#TheAIAlphabet: B for Bellman Equation

The AI Alphabet |

Published July 20, 2023 |

Sunantha Sanjeeva Rao

AI, AI on the screen, which route is the best of all?

If AI concepts were retold as fairy tales, this twist of the famous dialog from Snow White and The Seven Dwarves would be the best way to summarize the Bellman Equation.

Simply put, this fundamental block of Reinforcement Learning helps us figure out the best route to pursue, under a given set of conditions. In this process of giving the best possibility, it takes into account both the immediate and future rewards. It can be used to train robots to play games, control robots, and even make financial decisions.

Let’s take Google Maps. One destination, two routes. To show you the best route, the intelligence behind it considers traffic and distance.

Say,

Route A is considerably shorter than route B
But route A has a bad case of traffic
So, the waiting time + travel time in route A exceeds the travel time in route B

The system is sure to recommend route B, despite it being longer.

If we had to make it more concise,

Bellman Equation = Reward + γ * Value of next state

Recent Blogs

#TheAIAlphabet: Z for Zero-Shot Learning

#TheAIAlphabet: Z for Zero-Shot Learning

The AI Alphabet

Zero-shot...

#TheAIAlphabet: Y for YOLO

#TheAIAlphabet: Y for YOLO

The AI Alphabet

Imagine...

#TheAIAlphabet: X for Xception

#TheAIAlphabet: X for Xception

The AI Alphabet

Xception,...

#TheAIAlphabet: W for Winograd Schema Challenge

#TheAIAlphabet: W for Winograd Schema Challenge

The AI Alphabet

The...

#TheAIAlphabet: V for Vector Quantized VAE-2 (VQ VAE 2)

#TheAIAlphabet: V for Vector Quantized VAE-2 (VQ VAE 2)

The AI Alphabet

Vector...

#TheAIAlphabet: U for Unawareness

#TheAIAlphabet: U for Unawareness

The AI Alphabet

Imagine...

#TheAIAlphabet: T for Turing Olympics

#TheAIAlphabet: T for Turing Olympics

The AI Alphabet

The...

#TheAIAlphabet: S for Stochastic Parrots

#TheAIAlphabet: S for Stochastic Parrots

The AI Alphabet

Imagine...

#TheAIAlphabet: R for Responsible AI

#TheAIAlphabet: R for Responsible AI

The AI Alphabet

Imagine...

#TheAIAlphabet: Q for Quantum AI

#TheAIAlphabet: Q for Quantum AI

The AI Alphabet

“Why was...

#TheAIAlphabet: P for Pre-Trained Models

#TheAIAlphabet: P for Pre-Trained Models

The AI Alphabet

You know...

#TheAIAlphabet: O for Orthogonality Thesis

#TheAIAlphabet: O for Orthogonality Thesis

The AI Alphabet

The...

#TheAIAlphabet: N for Neurosymbolic Learning

#TheAIAlphabet: N for Neurosymbolic Learning

The AI Alphabet

How do...

#TheAIAlphabet: M for Machine Consciousness

#TheAIAlphabet: M for Machine Consciousness

The AI Alphabet

Consciousn...

#TheAIAlphabet: L for Long Short Term Memory (LSTM)

#TheAIAlphabet: L for Long Short Term Memory (LSTM)

The AI Alphabet

How does...

#TheAIAlphabet: K for Kalman Filtering

#TheAIAlphabet: K for Kalman Filtering

The AI Alphabet

You're...

#TheAIAlphabet: J for Jaccard Index

#TheAIAlphabet: J for Jaccard Index

The AI Alphabet

In the...

#TheAIAlphabet series: I for Inverse Reinforcement Learning

#TheAIAlphabet series: I for Inverse Reinforcement Learning

The AI Alphabet

How...

#TheAIAlphabet series: H for Human in the Loop

#TheAIAlphabet series: H for Human in the Loop

The AI Alphabet

Imagine...

#TheAIAlphabet: G for General Adversarial Networks

#TheAIAlphabet: G for General Adversarial Networks

The AI Alphabet

#TheAIAlph...

#TheAIAlphabet: F for Foundation Models

#TheAIAlphabet: F for Foundation Models

The AI Alphabet

Foundation...

#TheAIAlphabet: E for E3 Model

#TheAIAlphabet: E for E3 Model

The AI Alphabet

The E3...

#TheAIAlphabet: D for Deep Learning

#TheAIAlphabet: D for Deep Learning

The AI Alphabet

In the...

#TheAIAlphabet: C for Curse of Dimensionality

#TheAIAlphabet: C for Curse of Dimensionality

The AI Alphabet

Imagine...

#TheAIAlphabet: A for Attention

#TheAIAlphabet: A for Attention

The AI Alphabet

The...

Subscribe to the Crayon Blog

Get the latest posts in your inbox!