Bellman Equation

Bellman Equation When the agent is in state i at time slot n and takes action a, it transitions to the next state j at time slot n+1 with probability and incurs expected cost c(i, a). However, given the available actions , it is not enough to select the action that...