Argue that for fixed (i), the maximum in the optimal value function (W(i)=max _{mathbf{u}} W(i, mathbf{u})) among

Question:

Argue that for fixed \(i\), the maximum in the optimal value function \(W(i)=\max _{\mathbf{u}} W(i, \mathbf{u})\) among only all stationary policies must be assumed by some policy. Does your argument extend to the case where the supremum is taken over all admissible policies?

Fantastic news! We've Found the answer you've been seeking!

Step by Step Answer:

Question Posted: