Value iteration: (i) Is a model-free method for finding optimal policies. (ii) Is sensitive to local optima.

Question:

Value iteration: 

(i) Is a model-free method for finding optimal policies. 

(ii) Is sensitive to local optima. 

(iii) Is tedious to do by hand. 

(iv) Is guaranteed to converge when the discount factor satisfies 0 < γ < 1.

Fantastic news! We've Found the answer you've been seeking!

Step by Step Answer:

Related Book For  book-img-for-question
Question Posted: