Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Value Iteration Example: 4 x 4 Grid World k = 2 : Given the state - value we obtained at k = 1 , consider

Value Iteration Example: 4 x 4 Grid World
k=2 : Given the state-value we obtained at k=1, consider updating state 9, there are four actions, and we have
V2W(s=9),=r9W+P98W**V1(s=8) Where r9W=-1,P98W=1,V1(s=8)=-1
,=-1+1**-1=-2,
V2E(s=9),=r9E+P9,10E**V1(s=10) Where r9E=-1,P9,10E=1,V1(s=10)=-1
,=-1+1**-1=-2,
V2N(s=9),=r9N+P95N**V1(s=5) Where r9N=-1,P95N=1,V1(s=5)=-1
,=-1+1**-1=-2,
V2S(s=9),=r9S+P9,13S**V1(s=13) Where r9S=-1,P9,13W=1,V1(s=13)=-1
,=-1+1**-1=-2,
k=1 :
We then compare the values of each action and choose the one with the maximum value.
Since V2W(s=9)=V2E(s=9)=V2N(s=9)=V2S(s=9)=-2, we have
V2(s=9)=maxainA{V2W(s=9),V2E(s=9),V2N(s=9),V2S(s=9)}=-2
For the grid world example we discussed in the lecture, consider using the value iteration, given the state value at
k=2 :
Answer the following questions:
1.1: what is the state value of state "1" when k=3?
1.2 : what is the state value of state "9" when k=3?
image text in transcribed

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

More Books

Students also viewed these Databases questions

Question

5. If yes, then why?

Answered: 1 week ago

Question

6. How would you design your ideal position?

Answered: 1 week ago