Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

1 Gridworld [15 pts] Consider the following grid environment. Starting from any unshaded square, you can move up, down, left, or right. Actions are deterministic

image text in transcribed
1 Gridworld [15 pts] Consider the following grid environment. Starting from any unshaded square, you can move up, down, left, or right. Actions are deterministic and always succeed [e.g. going left from state 16 goes to state 15) unless they will cause the agent to run into a wall. The thicker edges indicate walls, and attempting to move in the direction of a wall results in staying in the same square (e.g. going in any direction other than left from state 16 stays in 16). Taking any action from the green target square (no. 12) earns a reward of r9 (so r(12,a) = r9 Va) and ends the episode . Taking any action from the red square of death (no. 5) earns a reward of r, (so r(5,a) = 1",. Va) and ends the episode. Otherwise , from every other square , taking any action is associated with a reward rs E {1,{},+1} (even if the action results in the agent staying in the same square). Assume the discount factor 7:1,r9=+5,and r,=5unless otherwise specied (a) (3pts) Define the value of rs that would cause the optimal policy to return the shortest path to the green target square (no. 12). Using this rs, find the optimal value for each square. (b) (3pts) Lets refer to the value function derived in {a} as V012 and the policy as 11-9. Suppose we are now in a new gridworld where all the rewards (r3, 7'9, and Tr) have +2 added to them . Consider still following 7r of the original gridworld, what will the new values VH1?\" be in this 9 second gridworld

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Statistical Techniques in Business and Economics

Authors: Douglas A. Lind, William G Marchal

17th edition

1259666360, 978-1259666360

More Books

Students also viewed these Mathematics questions

Question

2. What work valul's drivl' wouldbe l'mrt'prcneurs?

Answered: 1 week ago