Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Q5. Feature-based Representation (20 points) Consider the following feature based representation of the Q-function: Q(8,0) = wifi(sa) + w2f2(8.a) with: fi(s, a) = 1/ (Manhattan

image text in transcribed

Q5. Feature-based Representation (20 points) Consider the following feature based representation of the Q-function: Q(8,0) = wifi(sa) + w2f2(8.a) with: fi(s, a) = 1/ (Manhattan distance to nearest dot after having executed action a in state s) $2(, a) =(Manhattan distance to nearest ghost after having executed action a in state s) Q5.1. Assume wi = 1 and W2 = 10. Assume that the red and blue ghosts are both sitting on top of a dot. Provide the values of Q(s, west) and Q(s, south). Based on this approximate Q-function, which action would be chosen? Q5.2. Assume Pac-Man moves West. This results in the state s' shown below. Pac-Man receives reward 9 (10 for eating a dot and -1 living penalty). Provide the values of Q(s', west) and Q(s', east). What is the sample value (assuming 7 = 1)? Q5.3. Now provide the update to the weights. Let a=0.5. Q5. Feature-based Representation (20 points) Consider the following feature based representation of the Q-function: Q(8,0) = wifi(sa) + w2f2(8.a) with: fi(s, a) = 1/ (Manhattan distance to nearest dot after having executed action a in state s) $2(, a) =(Manhattan distance to nearest ghost after having executed action a in state s) Q5.1. Assume wi = 1 and W2 = 10. Assume that the red and blue ghosts are both sitting on top of a dot. Provide the values of Q(s, west) and Q(s, south). Based on this approximate Q-function, which action would be chosen? Q5.2. Assume Pac-Man moves West. This results in the state s' shown below. Pac-Man receives reward 9 (10 for eating a dot and -1 living penalty). Provide the values of Q(s', west) and Q(s', east). What is the sample value (assuming 7 = 1)? Q5.3. Now provide the update to the weights. Let a=0.5

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Advances In Databases And Information Systems 23rd European Conference Adbis 2019 Bled Slovenia September 8 11 2019 Proceedings Lncs 11695

Authors: Tatjana Welzer ,Johann Eder ,Vili Podgorelec ,Aida Kamisalic Latific

1st Edition

3030287297, 978-3030287290

More Books

Students also viewed these Databases questions

Question

Explain in detail the different methods of performance appraisal .

Answered: 1 week ago

Question

KEY QUESTION Refer to columns 1 and 6 in the table for question

Answered: 1 week ago