Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Your Task ( part 2 ) : Now that you've implemented q - learning for one task, you will move to the mountain car task.

Your Task (part 2):
Now that you've implemented q-learning for one task, you will move to the mountain car task.
Instead of 2 actions (left, right), this task has three (left, null, right). The task also has different
state variables (only 2)
?x : the location of the robot is the left, -.45 is approximately the valley, 0.6 is the
rightmost part of the board, 0.5 is the location of the flag)
? xdot: the velocity of the robot (this can go from -0.07 to 0.07)
This will require you to change the number of bins for state descritization as well as the
alpha and gamma values. Additionally, you need to implement the exploration vs
exploitation part for this problem as well.
Once your model is trained it will be saved the Q-table as 'car.npy' file. Make sure to that you
don't change this file name.
image text in transcribed

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Postgresql 16 Administration Cookbook Solve Real World Database Administration Challenges With 180+ Practical Recipes And Best Practices

Authors: Gianni Ciolli ,Boriss Mejias ,Jimmy Angelakos ,Vibhor Kumar ,Simon Riggs

1st Edition

1835460585, 978-1835460580

More Books

Students also viewed these Databases questions