Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

How can I compute MDP in Machine Learning! I add the formulas for it C. Compute the state-action value functions obtained by Sarsa and Q-learning

image text in transcribed
How can I compute MDP in Machine Learning! I add the formulas for it
image text in transcribed
C. Compute the state-action value functions obtained by Sarsa and Q-learning for the MDP in the following figure, under an E-greedy policy with = 0.2. The edges of the graph are actions, labelled with their name, probability, and immediate reward when non-zero. The nodes are states, labelled with their name. For this MDP, y=0.5. 2,0.5 a 0.5 Ja,0.5 a,1,1 a,1,-10 Sarsa update: Qx+1(s, a) = (x(s, a) + a(R4+1 + y Q(s', a') - Ox(s,a)). Q-learning update: Qx+1(s, a) = (x(s, a) + a(Rt+1 + max ' Y Qx(s', a') - Qx(s, a)) C. Compute the state-action value functions obtained by Sarsa and Q-learning for the MDP in the following figure, under an E-greedy policy with = 0.2. The edges of the graph are actions, labelled with their name, probability, and immediate reward when non-zero. The nodes are states, labelled with their name. For this MDP, y=0.5. 2,0.5 a 0.5 Ja,0.5 a,1,1 a,1,-10 Sarsa update: Qx+1(s, a) = (x(s, a) + a(R4+1 + y Q(s', a') - Ox(s,a)). Q-learning update: Qx+1(s, a) = (x(s, a) + a(Rt+1 + max ' Y Qx(s', a') - Qx(s, a))

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Deductive And Object Oriented Databases Second International Conference Dood 91 Munich Germany December 18 1991 Proceedings Lncs 566

Authors: Claude Delobel ,Michael Kifer ,Yoshifumi Masunaga

1st Edition

3540550151, 978-3540550150

More Books

Students also viewed these Databases questions