Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 26, 2024

Consider training an MDP using the following sequence of states, actions, and rewards: S1, reward 0, action 1 S2, reward 10, action 1 S2, reward

Consider training an MDP using the following sequence of states, actions, and rewards:

S1, reward 0, action 1

S2, reward 10, action 1

S2, reward 10, action 2

S1, reward 0, action 1

S2, reward 10, action 2

S1, reward 0, action 2

S3, reward 0, action 1

S3, reward 0, action 1

S4, reward 100, action 1

S4, reward 100, action 2

S2, reward 10

(a) Suppose you use certainty equivalent learning to calculate the J values. Fill in the table below, using discount factor = 0.5.

State:	S1	S2	S3	S4
J^* Value:

(b) Suppose you instead use Q-learning. Assume that all Q-values are initialized to 0. Fill in the table below to show how the Q-values change after the first six transitions, using discount factor = 0.5 and learning rate = 0.5.

State, Action Pair:	(S1, 1)	(S1, 2)	(S2, 1)	(S2, 2)	(S3, 1)	(S3, 2)	(S4, 1)	(S4, 2)
Q-value at start:	0	0	0	0	0	0	0	0
Q-value after Observing: S1, reward 0, action 1 S2
Q-value after observing: S2, reward 10, action 1 S2
Q-value after observing: S2, reward 10, action 2 S1
Q-value after observing: S1, reward 0, action 1 S2
Q-value after observing: S2, reward 10, action 2 S1
S1, reward 0, action 2 S3

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database Systems Design Implementation And Management

Database Systems Design Implementation And Management

Authors: Carlos Coronel, Steven Morris

14th Edition

978-0357673034

More Books

Students also viewed these Databases questions

Question

If you were asked to implement a knowledge management system, what would you recommend to ensure that employees shared and accessed knowledge? Explain your recommendations.

Answered: 1 week ago

Question

★★★★★

On March 10, Fly Corporation acquired 6,000 shares of the 140,000 outstanding shares of Dickson Co. common stock at $32 plus commission charges of $240. On July 23, a cash dividend of $1.40 per share...

Answered: 1 week ago

Question

★★★★★

Consider training an MDP using the following sequence of states, actions, and rewards: S1, reward 0, action 1 S2, reward 10, action 1 S2, reward 10, action 2 S1, reward 0, action 1 S2, reward 10,...

Answered: 1 week ago

Question

★★★★★

Acitelli Corporation, which applies manufacturing overhead on the basis of machinehours, has provided the following data for its most recent year of operations. The estimates of the manufackuring...

Answered: 1 week ago

Question

★★★★★

What is Mendeleev's periodic law? Explain this law by giving examples. Mendeleev had left some blank spaces in his periodic table. Did Mendeleev's prediction prove true with the discovery of new...

Answered: 1 week ago

Question

★★★★★

Tell the merits and demerits of Mendeleev's periodic table.

Answered: 1 week ago

Question

★★★★★

What is the modern periodic law? Which class of oxides are strongly alkaline and which class are strongly acidic? Explain the gradual change in the relative valency of hydrogen in a period.

Answered: 1 week ago

Question

★★★★★

What is meant by digestion? Which digestive juices take part in it? Describe the process of absorption and assimilation of the ingested food.

Answered: 1 week ago

Question

★★★★★

What do you understand by photosynthesis? Write its chemical equation to demonstrate this process and prove that in this process . is released.

Answered: 1 week ago

Question

★★★★★

Holding national saving constant, does an increase in net capital outflow increase, decrease, or have no effect on a countrys accumulation of domestic capital?

Answered: 1 week ago

Question

★★★★★

An article in USA Today (December 16, 2004) began President Bush said Wednesday that the White House will shore up the sliding dollar by working to cut record budget and trade deficits. a. According...

Answered: 1 week ago

Question

★★★★★

International trade in each of the following products has increased over time. Suggest some reasons this might be so. a. wheat b. banking services c. computer software d. automobiles

Answered: 1 week ago

Previous Question Next Question