Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Aug 12, 2021

Let be the following Markovian decision process of Figure 4, which transitions are labeled by the names of the actions and the probabilities of transitions,

Let be the following Markovian decision process of Figure 4, which transitions are labeled by the names of the actions and the probabilities of transitions, the states are labeled with the corresponding rewards. It models the possible actions of a character being chased by others. In the Tranquil State, nothing happens - no enemies on the horizon. In this state he receives a reward of +2. If it waits and does nothing, it risks being located and caught by the enemy (probability 0.2). If caught he has a -5 penalty. In the Tranquil state, if it moves, it has a 50% chance to ambush and be surrounded by the enemy. Once surrounded or caught, it may attempt to flee or defend itself.

Move 0.5 Wait 0.8 Tranquil Defend 0.3 Move 0.5 Escape 0.4 Defend 0.25 Wait 0.2 Figure 4 Caught -5 Escape 0.3

In the following table, indicate the utility values of the states at the end of each of the first two iterations of the value-iteration algorithm, assuming that an attenuation factor is used (discount factor) of 0.9 and starting initially (iteration 0) with state values all equal to 0. Indicate only the values, do not detail the calculations.

Tranquil Surrounded Caught After Iteration 1 After Iteration 2

2. Give the action plan (policy) resulting from iteration 2.

Move 0.5 Wait 0.8 Tranquil Defend 0.3 Move 0.5 Escape 0.4 Defend 0.25 Wait 0.2 Figure 4 Caught -5 Escape 0.3 Defend 0.55 Surrounded -1 Escape 0.7 Escape 0.3 Defend 0.7 ECRE Defend

Step by Step Solution

★★★★★

3.51 Rating (158 Votes )

There are 3 Steps involved in it

Step: 1

State Utility Value after Iteration 1 Utility Value after Iteration 2 Tranquil 18 194 Surrounded 075 109 Caught 45 45 The action plan policy resulting from iteration 2 is to move from the Tranquil sta... blur-text-image

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Probability and Random Processes With Applications to Signal Processing and Communications

Probability and Random Processes With Applications to Signal Processing and Communications

Authors: Scott Miller, Donald Childers

2nd edition

123869811, 978-0121726515, 121726517, 978-0130200716, 978-0123869814

More Books

Students also viewed these Accounting questions

Question

★★★★★

2 min. Starting with 50 molecules, each leaving with probability 0.2/min, find the normal probability distribution approximating the number remaining at the above time. Use it to estimate the...

Answered: 1 week ago

Question

★★★★★

An oil and gas industry produces firm produces oil. The profit function for the firm is give as: profit = ($100-variable cost)*gallons oil sold-fixed cost. The current production rate is 5 million...

Answered: 1 week ago

Question

★★★★★

A hydrofoil 50 cm long and 4 m wide moves at 28 kn in seawater at 20°C. Using flat-plate theory with Retr = 5E5, estimate its drag, in N, for (a) a smooth wall and (b) a rough wall, ε =...

Answered: 1 week ago

Question

★★★★★

Serra do Mar Corporation manufactures and distributes leisure clothing. Selected transactions completed by Serra do Mar during the current fiscal year are as follows: Jan. 8 Split the common stock 3...

Answered: 1 week ago

Question

★★★★★

Consider again the conditions of Exercise 3, including the observations in the random sample of 128 families, but suppose now that it is desired to test the composite null hypothesis H0 that the...

Answered: 1 week ago

Question

★★★★★

Wans AGI last year was $200,000. Her Federal income tax came to $65,000, paid through both withholding and estimated payments. This year, her AGI will be $300,000, with a projected tax liability of...

Answered: 1 week ago

Question

★★★★★

What was the influence of the type of measurement used for the dependent variable?

Answered: 1 week ago

Question

★★★★★

Jason Tierro, an inventory clerk at Lexmar Company, is responsible for taking a physical count of the goods on hand at the end of the year. He has been performing this duty for several years. This...

Answered: 1 week ago

Question

★★★★★

In C + + code, we can use a variable without its prior declaration. True False

Answered: 1 week ago

Question

★★★★★

On January 2, 2020, Brook Company acquired machinery by issuing a 3%, $360,000 note due in five years on December 31, 2024. Annual payments are $78,608 each December 31. The payment schedule is:...

Answered: 1 week ago

Question

★★★★★

Your brother-in-law has a coin collection.Today, two of his coins are each valued at $569.One coin is expected to increase in value by 3.44% APR annually while the other coin is expected to increase...

Answered: 1 week ago

Question

★★★★★

As of November 1, 1999, the exchange rate between the Brazilian real and U.S. dollar is R$1.73/$. The consensus forecast for the U.S. and Brazil inflation rates for the next 1-year period is 3.3% and...

Answered: 1 week ago

Question

★★★★★

Polycorp plans to pay a dividend of $6 in one year time dividend are then expected to increase by $1 a year for 5 year after that they are expected to grow at 2%pa forever shareholders required...

Answered: 1 week ago

Question

★★★★★

Michelle Sargol offers to pay $9,000 for a particular car located on John Weber's car lot. Weber accepts Sargol's offer and promises to transfer the title next week, at which time Sargol will pay for...

Answered: 1 week ago

Question

★★★★★

Recent research at the Wharton School suggests that harmonious bargaining Group of answer choices can bring about good long-term results. often backfires on the side that initiated harmonious...

Answered: 1 week ago

Question

★★★★★

Can the astronaut simply shake one tank and then the other two find the full tank? Why or why not?

Answered: 1 week ago

Question

★★★★★

Standard wage rate is Rs . 2 per hour and standard time is 1 0 hours. But actual wage rate is Rs . 2 . 2 5 per hour and actual hours used are 1 2 hours.Calculate Labour cost variance.

Answered: 1 week ago

Question

★★★★★

Hotel Majestic is interested in estimating fixed and variable costs so that the company can make more accurate projections of costs and profit. The hotel is in a resort area that is particularly busy...

Answered: 1 week ago

Question

★★★★★

A certain class of students takes a standardized test where each students score is modeled as a random variable with mean, = 85, and standard deviation, = 5. The school will be put on probationary...

Answered: 1 week ago

Question

★★★★★

Consider a quantizer that is designed to minimize the mean square quantization error. That is, the quantization levels, yi, are chosen according to the conditional mean criterion and the bin edges,...

Answered: 1 week ago

Question

★★★★★

A colleague of your proposes that a certain pair of random variables be modeled with a joint CDF of the form Fx, y (x,y) = [1 aex bey +ce(x+y)] u (x) u (y). (a) Find any restrictions on the...

Answered: 1 week ago

Question

★★★★★

Louiselle purchased a motor home for $9000 down, with the balance to be paid by 60 monthly payments of $1176.40 including interest at 6% compounded monthly. a. What was the purchase price of the...

Answered: 1 week ago

Question

★★★★★

Assume that your client invests $1000 at the end of each of the next three years. The investments earn 8% compounded annually. What is the future value at the end of the three years?

Answered: 1 week ago

Question

★★★★★

Calculate and rank the equivalent values eight years from now of the following cash flow streams: (i) A single payment of $5000 today. (ii) An ordinary annuity starting today with eight annual...

Answered: 1 week ago

Previous Question Next Question