Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Mar 13, 2024

Question 2 - Value Iteration [35 points] In this question, you will be using an applet to improve your understanding of value iteration. You can

Question 2 - Value Iteration [35 points] In this question, you will be using an applet to improve your understanding of value iteration. You can find the applet at https://artint info/demos/mdp/vi.html Note: modern browsers don't seem to like Java. There are workarounds, but a vastly less painful way to access the applet is to make sure you have the Java appletviewer installed (which you should if you have the JDK installed), and then from your command line run appletviewer https://artint . info/ demos/mdp/vi . html (You may need to first navigate to the directory where the appletviewer program is located.) There are some questions listed on that website; for this assignment, please disregard those questions and only answer the following ones. In this assignment, we are using a discount factor of 0.9, initial values of UCI(s) = 0 for all s, and the "absorbing states" option (explained in detail on the website with the applet) We will refer to states as (x,y), meaning the state in the x-th column and the y-th row: e.g. (1,1) for the state at the top left, and (10,1) for the state at the top right. (a) (10 points) The figure below shows the values U.")(s) in each state, that is, the values after one step of value iteration. We will focus on the entry in a single state, namely state (10,8), the state to the right of the absorbing state with reward 10 (which is located at (9,9)). Show in detail how UO( (10,8) ) is computed using the values U"(s). Value Iteration Step Discount 13 . Resch Meeting States

Question 2 - Value Iteration [35 points] In this question, you will be using an applet to improve your understanding of value iteration. You can find the applet at https://artint.info/demos/mdp/vi.html Note: modern browsers don't seem to like Java. There are workarounds, but a vastly less painful way to access the applet is to make sure you have the Java appletviewer installed (which you should if you have the JDK installed), and then from your command line run appletviewer https://artint.info/demos/mdp/vi.html (You may need to first navigate to the directory where the appletviewer program is located.) There are some questions listed on that website; for this assignment, please disregard those questions and only answer the following ones. In this assignment, we are using a discount factor of 0.9, initial values of U(s) = 0 for all s, and the "absorbing states" option (explained in detail on the website with the applet). We will refer to states as (x,y), meaning the state in the x-th column and the y-th row: e.g. (1,1) for the state at the top left, and (10,1) for the state at the top right. (a) (10 points) The figure below shows the values U(s) in each state, that is, the values after one step of value iteration. We will focus on the entry in a single state, namely state (10,8), the state to the right of the absorbing state with reward 10 (which is located at (9,8)). Show in detail how U(10,8)) is computed using the values U)(s). Value Iteration 01 01 01 01 Disco Step Resel Inal Value Absorbing States

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Introduction to Data Mining

Introduction to Data Mining

Authors: Pang Ning Tan, Michael Steinbach, Vipin Kumar

1st edition

321321367, 978-0321321367

More Books

Students also viewed these Algorithms questions

Question

You construct a sample in which Whites, Blacks, and Latinos are randomly selected from the U.S. adult population. The composition of your final sample is 25% White, 30% Black, and 30% Latino. You've...

Answered: 1 week ago

Question

★★★★★

Planning is one of the most important management functions in any business. A front office managers first step in planning should involve determine the departments goals. Planning also includes...

Answered: 1 week ago

Question

★★★★★

Question: Old MathJax webview Old MathJax webview i need ans of these question but the source is alot Old MathJax webviewOld MathJax webview i need ans of these question but the source is alot these...

Answered: 1 week ago

Question

★★★★★

In Problems 11 54, simplify each expression. Assume that all variables are positive when they appear. 2 ( 5 29 )

Answered: 1 week ago

Question

★★★★★

The parent of Nester Co. (a U.S. firm) has no international business but plans to invest $20 million in a business in Switzerland. Since the operating costs of this business are very low, Nester Co....

Answered: 1 week ago

Question

★★★★★

Write each system in Problems 1316 as a matrix equation of the form AX = B. X1 X - 3x + 2x3 -3 -2x + 3x 1 -2 x + X1 x + 4x3 X2 = || ||

Answered: 1 week ago

Question

★★★★★

Let X be a random variable denoting the number of roulette spins until a black-colored number appears. Then, the distribution of X is geometric. (To answer the question, it is immaterial how many...

Answered: 1 week ago

Question

★★★★★

Bridget Krumb, Inc., purchased inventory costing $125,000 and sold 80% of the goods for $190,000. All purchases and sales were on account. Krumb later collected 25% of the accounts receivable. 1....

Answered: 1 week ago

Question

★★★★★

1. Categorize the following accounts as Current Liabilities or Long-term Labilities: 2. Why do we categorize liabilities on the Balance Sheet? 3. A business takes out a 5 year loan for $50,000 with...

Answered: 1 week ago

Question

★★★★★

Alabama Atlantic is a lumber company that has three sources of wood and five markets to be supplied. The annual availability of wood at sources 1, 2, and 3 is 15, 20, and 15 million board feet,...

Answered: 1 week ago

Question

★★★★★

The Conjugate Zeros Theorem says that the complex zeros of a polynomial with real coefficients occur in complex conjugate pairs. Explain how this fact proves that a polynomial with real coefficients...

Answered: 1 week ago

Question

★★★★★

solve this problem by showing the manual calculation and choose the correct answer(mechanical vibrations) Find the natural frequencies of the system for k = 300 N/m, k2 = 500 N/m, k3 = 200 N/m, m = 2...

Answered: 1 week ago

Question

★★★★★

MegaHoldings Group, a significant conglomerate, and MiniFirm Ltd, its subsidiary, are involved in a financial transaction. Initially, on January 1, 2021, MegaHoldings Group issued bonds into the...

Answered: 1 week ago

Question

★★★★★

D Check out the figure below: 10. Annual rate of per capita GDP growth (%) 4 2 Z " Annual rate of population growth (%) What does the above graph imply about the relationship between income growth...

Answered: 1 week ago

Question

★★★★★

2) years, a rancher received $900 from an investment that earned 3% interest compounding annually. Using the table below, how much did the rancher invest? Express your answer to two (2) decimal...

Answered: 1 week ago

Question

★★★★★

Research your findings on the issues on Environmental Engineering , What types of positions may be included in a modern safety and health team in those area, Certification specifications etc. ( Try...

Answered: 1 week ago

Question

★★★★★

give me the data field for every class pls Following Assignment 1 and 2; make your changes to enhance the covenant System and make it suitable for university by adding a Faculty and Person classes,...

Answered: 1 week ago

Question

★★★★★

Explain the buyers position in a typical negotiation for a business. Explain the sellers position. What tips would you offer a buyer about to begin negotiating the purchase of a business?

Answered: 1 week ago

Question

★★★★★

Consider the data set shown in Table 7.12. Table 7.12. Data set for Exercise 4. (a) For each combination of rules given below, specify the rule that has the highest confidence. i. 15 ii. 15 iii. 15...

Answered: 1 week ago

Question

★★★★★

Consider the data set shown in Table 5.1 Table 5.1. Data set for Exercise 7. (a) Estimate the conditional probabilities for P(A|+), P(B|+), P(C|+), P(A|), P(B|), and P(C|). Answer: P(A = 1|) = 2/5 =...

Answered: 1 week ago

Question

★★★★★

Discuss why a document-term matrix is an example of a data set that has asymmetric discrete or asymmetric continuous features.

Answered: 1 week ago

Question

★★★★★

One year ago, Jasmin and Derek opened investment accounts with a discount broker. In their C$ account, they purchased 300 Bank of Montreal (BMO) shares at C$54.20 per share and six Government of...

Answered: 1 week ago

Question

★★★★★

Calculate the missing value.

Answered: 1 week ago

Question

★★★★★

A provincial government allocates 29% of its budget to education, 31% to health care, and 21% to social services. If the dollar amount budgeted for education is $13.7 billion, how much is budgeted...

Answered: 1 week ago

Previous Question Next Question