Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Q. 4(Discounted rewards).Let{Xn}be a Markov chain with transition matrixP. Whenever the Markov chain is in the statei, you get a rewardf(i). However, we must discount

Q. 4(Discounted rewards).Let{Xn}be a Markov chain with transition matrixP. Whenever the Markov chain is in the statei, you get a rewardf(i). However, we must discount the value of rewards we will get in the future (a dollar you receive next year is worth less than a dollar you receive now, because if you receive the dollar now you can put it in your bank account and earn interest on it). Therefore, the total value of the rewards you will receive in the future is

r(i)=Enf(X),

in n=0

where0< <1is the discount factor (that is,1/is the interest rate on your bank account). We can naturally viewrandfas vectors as explained in class.

  1. Show that the vectorrsatisfies the following equation:r=f+P r.
  2. Suppse that{Xn}is a Markov chain on two statesD={1,2}with

f(1)=1, f(2)=2, =0.5, P11=P22=0.75, P12=P21=0.25.

Use the above formula to computer(1)andr(2).

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

An Introduction to Analysis

Authors: William R. Wade

4th edition

132296381, 978-0132296380

More Books

Students also viewed these Mathematics questions