33. If Ri denotes the random amount that is earned in period i, then i=1 i1Ri...

Question:

33. If Ri denotes the random amount that is earned in period i, then ∞

i=1 βi−1Ri , where 0 <β< 1 is a specified constant, is called the total discounted reward with discount factor β. Let T be a geometric random variable with parameter 1 − β

that is independent of the Ri . Show that the expected total discounted reward is equal to the expected total (undiscounted) reward earned by time T . That is, show that E



i=1

βi−1Ri



= E



T i=1 Ri



Fantastic news! We've Found the answer you've been seeking!

Step by Step Answer:

Related Book For  book-img-for-question
Question Posted: