33. If Ri denotes the random amount that is earned in period i, then i=1 i1Ri...
Question:
33. If Ri denotes the random amount that is earned in period i, then ∞
i=1 βi−1Ri , where 0 <β< 1 is a specified constant, is called the total discounted reward with discount factor β. Let T be a geometric random variable with parameter 1 − β
that is independent of the Ri . Show that the expected total discounted reward is equal to the expected total (undiscounted) reward earned by time T . That is, show that E
∞
i=1
βi−1Ri
= E
T i=1 Ri
Fantastic news! We've Found the answer you've been seeking!
Step by Step Answer:
Related Book For
Question Posted: