33. If Ri denotes the random amount that is earned in period i, then i=1 i1Ri,...
Question:
33. If Ri denotes the random amount that is earned in period i, then ∞
i=1 βi−1Ri, where 0 <β< 1 is a specified constant, is called the total discounted reward with discount factor β. Let T be a geometric random variable with parameter 1 − β
that is independent of the Ri. Show that the expected total discounted reward is equal to the expected total (undiscounted) reward earned by time T . That is, show that E
,
∞
i=1
βi−1Ri
-
= E
,
T i=1 Ri
-
Fantastic news! We've Found the answer you've been seeking!
Step by Step Answer:
Related Book For
Question Posted: