Exercise 9.8 Explain why we often use discounting of future rewards in MDPs. How would an agent

Question:

Exercise 9.8 Explain why we often use discounting of future rewards in MDPs.

How would an agent act differently if the discount factor was 0.6 as opposed to 0.9?

Fantastic news! We've Found the answer you've been seeking!

Step by Step Answer:

Related Book For  book-img-for-question
Question Posted: