Exercise 9.8 Explain why we often use discounting of future rewards in MDPs. How would an agent
Question:
Exercise 9.8 Explain why we often use discounting of future rewards in MDPs.
How would an agent act differently if the discount factor was 0.6 as opposed to 0.9?
Fantastic news! We've Found the answer you've been seeking!
Step by Step Answer:
Related Book For
Artificial Intelligence Foundations Of Computational Agents
ISBN: 9780521519007
1st Edition
Authors: David L. Poole, Alan K. Mackworth
Question Posted: