Question: Players MAX and MIN are playing a game with a finite depth of possible moves. MAX calculates the minimax value of the root to be

Players MAX and MIN are playing a game with a finite depth of possible moves. MAX calculates the minimax value of the root to be M. Assume that each player has at least 2 possible actions at every turn and that every distinct sequence of moves leads to a distinct score. Which of the following are true? 

a. Assume MIN is playing suboptimally, and MAX does not know this. The outcome of the game can be better than M (i.e. higher for MAX). 

b. Assume MAX knows player MIN is playing randomly. There exists a policy for MAX such that MAX can guarantee a better outcome than M. 

c. Assume MAX knows MIN is playing suboptimally on every move and knows the policy πMIN that MIN is using (MAX knows exactly how MIN will play). There exists a policy for MAX such that MAX can guarantee a better outcome than M.

d. Assume MAX knows MIN is playing suboptimally at all times but does not know the policy πMIN that MIN is using (MAX knows MIN will choose a suboptimal action at each turn, but does not know which suboptimal action). There exists a policy for MAX such that MAX can guarantee a better outcome than M. 

Step by Step Solution

3.50 Rating (163 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock

a True MAX may take advantage of MINs errors without having to know about it ahead of time b F... View full answer

blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Artificial Intelligence A Modern approach Questions!