Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Q . 4 . Figure ( a ) below shows a house with 6 rooms, where the rooms labeled 0 - 4 are internal rooms
Q Figure a below shows a house with rooms, where the rooms labeled are internal
rooms and is the outside "room". Doors lead from each room to some others, as shown.
An Agent can be placed in any of the rooms, which can be considered his starting State.
The objective is to take Actions to move from the starting room to room which can
also be called the Goal State. Transiting from one room state to another through a door
is considered an Action, that leads to a Reward. The Reward associated with each Action
is expressed in the Table which is like a Reward Matrix each row represents a State
and each column an Action, and the value of that Action is the corresponding element.
All infeasible Actions ie no doors exist to execute such actions are shown as
You are to use the QLearning process to update the Qmatrix shown as Table Q As you
know, the QLearning process proceeds through Episodes, and in each Episode a
sequence of states is followed which updates the Qmatrix at each step, till the Goal
Terminal state is reached signaling the end of that Episode. The QLearning update
equation is the following:
where all notations and symbols follow from what you have seen in your class.
Fig a: the house with six labelled rooms.
Tabie R: Rows are states, columns are
Actions. Values are Rewards.
Tabie Q: Rows are states, columns are
Actions. Elements are QValues.
You are to take as as and then use the given QMatrix and RMatrix to update
the Matrix at each state in the following Episode Sequence of states:
Please provide a handwritten solution if possible
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started