Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Consider the vacuum-cleaning problem outlined below: Here we have a vacuum-cleaning agent that can sense the environment and perform actions to move around and vacuum-clean

Consider the vacuum-cleaning problem outlined below:

Here we have a vacuum-cleaning agent that can sense the environment and perform actions to move around and vacuum-clean dirty squares.

We make the following assumptions:

We assume a 3 by 3 grid world known to the agent. The environment is fully observable: the percepts give complete information about the Dirty/Clean status of every square and the agents location.

The environment is deterministic: A clean square remains clean and a dirty square remains dirty unless the agent cleans it up.

The actions available for the agent are: Left, Right, Up, Down, Suck. Each action takes place in one time "step".

The actions and percepts are perfectly reliable.

Each of the actions will incur a penalty of 1 point.

The agent will get a penalty of -2 points per dirty square per time step,

The agents performance is measured by the sum of negative points received on reaching a state with all squares in the environment being "Clean". As usual, a rational agent should max- imize its performance score.

(a) Define the vacuum-cleaning problem as a state-space search problem. Specify the states, actions, the transition model, and the goal state(s). The initial state could be any of the states in the state space.

You should also specify a cost function COST (n, a, n) > 0, which defines the cost of one step transition from node n to node n by applying action a, such that the cost of a path in the search tree should be exactly the absolute value of the sum of penalty points due to dirty square(s) and moves.

(Hint: COST (n, a, n) should be related to w, the number of dirty squares at the state s for noden, and related to the fact whether the action a is "Suck", and the current square is dirty or not).

(b) Which of the algorithms defined in Chapter 3 would be appropriate for this problem? Should the algorithm use tree search or graph search?

(c) You are asked to define an admissible (hopefully also consistent) heuristic function h to be used with the A* algorithm for solving the problem of searching for an optimal sequence of moves that maximize the agents performance score. (Hint: your h(n) should be related to w, the number of dirty squares at the state for node n, andd, the shortest distance between the current square and a dirty square, etc.)

You should also define an alternative admissible heuristic function h2 (which is different fromh), such that h2(n) is always lower than or equal to h(n) for any node n, namely h dominatesh2.

(d) Show the first level of a search tree starting from the initial state in which the top 3 squares, and the 3 right-most squares are dirty, and the agent is at the left bottom corner square. The frontier nodes at the first level should consist of three nodes, one is obtained from the initial state (root node) by the action "Up", one is obtained by the action "Right", and one is obtained by the action "Suck". Write the action as well as the edge cost along each edge from the root node to a frontier node. Then write down below each frontier node n: g(n), h(n) and h2(n).

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database Processing Fundamentals, Design, and Implementation

Authors: David M. Kroenke, David J. Auer

14th edition

133876705, 9781292107639, 1292107634, 978-0133876703

More Books

Students also viewed these Databases questions