Question: 6. The following table lists a dataset containing the details of six patients. Each patient is described in terms of three binary descriptive features (OBESE,

6. The following table lists a dataset containing the details of six patients. Each patient is described in terms of three binary descriptive features (OBESE, SMOKER, and DRINKS ALCOHOL) and a target feature

(CANCER RISK)

ID OBESE SMOKER DRINKS ALCOHOL CANCER RISK 1 true false true low

a. Which of the descriptive features will the ID3 decision tree induction algorithm choose as the feature for the root node of the decision tree?

b. When designing a dataset, it is generally a bad idea if all the descriptive features are indicators of the target feature taking a particular value. For example, a potential criticism of the design of the dataset in this question is that all the descriptive features are indicators of the CANCER RISK target feature taking the same level, high. Can you think of any descriptive features that could be added to this dataset that are indicators of the low target level?

ID OBESE SMOKER DRINKS ALCOHOL CANCER RISK 1 true false true low 2 23 true true true high 3 true false true low 4 false true true high 5 false true false low 6 false true true high

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Principles Algorithms And Systems Questions!