Question: Consider a dataset with three columns of binary attributes A, A and a binary label attribute Y. There are eight types of data point

Consider a dataset with three columns of binary attributes A, A and a binary label attribute Y. There are eight types of data point in total, and their corresponding proportions in the dataset are captured in the column P. type 1 2 3 4 5 6 7 8 A A2 Y P 0 0 0 8% 0 1 0 0 1 1 1 1 0 1 OTT 1 0 0 1 1 0|1|0| 0 1 Hol 0 1 29% 2% 18% 16% 2% 1% 24% (a) What is the GINI index of the dataset? (b) What is the GINI index of the split on A? (c) What is the GINI index of the split on A? (d) Construct a decision tree of exactly five nodes for the dataset using the Hunt's algorithm. (e) What is the accuracy of the decision tree built in (d)?
Step by Step Solution
3.56 Rating (187 Votes )
There are 3 Steps involved in it
a The GINI index of the dataset can be calculated as follows GINI 1 PY02 PY12 1 05372 04632 0499 b T... View full answer
Get step-by-step solutions from verified subject matter experts
