Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Consider the dataset shown in Table 1 for a binary classification problem. Customer ID Housing Type Gender Marital Status 1 Apartment Male Married Class CO

image text in transcribed

image text in transcribed

Consider the dataset shown in Table 1 for a binary classification problem. Customer ID Housing Type Gender Marital Status 1 Apartment Male Married Class CO 2 Male Single C1 3 Female Married C1 4 House House Apartment Apartment Hostel Female Single CO 5 Male Married CO 6 Male Single ci 7 House Female Married C1 8 Female CO Single Married 9 Male CO 10 Male Single C1 Apartment Apartment House Hostel Hostel House 11 Female Married C1 12 Female Single CO 13 Male Married CO 14 Hostel Male Single C1 15 Hostel Female Married C1 16 Apartment Female Single CO Table 1 a. [1 points) Compute the entropy for the overall data. b. [2 points] Compute the entropy for each of the four attributes (consider a multi-way split using each unique value of an attribute). c. [3 points] Compute the Information Gain (IG) obtained by splitting the overall data using each of the four attributes. Which attribute provides the highest IG, and which attribute provides the lowest IG. d. [2.5 points] Compute the Gain Ratio for splitting over each of the four attributes. Which attribute provides the highest Gain Ratio? e. [1.5 points] For splitting at the root node, would you choose the attribute that provides the maximum IG, or the attribute that provides maximum Gain Ratio? Briefly explain your choice

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Students also viewed these Databases questions