Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Consider the dataset shown in Table 1 for a binary classification problem. Customer ID 1 Housing Type Gender Marital Status Apartment Male Married House Male

image text in transcribed

image text in transcribed

Consider the dataset shown in Table 1 for a binary classification problem. Customer ID 1 Housing Type Gender Marital Status Apartment Male Married House Male Single Married House Female Apartment Female Single CO w Apartment Male Married Hostel Male C Single Married House Female Apartment Female Single 18 5 8 8 5 5 8 8 5 5 8 8 5 5 8 Apartment Male Male House Married Single Married Hostel Female Hostel Female Single House Male Married Hostel Male Single Hostel Married Female Female Apartment Single Table 1 a. [1.5 points) Compute the Gini index, entropy, and misclassification error for the overall data b. [6 points) Compute the Gini index, entropy, and misclassification error for each of the four attributes (consider a multi-way split using each unique value of an attribute). c. [3 points) Compute the information Gain (IG) obtained by splitting the overall data using each of the four attributes. Which attribute provides the highest IG, and which attribute provides the lowest IG. Consider the dataset shown in Table 1 for a binary classification problem. Customer ID 1 Housing Type Gender Marital Status Apartment Male Married House Male Single Married House Female Apartment Female Single CO w Apartment Male Married Hostel Male C Single Married House Female Apartment Female Single 18 5 8 8 5 5 8 8 5 5 8 8 5 5 8 Apartment Male Male House Married Single Married Hostel Female Hostel Female Single House Male Married Hostel Male Single Hostel Married Female Female Apartment Single Table 1 a. [1.5 points) Compute the Gini index, entropy, and misclassification error for the overall data b. [6 points) Compute the Gini index, entropy, and misclassification error for each of the four attributes (consider a multi-way split using each unique value of an attribute). c. [3 points) Compute the information Gain (IG) obtained by splitting the overall data using each of the four attributes. Which attribute provides the highest IG, and which attribute provides the lowest IG

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Sams Teach Yourself Beginning Databases In 24 Hours

Authors: Ryan Stephens, Ron Plew

1st Edition

067232492X, 978-0672324925

More Books

Students also viewed these Databases questions