Question
Decision Trees: A survey was sent to the employees of a large company to ask them the following questions: Do you work in the data
Decision Trees: A survey was sent to the employees of a large company to ask them the following questions:
Do you work in the data analytics department? (Y or N)
Are you above the age of 30? (Y or N)
Have you spent more than 5 years in this company? (Y or N)
Is your current gross income more than USD 50,000 per year? (Y or N)
The following table summarizes the responses to the survey. For each entry, Number of Instances represents the number of respondents having the corresponding values for the attributes Analytics Department, Age>30, and Tenure>5.
Analytics Deparment | Age>30 | Tenure>5 | Number of Instances of Income > 50K | Number of Instances of Income 50K |
Y | Y | Y | 25 | 0 |
N | Y | Y | 15 | 0 |
Y | N | Y | 10 | 5 |
Y | Y | N | 0 | 0 |
N | N | Y | 0 | 0 |
N | Y | N | 25 | 15 |
Y | N | N | 0 | 10 |
N | N | N | 0 | 20 |
Given the data above, answer the following questions:
Find support and confidence for the rule:
if Analytics Department = Y Then Income > 50K
Find support and confidence for the rule:
if Analytics Department = Y and Tenure > 5 Then Income > 50K
Using the 1-rule method discussed in class, find the relevant sets of classification rules for the target variable by testing each of the input attributes Analytics Department, Age > 30, and Tenure > 5. Which of these three sets of rules has the lowest misclassification rate?
Considering Income >50K as the target variable, which of the attributes would you select as the root in a decision tree that is constructed using the information gain impurity measure?
Use the Gini index impurity measure and construct the full decision tree for this data set.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started