Question
Consider the task of building a decision tree classifier from random data, where the attribute values are generated randomly irrespective of the class labels. Assume
Consider the task of building a decision tree classifier from random data, where the attribute values are generated randomly irrespective of the class labels. Assume the data set contains records from two classes, + and ?. Half of the data set is used for training while the remaining half is used for testing. Answer each question below and justify your answer. (i) Suppose there are an equal number of positive and negative records in the data and the decision tree classifier predicts every test record to be positive. What is the expected error rate of the classifier on the test data? (ii) Repeat the above analysis in (i) assuming that the classifier predicts each test record to be positive with probability 0.8 and to be negative with probability 0.2. (iii) Suppose two-thirds of the data belong to the positive class and the remaining one-third belong to the negative class. What is the expected error of a classifier that predicts every test record to be positive? (iv) Repeat the above analysis in (iii) assuming that the classifier predicts each test record to be positive class with probability 2/3 and negative class with probability 1/3.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started