Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

2 . ( a ) Four classification algorithms were fitted to a simulated dataset with a two - class classification problem: logistic regression, decision tree,

2.(a) Four classification algorithms were fitted to a simulated dataset with a two-class classification problem: logistic regression, decision tree, random forest and a support vector machine with a radial basis kernel. The data with fits of four models are displayed on page 6. Which of the plots correspond to the classification algorithms? [5 marks](b) Whatarethe restrictions on the types of variables in the four classification algorithms in (a)?[4 marks](c) Name three other methods that are direct competitors to the classifiers in (a).[3 marks](d) Suppose we take a dataset, divide it into equally-sized training and testing sets and then try out two different classification procedures. First we use logistic regression and get an error rate of 20% on the training set and 30% on the test set. Next we use a k-nearest neighbours classifier with k =1 and get an error rate of X% on the training set and Y % on the test set where (X +Y)/2=18. Based on these results, which method should we prefer to use for classification of new results? Explain. [6 marks](e) A classifier gives the following result. In the table below, Group gives the true class, and Prob gives the estimated probability of Group A (positive) using the classifier. Group Prob 1 A 0.6672 A 0.4503 A 0.6414 A 0.4565 A 0.7816 A 0.9437 B 0.4238 B 0.7369 B 0.21510 B 0.021 i. What are the predicted classes? Use a threshold of 0.5. What is the error rate? What is the false positive rate? The true positive rate? ii. Describe how to construct the ROC curve for this problem. [8 marks][7 marks]33.(a) Explain how k-fold cross-validation is implemented

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database Management An Organizational Perspective

Authors: Richard T. Watson

1st Edition

0471305340, 978-0471305347

More Books

Students also viewed these Databases questions