Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

ISE - 2 9 1 : Homework 0 4 Page 3 of 9 a . Build a Random forest classifier for predicting the class label

ISE-291: Homework 04
Page 3 of 9
a. Build a Random forest classifier for predicting the class label with 4 trees. Fit the classifier
using the training set. Set criterion to entropy and random_state to 62.
b. Draw the trees using sci-kit learn (sklearn)
c. Test the classifier on the testing data set, and print the confusion matrix and classification
metrics (Accuracy, sensitivity (Recall), Precision) of the Random forest classifier.
d. Repeat A-4(a-c) using a Random forest with 8 trees instead of 4.
A-5.[10 marks]: Calculate the Information Gain (IG) for the class variable Drug given the feature
selected BP as a root node.
A-6.[10 marks]: From the decision tree built in A-3, write three classification rules using the
normalized values first then return it to the original values.
A-7.[10 marks]: Write an association rule for " BP -> Cholestrol", which rule has the highest
accuracy? Write the corresponding support and accuracy.
A-8.[10 marks]: Repeat parts b, c, and d in A-3 using the Nave Bayes GaussianNB classifier.
A-9. Compare the performance of the Nave Bayes against the built decision tree and random forest
classifiers using confusion matrix. Based on the comparison, which one is the best to use with
the given datat set?
Zybook: Solve all the following questions in zybook, and then have screenshots for each of your
answers and paste them as images in markdown cells in jupyter file.
Note: Although this HW is group-based, this problem shall be tried in zybook by each student.
Then, only one submission is required per team. Your instructor will check the zybook to
ensure that this problem activities are tried by each student.
A-10.[5 marks]: Do Participation Activity 11.3.2: Classifying Sira and Cali beans.
A-11.[5 marks]: Do Participation Activity 11.3.4: Classification trees with more than two categories.
A-12.[5 marks]: Do Challenge Activity 11.3.1: Classification trees.
A-13.[5 marks]: From Section 11.3 an impurity measure called Gini index is used instead of entropy.
Compare the between the two measures (i.e., entropy vs. Gini index) in terms of the minimum valu

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Pro PowerShell For Database Developers

Authors: Bryan P Cafferky

1st Edition

1484205413, 9781484205419

More Books

Students also viewed these Databases questions

Question

Define self-esteem and discuss its impact on your life.

Answered: 1 week ago