Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Given the scenario below; it is a small sample dataset for the student engagement in class routines and a prediction to whether such student will

Given the scenario below; it is a small sample dataset for the student engagement in class routines and a prediction to whether such student will pass or fail a particular course. You're required to apply the decision tree using Entropy and Gain Ratio to determine the factors that most significantly influence the outcome.
Hints log2x=log10xlog102 or simply logxlog2 and if it happens you encounter log0 set it to 0
\table[[Study Hours,Attendance,\table[[Previous],[Grades]],Participation,Result],[Low,Poor,Low,No,Fail],[Medium,Fair,Medium,Yes,Pass],[High,Good,High,Yes,Pass],[Low,Good,Low,No,Fail],[Medium,Poor,Medium,No,Fail],[High,Fair,High,Yes,Pass],[Low,Fair,Low,Yes,Fail],[Medium,Good,Medium,No,Pass],[High,Poor,High,Yes,Pass],[Low,Fair,Medium,Yes,Fail]]
Repeat question a) using Gini index and gain ratio. Compare the final decision tree you obtain in both a) and here. Is there any difference? If there is or not, explain the reason.
Hints: the steps for computing gain ratio are the same in a) and b) and the splitting information formula remains the same in both cases i.e it uses Logarithm. Only the gini index computation differs.
image text in transcribed

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Data Mining Concepts And Techniques

Authors: Jiawei Han, Micheline Kamber, Jian Pei

3rd Edition

0123814790, 9780123814791

More Books

Students also viewed these Databases questions