Given the data set below for the binary class problem. Keep 2 decimal places. ID Attribute...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Given the data set below for the binary class problem. Keep 2 decimal places. ID Attribute A Attribute B Attribute C Class label 1 T F 2 T T T T T T F F 3 4 5 6 7 9 10 F T F T F F F T F 3.5 2.0 6.7 5.5 4.8 2.3 3.7 4.0 5.1 Y Y Y N Y N ZZZZ N N N N (a) Calculate the information gain when splitting on attribute A and B. Which attribute would the decision tree induction algorithm choose? Show all your work. [10] (b) Calculate the reduction in impurity using Gini index when splitting on A and B. Which attribute would the decision tree induction algorithm choose? Show all your work. [10] (c) Compare information gain and Gini index. Explain your results in part (a) and (b). [5] (d) For the attribute C, which is a continuous attribute, describe how you compute the information gain. [5] Given the data set below for the binary class problem. Keep 2 decimal places. ID Attribute A Attribute B Attribute C Class label 1 T F 2 T T T T T T F F 3 4 5 6 7 9 10 F T F T F F F T F 3.5 2.0 6.7 5.5 4.8 2.3 3.7 4.0 5.1 Y Y Y N Y N ZZZZ N N N N (a) Calculate the information gain when splitting on attribute A and B. Which attribute would the decision tree induction algorithm choose? Show all your work. [10] (b) Calculate the reduction in impurity using Gini index when splitting on A and B. Which attribute would the decision tree induction algorithm choose? Show all your work. [10] (c) Compare information gain and Gini index. Explain your results in part (a) and (b). [5] (d) For the attribute C, which is a continuous attribute, describe how you compute the information gain. [5]
Expert Answer:
Answer rating: 100% (QA)
a To calculate the information gain we first need to calculate the entropy of the class labels and the weighted entropy after splitting on each attribute Entropy of class labels There are 6 positive Y ... View the full answer
Related Book For
An Introduction To Statistical Methods And Data Analysis
ISBN: 9781305465527
7th Edition
Authors: R. Lyman Ott, Micheal T. Longnecker
Posted Date:
Students also viewed these programming questions
-
Question 3 You have worked for more than 10 years. You have spent your money wisely and invest the extra money you have in shares. Your investment in shares did quite well until the recent years when...
-
answer all questions as instructed below. attend all questions. 4 Computer Vision (a) Explain why such a tiny number of 2D Gabor wavelets as shown in this sequence are so efficient at representing...
-
"Fortran, Algol and Lisp invented most programming language concepts 50 years ago; adding the concept of object-orientation suffices to explain all programming languages to date". To what extent is...
-
Tiffee Company identifies the following items for possible inclusion in the physical inventory. Indicate whether each item should be included or excluded from the inventory taking. (a) 900 units of...
-
Suppose that 50% of all young adults prefer McDonald's to Burger King when asked to state a preference. A group of 10 young adults were randomly selected and their preferences recorded. a. What is...
-
An electron in a hydrogen atom in the level n 5 undergoes a transition to level n 3. What is the frequency of the emitted radiation?
-
Bitter man Company has completed all of its operating budgets. The sales budget for the year shows 50,000 units and total sales of $2,000,000. The total unit cost of making one unit of sales is $28....
-
Central Incorporated has two items in inventory as of December 31, 2011. Each item was purchased for $40. Company management chose to write down Item # 1 $28, which at year end was assessed to be its...
-
Use the Law of Detachment to write a conclusion from the following statements. If a person wants to be an engineer, then that person needs to get a degree in engineering. Michael wants to be an...
-
Question A: Stay Safe International manufactures industrial safety equipment at its plant in Evans- ville, Indiana. The company has initiated DRP to coordinate finished goods distribution from the...
-
If G, and G, are commutator subgroup and centre of the dihedral group D, respectively. Then select the incorrect statement. G G {e}
-
Beyer Company is considering buying an asset for $290,000. It is expected to produce the following net cash flows. Net cash flows Year 1 $70,000 Year 2 $40,000 Year 3 $70,000 Year 4 $200,000 Year 5...
-
2. Why vibrational excitation of molecules by electron impact mostly proceeds through intermediate electron attachment and not by simple elastic collision?
-
Plantwide Overhead Rate, Activity-Based Costing, Job Costs Foto-Fast Copy Shop provides a variety of photocopying and printing services. On June 5, the owner invested in some computer-aided...
-
Company D is expected to pay a dividend of $2 once a year. It is expected to sell for $44 1 year from today. The equity cost of capital is 15%. What is the expected capital gain rate from the sale of...
-
The managers of Rochester Manufacturing are discussing ways to allocate the cost of service departments such as Quality Control and Maintenance to the production departments. To aid them in this...
-
Excalibur Corporation sells video games for personal computers. The unadjusted trial balance as of December 31, 2024, appears below. December 31 is the company's reporting year-end. The company uses...
-
Q:1 Take any product or service offered in Pakistan and apply all determinents of customer Perceived value ?
-
Refer to the study described in Exercise 14.11. Suppose a new study is to be designed in which only three levels of milk fat and three levels of air will be used. Determine the number of replications...
-
The sales manager for a very exclusive brand of automobile declares that, after his staff had completed a new training program, less than 10% of their clients were dissatisfied with the service...
-
A medical researcher designed an experiment to study the impact of three exercise regimens (30, 60, and 90 minutes per week) on the total blood cholesterol level in active adult males. The...
-
Copying of the successful practices of others is called __________. (a) mimicry (b) scanning (c) grafting (d) strategy
-
The process of acquiring individuals, units, and/or firms to bring in useful knowledge to the organization is called __________. (a) grafting (b) strategy (c) scanning (d) mimicry
-
A firm which refuses to use debt in its capital structure even though it is capable of doing so will: a. Have too low of a WACC b. Incorrectly calculate prospective project NPVs as being too high c....
Study smarter with the SolutionInn App