3. [Decision Trees] You own a movie theater and are trying to understand your market: what...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
3. [Decision Trees] You own a movie theater and are trying to understand your market: what types of people frequently go to the movies? You start with the following dataset with data about 6 people with different age groups, income levels, and professions, and whether or not they frequently go to movie theaters. In particular, you are going to build a decision tree to predict whether or not someone is a frequent movie-goer. No High Income? Engineer? Movie Goer? T F T T 1 2 3 4 T F T F Yes Yes Yes No Recall the following definitions of entropy and information gain, respectively, which are useful for this problem: H(Z) == P(Y = y) log P(Y = y) Y IG(Z, j,t) = H (Z) H(Z[x; = t])P(x; = t) - H(Z[x; t])P(xj t). a. (4 pts) Based on the principle of information gain, which attribute is to be used for the first split? Be sure to show your computations. You can round the entropy and information gain values to two decimal places. b. (4 pts) Draw the complete (unpruned) decision tree, showing the class predictions at the leaves. Assuming you are using LaTeX, you may (i) very neatly hand draw the tree, photograph it, and include it as a figure, (ii) draw it using a graph- ics program or PowerPoint, or (iii) express the tree in a series of if statements, preferably using LaTeX's verbatim environment. c. (2 pts) From the Decision Tree constructed in the previous question, predict whether a person who has high income but is not an engineer is a movie goer. 3. [Decision Trees] You own a movie theater and are trying to understand your market: what types of people frequently go to the movies? You start with the following dataset with data about 6 people with different age groups, income levels, and professions, and whether or not they frequently go to movie theaters. In particular, you are going to build a decision tree to predict whether or not someone is a frequent movie-goer. No High Income? Engineer? Movie Goer? T F T T 1 2 3 4 T F T F Yes Yes Yes No Recall the following definitions of entropy and information gain, respectively, which are useful for this problem: H(Z) == P(Y = y) log P(Y = y) Y IG(Z, j,t) = H (Z) H(Z[x; = t])P(x; = t) - H(Z[x; t])P(xj t). a. (4 pts) Based on the principle of information gain, which attribute is to be used for the first split? Be sure to show your computations. You can round the entropy and information gain values to two decimal places. b. (4 pts) Draw the complete (unpruned) decision tree, showing the class predictions at the leaves. Assuming you are using LaTeX, you may (i) very neatly hand draw the tree, photograph it, and include it as a figure, (ii) draw it using a graph- ics program or PowerPoint, or (iii) express the tree in a series of if statements, preferably using LaTeX's verbatim environment. c. (2 pts) From the Decision Tree constructed in the previous question, predict whether a person who has high income but is not an engineer is a movie goer.
Expert Answer:
Answer rating: 100% (QA)
a To determine the attribute to be used for the first split we need to calculate the information gai... View the full answer
Related Book For
Business Intelligence And Analytics Systems For Decision Support
ISBN: 9781292009209
10th Global Edition
Authors: Efraim Turban, Ramesh Sharda, Dursun Delen, Pearson Education Limited, Dennis G. Zill
Posted Date:
Students also viewed these programming questions
-
If a trespass to property permanently deprives the property holder of the use of their property, what tort has been committed?
-
Planning is one of the most important management functions in any business. A front office managers first step in planning should involve determine the departments goals. Planning also includes...
-
Prosci's change management methodology is developed based on research with over 3,400 participants over the last twenty years. What is unique about the methodology is that it comes from real project...
-
You are the assistant vice president in charge of production for a firm that produces computers. Your firm's production function is f(L,K) = min (L,K) Where L and K are the quantities of the two...
-
According to the simple de Broglie model, how many wavelengths are there in an electron wave in the first orbit? In the second orbit? In the nth orbit?
-
The dynamic aggregate demand curve is a downward-sloping relationship between inflation and the quantity of output demanded by those who use it: a. Aggregate expenditure = Consumption + Investment +...
-
Identify all subsets of the real numbers to which the following real numbers belong: 1. 14 2. -14.223 3. \(\sqrt{17}\)
-
Excelsior Amusement Park has a fiscal year ending on September 30. Selected data from the September 30 worksheet are presented below. Instructions (a) Prepare a complete worksheet. (b) Prepare a...
-
pleasehelp with this Variance Analysis The company created a flexible budgeted income statement to compare to actual results for the year. From the data provided, you will calculate variances for...
-
Agata Polanska opened her first Pierogi Factory restaurant in Kitchener, Ontario in 2015. Customers love her authentic Polish pierogi, so Agata has steadily expanded her business to include four...
-
Floating Speed Boat has completed its journal entries for the month of June and posted them to the general ledger. Based on the ledger balances, an adjusted trial balance has been prepared. The...
-
A Kelvin thermometer indicates the same temperature as a Rankine thermometer. What is the temperature in Celsius? Input an integer.
-
Assume you hold a portfolio. Would you buy the asset under examination if your aim is to make the portfolio you hold risk-neutral? Discuss.
-
1. A firm is expected to generate return on equity of 13.2% in the next year. It is expected to generate EPS of $4.69 in the next year. It maintains a constant payout ratio of 45%. If the cost of...
-
Evaluate other areas of financial analysis for Coca Cola and Pepsi: capital spending, stock growth, beta values, credit rating service valuations (if possible), bond rating valuations (if possible),...
-
Inside our eyes, before the retina, is the liquid vitreous humor, with a refractive index about n = 1 . 3 . Find the light speed inside the vitreous humor
-
Rizzi Co. is growing quickly.Dividends are expected to grow at a 20% rate for the next three years, with the growth rate falling off to a constant 6% thereafter.If the required return is 12% and the...
-
Trade credit from suppliers is a very costly source of funds when discounts are lost. Explain why many firms rely on this source of funds to finance their temporary working capital.
-
Examine the difficulties to implement a new DSS over legacy systems.
-
List the different problem-solving search methods.
-
What is an ES?
-
Each business day, on average, a company writes checks totaling \($25,000\) to pay its suppliers. The usual clearing time for the checks is four days. Meanwhile, the company is receiving payments...
-
An undamped, unforced Duffing Equation, \(\ddot{x}+\omega^{2} x+\epsilon x^{3}=0\), can be solved exactly in terms of elliptic functions. Determine the solution of this equation and determine if...
-
Purple Feet Wine, Inc., receives an average of \($7,500\) in checks per day. The delay in clearing is typically six days. The current interest rate is .055 percent per day. a. What is the companys...
Study smarter with the SolutionInn App