Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Oct 24, 2023

Part B: Use the Simmons data set in module 10. See the Excel file titled Simmons-data-raw in Module 10. Watch the video that explains

Part B: Use the Simmons data set in module 10. See the Excel file titled Simmons-data-raw in Module 10. Watch PartB-3 (1 point): If you were to ROLL OUT the logistic regression model to PREDICT coupon usage for a LARGE The Rules are as stated by the decision tree. WHICH CHURN SEGMENT DO YOU RECOMMEND FOCUSING ON? WHY? Finally, Hints for writing a good report for Part A: The report for PART A does not have to be long (no more than 5 DATA SCIENCE: Machine Learning TEAM PROJECT This programming assignment contains 2 independent parts,

Part B: Use the Simmons data set in module 10. See the Excel file titled Simmons-data-raw in Module 10. Watch the video that explains the contents of the file. The data set uses two predictors X1 = Annual spend on a similar credit card and X2 = Presence/Absence of the Simmons loyalty card to PREDICT Y = Will customer use coupon or not? For Part 2, build a logistic regression model to predict Y = coupon usage from X1 and X2 and then answer the following questions. PartB-1 (2 points): What are the coefficents (BETAs) for the logistic regression model? Answer as below: LR coefficents BETAO (or constant term) BETA1 (coeff. For X1) BETA2 (coeff. For X2) Value PartB-2 (2 points): Use the model above to compare TWO customers Jack and Jill. Jack spends $2000 annually (note: X1 for Jack = 2) and HAS the Simmons card (X2 = 1). Jill spends $4000 annually (X1 = 4) and does NOT have the Simmons card (X2 = 0). Who is more likely to use the coupon? (Hint: A complete answer must evaluate their probabilities for response). Probability of Response Jack Jill XXXX is more likely to respond because... PartB-3 (1 point): If you were to ROLL OUT the logistic regression model to PREDICT coupon usage for a LARGE database of customers, what CUTOFF probability will you choose? (Hint: No right or wrong answer here, but a concept such as a CONFUSION MATRIX may help make your call for cutoff probability). The Rules are as stated by the decision tree. WHICH CHURN SEGMENT DO YOU RECOMMEND FOCUSING ON? WHY? Finally, in appendices for PART A, place Jupyter notebooks Parts A.2 and A.3. Appendix A.2: Decision Tree cross validation notebook: construct this notebook by combining ideas from Churn_Telco and Iris practice_crossval notebooks. Appendix A.3: Logistic regression cross validation notebook: construct this notebook by combining ideas from Churn_Telco, WBCD and Iris_practice_crossval notebooks. NOTE: The data set used for logistic regression is STILL the Telco churn dataset. Page < 3 of 5 C | ZOOM + Hints for writing a good report for Part A: The report for PART A does not have to be long (no more than 5 pages), but should be super-clear. Imagine you are presenting the report to senior management. Here are some suggestions: Part A.1: Describe the numbers below in a table: Decision Tree Cross-validation Fold Fold 1 Fold 2 Fold 3 Fold 4 Fold 5 Fold 6 Fold 7 Fold 8 Fold 9 Fold 10 Average Error % Std. Dev. Error % See IRIS PRACTICE CROSSVAL Jupyter notebook in Module 6 for how to create a 10-fold cross-validation (that notebook shows you a 5-fold). Next place Part A.4 (not parts A.2, A.3): Y N Logistic Regression $ Benefit $ cost Churn segment 1... Churn segment 2... n $ cost $ benefit Write a few sentences supporting the $benefit/cost numbers. How did you come up with these numbers? Next place Part A.5 (not parts A.2, A.3): RULES for identifying Description in words Page 2 of 5 C | ZOOM + DATA SCIENCE: Machine Learning TEAM PROJECT This programming assignment contains 2 independent parts, creatively called Part A and Part B ) PART A (telco churn analysis): (10 points) For this part of the project, you will analyze the TelCo CHURN data set. Divide the entire data set into training and test sets (the test set should be 25% of the original data set). PART A Deliverable: Apply 10-fold cross-validation to build TWO distinct models to predict customer CHURN. The two techniques are a) Decision Trees and b) Logistic Regression. Use the best combination of predictor variables for this purpose (ok to use the variables in the Lab notebook for Decision trees). There are 5 parts to PART A of your submission: A.1: For both Decision Trees and Logistic Regression, report the accuracy for the 10 fold Also, compute the VERAGE accuracy across folds as well as the STANDARD DEVIATION of accuracy across the 10 folds. Which technique (LR or Trees) has a higher average accuracy? (2 points) A.2: Accurate Jupyter notebook pdf in Appendix A.2 of your Decision Tree cross validation code (2 points) A.3: Accurate Jupyter notebook pdf in Appendix A.3 of your Logistic Regression cross validation code (2 points) A.4: Consider the 4 cells (p, Y), (p, N), (n, Y) and (n, N) (see chapter on confusion matrix from Provost book). For each of these cells come up with a BENEFIT/COST for every customer that falls into the cell. There is no right or wrong answer here, but this has NOTHING to do with parts A.1,A.2,A.3 above. This is based upon a BUSINESS understanding of the costs/benefits of misclassification. State your rationale for the numbers you provide (2 points). A.5: Look carefully at ALL the predicted "CHURN/LEAVE" node-leafs of your decision tree. As a business manager, describe each churning segment in words. Recommend ONE choice of CHURN segment where you will focus your resources to reduce churn. Why did you pick this one segment from all the available alternatives? (2 points) TOTAL for PART A: 10 points. 1 Page 1 of 5 C | ZOOM +

Step by Step Solution

★★★★★

3.36 Rating (168 Votes )

There are 3 Steps involved in it

Step: 1

R Code The following R code should produce the same results data1Y... blur-text-image

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Income Tax Fundamentals 2013

Income Tax Fundamentals 2013

Authors: Gerald E. Whittenburg, Martha Altus Buller, Steven L Gill

31st Edition

1111972516, 978-1285586618, 1285586611, 978-1285613109, 978-1111972516

More Books

Students also viewed these Databases questions

Question

Superior Company provided the following data for the year ended December 31 (all raw materials are used in production as direct materials): Selling expenses Purchases of raw materials Direct labor...

Answered: 1 week ago

Question

★★★★★

Planning is one of the most important management functions in any business. A front office managers first step in planning should involve determine the departments goals. Planning also includes...

Answered: 1 week ago

Question

★★★★★

Answer the following questions: 1. Build the management-research question hierarchy for Starbuck project 2. the Duetto Card team turned to Green field Online to recruit a panel for one of its online...

Answered: 1 week ago

Question

★★★★★

Under Social Security, the family of a worker who dies while fully insured at the time of death has a right to survivors' benefits. True False

Answered: 1 week ago

Question

★★★★★

AMR Corporation (parent company of American Airlines) reported the following for 2007 (in millions). Service cost.................................................$370 Interest on...

Answered: 1 week ago

Question

★★★★★

2.48 Refer to data on age and diameter of oak trees analyzed in Exercise 2.32. a Construct a residual plot. b Comment on the shape of the residual plot. c Comment on the appropriateness of the linear...

Answered: 1 week ago

Question

★★★★★

=+c) Calculate the lower control limit of the p chart.

Answered: 1 week ago

Question

★★★★★

At a recent meeting of the accounting staff in your company, the controller raised the issue of using present value techniques to conduct impairment tests for some of the companys fixed assets. Some...

Answered: 1 week ago

Question

★★★★★

The bank portion of the bank reconciliation for Langer Company at November 30, 2017, was as follows. LANGER COMPANY Bank Reconciliation November 30, 2017 Cash balance per bank $14,677.90 Add:...

Answered: 1 week ago

Question

★★★★★

Ben Johnson is the manager of the jewelry department of a large chain of department stores. The department store has succeeded on the basis of customer service and quality of merchandise. As a...

Answered: 1 week ago

Question

★★★★★

Testing a prototype of a new product is an example of a: Multiple Choice Unit-level activity Batch-level activity Product-level activity. C ) Organization-sustaining activity

Answered: 1 week ago

Question

★★★★★

A newly formed task force to determine how many nurses are needed on each shift meets each week. The task force performance depends on the sum of individual performance, known as [ Select ] ....

Answered: 1 week ago

Question

★★★★★

Consider a game in which two students simultaneously choose effort levels span style="font-family: CambriaMath;font-size:8pt;color:rgb(0,0,0);font-style:normal;font-variant:normal;">, span...

Answered: 1 week ago

Question

★★★★★

You own a fast food business that only serves burgers, fries, and soft drinks. Part 1: Create a process flow chart for the order process from the time the customer approaches the counter to the time...

Answered: 1 week ago

Question

★★★★★

A politician develops legislation to increase economic equity. Their opponent refutes the legislation, saying, "He is a billionaire, so how can his policy be legitimate?" What kind of misdirection...

Answered: 1 week ago

Question

★★★★★

Please read the article in the link below and answer the questions. https://kdvr.com/news/macys-typo-leads-to-1500-necklace-being-sold-for-47/ Macy's typo leads to $1,500 necklace being sold for $47...

Answered: 1 week ago

Question

★★★★★

What are the 5 Cs of credit? List and describe each of them.

Answered: 1 week ago

Question

★★★★★

A condenser (heat exchanger) brings 1 kg/s water flow at 10 kPa quality 95% to saturated liquid at 10 kPa, as shown in Fig. P4.91. The cooling is done by lake water at 20C that returns to the lake at...

Answered: 1 week ago

Question

★★★★★

Jenny earns $34,500 in 2012. Calculate the FICA tax that must be paid by: Jenny: ..............................Soc,Sec. ..................$______________...

Answered: 1 week ago

Question

★★★★★

Deborah purchases a new $30,000 car in 2012 to use exclusively in her business. If Deborah does not elect to expense or take bonus depreciation in 2012 and holds the car until it is fully...

Answered: 1 week ago

Question

★★★★★

If Charles, a 16-year-old child model, earns $50,000 a year and is completely self supporting even though he lives with his parents, can his parents claim him as a dependent? Why or why not?...

Answered: 1 week ago

Question

★★★★★

For the data from Problem 2 in this chapter, address the following questions, using the information provided in the accompanying SAS output. a. State the regression model that incorporates the...

Answered: 1 week ago

Question

★★★★★

Refer to the data in Problem 19 of Chapter 5. The relationship between the rate of owner occupancy of housing units (OWNEROCC) and the median monthly ownership costs (OWNCOST) was studied in that...

Answered: 1 week ago

Question

★★★★★

Using the data from Problem 2 in Chapter 5 and/or the SAS output given here, answer the following questions about the separate straight-line regressions of SBP on QUET for smokers (SMK = 1) and...

Answered: 1 week ago

Previous Question Next Question