The accompanying data set contains four predictor variables (x 1 , x 2 , x 3 ,
Question:
The accompanying data set contains four predictor variables (x1, x2, x3, and x4) and the target variable (y).
a. Bin predictor variables x1, x2, x3, and x4. For Analytic Solver, choose the Equal interval option and 2 bins for each of the four variables. For R, bin x1 into [0, 40000) and [40000, 80000); x2 into [0, 50) and [50, 100); x3 into [50, 75) and [75, 100); and x4 into [0, 20000) and [20000, 40000). What are the bin numbers for the variables of the first two observations?
b. Partition the transformed data to develop a naïve Bayes classification model where “1” denotes the positive or success class for y. Report the accuracy, sensitivity, specificity, and precision rates for the validation data set.
c. Generate the ROC curve. What is the area under the ROC curve (or the AUC value)?
d. Can the naïve Bayes model be used to effectively classify the data? Explain your answer.
Step by Step Answer:
Business Analytics Communicating With Numbers
ISBN: 9781260785005
1st Edition
Authors: Sanjiv Jaggia, Alison Kelly, Kevin Lertwachara, Leida Chen