Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Boosting and Bagging using the eBay auction data (file eBayAuctions Ch 13.jmp Download eBayAuctions Ch 13.jmp) with variable Competitive as the target. Create a classification

Boosting and Bagging using the eBay auction data (file eBayAuctions Ch 13.jmp Download eBayAuctions Ch 13.jmp) with variable Competitive as the target.

Create a classification tree (Analyze>Predictive Modeling>Partitition ). Cast the variable "Competitive?" to Y, Response, the variable "Validation" to Validation field, and all other variables to X, Factor.

In Partition platform dialog window use the "Go" button to create the model. Looking at the test set, what is the overall accuracy (red triangle option Show Fit Details)? What is the lift at portion = 0.20 (red triangle optionLift Curve)? Save the prediction formula to the data table.(Click Red Triangle>Save Columns>Save Prediction Formula)

  • The overall accuracy over the test set is 1-Misclassification = [ Select ] ["1-0.2340=0.766", "1 - 0.1362 = 0.8638", "1-0.367=0.633"]

Review the lift curve over the test set. The lift at portion = 0.2 for Competitive = 1 is approximately [ Select ] ["1.8", "1.4", "1.2"] , and the lift for Competitive = 0 is approximately [ Select ] ["1.9", "1.4", "2.3"] .

Run the same tree (use red triangle optionRedo>Relaunch Analysis), but first select the Boosted Tree as the method (from the drop-down menu) and set the Random seed to 123 (use Random Seed field in the next dialog window). Don`t change the other default settings. What is the overall accuracy? What is the lift at portion = 0.20? Save the prediction formula to the data table.(Click Red Triangle>Save Columns>Save Prediction Formula)

The overall accuracy of the Boosted Tree method on the test set is [ Select ] ["1 - 0.1594 = 0.8406", "1-0.2567=0.7433", "1-0.1122=0.8878"]

  • Review the lift curve on the test set (Red Triangle Option Lift Curve). The lift at portion = 0.2 for Competitive = 1 is approximately [ Select ] ["1.78", "1.05", "0.65"] , and the lift for Competitive = 0 is approximately [ Select ] ["1.9", "1.25", "1.4"] .

Now, try the same tree(use red triangle optionRedo>Relaunch Analysis) with the Bootstrap Forest method selected,set the Random seed to 123 (in the next dialog window), and accept the default settings. For the test set, what is the overall accuracy?What is the lift at portion = 0.20? Again, save the prediction formula to the data table. (Click Red Triangle>Save Columns>Save Prediction Formula)

The overall accuracy of the Bootstrap Forest Method on the test set is 1-Misclassification = [ Select ] ["1-0.1362=0.8638", "1 - 0.2879 = 0.7121", "1-0.7889=0.2111"] .

  • The lift at portion = 0.2 for Competitive = 1 is approximately [ Select ] ["1.01", "1.77", "0.67"] , and the lift for Competitive = 0 is approximately [ Select ] ["0.45", "1.21", "1.98"] .

Compare the three models using the Model Comparison platform under Analyze > Predictive Modeling. In the Model Comparison Platform`s launch window, cast the variable "Validation" to the Group field and click "OK". Compare the misclassification rates of the three models over the validation set. Which model has the best accuracy?

  • The Model [ Select ] ["Bootstrap Forest", "Classification tree (Partition)", "Boosted Tree"] has the best accuracy over the validation set

Now, choose Model Averaging (red triangle option) in the Model Comparison for Validation. This will create another prediction column in your data set. Now relaunch the Model Comparisonplatform again (Redo>Relaunch Analysis) with the "Validation" variable in the Group field. Compare the misclassification rates of all models. What is the misclassification rate of Model Averaged on the test set?

  • The misclassification rate of Model Averaged on the validation set is [ Select ] ["0.4531", "0.3452", "0.0848"]

Note: If you make a mistake at any step listed above, download the dataset again and start the analysis from the scratch.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

To solve this task follow these detailed steps using the given dataset in JMP Step 1 Creating a Classification Tree 1 Load the Dataset Open JMP and lo... blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Discrete Mathematics and Its Applications

Authors: Kenneth H. Rosen

7th edition

0073383090, 978-0073383095

More Books

Students also viewed these Mathematics questions

Question

14. What are the two reasons why quality is important?

Answered: 1 week ago