Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Jun 23, 2024

First, we split the data set into a training set and a test set by using the following command lines. library(ISLR) data(Credit) set.seed(15) Credit

First, we split the data set into a training set and a test set by using the following command lines.

library(ISLR)

data("Credit")

set.seed(15)

Credit <- Credit[,-1] # remove ID column

train <- sample(nrow(Credit), 300)

Credit.train <- Credit[train, ]

Credit.test <- Credit[-train, ]

(a) Fit a tree to the training data, with Balance as the response and the other variables. Use the summary() function to produce summary statistics about the tree, and describe the results obtained. What is the training MSE? How many terminal nodes does the tree have?

(b) Type in the name of the tree object in order to get a detailed text output. Pick one of the terminal nodes, and interpret the information displayed.

(c) Create a plot of the tree, and interpret the results.

(d) Predict the response on the test data. What is the test MSE?

(e) Apply the cv.tree() function to the training set in order to determine the optimal tree size.

(f) Produce a plot with tree size on the x-axis and cross-validated error on the y-axis.

(g) Which tree size corresponds to the lowest cross-validated error? 1

(h) Produce a pruned tree corresponding to the optimal tree size obtained using cross-validation. If cross-validation does not lead to selection of a pruned tree, then create a pruned tree with five terminal nodes.

(i) Compare the training MSEs between the pruned and unpruned trees. Which is higher?

(j) Compare the test MSEs between the pruned and unpruned trees. Which is higher?

(k) Fit a bagging model to the training set with Balance as the response and the other variables. Use 1,000 trees (ntree = 1000). Use the importance() function to determine which variables are most important.

(l) Use the bagging model to predict the response on the test data. Compute the test MSE.

(m) Fit a random forest model to the training set with Balance as the response and the other variables. Use 1,000 trees (ntree = 1000). Use the importance() function to determine which variables are most important.

(n) Use the random forest to predict the response on the test data. Compute the test MSE.

(o) Fit a boosting model to the training set with Balance as the response and the other variables. Use 1,000 trees, and a shrinkage value of 0.01 ( = 0.01). Which predictors appear to be the most important?

(p) Use the boosting model to predict the response on the test data. Compute the test MSE.

(q) Fit a GAM to the training set with Balance as the response and the other variables, and use the GAM to predict the response on the test data. Compute the test MSE.

(r) Compare the test MSEs between the unpruned trees, pruned trees, bagging, random forest, boosting, and GAM. Which performs the best?

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Discrete Mathematics, Revised

Discrete Mathematics, Revised

Authors: Seymour Lipschutz, Marc Lipson

3rd Edition

0071615873, 9780071615877

More Books

Students also viewed these Mathematics questions

Question

★★★★★

Duke Energy Corporation runs 30 coal-fired electric generating units at eight plants in North and South Carolina. The units were placed in service between 1940 and 1975, and each includes a boiler...

Answered: 1 week ago

Question

★★★★★

Once a product is defined, what documents are used to assist production personnel in its manufacture? lop1

Answered: 1 week ago

Question

★★★★★

=+Explain how the four principles of experimental design are used in the Acela experiment described in the previous section (see page 724)

Answered: 1 week ago

Question

★★★★★

Refer to the preceding facts for Postmans acquisition of 80% of Spartans common stock and the bond transactions. Postman uses the simple equity method to account for its investment in Spartan. On...

Answered: 1 week ago

Question

★★★★★

Direct or reverse acting of controller if control Valve is a ) air - to - close b ) air - to - open?

Answered: 1 week ago

Question

★★★★★

Explain a) General duty requirements b) Code of conduct c) Due diligence I

Answered: 1 week ago

Question

★★★★★

4. [0/3 Points] DETAILS PREVIOUS ANSWERS LARCALC11 3.2.009. MY Find the two x-intercepts of the function f and show that f'(x) = 0 at some point between the two x-intercepts. f(x) = xVx+9 (x, y ) = x...

Answered: 1 week ago

Question

★★★★★

For the company 7-eleven, briefly describe their current inventory management, processes and functions employed within that infrastructure.

Answered: 1 week ago

Question

★★★★★

The information that follows pertains to Julia Company: (a) Temporary differences for the year 2024 are summarized below. Expenses deducted in the tax return, but not included in the income...

Answered: 1 week ago

Question

★★★★★

Provide 2 business threats as part of SWAT Analysis for Booktopia Pty Ltd (Australian online bookstore) based on one of the following trends: Political, Economic, Social and Cultural, Technological,...

Answered: 1 week ago

Question

★★★★★

A BA is documenting their planning and monitoring experience. Which task covers the question, "Have you decided how your deliverables were going to be reviewed and accepted?"

Answered: 1 week ago

Question

★★★★★

2 Part 2 of 2 6 points Required information [The following information applies to the questions displayed below.] Tamar Company manufactures a single product in two departments: Forming and Assembly....

Answered: 1 week ago

Question

★★★★★

1. A horizontal venturimeter, with inlet and throat diameters 300 mm and 100 mm, respectively is used to measure the flow of oil of specific gravity 0.88. The pressure intensity at inlet is 130kN/m...

Answered: 1 week ago

Question

★★★★★

Do not take criticism personally, even if it is meant to be personal.

Answered: 1 week ago

Question

★★★★★

2. To understand that as a general rule it takes about 3 minutes before the other part(s) have made up a 90% impression of you.

Answered: 1 week ago

Question

★★★★★

3. To understand that the first impression means a lot, and it takes a lot of energy to change this first impression.

Answered: 1 week ago

Previous Question Next Question