Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 08, 2024

Question 1-K-Nearest Neighbors (KNN): (R studio) In this question, you will use the K-Nearest Neighbors (KNN) algorithm to predict whether the S&P 500 will go

Question 1-K-Nearest Neighbors (KNN): (R studio) In this question, you will use the K-Nearest Neighbors (KNN) algorithm to predict whether the S&P 500 will go up the next day or not. The dataset that you will use for this question is named Smarket and is part of the package ISLR. Install and load package ISLR. Now you have access to the package ISLR which contains the Smarket dataset. Run the following command to view the dataset: View(Smarket) Pass data frame Smarket to the function colnames. This should give you the column names of the Smarket dataframe. The results should be similar to the following: "volume" "Lag]" "Today" "Lag2" "Lag3" "Direction" "Lag4" "Lag5" Pass column Year of the data frame Smarket to the function unique. This should give you the unique values in the column Year which should be as follows: [1] 2001 2002 2003 2004 2005 Read pages 12 and 13 of the ISL documentation available at https://cran.r- project.org/web/packages/ISLR/ISLR.pdf to familiarize yourself more with this data frame. We would like to predict the Direction column using the columns Lag1, Lag2, Lag3, Lag4 , Lag5, and Volume. Divide the Smarket into two: 2001 through 2004 will be training data, and 2005 will be test data. Name the training dataset Smarket_train and test dataset Smarket test.

Create a subset of the Smarket_train data set which only includes columns 2 to 7 (Lag1, Lag?, Lag3, Lag4, Lag5, and Volume) and name it Smarket_train x.

Use the scale function to z-score standardize the Smarket_train data frames. The output of the scale function is a matrix. Don't forget to convert it to a dataframe using the as.data.frame function. Save the standardized Smarket on a variable named Smarket_train_z. Repeat the previous two steps on the Smarket test datasets, as well. You can name the outputs of these steps Smarket_ test _x and Smarket test z. Create a subset of the Smarket_train data set which only includes the Direction column and name it Smarket train label. Do the same task for the Smarket train dataset and name it Smarket_train_label. Run knn function, using k = 1, to obtain predictions for the test data. Check the accuracy of your prediction using the CrossTable function. Repeat the previous two tasks using k = 3 and k = 5. Which k has the highest accuracy?

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access with AI-Powered Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Students also viewed these Databases questions

Question

What is other comprehensive income (OCI) and where is it presented in the final accounts?

Answered: 1 week ago

Question

★★★★★

The balance sheet for the Heir Jordan Corporation follows. Based on this information and the income statement in the previous problem, supply the missing information using the percentage of sales...

Answered: 1 week ago

Question

★★★★★

Total undeposited taxes for Wednesday, May 12, are $2,600. Under the semiweekly rule, this deposit must be made no later than: (a) May 19 (b) May 11 (c) June 15 (d) May 12

Answered: 1 week ago

Question

★★★★★

AAC company is reviewing the behaviour of its various activities. MC is considering various costs under process costing system. Its costing system utilises two cost categories, direct materials and...

Answered: 1 week ago

Question

★★★★★

2. Rationalize each denominator. 53 e. 23+4 32 f. 23-5 3. Rationalize each numerator. 3-7 e. 4 23+ 7 f. 5 7. Calculate the slope of the tangent to each curve at the given point or value of x. 2 c. y=...

Answered: 1 week ago

Question

★★★★★

Develop a causal circle based on the following: Walmart has been receiving complaints regarding length of time spent waiting in line.

Answered: 1 week ago

Question

★★★★★

What would you consider specific information as opposed to passive information that may only help in a bind, Is there specific information that you would research for?

Answered: 1 week ago

Question

★★★★★

2.2. Choose a mineral as an example (eg. copper) and briefly describe how the following factors may influence the quantity of this mineral that will be demanded: 2.2.1. Price of the mineral (2)...

Answered: 1 week ago

Question

★★★★★

Do you find overt use of group-specific symbols/language offensive or exploitative?

Answered: 1 week ago

Previous Question Next Question