Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 09, 2024

I need help on the following question: Q2 (Linear Regression) Write python codes in a Jupyter notebook that implement the gradient descent algorithm to train

I need help on the following question:

Q2 (Linear Regression) Write python codes in a Jupyter notebook that implement the gradient descent algorithm to train a linear regression model for the Boston housing data set:

https://towardsdatascience.com/linear-regression-on-boston-housing-dataset-f409b7e4a155

Split the dataset to a training set (70% samples) and a testing set (30% samples). Print the root mean squared errors (RMSE) on the training and testing sets in the Jupyter notebook.

This is the following code I got:

import numpy as np import pandas as pd import matplotlib.pyplot as plt from sklearn.datasets import load_boston from sklearn.model_selection import train_test_split from sklearn.metrics import mean_squared_error

# Load the Boston Housing dataset boston_dataset = load_boston()

# Convert the dataset into a Pandas dataframe boston = pd.DataFrame(boston_dataset.data, columns=boston_dataset.feature_names) boston['MEDV'] = boston_dataset.target

# Split the dataset into a training set (70% samples) and a testing set (30% samples) train_set, test_set, train_target, test_target = train_test_split(boston.drop("MEDV", axis=1), boston["MEDV"], test_size=0.3, random_state=42)

# Add a column of ones to the dataset (bias term) train_set = np.c_[np.ones((train_set.shape[0], 1)), train_set] test_set = np.c_[np.ones((test_set.shape[0], 1)), test_set]

# Initialize the weights and learning rate weights = np.zeros(train_set.shape[1]) alpha = 0.01

# Define the number of iterations for gradient descent num_iters = 1000

**# Compute the cost function def compute_cost(X, y, weights): y_pred = X.dot(weights) cost = (1/2m) np.sum(np.power((y_pred - y), 2)) return cost**

# Perform gradient descent m = train_set.shape[0] for i in range(num_iters): y_pred = train_set.dot(weights) weights = weights - (alpha/m) * (train_set.T.dot(y_pred - train_target)) # Compute the root mean squared error (RMSE) on the training set train_rmse = np.sqrt(mean_squared_error(train_target, train_set.dot(weights))) print("RMSE on the training set: ", train_rmse)

# Compute the root mean squared error (RMSE) on the testing set test_rmse = np.sqrt(mean_squared_error(test_target, test_set.dot(weights))) print("RMSE on the testing set: ", test_rmse)

I'm getting the following error: "ValueError: Input contains NaN, infinity or a value too large for dtype('float64')". How do I fix this? Please let me know, thank you!

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access with AI-Powered Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Students also viewed these Databases questions

Question

61. A company that manufactures video cameras produces a basic model and a deluxe model. Over the past year, 40% of the cameras sold have been of the basic model. Of those buying the basic model, 30%...

Answered: 1 week ago

Question

★★★★★

Graphics Service Co. purchased a new color copier at the beginning of 2013 for $47,000. The copier is expected to have a five-year useful life and a $7,000 salvage value . The expected copy...

Answered: 1 week ago

Question

★★★★★

The following cost data relate to the manufacturing activities of SEC - 7 9 6 8 6 during the just completed year: \ table [ [ Manufacturing overhead costs incurred:,$ 1 5 , 7 0 0

Answered: 1 week ago

Question

★★★★★

(a) Why do most firms produce more than one product? (b) What is the rule for profit maximization for a multiproduct firm? (c) Why would a firm produce a product on which it makes zero profits? (a)...

Answered: 1 week ago

Question

★★★★★

The number of days rain per month in London varies depending on the time of year. The data shows the number of wet days per month. 17 13 11 14 13 11 13 12 13 14 16 16 (a) For this data, find (a) (i)...

Answered: 1 week ago

Question

★★★★★

Mark Gershon, owner of a musical instrument distributorship, thinks that demand for guitars may be related to the number of television appearances by the popular group Maroon 5 during the previous...

Answered: 1 week ago

Question

★★★★★

Write one short paragraphs about the target market segments for your client and why you think this company and product will be successful. Are there economic, social, or other trends that make this a...

Answered: 1 week ago

Question

★★★★★

Quiz Instructions: Term Structure Models II and Introduction to Credit Derivatives Questions 1 and 2 should be answered by building and calibrating a 10-period Black-Derman-Toy model for the...

Answered: 1 week ago

Question

★★★★★

a1 a2 a3 15. (23) Let A = b b2 b3 and B = = C1 4a1b1 4a2+ b2 4a3+b3 C2 C3 C1 C2 C3 3b1 3b2 3b3 If det(B) = 6, then (1) det(A) = and (4) rank(A-1) = (2) det(2ABA-) = (3) det(BBT) =

Answered: 1 week ago

Previous Question Next Question