Answered step by step
Verified Expert Solution
Question
1 Approved Answer
House price dataset. (20 points) The HOUSES dataset contains a collection of recent real estate listings in San Luis ObiSpO county and around it. The
House price dataset. (20 points) The HOUSES dataset contains a collection of recent real estate listings in San Luis ObiSpO county and around it. The dataset is provided in RealEstate.csv. You may use \"one-hot- keying\" to expand the categorical variables. The dataset contains the following useful elds (You may excluding the Location and MLS in your linear regression model). You can use any package for this question. ('0) Price: the most recent listing price of the house (in dollars). Bedrooms: number of bedrooms. Bathrooms: number of bathrooms. Size: size of the house in square feet. Price/Sth: price of the house per square foot. Status: Short Sale, Foreclosure and Regular. (10 points) Fit the Ridge regression model to predict Price from all variable. You can use one-hot keying to expand the categorical variable Status. Use 5-fold cross validation to select the regularizer optimal parameter, and show the CV curve. Report the tted model (i.e., the parameters), and the sum-of-squares residuals. You can use any package. The suggested search range for the regularization parameter is from 80 to 150. (10 points) Use lasso to select variables. Use 5-fold cross validation to select the regularizer optimal parameter, and show the CV curve. Report the tted model (i.e., the parameters selected and their coefficient). Show the Lasso solution path. You can use any package for this. The suggested search range for the regularization parameter is from 6000 to 8000
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started