Question
In this project, we use a data set describing the sale of individual residential property in Ames, Iowa from 2006 to 2010 from Cock [1].
In this project, we use a data set describing the sale of individual residential property in Ames, Iowa
from 2006 to 2010 from Cock [1]. The data set contains 2930 observations and a large number of explanatory variables (23 nominal, 23 ordinal, 14 discrete, and 20 continuous) involved in assessing home values. The link to the data set can be foundhere (https://www.statcrunch.com/app/index.html?dataid=3998101#).
The variables of our interest are listed below.
Variable | Description |
Price | Sale price in USD. |
Area | Above grade (ground) living area square feet. |
Neighborhood | Physical locations within Ames city limits (map available). |
Bldg.Type | Type of dwelling. |
House.Style | Style of dwelling. |
Year.Built | Original construction date. |
Overall.Qual | Rates the overall material and finish of the house. |
Overall.Cond | Rates the overall condition of the house. |
Full.Bath | Full bathrooms above grade. |
Half.Bath | Half baths above grade. |
Fireplaces | Number of fireplaces. |
Yr.Sold | Year Sold (YYYY). |
Use this data set to answer the following questions.
- [10 points] Analyze the distribution of the following variables using the proper summary measures (mean/median/std dev/IQR/relative frequency/etc.) and graphs (histogram/boxplot/bar graph/pie chart/etc.). Do it separately for each variable.
- Price
- Bldg.Type
- [25 points] Draw the scatter plot for the bivariate data collected for Area and Price. Which of these two variables is the response variable? Which is the explanatory variable? Determine the least-squares regression line for the relation between these two variables. Interpret the meaning of slope within the context.
- [6 points] Suppose one property is randomly selected from this data set.
- What is the probability that this property is a single family home?
- What is the probability that this property is a single family home given that it is in the Somerset (Somerst in the data)neighborhood of Ames?
- Create side-by-side boxplots for the sales price of properties with different numbers of full bathrooms above grade. Be sure to give a few sentences comparing the similarities and differences of sales price for different neighborhoods categories.
- [10 points] Create a 95% confidence interval for the mean sales price of individual residential property in Ames, Iowa from 2006 to 2010. Be sure to include a statement interpreting the confidence interval result within the context.
- [15 points] Is the type of dwelling (Variable: Bldg.Type) related to the year the property is sold (Variable: Yr.Sold)? Use a 0.01 significance level to determine this. Be sure to demonstrate all 5 steps of the hypothesis testing process.
Reference: Ames, Iowa: Alternative to the Boston Housing Data as an End of Semester Regression Project. Dean De Cock, Truman State University, Journal of Statistics Education, Volume 19, Number 3(2011), www.amstat.org/publications/jse/v19n3/decock.pdf
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started