So, for these questions, I am confused as to what it's asking me. I had to use excel to create the graphs.
$'s per Region State County listing price square foot square feet random East North Central oh summit 185,800 $101 1,847 0. 142249 East North Central in allen 398,000 $1 13 3,525 0.80362 East North Central oh clark 240,500 $137 1,752 0.829857 East North Central in henry 235,000 $148 1,588 0.393109 East North Central il rock island 166,300 $127 1,305 0.939367 East North Central wi milwaukee 184,900 $111 1,666 0.495266 East North Central il sangamon 213,500 $130 1,643 0.615068 East North Central wi portage 531,000 $109 4,888 0.558889 East North Central in marion 323,300 $95 3,408 0.03697 East North Central in madison 229, 100 $187 1,224 0.937702 East North Central il st. clair 201,400 $164 1,225 0.084852 East North Central oh ross 257,200 $127 2,018 0.429272 East North Central macoupin 197,600 $1 11 1,783 0.013935 192 East North Central peoria 187,900 $131 1,434 0.323474 East North Central il stephenson 235,600 $140 1,682 0.290222 East North Central mi muskegon 230,400 $131 1,757 0.945958 East North Central il winnebago 236,700 $140 1,692 0.626204 East North Central il champaign 246,700 $121 2,031 0.602284 East North Central oh pickaway 265,700 $143 1,853 0.642864 East North Central oh jefferson 246,500 $136 1,814 0.835402 East North Central oh huron 199,700 $147 1,359 0.518165 East North Central oh muskingum 188,300 $94 1,999 0.877228 East North Central oh stark 201,000 $163 1,230 0.402844 East North Central oh sandusky 253,900 $146 1,738 0.381931 East North Central il jackson 154,300 $105 1,463 0.870019 East North Central il henry 257,700 $123 2,087 0.798229 East North Central wi marathon 431,200 $1 19 3,638 0.212339 East North Central mi lenawee 191,500 $1 18 1,628 0.47723 East North Central oh darke 160,800 $1 14 1,416 0.079001 East North Central mi jackson 139,200 $1 16 1,201 0.285844sample national mean price $239,690 342365 median pric $229,750 318000 std dev pric 84080 125914 sample national mean sqft 1,930 2111 median sqf 1,715 1881 std dev sqf 841 921Data Analysis [Discuss how the regional sample created is reective of the national market. Compare and contrast your regional sample with the national population using the National Statistics and Graphs document found in the Module Two Assignment Guidelines and Rubric. Explain how you have made sure that the sample is random. Explain your methods to get a truly random sample] The Pattern [Based on your graph, dene each variable, and explain which variable will be useful for making predictions and why.] [Describe the association between x and y in the scatterplot and determine its shape. Identify any outliers you see in the graph and explain why these occur and what they represent] [If you had a 1,800 square foot house, based on the regression equation in the graph, what price would you choose to list at? Explain] Scatterplot Chart Title y = 93.646x +58973 R2 = 0.8782 600,000 500,000 e..... 400,000 300,000 . . . ..... 200,000 ...... 100,000 1,000 1,500 2,000 2,500 3,000 3,500 4,000 4,500 5,000Scenario Smart businesses in all industries use data to provide an intuitive analysis of how they can get a competitive advantage. The real estate industry heavily uses linear regression to estimate home prices, as cost of housing is currently the largest expense for most families. Additionally, in order to help new homeowners and home sellers with important decisions, real estate professionals need to go beyond showing property inventory. They need to be well versed in the relationship between price, square footage, build year, location, and so many other factors that can help predict the business environment and provide the best advice to their clients. Prompt You have been recently hired as a junior analyst by D.M. Pan Real Estate Company. The sales team has tasked you with preparing a report that examines the relationship between the selling price of properties and their size in square feet. You have been provided with a Real Estate Data Spreadsheet spreadsheet that includes properties sold nationwide in recent years. The team has asked you to select a region, complete an initial analysis, and provide the report to the team. Note: In the report you prepare for the sales team, the response variable (y) should be the listing price and the predictor variable (x) should be the square feet. Specically you must address the following rubric criteria, using the Module Two Assignment Template Word Document: - Generate a Representative Sample of the Data 0 Select a region and generate a simple random sample of 30 from the data. 0 Report the mean, median, and standard deviation of the listing price and the square foot variables. - Analyze Your Sample 0 Discuss how the regional sample created is or is not reective of the national market. - Compare and contrast your sample with the population using the National Summary Statistics and Graphs Real Estate Data PDF document. 0 Explain how you have made sure that the sample is random. - Explain your methods to get a truly random sample. - Generate Scatterplot o Create a scatterplot of the xand yvariables noted above and include a trend line and the regression equation - Observe patterns 0 Answer the following questions based on the scatterplot: - Dene xand y. Which variable is useful for making predictions? Is there an association between xand y? Describe the association you see in the scatter plot. What do you see as the shape (linear or nonlinear)? Ifyou had a 1,800 square foot house, based on the regression equation in the graph, what price would you choose to list at? Do you see any potential outliers in the scatterplot? - Why do you think the outliers appeared in the scatterplot you generated? - What do they represent? Module Two Assignment Rubric Criteria Exemplary Proficient Needs Improvement Not Evident Value Generate a N/A Includes a random Shows progress toward Does not attempt 20 Representative Sample of sample of 30 from a proficiency, but with criterion (0%) the Data region and descriptive errors or omissions; statistics for the sample areas for improvement (100%) may include a sample that is not truly random or has incorrect descriptive statistics (55%) Analyze Your Sample Exceeds proficiency in Discusses how the Shows progress toward Does not attempt 25 an exceptionally clear regional sample created proficiency, but with criterion (0%) manner (100%) is or is not reflective of errors or omissions; the national market and areas for improvement explains how the sample may include inaccurate is random (85%) descriptions of the extent to which the sample is reflective of the population and random (55%) Generate Scatterplot Exceeds proficiency in Creates a scatterplot of Shows progress toward Does not attempt 20 an exceptionally clear the x and y variables proficiency, but with criterion (0%) manner (100%) including a trend line errors or omissions; and the regression areas for improvement equation (85%) may include inaccuracies within the scatterplot or definitions of x and y (55%) Observe Patterns Exceeds proficiency in Makes cost projections Shows progress toward Does not attempt 25 an exceptionally clear, based on the regression proficiency, but with criterion (0%) insightful, or equation and discusses errors or omissions; sophisticated manner outliers (85%) areas for improvement (100%) may include inaccuracies in descriptions of association or shape or inaccuracies in cost projections or discussion of outliers (55%) Articulation of Response |Exceeds proficiency in Clearly conveys meaning Shows progress toward Submission has critical 10 an exceptionally clear, with correct grammar, proficiency, but with errors in grammar insightful, sophisticated, sentence structure, and errors in grammar, sentence structure, and or creative spelling, demonstrating sentence structure, and spelling, preventing (100%) an understanding of spelling, negatively understanding of ideas audience and purpose impacting readability (0%) (85%) (55%) Total 100%