Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Statistics Research and Report Assignment - Brief Date Due: Week 6/ Session 12 Worth: 20% of your final grade 5 pm, 10 August 2022 In

Statistics Research and Report Assignment - Brief

Date Due: Week 6/ Session 12 Worth: 20% of your final grade 5 pm, 10 August 2022

In this assignment you will examine data used by a Real Estate investment advisor. She wants you to answer some specific questions put by clients about houses prices in the neighbourhood encompassed by 4 suburbs around city of Melbourne. The data is contained in the file 'Real_Estate.xls' and contains the following columns (variables):

Variable Name

ID Price Bedrooms Size Pool

Distance Suburb Garage

Random Sample:

Description

House Identity number Selling Price of the house (in 000's) Number of bedrooms House Size (m2) 0=House without a Pool 1=House with a Pool Distance from city centre (km) Suburb number 0=House without a Garage 1=House with a Garage

Before you begin your analysis, you are required to take a random sample of size 110 from the 170 cases in the file. Use the file Random_Sample_Generator-2.xls to do this. Answers to the questions below are to be based on your sample of 110 cases. Make sure to keep a safe copy of the sample you use since you cannot use Random_Sample_Generator-2.xls to reproduce it. Provide a printout of the data in your sample, with ID numbers in ascending order.

Part 1: Initial Data Analysis

Variable List

Using the variables listed in the table above, state for each variable whether it is

qualitative or quantitative.

If it is qualitative, state whether it is nominal or ordinal, and if it is quantitative, state

whether it is discrete or continuous.

Histogram

Create a histogram showing the distribution of selling price of the house.

Comment upon the shape of the distribution: is it symmetric? If it is not, is it positively or

negatively skewed?

Are there any outliers present? If so, are they of particular interest?

State which central measure would be best to use to describe the centre of this distribution,

and the reason(s) why.

Descriptive statistics

Prepare a table that shows the 5-number summary of price for houses in the 4 suburbs.

Construct side-by-side boxplots for the price of the houses in the 4 suburbs. Briefly

comment upon any differences you observe in house price for each suburb.

Are there any outliers present? If so, are they of particular interest?

State which central measure would be best to use to describe the centre of this distribution,

and the reason(s) why.

Prepare a summary table that shows the mean and standard deviation of Price for houses in

the 4 Suburbs according (subject) to the variable Bedrooms. Think carefully about the layout of the rows and columns of your table. As well as means and standard deviations you should also include the number of houses in each group. So each cell in your final table should contain the mean, the standard deviation and n, the number of houses in that group.

Refer to part (e). Comment, in bullet point form, on the Price of any combinations for Suburb and Bedrooms variables (i.e. cells in the table).

Statistical inferences One of the clients wants information on size of houses as it relates to price.

Produce a scatter plot of Price vs Size (Size should be on the horizontal axis). Make sure you

label your axes properly and that your graph has an appropriate title.

Refer to part (a). Briefly, describe the nature of the relationship between these 2 variables.

Now, create a new variable (column) labelled Size Group which divides Size up into two size

groups as follows:

Produce suitable graphs or charts to help in providing the information requested on the Size of the house as it relates to Price.

Construct 95% confidence interval for small and large houses Price.

Refer to (ii). Is there any interaction (overlap) between the 2 Confidence Intervals?

What does this tell you about the Prices for the two Sizes.

Part 2: Research Questions

Based on your random sample, identify and investigate TWO research questions of your own using inferential statistics (estimation and hypothesis testing).

ID Price Bedrooms Size Pool Distance Suburbs Garage
123 590 6 248 0 11.7 3 0
137 600 4 242 1 10.6 3 1
4 340 2 139 0 7.9 1 0
106 540 5 234 1 9.4 4 1
140 330 2 128 1 10.3 3 1
169 580 4 203 1 9.1 1 1
121 400 2 144 1 11.9 1 1
87 530 5 227 0 12.5 3 0
146 390 2 138 1 8.1 1 1
147 490 3 135 1 4.8 2 0
19 280 2 137 1 11.4 4 0
62 280 3 122 1 18.3 4 1
144 440 2 158 1 7.5 1 1
69 540 3 165 0 3.5 2 0
30 420 2 135 0 4.5 2 0
154 570 3 162 1 3.9 2 0
43 400 3 174 0 6.1 3 0
158 330 1 95 1 3.3 2 0
33 480 2 143 1 5.9 2 0
20 460 2 145 0 2.4 2 0
102 400 3 188 1 18.6 4 1
141 480 3 129 1 3 2 0
152 490 4 199 1 8 3 0
2 340 2 142 0 10.3 1 0
109 390 2 135 1 7.5 1 1
118 550 4 216 0 8.1 1 0
48 360 2 151 0 9.7 1 0
143 600 3 171 1 2.5 2 0
64 390 2 149 1 10.9 1 0
46 320 2 136 0 9.6 1 0
58 430 3 176 1 8.6 3 0
22 300 3 151 0 19.3 4 1
11 300 2 151 0 13.7 4 1
129 490 2 143 1 3.2 2 0
63 510 5 216 0 8.8 3 0
57 360 2 143 1 10 3 1
27 470 2 152 0 3.7 2 0
165 490 4 162 1 6.9 1 1
12 230 3 125 0 19.9 4 0
85 450 2 142 0 2.8 2 0
36 580 3 178 0 2.2 2 0
139 490 3 135 1 4.5 2 0
78 430 4 199 1 19.4 4 1
15 300 3 156 0 15.5 4 0
35 450 3 208 0 12.1 3 0
72 450 4 194 0 10.2 3 0
131 410 2 148 1 9.6 1 1
103 440 5 209 0 20.5 4 1
100 630 5 225 1 6 1 0
107 430 3 160 0 7.7 1 1
91 530 6 248 0 18 4 1
138 340 1 115 0 3.9 2 0
164 330 1 98 1 4.3 2 0
134 550 5 246 1 13.6 4 1
41 340 2 145 0 7.6 3 1
168 630 3 182 1 3.4 2 0
1 300 2 124 0 8.6 1 0
114 350 3 143 1 10.1 3 0
99 470 6 218 0 19.6 4 1
93 490 3 148 0 4 2 0
120 510 5 240 0 14.6 4 1
42 280 3 128 0 14.1 3 0
162 500 6 210 1 14.7 4 1
17 260 2 136 0 19.3 4 1
68 380 4 172 1 20.2 4 1
124 470 4 171 0 9.1 1 1
115 390 3 178 1 15.3 4 1
53 400 3 170 0 9.7 3 1
126 520 2 151 1 2.7 2 0
110 480 2 139 1 2.5 2 0
67 360 2 151 0 5.5 3 1
34 480 2 155 0 3.8 2 0
133 550 4 203 0 7.1 1 1
88 370 3 171 1 16.7 4 1
132 390 3 145 1 7.6 3 1
9 280 2 149 0 19.1 4 1
83 510 3 155 0 3.8 2 0
18 270 2 126 0 7.1 4 1
24 360 2 157 0 9.7 3 1
95 410 3 198 1 21.8 4 1
160 460 3 158 1 6.9 1 1
96 400 4 183 0 9.7 4 1
151 440 4 156 1 12.9 1 0
130 330 1 110 0 4.3 2 0
50 350 3 151 0 5.7 3 0
128 410 3 155 0 10.5 1 1
89 320 3 138 1 10.4 4 1
111 500 3 154 0 5.1 2 0
55 400 4 192 0 13.8 4 1
65 380 2 146 0 8.4 1 1
166 560 3 162 1 5.8 2 0
81 320 3 129 0 7.2 3 1
10 420 2 134 0 5 2 0
38 240 2 122 0 13.4 4 1
49 390 3 168 0 14.1 3 1
13 360 2 137 0 5.6 1 1
122 490 5 202 0 7.4 3 0
75 470 3 141 0 4 2 0
7 370 2 140 0 5.2 1 1
51 310 3 147 1 14.8 4 0
23 420 2 133 0 3.6 2 0
105 390 2 151 1 9.5 1 0
125 470 3 162 1 9.2 1 1
32 290 2 138 0 10.9 3 0
28 290 2 135 1 15 4 1
84 340 2 128 1 9 1 0
26 490 2 145 1 4.4 2 0
5 310 2 155 0 10.9 4 1
155 470 4 191 1 9 3 0
16 430 3 158 0 7.4 1 1

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Encyclopedia Of Distances

Authors: Michel Marie Deza, Elena Deza

3rd Edition

3662443422, 9783662443422

More Books

Students also viewed these Mathematics questions