Question
Suppose a graduate student does a survey of undergraduate study habits on his university campus. He collects data on students who are in different years
Suppose a graduate student does a survey of undergraduate study habits on his university campus. He collects data on students who are in different years in college by asking them how many hours of course work they do for each class in a typical week. A sample of four students provides the following data on year in college and hours of course work per class:
Student | Year in College | Course Work Hours per Class |
---|---|---|
1 | Freshman (1) | 8 |
2 | Sophomore (2) | 8 |
3 | Junior (3) | 7 |
4 | Senior (4) | 5 |
A scatter plot of the sample data is shown here (blue circle symbols). The line Y = -2X + 11 is shown in orange.
Think about how close the line Y = -2X + 11 is to the sample points. Look at the graph and find each point's vertical distance from the line. If the point sits above the line, the distance is positive; if the point sits below the line, the distance is negative.
The sum of the vertical distances between the sample points and the orange line is ____________ , and the sum of the squared vertical distances between the sample points and the orange line is _____________ .
On the graph, place the black point (X symbol) on the graph to plot the point (MXX, MYY), where MXXis the mean year for the four students (1, 2, 3, and 4) in the sample and MYYis the mean hours of course work per class for the four students (8, 8, 7, and 5) in the sample.Then use the green line (triangle symbols) to plot the line that has the same slope as (is parallel to) the line Y = -2X + 11, but with the additional property that the vertical distances between the points and the line sum to 0. To plot the line, drag the green line onto the graph. Move the green triangles to adjust the slope.
The line you just plotted _______________ through the point (MXX, MYY).
The sum of the squared vertical distances between the sample points and the line that you just plotted is ______ .
Which of the following describes the plotted line with the smallest total squared error?
___The line you plotted that has a sum of the distances equal to 0
___ Neitherthe two lines fit the data equally well
___ Y = -2X + 11
Suppose you fit the regression line to the four sample points on the graph. On the basis of your work so far, being as specific as you can be, you know that the total squared error is ________________________ .
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started