Expand Your Knowledge: Residual Plot The least-squares line usually does not go through all the sample data
Question:
Expand Your Knowledge: Residual Plot The least-squares line usually does not go through all the sample data points x, y . In fact, for a specifi ed x value from a data pair (x, y), there is usually a difference between the predicted value and the y value paired with x. This difference is called the residual.
One way to assess how well a least-squares line serves as a model for the data is a residual plot. To make a residual plot, we put the x values in order on the horizontal axis and plot the corresponding residuals y yˆ in the vertical direction. Because the mean of the residuals is always zero for a least-squares model, we place a horizontal line at zero. The accompanying fi gure shows a residual plot for the data of Guided Exercise 4, in which the relationship between the number of ads run per week and the number of cars sold that week was explored. To make the residual plot, fi rst compute all the residuals.
Remember that x and y are the given data values, and yˆ is computed from the least-squares line yˆ < 6.56 1 1.01x.
Residual Residual x y yˆ y yˆ x y yˆ y yˆ
6 15 12.6 2.4 16 20 22.7 2.7 20 31 26.8 4.2 28 40 34.8 5.2 0 10 6.6 3.4 18 25 24.7 0.3 14 16 20.7 4.7 10 12 16.7 4.7 25 28 31.8 3.8 8 15 14.6 0.4
(a) If the least-squares line provides a reasonable model for the data, the pattern of points in the plot will seem random and unstructured about the horizontal line at 0.
Is this the case for the residual plot?
(b) If a point on the residual plot seems far outside the pattern of other points, it might refl ect an unusual data point (x, y), called an outlier. Such points may have quite an infl uence on the least-squares model. Do there appear to be any outliers in the data for the residual plot?
Step by Step Answer:
Understanding Basic Statistics
ISBN: 9781305548893
7th Edition
Authors: Charles Henry Brase, Corrinne Pellillo Brase