Question
1.The following table gives data on the Boston Red Sox wins and runs.All of the problems in this Chapter Writeup refer to this data. Construct
1.The following table gives data on the Boston Red Sox wins and runs.All of the problems in this Chapter Writeup refer to this data.
Construct a scatterplot with runs on the horizontal axis and wins on the vertical axis. Do you think there is a linear relationship between the number of runs and the number of wins in a given year? (Space provided on the next page for the graph. Make the graph large, we will use it again later.)
YEAR
GAMES PLAYED
RUNS
WINS
2009
162
872
95
2008
162
845
95
2007
162
867
96
2006
162
820
86
2005
162
910
95
2004
162
949
98
2003
162
961
95
2002
162
859
93
2001
161
772
82
2000
162
792
85
1999
162
836
94
1998
162
876
92
1997
162
851
78
1996
162
928
85
1995*
144
791
86
1994*
115
552
54
1993
162
686
80
1992
162
599
73
1991
162
731
84
1990
162
699
88
*failure inunion/management negotiations, fewer games were played.
Which of the following statements is true?
a.The variables display linear relationship
b.The variables display a nonlinear relationship
c.The variable are not related
RUNS
WINS
872
95
845
95
867
96
820
86
910
95
949
98
961
95
859
93
772
82
792
85
836
94
876
92
851
78
928
85
791
86
552
54
686
80
599
73
731
84
699
88
2.The linear regression line for the data in problem 1 is:
Wins = 24.5+ 0.08 Runs
Use this model to predict the number of wins the Red Socks would have in a season where they had 800 Runs. (Round your answer to one decimal place)
3.What is the slope of the regression line?
4.Interpret the slope?
5.What is the y-intercept of the regression line?
6.Can you interpret the y-intercept? Why or why not? If you can, do so.
7.In 1994, baseball management and the players union failed to come to an agreement and the management initiated a work stoppage. The two sides did not come to an agreement until 1995. As a result less baseball games were played in both the 1994 and 1995 season.The 1994 and 1995 data seem to be outside the pattern of the rest of the data. Determine if these two data points are influential by:
On the scatterplot you produced in problem 1:
a.Plot the regression line from the full data set on the on the scatter plot.The regression equation is:Wins = 24.5 + 0.08Runs, mark it "ALL SEASONS"
b.Plot the regression line from data set without the partial seasons on the on the scatter plot.The regression equation is:Wins = 43.3 + 0.05RUNS, mark it "ONLY FULL SEASON"
Do the partial seasons seem to be influential? Explain.
8.Using the linear regression model for "ALL GAMES" in the Red Socks data,
Wins = 24.5+ 0.08 Runs. Consider the data for the year 2004, (Runs = 949, Wins = 98) Calculate the residual for this year.
9.The coefficient of determination = 67.2% for the Red Socks data. Find the linear correlation coefficient. Round your answer to 2 decimal places.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started