We conducted a second regression analysis on the data from the previous exercise. In addition to depression
Question:
a. From this software output, write the regression equation.
b. As the first independent variable, depression at year 1, increases by 1 point, what happens to the predicted score on anxiety at year 3?
c. As the second independent variable, anxiety at year 1, increases by 1 point, what happens to the predicted score on anxiety at year 3?
d. Compare the predictive utility of depression at year 1 using the regression equation in the previous exercise and using the regression equation you just wrote in part (a) of this exercise. In which regression equation is depression at year 1 a better predictor? Given that were using the same sample, is depression at year 1 actually better at predicting anxiety at year 3 in one regression equation versus the other? Why do you think theres a difference?
e. The accompanying table is the correlation matrix for the three variables. As you can see, all three are highly correlated with one another. If we look at the intersection of each pair of variables, the number next to Pearson correlation is the correlation coefficient. For example, the correlation between Anxiety year 1 and Depression year 1 is .549. Which two variables show the strongest correlation? How might this explain the fact that depression at year 1 seems to be a better predictor when its the only independent variable than when anxiety at year 1 also is included? What does this tell us about the importance of including third variables in the regression analyses when possible?
f. Lets say you want to add a fourth independent variable. You have to choose among three possible independent variables: (1) a variable highly correlated with both independent variables and the dependent variable, (2) a variable highly correlated with the dependent variable but not correlated with either independent variable, and (3) a variable not correlated with either of the independent variables or with the dependent variable. Which of the three variables is likely to make the multiple regression equation better? That is, which is likely to increase the proportionate reduction in error? Explain.
Step by Step Answer:
Essentials Of Statistics For The Behavioral Sciences
ISBN: 9781464107771
3rd Edition
Authors: Susan A. Nolan