Periodically, software engineers must provide estimates of their effort in developing new software. In the Journal of
Question:
Periodically, software engineers must provide estimates of their effort in developing new software. In the Journal of Empirical Software Engineering (Vol. 9, 2004), multiple regression was used to predict the accuracy of these effort estimates. The dependent variable, defined as the relative error in estimating effort,
y = (Actual effort – Estimated effort)/(Actual effort)
was determined for each in a sample of n = 49 software development tasks. Eight independent variables were evaluated as potential predictors of relative error using stepwise regression. Each of these was formulated as a dummy variable, as shown in the table.
a. In Step 1 of the stepwise regression, how many different one-variable models are fit to the data?
b. In Step 1, the variable x1 is selected as the “best” one variable predictor. How is this determined?
c. In Step 2 of the stepwise regression, how many different two-variable models (where x1 is one of the variables) are fit to the data?
d. The only two variables selected for entry into the stepwise regression model were x1 and x8. The stepwise regression yielded the following prediction equation:
ŷ = .12 - .28x1 + .27x8
Give a practical interpretation of the β-estimates multiplied by x1 and x8.
e. Why should a researcher be wary of using the model, part d, as the final model for predicting effort (y)?
Step by Step Answer:
Statistics For Engineering And The Sciences
ISBN: 9781498728850
6th Edition
Authors: William M. Mendenhall, Terry L. Sincich