OLS Regression Analysis Interpretation Solutions

advertisement
UNC-Wilmington
Department of Economics and Finance
ECN 377
Dr. Chris Dumas
Regression Analysis Practice--Solutions
1) The results of an OLS regression analysis include: n = 62, k = 8, RSS = 200, ESS = 800, and TSS = 1000.
Calculate Ftest.
Ftest = [RSS/(k-1)] / [ESS/(n-k)] = [200/(8-1)] / [800/(62-8)] = 1.929
What is the hypothesis that is tested using Ftest?
H0 = all B’s are zero
H1 = one or more of the B’s is not equal to zero
What are the values of d.f.numerator and d.f.denominator for this F-test?
d.f. numerator = k - 1 = 7. d.f. denominator = n - k = 54.
For alpha = 5% level of significance, use the F-table to find Fcrit.
d.f. numerator = k - 1 = 7. d.f. denominator = n - k = 54. alpha = 5%.
So, Fcritical = between 2.10 and 2.25 (approximately)
Does the regression(as a whole) explain a statistically significant (at the alpha = 5% level of significance)
percentage of the variation in the dependent variable? Very briefly, why or why not?
Since Ftest < Fcritical, we accept H0.
Thus, the regression (as a whole) does not explain a statistically significant percentage of variation in the
dependent variable at the alpha = 5% level of significance.
Calculate R2.
R2 = RSS / TSS, however, because the regression equation is not statistically significant (based on the Ftest), R2 is not valid and should not be calculated.
Calculate Rbar2.
Rbar2 = 1-(1-R2)((n-1)/(n-k)), but, because the regression equation is not statistically significant (based on
the F-test), R-bar2 is not valid and should not be calculated.
If the regression is/were statistically significant, which should be used for this particular regression, R2 or
Rbar2?
Because the regression equation is not statistically significant (based on the F-test), neither R2 nor R-bar2 is
valid, and neither should be used.
Briefly, what does R2 (or Rbar2) tell us?
R2 (or Rbar2) tells us the percentage of the variation in Y that is explained by the regression model, but
neither should be used in this example, as the regression equation is not statistically significant (based on
the F-test).
1
UNC-Wilmington
Department of Economics and Finance
ECN 377
Dr. Chris Dumas
2) Suppose you do some consulting work for a client who is interested in the potential relationships between the price of ear
buds (Pbuds) and the prices of two of the materials used in ear bud production, the price of resin (Presin) and the price of
copper wire (Pwire). You decide to run the following OLS regression analysis in SAS:
proc reg data=dataset01;
model Pbuds = Presin Pwire;
run;
The results of the analysis are shown below.
The REG Procedure
Model: MODEL1
Dependent Variable: Pbuds
Number of Observations Read
Number of Observations Used
26
26
Analysis of Variance
Source
DF
Sum of
Squares
Mean
Square
Model
Error
Corrected Total
2
23
25
1551.00585
509.95569
2060.96154
775.50293
22.17199
Root MSE
Dependent Mean
Coeff Var
4.70871
14.96154
31.47212
R-Square
Adj R-Sq
F Value
Pr > F
34.98
<.0001
0.7526
0.7310
Parameter Estimates
Variable
Label
Intercept
Presin
Pwire
Intercept
Presin
Pwire
DF
Parameter
Estimate
Standard
Error
t Value
Pr > |t|
1
1
1
7.70949
0.90064
0.03093
1.46982
0.14345
0.05922
5.25
6.28
0.52
<.0001
<.0001
0.6064
2
UNC-Wilmington
Department of Economics and Finance
ECN 377
Dr. Chris Dumas
Referring to the OLS regression analysis results on the previous page, answer the following questions,
assuming α = 0.05:
What was the dependent variable in the regression analysis, and what were the independent variables?
The dependent variables is Pbuds. The independent variables are Presin and Pwire.
What was the sample size (n) used in the analysis?
n = 26
What does the F-Value tell you about the regression model?
The F-Value tells us whether the regression model, as a whole, explains a statistically-significant percentage of the
variation in the Y variable.
What are the values of d.f.numerator and d.f.denominator for use in finding Fcritical from the F-table?
d.f.numerator = k – 1 = 3 – 1 = 2
d.f.denominator = n – k = 26 – 3 = 23
What is the value of Fcritical from the F-table (using α = 0.05)?
Fcritical = 3.49 approximately
Is the F-Value significant? Briefly, how can you tell?
From the regression output, Ftest = 34.98. Since Ftest > Fcritical, the F-Value is significant, which means that we reject
H0 = all β’s are zero, and we accept H1 = one or more of the β’s is not equal to zero.
What do R-Square and Adj R-Sq tell you? For this regression, which should you look at, and (very briefly) why?
R-Square and Adj R-Sq tell us the percentage of the variation in Y that is explained by the regression model. For
this regression, we should look at Adj R-Sq, because the regression model has more than one X variable.
In SAS, the SER is called “Root MSE?” Briefly, what does it tell us?
SER tells us the average distance of a data point from the regression line/curve.
Briefly, what do the numbers in the “Parameter Estimate” column tell us?
The numbers in the “Parameter Estimate” column tell us the values of the 𝛽̂ ’s. The Parameter Estimate for the
“Intercept” is 𝛽̂ 0. The Parameter Estimate for Presin is the 𝛽̂ 1 in the regression equation. The Parameter Estimate
for Pwire is the 𝛽̂ 2 in the regression equation.
Briefly, what do the numbers given in the “t Value” column tell us?
The numbers in the “t Value” column tell us the ttest values for the t-tests of whether each 𝛽̂ parameter is equal to
zero. We can compare these ttest values with tcritical numbers from a t-table to test H0: 𝛽̂ = 0 vs. H1: 𝛽̂ ≠ 0. (Since
this is a two-sided test, use α/2 when retrieving tcritical from the t-table.)
Briefly, what do the numbers in the “Pr > |t| column tell us?
The numbers in the “Pr > |t|” column tell us the p-values for the t-tests of the 𝛽̂ parameters. We can compare these
p-values to the α/2-value in order to test H0: 𝛽̂ = 0 vs. H1: 𝛽̂ ≠ 0.
Is the effect of Presin on Pbuds statistically significant (is the 𝛽̂ for Presin different from zero)? If so, what is the sign
and magnitude of the effect of Presin on Pbuds?
 The effect of Presin on Pbuds is statistically significant at the α/2 significance level, because the t test value for
Presin is 6.28, which is greater than tcritical = 2.069 from the t-table (two-sided, α/2 = 0.025, d.f. = n-k = 26-3 =
23).
 Parameter estimate 𝛽̂ 1 gives the effect of Presin (variable X1) on Pbuds (the Y variable). The value of 𝛽̂ 1 is
0.90064. This means that a one unit increase in Presin (variable X1) results in a 0.90064 unit increase in Pbuds
(the Y variable). We know that the effect of X1 on Y is positive because 𝛽̂ 1 is positive.
Is the effect of Pwire on Pbuds statistically significant (is the 𝛽̂ for Pwire different from zero)? If so, what is the sign
and magnitude of the effect of Pwire on Pbuds?
 The effect of Pwire on Pbuds is not statistically significant at the α/2 significance level, because the ttest value
for Pwire is 0.52, which is less than tcritical = 2.069 from the t-table (two-sided, α/2 = 0.025, d.f. = n-k = 26-3 =
23).
 Because the effect of Pwire on Pbuds is not statistically significant, the effect of Pwire on Pbuds is zero.
3
Download