1) Suppose that you have been asked to estimate a... jogging a mile or more on the school track to...

advertisement
1) Suppose that you have been asked to estimate a regression model to explain the number of people
jogging a mile or more on the school track to help decide whether to build a second track to handle all the
joggers at SFU. You collect data by living in a press box for the spring term, and you run two possible
explanatory equations:
A : Yˆ  125.0  15.0 X 1  1.0 X 2  1.5 X 3 , R 2  0.75
B : Yˆ  123.0  14.0 X  5.5 X  3.7 X , R 2  0.73
1
2
4
Where Y = the number of joggers on a given day, X1 = inches of rain that day, X2 = hours of sunshine that
day, X3 = the high temperature for that day (in Celsius), and X4 = the number of classes with term papers
due the next day.
a) Which of the two (admittedly hypothetical) equations do your prefer and why?
b) How is it possible to get different estimate signs for the coefficient of the same variable using the
same data?
2) Researchers have looked into driver fatality rates. One in particular concluded that the variance of
driving speed (the extent to which vehicles sharing the same highway drive at dramatically different
speeds) is important in determining fatality rates. As part of this analysis, the following equation was
estimated with data from US states from two different years.
Year 1: Fˆi  ˆ0  0.176Vi  0.0136Ci  7.75H i
R 2 = 0.624, N = 41
Year 2: Fˆi  ˆ0  0.190Vi  0.0071Ci  5.29 H i
R 2 = 0.532, N = 44
Where: Fi = the fatalities on rural interstate highways (per 100 million vehicle miles traveled)
in the i-th state
Vi = the driving speed variance in the i-th state
Ci = driving citations per driver in the i-th state
Hi = hospitals per square mile (adjusted) in the i-th state
a) Think about the intuition behind each variable and develop expected signs for each coefficient. Do the
reported support your expectations?
b) Should we attach much meaning to the differences between the estimated coefficients from the two
years? Why or why not? Under what circumstances might you be concerned about such differences?
c) The equation for the first year has the higher adjusted R2 but which equation has the higher R2?
Download