Economics 4818 - Introduction to Econometrics Midterm 1 - Answers 1. Discuss the e¤ect of the following phenomena on the unbiasedness and the variance of the OLS estimators, cj : 1. heteroskedasticity Answer: Heteroskedasticity means that the error variance is not a constant and so Assumption 5 of the Gauss-Markov assumptions is violated, that is V ar (u) 6= 2 . (3 points) E¤ ect on the unbiasedness of cj : Heteroskedasticity does not a¤ect the unbiasedness of the OLS estimators, cj . (3 points) E¤ ect on the variance of cj : Under heteroskedasticity, the usual formula 2 for the variance of OLS estimators, V ar cj = SST 1 R2 , is no longer valid. ( j j) 2. perfect collinearity Answer: Perfect collinearity means that some independant variable, xj , is a perfect linear combination of other independent variables. (3 points) E¤ ect on the unbiasedness of cj : Perfect collinearity violates Assumption 3 for the unbiasedness of OLS estimators, cj . In fact, under perfect collinearity, the OLS estimators, cj , practically cannot be computed. That’s why we have to drop one of the variables in the perfect linear combination. (3 points) E¤ ect on the variance of cj : If the independant variable, xj , is a perfect linear combination of other independent variables, then Rj2 = 1 and the variance of 2 c, which is given by V ar c = , is not de…ined (or it is in…nitely large). j j SSTj (1 Rj2 ) 3. multicollinearity Answer: Multicollinearity means that some independant variables are higly correlated with each other. (3 points) E¤ ect on the unbiasedness of cj : Multicollinearity does not a¤ect the unbiasedness of the OLS estimators, cj . (3 points) E¤ ect on the variance of cj : If the independant variable, xj , is highly correlated with other independent variables, then Rj2 is "close to 1". Thus, the variance 2 of cj , given by V ar cj = SST 1 R2 , is large and this decreases the precision of j( j) c. j 4. including an irrelevant variable Answer: 1 (3 points) E¤ ect on the unbiasedness of cj : Including an irelevant variable does not a¤ect the unbiasedness of the OLS estimators, cj . (3 points) E¤ ect on the variance of cj : Including an irelevant variable increases the variances of the OLS estimators of the relevant independent variables. In particular, 2 Rj2 increases and thus, the variance of cj , given by V ar cj = SST 1 R2 , increases. ( j j) 5. omitting a relevant variable Answer: (3 points) E¤ ect on the unbiasedness of cj : If the omitted relevant variable is correlated with some of the other independent variables in the model, omitting the relevant variable biases the OLS estimators cj . In particular, the omitted relevant variable will be included in the error term u, and will violate the zero conditional mean assumption (Assumption 4). If the omitted relevant variable is not correlated with any of the other independent variables in the model, omitting the relevant variable will not bias the OLS estimators cj . (3 points) E¤ ect on the variance of cj : Omitting a relevant variable actually decreases the variances of the OLS estimators of the remaining independent variables. In 2 particular, Rj2 decreases and thus, the variance of cj , given by V ar cj = SST 1 R2 , j( j) decreases. 2. A) What does R2 measure? Answer: (5 points) R2 measures how well the OLS regression line …ts the data in the sample. More speci…cally, it measures what portion of the variation in y is explained by the variation in the independent variables x1 ; x2 ; ::::; xk . Mathematically, R2 = SSE SST . B) Is a regression useless if its R2 is low? Explain. Answer: (5 points) Not necessarily. However, it means that the independent variables in the regression, x1 ; x2 ; ::::; xk , are not very important factors of y. C) If R2 = 1, how close is the OLS regression line to the actual data points? Answer: (5 points) All data points are on the OLS regression line. 3. Suppose you have a simple model explaining monthly salary in dollars (wage) in terms of IQ score (IQ): wage = 0 + The OLS regression line for this model is: 2 1 IQ +u wage [ = 116:9 + 8:30IQ n = 936; R2 = :096 A) Interpret the OLS estimates in this regression. Answer: (5 points) The interpretation of the intercept estimate is not very meaningfull: an individual with IQ = 0 has a predicted wage of $116:9. The interpretation of the slope estmate is that, other things equal, an increase in IQ of 1 point is predicted to increase an individual’s wage by $8:30. B) What kinds of factors are contained in the eror term u? List at least two factors that are included in u and are correlated with IQ. Answer: (5 points) Education and innate ability, for example, are included in u and are correlated with IQ. Thus the zero conditional mean assumption is violated. C) Does this simple regression model uncover the ceteris paribus e¤ect of IQ on wage? Explain. Answer: (5 points) No, it doesn’t. The OLS estimates c0 and c1 are biased since the zero conditional mean assumption is violated. D) Does IQ explain most of the variation in wage in the above regression? Answer: (5 points) No, IQ explains only 9:6% of the variation in wage. Thus, there are other important factors of wage, which explain more of its variation. E) Suppose you have estimated an alternative speci…cation of the wage ship: IQ relation- log\ (wage) = 5:89 + :0088IQ n = 936; R2 = :099 Interpret the slope estimate in this model? Answer: (5 points) Other things equal, an increase in IQ of 1 point is predicted to increase an individual’s wage by 100 (:0088) = 0:8%. F) Which regression model is preferred - the one with dependent variable wage or the one with dependent variable log (wage)? Explain. Answer: (5 points) The second model is preferred because it explains more of the variation in wage. 3 4. Suppose you have the model price = 0 + 1 sqrf t + 2 bdrms +u where price is the house price measured in thousands of dollars, sqrf t is the size of the house in square feet and bdrms is the number of bedrooms in the house. The OLS regression line for this model is: [ = price 19:32 + :128sqrf t + 15:20bdrms n = 88; R2 = :632 A) Interpret the estimated coe¢ cient on sqrf t. Answer: (5 points) Holding bdrms …xed, a 1 square foot increase in the square footage of a house is predicted to increase the price of the house by $128. B) What s the predicted increase in price for a house having one more bedroom, holding square footage constant? Answer: (5 points) $15; 200. C) Are the variances of the slope estimators c1 and c2 likely to be large or small? Explain. Answer: (5 points) The slope estimators c1 and c2 are likely to have large variances because sqrf t and bdrms are highly correlated. In particular, the terms R12 and R22 are 2 2 large and so V ar c1 = SST 1 R2 and V ar c1 = SST 1 R2 are large. ) ( 1( 2 1 2) D) If you run the regression ] = f0 + f1 sqrf t; price what would be the likely sign and magnitude of f1 relative to c1 = :128? Explain. Answer: (5 points) f1 is biased since the relevant variable bdrms is omitted from the second regression model. The bias is given by the formula Bias f1 = 1 e1 . Here, 1 has been estimated to be positive, c1 = :128, and sqrf t and bdrms are positively correlated, e1 > 0. So, Bias f > 0. Thus, f is likely to have a positive sign and to be larger in 1 1 magnitude than c1 = :128. 4