Problem Set for ECON 3150/4150 Spring 05

advertisement
Problem Set for ECON 3150/4150 Spring 05
In order to get familiar with the textbook and the problems we should be able to solve in this
introductory econometric course, the first problems for the seminars are taken from the
textbook “Undergraduate Econometrics” by R. C. Hill, W.E. Griffiths and G. G. Judge.
Problem set 1. These problems are taken from chapter 2, and are intended to refresh and
revive your memory of well known facts you have learnt in previous courses in statistics.
Solve the exercises: 2.17, 2.18, 2.22, 2.26
Problem set 2. The problems here are meant to illustrate what type of information we get from
linear regression analysis. In addition they are exercises in using regression programs.
Solve the exercises: 3.9, you will find the data in the file capm.xls, 3.15, 3.16 using the data in
the file br-1.xls.
Problem set 3. The problems here summarize properties of the least square method (OLS). In
particular they remind you that OLS estimators and estimators in general are random
variables.
Solve the exercises: 4.5, 4.9, 4.12, 4.16
Problem set 4. The point with the exercises here is to become familiar with procedures for
testing hypotheses on regression coefficients and constructing confidence intervals.
Solve the exercises: 5.13 using data in the file insur.xls, 5.14 using data in the learn.xls, 5.15,
5.18.
Problem set 5. The step from simple regression to multiple regression is in many ways
immediate. The problems listed below is meant to give you some training in using multiple
regression.
Solve the exercises: 7.9 using the data in meat.xls, 7.10 using the data in clothes.xls.
Problem set 6. Testing simultaneous hypotheses on the regression coefficients - F tests.
Solve the exercises: 8.9 using the file codd.xls, 8.13 using the data in the file cars.xls.
Problem set 7. This is an exercise in demand analysis. You will find the data in the file table
8-3.xls
Demand analysis:
We are concerned with analysing the demand for beer in a sample of households. Our sample
reports observations on 5 variables: the quantity of beer demanded (QB ) , the price of beer
(PB) , the price of other liquor ( PL) , a price index of the remaining goods and services on the
households’ budgets (PR) , and finally the households income (INC ). We are not certain of
what will be the appropriate specification of this demand so we will try different forms. We
start with the ln-ln functional form:
(1) ln(QBi )  0  1 ln( PBi )  2 ln( PLi )  3 ln( PRi )  4 ln( INCi )   i
where  i denotes the random disturbances.
Question A. How will you interpret the regression parameters in this demand function.
Use PcGive and the attached data file table 8-3.xls to estimate this model.
Question B. Do you think the signs of the estimates are reasonable? Substantiate your
assertions. Explain how the numbers in the column P  t are calculated.
Standard consumer demand theory tells us that if prices and income increase by the same
proportion we should expect no change in the quantity demanded. The consumers are said to
have no money illusion.
Question C. Show that the assumption of no money illusion applied to the demand function
(1) implies the restriction:
(2) 1   2  3   4  0
Question D. Use results and information from your regression to test the hypothesis:
H 0 : 1   2  3   4  0
against
with significance level   0.05 .
H a : 1   2  3   4  0
Now we are told that the ln-ln functional form (1) has certain theoretical drawbacks. In
addition to (1) we thus wish to analyse two forms which are linear in the relative price and the
real income.
(3) QBi  1  2 ( PLi PBi )  3 ( PRi / PBi )  4 ( INCi PBi )  ui
(4) QBi  0 (1 PBi )  1  2 ( PLi / PBi )  3 ( PRi / PBi )   4 ( INCi / PBi )  vi
where u i and vi denote random disturbances.
Use PcGive to estimate the regression equations (3) and (4).
Question E. Which of the two equations (3) or (4) do you think is appropriate for testing the
assumption of no money illusion? State the reason for your choice and explain how you
would test this hypothesis.
Question F. Assume that the number of household members ( Ni ) have been wrongly
excluded from the regressions (3) and (4). Choose one of these regressions for further study,
and explain formally how this misspecification effects the OLS estimates of the  ' s when:
(i) N i is uncorrelated with the explanatory variables already used in the equation.
(ii) N i is correlated with the explanatory variables already used in the equation.
Problem set 8. The next exercise illustrates the use of dummy variables in regression analysis.
Problem set 9. Households’ expenditures on food.
Problem set 10. Exercises with dummy variables.
Solve the exercises: 9.6 using the data file tuna.xls, 9.8
Problem set 11. This is an exercise with non-linear models.
We are interested in investigating how households’ expenditures on food vary with their
income . We have observations on the households’ expenditures on food (Y) and incomes
( R ) for 50 Norwegian households, both variables are measured in 1000 kroner. In addition do
we have observations on the number of members in the households.
~
For a given income R do we assume that the expected value of Y , denoted Y , is given by
the function:
(1)
R
~
Y 
R
where  and  denote unknown, positive parameters.
(a) Discuss if, in your opinion, the function (1) gives a good description of the relation
between expenditures on food and income.
Since (1) is non-linear in the parameters, we are unable to estimate the parameters  og 
with ordinary OLS regression. We shall therefore approximate (1), first with a linear and then
with quadratic funksjon. The linear approximation Yi is given by:
(2)
Yi   1 Ri   i
where  1 


and  i denotes random disturbances,
i  1,2, ….. ,50
(b) Show that OLS estimator of  1 is given by:
50
(3)
ˆ1 
Y R
i 1
50
i
i
R
i 1
2
i
Out-print 1 shows the results of the OLS applied to regression (2) together with the
histogram of the residuals ˆi .
(c) Give your comments to this regression, calculate the Jarque-Bera observator when
(skewness) S  0.6332 and kurtosis k  4.5288.
~
In order to improve the approximation of Y , we now approximate (1) by a quadratic
function.. The quadratic approximation of Yi is given by:
(4)
Yi   1 Ri   2 Ri   i
2
where  2  

2
and  i are random disturbances,
i  1,2,......50
Outprint 2 shows the results of OLS applied to the regression (4) together with histogram of
the residuals ˆi . The variable RR in the outprint corresponds to R 2 in the regression
equation (4).
(d) Do you think that the results as they appear in this out-print are reasonable?
Substantiate your answer.
(e) Explain how you will use these results to deduce estimates of  og  and calculate
̂ and ˆ .
(f) Use the expenditure function (1) to derive the Engel elasticity. Use your results above
to calculate this elasticity for a household with income equal to 100 000
We suspect that the parameter  1 in equation (4) depends on the number of member in the
household S , but we are uncertain how to specify this dependency. There are two proposals:
(5)  1   0  1 S
eller
(6)  1   0  1 S  
where  denotes the random disturbances satisfying the usual conditions.
Out-print 3 shows the results of the regression:
i  1,2,.......,50
(7) Yi   0 Ri  1 ( RS )i   2 Ri2  ui
where u i denotes the random disturbances in this regression
(g) Give your comments to this out-print.
(h) Explain your opinion about choosing (5) or (6).
Out-print 1
EQ( 1) Modelling Y by OLS-CS (using data.eksoppg.h2003.xls)
The estimation sample is: 1 to 50
R
sigma
Coefficient
0.233338
6.55912
Std.Error
0.01760
RSS
t-value
13.3
t-prob
0.000
2108.08312
no. of observations
50 no. of parameters
1
mean(Y)
12.5025 var(Y)
37.0663
Density
0.45
r:Y
N(0,1)
0.40
0.35
0.30
0.25
0.20
0.15
0.10
0.05
-3.0
-2.5
-2.0
-1.5
-1.0
-0.5
0.0
0.5
1.0
1.5
2.0
2.5
3.0
3.5
4.0
no. of observations
50 no. of parameters
1
mean(Y)
12.5025 var(Y)
37.0663
Out-print 2
Q( 2) Modelling Y by OLS-CS (using data.eksoppg.h2003.xls)
The estimation sample is: 1 to 50
R
RR
Coefficient
Std.Error
0.371463
0.05604
-0.00213237 0.0008260
t-value
6.63
-2.58
t-prob
0.000
0.013
sigma
6.20997 RSS
1851.05693
no. of observations
50 no. of parameters
2
mean(Y)
12.5025 var(Y)
37.0663
4.5
0.5
Density
r:Y
N(0,1)
0.4
0.3
0.2
0.1
-3.0
-2.5
-2.0
-1.5
-1.0
-0.5
0.0
0.5
1.0
1.5
2.0
2.5
3.0
3.5
4.0
4.5
Out-print 3
Q( 3) Modelling Y by OLS-CS (using data.eksoppg.h2003.xls)
The estimation sample is: 1 to 50
R
RS
RR
Coefficient
0.170144
0.0623471
-0.00238358
Std.Error
t-value
0.06098
2.79
0.01248
5.00
0.0006765 -3.52
sigma
5.07192 RSS
1209.04536
no. of observations
50 no. of parameters
3
mean(Y)
12.5025 var(Y)
37.0663
t-prob
0.008
0.000
0.001
Density
r:Y
0.40
N(0,1)
0.35
0.30
0.25
0.20
0.15
0.10
0.05
-2.5
-2.0
-1.5
-1.0
-0.5
0.0
0.5
1.0
1.5
2.0
2.5
3.0
3.5
4.0
Download