Notation: Bold face = vectors and matrices

advertisement
Lin Ma Olga Kerimova Tan Meng
ECON4160 Seminar 2
11. 02.2009
Notation:
Bold face = vectors and matrices
(......)ʼ is a transposed row vector = column vector. Notation is used for easier
representation.
diag[..] is a matrix with zeroes everywhere except the leading diagonal. The elements
on this diagonal are listed in the bracket.
Exercise 7
1) Which assumptions would you make about the disturbances ut , vt and wt ?
The three equations may be rewritten as follows, such that all three have Ct on the LHS
and the RHSʼs are comparable.
(aʼ) -- multiplying (a) by Pt
(bʼ) -- (b) is unchanged
(cʼ) -- multiplying (c) by Rt
=> utPt = vt = wtRt
Thus, the assumptions on ut are sufficient for assumptions on vt and wt to follow (see
below).
Assumptions:
E(ut|Pt,Rt) = 0; E(vt|Pt,Rt) = 0; E(wt|Pt,Rt) = 0; t = 1, ... ,n
Note: since there is no intercept in (b), these zero conditional mean assumptions will
not be automatically fulfilled.
E(ut,us|Pt,Rt,Ps,Rs) = σ2ωts t,s = 1, ... ,n and σ2 some unknown constant
E(vt,vs|Pt,Rt,Ps,Rs) = E(utPt,usPs|Pt,Rt,Ps,Rs) = PtPsE(ut,us|Pt,Rt,Ps,Rs) = PtPsσ2ωts; t,s =
1, ... ,n
E(wt,ws|Pt,Rt,Ps,Rs) = E(ut(Pt/Rt),us(Ps/Rs)|Pt,Rt,Ps,Rs) = (Pt/Rt)(Ps/Rs)σ2ωts; t,s = 1, ... ,n
Note: ωts,t≠s may well be non-zero (autocorrelation) and ωtt ≠1 (heteroscedasticity)
Assumptions in vector form:
E(u|X) = 0; E(v|X) = 0; E(w|X) = 0
E(uuʼ|X) = σ2 Ω
E(vvʼ|X) = E((uP)(uP)ʼ|X) = PPʼE(uuʼ|X) = PPʼσ2 Ω
E(wwʼ|X) = E(u(P/R)(u(P/R))ʼ|X) = (P/R)(P/R)ʼE(uuʼ|X) = (P/R)(P/R)ʼσ2 Ω
where Ω is a nxn matrix with element (t,s) = ωts; X is a nx2 matrix with column 1 being
the vector of prices (P1 P2 ... Pn)ʼ and column 2 being the vector of Income (R1 R2 ...
Rn)ʼ. u, v and w are nx1 vectors of disturbances such that u = (u1 u2 ... un)ʼ and so on.
P is a vector of prices, i.e. a nx1 vector contained in column 1 of X. Similarly, R is a nx1
vector of income contained in column 2 of X. P/R is a nx1 vector (P1/R1 P2/R2 ...Pn/Rn)ʼ
such that Pt/Rt is row t of this vector. (P/R)(P/R)ʼ is the product of a column vector and
a row vector, yielding an nxn matrix of which (Pt/Rt)(Ps/Rs) is one element (in row t and
column s). Note: (uP) is not the product of vector u and vector P (that multiplication
would be impossible). uPnx1 = (u1P1 u2P2 ... unPn)ʼ. Hence, uP(uP)ʼ is an nxn vector.
Lin Ma Olga Kerimova Tan Meng
ECON4160 Seminar 2
11. 02.2009
2) Explain how you would estimate the marginal propensity to consume, β, (i) by OLS
and (ii) by GLS, in the three cases. What can you say about the properties of
estimators?
OLS
Estimate all three equation by standard OLS - minimise sum of squared residuals. The
estimates of β will differ as the residuals are assumed to be different and hence the
estimated error terms differ. The estimators will be unbiased, consistent but inefficient
(since the disturbances are autocorrelated, heteroscedastic or both). The OLS
estimators are asymptotically normally distributed. Only if Ω = In (identity matrix), u will
be well behaved (no autocorrelation and no heteroscedasticity) and, therefore, the
estimator of β in (a) will be efficient as well as unbiased and consistent. The other 2
estimators (in (b) and (c)), however, will remain inefficient (due to heteroscedasticity) as
for example E(vt,vs|Pt,Rt,Ps,Rs)=Pt2σ2 for t=s and 0 for t≠s. Similarly for w.
GLS (Generalised Least Squares)
First assume that there may be both Autocorrelation and Heteroscedasticity, i.e. a
general analysis follows.
In a case where E(uuʼ|X) = σ2 Ω and supposing that Ω is known, we can transform the
model and estimate by GLS. If σ2Ω is positive definite, then a non-singular matrix T
with element (t,s) = tts will exist such that TΩTʼ = In or equivalently Ω-1 = TʼT. Thus, we
need to find the inverse of Ω and factorise it, which is possible when Ω is positive
definite.
Rewrite (a), (b) and (c) in matrix form for convenience:
(a) y = Zβ + u where ynx1 = (C1/P1 C2/P2 ... Cn/Pn)ʼ and Z is nx2 with column 1
containing 1ʼs (for the intercept) and column 2 being an nx1 vector (R1/P1 R2/P2 ....
Rn/Pn)ʼ. β is a 2x1 vector (α β)ʼ. u is an nx1 vector of disturbances (u1 u2 ... un)ʼ.
(b) y = Zβ + v where ynx1 = (C1 C2 ... Cn)ʼ and Z is nx2 with column 1 being an nx1
vector (P1 P2 ....Pn)ʼ and column 2 being an nx1 vector (R1 R2 .... Rn)ʼ. β is a 2x1
vector (α β)ʼ. v is an nx1 vector of disturbances (v1 v2 ... vn)ʼ.
(c) y = Zβ + w where ynx1 = (C1/R1 C2/R2 ... Cn/Rn)ʼ and Z is nx2 with column 1 being an
nx1 vector (P1/R1 P2/R2 .... Pn/Rn)ʼ and column 2 containing 1ʼs (for the intercept) . β
is a 2x1 vector (α β)ʼ. w is an nx1 vector of disturbances (w1 w2 ... wn)ʼ.
Now transform the models by premultiplying by a transformation matrix T.
(a) becomes: Ty = TZβ + Tu (equivalently y* = Z*β + u*)
Then E(u*u*ʼ|Z) = E(TuuʼTʼ|Z) = Tσ2 ΩTʼ = σ2 TΩTʼ = σ2In
Thus, β can now be estimated by OLS on the transformed model.
(b) becomes: Ty = TZβ + Tv (equivalently y* = Z*β + v*)
Here we need to find T such that TPPʼΩTʼ = In
Then E(v*v*ʼ|Z) = E(TvvʼTʼ|Z) = TPPʼσ2 ΩTʼ = σ2 TPPʼΩTʼ = σ2In
Thus, β can now be estimated by OLS on the transformed model.
(c) becomes: Ty = TZβ + Tw (equivalently y* = Z*β + w*)
Here we need to find T such that T(P/R)(P/R)ʼΩTʼ = In
Then E(w*w*ʼ|Z) = E(TwwʼTʼ|Z) = T(P/R)(P/R)ʼσ2 ΩTʼ = σ2T(P/R)(P/R)ʼΩTʼ = σ2In
Thus, β can now be estimated by OLS on the transformed model.
The estimators are minimum variance linear unbiased estimators (MVLUE). The
estimators are consistent provided that plim[(1/n)Z*ʼZ*] = Q* where Q* is a finite
positive definite matrix. The estimators are also asymptotically normally distributed.
Lin Ma Olga Kerimova Tan Meng
ECON4160 Seminar 2
11. 02.2009
Two specific cases:
Only Heteroscedasticity - weighted least squares (equation (a))
ω11
ω11
ω22
Ω
ω22
ωnn
ωnn
Thus the transformed model y* = Z*β + u* has the following elements: yt* = yt/√ωtt; zkt*
= zkt/√ωtt; ut* = ut/√ωtt.
In case (b) Tnxn = diag[(1/√ωtt)Pt] for t=1, .... ,n.
In case (c) Tnxn = diag[(1/√ωtt)Rt/Pt] for t=1, .... ,n.
Only Autocorrelation - AR(1)
Ω
Then y1* = (√1-ρ2)y1; zk1* = (√1-ρ2) zk1
And for t = 2, .... ,n:
yt* = yt - ρyt-1 ; zkt* = zkt - ρzk,t-1
3) In the consumption functions above we have assumed that the consumers do
have money illusion, since a proportional change in Pt and Rt is assumed to leave
real value of consumption, Ct /Pt , unchanged. Could you, by modifying
specifications (a), (b) or (c) in suitable ways, use them as a starting point
investigating econometrically whether the consumers have money illusion?
not
the
the
for
One possibility is to include the price index Pt in specification in (a), which, as it stands,
is in real terms only. Thus,
Ct/Pt = α + βRt/Pt + δPt + ut
To test for money illusion, test δ=0 vs. alternative δ≠0 by means of a simple t-test. If
there is no money illusion, real consumption depends on real income only and does not
respond to proportional changes in prices and income). If there is money illusion (δ≠0),
a proportionate change in prices and income will change real consumption, indicating
that consumers respond to nominal changes (in the price level in this case) even if in
real terms nothing had changed.
An even clearer way to illustrate money illusion, perhaps, is to include nominal income
in (a):
Ct/Pt = α + βRt/Pt + φRt + ut
Lin Ma Olga Kerimova Tan Meng
ECON4160 Seminar 2
11. 02.2009
Here, one would test the hypothesis φ=0 (no money illusion). As before, if there is no
money illusion, a proportionate change in prices and income will leave real
consumption unchanged, i.e. consumers think in real terms rather than in nominal
terms. However, if the hypothesis is rejected and money illusion is present, a
proportionate change in prices and nominal income will in fact change real
consumption. If nominal income increases proportionately to an increase in prices,
consumers not suffering from money illusion will realise that in real terms their income
is the same and not change their consumption. However, consumers under money
illusion, will respond to an increase in their nominal income by consuming more (real
consumption increases). Real consumption will vary with nominal income.
Exercise 9
(a) Describe one of more methods which you would find useful in investigating whether
the disturbances u or v or w in models (a)–(c) in Exercise 7 exhibit
heteroscedasticity.
1. Plot graph
Plot êt² against independent variables. We can observe whether there exits an
increasing or decreasing pattern.
(a) êt² = [C t /P t – (C t /P t)ols]2
(b) êt² = [C t – (C t)ols]2
(c) êt² = [C t /R t – (C t /R t)ols]2
Homoscedasticity
Heteroscedasticity
2. Whiteʼs test
H0: σt²=σ2 for all t (no heteroscedasticity)
H1: Not H0.
A simple operational of Whiteʼs test is carried out by obtaining nR² in the regression of
êt² on a constant and all unique variables contained in x and all the squares and cross
products of the variables in x.The estimated models for the three cases, where a, b, c,
d, e, and f are coefficient estimates, are:
Lin Ma Olga Kerimova Tan Meng
a) êt²= a + b(Rt/Pt) + c(Rt /Pt)²
b) êt² = aPt + bPt² + cPtR t + dRt² + f
c) êt²= a + b(Pt /Rt) + c(Pt /Rt)²
ECON4160 Seminar 2
11. 02.2009
êt² = [C t /P t – (C t /P t)ols]2
êt² = [C t – (C t)ols]2
êt² = [C t /R t – (C t /R t)ols]2
Find the nR² (where n the number of observations) of each regression. The statistic is
asymptotically distributed as Chi-squared with P-1 degrees of freedom, where P is the
number of regressors in the equation, including the constant.
(a) 3-1 = 2 d.o.f
(b) 5-1 = 4 d.o.f
(c) 3-1 = 2 d.o.f.
Chi-squared with 5% : critical value = 5.99
Chi-squared with 5% : critical value = 9.49
Chi-squared with 5% : critical value = 5.99
Compare nR² with the relevant critical value. If nR² ≥ critical value reject hypothesis of
homoscedasticity. If nR²< critical value, one canʼt reject hypothesis of homoscedasticity.
3. Breusch-Pagan Test
This test requires a specific alternative hypothesis
H0: σt2 = σ2 f(α0 + α′zt ), where zt is a vector of independent variables. Model is
homoscedastic if α = 0.
We specify the test based on:
(a) zt=[1, Rt /Pt]
(b) zt=[1, Pt, Rt]
(c) zt=[1, Pt /Rt]
By using the least squares residuals, get gt = et²/(eʼe/n)-1 and then find
LM=½[gʼZ(ZʼZ)ˉ¹Zʼg], and compare the value with the critical value for the chi-squared
distribution with:
(a) 1 d.o.f.
Chi-square with 5% : critical value = 3.84
(b) 2 d.o.f.
Chi-square with 5% : critical value = 5.99
(c) 1 d.o.f.
Chi-square with 5% : critical value = 3.84
Z is an n×P matrix of observations on (1, zt ).
Conclude whether the null hypothesis of homoscedasticity is rejected or not.
Note: the Koenker and Bassettʼs method could be much more powerful with:
LM=[1/V](u - ūt)ʼZ(ZʼZ)ˉ¹Zʼ(u - ūt) where u=(e12, e22, . . . , en2) and t is an n×1 column of
1s and ū = eʼe/n.
Lin Ma Olga Kerimova Tan Meng
ECON4160 Seminar 2
11. 02.2009
9 (b) Describe one or more methods that you would find useful in order to investigate
whether the disturbance of the model you have chosen in Exercise 7, exhibits
autocorrelation.
1. Plot êt
We can plot êt which could represent u t, v t, or wt. For example:
positive autocorrelation
negative autocorrelation
2. Durbin-Watson test
Let μ t =ρμ t -1+ε t
-1<ρ<1
(μ is the random disturbance, ut, vt or wt in the case)
H0: ρ=0
H1: H0 is not true
Find the D.W. value. Find the dL and dL with D.W. distribution according to the degrees
of freedom, n, and the number of regressors excluding the intercept (check the table
Greene, p. 1098).
If 0<D.W.<dL
If dL<D.W.<dU
If dU <D.W.<4−dU
If 4−dU <D.W.<4 - dL If 4−dL <D.W.<4
positive autocorrelation
canʼt decide
no autocorrelation
canʼt decide
negative autocorrelation
Lin Ma Olga Kerimova Tan Meng
ECON4160 Seminar 2
11. 02.2009
Note: The Durbin–Watson test is not likely to be valid when there is a lagged
dependent variable in the model. The D.W. statistic will usually be biased towards no
autocorrelation. However, there are other tests, such as the LM and Q tests as well as
Durbinʼs h test, that can be used whether or not the regression contains a lagged
dependent variable.
Exercise 10
Explain the differences between the Generalized Least Squares (GLS) and the
Feasible Generalized Least Squares (FGLS) methods. Explain how you would proceed
to apply the latter method in a specific situation. What can you say about the properties
of the FGLS method?
The FGLS method is just a “little modification” on the GLS method. To obtain the FGLS
estimators is to replace the covariance matrix in the GLS method by an estimated
covariance matrix ΩFGLS. Generally the covariance matrix contains unknown
parameters, so it is impossible to use GLS. Instead, we use FGLS. However, the FGLS
method hinges on knowing the form of heteroscedasticity and/or autocorrelation, in
which case it is possible to find a consistent estimator of Ω.
Specific situation
Assume the disturbances are heteroscedastic but non-autocorrelated. Also the
disturbances satisfy the assumption that it is uncorrelated with explanatory variables.
Then we do the following steps to apply the FGLS:
Step1: calculate the ordinary least squares (OLS) estimators as usual. Then use the
OLS
estimators to compute the error terms, ei, as estimators of the residuals, ui.
Step2: Use ei to construct OLS covariance matrix Ω-hat = diag[e12 e22 ... en2]. In this
specific case, Ω-hat is the estimated covariance matrix ΩFGLS in FGLS method.
Step3: Estimate the FGLS estimators by using ΩFGLS to replace the true covariance
matrix, Ω, of the disturbances. Then we get the FGLS estimator:
The above steps can be iterated to get the estimation of Ω to convergence.
Properties of the FGLS estimators
The GLS estimator is the minimum variance linear unbiased estimator in the
generalized regression model. The FGLS estimator has the same large-sample
property as its infeasible counterpart GLS estimator. This result does not depend on
using an efficient estimator of Ω, but simply any consistent estimator of Ω. However,
care has to be taken that the asymptotic efficiency of FGLS estimators may not carry
over to small samples because of the variability introduced by the estimated Ω. So with
small sample, it might not be a good choice to use FGLS.
Download