Dynamic equations and their estimation (long version containing optional material, from slide 18). Ragnar Nymoen Department of Economics, UiO 29 March 2009 (updated from 23 March) ECON 4610: Lecture 10 Overview In economics, the typical case is that there are dynamic e¤ects of a change in an explanatory variable on the dependent variable. This means that the models that we have mainly considered so far, which are static models, are of limited relevance for the modelling of time series data. However, it possible to extend the regression model, and the simultaneous equations model as well, so that they become relevant for time series data. Dynamic modelling is however a big …eld within econometrics, and we are only give a partial introduction here. Syllabus: B: Lecture note DL Greene: 20.1, 20.2.1, 20.3 K: 9.4 ECON 4610: Lecture 10 Notation for dynamic equations We can re-interpret the linear regression model that we started out with in lecture 1. Using Greene’s notation: yt D 1 C 2 x2t C ... C k xKt C "t , t D 1, 2, ..., T . In the case of a single explanatory variable, we can set x2t D xt , x3t D xt 1 , x4t D xt 2 and so on, hence: yt D 1C K X2 i D0 2 Ci xt i C "t , t D 1, 2, ..., T . (1) However, this is cumbersome, so in Ch 20, Greene changes notation to: p X yt D C (2) i xt i C " t , t D 1, 2, ..., T . i D0 where it is understood that corresponds to 1 in (1), and that there is a natural re-numbering of the slope coe¢ cients. p denotes the lag-length of the model. ECON 4610: Lecture 10 Lag and di¤erence operator The lag operator L is de…ned as Lxt D xt 1 and the following rules apply: L2 xt q p .L C L /xt 0 L xt D L.Lxt / D xt D xt q C xt 2, p, 0 and Lq Lp xt D Lq Cp xt D xt D xt , and L a D a if a is a constant. The di¤erence operator is de…ned as 1q xt D xt xt q D .1 Lq /xt with the …rst di¤erence as an important special case: 1xt D xt xt 1 D .1 L/xt ECON 4610: Lecture 10 q p Polynomial in the lag operator Using the lag operator, (2) can be written as yt D or yt D p X C. i i L /xt i D0 C "t , t D 1, 2, ..., T . C B.L/xt C "t , t D 1, 2, ..., T . where B.L/ is a polynomial in L: B.L/ D 0 0L C 1 1L C 2 2 L C ... C p pL D ECON 4610: Lecture 10 p X i D0 i iL The distributed lag model and its multipliers Model (2), or the same equation with lag-operator notation, is called the distributed lag model, which we refer to as DL model from now on, because the e¤ects of a change in xt has an e¤ect on y which is distributed over several periods. Let j denote the marginal e¤ect that a change in x has on y after j periods. The simplest example is when the change in x occurs in period t and lasts for just one period, and then x returns to the old level in period t C 1, then: @yt D 0 0 D @xt @yt C1 D 1 1 D @xt @yt Cp D p p D @xt @yt Cp C1 j D D 0 for j > p. @xt ECON 4610: Lecture 10 A permanent change in x Assume that x changes in period t and stays permanently at that new level, then 0 1 p j @yt D 0 @xt @yt C1 @yt C1 D C D @xt C1 @xt p X D i D D i D0 p 0 C 1 for j > p. In both examples, 0 is called the impact multiplier, and j , j > p is called the equilibrium multiplier or the long-run multiplier. The intermediate e¤ects are called the dynamic multiplies in the case of a temporary change, and the cumulated multipliers in the case of a permanent change. ECON 4610: Lecture 10 Estimation of the DL-model In principle there are no new issues: If xt is uncorrelated with all the disturbances, then OLS estimates of the DL-model are unbiased. OLS may not be fully e¢ cient, and standard errors may need to be corrected for e.g. heteroscedasticity. If xt is uncorrelated with "t , "t C1 , ..., the DL-model includes only pre-determined explanatory variables, and OLS is consistent, and …nite sample biases that are small. If xt is correlated with "t then we need to use IV to obtain consistent estimators. The problems associated with the estimation of DL model are more practical and about relevance: Why should there be an exact “cut o¤” point in the lag distribution, and how do we know p? The number of parameters may become large, combined with multicollinearity, the estimates may become imprecise. ECON 4610: Lecture 10 Dealing with multicollinearity Consider yt D C 0 xt C 1 xt 1 C "t , t D 1, 2, ..., T . where xt and xt 1 are often quite highly correlated. However, we can transform the equation in a way that does not a¤ect the disturbance "t , and therefore does not a¤ect the statistical properties of the model: yt D C D C 0 1xt 0 1xt C. C 0 C 1 xt 1 1 /xt 1 C "t C "t In this equation, the regressors have a low correlation, meaning that if we concentrate on the estimation of the impact multiplier 0 , and the long-run multiplier 1 collinearity is not a practical problem. This generalizes. ECON 4610: Lecture 10 Speci…c distributed lags In order to deal with the problem of unknown lag order p, and that the number of variables may become large, it has been proposed to specify the distributed lag more closely. Two models are much referred to: The polynomial lag distribution, or “Almon lag” after one of its inventors. The geometric lag distribution. We brie‡y explain these, and then mention some arguments for taking a slightly more general starting point for dynamic modelling than the DL model. ECON 4610: Lecture 10 Almon lag It is often reasonable to assume that the dynamic multipliers …rst increase, and reach a peak before they decline. A model which captures that idea is: j D 0 C with 1 > 0, 2 < 0 and equation gives: yt D . C 0 0 xt Cp 1j 2j C > 1 C. 0C 1C 2 1 Cp 2 /xt p 2 2. , j D 0, 1, ..., p Substitution into the original 2 /xt 1 C "t C. 0 C2 1 C4 Collecting terms gives yt z0t D D z1t D z2t D C Xp 0 z0t 1 z1t C x , j D0 t j Xp j D0 Xp j D0 C 2 z2t C "t jxt j , j 2 xt j . ECON 4610: Lecture 10 2 /xt 2 C ... Almon lag, restrictions The number of parameters is reduced by p quadratic form. 2 in the case of We can think of this as p 2 restrictions on the unrestricted DL model with p lags: The coe¢ cients of xt , xt 1 and xt 2 are not restricted by the assumed quadric polynomial, but the coe¢ cients of xt 3 and all longer lags are restricted, eg. 3 D 0 C3 1 C9 2 () 3 3 2 C3 1 0 D0 which is a linear restriction on the unrestricted DL-model with p D 3. So the validity of the quadratic Almon lag speci…cation can be tested by …rst estimating an unrestricted model with p set to a high number, and then testing the p 2 restriction by a F -test. Can generalize to cubic form for example: The number of restriction is p 3. And p q in general (q is the order of polynomial). ECON 4610: Lecture 10 Almon lag, comments In principle, the Almon lag model is attractive, but in practice multicollinearity is often a big problem, cf the expressions for z0t and z1t . Hence, the variances of bj tend to be large Another problem, is that it is di¢ cult in practice to obtain Almon lag models that “survive” mis-speci…cation tests: For example: Tests of residual autocorrelation often reject the null of no autocorrelation. The source of this problem may be the DL model itself: since the Almon lag model is only a restricted version of that, and suggest that a more general starting point for the speci…cation of a dynamic model is needed. ECON 4610: Lecture 10 Geometric lag distribution and the Koyck-transformation If there are economic reasons to believe that the dynamic multipliers are declining then a simple model is j D j 0, 0< < 1, j D 1, 2, .... where it is understood that p is set p to in…nity. Hence (2) becomes: 1 X i yt D C 0 xt i C "t (3) i D0 The Koyck-transformation: lag (3) one period and multiply through by : yt 1 D C 0 1 X i xt i 1 i D0 C "t 1 and subtract from (3) yt D .1 /C 0 xt C yt 1 C ECON 4610: Lecture 10 t (4) OLS estimators and Koyck transformation The disturbance in (4) is autocorrelated: t D "t "t 1 . The combination of a lagged regressor (yt 1 ) and autocorrelation means that OLS yields inconsistent estimators. Recall that in a model with no exogenous regressors the estimator will be biased if X plim yt 1 disturbance 6D 0 t which in this case becomes X plim yt 1 "t t plim X yt 1 "t 1 t the …rst term is zero (cf …rst lecture), but the second is not, so inconsistency follows. ECON 4610: Lecture 10 A more general approach: ADL model The autocorrelation above was induced by the Koyck transformation. It does not imply that all models with lagged regressors have autocorrelated residuals. In modern dynamic econometric modelling the DL model is no longer seen as the right framework, instead the stochastic di¤erence equation p q X X yt D C x C i yt i 1 C " t , t D 1, 2, ..., T . (5) i t i i D0 i D0 has taken over this role. (5) is called the autoregressive distributed lag model, or ADL. Classical disturbance properties are usually assumed with reference to the principle that the lag orders p and q can be chosen so that the model explains all the systematic variation in yt . Hence residual autocorrelation is typically seen as a sign that the model is not general enough, and the advise is to re-specify the model to install classical disturbance properties ECON 4610: Lecture 10 Attributes of the ADL model Dynamic multipliersP and the long run multiplier are well q de…ned (as long as i D0 i < 1). Available from PcGive after estimation by OLS The rationale for OLS is that yt 1 and higher order lags are pre-determined variables as long at there is no residual autocorrelation. Hence, with reference to lecture 1, the estimators Pq are consistent, and small sample bias is not large unless i D0 i is very close to unity. De…nes a large typology of di¤erent models, all of them are of interest in economics. ECON 4610: Lecture 10 A model typology Several single equation models are encompassed by the ADL(1,1)-model: yt D C 1 yt 1 C 0 xt C 1 xt 1 C "t (6) where "t white-noise., E[xt " t Cj ] D E[xt 1 "t 1Cj ] D 0, j D 0, 1, 2, ..... ADL equations can be part of a simultaneous equations system, E[xt " t ] 6D 0 in that case, but herer we consider the case of predetermined xt , and OLS yields consistent estimates. A selection of models is Static model Model with di¤erenced data Common-factor model Equilibrium correction(EqCM) ECON 4610: Lecture 10 Static model If 1 D 1 D 0 (6) reduces to yt D C 0 xt C "t . White-noise "t requires that the autocorrelation of yt is matched exactly by xt , thus 1 D 1 D 0. Unusual in practice. Instead, a regression yt D KO 0 C KO 1 xt C uO t (7) usually produces a residual uO t which is autocorrelated because of the omitted variables (xt 1 and yt 1 /. The static model is nevertheless important in econometrics, because of interpretation as long-run equilibrium relationship. ECON 4610: Lecture 10 Di¤erenced data 1 D 1 and 1 D 0, yields: 1yt D C 0 1xt C "t . (8) In this case neither the expectation nor the variance of yt exists, so yt is not covariance-stationary. In addition yt contains a deterministic trend if 6D 0. Below, we shall see that (8) represents a situation where yt and xt are integrated of degree 1, but not cointegrated. ECON 4610: Lecture 10 Common-factor model ADL(1,1) with lag-operator notation .1 1 L/yt D C. D C .1 0 C 1 L/xt C "t . (9) Writing the equation as .1 1 L/yt 1 L/ 0 xt C "t shows that the COMFAC restriction . 0 C 1 L/ D .1 1 L/ 0 , or alternatively, 1 D 1 0, must hold if the order of the dynamics is to be reduced by 1. ECON 4610: Lecture 10 (10) Imposing the restriction yields yt D /.1 1 L/ C 0 xt C "t /.1 1 L/ or yt D C 0 xt C ut , ut D 1 ut 1 C "t (11) Note that ut is AR(1) and that the long run multiplier is equal to the short run multiplier 0 yt D .1 1/ C 0 xt 0 1 xt 1 C 1 yt 1 C "t , (12) so @yt /@xt D 0 and, for yt D y and xt D x, @y /@x D 0 . Greater ‡exibility is obtained by starting from a general ADL(p,q). By imposing m COMFAC restrictions, the order of the dynamics can be reduced to p m and q m. ECON 4610: Lecture 10 Equilibrium correction (EqCM) EqCM is a 1 ADL(1,1). yt yt 1 1 transformation of the ADL-model. Consider a C. D C. D 1 1/yt 1 1 1/yt C 1 C 0 xt 1 xt 1 C 0 .xt C "t (13) xt 1/ C . 0 C 1 /xt 1 C 1 /xt 1 C "t , i.e., 1yt D C 0 1xt C. 1 1/yt 1 C. 0 C "t . To establish the equilibrium relationship, we start from the ADL-model .1 .L//yt D 0 C .L/xt C " t where .L/ D yt 1L and .L/ D 1 .L/ 1 0 C 1L (14) (15) in a ADL(1,1). .L/ "t xt D .L/ 1 .L/ ECON 4610: Lecture 10 (16) To establish the long-run conditional expectation, set L D 1 and take the expectation: yt .1/ 1 1 E[yt K0 .1/ "t xt D .1/ 1 .1/ K1 xt ] D 0 (17) (18) where K0 D K1 D 1 1 .1/ .1/ D .1/ D . 0 1 (19) 1 1 C 1/ 1 ECON 4610: Lecture 10 (20) (18) implies that the conditional long-run mean, y can be de…ned as y D K0 C K1 x. (21) Use this notation to write eq. (14) as 1yt 1yt D 0 1xt D C C. 0 1xt 1 C. 1/.yt 1 1 1/.y yt 1 / C "t K1 x/t (22) 1 C "t , (23) which makes the equilibrium correction mechanism explicit. The static equilibrium is restrictive, since many series have non-zero growth rates. The dynamic equilibrium is de…ned in such a way that y D KQ 0 C K 1 x (24) holds in a situation where 1yt D gy and 1xt D gx . In equilibrium, gy D K1 gx from (24). ECON 4610: Lecture 10 (25) Inserting in (23) gives K 1 gx D 0 C 0 gx C. 1 1/.y K 1 x/ (26) and consequently yD 0 C. .1 0 K1 /gx C K x D KQ 0 C K1 x, 1/ Hence KQ 0 in (24) is KQ 0 D 0 C. .1 0 K1 /gx . 1/ KQ 0 D K0 only if gx D gy D 0. ECON 4610: Lecture 10 (27) Generalization to ADL(p,q) The ADL.p, q/ can be written as a EqCM in several equivalent ways. For p D q D 4: yt 4 X i yt i i D1 D C 4 X i xt i i D0 C "t , (28) one possibility is to put the levels term at the fourth lag 1yt D C 3 X i D1 C. .1/ 0 1yt i 1/yt i 4 C 3 X 0 i 1xt i i D0 C .1/xt 4 (29) C "t where 0 i 0 i D D i X j j D1 i X j D0 j, 1, i D 1, 2, 3, i D 0, 1, 2, 3. ECON 4610: Lecture 10 (30) Alternatively, 1yt D C 3 X i D1 i C. .1/ 1yt i 1/yt C 1 3 X i 1xt i D0 C .1/xt 1 i (31) C "t with i D 0 D i D 4 X j, 4 X j, j Di C1 i D 1, 2, 3, (32) 0 j Di C1 i D 1, 2, 3. ECON 4610: Lecture 10 In both cases: K1 D 1 .1/ D 1 .1/ .1 P4 j D0 P4 and 0 0 D 0 D j D1 j j/ 0, but 0 i 0 i 6D i , i D 1, 2, 3, 6D i , i D 1, 2, 3. ECON 4610: Lecture 10 (33) Estimation OLS for the ADL[p, q ]-model yt p X j yt j j D1 D C q X j xt j j D0 C "t (34) yields an estimator bT for the parameter vector D [ , 1 , 2 , ... 0 , ..., k ] with the following properties ("t is w.n.; E[Xt "s ] D 0 for all t,s): p T .bT / ! N.0, •/ tT rFT T !1 ! T !1 ! T !1 N.0, 1/ (35) 2 .r / • is a constant covariance matrix, tT the “t-value” and rFT the “F-statistic” for a r linear restrictions. ECON 4610: Lecture 10 (35) holds as long as the roots of the characteristic polynomial (34): p. / D p 1 p 1 ::: p , are inside the unit-circle. The proof is an application of the Mann-Wald theorem, and is not shown here. Finite sample bias arise because E[Yt j "t i ] 6D 0 for j i . An important issue is how fast this bias (the Hurwicz-bias) is disappearing.— How large is “large”? The AR(1) process yt D C 1 yt 1 C "t , "t N.0, 2 /, t D 1, : : : T . from the …rst lecture illustrates several features. ECON 4610: Lecture 10 (36)