US Wine Sales vs Population 1934

advertisement
Regression with Autocorrelated Errors
U.S. Wine Consumption and Adult
Population – 1934-2002
Data Description
• Y=U.S. Annual Wine Consumption (Millions of
Gallons)
• X=U.S. Adult Population (Millions of People)
• Years – 1934-2002 (Post Prohibition)
• Model:
Yt   0  1 X t   t
 t   t  1 t 1  ...   q t q
 t ~ iid 0, 
2
     
Ordinary Least Squares Regression
Regression Statistics
Multiple R
0.965612383
R Square
0.932407274
Adjusted R Square
0.931398427
Standard Error
48.64438813
Observations
69
Intercept
apop_m
ANOVA
df
1
67
68
Regression
Residual
Total
Coefficients
-347.9736
4.3092
SS
2186985.91
158540.53
2345526.43
Standard Error
t Stat
21.9895
-15.8245
0.1417
30.4012
^
 e  e 
2
t 1
t
n
e
t 1
2
t

19008.8044
 0.11989871
158540.53
d L  p  1, n  70   1.58 dU  p  1, n  70   1.64
DW  dU  Autocorrel ation is present
Significance F
0.0000
Upper 95%
-304.0824
4.5921
 e  2366.28  48.64
n
t 2
Lower 95%
-391.8649
4.0263
F
924.23
^
W t  347.97  4.31Pt
Durbin - Watson Test : DW 
P-value
0.0000
0.0000
MS
2186985.91
2366.28
Residuals versus Time
150
100
Residuals
50
0
-50
-100
1
3
5
7
9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 53 55 57 59 61 63 65 67 69
Covariance Structure (q=1)
Yt   0  1 X t   t
 t   t   t 1  t ~ iid 0,  2 
    
1    1
2
Assuming : E  1   0 V  1  
1  2
 2 2  2 1   2    2 2
2
 E  2   0 V  2   V  2    V  1    


1  2
1  2
1  2
2
2
 2
COV  1 ,  2   COV  1 , 2  1   V  1  
1  2
In General :
2
E  t   0 V  t  
1  2
 k 2
COV  t ,  t  k  
1  2
 
 1  
 1
   n 1 
 


1
  n2 
  2  
2  
V ε   V 


2
 

1




 
 
 n 1

n2
  



1


 n
  2  1   2 V  t   V  t   COV  t ,  t 1 


COV  t ,  t 1 
V  t 
Generalized Least Squares (q=1)
 1

2  
V  V Y   V ε  
1  2  
 n 1
 

1

 n2
 1  2

 
 0
1
Define : T  
 

 0
 0
  n 1 

  n2 

 


1 
0
0

0
1

0
1


0
0

 

0
0
0
0
 1
 
0

0
0



0
1
Consider t he Case where n  4 :
 1  2


T 1 V  
 0

 0
0
0
1
0

1
0

 1  2

2

 0
T 1 VT 1' 
1  2  0

 0
0

2
0 
2
0 1  

1
 1  2
1  2
1


 2
 3
 

2

1

2
1

0
 2 1  2
 1   2 
1  2
0
0
3

2
2

  1  2

1 
 1  2

 0
 0

 0
 3 1  2  1  2

 2 1   2   0
 1   2    0

2
1     0
 1  2
1  2
0
 2 1  2
 1   2 
1  2
0
0

0
1

0
1
0
0
 3 1  2 

 2 1   2 
 1   2  

1   2 
0 

0 

 

1 
1   2
0
0
0 


1  2
0
0 
2  0
  2 I  V T 1 Y  T 1 VT 1'   2 I  Transform Y*  T 1 Y  T 1 Xβ  T 1ε
2
2
1   0
0
1 
0 


0
0
1   2 
 0


Estimated GLS (q=1)
^
1 n 2
 (0)   et
n t 1
^
^ 1
T

^ 2
 
 ^
  ( 0)
^

 

 0



 0

 0
1 n
 (1)   et et 1
n t 2
^
^

0
0

0
0
1
0

0
0

1

0
0





^
 (1)
0


1
0
0

0


^
 ( 0)
^
^
^
  (0)    (1)


0


0

0


0

1
^
0
^ 2
^
1
^
β EG LS
^ 1 ^ 1




X'
T
'
T
X




1
^ 1
X' T ' T
^ 1 ^ 1





V  β EG LS    e 
X'
T
'
T
X






^
^
^ 2
^ ^
^


(
0
)



(
1
)



s2  
n  p '1
^ 1
^ 2
e
Y
^
1


SE   i  


^
ti 
 EGLS ,i
 ^

SE   EGLS ,i 


^
^
s 2 /  ( 0)
1
^
^
^

 ^


 Y  X β EG LS ' T ' T  Y  X β EG LS 




n  p '1
ti 
i
 ^ 
SE   i 


Estimated GLS (q=1) – Wine/Population Data
^
^
 (0)  2297.69  (1)  2147.89
^
  0.9348
^ 2
  289.82
0
0 
0
0
0
 0.3552
- 0.9348

1
0

0
0
0



1
^

0
- 0.9348 1 
0
0
0
T 











0
0
0  - 0.9348
1
0


0
0
0 
0
- 0.9348 1

^
^ 2
- 347.23
β EGLS  
 e  280.5479

 4.2540 
^ ^
4.2540
4.2540

 5482.225 - 31.515
V  β EGLS   
t


 9.358
1

0.2066 
0.2066 0.4546

  - 31.515
s 2  4.39
 
SE   i  
 
^
^
^
s 2 /  (0)  .04372
ti 
i
^ 
SE   i 
 

0.9348
 21.38
0.04372
Transformed residuals versus Year - EGLS (q=1)
60
40
Residuals
20
0
-20
-40
-60
1
3
5
7
9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 53 55 57 59 61 63 65 67 69
DW  1.67 d L  p  1, n  70  1.58 dU  p  1, n  70  1.64
DW  dU  Do not concludeAu tocorrelat ion is present
Untransformed data and Fitted Equation from EGLS
700
600
500
Wine Sales
400
Y
Yhat_w
300
200
100
0
50
70
90
110
130
150
Population
170
190
210
230
250
Estimated GLS (General q) - I
1) Fit Ordinary Least Squares Regression and Obtain OLS Residuals :
β OLS  X' X  X' Y e OLS  Y  X β OLS
^
^
1
2) Estimate the autocovari ances to lag q and the following qxq matrix and qx1 vector :
^
^
 ^

^ 

(
0
)

(
1
)


(
q

1
)
 ^

 ^ (1) 


e
e
^
^

t
t

h
^
^
^

 (0)   (q  2) γ   (2) 
 (h)  t  h 1
h  0,1,..., q Γ q    (1)
q



n









^
^
^ (q  1) ^ (q  2) 



 ( 0) 

 (q )
3) Estimate the Coefficien ts of the lagged errors - qx1 vector and V  t    2 :
n
^
^ 1 ^
^ 2
^
^ ^
ρ  Γ q γ q    ( 0)  ρ ' γ q
^ 1
^ 1
^
^
4) Obtain the Cholesky Decomposit ion of Γ q : Γ q  P q ' P q
^ 1/2
^
5) Obtain the transform ation matrix : T  V
^ 2 ^
T11   P q
^
 q
0

T2   

0

 0

^
q

(q  q)
^
1
q  (n  q) 
T12  0
(assuming n  2(q  1)) :
T1  T11 T12 
0 
0
0
0
0
 1 1 
0
0
0
0




1
^


 
^
^
0
0
0
0  q

1
0
0
0 
^
0
q
 1
0
1
^

0
0


0

1
q  n 
(n  q)  n 
^
T 
T   1  ( n  n)
T2 
Estimated GLS (General q) - II
1
^ ^
 ^ ^ 
6) Obtain the estimated generalize d least squares estimate : β EGLS   X' T' T X  X' T' T Y


7) Obtain the Estimated error vari ance from the transform ed model :
^
^
^

 ^ ^

 Y  X β EGLS ' T' T Y  X β EGLS 
^ 2



e  
n  p ' q
^
 ^  ^ ^ 
8) Obtain the estimated variance - covariance matrix of β EGLS : V  β EGLS    e  X' T' T X 




^
2
^
1
^
9) Obtain estimated t - statistics for regression coefficien ts : ti 
 EGLS ,i
^

SE   EGLS ,i 


10) Obtain residual MS for estimates of autoregres sive parameters  and the standard errors :
^ ^
^


(
0
)

ρ
'
γ
q
^


^ 
s2  
SE   i   s 2  ii
n  p ' q
 
11) Obtain t - statistics for the  i :
^
^
ti 
i
^ 
SE   i 
 
^ 1
 ii  i diagonal element of Γ q
th
and compare with the t - distributi on with df  n  p 'q
SAS Proc Autoreg Output
The AUTOREG Procedure
Dependent Variable
wine
Ordinary Least Squares Estimates
SSE
158540.525
DFE
MSE
2366
Root MSE
SBC
738.318203
AIC
Regress R-Square
0.9324
Total R-Square
Durbin-Watson
0.1199
Variable
Intercept
adpop
DF
1
1
Estimate
-347.9736
4.3092
Standard
Error
21.9895
0.1417
t Value
-15.82
30.40
67
48.64439
733.84999
0.9324
Approx
Pr > |t|
<.0001
<.0001
Estimates of Autocorrelations
Lag
Covariance
Correlation
0
1
2297.7
2147.9
1.000000
0.934807
-1 9 8 7 6 5 4 3 2 1 0 1 2 3 4 5 6 7 8 9 1
|
|
|********************|
|******************* |
SAS Proc Autoreg Output
Preliminary MSE
289.8
Estimates of Autoregressive Parameters
Standard
Lag
Coefficient
Error
t Value
1
-0.934807
0.043717
-21.38
SSE
MSE
SBC
Regress R-Square
Durbin-Watson
Yule-Walker Estimates
18516.1612
DFE
280.54790
Root MSE
596.454422
AIC
0.5702
Total R-Square
1.6728
Variable
DF
Estimate
Standard
Error
Intercept
adpop
1
1
-347.2297
4.2540
74.0420
0.4546
66
16.74956
589.752103
0.9921
t Value
Approx
Pr > |t|
-4.69
9.36
<.0001
<.0001
Download