(Three-Variable) Linear Regression Model

advertisement
ECONOMICS 351* -- NOTE 12
M.G. Abbott
ECON 351* -- NOTE 12
OLS Estimation of the Multiple (Three-Variable)
Linear Regression Model
This note derives the Ordinary Least Squares (OLS) coefficient estimators for the
three-variable multiple linear regression model.
•
The population regression equation, or PRE, takes the form:
Yi = β0 + β1X1i + β 2 X 2i + u i
(1)
where ui is an iid random error term.
•
The OLS sample regression equation (OLS-SRE) for equation (1) can be
written as
Yi = βˆ 0 + βˆ 1X1i + βˆ 2 X 2i + û i = Ŷi + û i
(i = 1, ..., N).
(2)
where the β$ j are the OLS estimators of the corresponding population regression
coefficients βj (j = 0, 1, 2),
û i = Yi − Ŷi = Yi − βˆ 0 − βˆ 1X1i − βˆ 2 X 2i
(i = 1, ..., N)
are the OLS residuals, and
Ŷi = βˆ 0 + βˆ 1X1i + βˆ 2 X 2i
(i = 1, ..., N)
are the OLS estimated (or predicted) values of Yi.
The function f (X1i , X 2i ) = βˆ 0 + βˆ 1X1i + βˆ 2 X 2i is called the OLS sample
regression function (or OLS-SRF).
ECON 351* -- Note 12: OLS Estimation in the Multiple CLRM
… Page 1 of 17 pages
ECONOMICS 351* -- NOTE 12
M.G. Abbott
1. The OLS Estimation Criterion
The OLS coefficient estimators are those formulas (or expressions) for β̂0 , β̂1 ,
and β̂ 2 that minimize the sum of squared residuals RSS for any given sample of
size N.
The OLS estimation criterion is therefore:
(
)
Minimize RSS βˆ 0 , βˆ 1 , βˆ 2 =
N
∑ û i2
=
i =1
∑ (Yi − βˆ 0 − βˆ 1X1i − βˆ 2 X 2i )
N
2
(3)
i =1
{ β$ j }
(
)
(
)
Interpretation of the RSS βˆ 0 , βˆ 1 , βˆ 2 function:
♦
The knowns in the RSS βˆ 0 , βˆ 1 , βˆ 2 function are the sample observations
(Yi , 1, X1i , X 2i ) for i = 1, …, N. In other words, the N sample values of the
observable variables Y, X1, X2 are taken as known (or given).
♦
The unknowns in the RSS βˆ 0 , βˆ 1 , βˆ 2 function are therefore the coefficient
estimators βˆ , βˆ , βˆ .
(
0
♦
1
)
2
(
For purposes of deriving the OLS coefficient estimators, the RSS βˆ 0 , βˆ 1 , βˆ 2
function is interpreted as a function of the three unknowns βˆ , βˆ , βˆ .
0
ECON 351* -- Note 12: OLS Estimation in the Multiple CLRM
1
)
2
… Page 2 of 17 pages
ECONOMICS 351* -- NOTE 12
M.G. Abbott
2. The OLS Normal Equations: Derivation of the FOCs
(
)
STEP 1: Re-write the RSS βˆ 0, βˆ 1, βˆ 2 function in (3) as follows:
(
)
RSS βˆ 0 , βˆ 1 , βˆ 2 =
N
∑ û i2 =
i =1
N
∑ f (û i )
where
i =1
f (û i ) = û i2
û i = Yi − βˆ 0 − βˆ 1X1i − βˆ 2 X 2i
(3)
Note: The function f (û i ) = û i2 is a function of û i , and û i is in turn a function
of β̂0 , β̂1 , and β̂ 2 .
(
)
STEP 2: Partially differentiate the RSS βˆ 0, βˆ 1, βˆ 2 function in (3) with respect
to β̂ , βˆ , and β̂ :
0
•
1
Using the chain rule of differentiation, each partial derivative of the
RSS βˆ 0, βˆ 1, βˆ 2 function takes the general form
(
∂RSS
=
∂βˆ j
•
2
)
N
∑
i =1
d f ∂û i
.
d û i ∂βˆ j
(4)
Using the power rule of differentiation, the derivative df d û i is
df
d (û i2 )
=
= 2 û i .
d û i
d û i
The partial derivatives ∂ RSS ∂βˆ j for j = 0, 1, 2 are therefore
N d f ∂ û
N
N
∂RSS
∂û
∂û
i
= ∑
= ∑ 2 û i i = 2 ∑ û i i
ˆ
i =1 d û i ∂β
i =1
i =1
∂βˆ j
∂βˆ j
∂βˆ j
j
ECON 351* -- Note 12: OLS Estimation in the Multiple CLRM
j = 0, 1, 2.
(5)
… Page 3 of 17 pages
ECONOMICS 351* -- NOTE 12
•
Since the i-th residual is û i = Yi − βˆ 0 − βˆ 1X1i − βˆ 2 X 2i , the partial derivatives
∂ u$ ∂β$ for j = 0, 1, 2 are:
i
j
∂û i
= −1;
∂βˆ 0
•
M.G. Abbott
∂û i
= −X1i ;
∂βˆ 1
∂û i
= −X 2i .
∂βˆ 2
Substitute the partial derivatives ∂ u$ i ∂β$ j for j = 0, 1, 2 into equation (5):
N
∂RSS
∂û
= 2 ∑ û i i
i =1
∂βˆ j
∂βˆ j
j = 0, 1, 2.
(5)
The partial derivatives ∂RSS ∂β$ j for j = 0, 1, 2 thus take the form:
N
N
N
∂RSS
∂û i
= 2 ∑ û i
= 2 ∑ û i (−1) = − 2 ∑ û i
∂βˆ 0
∂βˆ 0
i =1
i =1
i =1
(6.1)
N
N
N
∂RSS
∂û i
= 2 ∑ û i
= 2 ∑ û i (−X1i ) = − 2 ∑ X1i û i
∂βˆ 1
∂βˆ 1
i =1
i =1
i =1
(6.2)
N
N
N
∂RSS
∂û
= 2 ∑ û i i = 2 ∑ û i (−X 2i ) = − 2 ∑ X 2i û i
∂βˆ 2
∂βˆ 2
i =1
i =1
i =1
(6.3)
ECON 351* -- Note 12: OLS Estimation in the Multiple CLRM
… Page 4 of 17 pages
ECONOMICS 351* -- NOTE 12
M.G. Abbott
STEP 3: Obtain the first-order conditions (FOCs) for a minimum of the RSS
function by setting the partial derivatives (6.1)-(6.3) equal to zero, then dividing
each equation by −2, and finally setting û i = Yi − βˆ 0 − βˆ 1X1i − βˆ 2 X 2i :
•
N
∂RSS
= − 2 ∑ û i
∂βˆ 0
i =1
∂RSS
=0
∂βˆ 0
⇒
(6.1)
N
− 2 ∑ û i = 0
N
∑ û i = 0
(7.1)
∑ (Yi − βˆ 0 − βˆ 1X1i − βˆ 2 X 2i ) = 0
(8.1)
i =1
⇒
i =1
N
⇒
i =1
•
N
∂RSS
= − 2 ∑ X1i û i
∂βˆ 1
i =1
∂RSS
=0
∂βˆ
⇒
(6.2)
N
− 2 ∑ X1i û i = 0
⇒
∑ X1i û i
= 0
(7.2)
∑ X1i (Yi − βˆ 0 − βˆ 1X1i − βˆ 2 X 2i ) = 0
(8.2)
i =1
1
N
i =1
N
⇒
i =1
•
N
∂RSS
= − 2 ∑ X 2i û i
i =1
∂βˆ 2
∂RSS
=0
∂βˆ 2
⇒
⇒
(6.3)
N
− 2 ∑ X 2i û i = 0
N
∑ X 2i û i = 0
(7.3)
∑ X 2i (Yi − βˆ 0 − βˆ 1X1i − βˆ 2 X 2i ) = 0
(8.3)
i =1
⇒
i =1
N
i =1
ECON 351* -- Note 12: OLS Estimation in the Multiple CLRM
… Page 5 of 17 pages
ECONOMICS 351* -- NOTE 12
M.G. Abbott
STEP 4: Rearrange each of the equations (8.1)-(8.3) to put them in the
conventional form of the OLS normal equations. Thus, taking summations and
rearranging terms, we obtain the OLS normal equations:
•
∑ (Yi − βˆ 0 − βˆ 1X1i − βˆ 2 X 2i ) = 0
N
i =1
N
(8.1)
N
N
i =1
N
i =1
N
i =1
N
i =1
N
N
i =1
i =1
i =1
∑ Yi − Nβˆ 0 − βˆ 1 ∑ X1i − βˆ 2 ∑ X 2i = 0
i =1
N
− Nβˆ 0 − βˆ 1 ∑ X1i − βˆ 2 ∑ X 2i = −∑ Yi
i =1
Nβˆ 0 + βˆ 1 ∑ X1i + βˆ 2 ∑ X 2i = ∑ Yi
•
(N1)
∑ X1i (Yi − βˆ 0 − βˆ 1X1i − βˆ 2 X 2i ) = 0
N
(8.2)
i =1
N
∑ (X1i Yi − βˆ 0 X1i − βˆ 1X12i − βˆ 2 X1i X 2i ) = 0
i =1
N
N
N
N
i =1
N
i =1
N
i =1
N
∑ X1i Yi − βˆ 0 ∑ X1i − βˆ 1 ∑ X12i − βˆ 2 ∑ X1i X 2i = 0
i =1
N
− βˆ 0 ∑ X1i − βˆ 1 ∑ X12i − βˆ 2 ∑ X1i X 2i = −∑ X1i Yi
i =1
N
i =1
N
i =1
N
N
i =1
i =1
i =1
i =1
i =1
β̂ 0 ∑ X1i + β̂1 ∑ X12i + β̂ 2 ∑ X1i X 2i = ∑ X1i Yi
•
∑ X 2i (Yi − βˆ 0 − βˆ 1X1i − βˆ 2 X 2i ) = 0
N
(N2)
(8.3)
i =1
N
∑ (X 2i Yi − βˆ 0 X 2i − βˆ 1X 2i X1i − βˆ 2 X 22i ) = 0
i =1
N
N
N
N
i =1
N
i =1
N
i =1
N
N
i =1
N
i =1
N
i =1
N
i =1
N
i =1
i =1
i =1
i =1
∑ X 2i Yi − βˆ 0 ∑ X 2i − βˆ 1 ∑ X 2i X1i − βˆ 2 ∑ X 22i = 0
i =1
− βˆ 0 ∑ X 2i − βˆ 1 ∑ X 2i X1i − βˆ 2 ∑ X 22i = −∑ X 2i Yi
βˆ 0 ∑ X 2i + βˆ 1 ∑ X 2i X1i + βˆ 2 ∑ X 22i = ∑ X 2i Yi
ECON 351* -- Note 12: OLS Estimation in the Multiple CLRM
(N3)
… Page 6 of 17 pages
ECONOMICS 351* -- NOTE 12
M.G. Abbott
RESULT: Assemble the three OLS normal equations (N1)-(N3):
N
N
N
i =1
i =1
i =1
Nβˆ 0 + βˆ 1 ∑ X1i + βˆ 2 ∑ X 2i = ∑ Yi
(N1)
N
N
N
N
i =1
i =1
i =1
i =1
N
N
N
N
i =1
i =1
i =1
i =1
βˆ 0 ∑ X1i + βˆ 1 ∑ X12i + βˆ 2 ∑ X1i X 2i = ∑ X1i Yi
βˆ 0 ∑ X 2i + βˆ 1 ∑ X 2i X1i + βˆ 2 ∑ X 22i = ∑ X 2i Yi
(N2)
(N3)
•
The OLS normal equations (N1)-(N3) constitute three linear equations in the
three unknowns β̂0 , β̂1 , and β̂ 2 .
•
Solution of the OLS normal equations (N1)-(N3) yields explicit expressions
(or formulas) for β̂0 , β̂1 , and β̂ 2 ; these expressions are the OLS estimators β̂0 ,
β̂1 , and β̂ 2 of the partial regression coefficients β0, β1, and β2 respectively.
ECON 351* -- Note 12: OLS Estimation in the Multiple CLRM
… Page 7 of 17 pages
ECONOMICS 351* -- NOTE 12
M.G. Abbott
3. Expressions for the OLS Coefficient Estimators
•
The expressions (formulas) for the OLS estimators are most conveniently
written in deviation-from-means form, which uses lower case letters to denote
the deviations of the sample values of each observable variable from their
respective sample means. Thus, define the deviations-from-means of Yi, X1i,
and X2i as:
y i ≡ Yi − Y;
x 1i ≡ X1i − X1 ;
x 2i ≡ X 2i − X 2 ;
where
Σ i Yi
is the sample mean of the Yi values;
N
ΣX
X1 = Σ i X1i N = i 1i is the sample mean of the X1i values;
N
ΣX
X 2 = Σ i X 2i N = i 2i is the sample mean of the X2i values.
N
Y = Σ i Yi N =
‰
‰
The OLS slope coefficient estimators β̂1 and β̂2 in deviation-from-means
form are:
(Σ x 2 )(Σ x y ) − (Σi x1i x 2i )(Σi x 2i yi ) ;
βˆ 1 = i 2i i2 1i i 2
(Σi x1i )(Σi x 2i ) − (Σi x1i x 2i )2
(9.2)
2
ˆβ = (Σ i x1i )(Σ i x 2i y i ) − (Σ i x1i x 2i )(Σ i x1i y i ) .
2
(Σi x12i )(Σi x 22i ) − (Σi x1i x 2i )2
(9.3)
The OLS intercept coefficient estimator β̂0 is:
βˆ 0 = Y − βˆ 1X1 − βˆ 2 X 2 .
ECON 351* -- Note 12: OLS Estimation in the Multiple CLRM
(9.1)
… Page 8 of 17 pages
ECONOMICS 351* -- NOTE 12
M.G. Abbott
4. The OLS Variance-Covariance Estimators
‰
An unbiased estimator of the error variance σ2
•
For the general multiple linear regression model with K regression
coefficients, an unbiased estimator of the error variance σ2 is the degrees-offreedom-adjusted estimator
Σ i û i2
RSS
σˆ =
=
(N − K ) (N − K )
2
where K = k + 1 = the total number of regression coefficients in the PRF.
•
For the three-variable multiple linear regression model (such as regression
equation (1) above) for which K = 3, the unbiased estimator of the error
variance σ2 is therefore
Σ i u$ 2i
RSS
=
.
σ$ =
( N − 3) ( N − 3)
2
(10)
where (N − 3) is the degrees of freedom for the residual sum of squares RSS in
the OLS-SRE (2). σ$ 2 is an unbiased estimator of σ2 because it can be shown
that E( ∑ i u$ 2i ) = E(RSS) = (N − 3)σ2 .
•
The error variance estimator σ$ 2 is used to obtain unbiased estimators of the
variances and covariances of the OLS coefficient estimators β̂0 , β̂1 , and β̂ 2 .
ECON 351* -- Note 12: OLS Estimation in the Multiple CLRM
… Page 9 of 17 pages
ECONOMICS 351* -- NOTE 12
‰
Formulas for the variances and covariances of the slope coefficient
estimators β̂1 and β̂ 2 in the three-variable multiple regression model
( )
σ 2 Σ i x 22i
;
(Σi x12i )(Σi x 22i ) − (Σi x1i x 2i )2
( )
σ 2 Σ i x 12i
;
(Σ i x12i )(Σ i x 22i ) − (Σ i x1i x 2i )2
(
)
Var βˆ 1 =
Var βˆ 2 =
Cov βˆ 1 , βˆ 2 =
‰
M.G. Abbott
σ 2 Σ i x1i x 2i
.
(Σi x12i )(Σi x 22i ) − (Σi x1i x 2i )2
Unbiased estimators of the variances of the slope coefficient estimators β̂1
and β̂ 2 are obtained by substituting the unbiased estimator σ$ 2 for the unknown
error variance σ 2 in the formulas for Var(βˆ ) and Var(βˆ ) :
1
( )
σˆ 2 Σ i x 22i
;
(Σ i x12i )(Σ i x 22i ) − (Σ i x1i x 2i )2
(11.1)
( )
σˆ 2 Σ i x 12i
;
(Σ i x12i )(Σ i x 22i ) − (Σ i x1i x 2i )2
(11.2)
Vâr βˆ 1 =
Vâr βˆ 2 =
‰
2
Similarly, an unbiased estimator of the covariance between the slope
coefficient estimators β̂1 and β̂ 2 is obtained by substituting the unbiased
estimator σ$ 2 for the unknown error variance σ 2 in the formula for
Cov(βˆ 1 , βˆ 2 ) :
(
)
Côv βˆ 1 , βˆ 2 =
σˆ 2 Σ i x 1i x 2i
.
(Σ i x12i )(Σ i x 22i ) − (Σ i x1i x 2i )2
ECON 351* -- Note 12: OLS Estimation in the Multiple CLRM
(11.3)
… Page 10 of 17 pages
ECONOMICS 351* -- NOTE 12
‰
M.G. Abbott
Interpretive formula for the variances of the OLS slope coefficient
estimators β̂ j , j = 1, 2, …, k
Consider the general multiple linear regression equation given by the PRE
Yi = β0 + β1X1i + L + β jX ji + L + β k X ki + u i
(12.1)
OLS estimation of the PRE in (11) yields the OLS SRE
Yi = βˆ 0 + βˆ 1X1i + L + βˆ jX ji + L + βˆ k X ki + û i
•
(12.2)
The formula for Var(βˆ j ) for j = 1, 2, …, k can be written as
Var(βˆ j ) =
σ2
TSS j ( 1 − R 2j )
for j = 1, 2, …, k
(13)
where
TSS j ≡ ∑ x 2ji ≡ ∑ (X ji − X j ) ≡ the total sample variation in the regressor Xj;
N
N
i =1
i =1
2
R 2j ≡ the R2 from the OLS regression of regressor Xj on all the other K−1
regressors in (12.1), including the intercept. That is, R 2j measures the
proportion of the total sample variation in Xj that is explained by the
other regressors in the PRE. Alternatively, R 2j measures the degree of
linear dependence between the sample values Xji of the regressor Xj
and the sample values of the other regressors in regression equation
(12.1).
This formula for Var (βˆ j ) is given in J. Kmenta, Elements of Econometrics, 2nd
edition (1986), pp. 437-438.
ECON 351* -- Note 12: OLS Estimation in the Multiple CLRM
… Page 11 of 17 pages
ECONOMICS 351* -- NOTE 12
•
M.G. Abbott
Determinants of Var(βˆ j ) .
Var(βˆ j ) =
σ2
TSS j ( 1 − R 2j )
for j = 1, 2, …, k
(13)
Three factors determine Var(βˆ j ) :
1. the error variance σ 2 ;
2. the total sample variation in Xj, TSSj;
3. the degree of linear dependence between Xj and the other regressors in
the model, as measured by R 2j .
•
Var(βˆ j ) is smaller:
(1) the smaller is σ2, the error variance in the true model;
(2) the larger is TSSj, the total sample variation in the regressor Xj;
TSS j ≡ ∑ x 2ji ≡ ∑ (X ji − X j ) is larger
•
N
N
i =1
i =1
2
the larger the values of x 2ji = ( X ji − X j ) 2 for i = 1, …, N, meaning the
greater the sample variation in the Xji values around their sample mean;
•
the larger is N, the size of the estimation sample;
(3) the smaller (closer to 0) is R 2j , the lower the degree of linear dependence
between the sample values Xji of the regressor Xj and the sample values of
the other regressors in the PRE.
ECON 351* -- Note 12: OLS Estimation in the Multiple CLRM
… Page 12 of 17 pages
ECONOMICS 351* -- NOTE 12
•
M.G. Abbott
Conversely, Var(βˆ j ) is larger:
(1) the larger is σ2, the error variance in the true model;
(2) the smaller is TSSj, the total sample variation in the regressor Xj;
TSS j ≡ ∑ x 2ji ≡ ∑ (X ji − X j ) is smaller
•
N
N
i =1
i =1
2
the smaller the values of x 2ji = ( X ji − X j ) 2 for i = 1, …, N, meaning the
greater the sample variation in the Xji values around their sample mean;
•
the smaller is N, the size of the estimation sample;
(3) the larger (closer to 1) is R 2j , the greater the degree of linear dependence
between the sample values Xji of the regressor Xj and the sample values of
the other regressors in the PRE.
Note: Assumption A8, the absence of perfect multicollinearity, rules out the
value 1 for R 2j .
If R 2j = 1, then the sample values of the regressor Xj exhibit an exact linear
dependence -- are perfectly multicollinear -- with one or more of the other
regressors in the model, in which case it is impossible to compute either
(1) the OLS estimate β̂ j of the slope coefficient βj associated with the
regressor Xj, or
(2) the estimated value of Var(βˆ j ) , the estimated variance of β̂ j .
If R 2j < 1, then multicollinearity is simply a question of degree: the closer to
1 is the sample value of R 2 , the larger is Var (βˆ ) .
j
j
Var(βˆ j ) → ∞
as
R 2j → 1
ECON 351* -- Note 12: OLS Estimation in the Multiple CLRM
… Page 13 of 17 pages
ECONOMICS 351* -- NOTE 12
M.G. Abbott
5. Computational Properties of the OLS-SRE (2)
Yi = βˆ 0 + βˆ 1X1i + βˆ 2 X 2i + û i = Ŷi + û i
(i = 1, ..., N).
(2)
5.1 The OLS-SRE passes through the point of sample means (Y, X1 , X 2 ) ; i.e.,
Y = βˆ 0 + βˆ 1X1 + βˆ 2 X 2 .
(C1)
Proof of (C1): Follows directly from dividing OLS normal equation (N1) by
N.
N
N
i =1
i =1
Nβˆ 0 + βˆ 1 ∑ X1i + βˆ 2 ∑ X 2i =
N
Nβˆ 0 ˆ
+ β1
N
∑ X1i
i =1
N
N
+ βˆ 2
∑ X 2i
i =1
N
N
∑ Yi
(N1)
i =1
N
=
∑ Yi
i =1
N
dividing (N1) by N
βˆ 0 + βˆ 1X1 + βˆ 2 X 2 = Y .
ECON 351* -- Note 12: OLS Estimation in the Multiple CLRM
… Page 14 of 17 pages
ECONOMICS 351* -- NOTE 12
M.G. Abbott
5.2 The sum of the estimated Yi's (the Ŷi 's) equals the sum of the observed Yi's;
or the sample mean of the estimated Yi's (the Ŷi 's) equals the sample mean
of the observed Yi's.
N
N
∑ Ŷi
N
∑ Ŷi = ∑ Yi
i =1
i =1
or
N
i =1
N
=
∑ Yi
i =1
Ŷ = Y
or
N
(C2)
Proof of (C2):
1. Substitute βˆ 0 = Y − βˆ 1X1 − βˆ 2 X 2 in the expression for Ŷi :
Ŷi = βˆ 0 + βˆ 1X1i + βˆ 2 X 2i
= Y − βˆ 1X1 − βˆ 2 X 2 + βˆ 1X1i + βˆ 2 X 2i
= Y + βˆ (X − X ) + βˆ (X − X )
(
)
1
1i
1
2
2i
2
= Y + βˆ 1x1i + βˆ 2 x 2i
since
x1i ≡ (X1i − X1 )
x 2i ≡ (X 2i − X 2 )
2. Now sum the above equation over the sample:
N
N
N
i =1
i =1
i =1
∑ Ŷi = NY + βˆ 1 ∑ x1i + βˆ 2 ∑ x 2i
= NY
because
N
N
i =1
i =1
∑ x1i = ∑ x 2i = 0.
3. Thus, dividing both sides of the above equation by N, we obtain the result
N
N
N
∑ Ŷi
i =1
N
∑ Ŷi = NY ⇒
i =1
= Y or
∑ Ŷi
i =1
N
N
=
∑ Yi
i =1
N
N
N
i =1
i =1
$ = Y or ∑ Ŷ = ∑ Y .
or Y
i
i
ECON 351* -- Note 12: OLS Estimation in the Multiple CLRM
… Page 15 of 17 pages
ECONOMICS 351* -- NOTE 12
M.G. Abbott
5.3 The sum of the OLS residuals û i (i = 1, …, N) equals zero, or the sample
mean of the OLS residuals û i equals zero.
N
N
∑ u$ i
=0
or
û =
i =1
∑ û i
i =1
N
= 0.
(C3)
Proof of (C3): An immediate implication of equation (7.1).
∑(
N
i =1
)
Yi − βˆ 0 − βˆ 1X1i − βˆ 2 X 2i = 0
N
∑ u$ i
⇔
=0
(7.1)
i =1
5.4 The OLS residuals û i (i = 1, …, N) are uncorrelated with the sample values
of the regressors X1i and X2i (i = 1, …, N): i.e.,
N
∑ X1i û i = 0
i =1
N
and
∑ X 2i û i = 0 .
(C4)
i =1
Proof of (C4): An immediate implication of equations (7.2) and (7.3).
N
(
)
(
)
∑ X1i Yi − βˆ 0 − βˆ 1X1i − βˆ 2 X 2i = 0
i =1
N
∑ X 2i Yi − βˆ 0 − βˆ 1X1i − βˆ 2 X 2i = 0
i =1
⇔
N
∑ X1i û i = 0
(7.2)
i =1
⇔
N
∑ X 2i û i = 0
(7.3)
i =1
ECON 351* -- Note 12: OLS Estimation in the Multiple CLRM
… Page 16 of 17 pages
ECONOMICS 351* -- NOTE 12
M.G. Abbott
5.5 The OLS residuals û i (i = 1, …, N) are uncorrelated with the estimated
values of Yi, the Ŷi values (i = 1, …, N): i.e.,
N
∑ Ŷi û i = 0 .
(C5)
i =1
Proof of (C5): Follows from properties (C3) and (C4) above.
$ are given by the following expression for the OLS sample regression
1. The Y
i
function (the OLS-SRF):
Ŷi = βˆ 0 + β1X1i + βˆ 2 X 2i .
2. Multiply the above equation by u$ i :
Ŷi û i = βˆ 0 û i + βˆ 1X1i û i + βˆ 2 X 2i û i .
3. Summing both sides of the above equation over the sample gives the result
N
N
N
N
i =1
i =1
i =1
i =1
∑ Ŷi û i = βˆ 0 ∑ û i + βˆ 1 ∑ X1i û i + βˆ 2 ∑ X 2i û i
=0
because
N
∑ û i = 0
by property (C3)
i =1
and
N
∑ X1i û i =
i =1
N
∑ X 2i û i = 0
by property (C4).
i =1
ECON 351* -- Note 12: OLS Estimation in the Multiple CLRM
… Page 17 of 17 pages
Download