Document 10639898

advertisement
Least Squares Estimation of the Expected
Value of the Response Vector
c
Copyright 2012
Dan Nettleton (Iowa State University)
Statistics 611
1 / 19
Consider the General Linear Model (GLM) previously introduced,
where
y = Xβ + ε,
with E(ε) = 0.
c
Copyright 2012
Dan Nettleton (Iowa State University)
Statistics 611
2 / 19
Suppose we want to estimate
E(y) = Xβ
based on our observed response y.
c
Copyright 2012
Dan Nettleton (Iowa State University)
Statistics 611
3 / 19
Because we know
E(y) = Xβ ∈ C(X),
a reasonable strategy may be to find the vector in C(X) that is closest
to y in terms of Euclidean distance.
c
Copyright 2012
Dan Nettleton (Iowa State University)
Statistics 611
4 / 19
For example, suppose
" #
y1
=
y2
" #
1
[µ] +
1
" #
ε1
ε2
.
Then
E
" #!
y1
y2
c
Copyright 2012
Dan Nettleton (Iowa State University)
=
" #
1
1
[µ] =
" #
µ
µ
.
Statistics 611
5 / 19
If we observe y =
" #
4
2
and have to guess
E(y) = Xβ =
" #
1
1
[µ] =
" #
µ
µ
,
it may be reasonable to find the value of µ that makes
" #
4
2
" #
µ
µ
as close to
as possible.
c
Copyright 2012
Dan Nettleton (Iowa State University)
Statistics 611
6 / 19
In general, we seek a vector β̂ 3 Xβ̂ is closer to y than any other
vector in C(X).
c
Copyright 2012
Dan Nettleton (Iowa State University)
Statistics 611
7 / 19
If
Q(b) = ky − Xbk2
= (y − Xb)0 (y − Xb),
we seek β̂ 3 Q(β̂) ≤ Q(b) ∀ b ∈ Rp .
c
Copyright 2012
Dan Nettleton (Iowa State University)
Statistics 611
8 / 19
If
Q(β̂) ≤ Q(b) ∀ b ∈ Rp ,
Xβ̂ is called the Least Squares Estimate (LSE) of E(y) = Xβ.
c
Copyright 2012
Dan Nettleton (Iowa State University)
Statistics 611
9 / 19
Suppose PX is the orthogonal projection matrix onto C(X).
Show that PX y is the unique vector in C(X) that is closest to y.
c
Copyright 2012
Dan Nettleton (Iowa State University)
Statistics 611
10 / 19
Because PX y ∈ C(X), we know ∃ at least one vector b 3 PX y = Xb.
We can take β̂ to be any such vector b. Then we have shown that β̂
will minimize Q(b) over b ∈ Rp .
c
Copyright 2012
Dan Nettleton (Iowa State University)
Statistics 611
17 / 19
Now we know that the least square estimator of E(y) = Xβ is
Xβ̂ = PX y.
How do we find PX ?
c
Copyright 2012
Dan Nettleton (Iowa State University)
Statistics 611
18 / 19
We need to find the symmetric and idempotent matrix that projects
onto C(X).
In the next set of notes we will show
PX = X(X0 X)− X0 .
c
Copyright 2012
Dan Nettleton (Iowa State University)
Statistics 611
19 / 19
Download