Document 10714802

advertisement
STAT 501
Spring 2005
Assignment 1
Name ________________
Reading Assignment:
Johnson and Wichern, Chapter 1, Sections 2.5 and 2.6, Chapter 4, and Chapter 3. Review
matrix operations in Chapter 2 and Supplement 2A.
Written Assignment: Due Monday, January 24, in class.
1.
Consider a random vector X = (X1 X 2 X 3 )′ with mean vector µ = (3, 2, 1)′ and
covariance matrix Σ. The eigenvalues of Σ are λ1 = 12, λ2 = 6, λ3 = 2 and the
corresponding eigenvectors are
⎡ 1/ 3 ⎤
⎢
⎥
e 1 = ⎢ 1/ 3 ⎥
~
⎢
⎥
⎢⎣ 1/ 3 ⎥⎦
⎡ 2/ 6
⎢
e 2 = ⎢ − 1/ 6
~
⎢
⎢⎣ − 1/ 6
⎤
⎥
⎥
⎥
⎥⎦
⎡ 0
⎤
⎢
⎥
e 3 = ⎢ 1/ 2 ⎥
~
⎢ − 1/ 2 ⎥
⎣
⎦
Evaluate the following quantities. Parts (a), (b), (c), (d) and (g) should be done without
using a computer.
(a)
⎮Σ⎮ = _____
(d)
⎡
⎢
∑= ⎢
⎢
⎢
⎣
(f)
⎡
⎢
−1/ 2
∑
= ⎢
⎢
⎢
⎣
(b) trace(Σ) = _____
⎤
⎥
⎥
⎥
⎥
⎦
(e)
⎤
⎥
⎥
⎥
⎥
⎦
(c) V(e′ 1 X ) = _____
~ ~
⎡
⎢
∑ −1 = ⎢
⎢
⎢
⎣
⎤
⎥
⎥
⎥
⎥
⎦
2
(g)
Define Y1 = e1′ X , Y2 = e′2 X , and Y3 = e′3X. Find the mean vector and
covariance matrix for Y = (Y1 Y2 Y3 )′ = Γ′ X , where the i-th column of Γ is the i-th
eigenvector for Σ. Evaluate
⎡
⎢
E( Y) = ⎢
⎢
⎢
⎣
⎤
⎥
⎥
⎥
⎥
⎦
⎡
⎢
V(Y) = Γ′ ∑ Γ = ⎢
⎢
⎢
⎣
⎤
⎥
⎥
⎥
⎥
⎦
(h) The linear transformation, Y = Γ′ X , examined in part (g) is often called a rotation
because it corresponds to simply rotating the coordinate axes. Use the definition of
eigenvectors to show that lengths of vectors are not changed by this transformation,
i.e., show Y′ Y = X′ X when Y = Γ′ X .
(i)
Let Γ be a matrix for which the i-th column of Γ is the i-th eigenvector for Σ, and
let Y = Γ′ X , µ X = E(X) , Σ X = V(X) , µ Y = Γ′ µ X , and ∑ Y = V(Y) = Γ′ ∑X Γ .
Determine which of the following measures of variability or distance are unaffected
by rotations. Justify your answers by either giving a counter example or a proof using
the properties of eigenvalues, eigenvectors, and matrix operations given in chapter 2
of Johnson and Wichern,
(a) Is |ΣX| = |ΣY| ?
(b) Is trace (ΣX) = trace (ΣY) ?
(c) Is ΣX = ΣY ?
−1
(d) Is (X − µ X )′ ∑X
(X − µ X ) = (Y − µ Y )′ ∑ −Y1(Y − µ Y ) ?
(e) Is (X − µ X )′(X − µ X ) = (Y − µ Y )′(Y − µ Y ) ?
2.
Consider a bivariate normal population with mean vector µ
where
−3
⎡ 2 ⎤
⎡ 5
and
µ = ⎢
∑ = ⎢
⎥
9
⎣ 1 ⎦
⎣ −3
(a)
Write down a formula for the joint density function.
and covariance matrix Σ,
⎤
⎥
⎦
3
(b)
Find the eigenvalues and eigenvectors for Σ.
⎛
e1 = ⎜⎜
⎜
⎝
λ1 = ______ λ2 = _____
(c) Find values for ⎮Σ⎮ = _________
⎞
⎟
⎟
⎟
⎠
⎛
e2 = ⎜⎜
⎜
⎝
⎞
⎟
⎟
⎟
⎠
trace (Σ) = _________
(d)
Write down the formula for the boundary of the smallest region such that there is
probability 0.5 that a randomly selected observation will be inside the boundary.
(e)
Sketch the boundary of the region you identified in part (d).
X2
4
3
2
1
X1
-4
-3
-2
-1
1
2
3
4
5
6
-1
-2
3.
(f)
Determine the area of the region described in parts (d) and (e). ______________
(g)
Determine the area of the smallest region such that there is probability
0.95 that a randomly selected observation will be in that region. ______________
The joint density function for X1 and X 2 is
f (x1, x 2 ) = 3π −1 exp{−4.5x12 + 3x1x 2 − 2.5x 22 − 3x1 + 17x 2 − 32.5}
Find the mean vector and the covariance matrix for this bivariate distribution.
4
4.
Let X be a normally distributed random vector with
~
⎛ −3 ⎞
µ = ⎜⎜ 1 ⎟⎟
⎜ −2 ⎟
⎝
⎠
(a)
and
⎛ 4
∑ = ⎜⎜ 0
⎜ −1
⎝
Which of the following are pairs of independent random variables?
(i)
X1 and X2
(iv) (X1 , X3 ) and X2
(ii) X1 and X3
(v) X1 + X3 and X1 - 2X2
(iii) X1 and X1 + 3X2 – 2X3
(vi) X1 + X2+ X3 and 4X1 - 2X2 + 3X3
(b)
What is the distribution of Y = (X1 , X 2 )′ ?
(c)
What is the conditional distribution of Y = (X1 , X 2 )′ given X 3 = x 3 ?
(d)
Find the correlation between X1 and X2 and a formula for the partial correlation
between X1 and X2 given X3 = x3.
ρ12 = ____________
(e)
5.
0 −1 ⎞
5 0 ⎟⎟
0 2 ⎟⎠
ρ12•3 = _______________
What is the conditional distribution of X2 given X1 = x1 and X3 = x3 ?
⎛
⎞
Suppose X ~ N 4 ⎜ µ , ∑ ⎟ where
~
⎝~ ~⎠
⎛ −4 ⎞
⎜ 3 ⎟
⎟
µ = ⎜
⎜
⎟
0
~
⎜
⎟
⎝ 1 ⎠
⎛
⎜
∑ = ⎜
⎜
⎜
⎝
4
0
2
2
0 2 2
1 1 0
1 9 −2
0 −2 4
⎞
⎟
⎟
⎟
⎟
⎠
(a)
What is the distribution of Z1 = 3X1 − 5X2 + X4 ?
(b)
What is the joint distribution of Z1 in part (a) and Z2 = 2X1 − X3 + 3X4 .
(c)
⎛X ⎞
Find the conditional distribution of ⎜ 1 ⎟ given X3 = 1 .
⎝ X4 ⎠
(d)
Find the partial correlation between X1 and X4 given X2 = x2 and X3 = x3 .
5
6.
⎛
⎛
⎞
⎞
Let X be N ⎜⎜ µ1 , ∑1 ⎟⎟ and Y be N ⎜⎜ µ 2 , ∑ 2 ⎟⎟ where X and Y are independent
~
~
~
~
⎝ ~
⎝ ~
⎠
⎠
and
⎛ 8 −2 ⎞
∑1 = ⎜⎜
⎟⎟
⎝− 2 4 ⎠
⎛ 2⎞
µ1 = ⎜⎜ ⎟⎟
~
⎝ 6⎠
⎛ 3 ⎞
⎛ 4
∑ 2 = ⎜⎜
⎝ 2
µ 2 = ⎜⎜ ⎟⎟
~
⎝ 4 ⎠
2⎞
⎟
4 ⎟⎠
(a) Evaluate Cov ⎛⎜ X − Y , X + Y ⎞⎟
~
~
~⎠
⎝~
(b)
Are X − Y and X + Y independent random vectors? Explain.
~
~
~
~
(c) Show that the joint distribution for the four dimensional random vector
⎡ X − Y ⎤
~ ⎥
⎢ ~
is a multivariate normal distribution.
⎢ X + Y ⎥
~ ⎦
⎣ ~
7.
Let d ( p , q ) denote a measure of distance between p and q . Johnson and Wichern
~
~ ~
~
indicate that any distance measure should possess the following four properties:
d ( p , q ) = d ( q , p)
(i)
~ ~
~ ~
d (p , q) > 0
(ii)
if p ≠ q
~
~ ~
d (p , q) = 0
(iii)
~
if p = q
~
~ ~
~
d ( p , q ) ≤ d ( p , r ) + d ( r , q ) , where r = (r1 , r2 )' .
(iv)
~ ~
~ ~
~ ~
~
These properties are satisfied, for example, by Euclidean distance, i.e., .
d ( p , q ) = [ ( p − q ) ' ( p − q )]1 / 2
~ ~
~
~
~
~
(a) In this class we will often consider distance measures of the form
d ( p , q ) = [ ( p − q ) ' A ( p − q )]1 / 2 . Sometimes A will be the inverse of a covariance
~ ~
~
~
~
~
matrix which implies that A is symmetric and positive definite. Show that properties
(i) through (iv) are satisfied when A is symmetric and positive definite.
6
(b) Does A have to be both symmetric and positive definite for
d ( p , q ) = [ ( p − q ) ' A ( p − q )]1 / 2 to satisfy properties (i) through (iv)?
~ ~
~
~
~
~
Present your proofs, counter examples or explanations.
(c) For k-dimensional vectors it is obvious that the measure
d ( p , q ) = max{| p1 − q1 |, | p2 − q 2 |, ..., | p k − q k | }
~ ~
satisfies properties (i), (ii), (iii). Either prove or disprove that it satisfies
property (iv), the triangle inequality.
8.
(a) By multiplication of partitioned matrices, verify for yourself that
⎡ Iq×q B ⎤
⎢
⎥
⎢⎣ 0r ×q I r ×r ⎥⎦
−1
⎡ Iq×q
=⎢
⎢⎣ 0r ×q
−B ⎤
⎡ I r ×r 0r ×q ⎤
⎥ and ⎢
⎥
I r ×r ⎥⎦
⎢⎣ B Iq×q ⎥⎦
−1
⎡ I r ×r 0r ×q
=⎢
⎢⎣ − B Iq×q
⎤
⎥
⎥⎦
for any q × r matrix B, where 0a ×b denotes a matrix of zeros and Ia ×a denotes
an identity matrix.
(b) Show that
Iq×q
B
0r ×q I r ×r
=
I r ×r 0r ×q
= 1 for any q × r matrix B.
B Iq×q
⎡A A ⎤
(c) By multiplication of partitioned matrices verify that if A = ⎢ 11 12 ⎥ is a square
⎣ A 21 A 22 ⎦
matrix for which A11 is q × q matrix and A 22 is an r × r matrix and the inverse of A 22
exists, then
−1
−1 ⎤
⎡ A11 − A12 A 22
0q×r
A 21 0q×r ⎤ ⎡ Iq×q − A12 A 22
⎡ A A ⎤ ⎡ Iq×q
⎢
⎥=⎢
⎥ ⎢ 11 12 ⎥ ⎢
−1
0r ×q
A 22 ⎥⎦ ⎢⎣ 0r ×q
I r ×r ⎥⎦ ⎣ A 21 A 22 ⎦ ⎢⎣ − A 22
⎢⎣
A 21 I r ×r
(d) Use the results from parts (a) through (c) to show that
−1
A = A 22 A11 − A12 A 22
A 21 for any square matrix A with A 22 ≠ 0 .
⎤
⎥
⎥⎦
7
(e) Use the results from parts (a) through (d) to show that
⎡ Iq×q
A −1 = ⎢
−1
A 21
⎢⎣ − A 22
0q × r ⎤
⎥
I r ×r ⎥⎦
−1
⎢( A11 − A12 A 22 A 21 )
−1 ⎡
⎢
⎢⎣
0r ×q
−1
⎤
−1 ⎤ −1
0q×r ⎥ ⎡ Iq×q − A12 A 22
⎢
⎥
⎥
I
⎥
−1 ⎢ 0r ×q
r ×r ⎦
A 22 ⎥⎦ ⎣
for any symmetric matrix A with A 22 ≠ 0 .
9. Consider a multivariate normal distribution with positive definite covariance matrix
⎡ ∑11 ∑12 ⎤
−1
∑=⎢
⎥ . Note that from part (d) in problem 8, ∑ = ∑22 ∑11 − ∑12 ∑22 ∑ 21 ,
∑
∑
⎣ 21 22 ⎦
the product of determinants of covariance matrices for a marginal and a conditional
normal distribution. Use the result from part (e) in problem 8 to expand
⎛ x1 − µ1 ⎞′ ⎡ ∑11 ∑12 ⎤ −1 ⎛ x1 − µ1 ⎞
1
−
(x − µ)′ ∑ (x − µ) = ⎜ ⎟
⎟
⎜
⎜⎝ x 2 − µ2 ⎟⎠ ⎢⎣ ∑21 ∑22 ⎥⎦ ⎜⎝ x 2 − µ 2 ⎟⎠
in an appropriate way to show that the multivariate normal density function is a product of
the density functions for the marginal distribution of X 2 and a conditional distribution.
What happens when ∑12 = 0 , i.e. when X1 and X 2 are uncorrelated?
For additional practice you could do problems 3.6, 3.8, 3.9, 3.10, 3.14, 3.16 at the end of
Chapter 3 and problems 4.1, 4.3, 4.4, 4.5, 4.14, 4.15, 4.16 , 4.17 at the end of Chapter 4.
Problem 4.8 gives a trivial example of a non-normal bivariate distribution with normal marginal
distributions. We will consider a more substantial example on the next assignment. Do not hand
in these additional problems; answers will be given on the solution sheet for this assignment.
Download