Multivariate Statistical Analysis Mid Term November 9, 2012 2 pages, 14 problems, 100 points 5 1. Find the eigenvalues and normalized eigenvectors for π = [ 4 2. Verify the spectral decomposition theorem for Problem 1. 3. Find max π±′ππ± π±≠0 π±′π± and min π±′ππ± π±≠0 π±′π± 4 ]. (7%) 5 (7%) , where S is defined in Problem 1. (7%) 4. Under what situations will the generalized variance be zero? (7%) 5. Find the maximum likelihood estimate of the 2 × 1 mean vector π and the 2 × 3 6 4 4 2 covariance matrix πΊ based on the random sample [ ] from a bivariate 5 7 4 7 normal population. (7%) 6. Set up a null hypothesis H 0 : μ ο½ μ 0 and an alternative hypothesis H 1 : μ οΉ μ 0 , with μ '0 ο½ ο1 ο 1ο for bivariate normal distribution N 2 (μ, Σ) . Will the Hotelling’s T 2 test rejects the null hypothesis at the level of significance ο‘ ο½ 0.05 ? Suppose that the sample size is 30, the sample mean x'ο½ ο0.5 0.5ο , and the sample ο© 5 4οΉ covariance matrix S ο½ οͺ οΊ . (7%) ο« 4 5ο» 7. Find the semi-major and semi-minor axes and directions of the Hotelling’s 90% T 2 confidence region for Problem 6. (7%) 8. Determine the 90% simultaneous T 2 confidence intervals for Problem 6. (7%) 9. Determine the 90% Bonferroni simultaneous confidence intervals for Problem 6. Note that if your t corresponding to the desired ο‘ can not be found directly in the table, use the following linear interpolation formula to get an approximate value: t ο½ t1 ο« m(ο‘ ο ο‘1 ), m ο½ (t2 ο t1 ) /(ο‘ 2 ο ο‘1 ) , where t1 and t 2 are the two t values corresponding to ο‘1 and ο‘2, respectively, with ο‘1 and ο‘2 being the two ο‘ values in the table and closest to the desired ο‘. (7%) 10. Explain briefly why the lengths of the one-at-a-time confidence intervals are often shorter than those of the corresponding T 2 simultaneous confidence intervals. (7%) 11. Given the data ο© ο 6οΉ X ο½ οͺ 4 4οΊ οͺ οΊ οͺο« 2 8οΊο» with 1 missing components, use the prediction-estimation algorithm (the EM algorithm) to estimate μ and Σ . Determine the initial estimate and iterate to find the first revised estimates. (7%) 12. Bars of soap are manufactured in each of two ways. The characteristics X1 = lather and X2 = mildness are measured. The summary statistics for 61 and 61 bars produced by methods 1 and 2, respectively, are x1 ο½ ο8 4ο' , x 2 ο½ ο10 3ο' , ο© 5 4οΉ ο©7 2οΉ , . S2 ο½ οͺ S1 ο½ οͺ οΊ οΊ . Test the hypothesis that the population means ο1 = ο« 4 5ο» ο«2 7ο» ο2 at 10% significance level, using the spooled covariance matrix to estimate the population covariance matrix οο and assuming that both populations are with multivariate normal distribution. (7%) 13. Find the 90% Bonferroni simultaneous confidence interval for the individual mean difference in Problem 12. (7%) 14. Explain why paired comparison of univariate means is more accurate than unpaired one by considering their confidence intervals of significance ο‘ for a pair of paired samples. In other words, use individual formulae of the confidence intervals for the same paired samples, and compare their lengths. Assume that the number of pair of samples n is not too small. (9%) 2