STA 6207 – Exam 1 – Fall 2014 PRINT Name _____________________ Conduct all tests at the = 0.05 Significance level. All Models on test are of the forms: Scalar: Y 0 1 X 1 ... p X p ~ N 0, 2 independent Matrix: Y Xβ ε ε ~ N 0, 2I Critical Values for t, 2, and F Distributions F Distributions Indexed by Numerator Degrees of Freedom CDF - Lower tail probabilities df | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | t.95 6.314 2.920 2.353 2.132 2.015 1.943 1.895 1.860 1.833 1.812 1.796 1.782 1.771 1.761 1.753 1.746 1.740 1.734 1.729 1.725 1.721 1.717 1.714 1.711 1.708 1.706 1.703 1.701 1.699 1.697 1.684 1.676 1.671 1.667 1.664 1.662 1.660 1.659 1.658 1.657 1.656 1.655 1.654 1.654 1.653 1.653 1.653 1.645 t.975 .295 F.95,1 F.95,2 F.95,3 F.95,4 F.95,5 F.95,6 F.95,7 F.95,8 12.706 4.303 3.182 2.776 2.571 2.447 2.365 2.306 2.262 2.228 2.201 2.179 2.160 2.145 2.131 2.120 2.110 2.101 2.093 2.086 2.080 2.074 2.069 2.064 2.060 2.056 2.052 2.048 2.045 2.042 2.021 2.009 2.000 1.994 1.990 1.987 1.984 1.982 1.980 1.978 1.977 1.976 1.975 1.974 1.973 1.973 1.972 1.960 3.841 5.991 7.815 9.488 11.070 12.592 14.067 15.507 16.919 18.307 19.675 21.026 22.362 23.685 24.996 26.296 27.587 28.869 30.144 31.410 32.671 33.924 35.172 36.415 37.652 38.885 40.113 41.337 42.557 43.773 55.758 67.505 79.082 90.531 101.879 113.145 124.342 135.480 146.567 157.610 168.613 179.581 190.516 201.423 212.304 223.160 233.994 --- 161.448 18.513 10.128 7.709 6.608 5.987 5.591 5.318 5.117 4.965 4.844 4.747 4.667 4.600 4.543 4.494 4.451 4.414 4.381 4.351 4.325 4.301 4.279 4.260 4.242 4.225 4.210 4.196 4.183 4.171 4.085 4.034 4.001 3.978 3.960 3.947 3.936 3.927 3.920 3.914 3.909 3.904 3.900 3.897 3.894 3.891 3.888 3.841 199.500 19.000 9.552 6.944 5.786 5.143 4.737 4.459 4.256 4.103 3.982 3.885 3.806 3.739 3.682 3.634 3.592 3.555 3.522 3.493 3.467 3.443 3.422 3.403 3.385 3.369 3.354 3.340 3.328 3.316 3.232 3.183 3.150 3.128 3.111 3.098 3.087 3.079 3.072 3.066 3.061 3.056 3.053 3.049 3.046 3.043 3.041 2.995 215.707 19.164 9.277 6.591 5.409 4.757 4.347 4.066 3.863 3.708 3.587 3.490 3.411 3.344 3.287 3.239 3.197 3.160 3.127 3.098 3.072 3.049 3.028 3.009 2.991 2.975 2.960 2.947 2.934 2.922 2.839 2.790 2.758 2.736 2.719 2.706 2.696 2.687 2.680 2.674 2.669 2.665 2.661 2.658 2.655 2.652 2.650 2.605 224.583 19.247 9.117 6.388 5.192 4.534 4.120 3.838 3.633 3.478 3.357 3.259 3.179 3.112 3.056 3.007 2.965 2.928 2.895 2.866 2.840 2.817 2.796 2.776 2.759 2.743 2.728 2.714 2.701 2.690 2.606 2.557 2.525 2.503 2.486 2.473 2.463 2.454 2.447 2.441 2.436 2.432 2.428 2.425 2.422 2.419 2.417 2.372 230.162 19.296 9.013 6.256 5.050 4.387 3.972 3.687 3.482 3.326 3.204 3.106 3.025 2.958 2.901 2.852 2.810 2.773 2.740 2.711 2.685 2.661 2.640 2.621 2.603 2.587 2.572 2.558 2.545 2.534 2.449 2.400 2.368 2.346 2.329 2.316 2.305 2.297 2.290 2.284 2.279 2.274 2.271 2.267 2.264 2.262 2.259 2.214 233.986 19.330 8.941 6.163 4.950 4.284 3.866 3.581 3.374 3.217 3.095 2.996 2.915 2.848 2.790 2.741 2.699 2.661 2.628 2.599 2.573 2.549 2.528 2.508 2.490 2.474 2.459 2.445 2.432 2.421 2.336 2.286 2.254 2.231 2.214 2.201 2.191 2.182 2.175 2.169 2.164 2.160 2.156 2.152 2.149 2.147 2.144 2.099 236.768 19.353 8.887 6.094 4.876 4.207 3.787 3.500 3.293 3.135 3.012 2.913 2.832 2.764 2.707 2.657 2.614 2.577 2.544 2.514 2.488 2.464 2.442 2.423 2.405 2.388 2.373 2.359 2.346 2.334 2.249 2.199 2.167 2.143 2.126 2.113 2.103 2.094 2.087 2.081 2.076 2.071 2.067 2.064 2.061 2.058 2.056 2.010 238.883 19.371 8.845 6.041 4.818 4.147 3.726 3.438 3.230 3.072 2.948 2.849 2.767 2.699 2.641 2.591 2.548 2.510 2.477 2.447 2.420 2.397 2.375 2.355 2.337 2.321 2.305 2.291 2.278 2.266 2.180 2.130 2.097 2.074 2.056 2.043 2.032 2.024 2.016 2.010 2.005 2.001 1.997 1.993 1.990 1.987 1.985 1.938 | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Q.1. Regression models were fit, relating height (Y, in mm) to hand length (X1, in mm), foot length (X2, in mm) and gender (X3=1 if male, 0 if female) based on a sample of 80 males and 75 females. Consider these 4 models: Model 1: E Y 0 1 X 1 2 X 2 3 X 3 13 X 1 X 3 23 X 2 X 3 Model 2: E Y 0 1 X 1 2 X 2 3 X 3 Model 3 (Males Only): E Y 0 1 X 1 2 X 2 Model 4 (Females Only): E Y 0 1 X 1 2 X 2 ANOVA Regression Residual Total Intercept Hand Foot Male MaleHand MaleFoot Model1 df 5 149 154 ANOVA SS 1201091 157138 1358229 Coefficients Standard Error 744.14 83.68 2.38 0.51 1.73 0.39 -304.72 125.47 0.91 0.68 0.65 0.52 Regression Residual Total Intercept Hand Foot Male Model2 df 3 151 154 ANOVA SS 1193101 165128 1358229 CoefficientsStandard Error 582.16 60.55 2.81 0.34 2.06 0.26 39.61 8.50 Regression Residual Total Intercept Hand Foot Model3 df ANOVA 2 77 79 SS 208298 88305 296603 Coefficients Standard Error 439.42 97.49 3.29 0.47 2.38 0.35 Regression Residual Total Intercept Hand Foot Model4 df 2 72 74 SS 110552 68833 179385 Coefficients Standard Error 744.14 79.67 2.38 0.49 1.73 0.37 p.1.a. Confirm the equivalence of the regression coefficients (but not standard errors) based on the appropriate models (Hint: set up the fitted equations based on the two models): Females: Males: p.1.b. Test H0: = 0 (No interactions between Hand and Gender or Foot and Gender). Test Statistic: __________________ Rejection Region: ______________________ p-value > or < 0.05? p.1.c. Use Hartley’s Test to test whether the error variances among the individual regressions are equal: t 1 B ln MSE i ln si2 C i 1 1 t 1 1 C 1 i 3 t 1 i 1 Test Statistic B = _____________________ Rejection Region: ________________ p-value > or < 0.05? p.1.d. What fraction of the total variation in height is explained by the set of predictors: hand length, foot length, and gender (but no interactions)? p.1.e. Compute the standard deviations among the 80 Male heights and among the 75 Female heights (ignoring hand and foot length). Males: SD = ___________________________ Females: SD = _________________________________ Q.2. A study related Personal Best Shot Put distance (Y, in meters) to best preseason power clean lift (X, in kilograms). The following models were fit, based on a sample of n = 24 male collegiate shot putters: ^ Model 1: E Y 0 1 X SSE1 43.41 R12 .686 Y X 4.4353 0.0898 X Model 2: E Y 0 1 X 2 X 2 SSE2 37.41 R22 .729 Y X , X 2 12.08 0.3285 X 0.00084 X 2 ^ p.2.a. Use Model 2 to test H0: (Y is not related to X) Test Statistic____________________ Rejection Region: ____________________ Reject H0? Yes or No Reject H0? Yes or No p.2.b. Use Models 1 and 2 to test H0: (Y is linearly related to X) Test Statistic: ___________________ Rejection Region: ____________________ p.2.c. Give an estimate of the level of X is that maximizes E{Y}. X* = ___________________________ Q.3. A study was conducted to determine whether having been exposed to an advertisement claiming a natural ingredient is contained in a perfume had an effect on subjects’ rating of the perfume’s scent. There were 112 subjects of which, 56 were exposed to the ad, and 56 were not. We fit the following regression model: Yi 0 1 X i i i 1,...,112 X'X 1 if Subject i was exposed to the ad Xi 0 if Subject i was not exposed to the ad X'Y 112 56 56 56 587 337 Y'Y 3683.05 p.3.a. First, we fit a model with only an intercept term, what will P0 X 0 X 0 ' X 0 X 0 ' be (symbolically, do 1 not write out a 112x112 matrix!)? Compute R(0). P0 = ______________________________ R(0) = ______________________________ p.3.b. Compute X'X 1 ^ and β NOTE: Write X'X 1 as 1 A for the appropriate A X'X p.3.c. Compute R(0 , 1) , R(0), and MSResidual R(0 , 1) = ________________ R(0) = _________________ MSResidual = __________________ p.3.d. Use the t-test and the F-test to test H0: 0 vs HA≠ t-Statistic: ____________________ Rejection Region: ___________________________ F-Statistic: ____________________ Rejection Region: ___________________________ Q.4. An experiment was conducted to measure the subsoil pressure of a steel ground roller. There were 3 replicates at each of 4 depths (X=5, 10, 15, 20 cm). The response was measured force (100s of Newtons). ^ The fitted regression equation is Y 49.371 2.036 X We want to test H0: E{Yj} = 0 + 1Xj j X_j 1 2 3 4 Ybar_j SD_j 5 40.38 10 28.87 15 16.23 20 11.15 HA: E{Yj} = j ≠ 0 + 1Xj Yhat_j Pure Error Lack of Fit 4.32 6.83 3.76 4.48 p.4.a. Compute the Pure Error Sum of Squares and Degrees of Freedom. Hint: What is SD_j equal to? SSPE = __________________________________ dfPE = _________________________________ p.4.b. Compute the Lack-of-Fit Sum of Squares and Degrees of Freedom. SSLF = __________________________________ dfLF = _________________________________ p.4.c. Conduct the F-test for Lack-of-Fit Test Statistic: ________________________ Rejection Region: ______________ Reject H0? Yes / No Q.5. A series of models were fit, relating Average January High Temperature (Y, in degrees F) to Elevation (X1, in 100s ft above sea level), and Latitude (degrees North Lat) for n = 369 weather stations in Texas. Latitude and Elevation were centered in the regression models. CP SS Re s Model 2 p 'n MSRes Complete AIC n ln SS Re s Model 2 p 'n ln( n) SBC BIC n ln SS Re s Model ln( n) p 'n ln( n) Variables in Model ELEV ( E ) LAT ( L ) E,L E,L,E*L E,L,E^2,L^2 E,L,E*L,E^2,L^2 SS(RES) C_p AIC SBC 7986.3 4764.2 1138.6 1146.4 1168.0 429.2 437.0 616.2 32.8 195.2 207.0 603.9 26.9 205.4 575.0 10.3 173.7 565.2 6 169.3 192.8 p.5.a. Complete the table. p.5.b. Based on each criteria, which model do you choose? Cp: _____________________________ AIC: ____________________ SBC: ______________________