252x0572 11/28/05 (Page layout view!) ECO252 QBA2 THIRD HOUR EXAM Dec 1 2005 Name Hour of Class Registered MWF 2, MWF3, TR 12:30, TR2 I. (40 points) Do all the following (2 points each unless noted otherwise). Do not answer question ‘yes’ or ‘no’ without giving reasons. Show your work in questions that are not multiple choice. 1. Turn in your computer problems 2 and 3 marked to show the following: (5 points, 2 point penalty for not doing.) a) In problem 2 – what is tested and what are the results? b) In problem 3 – what coefficients are significant? What is your evidence? c) In the last graph in problem 3, where is the regression line? [5] 2. (Dummeldinger) As part of a study to investigate the effect of helmet design on football injuries, head width measurements were taken for 30 subjects randomly selected from each of 3 groups (High school football players, college football players and college students who do not play football – so that there are a total of 90 observations) with the object of comparing the typical head widths of the three groups. If the researchers assume that the data in each of these three groups comes from a Normally distributed population, they should use the following method. a) The Kruskal-Wallis test. b) One-way ANOVA c) The Friedman test d) Two-Way ANOVA (2) [7] 3. (Sandy) Which of the following is not an assumption required for 1-way ANOVA wth 4 columns.. a) 1 2 3 4 . b) All of the columns are random samples c) All of the population have to be Normally distributed. d) 1 2 3 4 (2) [9] 4. If we are comparing the means of 5 random samples and find the following: x1 10 x 2 12 x 3 11 x 4 13 x 5 14 s1 1.0 s 2 1.2 s 3 1.3 s 4 1.5 s 5 1.5 n1 7 n 2 7 n 3 7 n 4 7 The appropriate test statistic is: x1 x 2 x 3 x 4 x 5 a) D s12 s 22 s 32 s 42 s 52 n1 n 2 n3 n 4 n5 b) c) d) e) n5 7 F with 7 and 4 degrees of freedom F with 4 and 30 degrees of freedom F with 4 and 7 degrees of freedom. 2 with 18 degrees of freedom f) 2 with 34 degrees of freedom (2) [11] 5. If we are doing a 2-way ANOVA and find the following: Two-way ANOVA: C8 versus C9, C10 Two-way ANOVA: C5 versus C6, C7 Source Rows Columns Interaction Error Total S = 1.955 DF 3 2 6 60 71 SS 32.374 7.861 28.999 229.406 298.639 R-Sq = 23.18% MS 10.7914 3.9304 4.8331 3.8234 F 2.82 1.03 1.26 P 0.046 0.364 0.288 R-Sq(adj) = 9.10% The following are significant at the 5% level. (3) a) Differences between Row means only b) Differences between Column means only c) Both differences between Column means and Interaction d) Interaction only d) All are significant at the 5% level e) None are significant at the 5% level f) Not enough information. [14] 6. If we do a 1-way ANOVA and find the following. One-way ANOVA: C1, C2, C3, C4 Source Factor Error Total Level C1 C2 C3 C4 DF 3 68 71 N 18 18 18 18 SS 32.37 266.27 298.64 Mean 11.916 12.436 12.927 13.736 MS 10.79 3.92 F 2.76 P 0.049 Individual 95% CIs For Mean Based on Pooled StDev StDev +---------+---------+---------+--------1.095 (--------*--------) 2.195 (--------*---------) 1.929 (--------*---------) 2.434 (--------*---------) +---------+---------+---------+--------11.0 12.0 13.0 14.0 Give a 1% Tukey confidence interval (or equivalent test) for 1 3 and explain whether this shows a significant difference between these two means. (3) [17] Extra Credit – do the same with a Scheffe interval. (2) Extra Credit – Do the same for an individual confidence interval for the difference and explain why it is more likely to show a significant difference than the other two. (2) 2 7. If we do a 1-way ANOVA and find the following: (Sandy 12.50, 12.51) One-way ANOVA: Source DF SS Factor ? 7.30310 Error ? 101.358 Total 116 108.661 MS F 1.46062 1.60 0.913131 P The degrees of freedom for the F test are (2) [19] a) 4, 100 b) 5, 111 c) 4, 111 d) 5, 115. e) 5, 116 f) 4, 115 8. If we do a 1-way ANOVA and assume that your answer in 7 is correct, pick an appropriate value for [21] F with a 10% significance level from the table and explain your results. (2) 9. If we do a simple regression and find the following: (Sandy 13.1, 13.2) xy 1200 , x 5 , y 10, n 10 , x 2 500 . The predicted value of y when x 4 is: a) b) c) d) e) 5.2 7.2 8.6 9.6 Answer cant’t be obtained with information given. 10. Assume the following data: x y 7 -2 9 -6 5 -1 8 -9 29 -18 Find the following. Show your work! x , xy , R 2 2 (4) [25] (4) [29] 3 11. The coefficient of determination is defined as a) Total (squared) variation in y divided by the explained variation. b) Explained variation in y divided by the total (squared) variation in y c) Unexplained variation in y divided by the total (squared) variation in y . d) Sum of the explained and unexplained variation in y divided by the total variation in [31] y. ————— 11/28/2005 8:40:25 PM ———————————————————— Welcome to Minitab, press F1 for help. MTB > Regress c1 1 c2; SUBC> Constant; SUBC> Brief 3. Regression Analysis: Y versus X The regression equation is Y = 4.53 + 10.2 X Predictor Constant X Coef 4.531 10.198 SE Coef 7.217 1.256 T 0.63 8.12 P 0.548 0.000 Analysis of Variance Source Regression Residual Error Total Obs 1 2 3 4 5 6 7 8 9 10 X 2.00 7.00 5.00 8.00 5.00 8.00 5.00 3.00 7.00 4.00 Y DF 1 8 9 22.00 68.00 68.00 96.00 46.00 80.00 52.00 38.00 78.00 48.00 SS 3993.5 484.9 4478.4 MS 3993.5 60.6 F 65.89 P 0.000 Ŷ 24.93 75.92 55.52 86.11 55.52 86.11 55.52 35.12 75.92 45.32 y 596 , x 54 and x 2 330 12. From the computer output above, find the following: a) R 2 (2) b) s e (2) c) A 90% confidence interval for 1 (2) d) A 90% prediction interval for Y when X 5. (3) [40] 4 5