Statistics 503, Divisions 1 and 2 Midterm Examination 2 Thursday, November 9, 2000 Time: 2 hours, 7:00 pm – 9:00 pm Name (please print): ANSWERS Section . (1 = 10:30, 2 = 11:30) Show all your work and calculations. Partial credit will be given for work that is partially correct. Points will be deducted for false statements shown in the solution to a problem, even if the final answer is correct. Numbers in the right-hand margin indicate the number of points for each question. If extra space is needed, use the back of a page and indicate this clearly on the question page. This exam is closed-book. You may not consult any notes or books during this exam. One formula page, and tables for several distributions and binomial coefficients are provided separately. There is no need to hand them in. Calculators are permitted. Please sign below to indicate your agreement with the following honour code. Honour code: I promise not to cheat on this exam. I will neither give nor receive any unauthorized assistance. I promise not to share information about this exam with anyone who may be taking it at a different time. I have not been told anything about the exam by someone who has already taken it. Signature: . Question 1 Possible 16 2 23 3 15 4 14 5 15 6 17 Total 100 Actual 1 Date: . (16) 1. Short-answer questions (2 pts each). Fill in the blanks to make the following sentences correct, or answer the question in the space provided. It is possible for more than one word to go in a single blank space. a) When the alternative hypothesis is ___false_________ and the null hypothesis is (2) ___rejected__________ then a type I error has been made. b) When the alternative hypothesis is ___true__________ and the null hypothesis is (2) _____not rejected_______ then a type II error has been made. c) Either __decreasing_________ the sample size or __increasing______ the variance will (2) decrease the power of a hypothesis test. d) In a directional two-independent-samples t-test, I would need a sample of size __24__ to achieve (1) 80% power to detect an effect of size 0.95 with = 0.01. e) A t-test is performed with HA : 1 > 2 and a test statistic of ts = –1.24 is observed. Draw a sketch (2) of the area corresponding to the P-value, and bracket the p-value when df = 12. (Area under the t-distribution curve to the right of –1.24. ) Since the P-value for ts = +1.24 would be bracketed between 0.1 and 0.2, the P-value for ts = –1.24 must satisfy 0.8 < P < 0.9. f) If = 0.02 and the P-value is 0.015 I should ___reject______ the null hypothesis. (1) g) The type of ANOVA studied in this class and performed on problem 2 is an appropriate analysis (2) for a _____________completely randomized _______________ design. h) A(n) _experimental _observational OR OR double-blind__ study is usually the best kind to do, but sometimes a(n) (2) blind/open_________ must be done instead, for ethical or practical reasons. i) Blocking on __extraneous_______________ variables is a good way to take them into account. (2) In this type of study design, subjects are first grouped into blocks and then __randomized_______ within each block. j) (Bonus question – if correct, score can replace any of the above. No penalty for getting it wrong.) Give a clear definition of P-value. The P-value is the probabillity, assuming the null hypothesis to be true, of observing a test statistic at least as extreme as what was actually observed. 2 (2) 2. In a study of weight loss methods, efforts were made to change the body weights of 20 moderately (23) overweight student volunteers by interventions involving diet, increased physical activity, or both. Specifically, the 20 students were randomly divided into 4 groups of 5 students each, and the groups each received a different treatment as shown. Their weight losses (before – after) are given in the accompanying table. Control Diet Activity Diet & Only Only Activity ANOVA Table 5 2 8 12 SS df MS -2 8 0 6 3 4 2 15 Between 205.35 3 68.45 2 12 6 8 Within 185.20 16 11.58 0 4 2 10 n 5 5 5 5 Total 390.55 19 mean 1.6 6.0 3.6 10.2 SD 2.70 4.00 3.29 3.49 a) Complete the ANOVA table for these data. Do not (10) do any more work than is necessary. Use only 2 decimal places when writing your answers in the table. y 5 1.6 6.0 3.6 10.2 20 5.35; n * 20; SSB 5 1.6 5.35 5 6.0 5.35 5 3.6 5.35 5 10.2 5.35 205.35; dfb k 1 3 2 2 2 2 2 2 2 2 SSW 4 2.7 4.00 3.29 3.49 185.20; dfw n * k 20 4 16 205.35 185.20 MSB 68.45; MSW 11.58; SST 205.35 185.20 390.55; dft 3 16 19 3 16 b) Perform an appropriate hypothesis with significance level 0.01. Do any of the treatments influence body weight? Let 1, 2, 3, 4 be the mean body weight of moderately overweight students with no treatment, diet only, activity only, and diet and activity, respectively. H0: 1 = 2 = 3 = 4; mean body weight of students is the same under all four treatments HA: students’ mean body weights are not all the same under the four treatments Use a global F-test Fs = MSB/MSW has an F-distribution with df 3,16 under H0 Test at level = 0.01. Critical value is F.01 = 5.29. If Fs > 5.29 then will reject H0. Fs 68.45 5.91 . 5.91 > 5.29 so reject H0. 11.58 This study provides evidence at the 0.01 significance level that mean body weights of moderately overweight students are not all the same under the four treatments. 3 (13) (15) 3. A certain breed of rat is known to show a mean weight gain of 65g during the first 3 months of life. Sixteen rats of this breed are fed a new diet from birth until age 3 months. These 16 have a mean weight gain of 60.75g with standard deviation 3.84g. Assuming that weight gain follows a normal distribution, test whether the new diet has any effect on 3-month weight gain in this breed of rat. Does the new diet have an effect on 3-month weight gain in this breed of rat? Let be the mean 3-month weight of this breed of rat on the new diet. [Let 0 = 65g be the mean 3-month weight of this breed of rat on a standard diet.] H0: = 65; the mean 3-month weight of this breed of rat is unchanged on the new diet. HA: 65; the mean 3-month weight of this breed of rat is different on the new diet. Use a one-sample, non-directional t-test. ts y 65 has a t-distribution with 16 – 1 = 15 df under H0. SE Test at level = 0.05. Critical value is t.025 = 2.131. If ts > 2.131 or ts < –2.131 then will reject H0. SE 3.84 16 0.96; ts 60.75 65 0.96 4.43 Since –4.43 < –2.131 we reject H0. This study provides evidence at the 0.05 significance level that the mean 3-month weight of this breed of rat is different on the new diet. 4 (14) 4. An experiment was performed to consider the effect of body mass on toxin metabolism. Twenty-three yearling sheep were fed pingue (a toxin-producing weed). Of these, 13 died and the remaining 10 survived. The weights of the sheep (in kg) just before they were fed pingue were as shown. n mean SD K Died 21 25 27 27 28 29 29 29 30 30 32 34 34 13 28.8 3.53 62 Survived 20 26 26 28 29 30 30 33 34 34 10 29.0 4.32 68 a) Briefly justify your choice of which type of hypothesis test to perform. (3) We are not told to assume a normal distribution, and the sample size is small. Furthermore, the data do not look normal, since there are many observations in the right tail (near 34) , while the left tail is very spread out; thus the distribution seems skewed. Therefore I will use a Wilcoxon-Mann-Whitney (WMW) test. b) Perform an appropriate hypothesis test at level 0.05. Does body mass have an effect on toxin metabolism? H0: the body mass distribution of sheep who survive is the same as that of those who die after pingue exposure. H1: the body mass distribution of sheep who survive is different from that of those who die after pingue exposure. Use a WMW test (non-directional). Us = max(K1, K2) follows the Mann-Whitney null distribution with n = 13, n’ = 10under H0. Test at level 0.05. Critical value is U.05 = 97. If Us 97 then will reject H0. Check: K1 + K2 =62 + 68 = 130 = 13x10 = n1 n2 Us = max(62, 68) = 68 68 < 97 so do not reject H0. This study does not provide evidence at the 0.05 significance level that the body mass distribution of sheep who survive is different from that of those who die after pingue exposure. 5 (11) (15) 5. Twelve participants were chosen at random to participate in a blood pressure study. Of interest is the idea of postural hypotension, namely, that standing up tends to lower blood pressure. The systolic blood pressure of each participant was measured both while standing and while supine (lying down). The results are shown in the accompanying table. Assuming that systolic blood pressure follows a normal distribution, use these data to test whether blood pressure is decreased while standing. Subject Standing Supine Difference 1 132 136 -4 2 146 145 1 3 135 140 -5 4 141 147 -6 5 139 142 -3 6 162 160 2 7 128 137 -9 8 137 136 1 9 145 149 -4 10 151 158 -7 11 131 130 1 12 143 150 -7 mean 140.83 144.17 -3.33 s 9.49 9.12 3.75 Is blood pressure decreased while standing? Let 1 be the mean systolic blood pressure of people while standing. Let 2 be the mean systolic blood pressure of people while supine. H0: 1 = 2 ; mean systolic blood pressure is the same whether standing or supine. HA: 1 < 2 ; mean systolic blood pressure is lower while standing than while supine Use a paired-sample, directional t-test. ts y1 y 2 SEdiff has a t-distribution with 12 – 1 = 11 df under H0. Test at level = 0.05; critical value is t.05 = 1.796. If ts < –1.796 then will reject H0. SEdiff 3.75 12 1.08; ts 3.33 3.08 1.08 Since –3.08 < –1.796 we reject H0. This study provides evidence at the 0.05 significance level that mean systolic blood pressure is lower while standing than while supine. 6 (17) 6. Ten students were given a mathematics aptitude test in a quiet room. The same students were given a similar test in a room with background music. Their performances on the tests were as shown. Do not make any assumptions about the distribution of the test scores. a) Test an appropriate hypothesis at the 0.10 significance level. Student Without With Music Music 1 2 3 4 5 6 7 8 9 (14) mean 114 121 141 102 99 114 127 150 129 112 122 136 107 96 109 121 146 127 122.7 120.4 Does background music affect math test scores? Let be the probability that a student will score higher without music. H0: = 0.5; a student is equally likely to score higher with music or without music. HA: 0.5; a student is more likely to score higher in one of the two situations. Use a non-directional sign test. Bs = max(N+, N–) has a folded binomial (n = 9, = 0.5) distribution under H0. Test at level = 0.01; critical value is 8. If Bs 8 then will reject H0. N+ = 7, N– = 2, so Bs = 7 Since 7 < 8 we do not reject H0. This study does not provide evidence at the 0.10 significance level that a student is more likely to score higher in one of the two situations. (3) b) Calculate the P-value for this test. With Y binomial(n = 9, = 0.5), P-value 2 Pr Y 7 2 Pr Y 7 Pr Y 8 Pr Y 9 9 9 9 7 2 8 1 9 0 2 0.5 0.5 0.5 0.5 0.5 0.5 8 9 7 2 36 9 1 0.5 0.1797 9 7