252y0552 10/31/05 (Open in ‘Print Layout’ format) ECO252 QBA2 FIRST HOUR EXAM October 17-18 2005 Name _ Key______ Hour of class registered _____ Class attended if different ____ Show your work! Make Diagrams! Exam is normed on 50 points. Answers without reasons are not usually acceptable. I. (8 points) Do all the following. If you do not use the standard Normal table, explain! x ~ N 4.5, 6 Make diagrams. For x draw a Normal curve with a vertical line in the center at 4.5. For z draw a Normal curve with a vertical line in the center at zero. 3 4.5 17 .5 4.5 z 1. P17.5 x 3 P P3.67 z 0.25 6 6 P3.67 z 0 P0.25 z 0 .4999 .0987 .4012 1 252y0552 10/31/05 (Open in ‘Print Layout’ format) 7 4. 5 4.5 4.5 z 2. P4.5 x 7.00 P P0 z 0.42 .1628 6 6 10 .22 4.5 3. Px 10 .22 P z Pz 0.95 Pz 0 P0 z 0.95 .5 .3289 .1711 6 2 252y0552 10/31/05 (Open in ‘Print Layout’ format) 4. x.06 There is no excuse for not being able to do this at this point, since you should have done it 3 times in the take-home. To find z .06 make a Normal diagram for z showing a mean at 0 and 50% above 0, divided into 6% above z .06 and 44% below z .04 . So P0 z z.06 .4400 The closest we can come is P0 z 1.55 .4394 or P0 z 1.56 .4406 . So use z .06 1.555 . x.06 z.06 4.5 1.555 6 13.83 3 252y0552 10/31/05 (Open in ‘Print Layout’ format) II. (5 points-2 point penalty for not trying part a.) A dealer wishes to test the manufacture’s claim that the Toyota Caramba gets 28 miles per gallon. The data below is found. (Recomputing what I’ve done for you is a great way to waste time.) a. Compute the sample standard deviation, s , of the miles per gallon. Show your work! (2) b. Is the population mean significantly different (using a 95% confidence level) from 28 mpg? Show your work! (3) c. (Extra Credit) Find an approximate p value for your null hypothesis. (2) Solution: a. Compute the sample standard x Row x2 deviation, s , of the miles per gallon. Show your 1 23.56 555.074 work! (2) 2 26.09 680.688 3 27.55 759.003 x 263 .98 x 26 .398 4 26.92 724.686 n 10 5 29.20 852.640 6 29.02 842.160 x 2 nx 2 7010 .07 10 26 .398 2 2 s 7 22.48 505.350 n 1 9 8 27.45 753.503 41 .52596 9 25.90 670.810 4.61400 10 25.81 666.156 9 sum 263.98 7010.070 s 4.61400 2.148021. Note that excessive rounding can throw this answer way off. Using x 26 , I got 7010 10 (26 ) 2 7010 6760 250 27 .778 and s 5.3541 . 9 9 9 b. Is the population mean significantly different (using a 95% confidence level) from 28 mpg? Show your work! (3) 9 Given: x 26 .398 s 2.14802 , n 7 and .05 . DF n 1 9 and t .025 2.262 . s2 We are testing H 0 : 28 . Use only one of the following! 4.61400 0.461400 0.6793 . 10 n 10 x t 2 s x 26.398 2.262 0.6793 26.398 1.5366 or 21.861 to 27.935. Confidence Interval: s x s 2.14802 Since 28 is not on the confidence interval, it is significantly different from the sample mean. 4.61400 0.461400 0.6793 . 10 n 10 t 2 s x 28 2.262 0.6793 28 1.5366 or 26.46 to 29.53. Critical Value: s x xcv s 2.14802 Since 26.398 is not between the critical values, it is significantly different from the population mean. Test Ratio: s x s 2.14802 4.61400 0.461400 0.6793 . 10 n 10 x 0 26 .398 28 t 2.358 . The ‘accept’ zone is between -2.262 and 2.262. Since -2.197 sx 0.6793 is not between these values, reject the null hypothesis. c. (Extra Credit) Find an approximate p value for your null hypothesis. (2) 9 9 2.262 and t .01 2.821 . Since pval 2Pt 2.358 , it is between .02 2.358 is between t .025 and .05. 4 252y0552 10/31/05 (Open in ‘Print Layout’ format) III. Do all of the following problems (2 each unless marked otherwise adding to 18+ points). Show your work except in multiple choice questions. (Actually – it doesn’t hurt there either.) If the answer is ‘None of the above,’ put in the correct answer. [10] 1. For sample sizes greater than 100, the sampling distribution of the mean will be approximately normally distributed a) *Regardless of the shape of the population. b) Only if the shape of the population is symmetrical. c) Only if the standard deviation of the samples are known. d) Only if the population is normally distributed. 2. When determining the sample size for a proportion for a given level of confidence and sampling error, the closer to 0.50 that p is estimated to be the __________ the sample size required. a) Smaller b) *Larger c) Sample size is not affected. d) The effect cannot be determined from the information given. 3. Which of the following would be an appropriate null hypothesis? a) The population proportion is less than 0.65. b) The sample proportion is less than 0.65. c) *The population proportion is at most 0.65. d) The sample proportion is at most 0.65. 4. A Type I error is committed when a) *We reject a null hypothesis that is true. b) We don't reject a null hypothesis that is true. Note that b) and c) are not errors. c) We reject a null hypothesis that is false. d) We don't reject a null hypothesis that is false 5. 6. If we are performing a two-tailed test of whether = 100, the power of the test in detecting a shift of the mean to 105 will be ________ its power detecting a shift of the mean to 96. a) *Less than 96 is closer than 105 to 100. b) Greater than c) Equal to d) Not comparable to From a sample of 100 students we find that the mean expenditure for books is $316.40 with a sample standard deviation of $43.20. You are asked to test whether the (population) mean expenditure is above $314 using a 10% significance level. a) What are the null and alternative hypotheses? (2) b) What is the ‘rejection zone’ (in terms of x ) (2) c) What is your conclusion? (2) [16] Solution: This is basically the revised version of Exercise 9.54 [9.52 in 9th] (9.50 in 8th edition) that I warned you about. a) Since the problem asks if the mean is above $314 314 , and this does not contain an equality, it must be an alternate hypothesis. Our hypotheses are H 0 : 314 , (The average cost of textbooks per semester at a large university is no more than $314.) and H 1 : 314 (The average cost of textbooks per semester at a large university is more than $314.) This is a right-sided test. 5 252y0552 10/31/05 (Open in ‘Print Layout’ format) b) Given: 0 314 .00, s 43 .20, n 100 , df n 1 99, x 316 .40 and .10 . So sx s 43 .20 4.320 . Note that tn 1 t ..99 10 1.290 . We need a critical value for x n 100 above $314. Common sense says that if the sample mean is too far above $314, we will not believe H 0 : 314 . The formula for a critical value for the sample mean is x t n1 s , but we want a single value above 314, so use x t n1 s cv 0 2 x cv 0 x 314 .00 1.290 4.320 319 .57 . Make a diagram showing an almost Normal curve with a mean at 314 and a shaded 'reject' zone above 319.57. c) Since x 316 .40 is not in the ‘reject’ zone, we do not reject the null hypothesis and cannot conclude that the mean is above $314. 7. From a sample of 10 students we find that the mean expenditure for books is $316.40 with a sample standard deviation of $43.20. You are asked to test whether the (population) mean expenditure is below some number. You compute a t ratio and find that it is 1.960. The p-value is a) Between .025 and .05 b) Between .05 and .10 c) Between .95 and .975 d) Between 1.90 and 1.95 e) Exactly .025 f) None of the above – provide a more suitable answer. [18] Explanation: For this see the writeup that I announced two weeks ago. If our alternate hypothesis is that the mean expenditure is below some number, we want Pt 1.960 . There are n 1 9 degrees of freedom. The table says that t.9025 2.262 and t.905 1.833 . Thus we can see that .025 Pt 1.960 .05 , and we can conclude that .95 Pt 1.645 .975 . 8. An employer states that the median wage in a factory is $45000. You believe that it is lower. From a sample of 300 employees you find that 170 have wages below $45000. The median for the sample is $41000, the mean is $47273 and the sample standard deviation is $4051. The test should reflect your beliefs and, if you use p in a hypothesis, you must define it. Your hypotheses are (3) a) H 0 : 45000 H 1 : 45000 b) H 0 : 45000 H 1 : 45000 c) H 0 : 45000 H 1 : 45000 d) H 0 : 45000 H 0 : p .5 * and If p is the proportion above 0 . H 1 : 45000 H 1 : p .5 e) H 0 : 45000 H 0 : p .5 and H 1 : 45000 H 1 : p .5 f) H 0 : 45000 H 0 : p .5 and H 1 : 45000 H 1 : p .5 H 0 : 45000 H 0 : p .5 g) * and If p is the proportion below 0 . H : 45000 1 H 1 : p .5 6 252y0552 10/31/05 (Open in ‘Print Layout’ format) h) i) H 0 : 45000 H : p .5 and 0 H 1 : 45000 H 1 : p .5 H 0 : 45000 H : p .5 and 0 H : 45000 1 H 1 : p .5 [21] Your belief is that the median is below 45000. This is an alternate hypothesis because it does not contain an equality. The null hypothesis must be that the median is at least 45000. The correct hypothesis involving the proportion can be found in the following table from the outline. Hypotheses about a median Hypotheses about a proportion If p is the proportion If p is the proportion above 0 below 0 H 0 : 0 H 1 : 0 H 0 : 0 H 1 : 0 H 0 : p .5 H 1 : p .5 H 0 : p .5 H 1 : p .5 H 0 : p .5 H 1 : p .5 H 0 : p .5 H 1 : p .5 H 0 : 0 H 1 : 0 H 0 : p .5 H 1 : p .5 H 0 : p .5 H 1 : p .5 9. (Extra credit) An employer states that the median wage in a factory is $45000. You believe that it is lower. From a sample of 300 employees you find that 170 have wages below $45000. The median for the sample is $41000, the mean is $47273 and the sample standard deviation is $4051. The test should reflect your beliefs and, if you use p in a hypothesis, you must define it. Finish the problem. (3) 170 .5667 This is the most natural way to define the Solution: The proportion below $45000 is p 300 H 0 : 45000 H 0 : p .5 proportion here. So our hypotheses are and . We also will assume .05 , H 1 : 45000 H 1 : p .5 so that z z.05 1.645 , and we know that p 0 q 0 .5 and n 300 . This is a right-side test. From Table 3, we can use one of the methods.. Interval for Confidence Hypotheses Test Ratio Critical Value Interval Proportion p p z 2 s p pq n q 1 p sp H 0 : p p0 H1 : p p0 z p p0 p pcv p0 z 2 p p0 q0 n q0 1 p0 p .5.5 .5667 .4333 .0008333 .02887 and s p .0008185 .02861 300 300 (i) Critical Value: pcv p0 z p .5 1.645.02887 = .5475. The rejection region is above We can say p .5475 and the observed proportion, .5667 is in that region, so we reject the null hypothesis. p p 0 .5667 .5 (ii) Test Ratio: z = 2.3104. The rejection region is above 1.645 and this p .02887 value of z is in that region, so we reject the null hypothesis. 7 252y0552 10/31/05 (Open in ‘Print Layout’ format) (iii) Confidence Interval: p p z s p .5667 1.645.02861 = .5196. There is no way that the proportion can be both above .5196 and below .5 as stated in the null hypothesis, so we reject the null hypothesis. 10. (Extra Credit – Nasty but not that hard – problem due to Prem S. Mann.) Use .01 . A quality control inspector is checking machines that are supposed to produce packages of cookies with a mean weight of 32 oz and a standard deviation of .015 oz. She takes a sample of 40 packages and finds a sample standard deviation of .029 oz. Do a one-sided test on the variance to see if the machine is out of control. Do the test by finding a critical value for the sample variance (3) (You can do it in a more familiar way, but you won’t get as much credit!) [27] Solution: H 0 : .015 H 1: .015 , n 40 , s .029 and .01 . Table 3 says s cv 2 DF z 2 2 DF . If we are doing a right sided test we want a critical value above .015, we use .015 8.84176 0.13222 .0203 . 2.327 8.84176 6.5148 This is a right-side test, so that the ‘reject’ region is the area under the curve to the right of 0.0203. Since s .029 is above this value, reject the null hypothesis and fix the machine. z z.01 2.372 . 2 DF 239 78 8.84176 s cv 8 252y0552 10/31/05 (Open in ‘Print Layout’ format) ECO252 QBA2 FIRST EXAM October 17-18 2005 TAKE HOME SECTION Name: _________________________ Student Number and class: _________________________ IV. Do sections adding to at least 20 points - Anything extra you do helps, and grades wrap around) . Show your work! State H 0 and H 1 where appropriate. You have not done a hypothesis test unless you have stated your hypotheses, run the numbers and stated your conclusion. (Use a 95% confidence level unless another level is specified.) Answers without reasons are not usually acceptable. Neatness counts! 1. (Prem S. Mann - modified) Exhibit T1: According to a 1992 survey, 45% of the American population would support higher taxes to pay for health insurance. A state government is considering offering a health insurance plan and took a survey of 400 residents and found that 50% would support higher taxes. Use this result to test whether the results of the 1992 survey apply in the state. Use a 99% confidence level. Personalize these results as follows. Change 45% by replacing 5 by the last digit of your student number. We will call your result the ‘proportion of interest.’ (Example: Seymour Butz’s student number is 976512, so he changes 45% to 42%; 42% is his proportion of interest.) a. State the null and alternative hypotheses in each case and find a critical value for each case. What is the ‘reject’ region? Compute a test ratio and find a p-value for the hypothesis in each case. (12) (i) The state government wants to test that the fraction of people who favored higher taxes for health insurance is below the proportion of interest (ii) The state government wants to test that the fraction of people who favored higher taxes for health insurance is above the proportion of interest. (iii) The state government wants to test that the fraction of people who favored higher taxes for health insurance is equal to the proportion of interest. [12] b. (Extra credit) Find the power of the test if the true proportion is 50% and: (i) The alternate hypothesis is that the fraction of people is above the proportion of interest. (2) (ii) The alternate hypothesis is that the fraction of people does not equal the proportion of interest. (2) c. Assuming that the proportion of interest is correct, is the sample size given above adequate to find the true proportion within .005 (1/2 of 1%)? (Don’t say yes or no without calculating the size that you actually need!) (2) d. If the alternate hypothesis is that the fraction of people is above the proportion of interest, create an appropriate confidence interval for the hypothesis test. (2) [20] Solution: a) From the formula table we have: Interval for Confidence Hypotheses Test Ratio Critical Value Interval Proportion p p0 p p z 2 s p pcv p0 z 2 p H 0 : p p0 z p H1 : p p0 pq p0 q0 sp p n n q 1 p q 1 p 0 0 9 252y0552 10/31/05 (Open in ‘Print Layout’ format) Only the solution for Version 0 is given here. The solutions for the remainder will be in 252y0551s or 252y0551h. Note that my rule on grading parts like (ii) below is to assume that your alternate hypothesis was correct and to ask if the critical values agree with it. p0 q0 .40 .60 .0006 .024495 n 400 (i) The state government wants to test that the fraction of people who favored higher taxes for health insurance is below the proportion of interest H 0 : p .40, H 1 : p .40, n 400 , p .50 and .01 . z.01 2.327 . The critical value must be below .40. pcv p 0 z p .40 2.327.024495 = .3430. The ‘reject’ zone is a) p 0 .40 p Version 0 .50 .40 Pz 4.08 1 below .3430. pval P p .50 P z .024495 (ii) The state government wants to test that the fraction of people who favored higher taxes for health insurance is above the proportion of interest. H 0 : p .40, H 1 : p .40, n 400 , p .50 and .01 . z.01 2.327 . The critical value must be above .40. pcv p 0 z p .40 2.327.024495 = .4570 . The ‘reject’ zone .50 .40 Pz 4.08 0 is above .4570. pval P p .50 P z .024495 (iii) The state government wants to test that the fraction of people who favored higher taxes for health insurance is equal to the proportion of interest. [12] H 0 : p .40, H 1 : p .40, n 400 , p .50 and .01 . z.005 2.576 . The critical value must be on either side of .40. pcv p 0 z 2 p .40 2.576.024495 .40 .0631 . The ‘reject’ zone is below .3369 and above .4631. .50 .40 pval 2 P p .50 2 P z 2 Pz 4.08 0 .024495 pq .5.5 .5 .025 . n 400 20 b) (i) H 1 : p .40, P p .4570 p1 .50 p Pz .4570 .5 Pz 1.72 .5 .4573 .0427 Power 1 .0427 .9573 .025 .4631 .5 .3369 .5 z (ii) H 1 : p .40, P .3369 p .4631 p1 .50 P . 025 .025 P6.52 z 1.48 .5 .4306 .0694 Power 1 .0694 .9306 c) p 0 .40 n pqz 2 e2 .40 .60 2.576 2 .005 2 pq .5.5 .5 .025 . n 400 20 H 0 : p .40, we must reject it. d) s p 63703 .42 This is above 400, so the sample size is inadequate. p p z s p .5 2.327.025 .4418 If the null hypothesis is Minitab Calculation of critical values and test ratios for all Versions of question 1 of the Take-home exam. ————— 10/17/2005 7:01:49 PM ———————————————————— Available in 252y0551s 10 252y0552 10/31/05 (Open in ‘Print Layout’ format) 2. Customers at a bank complain about long lines and a survey shows a median waiting time of 8 minutes. Personalize the data as follows: Use the second-to-last digit of your student number – subtract it from the last digit of every single number. (Example: Seymour Butz’s student number is 976512, so he changes {6.7 6.3 5.7 ….} to {6.6 6.1 5.6 …}) You may drop any number in the data set that you use for this problem that is exactly equal to 8. Use .05 . Do not assume that the population is Normal! Exhibit T2: Changes are made and a new survey of 32 customers is taken. The times are as follows: 6.7 7.6 8.5 6.3 7.6 8.9 5.7 7.7 9.0 5.5 7.7 9.1 4.9 8.0 9.3 7.0 8.0 9.5 7.1 8.1 9.6 7.2 8.2 9.7 7.3 8.4 9.8 7.3 8.4 10.3 7.4 8.4 a. Test that the median is below 8. State your null and alternate hypotheses clearly. (2) b. (Extra credit) Find a 95% (or slightly more) two-sided confidence interval for the median. (2) [24] Solution: The data sets for this problem are below. Let n be the count of the sample and x be the number of numbers below 8. Row x1 x2 x3 x4 x5 x6 x7 x8 x9 x10 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 4.8 5.4 5.6 6.2 6.6 6.9 7.0 7.1 7.2 7.2 7.3 7.5 7.5 7.6 7.6 7.9 7.9 8.1 8.3 8.3 8.3 8.4 8.8 8.9 9.0 9.2 9.4 9.5 9.6 9.7 10.2 4.7 5.3 5.5 6.1 6.5 6.8 6.9 7.0 7.1 7.1 7.2 7.4 7.4 7.5 7.5 7.8 7.8 7.9 8.2 8.2 8.2 8.3 8.7 8.8 8.9 9.1 9.3 9.4 9.5 9.6 10.1 n x 31 17 31 18 4.6 5.2 5.4 6.0 6.4 6.7 6.8 6.9 7.0 7.0 7.1 7.3 7.3 7.4 7.4 7.7 7.7 7.8 7.9 8.1 8.1 8.1 8.2 8.6 8.7 8.8 9.0 9.2 9.3 9.4 9.5 10.0 32 19 4.5 5.1 5.3 5.9 6.3 6.6 6.7 6.8 6.9 6.9 7.0 7.2 7.2 7.3 7.3 7.6 7.6 7.7 7.8 8.1 8.5 8.6 8.7 8.9 9.1 9.2 9.3 9.4 9.9 4.4 5.0 5.2 5.8 6.2 6.5 6.6 6.7 6.8 6.8 6.9 7.1 7.1 7.2 7.2 7.5 7.5 7.6 7.7 7.9 7.9 7.9 8.4 8.5 8.6 8.8 9.0 9.1 9.2 9.3 9.8 29 19 31 22 4.3 4.9 5.1 5.7 6.1 6.4 6.5 6.6 6.7 6.7 6.8 7.0 7.0 7.1 7.1 7.4 7.4 7.5 7.6 7.8 7.8 7.8 7.9 8.3 8.4 8.5 8.7 8.9 9.0 9.1 9.2 9.7 32 23 4.2 4.8 5.0 5.6 6.0 6.3 6.4 6.5 6.6 6.6 6.7 6.9 6.9 7.0 7.0 7.3 7.3 7.4 7.5 7.7 7.7 7.7 7.8 8.2 8.3 8.4 8.6 8.8 8.9 9.0 9.1 9.6 32 23 4.1 4.7 4.9 5.5 5.9 6.2 6.3 6.4 6.5 6.5 6.6 6.8 6.8 6.9 6.9 7.2 7.2 7.3 7.4 7.6 7.6 7.6 7.7 8.1 8.2 8.3 8.5 8.7 8.8 8.9 9.0 9.5 32 23 4.0 4.6 4.8 5.4 5.8 6.1 6.2 6.3 6.4 6.4 6.5 6.7 6.7 6.8 6.8 7.1 7.1 7.2 7.3 7.5 7.5 7.5 7.6 8.1 8.2 8.4 8.6 8.7 8.8 8.9 9.4 4.9 5.5 5.7 6.3 6.7 7.0 7.1 7.2 7.3 7.3 7.4 7.6 7.6 7.7 7.7 8.1 8.2 8.4 8.4 8.4 8.5 8.9 9.0 9.1 9.3 9.5 9.6 9.7 9.8 10.3 31 23 30 15 11 252y0552 10/31/05 (Open in ‘Print Layout’ format) a) Test that the median is below 8. State your null and alternate hypotheses clearly. (2) Hypotheses about Hypotheses about a proportion a median If p is the proportion If p is the proportion above 0 below 0 H 0 : 0 H 0 : p .5 H 0 : p .5 H 1 : p .5 H 1 : 0 H 1 : p .5 It seems easiest to let p be the proportion below 8. If you defined p as the proportion above .8, the p H : 8 H : p .5 values should be identical. Our Hypotheses are 0 and 0 . According to the outline, in H 1 : 8 H 1 : p .5 the absence of a binomial table we must use the normal approximation to the binomial distribution. If p p0 x p is our observed proportion, we use z . But for the sign test, p .5 and q 1 .5 .5 . n p0 q0 n So z x n .5 .25 n x n .5 .5 2x n x .5n n x .5n 2 . n .5 n n n (For relatively small values of n , a continuity correction is advisable, so try z 2x 1 n , where the + n n n , and the applies if x . ) Values of x and n are repeated below for all ten possible 2 2 n 2x n 2x 1 n cases. Both z1 and the more correct z 2 (except when x . ) are computed. Since 2 n n the alternative hypothesis is p .5, the probabilities in the two right columns are p-values. If you used applies if x z1 p p x n .5 , the values of z 1 below are correct. The exact probabilities were computed by .25 n Minitab in the last column. 2x n 2x 1 n x n z1 z2 P z z1 P z z 2 1 F x 1 n n 1 2 3 4 5 6 7 8 9 10 p 17 18 19 19 22 23 23 23 23 15 31 31 32 29 31 32 32 32 31 30 0.53882 0.89803 1.06066 1.67126 2.33487 2.47487 2.47487 2.47487 2.69408 0.00000 0.35921 0.71842 0.88388 1.48556 2.15526 2.29810 2.29810 2.29810 2.51447 0.00000 .5-.2054=.2946 .5-.3133=.1867 .5-.3554=.1445 .5-.4525=.0475* .5-.4901=.0099* .5-.4934=.0066* .5-.4934=.0066* .5-.4934=.0066* .5-.4964=.0036* .5 .5-.1404=.3596 .5-.2642=.2358 .5-.3106=.1894 .5-.4319=.0681 .5-.4846=.0154* .5-.4893=.0107* .5-.4893=.0107* .5-.4893=.0107* .5-.4940=.0060* .5 0.360050 0.236565 0.188543 0.068023 0.014725 0.010031 0.010031 0.010031 0.005337 0.572232 Since .05 , and the starred items are below .05, these are the cases in which we reject the hypothesis that the median is, at least 8. It makes about as much sense to reject a hypothesis about a median because the sample median x.50 is above or below 0 with no other information as it does to reject a hypothesis about a mean because x is above or below 0 without more information. 12 252y0552 10/31/05 (Open in ‘Print Layout’ format) b) (Extra credit) Find a 95% (or slightly more) two-sided confidence interval for the median. (2) I am going to assume that you continued to use the data sets above. The outline says that an approximate value for the index of the lower limit of the interval is k n 1 z .2 n 2 . In this formula z z.025 1.96. 3 We use this formula with the following results. k Row n sqrtn k * rounded down n k * 1 1 2 3 4 5 6 7 8 9 10 31 31 32 29 31 32 32 32 31 30 5.56776 5.56776 5.65685 5.38516 5.56776 5.65685 5.65685 5.65685 5.56776 5.47723 10.5436 10.5436 10.9563 9.7225 10.5436 10.9563 10.9563 10.9563 10.5436 10.1323 10 10 10 9 10 10 10 10 10 10 22 22 23 23 22 23 23 23 22 21 interval 7.2 8.4 7.1 8.3 7.0 8.2 6.9 8.6 6.8 8.4 6.7 7.9 6.6 7.8 6.5 7.7 6.4 7.5 7.3 8.5 13 252y0552 10/31/05 (Open in ‘Print Layout’ format) 3. Use the personalized data from Problem 2 (but do not drop any numbers.) test that the mean is below 8. Minitab found the following the original numbers, which were in C4: Descriptive Statistics: C4 Variable N Mean SE Mean StDev Minimum C4 32 7.944 0.228 1.291 4.900 Sum of squares (uncorrected) of C4 = 2070.94 Q1 7.225 Median 8.000 Q3 8.975 Maximum 10.300 I do not know the values for your numbers, but the following (copied from last year’s exam) should be useful: x a x na, x a2 x 2 2a x na2 . Your value of a is negative or zero. Can you say that the population mean is below 8? Use .05 . If you remembered what we learned last semester, you could have done this calculation with little or no work. The relevant formulas are: For the mean Ex a Ex a For the variance Var x a Var x So that good old Seymour, whose student number is 976512, subtracts 0.1 from every number and gets Mean E x 0.1 E x 0.1 7.944 0.1 7.844 Variance Var x 0.1 Var x 1.291 2 a. State your null and alternative hypotheses (1) b. Find a critical value for the sample mean and a ‘reject’ zone. Make a diagram! (1) c. Do you reject the null hypothesis? Use the diagram to show why. (1) d. Find an approximate p-value for the null hypothesis. Make sure that I know where you got your results. (1) e. Test to see if the population standard deviation is 2. (2) f. (Extra credit) What would the critical values be for this test of the standard deviation? (1) g. Assume that the population standard deviation is 2, restate your critical value for the mean and create a power curve for your test. (5) h. Assume that the population standard deviation is 2 and find a 94% (this does not say 95%!) two sided confidence interval for the mean. (1) i. Using a 96% confidence level and assuming that the population standard deviation is 2, test that the mean is below 8. (1) j. Using a 96% confidence interval an assuming that the population standard deviation is 2, how large a sample do you need to have an error in the mean of .005 ? (1) [39] Solution: a. State your null and alternative hypotheses (1) Since the problem asks if the mean is below 8 , and this does not contain an equality, it must be an alternate hypothesis. Our hypotheses are H 0 : 8 , (The average wait is at least 8 minutes.) and H 1 : 8 (The average wait is less than 8 minutes.) This is a left-sided test. b. Find a critical value for the sample mean and a ‘reject’ zone. Make a diagram! (1) Given: 0 8, s 1.291, n 32, df n 1 31, and .05 . So sx s 1.291 31 1.696 . We need a critical value for x below 8. 0.228 . Note that tn 1 t .10 n 32 Common sense says that if the sample mean is too far below 8, we will not believe H 0 : 8 . The formula for a critical value for the sample mean is x t n1 s , but we want a single value cv 0 2 x below 8, so use xcv 0 tn1 s x 8 1.696 0.228 7.613 . Make a diagram showing an almost Normal curve with a mean at 8 and a shaded 'reject' zone below 7.613. I should not have to say that if you have found a critical value, the ‘reject’ zone should be either above or below it, depending on whether it is a right or left side test. 0 , the null hypothesis mean is never in the ‘reject’ zone. If your critical value is x cv and x is in the ‘reject’ zone, you 14 252y0552 10/31/05 (Open in ‘Print Layout’ format) reject the null hypothesis. Telling me that you reject a hypothesis because x cv is above or below 0 just doesn’t cut it! c. Do you reject the null hypothesis? Use the diagram to show why. (1) x 7.844 7.744 7.644 7.544 7.444 7.344 7.244 7.144 7.044 7.944 Do not Do not Do not Reject Reject Reject Reject Reject Reject Do not reject reject reject Not below Not below Not below Below cv Below cv Below cv Below cv Below cv Below cv Not below reject cv cv cv cv d. Find an approximate p-value for the null hypothesis. Make sure that I know where you got your results. n 32 means df 31 . The closest values from the t table to the computed t are shown. x 0 x 8 The alternative hypothesis, H 1 : 8 , means that the p-value is P t P t s 0 .228 x Version Nearest t Values Approx. p-value x t 31 31 1 7.844 -0.68421 p-value is between .20 and .25. t .25 0.680 t .20 0.853 2 31 31 t .15 1.054 t .10 1.309 7.744 -1.12281 3 7.644 -1.56140 4 7.544 -2.00000 31 t .05 1.309 31 t 1.309 5 7.444 -2.43860 2.040 6 7.344 -2.87719 7 7.244 -3.31579 8 7.144 -3.75439 9 7.044 -4.19298 10 .05 31 t .025 31 t .01 31 t .005 31 t .001 31 t .001 31 t .45 7.944 -0.24561 2.453 2.744 2.453 p-value is between .01 and .025 2.453 p-value is between .005 and .01. 3.375 p-value is between .001 and .005. .025 31 t .01 31 t .01 31 t .001 3.375 p-value is between .025 and .05 p-value is between .025 and .05 p-value is below .005 3.375 0.127 p-value is between .10 and .15. 31 t .025 2.040 31 t 2.040 p-value is below .005 31 t .40 0.256 p-value is between .40 and .45. e. Test to see if the population standard deviation is 2. s 1.291 df 31 . , (2) The outline says To test H 0 : 0 against H1 : 0 i. Test Ratio: 2 ii. Critical Value: n 1s 2 2 s cv 02 iii. Confidence Interval: s 2DF z 2 2DF or for large samples z 2 2 2DF 1 2 02 2 n 1 or n 1s 2 22 12 2 02 n 1 2 or for large samples (from table 3) s cv n 1s 2 12 2 2 DF z 2 2 DF . or for large samples s 2DF z 2 2DF 15 252y0552 10/31/05 (Open in ‘Print Layout’ format) If we use a test ratio, 2 n 1s 2 02 311.291 2 22 12 .9168 . Since our degrees of freedom are too large for the table, use z 2 2 2DF 1 212 .9168 231 1 25.8336 61 5.0827 7.8102 2.7275 . If we are doing a 2-sided 5% test, we do not reject the null hypothesis if z is between -1.960 and 1.960. It’s not, so we reject the null hypothesis. If we use a confidence interval, s 2DF z 2 2DF s 2DF z 2 2DF . This becomes 1.2917.8740 1.2917.8740 or 1.034 1.718 . Since 0 is not in this interval, reject the null 1.960 7.8740 1.96 7.7840 hypothesis . f. (Extra credit) What would the critical values be for this test of the standard deviation? (1) Table 3 says s cv 2 DF z 2 2 DF 2 231 1.960 231 27.8740 . The two critical values are 7.8740 1.960 27.8740 27.8740 15 .748 15 .748 1.6014 and 2.2663 . Since the standard deviation is 7.8740 1.960 9.834 7.8740 1.960 5.914 not between them, we reject the null hypothesis. pval 2Pz 2.73 2.5 .4968 .0064 . g. Assume that the population standard deviation is 2, restate your critical value for the mean and create a power curve for your test. (5). Our hypotheses are H 0 : 8 , (The average wait is at least 8 minutes.) and H 1 : 8 (The average wait is less than 8 minutes.) This is a left-sided test. If 2 , n 32 , x 2 0.35355 and .05 , our critical value is 1.645 2 8 0.3762 7.4184 . We use the 32 32 following points: 8, 7.709, 7.4184, 7.127 and 6.836. Our results are as follows. 7.4184 8 Px 7.4184 8 P z Pz 1.645 = .95 1 8 .35355 Power 1 .95 .05 7.4184 7.709 1 7.709 P x 7.4184 7.709 P z Pz 0.82 =.5 + 0.2939 =.7939 .35355 Power 1 .7939 .2061 7.4184 7.4184 1 7.4184 P x 7.4184 7.4184 P z Pz 0 .5 .35355 Power 1 .5 .5 7.4184 7.127 1 7.127 P x 7.4184 7.127 P z Pz 0.82 = .5-.2939 = .2061 .35355 Power 1 .2061 .7939 7.4184 6.836 1 6.836 P x 7.4184 6.836 P z Pz 1.647 = .5 - .4500 = .0500 .35355 Power 1 .05 .95 h. Assume that the population standard deviation is 2 and find a 94% (this does not say 95%!) two sided 2 0.35355 . To find confidence interval for the mean. (1) .06 . We know x z x . x 2 32 z .03 make a Normal diagram for z showing a mean at 0 and 50% above 0, divided into 3% above z .03 and 16 252y0552 10/31/05 (Open in ‘Print Layout’ format) 47% below z .03 . So P0 z z.03 .4700 The closest we can come is P0 z 1.88 .4699 . So z .03 1.88 x z x x 1.88.35355 x 0.665 . Substitute your value of the sample mean. 2 i. Using a 96% confidence level and assuming that the population standard deviation is 2, test that the mean is below 8. (1) Our hypotheses are H 0 : 8 , (The average wait is at least 8 minutes.) and H 1 : 8 (The average wait is less than 8 minutes.) This is a left-sided test. We can do this by a critical value, a test ratio or a confidence interval. (i) .04 To find z .04 make a Normal diagram for z showing a mean at 0 and 50% above 0, divided into 4% above z .04 and 46% below z .04 . So P0 z z.04 .4600 The closest we can come is P0 z 1.75 .4579 . So x cv 0 z x 8 1.750.35355 7.381 If your value of the sample mean is below 7.381, reject the null hypothesis. x 0 x 8 x 8 (ii) z . To get a p-value find P z . So, for example, if 0 . 35355 x 0.35355 7.944 8 x 7.944 , pval P z Pz 0.16 .5 .0636 .4364 . If the p-value is below 0.35355 x 0 x 8 .04 or if the test ratio z is below -1.75 , reject the null hypothesis. x 0.35355 (iii) The one-sided confidence interval is x z.04 x x 1.75.35355 x 0.6187 . If this value is below 8, reject the null hypothesis. j. Using a 96% confidence level and assuming that the population standard deviation is 2, how large a sample do you need to have an error in the mean of .005 ? (1) .04 To find z .02 make a Normal diagram for z showing a mean at 0 and 50% above 0, divided into 2% above z .02 and 48% below z .04 . So P0 z z.02 .4800 The closest we can come is P0 z 2.08 .4798 From the outline n z 22 2 e 2 2.08 2 22 3461 .12 . Use 3462. .005 For Minitab Computations for Problem 3, see 252y0551s.. What follows below is excerpted from the solution to the first exam from Spring 2005. Possible Rubric for Statistics Exams. I have been hearing a lot about rubrics lately, and have taken a while to be assured that they are not the materials that the third pig built his house out of. My first attempt at this came to me in a recent assessment meeting. It is slightly expanded here. 1. 2. 3. 4. 5. Did the student make a good effort to understand the question? This would include asking the instructor and consulting notes and texts if he/she did not understand what was desired. Was the method used to solve the problem the best and most appropriate for the problem? Was the method used correctly? Did the student present the solution in such a way that the instructor can understand how the student got the answers presented? This should include all formulas, equations and tables used. Is it evident from the way the work is presented that the student understood what he/she was doing? Is it legible? Was the conclusion stated clearly? Was the null hypotheses rejected or not rejected? Was a valid null and alternate hypothesis clearly stated at the beginning? What were the 17 252y0552 10/31/05 (Open in ‘Print Layout’ format) 6. implications of the conclusion for a relevant goal, for example the decision to buy a new product? Did the student demonstrate knowledge of the difference between sample statistics and population parameters? Was a statistical test using sample statistics used to evaluate a null hypothesis containing population parameters and an equality? In view of what was said here, it is incredible that, on every exam I give, students give me confidence intervals and tests for means when I ask for confidence intervals and tests for medians, variances and even proportions. Check the wording on the questions that you misunderstood. Can you identify what wording in the question made you think it was about a mean? Can you tell me what it was? It is also remarkable that there are any people out there who do not know that proportions, probabilities and p-values (which are probabilities) must be between one and zero. It is also amazing to me that that so many of you cannot express the difference between t and z . In the most practical sense a value of t comes from the t table and must be used with s , the sample standard deviation in confidence intervals and tests for the population mean. There are only a few other cases where we use t and they will be discussed later in the course. On the other hand z , which comes only from the bottom line of the t table, but can be calculated using the table of the standardized Normal distribution, must be used with , the population standard deviation, in confidence intervals and tests for the mean. z is also used in large sample tests for the population proportion, population mean, population standard deviation, population median and the means of the Poisson and Binomial distribution if the correct formulas are used, but don’t push it. The Normal distribution should not be used if more accurate methods are available. In any case, look at Things You Should Never Do on and Exam or Anywhere Else before you do another assignment and frequently thereafter. 18