251x0741 5/3/07 ECO 251 QBA1 FINAL EXAM, Version 1 MAY 7 and 9, 2007 Name Class ________________ Part I. Do all the Following (14 Points) Make Diagrams! Show your work! Illegible and poorly presented sections will be penalized. Exam is normed on 75 points. There are actually 123+ possible points. If you haven’t done it lately, take a fast look at ECO 251 - Things That You Should Never Do on a Statistics Exam (or Anywhere Else). x ~ N 10, 7.9 1. Px 14 2. P0 x 9 3. F 2.00 (Cumulative Probability) 4. P2 x 36 5. P10 x 10 6. x.34 7. A symmetrical region around the mean with a probability of 32%. 1 251x0741 5/3/07 II. (10 points+, 2 point penalty for not trying parts a) and b) Show your work! Mark individual sections clearly. Answers without work and/or reasons cannot be accepted! (Webster) A local bar sells ’16 oz’ glasses of beer. A group of students buys 22 glasses of beer and, using their own measuring cup, tries to estimate the mean contents. The measurements are below. 13.5 15.3 13.8 15.4 14.2 15.4 14.3 15.6 14.6 15.6 14.6 15.8 14.6 15.8 22 Using Minitab, I have computed the following i 1 14.9 15.9 15.0 16.1 15.1 16.7 15.3 16.9 20 x 334 .4 and x 2 4533 .84 . i 1 a) Note that the sum of squares is the sum of only the first 20 numbers. To show me that you know how to x 2 using the total that I have given to you. (1) compute a sum of squares, compute b) Compute the sample variance of the contents of the 22 glasses given above. (2) c) Compute a 99% confidence interval for the mean contents (3) d) Is the mean significantly different from 16 oz.? Why? (1) e) Because of all the hoopla around the students’ experiment only 100 glasses of beer were sold that night. Assuming that 100 is the population size from which the sample of 22 was taken, recompute the confidence interval in c). (2) [9] f) Assume that the costs to the bar owner in drawing a glass of beer are $0.15 per glass plus $0.015 per oz, use the mean and variance that you found in b) to compute the mean and variance of the costs to the bar owner. (The formula relating cost to ounces is w 0.015 x 0.15 .) There will be no credit in this section unless you use the mean and variance from b). (2.5) [11.5] 2 251x0741 5/3/07 III. Do at least 5 of the following 6 sections (at least 12 each) (or do items adding to at least 48 points Anything extra you do helps, and grades wrap around) . Show your work! Please indicate clearly what sections of the problem you are answering! If you are following a rule like E ax aEx please state it! If you are using a formula, state it! If you answer a 'yes' or 'no' question, explain why! If you are using the Poisson or Binomial table, state things like n , p or the mean. Avoid crossing out answers that you think are inappropriate - you might get partial credit. Choose the problems that you do carefully – most of us are unlikely to be able to do more than half of the entire possible credit in this section!) This is not an opinion questionnaire. Answers without reasons or supporting calculations or table references will not be accepted (except in multiple choice questions)!!!! Answers that are hard to follow will be penalized. Note that some sections extend over more than one page. A. Remember the problem on the previous page. The data is repeated here. x is volume in ounces. y is simply the order in which the beers were drawn. The students believe that the bartender got more generous as the evening passed. To test this they computed the correlation of the volume with the order. You have x and x 2 from the previous problem. It is quite easy to compute y 253 and y 2 3795 20 and Minitab says xy 3237 .3 i 1 1) Note that the sum xy is the sum of only the first 20 numbers. To show me that you know how to compute a sum of this type, compute xy using the total that I have given to you. (1) 2). Compute the sample covariance s xy between x and y . (2) Explain what this covariance tells us. (0.5) 3) Compute the sample correlation rxy between x and y . (2) Explain what this correlation tells us about the relationship between the size of the beer and the order in which it was drawn. (1) [6.5] 4) Using the correlation and covariance you computed in b) and c) and the equation relating the size of the beer and its cost that you used on the last page, find the covariance and correlation between cost of the beer to the bar owner and the order in which it was drawn. (4) [10.5] 5) The true test of a correlation is a significance test which not only looks at the size of the correlation, but r r the size of the sample. To do such a test compute t . On the assumption that there is no sr 1 r 2 n2 significant correlation and using a 99% confidence level, 99% of the values of this t-ratio will fall between n 2) n 2) t .(005 and t .(005 . If your value of the t ratio does not fall between these two values, you can say that there is significant correlation. Find the appropriate values of t, compute the t-ratio and tell me if the correlation is significant. (5) [15.5] Row 1 2 3 4 5 6 7 8 9 10 11 x 13.5 13.8 14.2 14.3 14.6 14.6 14.6 14.9 15.0 15.1 15.3 y 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 15.3 15.4 15.4 15.6 15.6 15.8 15.8 15.9 16.1 16.7 16.9 12 13 14 15 16 17 18 19 20 21 22 3 251x0741 5/3/07 B. Let us define the following events. A1 : y 3, A2 : y 5 , B1 : x 1 B 2 : x 2 . B1 A1 A2 B2 1) Create a joint probability table like the one at the left on the assumption that the four events are the only ones that can occur, P A1 .4 , PB1 .3 and that A1 and B1 are independent. (3) 2) Compute the variance of y and the covariance and correlation between x and y in 1). (3) 3) Repeat 1) on the assumption that A1 and B1 are independent are mutually exclusive. (2) 4) Compute the variance of y and the covariance and correlation between x and y in 3). (3) 5) Use the addition rule to show that if P A1 .4 and PB1 .3 , these two events cannot be collectively exhaustive no matter what the relationship between these two events. (1) [12] 4 251x0741 5/3/07 C. Answer the following 6 multiple choice questions. (These should be 2 each, but to discourage guessing, how about 2.5 each for right answers and 0.5 penalty for wrong answers.) 1) Which of the following is a major difference between the binomial and the hypergeometric distributions? a). The sum of the outcomes can be greater than 1 for the hypergeometric b) The probability of a success changes in the hypergeometric distribution. c) The number of trials changes in the hypergeometric distribution d) The outcomes cannot be whole numbers in the hypergeometric distribution e) None of the above is correct. 2) The continuity correction factor is used when a) The sample size is at least 5. b) Both np and nq are at least 30. c) A continuous distribution is used to approximate a discrete distribution d) A discrete distribution is used to approximate a continuous distribution e) A binomial distribution is used to approximate the hypergeometric distribution f) None of the above. 3) Suppose a population consisted of 20 items. How many different samples of n = 3 are possible? a) 20 b) 40 c) 120 d) 20 3 8000 e) 1140 f) 6840 g) None of the above is correct. 4) The finite population correction factor is used when a) n/N is more than .05. b) N is more than 1000 c) np is greater than 5. d) n is more than 30. e) None of the above 5) In the formula x z x , the 2 2 is a) The confidence level b) The area of one tail of the sampling distribution of the mean c) The probability that the interval would not include the mean d) The proportion of confidence intervals that will contain the mean e) None of the above. 6) All Normal distributions have a) A finite range b) The same coefficient of variation c) The same probability density function f x d) The same area between and 2 e) All of the above must be true. 5 251x0741 5/3/07 D. 1) We are baking chocolate chip cookies again. Assume that all cookies are discarded that have less than 4 chips, but the mean is only 4.5, what proportion of cookies will be discarded. (1) 2) In view of the results of problem 1 and using only the distributions for which you have tables, what is the lowest value the mean can take if we want to be sure that no more than 1% of the cookies are discarded? 3) Find P1 x 15 for the following distributions. If you must substitute one distribution for another, show that the substitution criterion is satisfied. a) Poisson with parameter of 2.5 (1) b) Poisson with parameter of 36 (2) c) Binomial with n 35 and p .02 . (2) d) Binomial with n 35 and p .2 (2) e) Binomial with n 50 and p .2 (1) f) Binomial with n 50 and p .55 (2) [13] g) Continuous uniform c 8 and d 13 (1) 4) Find Px 1 for the following. a) Hypergeometric N 16 , p .25 , n 5 (2) b) Hypergeometric N 600 , p .25 , n 5 (2) c) Geometric p .25 (1) [19 – 58.5] 6 251x0741 5/3/07 E. 1) Do the following confidence intervals. (Webster) Assume that a real estate developer wants to estimate the mean family income in an area where a mall is proposed. The developer assumes that the population standard deviation for income is $7200 a) If a survey of 100 families yields a sample mean of $35500, create a 90% confidence interval for the mean. (3) b) Find a confidence interval using the data in a) with a 32% confidence level. (2) c) Assume a 90% confidence level again and that the sample of 100 families comes from a neighborhood of only 1000 families. Repeat a) (3) d) (Extra Credit) Assume that your results in a) are correct and that there are 30000 families in the target area. Can you make the 90% confidence interval in a) into a confidence interval for total income? (2) d) Is the income found in a) significantly different from $36000? Why? (1) [9] e) A Business Week article claims that 25% of CEOs are ‘outsiders.’ A survey of 350 corporations is taken to check this ‘fact’ and the survey finds that 77 of the 350 firms have outsider CEOs. Create a 90% confidence interval for the proportion of firms that have outsider CEOs. Does this conflict with the statement in Business Week? Why? (4) 2) The amount of time a bank teller spends with each customer has a population mean 3.10 minutes and a population standard deviation 0.40 minute. a) If a random sample of 16 customers is selected from a large normally distributed population, what is the probability that the average (mean) time spent per customer is less than 3 minutes? (2.5) b) How would this change if the sample was taken from a population of only 144 customers? Give a specific answer! (2) c) If the teller puts in a 420 minute day, and the conditions in 2a) prevail, what is the probability of serving 150 customers or more? (2.5) [20] d) Under the circumstances in a), what is the probability that the time spent with 1 of the 16 customers is less than 3 minutes? (2) e) Under the circumstances in a), what is the probability that the time spent with each of the 16 customers in 2a) is less than 3 minutes apiece? (2) [26 – 84.5] 7 251x0741 5/3/07 F. 1) (Dunleavy) Assume that for every 100,000 people treated in an emergency room, 5,000 are misdiagnosed, and 1,000 misdiagnosed patients die. Overall, 90,000 of the 100,000 live through the experience. Define the following events MD misdiagnosed ; S survives ; MD not misdiagnosed and S dies. Don’t just guess, make a table. Distinguish clearly between conditional and joint probabilities. a) What is the probability that a patient will be diagnosed correctly? (1) b) What is the probability that a patient will be diagnosed correctly and live? (1) c) What is the probability that a patient will die given that they were diagnosed correctly? (2) d) Are the diagnosis and survival independent? (1) e) If a patient dies, what is the probability that the patient was not misdiagnosed? (2) [7] f) What is the value of P MD S ? 2) OK. I ran out of ideas, so here is a mini-Jorcillator problem. Here is the joint probability table and the table relating joint events to failure times. Failure of the phillinx in period 1, 2, 3, 4 are events A1, A2 , A3 , and A4 . Failure of the flubberall in period 1, 2, 3, 4 are events B1, B2 , B3 , and B4 . Failure of the jorcillator in period 1, 2, 3, 4 are events C1, C2 , C3 , and C 4 . a) Assuming that the Jorcillator lives as long as one of its components lives, fill in the second table with failure times of 1, 2, 3 and 4. (1) A1 A2 A3 A4 B1 B2 B3 B4 .20 .12 .04 .04 .40 .15 .09 .03 .03 .30 .10 .06 .02 .02 .20 .05 .03 .01 .01 .10 b) Find PC1 (1) B1 .5 .3 .1 .1 1.0 B2 B3 B4 A1 A2 A3 A4 c) Find PC 2 (1) d) Find PC3 (1) e) Find PC 4 (1) f) Find P C3 A3 (1) [13 – 97.5] 8 251x0741 5/3/07 Blank page for calculations. 9