12/13/00 251y0043 ECO251 QBA1 Name key FINAL EXAM Class ________________ DECEMBER 15, 2000 Hint: Though you should never give 2 answers to a problem, never cross out the answer to a problem that you have given up on. I may see something worth partial credit in the answer. Part I. Do all the Following (20 Points) Make Diagrams! A. z ~ N (0,1) 1. Pz 2.93 Pz 0 P0 z 2.93 .5 .4983 .0017 Diagram! These are expected and will be provided in class for version 2. Half of you didn't read this question: You assumed that it said Pz , because that was on last year's exam. 2. P3.42 z 0.19 P3.42 z 0 P0.19 z 0 .4997 .0753 .4244 3. P1.07 z 1.07 2P0 z 1.07 2.3577 .7154 4. P1.07 z 2.57 P0 z 2.57 P0 z 1.07 .4949 .3577 .1372 5. The 93rd percentile of the distribution. z .07 is the point with a probability of .07 above it or .93 below it. Since 93% is below this point and 50% is below zero, from the diagram P0 z z.11 .4300 . The closest we can come to this probability using the Normal table is P0 z 1.48 .4306 So z .07 1.48 . Note that probabilities cannot be above 1 or below 0. 1 12/13/00 251y0043 B. x ~ N 1.2 , 5 . Remember z x . As usual, people made diagrams of x with zero in the middle. Make up your mind! If you are diagramming x , put the mean in the middle; if you are diagramming z put zero in the middle. 2.93 1.2 1. Px 2.93 P z Pz 0.35 Pz 0 P0 z 0.35 .5 .1368 .3632 5 0.19 1.2 3.42 1.2 z 2. P3.42 x 0.19 P P 0.92 z 0.28 5 5 P 0.92 z 0 P 0.28 z 0 .3212 .1103 .2109 1.07 1.2 1.07 1.2 z 3. P1.07 x 1.07 P P 0.45 z 0.03 5 5 P0.45 z 0 P0.03 z 0 .1736 .0120 .1616 2.57 1.2 1.07 1.2 z 4. P1.07 x 2.57 P P 0.03 z 0.27 5 5 P0 z 0.27 P0 z 0.03 .1064 .0120 .1184 5. The 93rd percentile of the distribution.. x.07 is the point with a probability of .07 above it or .93 below it. Because it is above the 50th percentile, which is the mean for the Normal distribution, the point we want will be above 1.2. On the previous page we found z .07 1.48 . . So x.97 z.97 1.2 1.485 8.60 . To check to see if this is correct: Px 8.60 8.60 1.2 P z Pz 1.48 Pz 0 P0 z 1.48 .5 .4306 .9306 .93 5 2 12/13/00 251y0043 II. (4 points-2 point penalty for not trying .) Show your work! We are investigating the cost of a business trip for people in the financial industry. The data for a sample of 6 are below. x 1433 1225 1433 1573 1333 941 Compute the sample standard deviation, s . Show your work. (4) Solution: x 1433 1225 1433 1573 1333 941 7938 1 2 3 4 5 6 x 7938 1323 x n 6 s x2 x 2 nx 2 n 1 x2 2053489 1500625 2053489 2474329 1776889 885481 10744302 10744302 61323 2 242328 48465 .6 5 5 s x 48465 .6 220 .1490 How about those jokers who are still trying to compute x x 2 by computing x x ! It 2 didn't work last term and it won't work this term. I don't encourage using definitional formulas, but, if you insist on using them, I will be happy to give you a personal tutorial on how to do it. They are generally harder to use than computational formulas. 3 12/13/00 251y0043 III. Do at least 4 of the following 6 Problems (at least 12 each) (or do sections adding to at least 48 points Anything extra you do helps, and grades wrap around) . Show your work! Please indicate clearly what sections of the problem you are answering! If you are following a rule like E ax aEx please state it! If you are using a formula, state it! If you answer a 'yes' or 'no' question, explain why! If you are using the Poisson or Binomial table, state things like n , p or the mean. Avoid crossing out answers that you think are inappropriate - you might get partial credit. 1. a. Find z.085 (2) b. If x ~ N 18, 5 , find a symmetrical interval around the mean with a probability of 83%. (2) c. If x ~ N 18, 5 , find F 22 .6 (2) d. If computer parts last an average of eighteen months, with a standard deviation of 5, what is the probability that 15 of them will all last less than 22.6 months. (2) e. If a sample mean is taken from a sample of 15 from a normal population with a mean of 18 and a standard deviation of 5, what is the standard deviation of the sample mean? (2) f. What is the probability that the sample mean in 5) above is less than 22.6? (2) g. If sample mean of 14.3 comes from a sample of 15 from a normal population with a population standard deviation of 5, do an 83% confidence interval for the population mean. (3) Solution: From the problem statement x ~ N 18, 5 , i.e. 18 and 5 . a) z.085 is the point with a probability of .085 above it or .915 below it. It is the 91.5 percentile of z. From the diagram P0 z z.085 .4150 . the closest we can come to this probability using the Normal table is P0 z 1.37 .4147 . So z.085 1.37 . Make a diagram! b) A symmetrical interval around the mean with a probability of 83%: We can split the 83% into two halves of 41.5%, on either side of zero. Make a diagram! From the diagram, we want two points z.085 and z .915 z .085 so that Pz.915 z z.085 .8500 . And, since we found z.085 1.37 in part a), our interval for z is -1.37 to 1.37. The interval for x can then be written x z.075 18 1.37 5 18 6.85 or 11.15 to 24.85. 24 .85 18 11 .15 18 z To check this P11 .15 x 24 .85 P 5 5 P1.37 z 1.37 2P0 z 1.37 2(.4147 ) .8294 83% 22 .6 18 c) F 22 .6 Px 22 .6 P z Pz 0.92 Pz 0 P0 z 0.92 .5 .3212 .8212 5 15 15 0 d) Binomial n 15 , p .8212 -- P15 C15 p q .8212 15 .05209 e) 15 , n 15 , 5 . So x x n 5 1.29099 . 15 22 .6 18 f) Px 22 .6 P z Pz 3.56 Pz 0 P0 z 3.56 .5 .4998 .9998 1.29099 g) Given: x 14.3 , n 15 , 5 . Step 1: Confidence level is 83%.; significance level is 5 1.29099 . 1 .83 .17 . Step2: z 2 z.085 1.37 . Step 3: x x n 15 Step 4: x z x 14.3 1.37 1.29099 14.3 1.77 or 12.53 to 16.06. More formally 2 P12.53 16.06 83% 4 12/13/00 251y0043 2. According to Ken Black, a survey was taken asking how long a group of firms has done business with India. (You may assume that the underlying distribution is normal.) A random sample of 27 responses gave a mean of 10.655 years. a. Assume that the population standard deviation is known to be 7.3, construct a 98% confidence interval for the population mean.(4) b. Assume that the population standard deviation is unknown and that 7.3 is the sample standard deviation instead. Construct the 98% confidence interval again. (4) c. Assume that once again the population standard deviation is unknown and that the sample mean of 10.655 and the sample standard deviation of 7.3 are taken from a sample of 27 that is part of a population of 300. Construct the 98% confidence interval again. (4) d. Assume that we use a smaller sample and a higher confidence level. What would happen to the confidence interval and why? (2) Solution: x 10.655 . For all these sections the confidence level is 98% and the significance level is 1 .98 .02 . n 27 . Repeat after me! " z goes with (sigma - population variance); t goes with s (sample variance)!" 7.3 1.4049 . a) From the t-table, z z.01 2.327 . x x 2 n 27 x z x 1`0.655 2.327 1.4049 10.655 3.2692 or 7.3558 to 13.8942. More formally 2 P7.3558 13.8942 98% s 7.3 26 1.4049 b) The degrees of freedom are n 1 27 1 26 . tn1 t.01 2.479 s x x 2 n 27 Putting this all together x t n1 s 10.655 2.479 1.4049 10.655 3.5921 or 7.0629 to 14.2471. 2 x 26 More formally, P7.0629 14.2471 .98 . If your exam said a 90% interval here, use t .05 1.706 . c) The population size, N 300 , is less than 20 times the sample size n 27 so s x sx n N n N 1 300 27 1.4049 0.9555 1.3424 and x tn1 s x 10 .655 2.479 1.3424 2 300 1 27 10.655 3.3279 or 7.3271 to 13.9829. More formally, P7.3271 13.9829 .98 . If your exam 7.3 300 27 1.4818 0.9555 1.4159 . 27 300 1 d) The effect of a smaller sample is to decrease the denominator in the standard error of the s mean, s x . This makes the standard error larger and since the size of the confidence interval is n proportional to the standard error, the interval is also larger. The effect of a higher confidence level is to lower the significance level and thus to require a larger value of t or z . This also makes the confidence interval larger. said the sample standard deviation was 7.7, use s x 7.7 5 12/13/00 251y0043 3. Describe the meaning and give the probabilities for the problems below. Example: For the Hypergeometric Distribution, P1 x 2 when N 18, n 5, and M 9 is the probability of between 1 and 2 successes when a sample of 5 is taken from a population of 18 where there 9! 9! 9! 9! C19 C 49 C 29 C 39 1!8! 4!5! 2!7! 3!6! are 9 successes in the population. P1 x 2 .48529 . Use tables 18 18! 18! C 518 C5 5!13! 5!13! where appropriate! a. Binomial P3 x 8 , when p .20 and n 13 (2) b Binomial P4 x 10 , when p .80 and n 13 (2) c Poisson P3 x 8 , when m 18 (2) d Geometric P4 x 10 , when p .20 (2) e. No meaning is needed for the following: (i) Binomial Px 7 , when p .20 and n 13 (1) (ii) Geometric Px 7 , when p .20 (1) (iii) Binomial Px 7 , when p .20 and n 13 (1) (iv) Geometric Px 7 , when p .20 (1) (v) Poisson Px 7 , when m 18 (1) (vi) Continuous Uniform P5 x 11 when c 0 and d 9 (2) (vii) Normal x ~ N 18, 5 . P x 500 when n 25 (2) Solution: a. Binomial Distribution with p .20 and n 13 The probability of between 3 and 8 successes in 13 tries when the probability of success on a single try is .20 is P3 x 8 Px 8 Px 2 .99983 .50165 .49818 b. Binomial Distribution with p .80 and n 13 . The probability of between 4 and 10 successes in 13 tries when the probability of success on a single try is .80 is P4 x 10 . It can't be done directly with tables that stop at p .5 , so try to do it with failures. (The probability of failure is 1 - .80 = .20.) 4 successes correspond to 9 failures out of 13 tries. 10 successes correspond to 3 failures. So try 3 to 10 successes when p .20 and n 13 . P3 x 9 Px 9 Px 2 .99998 .50165 .49833 c. Poisson Distribution with parameter of 18 (parameter of 18 means 18 m 2 ) The probability of between 3 and 8 successes in a unit of space or time when the average number of successes is 18 is P3 x 8 Px 8 Px 2 .00706 .00000 .00706 d. Geometric Distribution with p .20 The probability that the first success will occur between try 3 and try 8 when the probability of success on any one try is .20 is P4 x 10 . Remember that F c Px c 1 q c , because success at try c or earlier implies that there cannot have been failures on the first c tries. q 1 p 1 .20 .80 . P4 x 10 Px 10 Px 3 F 10 F 3 1 .8010 1 .803 .80 .80 3 10 .51200 .10737 .40463 6 12/13/00 251y0043 e. No meaning is needed for the following: (i) Binomial Distribution with p .20 , n 13 . Px 7 1 Px 6 1 .99300 .00700 (ii) Geometric Distribution with p .20 Px 7 1 Px 6 1 1 .806 .806 .26214 (iii) Binomial Distribution with p .20 , n 13 . Px 7 Px 7 Px 6 .99875 .99300 .00575 or P7 C713 p 7 q 6 C713.20 7 .80 6 .00576. (iv) Geometric Distribution with p .20 Px 7 q x 1 p .80 6 .20 .05242 or F 7 F 6 Px 7 Px 6 1 .807 1 .806 .79028 .73786 .05243 (v) Poisson Px 7 , when m 18 Px 7 .0018500 (directly from table). (vi) Continuous Uniform P5 x 11 when c 0 and d 9 Make a diagram! f x 1 1 1 1 . P5 x 11 P5 x 9 4 .44444 d c 90 9 9 x 500 when. 15 , n 25 , 5 . So x 500 20 , P x 500 Px 20 1.00 . and since x n 25 (vii) Normal x ~ N 18, 5 . P x x n 5 25 20 18 P z Pz 2.00 Pz 0 P0 z 2.00 .5 .4772 .8772 1 f. Extra Credit! A company designs washing machines to go an average of eight years without a major breakdown. (exponential distribution). (i) What is the probability that the machine lasts more than 2 years? (2) (ii) What is the probability that the machine lasts longer than 1 year? (1) (iii) What is the standard deviation of the time that it lasts? (1) (iv) ( Difficult!!!) If the company offers a warranty against major breakdowns and wants to be sure that the no more than 20% of the machines break down in the warranty period, what is the longest time for which the company could offer the warranty? (4) I'll do it if you do it! For one answer, see 251y0042. 7 12/13/00 251y0043 4. Assume that P A .1 and PB .3 b. If A and B are mutually exclusive what is (i) P A B , (ii) PA B , (iii) PA B ? (3) c. If PB A .2 what is (i) P A B , (ii) PA B , (iii) PA B ? (3) a. If A and B are independent what is (i) P A B , (ii) P A B , (iii) P A B ? (3) d. Make a joint probability table of A, B, A and B , assuming that P B A .4 (3) e. A drug rehabilitation program has a 25% success rate. After completing the program, George is given a drug test and fails it. If the drug test says that non-drug- users are drug users 1% of the time (a false positive) and the drug test also fails to detect drugs in a drug user 5% of the time (a false negative), what is the chance that George is still using drugs? (4) Solution: Note: A is the complement of A ; P A 1 P A . a) Independence means P A B P A , so that P A B P APB . This implies (i) P A B P APB . What will it take to convince you that if A and B are independent, P A B 0 ? Since P A .1 and PB .4 , P A B P APB .1.3 .03 . So (ii) P A B P A .1 and P A B P A PB P A B .1 .3 .03 .37 (iii) P A B table. A B B 1 P A B 1 .37 .63 . One of the best ways to see this is with a joint probability A B becomes P B B P A B P A B P A P A 1 P B P A B P A B it add up we get B B A A .03 .27 .07 .63 .1 .9 .3 .7 A A .03 .3 .7 .1 .9 , because of independence. If we just make 1. 0 . P A B P A B P A B P A B .03 .27 .07 .37 . 1 .0 P A B .63 can be read right off the table. And, if you really need it, PA B b) Mutual exclusiveness means P A B 0 . (i) So P A B P A PB P A B .1 .3 0 .4 (ii) PA B (iii) P A B P A B . P B P A B 0 PB 1 P A B 1 .4 .6 . The joint probability table starts out as B A A .0 .7 B .1 .9 because of the mutual exclusiveness and becomes B B A A 0 .3 .1 .6 .1 .9 .3 1.0 .3 .7 1 .0 P A B P A B P A B P A B 0 .3 .1 .4 . P A B .6 can be read right off the table. 9 points of this question were repeated from the hour exam with the numbers only slightly 8 changed. Nevertheless most of you made exactly the same mistakes again. You are supposed to learn from your mistakes! 9 12/13/00 251y0043 c) P B A .2 means that P A B PB A P B A P A .2.1 .02 . So (i) P A B P A PB P A B .1 .3 .02 .38 (iii) P A B (ii) P A B 1 P A B 1 .38 .62 . The table reads B B P A B .02 .0667 PB .3 A A .02 .28 .08 .62 .1 .9 .3 .7 1.0 d) P B A .4 means that P A B PB A P B A P A .4.1 .04 . If we put .04 in the upper leftB hand corner and fill in the blanks, we get B A A .04 .26 .06 .64 .1 .9 .3 .7 1.0 The following laws have not been repealed: The Addition Rule: P A B P A PB P A B The Multiplication Rule: P A B P A B PB PB A P B A PB e) A drug rehabilitation program has a 25% success rate. (If S is the probability that George kicked drugs PS .25 .) After completing the program, George is given a drug test and fails it. If the drug test says that non-drug- users are drug users 1% of the time (a false positive) (If P is the event that the test gives a positive P P S .01 ) and the drug test also fails to detect drugs in a drug user 5% of the time (a false negative) ( P P S .05 ), what is the chance that George is still using drugs? I somehow, don’t think that anyone got to this without Bayes' rule. P PS P S PSP P P P PS P S P P S P S PP S PS P P S 1 P P S 1 .05 .95 so P S P If you used a table for this S S P P .0250 .2250 .7125 .7375 .0375 .2625 . From the above P S 1 PS 1 .25 .75 and .95.75 .7125 .9661 .95.75 .01.25 .7125 .025 .25 .75 1.00 10 12/13/00 251y0043 5. a. A real estate office sells 1050 houses in 300 working-day year. (Poisson distribution - if you approximate it by another distribution tell why you can do it.) (i) What is the mean and variance of the number of houses sold in a day? (2) (ii) What is the probability that at least one house is sold in a day? (2) (iii) What is the probability that more than 5 houses are sold in a day? (2) (iv) What is the probability that more than 100 houses are sold in a 25 working-day month? (3) 5. b. The probability that a buyer is unable to get a mortgage is .25. In a month in which offers are made on 30 houses, what is the probability that at least one mortgage will be denied? (i) Do this problem using the Binomial distribution (2) (ii) Can the solution to this problem be approximated by the Poisson or Normal distribution? Test for both and use both, one or neither distribution to solve the problem depending on the results of your tests. (4) Solution: Because this involves a situation in which you are given the average number of events per unit space or time this would normally be considered to be a Poisson problem. a) (i) Since the period involved is one working day, the mean is m 1050300 3.5 . This is also the variance since the mean and variance are identical for the Poisson distribution. (ii) From the Poisson table for a mean of 3.5 , Px 0 1 Px 0 1 .03020 .96980 (iii) From the same table Px 5 1 Px 5 1 .85761 .14239 (iv) The mean is now m 3.525 87.5 . Because the mean is over 20, use the Normal 100 .5 87 .5 PP x 100 PN x 100 .5 P z 87 .5 Pz 1.39 .5 ..4177 .0823 (DIAGRAM?) b) (i) Binomial Distribution with p .25 , n 30 . distribution. Px 1 1 P0 1 C 030 p 0 q 30 1 .25 0 .75 30 1 .00075 .99925 (ii) Test for Normal distribution: np 30.25 7.5 5 , nq n np 30 7.5 22.5 5 . We can use the Normal distribution because these are both above 5. n 30 120 500 . We can't use the Poisson distribution because Test for the Poisson distribution: p .25 the ratio is below 500. If we use the Normal distribution with np 30.25 7.5 and 2 npq 7.5.75 5.625 , 0.5 7.5 PB x 1 PN x 0.5 P z Pz 2.95 .5 .4984 .9984 5.625 11 12/13/00 251y0043 6. a. The columns below state the Dow Jones Industrial Average(in hundreds) x and the 6-month treasury bill rate y . x y According to Minitab, the mean and standard 22 8.36 deviation for the Dow are respectively 39.75 and 27 7.51 14.49. The mean and standard deviation for the t29 5.42 bill rate are 5.420 and 1.810. This was a gift! Why did so many 32 3.45 46 3.02 x2 , x, y of you compute 44 5.51 y2 ? and 56 5.02 62 5.07 (i) Compute the sample covariance between the Dow and the t-bill rate. (4) (ii) Compute and interpret the correlation between the two quantities. (3) (iii) If the t-bill rate was 1 point lower for all the values shown here (i.e. 8.36 was 7.36, 7.51 was 6.51 etc.), what would the values of the standard deviation for the t-bill rate, the covariance and the correlation be? The only accepted answer here will be values found by using the formulas for the variance, covariance and correlations of functions of x , and the values of the standard deviation, covariance and correlation found in (i) and (ii). (3) Solution: x y xy n8 22 8.36 183.92 x 39 .75, s x 14 .49 27 7.51 202.77 y 5.42 , s y 1.810 29 5.42 157.18 32 3.45 110.40 xy 1631 .09 (i) 46 3.02 138.92 44 5.51 242.44 xy nx y 1631 .09 839 .75 5.42 56 5.02 281.12 s xy n 1 7 62 5.07 314.34 1631.09 92 .47 13 .2100 7 (ii) rxy s xy sx s y 13 .21 .5037 . Since rxy2 .254 is not very large on a 0 to 1 scale, we cannot say 14 .49 1.810 that this is a strong relationship. However, it seems quite likely that interest rates and the Dow move inversely. (iii) According to the syllabus supplement, if w ax b , and v cy d , w2 Varw a 2Varx a 2 x2 , v2 Var v c 2Var y c 2 y2 , Covw, v wv acCovx, y ac xy and wv signac xy In this case w x and v y 1 , so a 1 , b 0 , c 1 and d 1 . Since we are working with sample statistics, we can substitute s v2 for v2 , s xy for xy , rxy for xy etc. so the new variance is s v2 Varv 12 Var y s 2y , the new covariance is Covw, v s wv 11Covx, y s xy and the new correlation is rwv sign11rxy rxy . In other words, all quantities are unchanged. 12 12/13/00 251y0043 b. Find the missing value in the following joint probability table and compute the population covariance and correlation between x and y and the variance of x y . (5) x 2 7 4 .6 y 9 .1 .3 Solution: Since the numbers in the table already add to 1, the missing number must be zero. x y Px xPx x 2 Px 2 4 .6 9 .1 .7 1.4 2.8 14 .7 x E x E xy yP y y 2 P y 2.4 9.6 To summarize 3.6 32 .4 6.0 42 .0 2 2 Px 1, P y 1, 17 .5 xPx 1.4 , E x x y E y 2 7 Px 0 .6 .3 .4 .3 1.0 2.1 0.7 2 Px 22 .2 , y E y P y 42 .0 .624 xyPxy .129 yP y 6.0 and 0 7 4 4.8 0 12 .3 .3 7 9 1.8 18 .9 xy Covxy Exy x y 12.3 0.76.0 8.1 , x2 E x 2 x2 17 .5 0.72 17 .01 and y2 E y 2 y2 42.0 6.02 6.00 . So that xy xy x y 8.1 .80178 . 17 .01 6.00 ( x 4.12432 , y 2.44949 ) Var x y x2 y2 2 xy Varx Var y 2Covx, y 17.01 6.00 28.1 6.81 13