251y0131 11/19/01 Open in page layout ECO251 QBA1 THIRD EXAM NOVEMBER 20, 2001 NAME: _____KEY_________ SECTION ENROLLED: MWF TR 10 11 12:30 (Circle both days and time separately or just write down days and time) Throughout this exam show your work! Please indicate clearly what sections of the problem you are answering! If you are following a rule like E ax aEx , please state it! If you are using a formula, state it! If you answer a 'yes' or 'no' question, explain why! Part I. Do all the Following (10+ Points) Penalty for not doing question 1. Show your work! (Keller, Warrack simplified) A manufacturing firm has recorded its total machine time (in thousands of hours) ( electrical power costs (in $1000s) ( y ) over 12 months. x ) and its Month Hours Costs x 1 2 3 4 5 6 7 8 9 10 11 12 Note that 6 9 8 7 10 10 5 7 6 9 8 11 96 y 0.76 1.00 0.89 0.88 1.07 1.03 0.70 0.91 0.75 0.95 0.87 1.11 10.92 y 0.910000 , s 2y 0.0168001 . Please don’t do these again! Compute the following: 1. The sample variance for the hours (4). 2. The sample covariance between the hours and the cost (3) 3. The sample correlation between the hours and the cost (2) 4. Interpret the correlation (1) Answers to questions 5) and 6) must be based on the results in questions 1-4. Do not recompute the answers after changing x or y ! y column rise by 20% (i.e. to 5. If there is a retroactive energy cost rise of 20%, so that all the numbers in the 6. .912 , 1.20, 1.068, 1.056 etc.), what would the variance, covariance and correlation computed above be? (1.5) If, instead, a monthly charge of $200 ($0.2 thousand) is added to all the numbers in the y column (i.e. they become 0.96, 1.29, 1.09, 1.08 etc.), what would the variance, covariance and correlation computed above be? (1.5) Solution: 1) From the computations below s x2 3.45455 (so that s x 1.85864 ). 2) From the computations below s xy 0.233637 3) From the computations below rxy .9698 . 4) Since the sign is positive, x and y tend to move together. Since rxy2 .94 , on a zero to one scale, this is a strong relationship. We would be justified in saying that hours of use enable us to predict costs very well. Note that r 2 1 is impossible as is a negative variance. 1 251y0131 obs 1 2 3 4 5 6 7 8 9 10 11 12 11/19/01 x x2 y y2 6 36 0.76 0.5776 9 81 1.00 1.0000 8 64 0.89 0.7921 7 49 0.88 0.7744 10 100 1.07 1.1449 10 100 1.03 1.0609 5 25 0.70 0.4900 7 49 0.91 0.8281 6 36 0.75 0.5625 9 81 0.95 0.9025 8 64 0.87 0.7569 11 121 1.11 1.2321 96 806 10.92 10.1220 x 2 806 , x 96 , xy 4.56 9.00 7.12 6.16 10.70 10.30 3.50 6.37 4.50 8.55 6.96 12.21 89.93 y 10 .92 , x s x2 x 96 8.000 n x 12 2 nx 2 n 1 806 12 8.000 2 11 3.45455 y y 10.92 0.9100 s 2y n 12 y 2 ny 2 n 1 10 .1220 12 0.9100 2 11 0.01680 y 2 10 .1220 and xy 89.93 Note that the y and y 2 columns were not needed! I can't think of a bigger waste of effort except recomputing y in parts 5 and 6. s xy rxy xy nx y 89.93 128.0000 0.9100 2.5700 0.233637 n 1 s xy sx s y 11 11 0.233637 so 0.9698 3.45455 0.016800 5) There is a retroactive energy cost rise of 20%, so that all the numbers in the y column rise by 20%. Varcy d c 2Var y , so Var 1.2 y 0 1.22 Var y 1.22 0.0168 0.0242 . Also, Cov(ax b, cy d ) acCov( x, y) , so Cov( x 0, 1.2 y 0) 11.2Cov( x, y) 11.20.233637 0.2804 and if w ax b and v cy d , wv signac xy . signac sign11.2 , so the correlation between the two is unchanged at 0.9698. 6) Instead, a monthly charge of $200 ($0.2 thousand) is added to all the numbers in the y column. Varcy d c 2Var y , so Var y 0.2 12 Var y 12 0.0168 0.0168 . Also, Cov(ax b, cy d ) acCov( x, y) , so Cov( x 0, y 0.2) 11Cov( x, y) 110.233637 0.233637 and if w ax b and v cy d , wv signac xy . signac sign11 , so the correlation between the two is 0.9698. In short, adding 0.2 to y has no effect. 2 251y0131 11/19/01 Part II. Do the following problems ( do at least 40 points ). Show your work! 1. The following table represents the joint probability of x and y . x 0 4 .1 y 8 0 10 .1 4 0 .1 8 .1 0 0 .1 12 .1 .2 a. b. Fill in the missing number. (1) Are x and y independent? Why?(2) c. Compute xy , the covariance of x and y , and interpret it. (3) d. Compute xy , the correlation of x and y , and interpret it. (3) e. Find the probability that f. (i) Find the distribution of x y is less than 11. (1) x y . (2) x y .(3) Solution: a) the missing number must be .2 because the numbers inside the table must add to 1. b) Check for independence: First you need to find Px and P y . Look at the upper left hand (ii) Using only the results of a)-d), find the mean and variance of probability below. Its value is .1 and it represents Px 0 y 4 . If x and y were independent , we would have Px 0 y 4 Px 0 P y 4 .4.2 . 8 . Since this is not true, we can say that x and y are not independent. A faster way to do this is to note that the zeroes are not parts of rows or columns of zeroes. c) Below, we find that xy Covxy Exy x y 32 84 0 - meaning: see d) d) Below, it says that xy xy x y 0 0 . This means that there is no linear relationship 22 .4 43 .2 between x and y . x 4 y 8 10 Px xPx x 2 Px E xy .2 12 P y yP y y 2 P y .2 .4 1.6 6.4 .1 .2 1.6 12 .8 .2 .4 4.0 40 .0 .5 1.0 4.0 59 .2 1.6 6.0 8.0 0 1.6 12 .8 72 86 .4 0 4 .1 0 0 .1 .1 0 .2 .1 0 0.4 8 .1 0 .1 Px 1 , E x xPx 8.0 , E x x Px 86 .4 , P y 1 , E y yP y 4.0 and E y y P y 59 .2 To summarize x 2 2 2 2 y .10 4 04 4 .18 4 .212 4 xyPxy 008 .148 088 .112 8 .1010 0410 .1810 .212 10 0 3.2 9.6 0 0 3.2 0 9.6 32 .0 0 0 8 24 3 251y0131 11/19/01 xy Covxy Exy x y 32 84 0 , y2 E y 2 y2 59.2 42 43.2 . x2 E x 2 x2 86.4 82 22.4 and So that xy xy x y 0 0. 22 .4 43 .2 ( x 4.73286 , y 6.57287 ) Note that 2 1 is impossible as is a negative variance. e) Repeat the joint probability table, and, in a second table, show the sums of x and y . Note that 8 is repeated, so that the probability of 8 is Px y 8 .2 0 .2. x 0 4 .1 . y 8 0 10 .1 x 4 0 .1 8 .1 0 0 .1 12 .2 .1 .2 0 4 8 12 4 4 0 4 8 y 8 8 12 16 20 10 10 14 18 22 It is easiest to do f(i) first. If you look at the table below Px y 11 P4 P0 P4 P8 P10 .1 0 .1 .2 .1 .5 f(i)) x y x y Px y Px y -4 .1 14 0 0 0 16 0 4 .1 18 .1 8 .2 20 .1 10 .1 22 .2 12 .1 It would be a good idea to take out the zero probabilities and get the table below. Note that you were not supposed to do the last 2 columns, but that they serve as a check on f(ii)), verifying that E x y 12 and Var x y E x y 2 E x y 2 209 .6 12 2 65 .6. x y -4 4 8 10 12 18 20 22 Px y .1 .1 .2 .1 .1 .1 .1 .2 1.0 x y Px y -0.4 0.4 1.6 1.0 1.2 1.8 2.0 4.4 12.0 x y 2 P x y 1.6 1.6 12.8 10.0 14.4 32.4 40.0 96.8 209.6 f(ii)) Ex y Ex E y x y 8 4 12 and Var x y x2 y2 2 xy Var x Var y 2Covx, y 22 .4 43 .2 20 65 .6 4 251y0131 11/19/01 2. A warehouse will accept a lot of jorcillators if a randomly picked box of 50 jorcillators contains no more than two defective jorcillators. Assume that, in fact, 2% of jorcillators in the lot are defective. a) Without using a table, figure out the probability that no jorcillators in a box of 50 will be defective. (2) b) Without using a table, figure out the probability that no more than two jorcillators in a box of 50 will be defective. (3) c) Using our test for use of the Poisson distribution to show that it is appropriate to use the Poisson distribution, find an approximate answer to a) using a table. (2) d) Find an approximate answer to b) using a table. (2) e) Assume instead that jorcillators come in boxes of 1250. What is the probability that the number of defective jorcillators in a box will be between 11 and 20? (3) f) Again consider a box of 1250 jorcillators. We will reject the hypothesis (that the defect rate for jorcillators is less than or equal to 2%) if the number, x 0 , of defective jorcillators in a box is so large that Px x 0 .05 . Using a table and the mean that you used is e), what is the minimum number of defective jorcillators in a box that would lead to rejection of the hypothesis? (3). Solution: a) Binomial (because we are taking 50 items in succession with a constant probability of getting n! a defective item.) p .02, n 50 . Px C xn p x q n x , q 1 p , C xn . Both n x ! x! q and p must be positive and not above 1. P0 C 050 p 0 q 50 50 ! .02 0 .98 50 11.364169 .36417 50 ! 0 ! b) Px 2 P0 P1 P2 C050 p 0 q 50 C150 p1q 49 C250 p 2 q 48 .36417 50 ! .02 1 .98 49 50 ! .02 2 .98 48 49 !1! 48 ! 2 ! .36417 50 .02 .37160 502149 .0004 .379185 .36417 .37160 .18580 .92157 n 50 2500 is above 500, use the Poisson distribution table with parameter c) Since p .02 m np 50.02 1. Px 0 .36788 . d) From the same table e) Since Px 2 .91970. n 1250 62500 is above 500, use the Poisson distribution table with parameter p .02 m np 1250 .02 25 . P11 x 20 Px 20 Px 10 .18549 .00059 .1849 . f) Again m 25. We want the minimum number x 0 , so that Px x 0 .05 . This is the same as saying Px x 0 .95 . If we check the Poisson table we find that Px 33 .95022 is the first value above 95%. So. If there are 33 or more defective jorcillators in the box, we reject the hypothesis. 5 251y0131 11/19/01 3. A company produces camcorders. In any batch of camcorders 80% will last five or more years without a repair, 20% will need repairs. a. If I take 5 camcorders from a batch of 20, what is the probability that exactly four will need a repair before 5 years pass? (3) b. If I take 5 camcorders from a batch of 20, what is the probability that at least one will need a repair before 5 years pass. (2) c. What is the expected value and standard deviation of the number that will need repairs before 5 years pass. (2) d. Assume instead that the batch from which I take the 5 camcorders is a batch of 1000. Now find the probability that exactly four will need a repair before 5 years pass without using the Hypergeometric distribution. (1) e. Assume that the batch from which I take the 5 camcorders is a batch of 1000. Now find the probability that at least one will need a repair before 5 years pass without using the Hypergeometric distribution. (2) f. What is the mean and variance of the number in e) that will need repairs before 5 years pass? (2) g. In d) and e) I said not to use the Hypergeometric distribution. This was to make your computations easier. Explain what criterion I had to satisfy to be able to do this. (1) h. Assume that the batch of camcorders is 10000. How many camcorders must I take (I took 5 before) to be able to use the Poisson distribution to find the probability that at least one needs a repair? (2) Solution: This problem is basically a hypergeometric problem (because we are taking a sample of 5 items from a population of only 25, so that the probability of a defective item keeps changing). C M C N M N n Px x Nn x , where M Np , np and 2 npq . N 1 Cn a) N 20 , n 5, M Np 20.20 4 . P4 b) Px 1 1 P0 1 C 04 C 516 C 520 C 44 C116 C 520 116 5 4 3 2 1 16 .00103 . 20! 20 19 18 17 16 15!5! 16! 1 16 15 14 13 12 11! 5! 1 1 .7183 . 20! 20 19 18 17 16 15! 5! 20 5 15 N n 5.20 .80 0.8 .63158 . c) np 5.20 1 and 2 npq 20 1 19 N 1 0.63158 .7947 . d) Use the binomial distribution when N is large compared with n . p .2, n 5 . Px C xn p x q n x , q 1 p , C xn n! . From the table n x ! x! P4 Px 4 Px 3 .99968 .99328 .0064 . Or P4 C 45 p 4 q 1 5.24 .8 .0064 . e) With or without the table Px 1 1 P0 1 .85 1 .32768 .67232 . f) np 5.20 1 and 2 npq 5.20.80 0.800 or 1000 5 995 N n 0.8 0.7944 . So .89 . 5.20 .80 npq N 1 1000 1 999 2 g) The condition to replace The Hypergeometric by the bimomial is N 20 n 20 5 100 . Since N 1000 , we have satisfied this condition. n n 500 to get n 500 .2 100 We can use the 500 . If p .2, we can solve h) We need .2 p Poisson distribution if the sample size is above 100, as long as it is not above 5% of the population. 6 251y0131 11/19/01 4. I sell camcorders for $300 apiece. My variable cost to produce a camcorder is $150 a unit and my fixed costs are $2000 a month. Thus, if I sell 20 camcorders, my revenue will be $300 (20) = $6000 and my costs will be $150 (20) + $2000 = $5000 and I will have profits of $1000. Assume that the mean number of units sold per month are 25 and that sales follow the Poisson distribution . Let x represent my sales. a) If it takes a month to make a camcorder, what is the minimum number of camcorders that I should keep on hand to be sure that I have a probability of no more than 1% of running out before I can produce more? (2) b) What is the probability that I will sell between 20 and 30 camcorders in a month? (2) c) What is the probability that I will sell less than 10 in a month? (1) d) Write an equation for my costs in terms of x and find the mean and standard deviation of my costs. (3) e) Using an equation for revenue as well as your equation for costs find the covariance and correlation between revenues and costs. (4) f) Using only your means and variances for costs and revenues and the covariance you found, find the mean and variance of my profits. Remember, profits are revenues minus costs. (3) Solution: For a) - c) all we need is a Poisson table with a parameter of 25. a) The first number we can use on the table is 37, since Px 37 .99079 , and 37 is the smallest number with a cumulative probability above 99%. This means that Px 37 1 .99079 .00921 (below 1%). b) The table says P20 x 30 Px 30 Px 19 .86331 .13357 .72974 . c) Px 10 Px 9 .00022 . d) As explained in the problems, the Outline says that if w ax b and v cy d , E w aEx b, E v cE y d , Varw w2 Varax b a 2Varx, Varv v2 Varcy d c 2Var y , Cov(w, v) Cov(ax b, cy d ) acCov( x, y) and Corr u, v signacCorr x., y . In problem K3 we found Covx, x Varx and Corr x, x 1 . For costs let v 150 x 2000 , so that c 150 and d 2000 . Remember that Varx E x x 25 , since the mean and variance are equal in the Poisson distribution.. Then, if we replace y by x in the formulas, Ev cEx d 150 25 2000 5750 and Varv v2 Varcx d c 2Varx 150 2 25 562500 . the standard deviation is v 562500 150 25 750 . e) For revenue, we need w 300 x, so that a 300 and b 0. Using the formulas above or the results of problem K3, we see that Cov(w, v) Cov(ax b, cx d ) acCov( x, x) acVarx 300 150 25 1125000 . Corr u, v signacCorrx, x sign300 150 1 sign45000 1 1. f) Using the formulas above, Ew E300 x 300 Ex 300 25 7500 and Var w 300 2 25 2250000 From the outline or the syllabus supplement, we know that we can add and subtract means and that Var(ax cy) a 2Var( x) c 2Var( y) 2acCov( x, y) This means that E w v E w E v 7500 5750 1750 and Var(w v) Var(w) Var(v) 2Cov(w, v) 2250000 562500 21125000 562500 . To confirm this try Pr ofit 300x 150x 2000 150x 2000. So EPr ofit 150 25 2000 1750 and Var Pr ofit 150 2 25 562500 . 7 251y0131 5. 11/19/01 a. Identify the distribution in the problem, give and explain its mean and give its standard deviation. Do not answer the question in the problems in part a.! Example: An average of 200 cars go through a toll booth each hour, what is the probability that over 150 go through the toll booth in a half hour? Solution: Poisson (with a parameter of 100) . The mean is 100 , the average number of cars that go through the booth in a half hour when the mean is 200. 100 10. Do not tell me the probability that over 150 go through the booth! (i) 14 customers enter my store. Each one has a 22% chance of buying something. What is the chance that at least one buys something? (3) (ii) Each customer that enters my store has a 22% chance of buying something. What is the chance that the 5 th customer is the first to buy? (3) (iii) In a package of 50 condoms, 8% are defective. If I sell 3, what is the chance that 2 are defective? (3) A jorcillator has two components, a phillinx and a flubberall. As long as both are working the jorcillator will not be junked. If either fails the jorcillator must be junked. From here on you are actually doing the problems! b. If a phillinx has a lifespan approximated by a uniform distribution between 3 and 13 years, find: (i) The mean and standard deviation of its life. (2) (ii) The probability that it will last between 0 and 5 years (2) (iii) The probability that it will last between 5 and 10 years (1) (iv) The probability that it will last more than 10 years (1) c. If a flubberall has a lifespan approximated by a uniform distribution between 5 and 20 years, do (i) b(ii)-b(iv) above for the flubberall (2) and then (assuming that the lifespan of the two components is independent) find: (ii) The probability that the jorcillator will last between 0 and 5 years. (2) (iii) The probability that the jorcillator will last between 5 and 10 years. (3) (iv) The probability that the jorcillator will last more than 10 years. (2) Solution: a) (i) 14 customers enter my store. Each one has a 22% chance of buying something. What is the chance that at least one buys something? (3) Binomial Distribution with p .22 , n 14 . q 1 p 1 .22 .78 . np 14.22 3.08 , the average number out of 14 customers that will buy something. 2 npq 3.08.78 2.4024 . npq 2.4024 1.5500 . (ii) Each customer that enters my store has a 22% chance of buying something. What is the chance that the 5th customer is the first to buy? (3) 1 1 4.54545 , the number of the Geometric Distribution with p .22 , p .22 average customer who buys first - in other words, on the average the fourth or fifth q .78 16 .1157 so customer to enter the store will actually buy something. 2 2 p .22 2 16.1157 4.0144. (iii) In a package of 50 condoms, 8% are defective. If I sell 3, what is the chance that 2 are defective? (3) Hypergeometric Distribution with p .08 , n 3 , N 50 . np 3.08 0.24, the average number that will be defective out of 3. N n 50 3 npq 0.24 .92 0.2118 so 0.2118 0.4602 . N 1 50 1 2 8 251y0131 11/19/01 A jorcillator has two components, a phillinx and a flubberall. As long as both are working the jorcillator will not be junked. If either fails the jorcillator must be junked. From here on you are actually doing the problems! b. If a phillinx has a lifespan approximated by a uniform distribution between 3 and 13 years, find: (v) The mean and standard deviation of its life. (2) (vi) The probability that it will last between 0 and 5 years (2) (vii) The probability that it will last between 5 and 10 years (1) (viii) The probability that it will last more than 10 years (1) c. If a flubberall has a lifespan approximated by a uniform distribution between 5 and 20 years, do (i) b(ii)-a(iv) above for the flubberall (2) and then (assuming that the lifespan of the two components is independent) find: (ii) The probability that the jorcillator will last between 0 and 5 years. (2) (iii) The probability that the jorcillator will last between 5 and 10 years. (3) (iv) The probability that the jorcillator will last more than 10 years. (2) Solution: b) For the Phillinx: c 3, d 13 Make a diagram. Show a box between 3 and 13 with a height of 1 d c 133 0.1 . Shade the areas between 3 and 5, between 5 and 10 and between 10 and 13. 1 c d 3 13 8 2 2 (i) 2 d c2 13 32 12 12 8.3333 8.3333 2.8868 Let A be the event that the Phillinx lasts 0 - 5 years. P A P0 x 5 5 3 1 2 . 10 10 1 5 (iii) Let B be the event that the Phillinx lasts 5 - 10 years. PB P5 x 10 10 5 . 10 10 (iv) Let C be the event that the Phillinx lasts more than 10 years. 1 3 PC Px 10 13 10 10 10 Note that these three probabilities must add to 1. (ii) c) (i) For the Flubberall: c 5, d 20 . Make a diagram. Show a box between 5 and 20 with a height of 1 1 d c 205 15 . Shade the areas between 3 and 5 (There is none!), between 5 and 10 and between 10 and 13. Let D be the event that the Flubberall lasts 0 - 5 years. PD P0 y 5 0 1 Let E be the event that the Flubberall lasts 5 - 10 years. PE P5 y 10 10 5 1 1 =.3333 15 3 1 2 Let F be the event that the Flubberall lasts more than 10 years. PF P y 10 20 10 15 3 =.6667 (ii) - (iv) A joint probability table follows on the left with the unions of joint events that make up the events that were specified summed to the right. Note that the probabilities sum to one and are not the same as on the 1998 exam. D E F Event Component Joint Events Probability Fails in P A D A E A F 2 4 2 2 4 6 A 0 0 00 .200 30 30 10 0-5 30 30 30 B D C D 10 5 B 0 5 30 C 0 0 3 30 1 3 30 6 30 2 3 10 3 10 1 Fails in 5-10 Fails in 10+ Sum PB E B F C E PC F 5 10 3 30 30 30 6 30 18 .600 30 6 .200 30 1.000 9 251y0131 11/19/01 Extra Credit: The main computer we use for payroll crashes on the average twice a month. It has just crashed. This means that the average waiting time before the next crash will be 1/2 of a month. Assume that a month consists of 4 weeks (A week is .25 months.) a. What is the probability that the time before the next crash will be less than 2 weeks? (3) b. What is the probability that the time before the next crash will be between 3 and 4 weeks? (2) c. What is the standard deviation for this distribution? (1) Solution: You were told in advance that this section would use the exponential distribution. In the cx 1 exponential distribution from the outline F x 1 , when the mean time to a success is . c 1 Note that this is only for x 0 . There is no probability below zero. c 1 1 a) Since .5, c 2 . Px 0.5 F 0.5 1 e2.5 1 e1 1 .3679 .6321 .5 b) P.75 x 1.0 F 1.0 F 0.75 1 e21.0 1 e20.75 e1.5 e2 .2231 .1353 .0878 c) Since 1 , 2. c 10