500y0711 12/26/06 ECO251 QBA1 FIRST EXAM ECO500 Business Statistics Name: _____________________ Student Number : _____________________ Remember – Neatness, or at least legibility, counts. In all questions an answer needs a calculation or explanation to count. Show your work! Part I. (10 points) The following numbers are a sample of 20. 22, 11, 22, 23, 17, 25, 17, 20, 26, 14, 23, 15, 34, 12, 15, 11, 19, 16, 34, 24 Compute the following: Show your work! a) The Median (1) b) The Standard Deviation (3) c) The Interquartile Range (3) d) The Coefficient of variation (1) e) A measure of skewness (2) Solution: The table computations which could be needed for this problem follow. Obviously, no one needs to do all of these. Row 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 x 22 11 22 23 17 25 17 20 26 14 23 15 34 12 15 11 19 16 34 24 400 x in order x 2 11 11 12 14 15 15 16 17 17 19 20 22 22 23 23 24 25 26 34 34 x x x x 2 x x 3 x3 484 10648 121 1331 484 10648 529 12167 289 4913 625 15625 289 4913 400 8000 676 17576 196 2744 529 12167 225 3375 1156 39304 144 1728 225 3375 121 1331 361 6859 256 4096 1156 39304 576 13824 8842 213928 2 -9 2 3 -3 5 -3 0 6 -6 3 -5 14 -8 -5 -9 -1 -4 14 4 0 4 81 4 9 9 25 9 0 36 36 9 25 196 64 25 81 1 16 196 16 842 8 -729 8 27 -27 125 -27 0 216 -216 27 -125 2744 -512 -125 -729 -1 -64 2744 64 3408 x 400 x 400, x 8842, x 213928 so x n 20 20 . If we use definitional formulas x x 0 (a check), x x 842 and x x 3408. 2 To summarize 3 2 3 a) Median: position pn 1 0.521 10.5 ; x1 p xa .bxa 1 xa so x.5 x10 .5x11 x10 19 .520 19 19 .5 . b) Standard Deviation: s s2 x x 2 n 1 2 x 2 nx 2 n 1 8842 20 20 2 842 44 .3158 or 19 19 842 44 .3158 . So s 44.3158 6.6570. 19 1 500y0711 12/26/06 c) Interquartile range: To find the first quartile position pn 1 0.2521 5.25 . x1 p x a .bx a 1 x a so x.75 x 5 .25x 6 x 5 15 .2515 15 15 To find the third quartile position pn 1 0.7521 15.75 . x1 p x a .bx a 1 x a so x.25 x15 .75x16 x15 23 .7524 23 23 .75 . IQR x.25 x.75 23.75 15 8.75. d) Coefficient of variation: C e) Measure of skewness: n k 3 x 3 3x (n 1)( n 2) s 6.6570 3.3285 x 20 x 2 2nx 3 20 213928 320 8842 220 20 3 1918 20 213928 530520 320000 20 3408 199 .2984 or 1918 1918 k n x x 3 20 3408 199 .2984 or g1 33 199 .29843 0.6756 or k 3 (n 1)( n 2) 19 18 s 6.6570 3mean median 320 19 .5 0.225 . All these measures of skewness indicate that the SK 2 6.6570 std .deviation distribution is skewed to the right. The last two tell us that the skewness is relatively small. 2 500y0711 12/26/06 Part II. (At least 40 points. Parentheses give points on individual questions. Brackets give cumulative point total.) Exam is normed on 50 points. 1. For the sample above, find the two numbers that would be at the limits of an interval of 2.5 standard deviations from the mean x 2.5s . If these were population parameters instead of sample statistics, what would be the minimum fraction of the data between these two numbers according to Tchebyschev's rule? The Empirical rule says that about 98.8% of the data should be between these two points. How does the fraction of the data actually between these two points compare? (5) Solution: We have x 20 and s 44.3158 6.6570, so the interval in question is x 2.5s 20 2.56.6570 20 16.64 or 3.36 to 36.64. The numbers in order are 11 11 12 14 15 15 16 17 17 19 20 22 22 23 23 24 25 26 34 34 All are between 3.36 and 36.64. The Tchebyschev Rule says that the proportion onside the interval 1 1 16 % , which means at least 84% should be in the interval. There is should be no more than 2 6 . 25 2.5 no reason why it shouldn’t be more than 84%. As far as the empirical rule is concerned, though the percent in the interval seems close to 98.8%, the rule does not apply since we have already shown that the distribution is not quite symmetrical. 2. (Most of the following problems are due to Mansfield and Schmidt) As you leave a restaurant the attendant gives you a randomly selected coat from the coatroom. If 10% of the coats are worth $200, 40% are worth $100 and 50% are worth $50, show that you have a valid distribution and compute the expected value and standard deviation of the value of the coat you receive. (5) Solution: Computations are below. Row 1 2 3 Px xPx x 2 Px x 200 100 50 0.1 0.4 0.5 1.0 20 40 25 85 4000 4000 1250 9250 x x x x Px x x 2 Px 115 15 -35 11.5 6.0 -17.5 0.0 1322.5 90.0 612.5 2025.0 We have shown that this is a valid distribution because all probabilitites are between zero and one and their xPx 85 and E x 2 x 2 Px 9250 . This means sum is 1. The next two columns show x that the (population) variance is x2 E x 2 2 x 9250 85 2 2025 . Thus the (population) standard deviation is x 2025 45 . The last three columns are totally unnecessary but show that x Px 0 , which must be true if the mean is correct and, using the definitional formula, that x Px 2025. This serves as a check on our work. x 2 x 2 x 3. If the gross profit on each calculator sold is $5 and the fixed cost is $100000 per year, so that if the firm sells 50000 calculators, annual profit is 550000 100000 150000 , what is the expected value and standard deviation of the firm’s annual profit? (4) ( I should have said that average sales are $75000 units with a standard deviation of $30000.) [14] Solution: The formula relating profit, w, to sales is w 5x 100000 . The formula table says E ax b aEx b and Varax b a 2Varx . So E 5x 100000 5E x 100000 and Var5x 100000 5 2 Varx , so that the standard deviation is 5 times the standard deviation of x . If you were around when I finally remembered the numbers, this gives E w E 5x 100000 575000 100000 275000 Varw Var5x 100000 5 2 30000 2 . The standard deviation is thus 530000 15000 . 3 500y0711 12/26/06 4. If a population has a mean weight of 140 pounds and a standard deviation of 30 pounds, what is the mean and standard deviation of the total weight of 3 randomly selected people? (3) [17] Solution: The section on functions of random variables in the Syllabus Supplement says the following. “ If x1 , x2 , x3 , xn are random variables, then E x1 x 2 x3 x n E x1 E x 2 E x3 E x n ,” and “If x1 , x 2 , x3 , x n are independent random variables, then Varx1 x 2 x3 x n Varx1 Var x 2 Varx3 Var x n . So, if the individuals weights are x1 , x 2 and x3 , we have E x1 x 2 x3 Ex1 Ex 2 E x3 3(140 ) 420 . Var x1 x 2 x3 Var x1 Var x 2 Var x3 330 2 3900 2700 , so that Std.Devx1 x 2 x3 2700 51.962 . 5. Two cards are drawn at random from a deck of cards. Identify the following events: A1 an ace on the first draw and A2 an ace on the second draw. Find the following: (a) P A1 , (b) P A1 A2 , (c) P A2 A1 , (d) P A2 A1 , (e) P A1 A2 , (g) P A2 . Can you explain why P A1 A2 P A1 P A2 ? Repeat the exercise for the following events H 1 , a head on the first flip and H 2 , a head on the second flip. By contrasting P A1 A2 and PH 1 H 2 , show that only one of these pairs of events are independent. (8) Solution: The difference between A1 and A2 on one hand and H 1 and H 2 on the other is that the first pair is not independent and the second one is. Since there are for aces in the deck, we can write that 4 .07692 . If we get an ace on the first draw, there are only 3 left in the deck so we have (a) P A1 52 3 4 .05882 . (d) But P A2 A1 .07843 . By the multiplication rule P A1 A2 (c) P A2 A1 51 51 3 4 .00452 The easiest way to do P A1 A2 is to note that there (b) P A2 A1 PA2 A1 P A1 51 52 47 48 .85068 and that (e) P A1 A2 are 48 non-aces in the deck, so that P A2 A1 P A2 A1 P A1 51 52 47 48 1 P A2 A1 1 P A2 A1 P A1 1 1 .85068 .14932 . It is also probably worth noting 51 52 4! 43 C 24 C 048 2! 2! that P A2 A1 2 1 .00452 . (g)But we still haven’t found P A2 . There are 52! 52 51 C 252 50! 2! 2 1 several ways to do this, all requiring some thought. P A1 A2 P A1 P A2 P A1 A2 , which is, of course why P A1 A2 P A1 P A2 . Perhaps this is best explained by saying that A1 and A2 are not mutually exclusive . So .14932 .07692 P A2 .00452 and P A2 .14932 .07692 .00452 .07692 or P A2 PA2 A1 P A1 P A2 A1 P A1 3 4 4 48 12 192 4 .07692 . If you tried to work 51 52 51 52 51 52 52 A1 A1 A2 this with a joint probability table, you would have A2 3 4 51 52 48 4 51 52 4 52 4 48 51 52 47 48 51 52 48 52 or 4 500y0711 12/26/06 A1 A2 A2 A1 .0045249 .0723982 .0723982 .8506787 .0769231 . Somehow, it all adds up. But we can see clearly that .9230769 .0769231 ..9230769 1.00000 43 4 4 P A2 A1 P A1 P A2 . So it seems that A1 and A2 are not mutually exclusive and 52 51 52 52 not independent. Because PH 1 PH 2 PH 2 H 1 P H 2 H 1 .5 we can say PH 2 H 1 PH 2 H 1 PH 1 11 1 . S we have PH 1 .5 , PH 1 H 2 .25 , P H 2 H 1 .5 , 22 4 P H 2 H 1 .5 , PH 1 H 2 .5 .5 .25 .75 , PH 2 . PH 1 H 2 PH 1 PH 2 and these two events are not mutually exclusive but are independent. 6. Your significant other is supposed to pick you up at 5pm. The probability that said other remembered to buy gas is 60%. If said other remembered there is a 90% probability that you will be picked up on time. If buying gas was forgotten the probability that you will be picked up on time is 60%. No one shows at 5pm. What is the probability that your significant other forgot to buy gas? To keep us all together on this, let G be the event that gas is remembered and T be the event that you are picked up on time. There is partial credit her for finding the probability of events like T G or T G . (5) [30] Solution: We are given that PG .6, P T G .9 and P T G .6 . We are asked for P G T .We could , or we could do a box solution. If we can talk about 100 dates, SO PT G PG write Bayes’ rule P G T PT will have remembered 60 times and forgotten 40. Since P T G .9 , on the 60 times that SO remembers, SO will be on time 54 times. Since P T G .6 , on the 40 times SO forgets, SO will be on time .640 24 times. Lets work this out. G T T 54 60 G 24 40 . If we just fill in the blanks we get G T T 54 6 G 24 16 100 gives us G G T T 54 6 60 24 16 40 78 22 100 60 40 100 . Adding . So On the 22 occasions that SO is not on time there are 16 on which gas was forgotten. The probability is thus 16 .72727 . Lets try this formally P T G .6 , so P T G 1 .6 .4 22 that PT G 1 .9 .1 . So PT .1.6 .4P.4 .06 .16 .22 . So Bayes’ rule says PT G PG .4.4 .16 PG T .72727 .22 .22 PT P T P T G P T G P T G PG P T G P G . We know that P G 1 PG 1 .6 .4 and 5 500y0711 12/26/06 7. The formula for the probability of x successes in n tries is Px C xn p x q n x , where p is the probability of a success on 1 try and q is the probability of failure. For example, if your probability of success on 1 try is .20 and the probability of failure is .80, the probability of 3 successes on 7 tries is 7! .20 3 .80 4 . This may help you in the following: You roll one die and then toss a coin the C 37 p 3 q 4 3!4! number of times indicated by the die; for example if you roll a two you toss the coin twice, Let y be the number of heads you get. Find the possible values of y and their probabilities. Show that y has a valid distribution and find the probabilities of each value of y . Find E y and y . (6) [36] Solution: I guess we could start with the probabilities we need. y will range from 1 to 6. Let w be our result from the die. For example if y 3 , we can get three heads in 4 ways: w 3 and we get 3 heads in 3 tries, w 4 and we get 3 heads in 4 tries; w 5 and we get 3 heads in 5 tries; w 6 and we get 3 heads in 1 1 1 1 6 tries, The probability for this is C 33 .53 C 34 .54 C 35 .55 C 36 .56 6 6 6 6 1 5! 3 4 .55 6! .56 1 .12500 .25000 .31250 .31250 1 1.0000 .1667 . 1.5 4.5 6 2! 3! 3! 3! 6 6 We can read the probabilities in the brackets from table E.6 in the text. Let w represent the amount on the face of the die. w Sum P0 P 1 P2 P3 P4 P5 P6 .5000 .5000 1.0000 1 .2500 .5000 .2500 1.0000 2 .1250 .3750 .3750 .1250 1.0000 3 .0625 .2500 .3750 .2500 .0625 1.0000 4 .0312 .1562 .3125 .3125 .1562 .0312 0.9998 5 .0156 .0937 .2344 .3125 .2344 .0937 .0156 0.9999 6 0.9843 1.8749 1.5469 1.0000 0.4531 0.1249 0.0156 5.9997 Sum There is a slight rounding error in these sums, but they should be good enough. The probabilities on the table are, as far as the present problem is concerned, conditional probabilities. For example, P 1 head w 3 .3750 . To allow for the die, multiply the column sums by 1 6 . Row y 1 2 3 4 5 6 7 0 1 2 3 4 5 6 P y 0.1641 0.3125 0.2578 0.1667 0.0755 0.0208 0.0026 1.0000 yP y y 2 P y 0.0000 0.3125 0.5156 0.5001 0.3020 0.1040 0.0156 1.7498 0.0000 0.3125 1.0312 1.5003 1.2080 0.5200 0.0936 4.6656 y y y y Py y y 2 P y -1.7498 -0.7498 0.2502 1.2502 2.2502 3.2502 4.2502 -0.287142 -0.234313 0.064502 0.208408 0.169890 0.067604 0.011051 0 0.502441 0.175688 0.016138 0.260552 0.382287 0.219727 0.046967 1.603800 We have shown that this is a valid distribution because all probabilitites are between zero and one and their yP y 1.7498 and E y 2 y 2 Px 4.6656 . This sum is 1. The next two columns show y means that the (population) variance is y2 E y 2 y2 4.6656 1.74982 1.6038 . Thus the (population) standard deviation is y 1.6038 1.2664 . The last three columns are totally unnecessary but show that formula, that y P y 0 , which must be true if the mean is correct and, using the definitional y P y 1.603800. y 2 y 2 y 6 500y0711 12/26/06 8. As everyone knows, a Veeblefetzer (Actually, it should have been a Jorcillator) has two components, a Phillinx and a Flubberall. Depending on the design of the Veeblefetzer, the Veeblefetzer will work a) as long as both components work or b) as long as either component works. Let us assume that rule b) is in effect and that the life of a phillinx, x1 , is a random variable with a continuous uniform distribution between 0 and 4, so that the probability of it failing in the first year is P0 x1 1 . The life of a Flubberall, x 2 , is a random variable with a continuous uniform distribution between 0.5 and 2.7. What is the probability that the Veeblefetzer fails in Year 1? Year 2? After Year 2? If you will lose $2million if it fails in the first year, break even if it fails in the second year and make $5 million of it lasts beyond the second year, what is the standard deviation of your earnings? (5) [41] Solution: For the first part of the problem find the uniform distribution probabilities. Define the following events. Period Phillinx Flubberall Fails Fails Make a diagram for the Phillinx. Show a 1 rectangle that goes from zero to 4, with a A1 B1 height of .25 Shade the areas from zero to 1, 1 2 A2 B2 to 2, and above 2. 3 A3 B3 A1 P A1 1 0.25 .25 P A2 2 1.25 .25 A2 A3 P A3 4 2.25 .50 Note that these add to 1. Make a diagram for the Flubberall. Show a rectangle that goes from 0.5 to 2.7, with a height of 1 1 .45455 Shade the areas from 0.5 to 1, 1 to 2, and above 2. 2.7 0.5 2.2 PB1 1 0.5.45455 .22727 B1 B2 B3 PB2 2 1.45455 .45455 PB3 2.7 2.45455 .31818 Note that these add to 1. Now we need a joint probability table. Event B1 B2 A1 A2 A3 Sum Sum .05682 .05682 B3 .11364 .11364 .11364 .22728 .45455 .07955 .07955 .15909 .31818 .25 .25 .50 1.0000 .22727 Assume that the Veeblefetzer will work as long as one component works. Fill in the table with the period in which the Veeblefetzer will fail. Event Sum B1 B2 B3 A1 A2 A3 Sum Period 1 Period 2 Period 3 Period 2 Period 2 Period 3 Period 3 Period 3 Period 3 7 500y0711 12/26/06 Use these two tables to figure out the probability that the Veeblefetzer will fail in each period. Period 1 Component Joint Events A1 B1 Probability .05682 2 A1 B2 , A2 B1 , A2 B 2 3 A1 B3 , A2 B3 , A3 B1 , A3 B2 , A3 B3 .05682 + .11364 + .11364 = .28410 .11364 + .22728 + .07955 + .07955 + .15909 = .65911 These probabilities add to one, So we can figure out our return as in the previous problems. Computations are below. Row 6 7 8 -2 0 5 xPx x 2 P x x x -0.11364 0.00000 3.29555 3.18191 0.2273 0.0000 16.4778 16.7050 -5.18191 -3.18191 1.81809 P x x 0.05682 0.28410 0.65911 1.00003 x x Px x x 2 Px -0.29444 -0.90398 1.19832 0 1.52574 2.87639 2.17866 6.58078 x2 E x 2 x2 16 .7050 3.18191 2 6.5805 . Thus the (population) standard deviation is x 6.58078 2.565 . A1 B1 9. The following joint probability represents returns for two different stocks. B 2 B3 a)Find the following: P A1 B2 , P A1 B2 , P A1 B2 A2 A3 .05 .05 .20 .05 .30 .05 .20 .05 .05 The returns on stock 1 are A1 x1 10 % , A2 x1 20% and A3 x1 30% The returns on stock 1 are B1 x 2 5% , B 2 x 2 10% and B 3 x 2 15% b) Find E x1 , E x 2 , x21 , x22 , x x , x x 1 2 1 2 c) If your return is x1 x 2 , find the mean and standard deviation of the return. d) If your return is P1 x1 P2 x 2 and P1 P2 1 , find the proportions P1 and P2 that will lead to a minimum risk. (9) Solution: a) Complete the joint probability table as follows. A1 A2 A3 sum B1 .05 .05 .20 .30 B 2 .05 .30 .05 .40 From this we see P A1 B2 .05 , B3 .20 .05 .05 .30 .30 .40 .30 1.00 P A1 B2 P A1 PB2 P A1 B2 .30 .40 .05 .65 PA1 B2 P A1 B2 .05 .125 P B 2 .40 8 500y0711 12/26/06 b) E x1 x 2 .05.10 .05 .05.20 .05 .20 .30 .05 x1 x 2 Px1 , x 2 .05.10 .10 .30 .20 .10 .05.30 .10 .20 .10 .15 .05.20 .15 .05.30 .15 .00050 .00300 .00600 .00150 .01850 . .00150 .00225 .00025 .00050 .00300 We can now use the following tableau to compute the means and variances of x1 and x 2 . y .05 .10 .15 Px1 x1 Px1 x12 Px x1 .20 .05 .30 .05 .10 .05 .05 .20 .30 .20 .05 .05 .30 .40 .30 .0300 .0800 .0900 .0030 .0160 .0270 Px 2 x 2 Px 2 x 22 Px 2 .30 .015 .00075 .40 .040 .00400 .30 .045 .00675 1.00 .200 .046 .100 .01150 Px 1 (a check), Ex x Px 0.200 , E x x Px 1 , E x Px 0.100 and E x x Px 0.01150 To summarize 2 2 1 x2 x1 2 1 1 2 2 2 2 2 1 2 2 2 2P x 2 0.046 , 2 x1x2 Covx1 x2 Ex1 x2 x1 x2 0.01850 0.2000.100 0.0015 Ex x21 E x11 x21 0.046 0.2002 0.006 and x22 x1x2 2 2 x1x2 x1 x2 2 x2 0.01150 0.1002 .0015 . So that 0.0015 0.006 0.0015 0.0015 2 0.2500 .5 . 0.006 0.0015 c) The outline says a. Ex y Ex E y and Var x y x2 y2 2 xy Var x Var y 2Covx, y So E x1 x 2 0.200 0.100 0.300 and Varx1 x 2 0.006 .0015 2.0015 .0045 . 9 500y0711 12/26/06 d) The outline says If R P1 R1 P2 R2 and P1 P2 1 , then E R P1 E R1 P2 E R2 and VarR P12VarR1 P22VarR2 2P1 P2 CovR1 , R2 P12 0.006 1 P1 2 .0015 2P1 1 P1 0.0015 P12 0.006 1 2P1 P12 .0015 2P1 2P12 0.0015 0.006 0.0015 .0030 .0030 .0030P1 .0015 .0105P12 .0060P1 .0015 . Using Calculus this hits a minimum when 2.0105 P1 .0060 or P1 .2857 . P12 To find this by groping around, note the following. If P1 0 , .0105P12 .0060P1 .0015 .0015 . Std deviation is .0387. If P1 0.1 . .0105P12 .0060P1 .0015 .0105 0.01 .0060 0.1 .0015 .000105 .00060 .0015 = .001005. Std deviation is .0317. If P1 0.2 . .0105P12 .0060P1 .0015 .0105 0.04 .0060 0.4 .0015 .00042 .0012 .0015 = .000720. Std deviation is .0268. If P1 0.3 . .0105P12 .0060P1 .0015 .0105 0.09 .0060 0.3 .0015 .000945 .0018 .0015 =.000645. Std deviation is .0254. If P1 0.4 . .0105P12 .0060P1 .0015 .0105 0.16 .0060 0.4 .0015 .00168 .0024 .0015 =.00078. Std deviation is .0279. If P1 0.5 . .0105P12 .0060P1 .0015 = .001125. Std deviation is .0335. If P1 0.6 . .0105P12 .0060P1 .0015 = .001680. Std deviation is .0410. If P1 0.7 . .0105P12 .0060P1 .0015 = .002445. Std deviation is .0494. If P1 0.8 . .0105P12 .0060P1 .0015 = .003420. Std deviation is .0585. If P1 0.9 . .0105P12 .0060P1 .0015 = .004605. Std deviation is .0679. If P1 1.0 . .0105P12 .0060P1 .0015 = .00600. Std deviation is .0775. We find a minimum between P1 0.2 and P1 0.3 . If you now try P1 0.20, P1 0.21 etc,, you should find the lowest between .28 and .29, which is good enough. [50] 10