251distrex3 1/19/07 (Open this document in 'Page Layout' view!) Finding Probabilities for the Normal Distribution when the Distribution is not Standardized We already know from the previous section (251disrtex2) that any probability for a Normally distributed variable (the standard notation to say that a given distribution is Normal with a certain mean and standard deviation is ~ N , ) can be found using the Standardized Normal distribution by using the transformation z x . This means that if x ~ N 50, 100 and we want P200 x 250 , we replace values of x 50 x with z . We thus say: 100 250 50 200 50 P200 x 250 P z P1.50 z 2.00 100 100 e. P2.00 x 35.00 f. P0 x 1.00 Since there is no particular reason to repeat the problems from class, here are some that I had available that seem to cover all the bases. Assume that x ~ N (2,9) . Do the following: a. P0.50 x 4.00 b. P0.41 x 0.41 g. P 3.00 x 1.00 h. Find probabilities for the following intervals: Below -3.4, 3.4 x 0.7 , 0.7 x 2.0 , 2.0 x 4.7 , 4.7 x 7.4 and above 7.4. Make diagrams. c. F 0.24 Px 0.24 (Cumulative) d. F 3.00 Px 3.00 Solution: Material in italics below is a description of the diagrams you were asked to make or a general explanation, and the written description will not be part of your solution. General comment - I can't give you much credit for an answer with a negative probability or one above 1 because there is no such thing!!! . 4.00 2 0.50 2 z P 0.17 z 0.22 a. P0.50 x 4.00 P 9 9 P 0.17 z 0 P0 z 0.22 .0675 .0871 .1546 For x ~ N (2,9) make a Normal curve centered at 2 and shade the area from .50 to 4; for z , make a Normal curve centered at zero and shade the area from -0.17 to 0.22. Since the area in either diagram is on both sides of the mean, you add. 0.4 f 0.3 0.2 0.1 0.0 -4 -3 -2 -1 0 x 1 2 3 4 Note that on all these graphs, the x axis should be labeled z and a vertical line should be added at zero. 251distrex3 1/19/07 (Open this document in 'Page Layout' view!) 0.41 2 0.41 2 z P 0.27 z 0.18 b. P0.41 x 0.41 P 9 9 P0.27 z 0 P0.18 z 0 .1064 .0714 .0350 In this problem many students made diagrams showing, correctly, a mean of 2, but then showing 0.41 above it. Also, I got many probabilities like P0 z 0.20 , which don't make any sense because -0.20 is below zero and this says that it is above zero. For x ~ N (2,9) make a Normal curve centered at 2 and shade the area from -0.41 to 0.41; for z make a Normal curve centered at zero and shade the area from – 0.27 to -0.18. Since the area in either diagram is on one side of the mean, you subtract. Like all diagrams in these examples, the diagram below is for z, since most students prefer to make z diagrams rather than x diagrams. A z diagram is always centered at zero. An x diagram is always centered at the population mean (2 in this case).Students who did this problem either (a) assumed that if -0.41 became -0.27, then +0.41 would become +0.27 or (b) didn’t get control of their calculators and got +0.41 – 2 for their first calculation. Doing 0 – 0.41 – 2 might help. 0.4 f 0.3 0.2 0.1 0.0 -4 -3 -2 -1 0 x 1 2 3 4 251distrex3 1/19/07 (Open this document in 'Page Layout' view!) 0.24 2 Pz 0.20 Pz 0 P0.20 z 0 c. F 0.24 Px 0.24 P z 9 .5000 .0793 .4207 . This is the Cumulative distribution. ( is a notation for ‘equal by definition’) We already know from the previous section (251disrtex2) that since the zero is the halfway point under the curve, Pz 0 .5 and Pz 0 .5 . For x ~ N (2,9) make a Normal curve centered at 2 and shade the area below 0.24; for z make a Normal curve centered at zero and shade the area below -0.20. Since the area in either diagram is on one side of the mean, you subtract. Moral: For a cumulative distribution F x 0 Px x0 when x 0 is below the mean, change x 0 to z 0 , and subtract P0 z z 0 (from the standardized normal table) from 0.5. 0.4 f 0.3 0.2 0.1 0.0 -4 -3 -2 -1 0 x . 1 2 3 4 251distrex3 1/19/07 (Open this document in 'Page Layout' view!) 3.00 2 Pz 0.11 Pz 0 P0 z 0.11 d. F 3.00 Px 3.00 P z 9 .5000 .0438 .5438 This is the Cumulative distribution. ( is a notation for ‘equal by definition’) For x ~ N (2,9) make a Normal curve centered at 2 and shade the entire area below 3.00; for z , make a Normal curve centered at zero and shade the area below 0.11. Since the area in either diagram is on both sides of the mean, you add. Moral: For a cumulative distribution F x 0 Px x0 when x 0 is above the mean, change x 0 to z 0 , and add P0 z z 0 (from the standardized normal table) to 0.5. 0.4 f 0.3 0.2 0.1 0.0 -4 -3 -2 -1 0 x 1 2 3 4 251distrex3 1/19/07 (Open this document in 'Page Layout' view!) 35 2 2.00 2 z P0 z 3.67 P0 z 3.67 .4999 e. P2.00 x 35.00 P 9 9 There are two surprises in this problem. First 3.67 is not on the usual standardized table and, second, one value of z is zero. For x ~ N (2,9) make a Normal curve centered at 2 and shade the area above 2.0; for z , make a Normal curve centered at zero and shade the area above 0. Since the area in either diagram starts at the mean, you imply look up P0 z 3.67 .This would also work for something like P0.50 x 2.00 . To find 3.67, if you have my table, look at the bottom of the table. It is reproduced below. Since 3.67 is between 3.62 and 3.89, we say P0 z 3.67 .4999 . If you don’t have my table and you know that probabilities between zero and any number above 4 are .5000 and that the conventional table will tell you that P0 x 3.09 .4990 , you can usually make a pretty good guess. For values above 3.09, see below If z0 P0 z z 0 is is between 3.08 3.11 3.14 3.18 3.22 3.27 3.33 3.39 3.49 3.62 3.90 and and and and and and and and and and and 0 1 3.10 3.13 3.17 3.21 3.26 3.32 3.38 3.48 3.61 3.89 up .4990 .4991 .4992 .4993 .4994 .4995 .4996 .4997 .4998 .4999 .5000 0.4 f 0.3 0.2 0.1 0.0 -4 -3 -2 -1 x 2 3 4 251distrex3 1/19/07 (Open this document in 'Page Layout' view!) 1 2 0 2 z P 0.22 z 0.11 f. P0 x 1.00 P 9 9 P 0.22 z 0 P 0.11 z 0 .0871 .0438 .0433 Students who did this problem often failed to change zero to -0.22. For x ~ N (2,9) make a Normal curve centered at 2 and shade the area from 0 to 1.00; for z , make a Normal curve centered at zero and shade the area from -0.22 to 0.11. Since the area in either diagram is on both sides of the mean, you subtract. 0.4 f 0.3 0.2 0.1 0.0 -4 -3 -2 -1 0 1 2 3 4 x 1.00 2 3.00 2 z P 0.56 z 0.11 g. P 3.00 x 1.00 P 9 9 P 0.56 z 0 P 0.11 z 0 .2123 .0438 .1685 0.4 f 0.3 0.2 0.1 0.0 -4 -3 -2 -1 0 x 1 2 3 4 251distrex3 1/19/07 (Open this document in 'Page Layout' view!) h. Find probabilities for the following intervals: Below -3.4, 3.4 x 0.7 , 0.7 x 2.0 , 2.0 x 4.7 , 4.7 x 7.4 and above 7.4. This type of problem is a preface to a procedure commonly used in Economics 252 called the Kolmogorov – Smirnov test or the Chi-squared test, or, in the case with the Normal distribution where x and s are known rather than and and are used in their place, the Lilliefors test. They are based on the fact that the sums of the differences squared between the proportion of points on intervals in a hypothesized distribution and the proportion of the points on the same interval in a sample from the actual distribution have a well known distribution. To start, we divide the hypothesized distribution into intervals and figure out their probabilities. You can, of course, do each of these separately, but a mass production technique based on the cumulative distribution is somewhat faster, though I am waiting for one of my students to invent one that is more efficient. The way this is done is that you take your values of x ~ N (2,9) above in x2 your first column, convert them to z in the second column, compute F z as in c.) and d.) above 9 in the third column, and difference the third column into the 4 th column by subtracting each number in the column except the first one from the number above it. For x ~ N (2,9) make a Normal curve centered at 2 and shade the areas below-3.4, from -3.4 to -0.07, -0.07 to 2.0, 2.0 to 4.7, 4.7 to 7.4 and above 7.4; for z make a Normal curve centered at zero and shade the areas below -0.6, between -0.6 and -0.3 etc. In either case, you will notice that the areas that you marked off are symmetrical about the mean, which means that the probabilities above the mean are unnecessary. x Row 1 -3.4 2 -0.7 3 2.0 4 4.7 5 7.4 6 F z z -0.6 -0.3 0.0 0.3 0.6 .5 .5 .5 .5 .5 1 - .2257 = .2743 - .1179 = .3821 = .5000 + .1179 = .6179 + .2257 = .7757 = 1.0000 Probability .2743 .3821 .5000 .6179 .7757 1.0000 - .2743 .3821 .5000 .6179 .7757 = = = = = = .2743 .1078 .1179 .1179 .1078 .2743 So Px 3.4 F 3.4 .2743 , P3.4 x 0.7 .1078 , P0.7 x 2.0 .1179 , P2.0 x 4.7 P0.7 x 2.0 .1179 , P4.7 x 7.4 P3.4 x 0.7 .1078 and Px 7.4 1 F 7.4 Px 3.4 F 3.4 .2743 .