Understanding Probability Tables and the Excel Probability Functions

advertisement
Understanding Probability Tables and the Excel Probability Functions
In class we have talked about the TAIL and the BODY of a probability
distribution. For example, the first chart below is the binomial distribution from the
lecture where we have chosen the tail area to be the outcomes 6 and ABOVE. That is, 6
is the CRITICAL VALUE in this example because it defines the point where the tail area
begins.
One source of confusion is that probability tables are organized in different ways.
For the binomial distribution, the Ramanathan book reports the TAIL
PROBABILITY for the associated critical value. So for this example the tail probability
(i.e., the tail area) is .0498. This is the value in Table A.6., p. 596, n=8, p=0.4, x=6.
Excel does something different. Excel reports the probability in the BODY, not
the tail. The body of the distribution in this example goes up to a maximum outcome of
5, because the tail begins at 6. So the tail area in Excel would be calculated as 1 –
P(body). Specifically, we would calculate it in a cell as =1–BINOMDIST(5,8,.4,1). Note
the last parameter is 1 to give the cumulative probability of the entire body. If the last
parameter instead was 0 then BINOMDIST would provide only the probability of getting
5 exactly.
Binomial Distribution
p = 0.4, n = 8
30.0%
25.0%
Probability
20.0%
15.0%
10.0%
upper tail
5.0%
0.0%
0
1
2
3
4
5
6
7
8
Successes
The normal distribution table unfortunately is even more confusing. In the chart
below the red area is the tail area for z  1.645. That is, 1.645 is the critical value.
Ramanathan in Table A.1 does NOT report the tail probability. He doesn’t report
the entire body probability either. He gives the probability from 0 to the critical value.
Because the standard normal curve is centered at 0, we know that the probability from 0
to infinity is 0.5. The tail probability therefore equals 0.5 minus the value in the table.
Since Table A.1 gives a table value of 0.45 (the midpoint between 1.64 and 1.645), the
tail probability is 0.05.
Ramanathan presents a different normal table in the inside cover of the book (the
table called the percentage points of the t-distribution). Remember I said in class that a tdistribution with infinity degrees of freedom is the same as a normal distribution. If you
look on the last row of this table you will also find 1.645. The very top row gives the
probability in the upper tail, i.e., 0.05.
Excel is a more consistent because it gives the body probability. So a cell with
=NORMSDIST(1.645) gives the value 0.95, which is the total cumulative probability up
to this critical value. The tail probability is then 1 minus this value. In one step, the tail
probability would be =1–NORMSDIST(1.645). [NORMSDIST assumes a standard
normal, with mean 0 and std. deviation 1).]
0.450
0.400
0.350
0.300
f(z)
0.250
0.200
0.150
tail area
0.100
0.050
2.95
2.78
2.61
2.44
2.1
2.27
1.93
1.76
1.59
1.42
1.25
1.08
0.91
0.74
0.4
0.57
0.23
-0.1
0.06
-0.3
-0.5
-0.6
-1
-0.8
-1.1
-1.3
-1.5
-1.6
-2
-1.8
-2.2
-2.3
-2.5
-2.7
-3
-2.8
0.000
z
The moral is you have to be careful when you read tables or use software (different
programs may report tail probs instead of cumulative probs, etc.) The key thing is to
have a good mental picture of the difference between the tail and the body. As we saw in
the gladiolus bulb problem, it is easy to get confused. Unfortunately, this is the way the
world is. But you now have the tools to figure it out!
Download