here - BCIT Commons

advertisement
MATH 2441
Probability and Statistics for Biological Sciences
Calculating Probabilities: I
Equally-likely Simple Events and Branching Diagrams
Some of the most complicated problems in all of mathematics has to do with the determination of
probabilities. However, most of our work with probabilities in this course falls into just a few well-defined
situations which can be handled using one of a few relatively simple approaches.
Recall the long-run relative frequency definition of probability described in the preceding document:
Pr( A) 
m
N
where
N = the number of times the experiment is repeated
and
m = the number of times the event A occurs in those N repetitions
This defines Pr(A) well if as N becomes very large in principle, the ratio m/N becomes more and more
constant (apologies for the non-technical language here!).
This definition can be exploited in a variety of ways to come up with probabilities of practical value. In this
document, we look at a few simple examples to introduce some useful methods.
Equally-Likely Simple Events
In many situations, the outcomes of an experiment can be expressed in terms of a set of simple events
which are all equally likely because of the nature of the experiment. In such a case, if there are N equally
likely simple events, then we know that each one of them must have a probability of 1/N. Furthermore, if we
have a compound event A which corresponds to m distinct simple events, then Pr(A) = m/N (the sum of m
terms, each equal to 1/N). This reduces the problem to one of counting how many simple events there are
in A -- not necessarily an easy task in some cases -- but at least something to start with.
The counting process is often facilitated by what we will call a branching diagram, which illustrates the tally
of events in an experiment. We will illustrate the basic methodology with several simple examples.
Example 1: Tossing Fair Coins
By a "fair coin" we mean one which has the same
likelihood of falling tails as of falling heads. To develop
a list of all possible outcomes of an experiment in which
two distinct coins are tossed, we proceed as follows.
Outcomes
H
HH
T
HT
H
TH
T
TT
H
The first coin can fall either heads up (H) or tails up (T)
 the two outcomes having the same likelihood. When
the second coin is tossed, it can likewise result in H or
T, with equal likelihood. This two-coin experiment is
pictured in the branching diagram to the right. Writing
the overall outcome of the experiment as a string of H's
and T's to indicate how each coin in the sequence fell,
we see from this picture that there is exactly four distinct
outcomes to an experiment in which two coins are
flipped:
T
First Coin
Second Coin
HH meaning the first coin falls heads up and the second coin falls heads up
HT meaning the first coin falls heads up and the second coin falls tails up
TH meaning the first coin falls tails up and the second coin falls heads up
David W. Sabo (1999)
Calculating Probabilities: I
Page 1 of 11
and
TT meaning the first coin falls tails up and the second coin falls tails up.
How do we know there are just these four outcomes and that we've gotten all possible outcomes? Well, the
first coin can fall only H or T, no other possibilities for it. In this diagram, this shows up as the list with the
two possibilities {H, T} in a vertical column above the label "first coin." The dotted branching lines on the
extreme left of the diagram indicate that these two outcomes are the only ones that can happen in passing
from the state "no coins flipped" to the state "the first coin is flipped."
Then, whatever the outcome for the first coin, the second can either fall H or T. In the diagram, this is
shown by attaching an {H, T} branch to each of the two possible outcomes for the first coin.
Each path through the diagram from "no coin flipped" to an outcome for the last coin flipped corresponds to
a way in which the overall two-coin-flip experiment can occur. In fact, each outcome in the right-most
column corresponds to a distinct way in which the two-coin experiment can occur. Because of the way this
diagram is constructed, we know that there are no other ways the two-coin experiment can occur.
Furthermore, at each branch point, the probability of the two paths are equal (because of our definition of a
"fair" coin). Thus, each path through this diagram has the same probability. Thus, the four simple events
{HH, HT, TH, TT} are equally likely, and so each must have a probability of 1/4 or 0.25:
Pr(HH) = 0.25,
Pr(HT) = 0.25,
Pr(TH) = 0.25,
Pr(TT) = 0.25
It is now easy to see how to obtain probabilities for
an experiment in which we flip three coins. We
just need to attach an {H, T} branch to each
outcome for the second coin above, which gives
the diagram to the right. Every outcome of the
two-coin experiment leads to two possible
outcomes in a three-coin experiment, for a total of
eight. Since each path at a branch is equallylikely for fair coins, the eight distinct outcomes of
the three-coin experiment are themselves equally
likely, and so each has a probability of 1/8. Thus,
we can write
HHH
H
H
T
HHT
H
HTH
T
HTT
H
THH
T
THT
H
TTH
T
TTT
T
H
T
Pr(HHH) = 1/8 or 0.125
Pr(HHT) = 1/8 or 0.125,
T
etc.
H
First Coin
Second Coin
Third Coin
Each of the eight outcomes, {HHH, HHT, HTH,
HTT, THH, THT, TTH, TTT} are simple events.
We cannot express any of them as a set of one or more of the other seven. They are also mutually
exclusive, since if the three coins fall in one of the eight patterns, they cannot have fallen in any of the other
seven patterns.
An example of a compound event is the event:
A = two coins fall heads up when three coins are flipped
because
A = {HHT, HTH, THH}
That is, A represents a set of three of the simpler events from our list. From the third property we listed for
probabilities in the previous document, we know that since HHT, HTH, and THH are mutually exclusive
simple events, each with probability 0.125, then
Pr(A) = Pr(HHT) + Pr(HTH) + Pr(THH) = 0.125 + 0.125 + 0.125 = 0.375
The probability of getting two coins heads up when three coins are flipped is 0.375. This means that if we
flip a set of three coins repeated, then approximately 37.5% of the time, the coins will fall showing two
heads.
Page 2 of 11
Calculating Probabilities: I
David W. Sabo (1999)
In this same way, the probability can now be calculated for any event B which can be expressed as some
collection of the eight simple events listed above.
We could continue the process to experiments in which four coins are flipped (for which there would be 2 4 =
16 possible outcomes), and so on. All that changes is that the diagram, and hence the list of outcomes
becomes longer and longer. Later in the course, we will present some formulas or procedures that simplify
the process. This coin toss experiment is a special case of such an important type of random experiment
that it has its own name: the binomial experiment. It has important applications in many areas of
biological sciences.

Example 2: The Selection of Contestants (or… the unfair coin!)
Let us suppose that three individuals, 15-years older or older, are to be selected at random from the
population of Burnaby to win some prize. When the winners are announced, it is found that they are Larry,
aged 24; Curley, aged 55, and Moe, aged 63 -- that is, two of the three successful contestants are 55 years
old or older. Accusations that the contest was rigged in favor of senior citizens are made. Assuming that no
one claims to have seen the organizers overtly tampering with the selection process, is there any way to
decide whether further investigation is necessary here?
S
SSS
Solution
S
One way to decide if there is any value in investigating the
Y
SSY
S
charge of rigging the selection further is to determine how
S
SYS
unlikely it might be that two of three contestants would be 55
Y
years old or older if everyone in Burnaby had an equal
Y
SYY
likelihood of being selected (that is, the selection process was
S
YSS
fair). We can view the contestant selection process as a little
S
like the process of flipping a coin as far as age categories are
Y
YSY
concerned. We will let S stand for the event that a contestant
Y
S
YYS
is 55 years old or older, and let Y stand for the event that the
Y
First
selected contestant is younger than 55 years. Then the first
Contestant
Y
YYY
contestant selected will either be S or Y, and whatever the first
Second
Third
contestant is, the second can also either be S or Y, and
Contestant
Contestant
similarly the third will also be either S or Y. In fact, you can see
that the possible ways of choosing three contestants (as far as their S-ness or Y-ness is concerned) is given
by the same kind of branching diagram as used about for the three-coin experiment, except that the letters H
and T are replaced by S and Y, as shown to the right.
Of the eight ways in which the contestants could have turned out (as far as their S or Y characteristics are
concerned), three of them involve the selection of two S's and one Y -- exactly what happened in this
contest. We need to figure out probabilities for each of these three outcomes.
In the case of flipping a fair coin, we knew that H and T occurred with equal probability for each coin, and so
we were able to conclude that all eight final outcomes in the branching diagram were equally-likely, and
hence each had a probability of 1/8 or 0.125. In this example, there is no reason to assume that in selecting
a person at random from Burnaby's 15 or over population, we will be equally likely to get an S or a Y. In
fact, consulting Government of British Columbia statistical data for 1997 (the latest year available at the time
of writing this), we find estimates that of the 157,742 persons 15 years or older in Burnaby that year, there
are 116,606 persons (or 73.9%) between 15 and 54 years old and 41,136 persons (or 26.1%) 55 years or
older. Since probabilities are the same thing as relative frequencies, we can equate these percentages for
each group with the probability that a person of that group will be selected in a truly fair random draw.
In fact, if we repeated selecting trios of contestants at random from the population of Burnaby, we would
have found that 73.9% of the first contestants selected would be Y, and 26.1% of the first contests would be
S. Of the 73.9% of the trios in which the first contestant is Y, 73.9% would have the second contestant a Y.
Thus the proportion of trios in which the first two contestants were Y's would be 73.9% of 73.9% or (0.739) 2.
Continuing in this way, we obtain the following percentages for each of the eight combinations of contestants
that can result:
David W. Sabo (1999)
Calculating Probabilities: I
Page 3 of 11
26.1%
S
26.1%
73.9%
S
26.1%
73.9%
26.1%
Y 73.9%
73.9%
26.1%
S
26.1%
73.9%
Y
73.9%
Y
26.1%
73.9%
First
Contestant
Second
Contestant
S
SSS
(0.261)(0.261)(0.261)
Y
SSY
(0.261)(0.261)(0.739)
S
SYS
(0.261)(0.739)(0.261)
Y
S
SYY
YSS
(0.261)(0.739)(0.739)
Y
YSY
(0.739)(0.261)(0.739)
S
YYS
(0.739)(0.739)(0.261)
Y
YYY
(0.739)(0.739)(0.739)
Third
Contestant
(0.739)(0.261)(0.261)
The products to the right indicate the fraction of a large number of trios of randomly selected contestants
that will have the indicated selection history. We can summarize this information in a table:
contestant
selection:
SSS
SSY
SYS
SYY
YSS
YSY
YYS
YYY
proportion of a large number of trios of
contestants with this pattern:
(0.261)(0.261)(0.261)
(0.261)(0.261)(0.739)
(0.261)(0.739)(0.261)
(0.261)(0.739)(0.739)
(0.739)(0.261)(0.261)
(0.739)(0.261)(0.739)
(0.739)(0.739)(0.261)
(0.739)(0.739)(0.739)
=
=
=
=
=
=
=
=
0.0178
0.0503
0.0503
0.1425
0.0503
0.1425
0.1425
0.4036
Now we can ask: what is the probability that a truly random selection of three contestants would have
resulted in two S's and one Y? From the table, we see that in the long run, if many trios of three Burnaby
residents are selected randomly, we would expect that a proportion 0.0503 + 0.0503 + 0.0503 = 0.1509
would be of such a type (just add up the proportions which are SSY, SYS, or YSS). Thus, we can say that
the probability of getting two S's and one Y in a randomly selected group of three individuals is 0.1509, or
very close to 15%.
Is this number small enough to cause us to doubt such an event could happen? Most people would agree
"not really." You will see as the course progresses that the conventional meaning of "rare occurrence" is
usually taken to be one with a probability of less than 5%, and in some cases, even much smaller. If we had
found that the probability of obtaining the combination of two S's and one Y in a truly random selection
process was, say, only 0.001, we would have grounds for doubting that a truly random selection had
occurred. In that case, the fact that the event actually occurred casts doubt in most people that the
probability of occurrence could really be as small as 0.001. However, there are many things happen to each
of us each day that have probabilities of 15% and we are not surprised. So, the conclusion here has to be
that there really isn't evidence on the face of things to support a claim of tampering with the random
selection process.

This last example gives rise to a number of aspects of probability calculations which are worth noting at this
point, even though more detailed treatment of some must be put off until later in the course:
(i.)
The 15% probability obtained in this example seems to give more of a "non-answer" than an
answer to the original question of whether the contestants resulted from a truly fair or random
Page 4 of 11
Calculating Probabilities: I
David W. Sabo (1999)
selection process. The probability isn't so small that we find it difficult to believe the event could
have happened, yet it isn't so large that it seems to indicate the observed outcome is highly likely.
It would still be possible to detect an unfair selection process if there was a succession of such
contests in which two S's and one Y were picked consistently. What this result means is that over
the long run, 15% of such selections should result in two S's and one Y. Thus, for a pair of
contests, 15% will have the first group of contestants made up of two S's and one Y, and 15% of
these will have the second group made up of two S's and one Y. Thus, the probability of a two
contests in a row each involving two S's and one Y is just (0.1509) 2  0.023 or about one in 40.
This is a fairly low probability, and you might be justified in having some suspicion of unfairness if
two contests in a row involved two S's and one Y. Now, if you looked at three successive contests,
then the probability of all three involving two S's and one Y would be (0.1509) 3  0.0034, or about
one in 300. To observe such an outcome for three successive contests is thus quite surprising,
and probably means the selection process should be investigated.
Would the occurrence of three consecutive contests involving two S's and one Y mean that the
organizers are deliberately cheating? Not necessarily. These outcomes could just be the result of
an incredible string of coincidences. Or, it could be the result of an unwittingly faulty selection
process. Our calculations are based on the assumption that all 157,742 residents of Burnaby have
the same likelihood of being selected. But, there may be all sorts of reasons why a convenient
contestant selection process might inadvertently favor those who are 55 years old or older. For
example, perhaps contestants are selected by first selecting a random telephone number, and then
phoning them in the middle of the day. Residents under the age of 55 may be less likely to be
home to receive such a call, and so less likely to be selected as contestants. You can probably
think of other procedures that might favor residents in the S category over residents in the Y
category. This may not be such a big deal in selecting contestants for a contest perhaps, but such
sampling biases can result in seriously incorrect conclusions in scientific and technical research.
Ways to avoid such errors fall under the topics of experimental design and sampling theory.
(ii.)
Make sure that you see that Example 2 is just a slight generalization of Example 1, and that you
can explain why we referred to Example 2 as "the unfair coin" example. By "unfair coin", we mean
a coin which is more likely to fall with one side up than with the other.
Situations such as these two, which consist of a fixed number of repetitions (or trials) of the same
process, and that process can result in one of just two possible simple outcomes, and the
probability of each simple outcome is the same for all trials, is a pattern that recurs in a large
number of scientific and technical situations. The process is so common that it has been given its
own special name: the binomial experiment. The binomial experiment has been studied
extensively, and we will look at it more generally and in more detail later in the course.
(iii.)
The calculations in Example 2 show some obvious patterns. The probabilities associated with each
of the eight simple outcomes are quite easy to set up. We have that for the selection of any one
contestant, Pr(Y) = 0.739 and Pr(S) = 0.261. Then, the probability of any given sequence of S's
and Y's is just the product of 0.739 for each S and 0.261 for each Y. This occurs because every
time we take an S branch in the diagram, the proportion of occurrences is reduced by a factor of
0.261, and every time we take a Y branch in the diagram, the proportion of occurrences is reduced
by a factor of 0.739.
Furthermore, if we ask for the probability of, say, selecting two S's and one Y without regard to their
order of selection, we see that we are really asking for the probability of occurrence of one of the
three simple outcomes {SSY, SYS, YSS}. Notice that these three outcomes amount to all the
possible distinct arrangements of two S's and one Y. It's quite easy to convince yourself that there
are only three distinct arrangements of two S's and one Y -- either use the branching diagram
above, or spend a few minutes trying to think of a fourth distinct arrangement. We will present a
formula later for working out this number in more complicated cases where intuition or drawing a
branching diagram may not be adequate or convenient.
(iv.)
Finally, a comment about using the numbers Pr(Y) = 0.739 and Pr(S) = 0.261. The basic definition
of a probability as a relative frequency justified this. We had the information that there were
157,742 residents of Burnaby in 1997 of which 116,606 were Y's. Thus, the Y's had a relative
frequency of 116,606/157,742  0.739. Assuming that the original population figures are accurate
(and one should have some reservations perhaps -- how could you count over 116,000 people right
down to the last individual, as implied by the 6 in the last figure?), identification of population
David W. Sabo (1999)
Calculating Probabilities: I
Page 5 of 11
proportions with probabilities is valid.
More usually, we don't have exact values for population numbers. We would then have to estimate
population proportions using information about proportions in random samples of that population.
Thus, another source of potential error in conclusions would be possible sampling errors in getting
estimates of these population proportions.
Finally, there is one somewhat more subtle issue in connection with these probabilities that needs
mentioning, although it is of no importance at all in this particular example. We assumed that the
correct composition of the population was 116,606 Y's and 41,136 S's for a total of 157,742 people
in all. If a person is picked at random from this population, then the probability of picking a Y is
given by 116,606/157,742. We used this probability for the second and third contestants selected
as well. However, if the three contestants must be distinct individuals, then the numbers in the
populations from which the second and third contestants are selected are not the same as for the
population from which the first contestant is selected, and so the probabilities are not really the
same either. For instance, if the first contestant selected is a Y, then the second contestant will be
selected from a population of 116,605 Y's out of a total of 157,741 people -- the first contestant has
been removed from the list of people eligible to be the second contestant. So, for the second
contestant, Pr(Y) is really 116,605/157,741 rather than 116,606/157,742. In this case, the
difference between these two proportions is certainly negligible, but that would not be true if we
were drawing items from a very small population. We speak of this being sampling without
replacement, to indicate that when a person is picked as one contestant, they are not "returned" to
the pool of eligible contestants for the selection of the next contestant. (For the alternative method,
sampling with replacement, an element of the population would be selected, its identity recorded,
and then it would be returned to the population and be eligible for selection again.) The method we
used in Example 2 is exactly correct for a sampling with replacement, and is approximately correct
for sampling without replacement when the sampling process itself doesn't significantly change the
proportions of the population.
After that lengthy discussion of Example 2, we must move along quickly to keep the length of this document
under control. However, we have two more examples that illustrate extensions of, and perhaps alternatives
to, the simple branching diagram in the calculation of probabilities for some basic, but important,
experimental patterns: rolling dice and lotteries.
Example 3: Rolling (Fair) Dice
You are familiar with the usually cube-shaped dice (we're going use the word "dice" to
refer to one or more than one of these things -- some people call a single one a "die"
and more than one "dice", but then surely we get into all sorts of trouble with words like
"rice" and "spice" and "mice", not to mention "ice" ). By a "fair" dice, we mean one for which every face
has the same likelihood of ending up on top when the dice is tossed.
When one dice is tossed, the possible outcomes are just the values {1, 2, 3, 4, 5, 6}. If the dice is fair, then
the six outcomes are equally likely, and so we can write:
Pr(1) = 1/6
Pr(4) = 1/6
Pr(2) = 1/6
Pr(5) = 1/6
Pr(3) = 1/6
Pr(6) = 1/6
So, what is the probability that you will get an even number when you roll one dice? Simply
Pr(even number) = Pr(2 or 4 or 6) = Pr(2) + Pr(4) + Pr(6) = 1/6 +1/6 + 1/6 = 0.5,
since the events {1, 2, 3, 4, 5, 6} are mutually exclusive simple events.
Things get a bit more complicated when we deal with two dice. One way to sort things out is by using a
branching diagram with two columns: the first indicating how the first dice lands, and the second indicating
how the second dice lands. This is shown in abbreviated form below.
Page 6 of 11
Calculating Probabilities: I
David W. Sabo (1999)
The numbers down the left edge of the figure is the outcome for
the first dice. Only the first two of six possible outcomes is
shown. For each of the six ways the first dice can fall, the
second can fall in one of six ways, as shown in the second
column. Thus, the complete branching diagram would consist of
six sections of six branches each. From this we conclude that
there are 36 distinct ways that two dice can fall. Further, for fair
dice, each outcome has the same likelihood, so that the 36
simple outcomes for two dice are all equally likely. We can also
represent the outcomes for the two-dice experiment as a pair of
numbers in brackets, indicating how the first and second dice fall
respectively.
1 (1, 1)
2 (1, 2)
3 (1, 3)
4 (1, 4)
1
5 (1, 5)
6 (1, 6)
For a three dice experiment, each of the 36 branches in the
figure to the right would split into six more branches, giving a
total of 36 x 6 = 216 possible outcomes.
2
1
2
(2, 1)
3
4
(2, 3)
5
6
Even for a two-dice experiment, the branching diagram is a bit
cumbersome to work with. Thus, it is easy to see even from this
incomplete diagram that
(2, 2)
(2, 4)
(2, 5)
(2, 6)
second dice
6
Pr( two ones) = 1/36
because the outcome (1, 1) can only occur in the first set of branches emerging from the "first dice falls 1"
event on the left. The pattern of two-dice events listed down the right is clear enough that we know for sure
there will not be a (1, 1) lower down as well.
It's somewhat more difficult to determine Pr(sum of dots on both dice = 7). We see two simple events of this
type in the part of the branching diagram that's shown: (1, 6) and (2, 5). With a bit of thought, you should be
able to see that the complete diagram would have four more outcomes corresponding to a sum of 7, namely,
(3, 4), (4, 3), (5, 2) and (6, 1). Thus, Pr(sum = 7) corresponds to any one of six equally likely simple events
and so
Pr(sum = 7) = 6 x (1/36) = 1/6.
To compute probabilities of various sums for the two-dice experiment, it might be simpler to display the
possible outcomes in a more tabular form:
first
dice
1
2
3
4
5
6
1
2
2
3
4
5
6
7
3
4
5
6
7
8
second dice
3
4
4
5
6
7
8
9
5
6
7
8
9
10
5
6
6
7
8
9
10
11
7
8
9
10
11
12
The numbers in the body of the table give the sum of the two dice corresponding to the outcomes for each in
the left-hand column and top row. Each of the 36 entries in the body of the table represent equally-likely
outcomes, so each have a probability of 1/36. Thus, to work out Pr(sum of two dice = 7), we just count up
the number of 7's that show up in the body of this table. Since there are six of them, we reconfirm the result
obtained just above that Pr(sum = 7) = 6/36 or 1/6.
Note that we can describe the possible outcomes of the two-dice experiment in terms of the sum of the two
dice. Described in this way, the list of possible outcomes is {2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12}. However, only
two of these eleven outcomes are simple events (the others correspond to two or more different ways that
the dice can fall), and they are not equally-likely. The "equally-likeliless" of a set of events must be the
consequence of some known aspect of the experiment.

David W. Sabo (1999)
Calculating Probabilities: I
Page 7 of 11
Example 4: Lotto 649
Very briefly, we look at one more example that is amenable to analysis based on branching diagrams -lotteries based on selecting several of a larger set of numbers. The most familiar version in this area of the
world is probably the Lotto 649. Ticket holders must select six numbers from 1 through 49. The grand prize
is won if those same six numbers are drawn by the lottery operators. Great effort is made to ensure that all
possible six-number combinations have the same likelihood. To work out the probability of winning the
grand prize, we just need to determine how many different six-number combinations can be made up from
the numbers between 1 and 49 inclusive.
We can only show a very small part of the branching diagram in this case, but hopefully it's enough to help
you see the pattern:
2
2
3
5
6
3
4
fifth and sixth
numbers
1
49
first
number:
49 choices
49
1
third
number:
47 choices
left
fourth
number:
46 choices
left
3
4
2
49
49
second number:
48 choices left
Think of the experiment as one in which we select the first number (which can be any one of the 49 possible
numbers), and then the second number (but now, the second number has only 48 possibilities, since one of
the 49 has already been taken in the first selection). Once the first two numbers are selected, there are only
47 candidates left to be the third number. Once the first three numbers are selected, there are only 46
candidates left to be the fourth number. The fifth is chosen from among 45 remaining possibilities, and the
sixth number from among 44 possibilities.
Notice in the set of branches representing the second number, we omit the number that has already been
picked as the first number. In the set of branches representing the selection of the third number, we omit the
two numbers that have already been selected in each case.
Anyway, it is quite easy to see how many branches the entire diagram would have if drawn up completely.
There are 49 sections corresponding to each possible choice of the first number. For each of these 49
choices of a first number, there are 48 choices for the second number. Thus, there are 49 x 48 = 2352 ways
of selecting the first two numbers. For each of these, there are 47 ways of selecting the third number, so the
number of ways of selecting the first three numbers is (49 x 48) x 47 = 110,544. For each of these, there are
46 choices of the fourth number, giving (49 x 48 x 47) x 46 = 5,085,024 ways of choosing the first four
numbers. For each of these, there are 45 ways of choosing the fifth number, giving a total of (49 x 48 x 47 x
46) x 45 = 228,826,080 ways of choosing the first five numbers. Finally, for each of these ways of choosing
the first five numbers, there are 44 ways of choosing the sixth number. Thus, it looks like the number of
different ways in which the six numbers in the Lotto 649 can be chosen is
49 x 48 x 47 x 46 x 45 x 44 = 10,068,347,520
(CP-1)
Fortunately, this is not quite correct or there would be very few winners (though each would win an
extremely big prize). But it is a start.
Page 8 of 11
Calculating Probabilities: I
David W. Sabo (1999)
The error in the calculation leading to (CP-1) is not too hard to track down. Look at the branching diagram
again. If we followed the top path all the way through the six stages, we'd come up with the selection of the
six numbers {1, 2, 3, 4, 5, 6} in that order. Now, take the top path through the diagram starting with a 2 as
the first number chosen. Then the six numbers chosen would be in order {2, 1, 3, 4, 5, 6}. But this is exactly
the same lottery ticket as the first one. Only the identity of the six numbers chosen is important -- the order
in which you choose the six numbers is irrelevant. So, it looks like our branching diagram is counting some
tickets more than once, and so the number obtained in (CP-1) is too large.
If you think about it for a minute, you'll realize that the ticket with the six numbers {1, 2, 3, 4, 5, 6} will show
up in our list once for every possible order in which these six numbers can be written. In fact, every
combination of six numbers will show up in our list that many times. In how many different orders can you
write six numbers? Once again, imagine a branching diagram. The first number in the sequence can be
any one of the six numbers. Then, for each choice of the first number, the second number in the sequence
can be any of the five remaining numbers. So, there are 6 x 5 = 30 ways to place the first two numbers in
the sequence. The third number can be any of the remaining four numbers after the first two are placed, so
there are (6 x 5) x 4 = 120 distinct sequences for the first three numbers picked. Then the fourth number
can be any of the three numbers not placed so far, meaning that there are (6 x 5 x 4) x 3 = 360 distinct
sequences of the first four numbers. The fifth number placed can be any of the remaining two numbers, so
that there are (6 x 5 x 4 x 3) x 2 = 720 distinct sequences for the first five numbers. Finally, once the first
five numbers have been placed, there is only one choice left for the sixth. Thus, there are (6 x 5 x 4 x 3 x 2)
x 1 = 720 different arrangements of any combination of six numbers. This means that the result (CP-1)
counts every valid Lotto 649 ticket 720 times!.
So, dividing (CP-1) by 720 gives
number of distinct Lotto 649 tickets = 10,068,347,520/720 = 13,983,816
(CP-2)
or just under 14 million. Each of these tickets are equally likely to be prize winners. Thus, the probability of
winning the Lotto 649 is 1/13,983,816 (or about 1 in 14 million). This means that in the long run, there is
about one winning ticket for every 14 million or so tickets sold.
Humans have a hard time understanding either very large numbers (such as 14 million) or very small
numbers (such as 1/14 million). One way to get a sense of what a probability like this might mean in
practice is to look at other events in life. For example, approximately 300 people in the United States are
killed each year by lightning. Assuming the 300 million or so people in the US are all equally likely to be
struck by lighting (which is not entirely accurate because lightning strikes are more frequent in certain parts
of the country than in others), then each person in the US has a probability of about 1 in a million of being
fatally struck by lightning each year -- a roughly fourteen times greater chance than winning the grand prize
with a single Lotto 649 ticket.

Later in the course, we will return to several important features that appear in this Lotto 649 example. We
just note them here briefly.
Factorials, Permutations, Combinations
First, you'll notice that products of successive whole numbers (or integers) occur often in counting up
branches of a diagram such as the one just above. It is convenient to introduce the so-called factorial
function for an integer: it is the product of that integer and all smaller integers down to 1. The exclamation
point is used to indicate a factorial. So, for example,
6! = 6 x 5 x 4 x 3 x 2 x 1 = 720.
We would speak the left-hand side of this equation as "six-factorial". This number, 6!, occurred above when
we calculated the number of ways in which 6 items could be arranged in a row. Such alternative
arrangements of items are called permutations. Thus, one generalization from the work in the last example
is that
the number of permutations of n objects or items = n!
David W. Sabo (1999)
Calculating Probabilities: I
Page 9 of 11
Secondly, recall that in the result (CP-1) above, we got a product of a sequence of consecutive whole
numbers, but the sequence didn't go all the way down to 1, so it wasn't really a factorial. However, such
sequential products of whole numbers can always be written as a quotient of two factorials. In the case of
(CP-1), the product was
49 x 48 x 47 x 46 x 45 x 44
But note the following:
49! = 49 x 48 x 47 x 46 x 45 x 44 x 43 x 42 x ... x 3 x 2 x 1
43!
Thus, we can write
49  48  47  46  45  44 
49!
43!
In fact, this leads to a much more useful formula than just being able to express sequential products of
whole numbers as a quotient of factorials. In the calculation displayed in (CP-1) is really a calculation of
how many ways we can choose 6 things from 49 where the order in which we choose them is important. As
mentioned earlier, in the jargon of probability theory, those ordered groups of things are called
permutations. From that example, we conclude that the number of permutations of six items selected from
a pool of 49 items is given by
P49,6 
49!
49!

43! (49  6)!
where we have rewritten the denominator 43! in terms of the numbers 49 and 6 occurring in the problem.
Thus, it would appear in general that the number of permutations of m items selected from a pool of N items,
denoted by the symbol PN,m, is given by
PN ,m 
N!
( N  m)!
(CP-3)
This formula is quite important in some applications, and we will return to it again before the end of the
course.
Finally, if order of selection is not important, then the above example indicates that we must divide P N,m by
m! to get the number, CN,m, of unique combinations of m things chosen from a pool of N things. That is
C N ,m 
PN , m
m!

N!
( N  m)! m!
(CP-4)
Question
A laboratory has a supply of 20 white rats. Each of five rats selected at random from this pool of 20 is to be
assigned to a specific researcher for observation. In how many distinct ways can this assignment of 5 rats
to 5 researchers be done?
Solution
It is obvious that this question asks how many ways five items (rats) can be chosen from the pool of 20
items. Since each rat will be assigned to a specific researcher, we need to calculate the number of
permutations rather than the number of combinations. Order of assignment is important here, because if
two researchers switch rats, they've really created a new assignment (even when the same five rats overall
are used in the experiment). Thus, the answer to this question is:
Page 10 of 11
Calculating Probabilities: I
David W. Sabo (1999)
P20,5 
20!
20!

 20 19 18 17 16  1,860 ,480
(20  5)! 15!
(Some scientific calculators have function keys for PN,m and CN,m, so that the expansion into factorials and
the expansion of the factorials as done above would not be necessary.)

Question
A laboratory has a supply of 20 white rats. Five of these are to be selected for use in a particular
experiment. In how many different ways can this group of five rats be selected?
Solution:
This is similar to the previous question except that now none of the five selected rats are not going to be
treated in a distinctive way -- so order of selection is not important. Thus, the question asks for how many
combinations of five rats can be chosen from the pool of twenty, which is
C 20,5 
20!
20!

 15,504
(20  5)! 5! 15! 5!
Thus, there are 15,504 distinct samples of 5 rats that can be selected from the pool of 20 available rats.

David W. Sabo (1999)
Calculating Probabilities: I
Page 11 of 11
Download