Conditional Probability
Conditional Probability
The Law of Total Probability
Let A1 , A2 , . . . , Ak be mutually exclusive and exhaustive events.
Then for any other event B,
P(B) = P(B | A1 ) · P(A1 ) + P(B | A2 ) · P(A2 ) + · · ·
+ P(B | Ak ) · P(Ak )
=
k
X
P(B | Ai ) · P(Ai )
i=1
where exhaustive means A1 ∪ A2 ∪ · · · Ak = S.
Conditional Probability
Conditional Probability
Bayes’ Theorem
Let A1 , A2 , . . . , Ak be a collection of k mutually exclusive and
exhaustive events with prior probabilities P(Ai )(i = 1, 2, . . . , k).
Then for any other event B with P(B) > 0, the posterior
probability of Aj given that B has occurred is
P(Aj | B) =
P(B | Aj ) · P(Aj )
P(Aj ∩ B)
= Pk
P(B)
i=1 P(B | Ai ) · P(Ai )
j = 1, 2, . . . k
Independence
Independence
Definition
Two events A and B are independent if P(A | B) = P(A), and
are dependent otherwise.
Independence
Definition
Two events A and B are independent if P(A | B) = P(A), and
are dependent otherwise.
Independence
Independence
The Multiplication Rule for Independent Events
Proposition
Events A and B are independent if and only if
P(A ∩ B) = P(A) · P(B)
Independence
Independence
Independence of More Than Two Events
Definition
Events A1 , A2 , . . . , An are mutually independent if for every k
(k = 2, 3, . . . , n) and every subset of indices i1 , i2 , . . . , ik ,
P(Ai1 ∩ Ai2 ∩ · · · ∩ Aik ) = P(Aii ) · P(Ai2 ) · ··· · P(Aik ).
Random Variables
Random Variables
Definition
For a given sample sample space S of some experiment, a random
variable (rv) is any rule that associates a number with each
outcome in S. In mathematical language, a random variable is a
function whose domain is the sample space and whose range is the
set of real numbers.
Random Variables
Definition
For a given sample sample space S of some experiment, a random
variable (rv) is any rule that associates a number with each
outcome in S. In mathematical language, a random variable is a
function whose domain is the sample space and whose range is the
set of real numbers.
We use uppercase letters, such as X and Y to, denote random
variables and use lowercase letters, such as x and y , to denote
some particular value of the corresponding random variable. For
example, X (s) = x means that value x is associated with the
oucome s by the rv X .
Random Variables
Definition
For a given sample sample space S of some experiment, a random
variable (rv) is any rule that associates a number with each
outcome in S. In mathematical language, a random variable is a
function whose domain is the sample space and whose range is the
set of real numbers.
We use uppercase letters, such as X and Y to, denote random
variables and use lowercase letters, such as x and y , to denote
some particular value of the corresponding random variable. For
example, X (s) = x means that value x is associated with the
oucome s by the rv X .
Random Variables
Random Variables
Examples:
Random Variables
Examples:
1. Assume we toss a coin. Then S = {H, T}. We can define a rv
X by
X (H) = 1 and X (T) = 0
Random Variables
Examples:
1. Assume we toss a coin. Then S = {H, T}. We can define a rv
X by
X (H) = 1 and X (T) = 0
2. A techincian is going to check the quality of 10 prodcuts. For
each product the outcome is either successful (S) or defective (D).
Then we can define a rv Y by
(
1, successful
Y =
0, defective
Random Variables
Examples:
1. Assume we toss a coin. Then S = {H, T}. We can define a rv
X by
X (H) = 1 and X (T) = 0
2. A techincian is going to check the quality of 10 prodcuts. For
each product the outcome is either successful (S) or defective (D).
Then we can define a rv Y by
(
1, successful
Y =
0, defective
Definition
Any random variable whose only possible values are 0 and 1 is
called a Bernoulli random variable.
Random Variables
Random Variables
More examples:
3. (Example 3.3) We are investigating two gas stations. Each has
six gas pumps. Consider the experiment in which the number of
pumps in use at a particular time of day is determined for each of
the stations.
Define rv’s X , Y and U by
X = the total number of pumps in use at the two stations
Y = the difference between the number of pumps in use at station 1
and the number in use at station 2
U = the maximum of the numbers of pumps in use at the two station
Random Variables
More examples:
3. (Example 3.3) We are investigating two gas stations. Each has
six gas pumps. Consider the experiment in which the number of
pumps in use at a particular time of day is determined for each of
the stations.
Define rv’s X , Y and U by
X = the total number of pumps in use at the two stations
Y = the difference between the number of pumps in use at station 1
and the number in use at station 2
U = the maximum of the numbers of pumps in use at the two station
If this experiment is performed and s = (3, 4) results, then
X ((3, 4)) = 3 + 4 = 7, so we say that the observed value of X was
x = 7. Similarly, the observed value of Y would be
y = 3 − 4 = −1, and the observed value of U would be
u = max(3, 4) = 4.
Random Variables
Random Variables
More examples:
4. Assume we toss a coin until we get a Head. Then the sample
space would be S = {H, TH, TTH, TTTH, . . . } If we define a rv
X by X
X = the number we totally tossed
Then X ({H}) = 1, X ({TH}) = 2, X ({TTH}) = 3, . . . , and so on.
Random Variables
More examples:
4. Assume we toss a coin until we get a Head. Then the sample
space would be S = {H, TH, TTH, TTTH, . . . } If we define a rv
X by X
X = the number we totally tossed
Then X ({H}) = 1, X ({TH}) = 2, X ({TTH}) = 3, . . . , and so on.
In this case, the random variable X can be any positive integer,
which in all is infinite.
Random Variables
More examples:
4. Assume we toss a coin until we get a Head. Then the sample
space would be S = {H, TH, TTH, TTTH, . . . } If we define a rv
X by X
X = the number we totally tossed
Then X ({H}) = 1, X ({TH}) = 2, X ({TTH}) = 3, . . . , and so on.
In this case, the random variable X can be any positive integer,
which in all is infinite.
5. Assume we are going to measure the length of 100 desks.
Define the rv Y by
Y = the length of a particular desk
Y can also assume infinitly possible values.
Random Variables
Random Variables
Definition
A dicrete random variable is an rv whose possible values either
constitute a finite set or else can be listed in an infinite sequence in
which there is a first element, a second element, and so on
(“countably” infinite).
Random Variables
Definition
A dicrete random variable is an rv whose possible values either
constitute a finite set or else can be listed in an infinite sequence in
which there is a first element, a second element, and so on
(“countably” infinite).
A random variable is continuous if both of the following apply:
Random Variables
Definition
A dicrete random variable is an rv whose possible values either
constitute a finite set or else can be listed in an infinite sequence in
which there is a first element, a second element, and so on
(“countably” infinite).
A random variable is continuous if both of the following apply:
1. Its set of possible values consists either of all numbers in a
single interval on the number line (possibly infinite in extent, e.g.,
(−∞, ∞) ) or all numbers in a disjoint union of such intervals
(e.g., [0, 10] ∪ [20, 30]).
Random Variables
Definition
A dicrete random variable is an rv whose possible values either
constitute a finite set or else can be listed in an infinite sequence in
which there is a first element, a second element, and so on
(“countably” infinite).
A random variable is continuous if both of the following apply:
1. Its set of possible values consists either of all numbers in a
single interval on the number line (possibly infinite in extent, e.g.,
(−∞, ∞) ) or all numbers in a disjoint union of such intervals
(e.g., [0, 10] ∪ [20, 30]).
2. No possible value of the variable has positive probability, that is,
P(X = c) = 0 for any possible value c.
Examples
Probability Distributions for Discrete RV
Probability Distributions for Discrete RV
An example:
Assume we toss a coin 3 times and record the outcomes. Let Xi be
a random variable defined by
(
1, if the i th outcome is Head;
Xi =
0, if the i th outcome is Tail;
Let X be the random variable such that X = X1 + X2 + X3 , then
X represents the total number of Heads we could get from the
experiment.
Probability Distributions for Discrete RV
An example:
Assume we toss a coin 3 times and record the outcomes. Let Xi be
a random variable defined by
(
1, if the i th outcome is Head;
Xi =
0, if the i th outcome is Tail;
Let X be the random variable such that X = X1 + X2 + X3 , then
X represents the total number of Heads we could get from the
experiment.
If the probability for getting a Head for each toss is 0.7, then the
probabilities for all the outcomes are tabulated as following:
s
x
p(x)
HHH
3
0.343
HHT
2
0.147
HTH
2
0.147
HTT
1
0.063
THH
2
0.147
THT
1
0.063
TTH
1
0.063
TTT
0
0.027
Probability Distributions for Discrete RV
Probability Distributions for Discrete RV
Example continued:
s
HHH HHT
x
3
2
p(x) 0.343 0.147
HTH
2
0.147
HTT
1
0.063
THH
2
0.147
THT
1
0.063
TTH
1
0.063
TTT
0
0.027
Probability Distributions for Discrete RV
Example continued:
s
HHH HHT HTH HTT THH
x
3
2
2
1
2
p(x) 0.343 0.147 0.147 0.063 0.147
We can re-tabulate it only for the x values:
0
1
2
3
x
p(x) 0.027 0.189 0.441 0.343
THT
1
0.063
TTH
1
0.063
TTT
0
0.027
Probability Distributions for Discrete RV
Example continued:
s
HHH HHT HTH HTT THH
x
3
2
2
1
2
p(x) 0.343 0.147 0.147 0.063 0.147
We can re-tabulate it only for the x values:
0
1
2
3
x
p(x) 0.027 0.189 0.441 0.343
Now we can answer various questions.
THT
1
0.063
TTH
1
0.063
TTT
0
0.027
Probability Distributions for Discrete RV
Example continued:
s
HHH HHT HTH HTT THH THT
x
3
2
2
1
2
1
p(x) 0.343 0.147 0.147 0.063 0.147 0.063
We can re-tabulate it only for the x values:
0
1
2
3
x
p(x) 0.027 0.189 0.441 0.343
Now we can answer various questions.
The probability that there are at most 2 Heads is
TTH
1
0.063
P(X ≤ 2) = P(x = 0 or 1 or 2) = p(0) + p(1) + p(2) = 0.657
TTT
0
0.027
Probability Distributions for Discrete RV
Example continued:
s
HHH HHT HTH HTT THH THT
x
3
2
2
1
2
1
p(x) 0.343 0.147 0.147 0.063 0.147 0.063
We can re-tabulate it only for the x values:
0
1
2
3
x
p(x) 0.027 0.189 0.441 0.343
Now we can answer various questions.
The probability that there are at most 2 Heads is
TTH
1
0.063
P(X ≤ 2) = P(x = 0 or 1 or 2) = p(0) + p(1) + p(2) = 0.657
The probability that the number of Heads are is strictly
between 1 and 3 is
P(1 < X < 3) = P(X = 2) = p(2) = 0.441
TTT
0
0.027
Probability Distributions for Discrete RV
Probability Distributions for Discrete RV
Definition
The probability distribution or probability mass function (pmf)
of a discrete rv is defined for every number x by
p(x) = P(X = x) = P(all s ∈ S : X (s) = x).
Probability Distributions for Discrete RV
Definition
The probability distribution or probability mass function (pmf)
of a discrete rv is defined for every number x by
p(x) = P(X = x) = P(all s ∈ S : X (s) = x).
In words, for every possible value x of the random variable, the
pmf specifies the probability of observing that value when the
experiment
is performed. (The conditions p(x) ≥ 0 and
P
all possible x p(x) = 1 are required for any pmf.)
Probability Distributions for Discrete RV
Probability Distributions for Discrete RV
Example 3.8
Six lots of components are ready to be shipped by a certain
supplier. The number of defective components in each lot is as
follows:
Lot
1 2 3 4 5 6
Number of defectives 0 2 0 1 2 0
One of these lots is to be randomly selected for shipment to a
particular customer. Let X be the number of defectives in the
selected lot.
Probability Distributions for Discrete RV
Example 3.8
Six lots of components are ready to be shipped by a certain
supplier. The number of defective components in each lot is as
follows:
Lot
1 2 3 4 5 6
Number of defectives 0 2 0 1 2 0
One of these lots is to be randomly selected for shipment to a
particular customer. Let X be the number of defectives in the
selected lot.
The three possible X values are 0, 1 and 2. The pmf for X is
3
p(0) = P(X = 0) = P(lot 1 or 3 or 6 is selected) = = 0.500
6
1
p(1) = P(X = 1) = P(lot 4 is selected) = = 0.167
6
2
p(2) = P(X = 2) = P(lot 2 or 5 is selected) = = 0.333
6
Probability Distributions for Discrete RV
Probability Distributions for Discrete RV
Example 3.10:
Consider a group of five potential blood donors — a, b, c, d, and e
— of whom only a and b have type O+ blood. Five blood
smaples, one from each individual, will be typed in random order
until an O+ individual is identified. Let the rv Y = the number of
typings necessary to identify an O+ individual. Then what is the
pmf of Y ?
Probability Distributions for Discrete RV
Probability Distributions for Discrete RV
Example:
Consider whether the next customer coming to a certain gas
station buys gasoline or diesel. Let
(
1, if the customer purchases gasoline
X =
0, if the customer purchases diesel
If 30% of all customers in one month purchase diesel, then the pmf
for X is
p(0) = P(X = 0) = P(nextcustomerbuysdiesel) = 0.3
p(1) = P(X = 1) = P(nextcustomerbuysgasoline) = 0.7
p(x) = P(X = x) = 0 for x 6= 0 or 1
Probability Distributions for Discrete RV
Probability Distributions for Discrete RV
Example:
Consider whether the next customer coming to a certain gas
station buys gasoline or diesel. Let
(
1, if the customer purchases gasoline
X =
0, if the customer purchases diesel
If 100α% of all customers in one month purchase diesel, then the
pmf for X is
p(0) = P(X = 0) = P(nextcustomerbuysdiesel) = α
p(1) = P(X = 1) = P(nextcustomerbuysgasoline) = 1 − α
p(x) = P(X = x) = 0 for x 6= 0 or 1
here α is between 0 and 1.
Probability Distributions for Discrete RV
Probability Distributions for Discrete RV
Definition
Suppose p(x) depends on a quantity that can be assigned any one
of a number of possible values, with each different value
determining a different probability distribution. Such a quantity is
called a parameter of the distribution. The collection of all
probability distributions for different values of the parameter is
called a family of probability distribution.
Probability Distributions for Discrete RV
Definition
Suppose p(x) depends on a quantity that can be assigned any one
of a number of possible values, with each different value
determining a different probability distribution. Such a quantity is
called a parameter of the distribution. The collection of all
probability distributions for different values of the parameter is
called a family of probability distribution.
For the previous example, the quantity α is a parameter. Each
different value of α between 0 and 1 determines a different
member of a family of distributions; two such members are
0.3
p(x) = 0.7
0
if x = 0
if x = 1
otherwise
0.25
p(x) = 0.75
0
if x = 0
if x = 1
otherwise
Probability Distributions for Discrete RV
Probability Distributions for Discrete RV
Example:
Assume we are drawing cards from a 100 well-shuffled cards with
replacement. We keep drawing until we get a ♠. Let p = P({♠}),
i.e. there are 100 · p ♠’s. Assume the successive drawings are
independent and define X = the number of drawings. Then
p(1) = P(X = 1) = P({♠}) = p
p(2) = P(X = 2) = P({♠♠}) = (1 − p) · p
p(3) = P(X = 3) = P({♠♠♠}) = (1 − p) · (1 − p) · p
...
Probability Distributions for Discrete RV
Example:
Assume we are drawing cards from a 100 well-shuffled cards with
replacement. We keep drawing until we get a ♠. Let p = P({♠}),
i.e. there are 100 · p ♠’s. Assume the successive drawings are
independent and define X = the number of drawings. Then
p(1) = P(X = 1) = P({♠}) = p
p(2) = P(X = 2) = P({♠♠}) = (1 − p) · p
p(3) = P(X = 3) = P({♠♠♠}) = (1 − p) · (1 − p) · p
...
A general formula would be
(
(1 − p)x−1 · p
p(x) =
0
x = 1, 2, 3, . . .
otherwise
Probability Distributions for Discrete RV
Probability Distributions for Discrete RV
Example:
Assume we are drawing cards from a 100 well-shuffled cards with
replacement. We keep drawing until we get a ♠. Let p = P({♠}),
i.e. there are 100 · p ♠’s. Assume the successive drawings are
independent and define X = the number of drawings.
Probability Distributions for Discrete RV
Example:
Assume we are drawing cards from a 100 well-shuffled cards with
replacement. We keep drawing until we get a ♠. Let p = P({♠}),
i.e. there are 100 · p ♠’s. Assume the successive drawings are
independent and define X = the number of drawings.
If we know that there are 20 ♠’s, i.e. p = 0.2, then what is the
probability for us to draw at most 3 times? More than 2 times?
Probability Distributions for Discrete RV
Example:
Assume we are drawing cards from a 100 well-shuffled cards with
replacement. We keep drawing until we get a ♠. Let p = P({♠}),
i.e. there are 100 · p ♠’s. Assume the successive drawings are
independent and define X = the number of drawings.
If we know that there are 20 ♠’s, i.e. p = 0.2, then what is the
probability for us to draw at most 3 times? More than 2 times?
P(X ≤ 3) = p(1)+p(2)+p(3) = 0.2+0.2·0.8+0.2·(0.8)2 = 0.488
Probability Distributions for Discrete RV
Example:
Assume we are drawing cards from a 100 well-shuffled cards with
replacement. We keep drawing until we get a ♠. Let p = P({♠}),
i.e. there are 100 · p ♠’s. Assume the successive drawings are
independent and define X = the number of drawings.
If we know that there are 20 ♠’s, i.e. p = 0.2, then what is the
probability for us to draw at most 3 times? More than 2 times?
P(X ≤ 3) = p(1)+p(2)+p(3) = 0.2+0.2·0.8+0.2·(0.8)2 = 0.488
P(X > 2) = p(3)+p(4)+p(5)+· · · = 1−p(1)−p(2) = 1−0.2−0.2·0.8 = 0
Probability Distributions for Discrete RV
Probability Distributions for Discrete RV
Definition
The cumulative distribution function (cdf) F (x) of a discrete rv
X with pmf p(x) is defined for every number x by
X
F (x) = P(X ≤ x) =
p(y )
y :y ≤x
For any number x, F(x) is the probability that the observed value
of X will be at most x.
Probability Distributions for Discrete RV
Definition
The cumulative distribution function (cdf) F (x) of a discrete rv
X with pmf p(x) is defined for every number x by
X
F (x) = P(X ≤ x) =
p(y )
y :y ≤x
For any number x, F(x) is the probability that the observed value
of X will be at most x.
F (x) = P(X ≤ x) = P(X is less than or equal to x)
p(x) = P(X = x) = P(X is exactly equal to x)
Probability Distributions for Discrete RV
Probability Distributions for Discrete RV
Example 3.10 (continued):
0
0.4
F (y ) = 0.7
0.9
1
if
if
if
if
if
y <1
1≤y <2
2≤y <3
3≤y <4
y ≥2
Probability Distributions for Discrete RV
Example 3.10 (continued):
0
0.4
F (y ) = 0.7
0.9
1
if
if
if
if
if
y <1
1≤y <2
2≤y <3
3≤y <4
y ≥2
Probability Distributions for Discrete RV
Probability Distributions for Discrete RV
Example:
Assume we are drawing cards from a 100 well-shuffled cards with
replacement. We keep drawing until we get a ♠. Let α = P({♠}),
i.e. there are 100 · α ♠’s. Assume the successive drawings are
independent and define X = the number of drawings. The pmf
would be
(
(1 − α)x−1 · α x = 1, 2, 3, . . .
p(x) =
0
otherwise
Probability Distributions for Discrete RV
Example:
Assume we are drawing cards from a 100 well-shuffled cards with
replacement. We keep drawing until we get a ♠. Let α = P({♠}),
i.e. there are 100 · α ♠’s. Assume the successive drawings are
independent and define X = the number of drawings. The pmf
would be
(
(1 − α)x−1 · α x = 1, 2, 3, . . .
p(x) =
0
otherwise
Then for any positive interger x, we have
F (x) =
X
y ≤x
p(y ) =
x
x−1
X
X
(1 − α)(y −1) · α = α
(1 − α)y
y =1
(
1 − (1 − α)x
=
0
y =0
x ≥1
x <1
Probability Distributions for Discrete RV
Probability Distributions for Discrete RV
Example:
Assume we are drawing cards from a 100 well-shuffled cards with
replacement. We keep drawing until we get a ♠. Let α = P({♠}),
i.e. there are 100 · α ♠’s. Assume the successive drawings are
independent and define X = the number of drawings. The pmf
would be
Probability Distributions for Discrete RV
Probability Distributions for Discrete RV
pmf =⇒ cdf:
F (x) = P(X ≤ x) =
X
y :y ≤x
p(y )
Probability Distributions for Discrete RV
pmf =⇒ cdf:
F (x) = P(X ≤ x) =
X
y :y ≤x
It is also possible cdf =⇒ pmf:
p(y )
Probability Distributions for Discrete RV
pmf =⇒ cdf:
F (x) = P(X ≤ x) =
X
p(y )
y :y ≤x
It is also possible cdf =⇒ pmf:
p(x) = F (x) − F (x−)
where “x−” represents the largest possible X value that is strictly
less than x.
Probability Distributions for Discrete RV
Probability Distributions for Discrete RV
Proposition
For any two numbers a and b with a ≤ b,
P(a ≤ X ≤ b) = F (b) − F (a−)
where “a−” represents the largest possible X value that is strictly
less than a. In particular, if the only possible values are integers
and if a and b are integers, then
P(a ≤ X ≤ b) = P(X = a or a + 1 or . . . or b)
= F (b) − F (a − 1)
Taking a = b yields P(X = a) = F (a) − F (a − 1) in this case.
Probability Distributions for Discrete RV
Probability Distributions for Discrete RV
Example (Problem 23):
A consumer organization that evaluates new automobiles customarily
reports the number of major defects in each car examined. Let X denote
the number of major defects in a randomly selected car of a certain type.
The cdf of X is as follows:
0
x <0
0.06 0 ≤ x < 1
0.19 1 ≤ x < 2
0.39 2 ≤ x < 3
F (x) =
0.67 3 ≤ x < 4
0.92 4 ≤ x < 5
0.97 5 ≤ x < 6
1
x ≤6
Calculate the following probabilities directly from the cdf: (a)p(2),
(b)P(X > 3) and (c)P(2 ≤ X < 5).
0
You can add this document to your study collection(s)
Sign in Available only to authorized usersYou can add this document to your saved list
Sign in Available only to authorized users(For complaints, use another form )