Lesson 4 Chapter 3: Random Variables and Their Distributions

advertisement
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Lesson 4
Chapter 3: Random Variables and
Their Distributions
Michael Akritas
Department of Statistics
The Pennsylvania State University
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
1
PMF, CDF and PDF
2
Mean, Variance and Percentiles
3
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Chapter Overview
The pmf describes the probability distribution of a discrete
X . This chapter introduces the cumulative distribution
function (cdf), and the probability density function (pdf).
Introduces more general notions of mean value, variance
and percentiles.
Introduces the most common probability models, for both
discrete and continuous random variables, and their use
for computing probabilities.
Read Section 3.2.1 for review.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The cumulative distribution function
The cumulative distribution function, or cdf, of a (discrete or
continuous) X is denoted by F or FX and is defined by
FX (x) = P([X ≤ x]).
Example
The pmf and cdf of a random variable X are shown below.
Solution:
x
FX (x)
pX (x)
0
0.4
0.4
Michael Akritas
1
0.7
0.3
2
0.9
0.2
3
1
0.1
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
1.0
0.8
0.6
0.4
0.2
0.0
0
1
2
3
4
5
6
Figure: The CDF of a Discrete Distribution is a Step or Jump Function
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
CDFs have the following properties:
1
If X is discrete, FX (x) is a jump function.
1
2
The jump points are the points in SX .
P(X = x) equals the size of the jump at x. Formally, this is
written as
pX (k ) = FX (k ) − FX (k − 1).
3
FX (x) =
X
pX (k ).
k ≤x
2
P(a < X ≤ b) = FX (b) − FX (a).
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Probability Density Function
Continuous random variable cannot have a pmf because
P(X = x) = 0, for any value x. (Why?!!)
Definition
The probability density function, or pdf, fX , of a continuous
random variable X is a nonnegative function with the property
that P(a < X < b) equals the area under it and above the
interval (a, b).
Thus,
P(a < X < b) =
area under fX
=
between a and b.
Michael Akritas
Z
b
fX (x)dx.
a
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Probability Density Function
Continuous random variable cannot have a pmf because
P(X = x) = 0, for any value x. (Why?!!)
Definition
The probability density function, or pdf, fX , of a continuous
random variable X is a nonnegative function with the property
that P(a < X < b) equals the area under it and above the
interval (a, b).
Thus,
P(a < X < b) =
area under fX
=
between a and b.
Michael Akritas
Z
b
fX (x)dx.
a
Lesson 4 Chapter 3: Random Variables and Their Distributions
0.3
0.4
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
0.0
0.1
f(x)
0.2
P(1.0 < X < 2.0)
-3
-2
-1
Michael Akritas
0
1
2
3
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Common Shapes of PDFs
symmetric
bimodal
positively skewed
negatively skewed
A positively skewed distribution is also called skewed to the
right, and a negatively skewed is also called skewed to the
left.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Uniform Random Variable
Consider selecting a number at random from the interval
[0, 1] in such a way that any two subintervals of [0, 1] of
equal length are equally likely to contain the selected
number.
For example, the subintervals [0.3, 0.4] and [0.6, 0.7] are
equally likely to contain the selected number.
If X denotes the outcome of such a selection, then X is
said to have the uniform in [0, 1] distribution; this is
denoted by X ∼ U(0, 1).
Since we know the probability with which X takes value in
any interval, we know its distribution.
What pdf describes it?
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Uniform PDF
If X ∼ U(0, 1), its pdf is:
0.0
0.2
0.4
0.6
0.8
1.0
P(0.2 < X < 0.6)
0.0
0.2
0.4
Michael Akritas
0.6
0.8
1.0
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Uniform cdf
0.0
0.2
0.4
0.6
0.8
1.0
If X ∼ U(0, 1), its cdf is (why?):
0.0
0.2
0.4
Michael Akritas
0.6
0.8
1.0
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Proposition
If X has pdf fX (x) and cdf FX (x), then
Z x
1
FX (x) =
fX (y)dy , and
−∞
2
d
fX (x) =
FX (y)|y=x
dy
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Example
(a) If the life time T , measured in hours, of a randomly selected
electrical component has pdf fT (t) = 0, for t < 0, and
fT (t) = 0.001 exp(−0.001t), for t ≥ 0, find the probability the
component will last between 900 and 1200 hours of operation.
e be the life time, measured in minutes, of the randomly
(b) Let T
e.
selected electrical component. Find the pdf of T
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Read Examples 3.2.5, 3.2.8, 3.2.9.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Mean for Discrete Random Variables
Definition
The expected value, E(X ) or µX , of a discrete random variable
X , having a possibly infinite sample space SX , with pmf
p(x) = P(X = x), for x ∈ SX , is defined as
X
xp(x).
µX =
x in SX
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
This definition coincides with that given in Chapter 1 if X is
obtained from simple random sampling from a finite
population.
Example
Let X be the number of heads in two flips of a coin. Find µX
with both definitions.
See also Example 3.3.1.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Example
Select a product item from the production line and let X take
the value 1 or 0 as the product item is defective or not. Let p be
the proportion of defective items in the conceptual population of
this experiment. Find µX .
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Example
Inspect items, as they come off the production line, until the first
defective item is found. Let X = number of items inspected,
and p be the proportion of defective items. Find µX .
Solution: p(x) = P(X = x) = (1 − p)x−1 p, for
x ∈ SX = {1, 2, 3, . . .}. According to the definition,
E(X ) =
X
xp(x) =
x in SX
∞
X
x=1
x(1 − p)x−1 p =
1
.
p
See Example 3.3.3 for the derivation of the above series.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Mean for Continuous Random Variables
Definition
If X has pdf f (x), its expected value is defined by
Z ∞
µX =
xf (x)dx.
−∞
Example
(a) If X ∼ U(0, 1), show that µX = 0.5.
(b) If the pdf of X is f (x) = 2x, for 0 < x < 1, and zero
otherwise, find E(X).
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Mean Value of a function h(X ) of X
Proposition
1
If X is discrete with sample space SX ,
X
E(h(X )) =
h(x)pX (x)
x in SX
2
If X is continuous,
Z
∞
E(h(X )) =
h(x)f (x)dx.
−∞
3
If the function h(x) is linear, i.e., h(x) = ax + b, then
E(aX + b) = aE(X ) + b.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Example
A bookstore purchases 3 copies of a book at $6.00 each, and
sells them for $12.00 each. Unsold copies are returned for
$2.00 each. Let X = {number of copies sold}, and Y = {net
revenue}. If the pmf of X is
x
pX (x)
0
0.1
1
0.2
2
0.2
3
0.5
find the expected value of Y .
Solution. Here Y = h(X ) = 12X + 2(3 − X ) − 18 = 10X − 12.
By part 3Pof the above proposition, E(Y ) = 10E(X ) − 12. But
E(X ) = x xpX (x) = 2.1. Thus, E(Y ) = 10(2.1) − 12 = 9.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Example
1
2
3
You are given a choice between accepting 3.52 = 12.25$
or roll a die and win X 2 . What will you choose and why?
2 + 22 + · · · + n2 = n(n+1)(2n+1)
1 + 2 + · · · + n = n(n+1)
,
1
2
6
If X ∼ U(0, 1), find E(X 2 ).
Let Y ∼ U(A, B), i.e., Y has the uniform in (A, B]
distribution. Show that E(Y ) = (B + A)/2.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Variance and Standard Deviation
General definition
of the variance of X .
h
i
σX2 = E (X − µX )2
σX2 = E(X 2 ) − [E(X )]2
σX =
q
σX2 .
Short-cut formula
for the variance X
Definition of
standard deviation
Var(a + bX ) = b2 σX2
Michael Akritas
Variance of a
linear transformation
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Example
(a) Select a product from the production line and let X take the
value 1 or 0 as the product is defective or not. If p is the
probability that the selected item is defective, find Var(X ) in
terms of p.
(b) Roll a die and let X denote the outcome. Find Var(X ).
(c) If X ∼ U(0,1). Find Var(X ).
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Percentiles
Definition
Let F be the cdf of the continuous r.v. X . The 100(1-α)th
percentile (or quantile) of X is the number, xα , with the property
F (xα ) = P(X ≤ xα ) = 1 − α.
x0.5 is the median and is also denoted by q2 .
x0.75 is also called the lower quartile, and denoted by q1 .
x0.25 is also called the upper quartile, and denoted by q3 .
For any given α, xα can be found by solving the equation
F (xα ) = 1 − α,
Michael Akritas
for xα .
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Example
Let the cdf F (x) of the r.v. X be such that F (x) = 0 for x ≤ 0,
F (x) =
x2
, for x between 0 and 2, and F (x) = 1 for x > 2 .
4
Find the three quartiles (the 25th, 50th, and 75th percentiles).
Solution: Solving F (xα ) = 1 − α, for xα we find
√
xα2 /4 = 1 − α, or xα = 2 1 − α.
Using α = 0.75, 0.5, and 0.25, we obtain
√
√
√
q1 = 2 0.25 = 1, q2 = 2 0.5 = 1.41, q3 = 2 0.75 = 1.73.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Example
If X ∼ U(A, B), find the median µ̃X .
Solution: The cdf of X is
F (x) = (x − A)/(B − A),
for A ≤ x ≤ B,
for x ≤ A, F (x) = 0, and for x ≥ B, F (x) = 1.
To find µ̃X we need to solve the equation
F (µ̃X ) = 0.5
The solution to it is
µ̃X = A + 0.5 × (B − A).
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Definition
The interquartile range, abbreviated by IQR, is the distance
between the 25th and 75th percentile. Thus,
IQR = q3 − q1 .
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
1
PMF, CDF and PDF
2
Mean, Variance and Percentiles
3
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
A r.v. X is called Bernoulli if it takes only two values.
The two values are referred to as success (S) and failure
(F), or are re-coded as 1 and 0. Thus, always, SX = {0, 1}.
If P(X = 1) = p, we write X ∼ Bernoulli(p) to indicate that
X is Bernoulli with probability of success p.
The pmf and cdf of X are:
x
p(x)
F (x)
0
1−p
1−p
1
p
1
The expected value of X is, E(X ) = p
The variance of X is, σX2 = p(1 − p).
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
If X1 , X2 , . . . , Xn are independent Bernoulli(p) RVs then
Y =
n
X
Xi = the total number of 1s,
i=1
is a Binomial RV. We write Y ∼ Bin(n, p).
The pmf of Y is (R command: dbinom(x,n,p)):
n k
P(Y = k ) =
p (1 − p)n−k , k ∈ SY = {0, 1, . . . , n}
k
The cdf of Y in R: pbinom(x,n,p).
The expected value of Y is E(Y ) = np
The variance of Y is σY2 = np(1 − p)
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
0.30
Bin(20,p)
0.15
0.10
0.05
0.00
P(X=k)
0.20
0.25
p = 0.3
p = 0.5
p = 0.7
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
The Binomial CDF Tables
Example
Suppose 70% of all purchases in a certain store are made with
credit card. Let X denote the number of credit card uses in the
next 10 purchases. Find a) µX and σX2 , and b) P(5 ≤ X ≤ 8).
Solution. It seems reasonable to assume that X ∼ Bin(10, 0.7).
a) E(X ) = np = 10(0.7) = 7, σX2 = 10(0.7)(0.3) = 2.1.
b) Using Table A.1, we have
P(5 ≤ X ≤ 8) = P(4 < X ≤ 8) = F (8) − F (4)
= 0.851 − 0.047 = 0.804.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Example
A company sells screws in packages of 10 and offers a
money-back guarantee if two or more of the screws are
defective. If a screws is defective with probability 0.01,
independently of other screws, what proportion of the packages
sold will the company replace?
Solution: 1 − P(X = 0) − P(X = 1) ∼
= 0.004
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Example
In order for the defendant to be convicted in a jury trial, at least
eight of the twelve jurors must enter a guilty vote. Assume each
juror makes the correct decision with probability 0.7
independently of other jurors. If 40% of the defendants in such
jury trials are innocent, what is the proportion of correct
verdicts?
Solution: We want P(B) for B = {jury renders correct verdict}.
If A = {defendant is innocent} then,
P(B) = P(B|A)P(A) + P(B|Ac )P(Ac ) = P(B|A)0.4 + P(B|Ac )0.6.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Solution Continued: Next, let X denote the number of jurors
who reach the correct verdict in a particular trial. Thus,
X ∼ Bin(12, 0.7), and
P(B|A) = P(X ≥ 5) = 1 −
4 X
12
0.7k 0.312−k = 0.9905,
k
k=0
12 X
12
c
P(B|A ) = P(X ≥ 8) =
0.7k 0.312−k = 0.724.
k
k=8
It follows that,
P(B) = P(B|A)0.4 + P(B|Ac )0.6 = 0.8306.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
1
PMF, CDF and PDF
2
Mean, Variance and Percentiles
3
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
The hypergeometric distribution arises when a simple random
sample of size n is taken from a finite population of N units of
which M are labeled 1 and the rest are labeled 0.
The number X of units labeled 1 in the sample is a
hypergeometric random variable with parameters n, M and N.
This is denoted by X ∼ Hypergeo(n, N, M)
If X ∼ Hypergeo(n, N, M), its pmf is
M N−M
P(X = x) =
x
n−x
N
n
. (In R: dhyper(x, M, N-M, n))
Note that P(X = x) = 0 if x > M, or if n − x > N − M.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
0.4
Hypergeometric(M1, 60 − M1, 10)
0.2
0.1
0.0
P(X=k)
0.3
M1 = 15
M1 = 30
M1 = 45
0
2
4
Michael Akritas
6
8
10
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
If X ∼ Hypergeo(n, N, M) then,
Its expected value is: µX = n M
N
M N −n
M
2
Its variance is: σX = n
1−
N
N N −1
N −n
is called finite population correction factor
N −1
Binomial Approximation to Hypergeometric
Probabilities
If n/N ≤ 0.05, or 0.1, then
P(X = x) ' P(Y = x), where Y ∼ Bin(n, p = M/N).
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Example (Illustration of the binomial approximation)
We will contrast P(X = 2) for X ∼ Hypergeo(n = 10, N, M),
when M/N = 0.25, with its binomial approximation
P(Y = 2) = 0.282 for Y ∼ Bin(n = 10, p = 0.25).
1
If N = 20 and M = 5, then (R command:
, dhyper(2,5,15,10))
5 15
P(X = 2) =
2
2
20
10
If N = 100 and M = 25, then
25
P(X = 2) =
Michael Akritas
2
8
= 0.348.
75
8
100
10
= 0.292,
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Example
12 refrigerators have been returned due to a high-pitched
noise. 4 have a defective compressor and the rest less serious
problems. 6 are selected at random for problem identification.
Let X = # found with defective compressor. Give the sample
space of X , and find P(X = 3) as well as E(X ) and Var(X ).
Solution. Here N = 12, n = 6, M = 4. Thus, the possible
values of X are SX = {0, 1, 2, 3, 4}.
4 8
P(X = 3) =
E(X ) = 6
3
12
6
3 = 0.2424.
1 2 12 − 6
4
8
= 2, Var(X ) = 6
=
.
12
3 3 12 − 1
11
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Applications of the Hypergeometric Distribution
Example (Quality Control)
A company buys electrical components in batches of size 10.
From each batch, 3 components are checked at random and if
all 3 are nondefective the batch is accepted. If 30% of the
batches have 4 defective components and 70% have only 1,
what proportion of batches does the company accept?
Solution: Let A be the event a batch is accepted.
P(A) = P(A|4 defectives)0.3 + P(A|1 defective)0.7
1 9
4 6
=
0
3 0.3 +
0
10
3
Michael Akritas
3 0.7 = 0.54.
10
3
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Example (The Capture/Recapture Method)
This method is used to estimate the size N of a wildlife
population. Suppose that 10 animals are captured, tagged and
released. On a later occasion, 20 animals are captured. Let X
N
be the number of recaptured animals. If all 20
possible groups
are equally likely, X is more likely to take small values if N is
large. The precise form of the hypergeometric pmf can be used
to estimate N from the value that X takes.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
1
PMF, CDF and PDF
2
Mean, Variance and Percentiles
3
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
In the negative binomial experiment, a Bernoulli
experiment is repeated independently until the r th 1 is
observed.
For example, products are inspected, as they come off the
assembly line, until the r th defective is found.
The number, X , of Bernoulli trials until the r th 1 is
observed is the negative binomial r.v.
If p is the probability of 1 in a Bernoulli trial, we write
X ∼ NBin(r , p)
If r = 1, X is called the geometric r.v.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
If X ∼ NBin(r , p), then
Its pmf is (R command: dnbinom(x-r, r, p))
x −1 r
P(X = x) =
p (1 − p)x−r , x = r , r + 1, . . .
r −1
Its cdf in R: pnbinom(x-r, r, p)
Its expected value is:
E(X ) =
Its variance is:
σX2 =
Michael Akritas
r
p
r (1 − p)
p2
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
Michael Akritas
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Example
Two athletic teams, A and B, play a best-of-three series of
games. Suppose team A is the stronger team and will win any
game with probability 0.6, independently from other games.
Find the probability that the stronger team will be the overall
winner.
Solution: Let X be the number of games needed for team A to
win twice. Then X ∼ NBin(2, 0.6). Team A will win the series if
X = 2 or X = 3. Thus,
P(Team A wins the series) = P(X = 2) + P(X = 3)
1
2
2
2−2
=
0.6 (1 − 0.6)
+
0.62 (1 − 0.6)3−2
1
1
= 0.36 + 0.288 = 0.648 (Found also in R: pnbinom(1,2,0.6))
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
1
PMF, CDF and PDF
2
Mean, Variance and Percentiles
3
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
A RV X with SX = {0, 1, 2, . . .} is a Poisson RV with
parameter λ, X ∼ Poisson(λ), if its pmf is
p(x) = P(X = k) = e−λ
λx
, x = 0, 1, 2, . . . ,
x!
for some λ > 0. (R command for this pmf: dpois(x,
lambda))
P∞
x=0
p(x) = 1 follows from eλ =
P∞
k=0 (λ
k
/k!).
Its cdf in R: ppois(x, lambda)
µX = λ, σX2 = λ.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
0.1
0.2
0.3
λ=1
λ=4
λ = 10
0.0
P(X=k)
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
0
5
Michael
Akritas
10Lesson 4 Chapter
15 3: Random Variables
20
and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
The Poisson random variable X can be:
1
the number of fish caught by an angler in an afternoon,
2
the number of new potholes in a stretch of I80 during the
winter months,
3
the number of disabled vehicles abandoned in I95 in a
year,
4
the number of earthquakes (or other natural disasters) in a
region of the United States in a month,
5
the number of wrongly dialed telephone numbers in a
given city in an hour,
6
the number of freak accidents, such as falls in the shower,
in a given time period.
7
the number of hits in a website in a day.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
In general, the Poisson distribution is used to model the
probability that a number of certain events occur in a
specified period of time (or distance, area or volume).
The events must occur at random and at a constant rate.
The occurrence of an event must not influence the timing
of subsequent events (i.e. events occur independently).
Its earliest use dealt with the number of alpha particles
emitted from a radioactive source in a given period of time.
Current applications include areas such as insurance
industry, tourist industry, traffic engineering, demography,
forestry and astronomy.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Example (Illustrative use of Table A.2)
Let X ∼ Poisson(5). Find: a) P(X ≤ 5), b) P(6 ≤ X ≤ 9), and
c) P(X ≥ 10).
Solution. a) P(X ≤ 5) = F (5) = 0.616.
b) Write
P(6 ≤ X ≤ 9) = P(5 < X ≤ 9) = P(X ≤ 9) − P(X ≤ 5)
= F (9) − F (5) = 0.968 − 0.616.
c) Write
P(X ≥ 10) = 1 − P(X ≤ 9) = 1 − F (9) = 1 − 0.968.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Example
Suppose the number of colds a person contracts in a year is a
Poisson random variable. Persons taking Vitamin C
supplements contract an average of 3 colds per year, and
persons not taking the supplements contract an average of 5.
1
Find the probability of no more than two colds for a person
taking, and for a person not taking, Vitamin C supplements.
2
Suppose 70% of the population takes Vitamin C. Find the
probability that a randomly selected person will have no
more than two colds in a given year.
3
Suppose that a randomly selected person contracts no
more than two colds in a given year. What is the probability
that he/she takes Vitamin C supplements?
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Proposition (Poisson Approximation to Binomial Probabilities)
If Y ∼ Bin(n, p), with n ≥ 100, p ≤ 0.01, and np ≤ 20, then
P(Y ≥ k) ' P(X ≥ k), k = 0, 1, 2, . . . , n,
where X ∼ Poisson(λ = np).
The enormous range of applications of the Poisson
distribution is due to this proposition. Read the two
paragraphs following the proof of Proposition 3.4.1 on
page 175.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Example
For the following 4 binomial random variables np = 3:
a) Y1 ∼ Bin(9, 1/3), b) Y2 ∼ Bin(18, 1/6),
c) Y3 ∼ Bin(30, 0.1), d) Y4 ∼ Bin(60, 0.05).
Compare the P(Yi ≤ 2) with P(X ≤ 2) where X ∼ Poisson(3).
2
Comparison: First, P(X ≤ 2) = e−3 1 + 3 + 32 = 0.4232.
Next,
a) P(Y1 ≤ 2) = 0.3772, b) P(Y2 ≤ 2) = 0.4027,
c) P(Y3 ≤ 2) = 0.4114, d) P(Y4 ≤ 2) = 0.4174.
Note: The conditions of the Proposition on n and p are not
satisfied for any of the four binomial RVs.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Example
Due to a serious defect, n = 10, 000 cars are recalled. The
probability that a car is defective is p = 0.0005. If Y is the
number of defective cars, find: (a) P(Y ≥ 10), and (b)
P(Y = 0).
Solution. Here Y ∼ Bin(10, 000, 0.0005), and all conditions of
the above Proposition are met. Thus, if X ∼ Poisson(np = 5),
(a) P(Y ≥ 10) ' P(X ≥ 10) = 1 − P(X ≤ 9) = 1 − 0.968.
(b) P(Y = 0) ' P(X = 0) = e−5 = 0.007.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Definition
The number of occurrences as a function of time, X (t), t ≥ 0,
is called a Poisson process with rate α per unit time, if the
following assumptions are satisfied.
1
The probability of exactly one occurrence in a short time
period of length h is approximately αh.
2
The probability of more than one occurrence in a short
time period is approximately 0.
3
The number of occurrences in nonoverlapping time
intervals are mutually independent.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
The parameter α in the first assumption specifies the ’rate’ of
the occurrences, i.e. the average number of occurrences per
time unit.
Proposition
Let X (t), t ≥ 0 be a Poisson(α) process.
1
For each fixed t0 , X (t0 ) ∼ Poisson(λ = α × t0 ). Thus,
P(X (t0 ) = k ) = e−αt0
2
(αt0 )k
, k = 0, 1, 2, · · ·
k!
If t1 < t2 are two positive numbers, then
X (t2 ) − X (t1 ) ∼ Poisson(α × (t2 − t1 ))
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Example
Continuous electrolytic inspection of a tin plate yields on
average 0.2 imperfections per minute. Find:
1
The probability of one imperfection in three minutes
2
The probability of at most one imperfection in 0.25 hours.
Solution. 1) Here α = 0.2, t = 3, λ = αt = 0.6. Thus,
P(X (3) = 1) = F (1; 0.6) − F (0; 0.6) = .878 − .549 = .329.
2) Here α = 0.2, t = 15, λ = αt = 3.0. Thus,
P(X (15) ≤ 1) = F 1; 3.0 = .199.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Example (Example 3.4.16, p. 180)
People enter a department store according to a Poisson
process with rate α per hour. It is known that 30% of those
entering the store will make a purchase of $50.00 or more. Find
the probability mass function of the number of customers who
will make purchases of $50.00 or more during the next hour.
ANSWER: Poisson(0.3α). (Proof omitted.)
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
1
PMF, CDF and PDF
2
Mean, Variance and Percentiles
3
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Normal distribution if the most important distribution in
probability and statistics.
X ∼ N(µ, σ 2 ) if its pdf is
f (x; µ, σ 2 ) = √
1
2πσ 2
e
−
(x−µ)2
2σ 2
, −∞ < x < ∞.
The cdf, F (x; µ, σ), does not have a closed form
expression.
R command for f (x; µ, σ 2 ): dnorm(x,µ,σ).
For example, dnorm(0,0,1) gives 0.3989423, which is the
value of f (0; µ = 0, σ 2 = 1).
R command for F (x; µ, σ 2 ): pnorm(x,µ,σ).
For example, pnorm(0,0,1) gives 0.5, which is the value of
F (0; µ = 0, σ 2 = 1).
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
The Standard Normal Distribution
When µ = 0 and σ = 1, X is said to have the standard normal
distribution and is denoted, universally, by Z . The pdf of Z is
1
2
φ(z) = √ e−z /2 , −∞ < z < ∞.
2π
The cdf of Z is denoted by Φ. Thus
Z z
Φ(z) = P(Z ≤ z) =
φ(x)dx.
−∞
Φ(z) has no closed form expression, but is tabulated in Table
A.3
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Historical Notes
It was discovered by Abraham DeMoivre in 1733, for
approximating binomial probabilities when n is large. He
called it the exponential bell-shaped curve.
DeMoivre was the first statistical consultant working out of
”Slaughter’s Coffee House”, a betting shop in Long Acres,
London.
In 1803, Karl Friedrich Gauss used it for predicting the
location of astronomical objects. Because of this it became
known as the Gaussian distribution.
By the late 19th century, statisticians had noted that most
data sets would have approximately bell-shaped
histograms. It came to be accepted that it was ”normal” for
any well-behaved data set to follow this curve. So the
Gaussian curve became the normal curve.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Figure: One side of the 10 Mark bill
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Basic Property of the Normal Distribution
Proposition
If X ∼ N(µ, σ 2 ), then
1
E(X ) = µ.
2
Var(X ) = σ 2 .
3
For an real numbers a, b
Y = a + bX ∼ N(a + bµ, b2 σ 2 ).
For example, if X ∼ N(4, 9) then
Y = 5 + 2X ∼ N(13, 36)
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Corollary
1
If Z ∼ N(0, 1), then X = µ + σZ ∼ N(µ, σ 2 ).
2
If X ∼ N(µ, σ 2 ), then
3
If X ∼ N(µ, σ 2 ), then
X −µ
∼ N(0, 1).
σ
xα = µ + σzα ,
Z =
where xα and zα denote the percentiles of X and Z (see figure
in next slide).
The corollary implies that probabilities and percentiles of
any normal random variable can be computed from
corresponding probabilities and percentiles of Z .
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
0.2
f(x)
0.3
0.4
Figure of the Standard Normal Percentile
0.0
0.1
area=alpha
z_alpha
-3
-2
-1
0
1
2
3
Normal percentiles in R: qnorm(p,µ,σ). For example,
qnorm(0.95,0,1) gives 1.644854, which is the value of z0.05 .
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Finding Probabilities via the Standard Normal Table
In Table A.3, z-values are identified from the left column, up to
the first decimal, and the top row, for the second decimal. Thus,
1 is identified by 1.0 in the left column and 0.00 in the top row.
Example (The 68-95-99.7% Property.)
Let Z ∼ N(0, 1). Then
1
P(−1 < Z < 1) = Φ(1) − Φ(−1) = .8413 − .1587 = .6826.
2
P(−2 < Z < 2) = Φ(2) − Φ(−2) = .9772 − .0228 = .9544.
3
P(−3 < Z < 3) = Φ(3) − Φ(−3) = .9987 − .0013 = .9974.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Example
Let X ∼ N(1.25, 0.462 ). Find a) P(1 ≤ X ≤ 1.75), and b)
P(X > 2).
X − 1.25
Solution. Use Z =
∼ N(0, 1) to express these
0.46
probabilities in terms of Z . Thus,
a)
1 − 1.25
X − 1.25
1.75 − 1.25
P(1 ≤ X ≤ 1.75) = P
≤
≤
.46
.46
.46
= P(−.54 < Z <1.09) = Φ(1.09)
− Φ(−.54) = .8621 − .2946.
2 − 1.25
b) P(X > 2) = P Z >
= 1 − Φ(1.63) = .0516.
.46
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
The 68-95-99.7% Property
The 68-95-99.7% rule applies for any normal random variable
X ∼ N(µ, σ 2 ):
P(µ − 1σ < X < µ + 1σ) = P(−1 < Z < 1) = 0.6826,
P(µ − 2σ < X < µ + 2σ) = P(−2 < Z < 2) = 0.9544,
P(µ − 3σ < X < µ + 3σ) = P(−3 < Z < 3) = 0.9974.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Finding Percentiles via the Standard Normal Table
To find zα , one first locates 1 − α in the body of Table A.3 and
then reads zα from the margins. If the exact value of 1 − α does
not exist in the main body of the table, then an approximation is
used as described in the following.
Example
Find z0.05 , the 95th percentile of Z .
Solution. 1 − α = 0.95 does not exist in the body of the table.
The entry that is closest to, but larger than 0.95 (i.e. 0.9505),
corresponds to 1.64. The entry that is closest to, but smaller
than 0.95 (which is 0.9495), corresponds to 1.65. We
approximate z0.05 by averaging 1.64 and 1.65: z.05 ' 1.645.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Example
Let X denote the weight of a randomly chosen frozen yogurt
cup. Suppose X ∼ N(8, .462 ). Find the value c that separates
the upper 5% of weight values from the lower 95%.
Solution. This is another way of asking for the 95-th percentile,
x.05 , of X . Using the formula xα = µ + σzα , we have
x.05 = 8 + .46z.05 = 8 + (.46)(1.645) = 8.76.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Example
A message consisting of a string of binary (either 0 or 1)
signals is transmitted from location A to location B. Due to
channel noise, however, when x is sent from A, B receives
y = x + e, where e ∼ N(0, 1) represents the noise. To minimize
error, location A sends x = 2 for 1 and x = −2 for 0. Location B
decodes the received signal y as 1, if y ≥ 0.5 and as 0 if
y < 0.5. Find the probability of an error in the decoded signal.
Solution. If E = signal is decoded incorrectly,
P(B|signal is 1) = P(x + e < 0.5|x = 2) = P(e < −1.5) = 0.0668,
P(B|signal is 0) = P(x + e ≥ 0.5|x = −2) = P(e ≥ 2.5) = 0.0062.
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Outline
PMF, CDF and PDF
Mean, Variance and Percentiles
Some Common Distributions
The Bernoulli and Binomial Random Variables
The Hypergeometric Random Variable
The Geometric and Negative Binomial Random Variables
The Poisson Random Variable and Process
The Normal Distribution
Go to previous lesson http://www.stat.psu.edu/
˜mga/401/course.info/lesson3.pdf
Go to next lesson http://www.stat.psu.edu/˜mga/
401/course.info/lesson5.pdf
Go to the Stat 401 home page http:
//www.stat.psu.edu/˜mga/401/course.info/
Michael Akritas
Lesson 4 Chapter 3: Random Variables and Their Distributions
Download