Probability distributions

advertisement
SUMMARY 1
Probability Distributions
The probability distribution of a discrete random variable lists all the possible values that the random
variable can assume and their corresponding probabilities.
It is useful to have standard probability models for certain types of problems. Some of the most
important probability models are:
For discrete variables:
1. The binomial probability distribution
2. The Poisson distribution
3. The Geometric distribution
4. The multinomial distribution
5. The Hyper-geometric distribution
For continuous variables:
1. Normal distributions
2. The sampling distribution of the mean
3. The student's t distribution
4. The Chi-square distribution
5. The F distribution
Binomial distribution Model:
Characteristics
1. There is a fixed number of trials n.
2. Each trial will result in either a success or a failure.
3. The probability of success p for each trial remains the same from trial to trial.
(q is the probability of failure and q = 1- p)
4. The trials are independent.
5. The variable x measures the number of successes in n trials.
( x=0, 1, 2, 3, . . . , n)
Properties:
1. The variable x is a discrete variable.
2. The expected value or average of x is given by   E(x)  np
3. The variance of the variable x can be obtain from the formula
  npq
4. Binomial probabilities can be obtained by using the formula below or by using tables or
calculators.
2
binom (n, p; x) 
n!
p xqn  x , q  n  x
x! n  x !
5. Binomial probability distributions are said to have two parameters: n and p.
Example. A baseball player has a batting average of 0.25. What is the probability that he will have 0
hits in 10 trips to the plate, 3 hits in 10 trips to the plate?
10!
0.25 0 (0.75)10  0.056
0!10!
10!
binom ( 10, 0.25, 3) 
0.253 (0.75) 7  0.2502822876
3!7!
binom (10, 0.25; 0) 
Example. In the previous problem, what is the average and the standard deviation for the number of
hits in 10 trips to the plate.
  np  10(0.25)  2.5 and   npq  10(0.25)(0.75)  1.369
-1-
Geometric probability distributions:
The variable x measures the number of trials needed to obtain the first success. Geometric
probabilities can be obtained from the formula
geometric( p, x)  q x 1 p,
  E ( x) 
1
1 p
, 2 
p
p2
Example. A medical diagnostic test could fail to detect a certain disease. Suppose that a given patient
has a certain disease and that the probability it will be detected by a certain test is 0.2. What is the
probability that a positive result will not be obtain until the fifth test.
4
x 1
p  0.8 (0.2)  0.08182
1
1
Note: it will take an average of  

 5 tests before a positive result is obtained from a
p 0.2
geometric( 0.2, 5)  q
patient with the disease. The standard deviation is

1 p
p2

1  0.2
(0.2) 2
 4.47
Poisson Distribution:
In a Poisson distribution we are given the expected number of successes  =E(x) and no limit on
the
number of trials.
 x e 
Poisson probabilities values are obtained from the formula P( x) 
, e  2.718
x!
   and   
2
Example: The A & P company receives 5.3 orders a day. Use the Poisson distribution to find the
probability that the company will receive exactly 7 orders.
7
x 
 5.3
Poison ( x  7) 
 e
x!

5.3
( 2.718)
7!
 0.1164
Poisson approximation to the binomial model:
When n is large and p is small a good approximation to binomial probabilities can be obtained
by using the Poisson distribution with  = u = np
Example. In a binomial problem n = 120 and p = 0.015. Use the Poisson approximation to the
binomial to find the probability of exactly 3 successes.
3
x 
1.8
1.8 (2.718)
 e

 0.1607005084
x!
3!
Note : binom (120, 0.015; 3)  0.1617229705 which is a good approximat ion .
  np  120(0.015)  1.8, Poisson( x  3) 
-2-
Hyper-geometric distributions: to determine the number of successes x in n dependent trials when
sampling without replacement from a population of size N with contains M successes.
Population N
Sample
M
M
successes
ess
size n
x successes
successe
s
The probability of x successes in a sample of size n taken from a p0pulation of size N with M
successes is given by the formula
 M  N  M 
 

x  n  x   M C x    N  M C n  x 

P( x) 

,
N
N Cn
 
n 
  E ( x) 
nM
nM  N  n ( N  M )
and  2 

N  1
N
N2
Example. In a lot of 20 radios been inspected there are 4 defective items. If an inspector inspects 5
radios, what is the probability that 2 of them are defective. N=20, M=4, N-M=16, n-5, x=2, n-x=3
 4 16 
  
2 3
6  560
P( x  2)     
 0.2167182663
15504
 20 
 
5 
Normal Distributions:
1. There are infinitely many normal distributions, one for each pair of combinations
of the two parameters  and  .
2. The normal distribution with  = 0 and  = 1 is called the standard normal distribution.
3. The area below any normal distribution is 1.
4. Probability = Area under the curve = proportion.
5. The variable x is a continuous variable.
6. Every normal distribution can be standardized by converting x to the z-scale.
7. How to find probabilities under a normal curve using the tables (is the same as finding areas.)
Step 1. Convert x to z by using the formula z 
x

. Round z to two decimal places
Step 2. Using z and the standard normal distribution table, find the area.
Step 3. Carefully conclude, answer the question precisely.
8. To find the value of x that corresponds to a given area or proportion,
Step 1. Carefully read the problem and draw a diagram indicating the given area
Step 2. Using the appropriate table area, find the corresponding z-value.
Step 3. Use the formula x    z to obtain x.
9. When n is large we usually compute binomial probabilities by using the normal approximation with
 = np,  2 = npq and a +0.5 continuity correction factor on x. This also provides a fairly good
approximation even when n is small, provided p is reasonable close to 1/2.
-3-
10. Solving problems using the TI-83
Problem- The variable X is normally distributed with   48 and   3.1
a) Find the ordinate (the y-value on the normal curve) when x=55
 2nd DISTR normalpdf (55, 48, 3.1)
answer: 0.0100541648
b) the ordinate (the y-value on the normal curve) at the mean value
 2nd DISTR normalpdf (48, 48, 3.1)
answer: 0.1286910582
c) Find the probability that a value is greater than 55?
 2nd DISTR normalcdf (55, 10^9, 48, 3.1)
answer: 0.0119707811
d) Find the probability that x is between 40 and 51?
 2nd DISTR normalcdf (40, 51, 48, 3.1)
answer: 0.8284825481
e) Find the value of x above which there is a 0.43 probability.
 2nd DISTR invNorm(0.57,48, 3.1)
answer: 48.54675989
f) Find the 90th percentile of the distribution.

2nd DISTR invNorm(0.90, 48, 3.1)
answer: 51.97280986
Problem. In a binomial problem n=45, p=0.6
a) Find the probability of exactly 30 successes
 2nd DISTR binompdf(45, 0.6, 30)
answer: 0.0818633582
b) Find the probability of at most 30 successes
 2nd DISTR binompcdf(45, 0.6, 30)
answer: 0.856960038
c) Find the probability of at least 30 successes
 1 - 2nd DISTR binomcdf(45, 0.6, 30)
answer: 0.143039962
Problem. In a Poisson problem with mean equal to 4.2 successes.
a) Find the probability of less than 5 successes
 2nd DISTR poissoncdf(4.2, 4)
answer: 0.5898270221
b) Find the probability of more than 5 successes
 2nd DISTR 1 - poissoncdf(4.2, 5)
answer: 0.2468571111
Problem. In a binomial problem n=1500 and p=0.12
a) Find the probability of at least 170 successes
 2nd DISTR 1 - binomcdf (1500, 0.12, 169)
answer: 0.7970876646
b) Use the Poisson approximation with   np  1500  0.12  180 to find the probability of at least
170 successes.
 2nd DISTR 1 - poissoncdf(180, 169)
answer: 0.7816766956
c) Use the Normal approximation with
  np  1500  0.12  180 ,   npq  1500(0.12)(0.88)  12.5857 and the 
correction factor to find probability of at least 170 successes.
 2nd DISTR normalcdf(169.5, 10^9, 180, 12.5857)
-4-
continuity
answer: 0.7979384958
Download