ppt - Department of Mathematics

advertisement
Chapter 4. Discrete Probability
Distributions
Section 4.2. Expected Values of Random
Variables
Jiaping Wang
Department of Mathematical Science
02/06/2013, Monday
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Outline
Expected Values
Variance and Standard Deviation
Some Properties
Tchebysheff’s Theorem
Homework #4
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Part 1. Expected Values
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Long-run Relative Frequency of
Occurrence
Suppose you and your friend are matching balanced coins. Each
of you flips a coin. If the upper faces match, you win $1; if they
do not match, you lose $1. So on average, how much will you
win per game in a long run?
From these flipping coins, you win +1 and -1
(lose money) in half of the time, so the
average is (-1)(1/2)+(1)(1/2) = 0.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Definition 4.4
Definition 4.4 The expected value of a discrete random
variable X with probability distribution p(x) is given as
(The sum is over all values of x for which p(x)>0)
We sometimes use the notation
E(X)=μ
for this equivalence.
Note: Not all expected values exist, the sum above must converge absolutely,
∑|x|p(x)<∞.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Theorem 4.1
Now if we change the wining function, g(x)=3x, which means, if coins are
matched, then you win $3, otherwise, you lose $3. So the expected value
becomes
(3)(1/2)+(-3)(1/2)=0
Theorem 4.1 If X is a discrete random variable with
probability p(x) and if g(x) is any real-valued function of
X, then
E(g(x)=∑g(x)p(x).
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Part 2. Variance and Standard
Deviation
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Definitions 4.5 and 4.6
The variance of a random variable X with expected value
μ is given by
V(X)=E[(X- μ)2]
Sometimes we use the notation
σ2 = E[(X- μ)2]
For this equivalence.
The standard deviation is a measure of variation that maintains the original units of
measure.
The standard deviation of a random variable is the square
root of the variance and is given by
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Cont.
Case I:
μ = (-1.0)(1/2)+(1)(1/2)=0;
σ2=(-1)2(1/2)+(1)2(1/2)=1.
Case II:
μ = (-1.5)(1/2)+(1)(1/4)+
(2)(1/4)=0;
σ2=(-1.5)2(1/2)+(1)2(1/4)+
1)2(1/4)=2.375.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Cont.
Age
Interval
Age
Midpoint
2000
2100
Under 5
3
6.9%
6.3%
μ=
3(0.069)+8(0.073)+…+(90)(0.033)
=36.6,
5-9
8
7.3%
6.2%
10-19
15
14.4%
12.8%
20-29
25
13.3%
12.3%
and σ2 = (3-36.6)2(0.069)+ (836.6)2(0.073) + … + (90-36.6)2(0.033)
= 510.76
30-39
35
15.5%
12.0%
40-49
45
15.3%
11.6%
50-59
55
10.8%
10.8%
Similarly, we can find for 2100,
μ = 42.5 and σ2 = 691.69
60-69
65
7.3%
9.8%
70-79
75
5.9%
8.3%
80 and
over
90
3.3%
9.9%
Consider the percentages as
probabilities, then for 2000
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Example 4.3
A department supervisor is considering purchasing a
photocopy machine. One consideration is how often
the machine will need repairs. Let X denote the number
of repairs during one year. Base on past performance,
the distribution of X is shown as follows
Number of
Repairs, x
0
1
2
3
p(x)
0.2
0.3
0.4
0.1
1. What is the expected number of repairs during a year?
2. What are the variance and standard deviation of the number of repairs during a
year?
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Solution
1. E(X)=∑xp(x) = (0)(0.2)+(1)(0.3)+(2)(0.4)+(3)(0.1)=1.4
2. V(X)= ∑(x-E(X))2p(x)
= (0-1.4) 2(0.2)+(1-1.4) 2(0.3)+(2-1.4) 2(0.4)+(3-1.4) 2(0.1)
= 0.84.
So the standard deviation is 0.92.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Part 3. Some Properties
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Theorem 4.2 For any random variable X and constants a
and b.
1. E(aX + b) = aE(X) + b
2. V(aX + b) = a2V(X)
Standardized random variable: If X has mean μ and
standard deviation σ, then
Y=(X – μ)/ σ
has E(Y)=0 and V(Y)=1, thus Y can be called the standardized random
variable of X.
Theorem 4.3 If X is a random variable with mean μ, then
V(X)= E(X2) – μ2
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Example 4.4
The department supervisor in Example 4.3 wants to
consider the cost of maintenance before purchasing
the photocopy machine. The cost of maintenance
consists of the expense of a service agreement and the
cost of repairs. The service agreement can be
purchased for $200. With the agreement, the cost of
each repair is $50. Find the mean, variance and
standard deviation of the annual costs of repair for the
photocopy machine.
Solution: Now the annual cost of the maintenance is 50X + 200. Recall that
E(X)=1.4 and V(X)=0.84 from Example 4.3, so we have
E(50X+200)=50E(X)+200=(50)(1.4)+200=270;
V(50X+200)=2500V(X)= (2500)(0.84)=2100, so the standard deviation is
$45.83.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Example 4.5
Use the results of Theorem 4.3 to compute the variance of
X as given in Example 4.3.
Number of
Repairs, x
0
1
2
3
p(x)
0.2
0.3
0.4
0.1
Solution: E(X)=1.4, E(X2)=∑x2p(x)=02(0.2)+12(0.3)+22(0.4)+32(0.1)=2.8
Then V(X)= E(X2)-E2(X) =2.8-1.56=0.84.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Part 4. Tchebysheff’s Inequality
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Theorem 4.4
Tchebysheff’s Theorem. Let X be a random variable with
mean μ and standard deviation σ. Then for any positive k,
P(|X – μ|/ σ < k) ≥ 1/k2
Consider Y=(X- μ)/ σ, then E(Y)=0 and V(Y)=1.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Example 4.6
The daily production of electric motors at a certain
factory averaged 120 with a standard deviation of
10.
1. What can be said about the fraction of days on
which the production level falls between 100 and
140?
2. Find the shortest interval certain to contain at least
90% of the daily production levels.
Solution: The interval from 100 to 140 means k=2 so that |X-μ|<2σ for μ=120
and σ=10. So based on Chebysheff’s inequality, we have 1-1/4=0.75, which
mean at least 75% of all days will have a total production values that falls in
this interval.
TO find k, we set (1-1/k2)=0.9 , then k=3.16, the interval will be from
120-(3.16)(10) to 120+(3.16)(10), or 88.4 to 151.6.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Example 4.7
The annual cost of maintenance for a certain photocopy
machine has a mean of $270 and a variance of $2100
(see example 4.4). The manager wants to budget
enough for maintenance that he is likely to go over the
budget amount. He is considering budgeting $400 for
maintenance. How often will the maintenance cost
exceed this amount?
Solution: Let X=400, so k=(400-270)/45.8=2.84, then the interval 270(2.84)(45.8) to 270 + (2.84)(45.8) or 140 to 400 contains at least 11/(2.84*2.84) = 0.88, thus at most 0.12 of the probability mass can exceed
$400.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Example 4.8
Suppose the random variable X has the probability mass
function given in the table
x
-1
0
1
p(x)
1/8
3/4
1/8
Evaluate Tchebysheff’s inequality for k=2.
Solution: Find mean E(X)=0, find V(X)=1/8 + 1/8=0.25, so σ = 0.5, thus
P(|X-E(X)|<2(0.5))=P(|X-E(X)|<1)=P(X=0)=3/4. By Chebysheff’s inequality, P(|XE(X)|<2 σ) ≥1-1/4=0.75.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Homework #4
Page 100: 4.2
Page 101: 4.8
Page 102: 4.12
Page 116: 4.17
Page 118: 4.26
Page 119: 4.33
Page 120: 4.42
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL
Download