W Io C B

advertisement
Chapter 4
Location-Scale-Based Parametric Distributions
William Q. Meeker and Luis A. Escobar
Iowa State University and Louisiana State University
4-1
Copyright 1998-2008 W. Q. Meeker and L. A. Escobar.
Based on the authors’ text Statistical Methods for Reliability
Data, John Wiley & Sons Inc. 1998.
December 14, 2015
8h 8min
Motivation for Parametric Models
• Complements nonparametric techniques.
• Parametric models can be described concisely with just a
few parameters, instead of having to report an entire curve.
• It is possible to use a parametric model to extrapolate (in
time) to the lower or upper tail of a distribution.
• Parametric models provide smooth estimates of failure-time
distributions.
4-3
In practice it is often useful to compare various parametric
and nonparametric analyses of a data set.
Functions of the Parameters-Continued
E(T ) =
Z ∞
0
tf (t; θ ) dt =
Z ∞
0
[1 − F (t; θ )] dt.
• The mean time to failure, MTTF, of T (also known as
expectation of T )
R
If 0∞ tf (t; θ ) dt = ∞, we say that the mean of T does not
exist.
Z ∞
SD(T )
.
E(T )
Var(T ).
q0
[t − E(T )]2f (t; θ ) dt
• The variance (or the second central moment) of T and the
standard deviation
Var(T ) =
SD(T ) =
• Coefficient of variation γ2
γ2 =
4-5
Chapter 4
Location-Scale-Based Parametric Distributions
Objectives
• Explain importance of parametric models in the analysis of
reliability data.
• Define important functions of model parameter that are of
interest in reliability studies.
• Introduce the location-scale and log-location-scale families
of distributions.
• Describe the properties of the exponential distribution.
4-4
4-2
• Describe the Weibull and lognormal distributions and the
related underlying location-scale distributions.
Functions of the Parameters
• Cumulative distribution function (cdf) of T
F (t; θ ) = Pr(T ≤ t), t > 0.
f (t; θ )
, t > 0.
1 − F (t; θ )
F (tp; θ ) ≥ p.
• The p quantile of T is the smallest value tp such that
• Hazard function of T
h(t; θ ) =
Location-Scale Distributions
Y belongs to the location-scale family of distributions if the
cdf of Y can be expressed as
y−µ
, −∞ < y < ∞
F (y; µ, σ) = Pr(Y ≤ y) = Φ
σ
where −∞ < µ < ∞ is a location parameter and σ > 0 is a
scale parameter.
Φ is the cdf of Y when µ = 0 and σ = 1 and Φ does not
depend on any unknown parameters.
Note: The distribution of Z = (Y − µ)/σ does not depend
on any unknown parameters.
4-6
Importance of Location-Scale Distributions
Importance due to:
• Most widely used statistical distributions are either members of this class or closely related to this class of distributions: exponential, normal, Weibull, lognormal, loglogistic,
logistic, and extreme value distributions.
• Methods of inference, statistical theory, and computer software generated for the general family can be applied to this
large, important class of models.
t > γ,
4-7
• Theory for location-scale distributions is relatively simple.
Exponential Distribution
For T ∼ EXP(θ, γ),
t−γ
F (t; θ, γ) = 1 − exp −
θ
1
t−γ
f (t; θ, γ) =
exp −
θ
θ
1
f (t; θ, γ)
= ,
h(t; θ, γ) =
1 − F (t; θ, γ)
θ
where θ > 0 is a scale parameter and γ is both a location and
a threshold parameter. When γ = 0 one gets the well-known
one-parameter exponential distribution.
E(T ) = γ + θ,
5
6
6
7
7
Var(T ) = θ 2.
f(t)
0.0
0.4
0.8
1.2
3
4
σ
5
5
5
µ
6
0.3
0.5
0.8
t
5
7
Probability Density Function
Examples of Normal Distributions
4
t
t
5
Hazard Function
4
4-9
Quantiles: tp = γ − θ log(1 − p).
Moments: For integer m > 0, E[(T − γ)m] = m! θm. Then
3
Cumulative Distribution Function
1
F(t) .5
0
20
15
h(t) 10
5
0
3
4 - 11
3.0
1.5
2.0
2.0
0.5
0.0
1.0
θ
0
0
0
γ
2.0
0.5
1.0
2.0
t
4-8
3.0
Probability Density Function
Examples of Exponential Distributions
2.0
Cumulative Distribution Function
1
t
f(t) 1.0
1.0
F(t) .5
0.0
t
Hazard Function
0.0
2.0
1.5
1.0
1.0
3.0
0
h(t)
0.5
0.0
Motivation for the Exponential Distribution
• Simplest distribution used in the analysis of reliability data.
• Has the important characteristic that its hazard function is
constant (does not depend on time t).
• Popular distribution for some kinds of electronic components (e.g., capacitors or robust, high-quality integrated
circuits).
• This distribution would not be appropriate for a population
of electronic components having failure-causing quality-defects.
4 - 10
• Might be useful to describe failure times for components
that exhibit physical wearout only after expected technological life of the system in which the component would be
installed.
−∞ < y < ∞.
Normal (Gaussian) Distribution
For Y ∼ NOR(µ, σ)
1
φnor
σ
y−µ
F (y; µ, σ) = Φnor
σ
y−µ
,
σ
f (y; µ, σ) =
√
Rz
where φnor (z) = (1/ 2π) exp(−z 2/2) and Φnor (z) = −∞
φnor (w)dw
are pdf and cdf for a standardized normal (µ = 0, σ = 1).
−∞ < µ < ∞ is a location parameter; σ > 0 is a scale parameter.
Var(Y ) = σ 2.
−1
−1
Quantiles: yp = µ + σΦnor
(p) where Φnor
(p) is the p quantile
for a standardized normal.
Moments: For integer m > 0, E[(Y − µ)m] = 0 if m is odd,
and E[(Y − µ)m] = (m)!σ m/[2m/2 (m/2)!] if m is even. Thus
E(Y ) = µ,
4 - 12
0.0
t
2.0
2.0
3.0
3.0
f(t)
0.0
0.4
0.8
1.2
0.0
1.0
σ
0
0
0
µ
2.0
0.3
0.5
0.8
t
4 - 13
3.0
Probability Density Function
Examples of Lognormal Distributions
1.0
1.0
Hazard Function
t
Cumulative Distribution Function
1
2
3
4
0
F(t) .5
h(t)
1
0
0.0
Motivation for Lognormal Distribution
• The lognormal distribution is a common model for failure
times.
• It can be justified for a random variable that arises from the
product of a number of identically distributed independent
positive random quantities.
• It has been suggested as an appropriate model for failure
time caused by a degradation process with combinations of
random rates that combine multiplicatively.
• Widely used to describe time to fracture from fatigue crack
growth in metals.
• Useful in modeling failure time of a population electronic
components with a decreasing hazard function (due to a
small proportion of defects in the population).
4 - 15
• Useful for describing the failure-time distribution of certain
degradation processes.
Smallest Extreme Value Distribution
For Y ∼ SEV(µ, σ),
y−µ
F (y; µ, σ) = Φsev
σ
1
y−µ
f (y; µ, σ) =
φsev
σ
σ 1
y−µ
h(y; µ, σ) =
exp
, −∞ < y < ∞.
σ
σ
Φsev (z) = 1 − exp[− exp(z)], φsev (z) = exp[z − exp(z)] are cdf
and pdf for standardized SEV (µ = 0, σ = 1). −∞ < µ < ∞
is a location parameter and σ > 0 is a scale parameter.
−1
Quantiles: yp = µ + Φsev
(p)σ = µ + log [− log(1 − p)] σ.
Mean and Variance: E(Y ) = µ − σγ, Var(Y ) = σ 2π 2/6,
where γ ≈ .5772, π ≈ 3.1416.
4 - 17
Lognormal Distribution
"
#
t > 0.
If T ∼ LOGNOR(µ, σ) then log(T ) ∼ NOR(µ, σ) with
log(t) − µ
F (t; µ, σ) = Φnor
σ
"
#
1
log(t) − µ
φnor
,
σt
σ
f (t; µ, σ) =
φnor and Φnor are pdf and cdf for a standardized normal.
exp(µ) is a scale parameter; σ > 0 is a shape parameter.
h
4 - 14
t
t
45
50
55
55
Hazard Function
40
t
1.0
1.5
1.5
60
2.0
2.0
f(t)
0.02
35
40
µ
55
σ
50
50
50
50
5
6
7
t
45
4 - 16
60
Probability Density Function
30
0.5
η
1.5
β
1
1
1
t
1.0
2.0
Probability Density Function
0.0
0.8
1.0
1.5
i
exp(σ 2) − 1 .
Quantiles: t = exp µ + σΦ−1 (p) , where Φ−1 (p) is the p
p
nor
nor
quantile for a standardized normal.
Moments: For integer m > 0, E(T m) = exp mµ + m2σ 2/2 .
E(T ) = exp µ + σ 2/2 , Var(T ) = exp 2µ + σ 2
50
0.06
Examples of Smallest Extreme Value Distributions
45
Cumulative Distribution Function
1
40
f(t) 0.04
35
F(t) .5
30
0.0
1.5
1.0
35
0.5
t
1.0
Hazard Function
0.5
2.5
2.0
1.5
1.0
0.5
0.0
Examples of Weibull Distributions
60
0
h(t)
0.5
0.0
30
0.0
Cumulative Distribution Function
1
2
3
4
0
F(t) .5
h(t)
1
0
0.0
4 - 18
!β−1
!β−1


Weibull Distribution
t
η
,
t>0
t
exp −
η
Common Parameterization:
β
f (t) =
η
t
η
!β 

t
F (t) = Pr(T ≤ t) = 1 − exp −
η
β
η
h(t) =

!β 
β > 0 is shape parameter; η > 0 is scale parameter.
0
Z ∞
!
!
!#
4 - 19
wκ−1 exp(−w)dw is the gamma function.
2
1
Var(T ) = η 2 Γ 1 +
− Γ2 1 +
β
β
"
Quantiles: tp = η [− log(1 − p)]1/β .
Moments: For integer m > 0, E(T m) = η mΓ(1+m/β). Then
Γ(κ) =
1
,
E(T ) = ηΓ 1 +
β
where
Note: When β = 1 then T ∼ EXP(η).
Motivation for the Weibull Distribution
• The theory of extreme values shows that the Weibull distribution can be used to model the minimum of a large
number of independent positive random variables from a
certain class of distributions.
◮ Failure of the weakest link in a chain with many links
with failure mechanisms (e.g., creep or fatigue) in each
link acting approximately independent.
◮ Failure of a system with a large number of components in
series and with approximately independent failure mechanisms in each component.
4 - 21
• The more common justification for its use is empirical: the
Weibull distribution can be used to model failure-time data
with a decreasing or an increasing hazard function.
−∞ < y < ∞.
Largest Extreme Value Distribution
When Y ∼ LEV(µ, σ),
y−µ
F (y; µ, σ) = Φlev
σ
1
y−µ
f (y; µ, σ) =
φlev
σ
σ
exp − y−µ
o,
n
h
σ i
−1
σ exp exp − y−µ
σ
h(y; µ, σ) =
where Φlev (z) = exp[− exp(−z)] and φlev (z) = exp[−z−exp(−z)]
are the cdf and pdf for a standardized LEV (µ = 0, σ = 1)
distribution.
−∞ < µ < ∞ is a location parameter and σ > 0 is a scale
parameter.
4 - 23
Alternative Weibull Parameterization

!β−1
t
η
!β

"
=
 = Φsev
!β 

exp −
t
η
h
#
i
log(t) − µ
1
φsev
σt
σ
"
log(t) − µ
σ
Note: If T ∼ WEIB(µ, σ) then Y = log(T ) ∼ SEV(µ, σ).
t
η
For T ∼ WEIB(µ, σ) then
β
η
F (t; µ, σ) = 1 − exp −
f (t; µ, σ) =
Φsev (z) = 1 − exp[− exp(z)].
φsev (z) = exp[z − exp(z)]
where σ = 1/β, µ = log(η), and
Quantiles:
−1 (p)
tp = η [− log(1 − p)]1/β = exp µ + σΦsev
#
4 - 20
−1
where Φsev
(p) is the p quantile for a standardized SEV (i.e.,
µ = 0, σ = 1).
40
0.06
30
0.02
0
10
σ
10
10
10
µ
30
5
6
7
t
20
40
Probability Density Function
Examples of Largest Extreme Value Distributions
30
Cumulative Distribution Function
1
t
f(t) 0.04
20
F(t) .5
10
0.0
0
t
20
Hazard Function
10
40
0
0.20
0.15
h(t)0.10
0.05
0.0
0
4 - 22
Largest Extreme Value Distribution - Continued
Quantiles: yp = µ − σ log [− log(p)] .
Mean and Variance: E(Y ) = µ + σγ, Var(Y ) = σ 2π 2/6,
where γ ≈ .5772, π ≈ 3.1416.
Notes:
• The hazard is increasing but is bounded in the sense that
limy→∞ h(y; µ, σ) = 1/σ.
• If Y ∼ LEV(µ, σ) then −Y ∼ SEV(−µ, σ).
4 - 24
5
5
15
t
15
20
20
25
25
0.25
0.20
0.15
f(t)
0.10
0.05
0.0
5
10
µ
20
σ
15
15
15
1
2
3
t
15
4 - 25
25
Probability Density Function
Examples of Logistic Distributions
10
Hazard Function
10
Cumulative Distribution Function
1
0
F(t) .5
h(t)
1.0
0.8
0.6
0.4
0.2
0.0
t
Logistic Distribution-Continued
p
−1
(p) = µ + σ log 1−p
Quantiles: yp = µ + σΦlogis
, where
−1
Φlogis
(p) = log[p/(1 − p)] is the p quantile for a standardized
logistic distribution.
4 - 27
Moments: For integer m > 0, E[(Y − µ)m] = 0 if m is odd,
h
i
P∞
m if m
and E[(Y − µ)m] = 2σ m (m!) 1 − (1/2)m−1
i=1 (1/i)
is even. Thus
E(Y ) = µ,
σ 2π 2
Var(Y ) =
.
3
Loglogistic Distribution
"
#
t > 0.
If Y ∼ LOGIS(µ, σ) then T = exp(Y ) ∼ LOGLOGIS(µ, σ)
with
log(t) − µ
F (t; µ, σ) = Φlogis
σ
"
#
1
log(t) − µ
f (t; µ, σ) =
φlogis
σt
σ
"
#
1
log(t) − µ
Φlogis
,
σt
σ
h(t; µ, σ) =
exp(µ) is a scale parameter; σ > 0 is a shape parameter.
Φlogis and φlogis are cdf and pdf for a LOGIS(0, 1).
4 - 29
Logistic Distribution
For Y ∼ LOGIS(µ, σ),
y−µ
F (y; µ, σ) = Φlogis
σ
1
y−µ
f (y; µ, σ) =
φlogis
σ
σ 1
y−µ
h(y; µ, σ) =
Φlogis
, −∞ < y < ∞.
σ
σ
−∞ < µ < ∞ is a location parameter; σ > 0 is a scale parameter.
φlogis(z) =
Φlogis(z) =
t
2
3
3
4
4
exp(z)
[1 + exp(z)]2
exp(z)
.
1 + exp(z)
f(t)
0.0
0.4
0.8
1.2
0
1
σ
0
0
0
µ
3
0.2
0.4
0.6
t
2
4 - 28
4
Probability Density Function
Examples of Loglogistic Distributions
1
t
2
Hazard Function
1
4 - 26
φlogis and Φlogis are pdf and cdf for a standardized logistic
distribution defined by
0
Cumulative Distribution Function
1
1.0
2.0
3.0
0
F(t) .5
h(t)
0.0
0
Loglogistic Distribution-Continued
h
i
−1
Quantiles: tp = exp µ + σΦlogis
(p) = exp(µ) [p/(1 − p)]σ .
Moments: For integer m > 0,
E(T m) = exp(mµ) Γ(1 + mσ) Γ(1 − mσ).
The m moment is not finite when mσ ≥ 1.
For σ < 1,
E(T ) = exp(µ) Γ(1 + σ) Γ(1 − σ),
and for σ < 1/2,
h
i
Var(T ) = exp(2µ) Γ(1 + 2σ) Γ(1 − 2σ) − Γ2(1 + σ) Γ2(1 − σ) .
4 - 30
Other Topics in Chapter 4
Pseudorandom number generation.
4 - 31
Topics in Chapter 5
• Parametric models with threshold parameters.
• Important distributions used in reliability that can not be
translated into location-scale distributions: gamma, generalized gamma, etc.
• Finite (discrete) mixture distributions
P
i ξi = 1
F (t; θ ) = ξ1F1(t; θ 1) + · · · + ξk Fk (t; θ k )
where ξi ≥ 0, and
• Compound (continuous) mixture distributions.
4 - 32
If failure-times of units in a population are EXP(η) with
1/η ∼ GAM(θ, κ), then the unconditional failure time, T , of
a unit selected at random from the population has a Pareto
distribution of the form
1
F (t; θ, κ) = 1 −
, t > 0.
(1 + θt)κ
Download