Chapter 10A Planning Tests to Compare Populations or Processes William Q. Meeker and Luis A. Escobar Iowa State University and Louisiana State University µ1 µ1 10A - 3 10A - 1 Copyright 1998-2014 W. Q. Meeker and L. A. Escobar. Complements to the authors’ text Statistical Methods for Reliability Data, John Wiley & Sons Inc., 1998. December 14, 2015 8h 9min µ2 ∆ Comparison of 2 Location Parameters d= ∆ /σ µ2 Tests Comparing k Populations or Processes • Samples of equal size n to be taken from each population. • If interest centers on the lower tail of the distribution, there is little reason to wait for all units to fail (in a life test). • In some situations data may be censored. For example ◮ Testing all units from all populations simultaneously until a given censoring time tc results in Type I censored data. ◮ Testing each population (perhaps sequentially because of a limitation in test positions) until a given number of failures occurs results in Type II censored data. 10A - 5 Chapter 10A Planning Comparison of Populations or Processes Objectives • Describe general issues in planning two or more processes or populations. • Describe the planning to compare two population means or quantiles assuming that the population variances are equal. 10A - 2 • Describe generalization of the procedures to compare populations with different variances. Comparing Populations or Processes • Product design decisions often require choosing the best from among k different populations or processes. log(t) − µi . σ ! • Suppose that response (e.g., failure time or strength) from population i follows a log-location-scale (e.g., Weibull or lognormal) distribution F (t; µi, σ) = Φ • Parameter µi varies; constant σ. • Specific interest in choosing the population with the largest distribution quantile value (tp)i = exp[µi + σΦ−1(p)], i = 1, . . . , k among the k populations. 10A - 4 • With constant σ the ordering of the (tp)i values is the same as the ordering of the µi values. Tests Comparing k Populations or Processes–Continued • To select the population with the largest quantile, if σ is the same in each group, choose the population with b i. To plan the test, supposing that µ1 > the largest µ max {µ2, . . . , µk }, one would want an assessment of b 1 > max {µ b 2, . . . , µ b k }) . Pr(CS) = Pr(Correct Selection) = Pr (µ 10A - 6 µ4 µ5 µ3 ∆ Comparison of 5 Location Parameters µ2 d= ∆ /σ µ2 = µ3 = µ4 = µ5 µ1 µ1 10A - 7 Probability of Correct Selection when Choosing the b i Based on Type II Censored Data Largest µ from k Populations • With Type II censoring resulting in r of n failures from each population and any log-location-scale distribution, there is a relatively simple expression for Pr(CS). i • For the case µ1 > µ2 = µ3 = · · · = µk , which is the least favorable situation to choose the correct population, the probability of correct selection simplifies to b 1 > max {µ b 2, . . . , µ b k }) Pr(CS) = Pr (µ o n h , General Formula to Compute Pr(CS) when Selecting the Population with the Largest µ • Notation: b 1, . . . , µ bk) joint distribution of (µ b 1) : marginal of µ b1 f (µ b 1, . . . , µ bk) : f (µ b 2, . . . , µ bk | µ b 1) : conditional distribution of (µ b 2, . . . , µ b k ) given µ b1 f (µ • The general formula to compute Pr(CS) is −∞ Z ∞ b2 ≤ µ b 1, . . . , µ bk ≤ µ b1 | µ b 1 ) f (µ b 1)dµ b 1. Pr (µ b2 ≤ µ b 1, . . . , µ bk ≤ µ b 1) Pr(CS) = Pr (µ = 10A - 8 Probability of Correct Selection when Choosing the b i Based on Type II Censored Data Smallest µ from k Populations • There is a similar expression for the probability of correct selecting the smallest µi from a log-location-scale distribution. i , • For the case µ1 < µ2 = µ3 = · · · = µk , the probability of correct selection simplifies to b 1 < min {µ b 2, . . . , µ b k }) Pr(CS) = Pr (µ n o h = Pr Zb1 < min Zb2, . . . , Zbk − d b i − µi)/σ and d = (µ1 − µ2)/σ. where Zbi = (µ • The joint distribution of (Zb1, . . . , Zbk ) depends only on the definition of Φ, n, and r and can be evaluated effectively with simulation without having to specify d. = Pr Zb1 > max Zb2, . . . , Zbk − d • The joint distribution of (Zb1, . . . , Zbk ) depends only on the definition of Φ, n, and r and can be evaluated effectively with simulation without having to specify d. 10A - 10 • Simulate with given n and r. Plot Pr(CS) versus d. Repeat for different n and r combinations. b i − µi)/σ and d = (µ1 − µ2)/σ. where Zbi = (µ 10A - 9 1.0 0.9 0.8 0.7 0.6 0.5 0.0 1.0 Standardized Difference (d) 0.5 1.5 10A - 12 2.0 n=40, r=24 n=20, r=12 n=10, r=6 n=5, r=3 Probability of Correctly Selecting the Largest Weibull Population out of 2 Populations Probability of Correct Selection when Choosing the Largest Weibull Characteristic Life Based on Type II Censored Data from k = 2 Populations • Simulate with given n and r. Plot Pr(CS) versus d. Repeat for different n and r combinations. Elicitation of a Planning Value for d • For a log-location-scale distribution, a planning value d✷ for d can be obtained as a function of a planning value for σ and a specified increase on life of interest. • Suppose that there is interest in detecting a percent increase of p, 0 < p < 100, or more on life and that a planning value σ ✷ for σ is available. Then exp µ1 + σz(1−α) p = 100 exp µ2 + σz(1−α) 10A - 11 ◮ Equating 1 + p/100 to the increase on life, one gets 1+ p . 100 = exp (µ1 − µ2) . ∆ = µ1 − µ2 = log 1 + Taking logarithms ◮ Then 1 p ∆ = ✷ log 1 + . d✷ = σ✷ σ 100 Pr(Correct Selection) 1.5 n=40, r=24 n=20, r=12 n=10, r=6 n=5, r=3 Probability of Correctly Selecting the Largest Weibull Population out of 5 Populations Probability of Correct Selection when Choosing the Largest Weibull Characteristic Life Based on Type II Censored Data from k = 5 Populations 1.0 0.8 0.6 1.0 10A - 13 2.0 Probability of Correct Selection when Choosing the Largest Log-Mean based on Type I Censored Data from k Populations • With Type I censoring, test units from all populations are tested until a prespecified censoring time tc. • The expression for Pr(CS) is the same as with Type II censoring but the joint distribution of (Zb1, . . . , Zbk ) depends on the definition of Φ, n, (log(tc) − µ1)/σ, and (log(tc) − µ2)/σ. • Pr(CS) can be evaluated with simulation, but a new simulation is needed for every different specified combination of n, µ1, µ2, σ, and tc (or n, µ1/σ, µ2/σ, and log(tc)/σ). • There may be a possibility of 0 failures (if so, this will often show up in the simulation). 0.5 0.4 0.0 = k Y i=2 k Y i=2 k Y i=2 √ 10A - 15 b 1 − µi ) n(µ σ bi ≤ µ b 1) Pr (µ Φnor √ Φnor z1 + di n , b 1 − µ1)/σ and di = (µ1 − µi)/σ. n(µ ! 10A - 17 • Then • In this case, −∞ 10A - 18 √ ik−1 φnor (z1) dz1, Φnor z1 + d n • In this case, Pr(CS) is easy to compute numerically. where d = (µ1 − µ2)/σ. Pr(CS) = Z ∞ h • Although we would expect all of the µi values to be different, a conservative (i.e., lower) value of Pr(CS) is obtained by supposing that there there is a best population and that all of the k − 1 other populations have the same value of µ (i.e., µ1 > µ2 = µ3 = · · · = µk ). Special Case: Probability of Correct Selection when Choosing the Largest Log-Mean with Complete Data from k Lognormal Populations 10A - 16 • In this case there are some simpler formulas to compute Pr(CS) and the required sample size to obtain a prespecified Pr(CS). b i ∼ NOR(µi, σ 2/n) µ b i is • For complete samples from a lognormal distribution µ the mean of the logtimes to failure. Special Case: Probability of Correct Selection for Complete Samples from a Lognormal Distribution 10A - 14 0.2 Extensions Standardized Difference (d) • If the sample size n in each group is large enough, the Pr(CS) versus d curves for Type II censoring provide an approximation to the sample size needed for Type I censoring. √ b2 ≤ µ b1 . . . , µ bk ≤ µ b1 | µ b 1) = Pr (µ where z1 = = b i denote the sample mean of the log failure times from Let µ the sample from population i. b 2, . . . , µ bk) Special Case: Conditional Distribution of (µ for Underlying Lognormal Distributions and Complete Data • Estimation of Pr(CS) after data have been collected. • Bayesian approaches to allow incorporating prior information. • Non-log-location-scale distributions (e.g., gamma). • Dynamic rule to allow termination of simultaneous testing when there is confidence that Pr(CS) is at least a specified value. • Other censoring schemes, and stopping rules. For example, stop testing when at least r units fail in each group or at time tc, which ever comes first. • Evaluations when σ values differ among populations. Specification of population characteristics is much more complicated. • Results for the selection of the distribution with the smallest quantile are similar. Pr(Correct Selection) Special Case: Sample Size to Give a Specified Probability of Correct Selection when Choosing the Largest Log-Mean with Complete Data from k Lognormal Populations • The sample size required to obtain a probability of correct selection equal to 1 − α, when detection of a difference of n = c2/d2 µ1 − µ2 is important, is given by −∞ where d = (µ1 − µ2)/σ and c is the solution to the equation Z ∞ [Φnor (z1 + c)]k−1 φnor (z1) dz1. 1−α = • To identify small differences one needs large samples and to identify large differences smaller samples are sufficient. 10A - 19