Empirical likelihood confidence regions for comparison distributions

Empirical likelihood confidence regions for comparison distributions and ROC curves Gerda CLAESKENS1 , Bing-Yi JING2 , Liang PENG3 and Wang ZHOU2 Abstract Empirical likelihood confidence regions are constructed for the comparison distribution of two random samples whose distributions are to be tested for equality. A second, though related, application is to receiver operating characteristic curves, where the data are measurements of a diagnostic test from two populations. The authors investigate the smoothed empirical likelihood method for estimation in this context, and empirical likelihood based confidence intervals are obtained via Wilks’s theorem. A bootstrap approach allows for the construction of confidence bands. Empirical likelihood methods are appealing for this purpose since they do not require any pre-specified assumptions about the shape of the confidence regions. The method is illustrated via data analysis and a simulation study. Key words and phrases: Bootstrap; Comparison distribution; Confidence region; Coverage accuracy; Empirical likelihood; ROC curve; Smoothing. MSC 2000: Primary 62E20; secondary 62G15. 1 Department of Statistics, Texas A&M University, College Station, Texas 77843, U.S.A. gerda@stat.tamu.edu 2 Department of Mathematics, Hong Kong University of Science and Technology, Hong Kong majing@ust.hk, wangzhou@ust.hk 3 School of Mathematics, Georgia Institute of Technology, Georgia 30332, U.S.A. peng@math.gatech.edu 1 1. INTRODUCTION To compare two populations with distribution functions F1 and F2 , the comparison distribution (Parzen, 1998) is defined as D(t) = F1 (F2−1 (t)), for 0 ≤ t ≤ 1. Testing the null hypothesis of equality of these distributions is equivalent to testing the comparison distribution D for uniformity. The graph of D is known as the percentile-percentile (PP) plot. A graphical way of performing the test is to construct the PP-plot and check whether it lies on the 45-degree line (Holmgren, 1995). A plot strictly above or below this line indicates that one of the distributions is stochastically larger than the other. By constructing a confidence band for the comparison distribution, a formal two-sample test can be performed. We will use the method of smoothed empirical likelihood for this purpose. Kim (2000) constructs a goodness of fit test based on the corresponding comparison density function (Parzen, 1979), by testing whether this density function is equal to one. The comparison distribution is also the starting point for control median tests, where 1 − D(0.5) gives the survival probability in population F1 , beyond the median in population F2 . A confidence interval for 1−D(0.5) will be of particular interest here. For other treatments of control median tests, refer to, for example, Chakraborti & Mukerjee (1989), Li, Tiwari & Wells (1996) and references therein. A second motivation for constructing confidence regions in this setting, is the receiver operating characteristic (ROC) curve, which is an important tool to summarize the performance of a medical diagnostic test for determining whether or not a patient has a disease. In a medical test resulting in a continuous measurement T , the disease is diagnosed if T > t, for a given threshold t. Suppose the distribution function of T is F1 conditional on disease and F2 conditional on non-disease. The ROC curve is defined as the graph (1 − F2 (t), 1 − F1 (t)) for various values of the threshold t, or in other words, sensitivity versus 1 - specificity, power versus size for a test with critical region {T > t}. This enables one to summarize a test’s performance or to compare two diagnostic tests. For more information about ROC curves and their use, we refer to Swets & Pickett (1982), Pepe (1997) and Metz, Herman & Shen (1998). An alternative definition (see Pepe, 1997) is R(t) = 1−F1 {F2−1 (1−t)} for 0 ≤ t ≤ 1. Indeed, as also noted by Lloyd (1998), the ROC curve is the distribution function of F 0 = 1 − F2 (X1 ). Distribution F0 can not be estimated directly since we only observe data from F1 and F2 separately. To estimate the comparison distribution and ROC curve, there are several approaches. One way is to model F1 and F2 parametrically (see Tosteston & Begg, 1988 and Goddard & Hinberg, 1990). Hsieh & Turnbull (1996) estimate this curve empirically; denote by F 1n1 the −1 empirical distribution function of a sample of size n1 from the disease population and let F2n 2 be the empirical quantile function of a sample of size n2 from the non-disease population. The −1 (1 − t)}. empirical or nonparametric estimator for R(t) is defined as R̂n (t) = 1 − F1n1 {F2n 2 For a semi-parametric approach to estimate the curves D and R, we refer to Li, Tiwari & Wells (1999). Zou, Hall & Shapiro (1997) and Lloyd (1998) construct a smooth estimator −1 e (1 − t)} where the Feini ’s are smooth versions of of R(t) by considering R(t) = 1 − Fe1n1 {Fe2n 2 the Fini ’s, for example kernel distribution estimators. Lloyd & Yong (1999) show that the e kernel estimator R(t) for R(t) is better than the empirical estimator R̂n (t) in terms of having smaller mean squared error. In this paper we define a smooth estimator of both D and R by 2 using empirical likelihood methods. The main motivation to employ empirical likelihood (Owen, 1988, 1990) above other existing estimation methods, is the construction of confidence regions, which is the focus point of this paper. Empirical likelihood methods are very appealing for this purpose since they involve no pre-specified assumptions about the shape of the confidence regions, which is determined automatically by the sample, as pointed out by Owen (1990). Empirical likelihood regions are Bartlett correctable in most cases; refer to Hall & La Scala (1990) and DiCiccio, Hall & Romano (1991). As an interesting alternative to the symmetric confidence regions imposed by a normal approximation method, it also avoids the need to estimate the variance term which contains both the unknown curve of interest and its first derivative. Confidence intervals based on likelihood ratios also have the additional advantage of respecting the range of the parameter. A recent overview of empirical likelihood methods and applications is given by Owen (2001). We extend the method of empirical likelihood to obtain a nonparametric estimator of the comparison distribution function, which is a combination of a distribution function and a quantile function. Chen & Hall (1993) introduce the method of smoothed empirical likelihood for the construction of confidence intervals for quantiles. Likelihood ratio based confidence intervals and bands for survival and quantile functions are constructed by Li (1995), Murphy (1995), Li, Hollander, McKeague & Yang (1996), and Hollander, McKeague & Yang (1997) amongst others. Einmahl & McKeague (1999) construct confidence tubes for quantile-quantile (QQ) plots, which in essence involves a study of the function F1−1 (F2 (t)). PP-plots are to be preferred above QQ-plots when outliers might be present, and as an additional advantage they allow for scale invariant treatment comparisons (Holmgren, 1995). The structure of this paper is as follows. Section 2 defines and studies the smoothed empirical likelihood estimator for the comparison distribution and ROC curve. In Section 3 the empirical likelihood based confidence interval is presented, Section 4 applies a bootstrap method to construct confidence bands. A simulation study is conducted in Section 5.1 to compare the empirical likelihood method and the normal approximation method. In Section 5.2 we apply the smoothed empirical likelihood method to a dataset on the detection of pancreatic cancer. All proofs are deferred until the appendix. 2. ESTIMATION OF THE COMPARISON DISTRIBUTION AND ROC CURVE This section is devoted to point estimation. The smoothed empirical likelihood function and its score vector for this two sample setting are introduced and defined in Section 2.1. Estimation of the comparison distribution and values of the ROC curve is the topic of Section 2.2. 2.1 Notation and definitions Suppose that X11 , . . . , X1n1 and X21 , . . . , X2n2 are two random samples from populations with (unknown) distribution functions F1 and F2 , respectively. We wish to construct a point or interval estimator for the ROC value R(t) = 1 − F1 (F2−1 (1 − t)), where 0 ≤ t ≤ 1. Note that the comparison distribution function is related to the ROC function in the following way: D(t) = 1 − R(1 − t), 0 ≤ t ≤ 1. Therefore, the results below will be presented only for the ROC function R. 3 In order to construct the empirical likelihood estimator of R(t), for j = 1, 2, let G j be a smoothed version of the distribution function represented by the indicator function I{t > 0}. More specifically, for KRj a compactly supported r-th order kernel (r ≥ 2, see condition C3 below), define Gj (t) = u≤t Kj (u)du. If we take Kj a density function and r = 2, Gj is a well defined distribution function. For future use, define Ghj (t) = Gj (t/hj ) where hj = hj (nj ) > 0 is a bandwidth sequence, converging to zero as nj grows to infinity. Let p = (p1 , · · · , pn1 ) and q = (q1 , · · · , qn2 ) be two probability vectors, that is, consisting of nonnegative numbers adding to one. Define the estimators F̂h1 ,p (η) = n1 X i=1 pi Gh1 (η − X1i ) and F̂h2 ,q (η) = n2 X j=1 qj Gh2 (η − X2j ). For ROC curve estimation, where we denote θ = R(t), there clearly exists a value η such that η = F1−1 (1 − θ) = F2−1 (1 − t). (1) For this setting, we define the smoothed empirical likelihood for θ, as L(θ) = sup (p,q,η) Ãn 1 Y i=1  ! n 2 Y p i  qj  , j=1 subject to the following constraints F̂h1 ,p (η) = 1 − θ and F̂h2 ,q (η) = 1 − t. (2) 2 Note that without constraints (2), the maximum of L(θ) would be n1−n1 n−n . To better 2 appreciate this definition, consider the second constraint which is used in the non-disease population F2 only, to estimate the quantile η = F2−1 (1 − t) via empirical likelihood. For ROC curve estimation, however, we add the first constraint, which determines the estimation of the 1−θ quantile in the disease population F1 . A challenging aspect of this approach is that the parameter of interest is not this quantile, but instead R(t) = θ. The link between both populations, disease and non-disease is through the receiver operator characteristic relation (1). A similar motivation holds for the comparison distribution function D. Chen & Hall (1993) introduced smoothed empirical likelihood for the construction of confidence intervals for quantiles in one population only. The current setting requires a nontrivial extension of their results. Einmahl & McKeague (1999) define an empirical likelihood approach for the estimation of multiple quantile-quantile plots from independent samples. A quantile function Q differs from the comparison distribution function used as the basis of a PPplot in that the order of the quantile and distribution function is reversed; more specifically, Q(t) = F1−1 (F2 (t)), while D(t) = F1 (F2−1 (t)). For this reason, a different theoretical treatment is required. We now define the empirical log likelihood ratio (multiplied by minus two) as `(θ) = −2 n1 ³X ln (n1 pi ) + n2 X j=1 i=1 4 ´ ln (n2 qj ) . To make this definition more explicit, we define the weight functions w1 (η, X1i ) = Gh1 (η − X1i ) − (1 − θ), and w2 (η, X2j ) = Gh2 (η − X2j ) − (1 − t). By the method of Lagrange multipliers, also taking the probability contraints into account, P 1 P 2 that is, ni=1 pi = 1, nj=1 qj = 1 and pi , qj ≥ 0, it easily follows that 1 n1 1 = n2 pi = qj 1 , 1 + λ1 w1 (η, X1i ) 1 · , 1 + λ2 w2 (η, X2j ) · i = 1, . . . , n1 , j = 1, . . . , n2 , where (λ1 , λ2 , η) are solutions to the following three score equations nj X ∂ wj (η, Xji ) = 0 (j = 1, 2), `(θ) = ∂λj i=1 1 + λj wj (η, Xji ) (3) nj 2 X X wj0 (η, Xji ) ∂ `(θ) = = 0, λj ∂η j=1 i=1 1 + λj wj (η, Xji ) with wj0 (η, z) ∂wi (η, z) = h−1 = j Kj ∂η Ã η−z hj ! (4) for j = 1, 2. The empirical log likelihood ratio can now be rewritten as `(θ) = 2 nj 2 X X ln (1 + λj wj (η, Xji )) . j=1 i=1 To ease interpretation, in the above notation we suppressed the dependence of (λ1 , λ2 , η) on θ, which takes place via the function w1 . 2.2 Point estimation Although the focus of this paper is on the construction of confidence regions, we wish to mention the point estimator itself. The maximum smoothed empirical likelihood estimator is that value θ̂ for which the smoothed empirical likelihood function is maximized: θ̂ = arg max L(θ). θ We can obtain an estimator of θ by minimizing the empirical log likelihood ratio l(θ) given above. Throughout, we assume that the following conditions hold for j = 1, 2. (r−1) (C1) For each density function fj = Fj0 , there exists an integer r ≥ 2 such that fj in a neighborhood of η and is continuous at η. Furthermore, f1 (η)f2 (η) > 0. 5 exists (C2) As min(n1 , n2 ) → ∞, nj /(n1 + n2 ) → γj , where 0 < γj < 1. (C3) Let Kj (·) denote an rth-order (r ≥ 2) kernel satisfying Z    1, k = 0, 1 ≤ k ≤ r − 1, u Kj (u)du =  0,  c 6= 0, k = r. k Let us explain these conditions some more. Condition (C1) requires that the densities fj (j = 1, 2) be sufficiently smooth only in a neighborhood of η. The requirement that the densities are positive at η assures the correct order (n1 + n2 )−1 for the asymptotic variance of the estimator. Condition (C2) is the balancing condition between the two sample sizes, stating that their growth rate should be of equal order. Clearly, γ 1 + γ2 = 1. The kernel functions given in (C3) are of the type commonly used in nonparametric density or regression. The case r = 2 is the most common, where we can take Kj to be a symmetric probability density function. Theorem 1. With C representing either the ROC function R(·) = 1 − F1 {F2−1 (1 − ·)} or the comparison distribution D(·) = F1 {F2−1 (·)}, the following asymptotic normality result holds. Assume regularity conditions (C1)–(C3), if n1 h2r 1 → 0, the smoothed empirical likelihood estimator Ĉ(t) satisfies √ n1 (Ĉ(t) − C(t)) →D N (0, C(t){1 − C(t)} + {C 0 (t)}2 t(1 − t)γ1 /γ2 ). In the above, “→D ” denotes convergence in distribution. Notice that the smoothed empirical likelihood estimator has the same asymptotic behavior as the kernel estimator studied by Lloyd & Yong (1999). The reason is that there is no extra information involved in the empirical likelihood function. The proof of this theorem requires Taylor series expansions around θ, similar to those in the proof of Theorem 2, and is similar to asymptotic normality results obtained by Qin and Lawless (1994). Hence we do not include the proof here. The leading terms of the asymptotic variance of this estimator also coincide with the leading −1 (·)}. Note that for estimation asymptotic variance terms of the empirical estimator F1n1 {F2n 2 of the comparison distribution function, the variance features the squared comparison density function (Parzen, 1979, 1998). Under the condition that n1 h2r 1 → 0 there is no bias in the above asymptotic normality result. A bandwidth sequence tending to zero at a slower rate would introduce a bias originating from the smoothing procedures. Optimal bandwidth choice is an interesting topic, but is beyond the scope of this paper. The point estimator R(·) can be used to construct summary values such as the area under the ROC curve (Hanley & McNeil, 1982). This is an important quantity which summarizes the accuracy of a test. The area under the R(·) curve is equal to P (X1 − X2 > 0), and is related to the Wilcoxon statistic used for testing whether P (X1 − X2 > 0) = 0.5. RAn estimator of this probability can be obtained by numerical evaluation of the integral 01 R̂(t)dt. An 6 alternative is to estimate first the distribution of X1 − X2 via F̂j are estimators of the distribution functions Fj . R∞ −∞ F̂1 (s + t)dF̂2 (t) where the 3. WILKS’S THEOREM FOR CONFIDENCE INTERVAL CONSTRUCTION In this section we construct an empirical likelihood based confidence interval for the ROC value R(t) and for the value of the comparison distribution D(t). The same notation as introduced in Section 2 is adapted throughout. The idea behind confidence interval construction based on (empirical) likelihood ratios is to include in the interval all those values ϑ for which the (empirical) likelihood ratio test for the null hypothesis H0 : R(t) = ϑ, or H0 : D(t) = ϑ, respectively, cannot be rejected. By definition, see Section 2.1, large values of the empirical log likelihood ratio `(ϑ) provide evidence in favor of the (two-sided) alternative hypothesis. This motivates defining a smoothed empirical likelihood confidence interval I for either R(t) or D(t) by Ic = {ϑ : `(ϑ) ≤ c}. The value of the constant c > 0 determines the coverage probability of the interval I c . Its value will be determined via Wilks’s theorem, below. For the construction of a smoothed empirical likelihood ratio confidence interval, the explicit maximization of the likelihood function can be avoided. To construct the empirical likelihood function as described in Section 2.1, only equations (3)–(4) need to be solved, solutions of which may, for practical matters, be obtained in two stages. In the first stage, we fix η and solve for λ1 = λ1 (η) and λ2 = λ2 (η) from the set of equations (3). In the second stage, we obtain η̂ as a solution to equation (4) where now λj = λj (η). It is straightforward to obtain that this provides identical solutions to a direct “one-stage” solution to (3)–(4). In addition to regularity conditions (C1)–(C3), presented in Section 2.2, we assume condition (C4), which dictates the rates at which the bandwidths tend to zero, and ensures that this convergence is fast enough. 2r (C4) For i = 1, 2: ni h4r i → 0, and ni hi / ln ni → ∞ as ni → ∞. First, we state two lemmas which will facilitate obtaining the main result in Theorem 2. Lemma 1. Assume that conditions (C1) − (C4) hold. Then, with probability one, there exists a root ηe of equation (4), such that |ηe − η| < δ a.s. where δ = O(hr1 + hr2 ). Lemma 2. Assume that conditions (C1) − (C4) hold. Then, for ηe given in Lemma 1, γ1 f1 (η)λ1 (ηe) = −γ2 f2 (η)λ2 (ηe) + op (n−1/2 ) ³ ´ √ N (ηe − η) →D N 0, d−1 θ(1 − θ)t(1 − t) Ã ! √ γ2 f2 (η)2 N λ1 (ηe) →D N 0, , γ1 d ! Ã √ γ1 f1 (η)2 N λ2 (ηe) →D N 0, , γ2 d 7 where d = γ1 t(1 − t)f12 (η) + γ2 θ(1 − θ)f22 (η) and N = n1 + n2 . Both proofs, as well as the proof of the following theorem are given in the Appendix. Theorem 2. Under conditions (C1) − (C4), the empirical log likelihood ratio statistic `(θ) converges in distribution to a χ21 distributed random variable. Theorem 2 is the nonparametric version of Wilks’s theorem in the ROC and comparison distribution context. Note that the limit distribution of `(θ) does not depend on unknown quantities, and so is asymptotically pivotal. Based on Theorem 2, an effective approach to construct a (1 − α)-level confidence interval for θ is Ic1−α = {ϑ : `(ϑ) ≤ c1−α }, where c1−α is the 1 − α quantile of a χ21 distribution. This will give a confidence interval for θ with asymptotically correct coverage probability (1 − α), as stated in the next corollary. Corollary 1. Assume that conditions (C1) − (C4) hold. Then it follows that ³ ´ P θ ∈ Ic1−α = (1 − α) + o (1) . Empirical likelihood based confidence intervals not only have as an advantage that their shape is determined by the data, the method also avoids searching for a transformation resulting in less skewness of the distributions, a requirement for the normal approximation to perform well. See, Zhou, Hall & Shapiro (1997) who apply a logit transformation to 1 − F 1 (t) and 1 − F2 (t). 4. BOOTSTRAP CONFIDENCE BANDS In order to compare two diagnostic tests via their ROC curve, or to compare two distributions via their comparison distribution function, pointwise confidence intervals are not optimal. Instead of developing asymptotic properties for global curve estimation, which would be one way to construct a confidence region for {R(t) : a ≤ t ≤ b} and {D(t) : a ≤ t ≤ b}, a bootstrap algorithm provides an interesting alternative. In the context of empirical likelihood estimation for density functions, Hall & Owen (1993) explain that very large sample sizes are needed to get accurate confidence regions based on an infinite-parameter version of Wilks’s theorem. In the current context, if anything, this is not expected to be better. The main reason for the large sample sizes is the known slow convergence of normal extremes (see for example Hall, 1979). Lloyd (1998) uses percentiles of a nonparametric bootstrap to construct pointwise confidence bands for the ROC curve, this as an alternative to the pointwise normal approximation. Li, Tiwari & Wells (1996, 1999) construct a confidence band with global coverage 1-α by determining the appropriate percentile via bootstrap resampling and construct the band symmetrically around the estimated curve. Our construction provides a bootstrap confidence region 8 without losing the advantages of automated shape-determination by the empirical likelihood method. The level of the confidence bands is set by means of the bootstrap. This method is similar to one presented by Hall & Owen (1993) in the context of density estimation. The bootstrap method proceeds as follows. We nonparametrically draw a resample from ∗ ∗ ∗ ∗ each of the two samples separately to obtain {X11 , . . . , X1n } and {X21 , . . . , X2n }. Using 1 2 these bootstrap data, we form the bootstrap smoothed empirical log-likelihood ratio `∗ (θ) = 2 nj 2 X X j=1 i=1 ³ ´ ln 1 + λ∗j wj∗ (η ∗ , Xji∗ ) , where, for the ROC curve R(t) = θ, ∗ ∗ ∗ ∗ w1∗ (η ∗ , X1i ) = Gh1 (η ∗ − X1i ) − (1 − θ), and w2∗ (η ∗ , X2j ) = Gh2 (η ∗ − X2j ) − (1 − t). Analogously, for the comparison distribution D(t) = θ, ∗ ∗ ∗ ∗ w1∗ (η ∗ , X1i ) = Gh1 (η ∗ − X1i ) − θ, and w2∗ (η ∗ , X2j ) = Gh2 (η ∗ − X2j ) − t. The bootstrap values λ∗1 , λ∗2 and η ∗ are solutions to ∂ ∗ ∂ ∗ ` (θ) = 0, ` (θ) = 0 (j = 1, 2). ∂λj ∂η To construct a simultaneous confidence region for C(·), with C representing either D or R, over the interval (a, b), first we calculate the appropriate bootstrap critical value c ∗ such that P {`∗ (Ĉ(t)) ≤ c∗ for a ≤ t ≤ b|Xji , j = 1, 2; i = 1, . . . , nj } = 1 − α, where Ĉ(t) is the smoothed empirical likelihood estimator derived in Section 2.2. Hence the bootstrap confidence band C consists of those curves C(·) such that the corresponding log likelihood ratio statistic stays below c∗ over the interval (a, b), that is, C = {C(·) : `(C(t)) ≤ c∗ for a ≤ t ≤ b}. This bootstrap method is applied in Section 5.2 to the dataset on pancreatic cancer. 5. SIMULATION STUDY AND DATA ANALYSIS In this section we apply the smoothed empirical likelihood method. First, in Section 5.1 in the setting of ROC curves, we simulate data, and compare the simulated pointwise coverage probabilities of the smoothed empirical likelihood method to those obtained by normal approximations to kernel and empirical estimators, as well as by using bootstrap percentile techniques. In Section 5.2 we construct simultaneous confidence regions for the ROC curve and comparison distributions for a dataset from a study on pancreatic cancer. 5.1 Simulation study 9 In this section, we report results of a simulation study to investigate the performance of the smoothed empirical likelihood method for the construction of confidence intervals for the ROC values. In particular, we conducted a Monte Carlo study to compare the coverage accuracy of confidence intervals by the empirical likelihood method, by normal approximations and by using bootstrap percentiles. Confidence intervals by means of normal approximations based on the empirical estimator R̂n (t), see Introduction, can be obtained as follows. It can be verified that the asymptotic variance of R̂n (t) is given by ( R(t)(1 − R(t)) f1 (F2−1 (1 − t)) V (t) = + n1 f2 (F2−1 (1 − t)) )2 t(1 − t) . n2 A consistent estimator of V (t), say V̂ (t), can be obtained by replacing R(t), the Fi (·)’s and fi (·)’s in the above formula by their empirical versions (or smoothed ones when appropriate). Then, from Theorem 2 of Lloyd & Yong (1999), a two-sided confidence interval based on the normal approximation of R̂(t) is constructed via (N ) I1−α · q q ¸ = R̂n (t) − zα/2 V̂ (t), R̂n (t) + zα/2 V̂ (t) , where zα/2 is the 1 − α/2 quantile of a standard normal distribution. In order to avoid additional bandwidth choices for estimating unknown densities involved in estimation of the variance, we draw 300 bootstrap resamples of size n1 from X11 , · · · , X1n1 and 300 bootstrap resamples of size n2 from X21 , · · · , X2n2 , and then construct a bootstrap variance estimator for estimation of V (t). Similarly, we obtain a confidence interval by normal approximation, based P 1 −1 Gh1 (x − X1i ) on the kernel estimator R̃(t) = 1 − F̃1n1 {F̃2n (1 − p)}, where F̃1n1 (x) = n11 ni=1 2 1 P n2 and F̃2n2 (x) = n2 i=1 Gh2 (x − X2i ). The bootstrap percentile method based on either kernel estimator or empirical estimator was conducted by drawing 300 bootstrap resamples of size n1 from X11 , · · · , X1n1 and 300 bootstrap resamples of size n2 from X21 , · · · , X2n2 . We generated 10,000 pseudorandom samples of various realistic sample sizes from F1 = N (1, 1) and F2 = N (0, 1). We used the biquadratic kernel throughout, that is K(t) = −3/20 −3/20 15 (1 − t2 )2 I{|t| ≤ 1}, and we employed bandwidths h1 = n1 and h2 = n2 . It is easy to 16 check that conditions (C1) − (C4) are satisfied. Confidence intervals are calculated for values of t equal to 0.1, 0.3, 0.5, 0.7 and 0.9. Table 1 presents the results of this simulation study for a nominal coverage level equal to 90%, results for 95% nominal coverage are given in Table 2. From Tables 1 and 2, we make the following observations. The smoothed empirical likelihood (E.L.) method outperforms the normal approximation method and bootstrap percentile method by showing simulated coverage probabilities closer to the nominal value, for both 90% and 95% nominal coverage probabilities. The bootstrap percentile method is better than normal approximations. For almost all cases the normal approximation method based on empirical estimators has better coverage accuracy than the normal approximation method based on kernel estimators. The relatively poor performance for the case t = 0.9 may be explained by the fact that R(0.9) is very close to 99%. Therefore, neither method can be expected to work very well in this case. In summary, the smoothed empirical likelihood based confidence intervals for the ROC curve have better coverage probabilities than confidence intervals based 10 Table 1: Coverage accuracy of confidence intervals with nominal level 0.90. Results from a simulation study. E.L.: smoothed empirical likelihood; N.A.E.: normal approximation based on empirical estimators; N.A.K.: normal approximation based on kernel estimators; B.P.E.: bootstrap percentile based on empirical estimators; B.P.K.: bootstrap percentile based on kernel estimators. n1 = n2 = 50 n1 = n2 = 70 n1 = n2 = 100 n1 = 50, n2 = 70 n1 = 70, n2 = 50 Method t = 0.1 E.L. 0.903 N.A.E. 0.885 N.A.K. 0.871 B.P.E. 0.909 B.P.K. 0.893 E.L. 0.899 N.A.E. 0.885 N.A.K. 0.874 B.P.E. 0.909 B.P.K. 0.894 E.L. 0.898 N.A.E. 0.882 N.A.K. 0.872 B.P.E. 0.896 B.P.K. 0.888 E.L. 0.896 N.A.E. 0.888 N.A.K. 0.875 B.P.E. 0.914 B.P.K. 0.897 E.L. 0.896 N.A.E. 0.882 N.A.K. 0.871 B.P.E. 0.907 B.P.K. 0.896 t = 0.3 0.900 0.883 0.879 0.917 0.895 0.897 0.888 0.884 0.911 0.894 0.897 0.892 0.883 0.912 0.890 0.891 0.880 0.881 0.918 0.891 0.898 0.883 0.883 0.909 0.899 t = 0.5 0.896 0.881 0.882 0.918 0.892 0.896 0.883 0.886 0.908 0.893 0.893 0.887 0.884 0.908 0.886 0.891 0.884 0.884 0.918 0.891 0.894 0.888 0.887 0.910 0.896 t = 0.7 0.899 0.885 0.872 0.910 0.887 0.895 0.895 0.880 0.917 0.892 0.896 0.889 0.881 0.910 0.884 0.892 0.878 0.874 0.902 0.887 0.895 0.903 0.886 0.923 0.897 t = 0.9 0.826 0.798 0.748 0.748 0.759 0.826 0.807 0.766 0.790 0.783 0.835 0.824 0.800 0.833 0.821 0.805 0.755 0.715 0.707 0.739 0.841 0.848 0.794 0.818 0.807 on normal approximations and bootstrap percentile methods. This result is consistent with applications of empirical likelihood based confidence intervals for other population values. 5.2 Data analysis We apply our method in Section 4 to the data set analyzed by Wieand et al. (1989) on the accuracy of a carbohydrate antigenic determinant (CA19-9) in detecting pancreatic cancer. Concentrations of CA19-9 in sera (mL) from n2 =51 ‘control’ patients with pancreatitis and n1 =90 ‘case’ pancreatic cancer were collected. We are interested in obtaining a simultaneous confidence band for the ROC curve of CA19-9 in distinguishing between a control patient 11 Table 2: Coverage accuracy of confidence intervals with nominal level 0.95. Results from a simulation study. E.L.: smoothed empirical likelihood; N.A.E.: normal approximation based on empirical estimators; N.A.K.: normal approximation based on kernel estimators; B.P.E.: bootstrap percentile based on empirical estimators; B.P.K.: bootstrap percentile based on kernel estimators. n1 = n2 = 50 n1 = n2 = 70 n1 = n2 = 100 n1 = 50, n2 = 70 n1 = 70, n2 = 50 Method t = 0.1 E.L. 0.948 N.A.E. 0.930 N.A.K. 0.923 B.P.E.: 0.949 B.P.K.: 0.945 E.L. 0.951 N.A.E. 0.934 N.A.K. 0.927 B.P.E. 0.952 B.P.K. 0.943 E.L. 0.945 N.A.E. 0.932 N.A.K. 0.926 B.P.E. 0.949 B.P.K. 0.939 E.L. 0.947 N.A.E. 0.937 N.A.K. 0.929 B.P.E. 0.953 B.P.K. 0.945 E.L. 0.946 N.A.E 0.930 N.A.K. 0.923 B.P.E. 0.946 B.P.K. 0.943 t = 0.3 0.950 0.934 0.931 0.957 0.943 0.949 0.937 0.934 0.953 0.942 0.948 0.939 0.936 0.957 0.942 0.946 0.938 0.932 0.959 0.943 0.947 0.935 0.932 0.957 0.946 t = 0.5 0.952 0.928 0.929 0.959 0.944 0.946 0.930 0.933 0.951 0.941 0.945 0.935 0.932 0.954 0.938 0.945 0.929 0.931 0.958 0.940 0.945 0.935 0.935 0.956 0.945 t = 0.7 0.945 0.931 0.913 0.954 0.939 0.950 0.939 0.925 0.958 0.941 0.946 0.938 0.929 0.954 0.937 0.938 0.926 0.916 0.951 0.936 0.949 0.942 0.926 0.961 0.943 t = 0.9 0.880 0.824 0.775 0.801 0.817 0.887 0.836 0.794 0.842 0.842 0.892 0.856 0.831 0.877 0.871 0.867 0.779 0.744 0.757 0.794 0.896 0.873 0.825 0.873 0.864 and case patient, and for this purpose apply the bootstrap method as given in Section 4. The same kernel function as in Section 5.1 was used and a total of B = 10, 000 bootstrap resamples were taken. In Figure 1, confidence bands for the ROC curve R(p) with confidence levels 0.90 and 0.95 are plotted against p ranging from 0.20 to 0.99 by employing bandwidths h1 = 1.0, h2 = 0.2 (upper panel) and h1 = 10.0, h2 = 0.2 (lower panel), respectively. The reason of choosing large h1 is the sparseness of disease data. Also the empirical estimator and the kernel estimator are plotted in Figure 1. A graphical way of performing a test for the equality of disease and non-disease distribution functions is the following. If the 100(1 − α)% confidence band contains the bisector, a test 12 0.7 0.6 R(p) 0.8 0.9 1.0 h1=1.0, h2=0.2 0.4 0.5 Empirical estimator Kernel estimator 90% confidence band 95% confidence band 0.2 0.4 0.6 0.8 1.0 p 0.7 0.6 R(p) 0.8 0.9 1.0 h1=10.0, h2=0.2 0.4 0.5 Empirical estimator Kernel estimator 90% confidence band 95% confidence band 0.2 0.4 0.6 0.8 1.0 p Figure 1: Confidence bands, kernel estimator and empirical estimator for the ROC curve. Confidence bands with level 90% and 95%, and kernel estimator and empirical estimator are calculated for bandwidths h1 = 1 (upper panel), h1 = 10 (lower panel) and h2 =0.2. for H0 : F1 = F2 cannot be rejected at a significance level α. In our data example, it is clear that in this case there is evidence to conclude that the disease and non-disease population distributions are significantly different at the 5% level. 13 APPENDIX First we introduce a few lemmas, which will be useful in proving the main theorem. Throughout the proofs, we let c denote a generic constant, which may assume a different value at each occurrence. Denote n = min{n1 , n2 }, N = n1 + n2 , h = max{h1 , h2 }, and for j = 1, 2, wj1 (η) = nj 1 X wj (η, Xji ), nj i=1 wj2 (η) = nj 1 X w2 (η, Xji ). nj i=1 j Also, we shall use δ = δn = O(hr ) to denote a nonrandom nonnegative sequence. Proofs are given in the context of ROC estimation, it is straightforward to formulate the proofs for estimation of a comparison distribution D. A.1 Technical lemmas. Lemma 3.Assume that conditions (C1) and (C3) hold. For each fixed η, we have that, for i = 1, 2 and j = 1, 2, E{wi (η, Xj1 )} = Fi (η) − Fi (η0 ) + o(hrj ), Var{wi (η, Xj1 )} = Fi (η){1 − Fi (η)} + O(hj ). Proof. Without loss of generality, let i = j = 1. For the expected value of wi (η, Xj1 ) we calculate Z ∞ η − X11 [F1 (η − h1 u) − F1 (η)] K1 (u)du )} − F1 (η) = E{G1 ( h1 −∞ · ¸ Z ∞ 1 0 1 (r−1) 2 r r f1 (η)(−h1 u) + f1 (η)(−h1 u) + . . . + f1 = (η)(−h1 u) + o(h1 ) K1 (u)du = o(hr1 ). 2 r! −∞ And for the second moment, E ½ G21 µ η − X11 h1 = F1 (η) Z ∞ −∞ ¶¾ =2 dG21 (u) = F1 (η) + O(h1 ). Z ∞ (F1 (η) − h1 uf1 (η 0 )) G1 (u)dG1 (u) −∞ Z ∞ − −∞ h1 uf1 (η 0 )G1 (u)dG1 (u) Therefore, Var{w1 (η, X11 )} = F1 (η) − F12 (η) + O(h1 ), which ends the proof. Lemma 4. Assume that conditions (C1), (C3) and (C4) hold. Then uniformly for η ∈ {η : |η − η0 | < δ}, for j = 1, 2, −1/2 wj1 (η) = o(hrj ) + Op (δ + nj ) (5) −1/2 wj2 (η) = Fj (η0 ) (1 − Fj (η0 )) + Op (δ + hj + nj 14 ) (6) Proof. We shall only prove the case j = 1. By Taylor expansion, 1 w11 (η) = w 11 (η0 ) + w011 (η0 )(η − η0 ) + w0011 (η 0 )(η0 − η0 )2 , 2 (7) where η 0 is between η0 and η. From the central limit theorem and Lemma 3, it is obtained that −1/2 w11 (η0 ) = E{w1 (η0 , X11 )} + Op (n1 −1/2 ) = o(hr1 ) + Op (n1 ). The strong law of large numbers implies that w 011 (η0 ) = f1 (η0 )+O(hr1 ). From the boundedness of the kernel by condition (C3) and the bandwidth condition in (C4), it follows that there exists a constant c such that ¯ µ ¶¯ ¯ 0 η − X1i ¯ ¯K ¯ ≤ c, ¯ 1 ¯ and h1 h−2 1 δ ≤ c, which, in turn, implies that ¯ ¯ ¯ 00 ¯ ¯w 11 (η 0 )(η0 − η0 )2 ¯ ≤ Ã n1 η 0 − X1i δ2 X 0 K h21 n1 i=1 1 h1 ! = O(δ). (8) The result of (8), together with (7) proves (5). Similarly, we can prove (6). Lemma 5. Assume that conditions (C1), (C3) and (C4) hold. Then for j = 1, 2, ³ −1/2 wj1 (η0 + δ) = o(hrj ) + O δ + nj (ln nj )1/2 ³ ´ a.s. −1/2 wj2 (η0 + δ) = Fj (η0 ) (1 − Fj (η0 )) + O δ + hj + nj (ln nj )1/2 ´ a.s. Proof. We shall only prove the case j = 1. An application of Bernstein’s inequality (see, e.g., Serfling, 1980, p.95), and the Borel-Cantelli lemma imply that . −1/2 |w1k (η0 + δ) − Ew 1k (η0 + δ)| = O(n1 (ln n1 )1/2 ), a.s., k = 1, 2. Combining this with Lemma 3, yields ³ −1/2 w11 (η0 + δ) = o(hr1 ) + O δ + n1 (ln n1 )1/2 ³ ´ a.s. −1/2 w12 (η0 + δ) = F1 (η0 ) (1 − Fj (η0 )) + O δ + h1 + n1 which completes the proof. (ln n1 )1/2 ´ a.s., Lemma 6. Assume that conditions (C1) − (C4) hold. Then uniformly for η ∈ {η : |η − η 0 | < δ}, for j = 1, 2, ³ −1/2 λj (η) = o(hrj ) + Op δ + nj 15 ´ ; (9) furthermore, on the boundary points, we have ³ −1/2 λj (η0 ± δ) = o(hrj ) + O δ + nj (ln nj )1/2 ´ a.s. (10) Proof. We only prove the case j = 1. Rewrite (3) as n1 1 X w12 (η, X1i ) w11 (η) − λ1 (η) = 0, n1 i=1 1 + λ1 (η)w1 (η, X1i ) (11) and define Zn = max1≤i≤n |w1 (η, X1i )|. Clearly, there exists a constant c > 0 such that 0 ≤ Zn ≤ c. An application of Lemma 4 to (11), gives the following result, n1 w12 (η, X1i ) 1 X −1/2 = o(hr1 ) + Op (δ + n1 ) |λ1 (η)| · n1 i=1 1 + λ1 (η)w1 (η, X1i ) |λ1 (η)| |λ1 (η)| ≥ · [t0 (1 − t0 ) + op (1)] · w12 (η) = 1 + |λ1 (η)|Zn 1 + c|λ1 (η)| ³ −1/2 This implies that λ1 (η) = o(hr1 ) + Op δ + n1 Now let us prove (10). From Lemma 5, ³ ´ , which proves (9). −1/2 w11 (η0 + δ) = o(hr1 ) + O δ + n1 ³ (ln n1 )1/2 ´ −1/2 w12 (η0 + δ) = t0 (1 − t0 ) + O δ + h1 + n1 a.s. (ln n1 )1/2 Following similar steps as those to obtain (12), it can be shown that ³ −1/2 λ1 (η0 + δ) = o(hr1 ) + O δ + n1 ³ −1/2 Similarly, λ1 (η0 − δ) = o(hr1 ) + O δ + n1 A.2 Proof of Lemma 1. (12) (ln n1 )1/2 ´ ´ (13) ´ a.s. (14) a.s. (ln n1 )1/2 a.s. −1/2 For simplicity, write ∆δ = δ + hr1 + n1 (ln n1 )1/2 . Also denote H(θ0 , η) = n1 H1 (θ0 , η) + n2 H2 (θ0 , η), where nj 1 X Hj (θ0 , η) = ln (1 + λj (η)wj (η, Xji )) . nj i=1 By a Taylor expansion and (10), it follows that 1 H1 (θ0 , η0 + δ) = λ1 (η0 + δ) w11 (η0 + δ) − λ21 (η0 + δ) w12 (η0 + δ) + O(∆3δ ) a.s. 2 (15) From (11), it follows that 0 = w11 (η0 + δ) − λ1 (η0 + δ)w12 (η0 + δ) + λ21 (η0 + δ) 16 n1 1 X w13 (η0 + δ, X1i ) . n1 i=1 1 + λ1 (η0 + δ)w1 (η0 + δ, X1i ) Combining this with (14) and (10), yields 2 λ1 (η0 + δ) = w 11 (η0 + δ)w−1 12 (η0 + δ) + O(∆δ ) a.s. (16) Substituting (16) into (15) and using (13), for η0 ≤ η 0 ≤ η0 + δ, ³ ´ 1 2 −1 2 + O(∆3δ ) a.s. H1 (θ0 , η0 + δ) w−1 w (η ) + δf (η ) + O(δ h (η + δ) ) 11 0 1 0 0 1 2 12 ´ ´2 ³ ³ 1 −1/2 ) = + O(∆3δ ) a.s., (17) o(hr1 ) + O n1 (ln n1 )1/2 + δf1 (η0 ) + O(δ 2 h−1 1 2t0 (1 − t0 ) where we have used the following facts: w 011 (η0 ) → f1 (η0 ) a.s. and for η0 ≤ η ∗ ≤ η 0 ≤ η0 + δ, |w011 (η 0 ) − w011 (η0 )| = |w0011 (η ∗ )(η 0 − η0 )| ¯ ¶¯ µ 2δ ¯¯ 0 η ∗ − X1i ¯¯ −1 ≤ 2 E ¯K1 ¯ ≤ cδh1 h1 h1 a.s. Letting δ = 0 in (17), H1 (θ0 , η0 ) = ´´2 ³ ³ ³ ´ 1 −1/2 o(hr1 ) + O n1 (ln n1 )1/2 + O ∆30 2t0 (1 − t0 ) a.s. For δ = O(hr1 ), then under condition (C4), we have that H1 (θ0 , η0 + δ) ≥ H1 (θ0 , η0 ) a.s. Similarly, we can show that H1 (θ0 , η0 − δ) ≥ H1 (θ0 , η0 ) a.s. and that H2 (θ0 , η0 ± δ) ≥ H2 (θ0 , η0 ) a.s. Therefore, also H (θ0 , η0 ± δ) ≥ H (θ0 , η0 ) a.s. Since H (θ0 , η) is continuous in {η : |η − η0 | ≤ δ}, thus with probability one H (θ0 , η) achieves its minimum in the interior of the region {η : |η − η0 | ≤ δ}, say at ηe, such that ηe is a root of ∂H (θ0 , ηe) /∂η = 0, or equivalently ηe is a root of (4). A.3 Proof of Lemma 2. e = λ (ηe), and the components of the score vector For j = 1, 2, denote λj = λj (η), λ j j nj wj (η, Xji ) 1 X Qjn (η, λ1 , λ2 ) = , N i=1 1 + λj wj (η, Xji ) ³ nj 2 X wj0 (η, Xji ) 1 X . Q3n (η, λ1 , λ2 ) = λj N j=1 i=1 1 + λj wj (η, Xji ) ´ e ,λ e = 0, for i=1,2,3. By a Taylor expansion and Lemma 6, From Lemma 1, Qin ηe, λ 1 2 0 = Qin (η0 , 0, 0) + ³ ∂Qin ∂Qin e + ∂Qin (η , 0, 0)λ e (η0 , 0, 0) (ηe − η0 ) + (η0 , 0, 0)λ 1 0 2 ∂η ∂λ1 ∂λ2 + o(hr ) + Op (δ + n−1/2 ) ´2 . ³ Note that δ = O(hr ). Then, from the assumption (C4), we have o(hr ) + Op (δ + n−1/2 ) op (n−1/2 ). It is easy to show that, for j = 1, 2, ∂Q3n ∂Qjn (η0 , 0, 0) = (η0 , 0, 0) → γj fj (η0 ) a.s., ∂η ∂λj 17 ´2 = ∂Q1n ∂Q2n (η0 , 0, 0) → −γ1 θ0 (1 − θ0 ) a.s., (η0 , 0, 0) = −γ2 t0 (1 − t0 ) a.s. ∂λ1 ∂λ2 All other partial derivatives of components of the score vector with respect to the components of the parameter vector, evaluated at (η0 , 0, 0) are zero. Therefore,     e Q1n (η0 , 0, 0) λ 1   −1/2 −1  e Q2n (η0 , 0, 0)  ), = −S  + op (n    λ 2 0 ηe − η0 where the limiting hessian matrix   −γ1 θ0 (1 − θ0 ) 0 γ1 f1 (η0 ) 0 −γ t (1 − t ) γ2 f2 (η0 )  S=  . 2 0 0 γ1 f1 (η0 ) γ2 f2 (η0 ) 0 The inverse of S is obtained as S −1   −γ22 f22 (η0 ) γ1 γ2 f1 (η0 )f2 (η0 ) γ1 γ2 t0 (1 − t0 )f1 (η0 ) 1  2 2 γ1 γ2 θ0 (1 − θ0 )f2 (η0 )  −γ1 f1 (η0 ) =  γ1 γ2 f1 (η0 )f2 (η0 )  c0 γ1 γ2 t0 (1 − t0 )f1 (η0 ) γ1 γ2 θ0 (1 − θ0 )f2 (η0 ) γ1 γ2 t0 (1 − t0 )θ0 (1 − θ0 ) where c0 = det(S) = γ1 γ2 {γ1 f12 (η0 )t0 (1 − t0 ) + γ2 f22 (η0 )θ0 (1 − θ0 )}. Hence, −1/2 ηe − η0 = −c−1 ) 0 γ1 γ2 [t0 (1 − t0 )f1 (η0 )Q1n (η0 , 0, 0) + θ0 (1 − θ0 )f2 (η0 )Q2n (η0 , 0, 0)] + op (n −1/2 −1 e λ ) 1 = c0 γ2 f2 (η0 ) [γ2 f2 (η0 )Q1n (η0 , 0, 0) − γ1 f1 (η0 )Q2n (η0 , 0, 0)] + op (n −1 −1/2 e λ ). 2 = −c0 γ1 f1 (η0 ) [γ2 f2 (η0 )Q1n (η0 , 0, 0) − γ1 f1 (η0 )Q2n (η0 , 0, 0)] + op (n Therefore, Lemma 2 follows directly from the above and the fact that √ N Ã Q1n (η0 , 0, 0) Q2n (η0 , 0, 0) ! →D N (Ã 0 0 ! Ã , γ1 θ0 (1 − θ0 ) 0 0 γ2 t0 (1 − t0 ) A.4 Proof of Theorem 2. From Lemma 2 and a Taylor expansion, we have n1 2 λ (ηe)w12 (ηe) 2 1 n2 −n2 λ2 (ηe)w21 (ηe) + λ22 (ηe)w22 (ηe) + op (1). 2 l(θ0 ) = −n1 λ1 (ηe)w11 (ηe) + Choosing η = ηe in (11) and using Lemma 2, we get w11 (ηe) = λ1 (ηe)w12 (ηe) − λ21 (ηe) n1 w13 (ηe, X1i ) 1 X n1 i=1 1 + λ1 (ηe)w1 (ηe, X1i ) = λ1 (ηe)w12 (ηe) + op (n−1/2 ). 18 !) . Similarly, w 21 (ηe) = λ2 (ηe)w22 (ηe) + op (n−1/2 ). Using w12 (ηe) = t0 (1 − t0 ) + op (1), w22 (ηe) = θ0 (1 − θ0 ) + op (1), and Lemma 2, we have l(θ0 ) = = = = = n1 λ21 (ηe)w12 (ηe) + n2 λ22 (ηe)w22 (ηe) + op (1) n1 λ21 (ηe)t0 (1 − t0 ) + n2 λ22 (ηe)θ0 (1 − θ0 ) + op (1) N λ21 (ηe)γ1 t0 (1 − t0 ) + N λ22 (ηe)γ2 θ0 (1 − θ0 ) + op (1)  Ã γ1 f1 (η0 ) N λ21 (ηe) γ1 t0 (1 − t0 ) + γ2 f2 (η0 ) ! Ã √ 2 √ c0 γ1 N λ1 (ηe) √ + op (1) γ 2 f2 (η0 ) !2  γ2 θ0 (1 − θ0 ) + (1) op →D χ21 . ACKNOWLEDGEMENTS The research of G. Claeskens was partially funded by the National Cancer Institute (CA– 74552). The authors thank the reviewers of the manuscript for their helpful comments. REFERENCES S. Chakraborti & R. Mukerjee (1989). A confidence interval for a measure associated with the comparison of a treatment with a control. South African Statistical Journal, 23, 219–230. S. Chen & P. Hall (1993). Smoothed empirical likelihood confidence intervals for quantiles. The Annals of Statistics, 21, 1166 – 1181. T. J. DiCiccio, P. Hall & J. P. Romano (1991). Empirical likelihood is Bartlett-correctable. The Annals of Statistics, 19, 1053–1061. J. H. J. Einmahl & I. W. McKeague (1999). Confidence tubes for multiple quantile plots via empirical likelihood. The Annals of Statistics, 27, 1348–1367. M. J. Goddard & I. Hinberg (1990). Receiver operating characteristics (ROC) curves and non normal data: an empirical study. Statistics in Medicine, 9, 325–337. P. Hall (1979). On the rate of convergence of normal extremes. Journal of Applied Probability, 16, 433–439. P. Hall & B. La Scala (1990). Methodology and algorithms of empirical likelihood. International Statistical Review, 58, 109–127. P. Hall & A. B. Owen (1993). Empirical likelihood confidence bands in density estimation. Journal of Computational and Graphical Statistics, 2, 273–289. J. A. Hanley & B. J. McNeil (1982). The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology, 143, 29–36. M. Hollander, I. W. McKeague & J. Yang (1997). Likelihood-ratio based confidence bands for survival function. Journal of the American Statistical Association, 92, 215–226. 19 E. C. Holmgren (1995). The P − P plot as a method of comparing treatment effects. Journal of the American Statistical Association, 90, 360–365. F. Hsieh & B. W. Turnbull (1996). Non-parametric and semi-parametric estimation of the receiver operating characteristic curve. The Annals of Statistics, 24, 25–40. J. T. Kim (2000). An order selection criterion for testing goodness of fit. Journal of the American Statistical Association, 95, 829–835. G. Li (1995). On nonparametric likelihood ratio estimation of survival probabilities for censored data. Statistics and Probability Letters, 25, 95–104. G. Li, M. Hollander, I. W. McKeague & J. Yang (1996). Nonparametric likelihood ratio confidence bands for quantile functions from incomplete survival data. The Annals of Statistics, 24, 628–640. G. Li, R. C. Tiwari & M. T. Wells (1996). Quantile comparison functions in two-sample problems: with applications to comparisons of diagnostic markers. Journal of the American Statistical Association, 91, 689–698. G. Li, R. C. Tiwari & M. T. Wells (1999). Semiparametric inference for a quantile comparison function with applications to receiver operating characteristic curves. Biometrika, 86, 487–502. C. J. Lloyd (1998). The use of smoothed ROC curves to summarize and compare diagnostic systems. Journal of the American Statistical Association, 93, 1356–1364. C. J. Lloyd & Z. Yong (1999). Kernel estimators for the ROC curve are better than empirical. Statistics and Probability Letters, 44, 221–228. C. E. Metz, B. A. Herman & J. -H. Shen (1998). Maximum likelihood estimation of receiver operating characteristic (ROC) curves from continuously-distributed data Statistics in Medicine, 17, 1033–1053. S. A. Murphy (1995). Likelihood ratio-based confidence intervals in survival analysis. Journal of the American Statistical Association, 90, 1399–1405. A. B. Owen (1988). Empirical likelihood ratio confidence intervals for a single functional. Biometrika, 75, 237–249. A. B. Owen (1990). Empirical likelihood ratio confidence regions. The Annals of Statistics, 18, 90–120. A. B. Owen (2001). Empirical likelihood. Boca Raton: Chapman & Hall/CRC E. Parzen (1979) Nonparametric statistical data modeling. Journal of the American Statistical Association, 74, 105–131. E. Parzen (1998). Statistical methods mining, two sample data analysis, comparison distributions and quantile limit theorems. Asymptotic Methods in Probability and Statistics, B. Szyszkowicz (Ed.), 611–617. M. S. Pepe (1997). A regression modelling framework for receiver operating characteristic curves in medical diagnostic testing. Biometrika, 84, 595–608. 20 J. Qin & J. Lawless (1994). Empirical likelihood and general estimating equations. The Annals of Statistics, 22, 300–325. R. J. Serfling (1980). Approximation theorems of mathematical statistics. New York: Wiley. I. A. Swets & R. M. Pickett (1982). Evaluation of Diagnostic Systems: Methods from Signal Detection Theory. New York: Academic Press. A. N. Tosteston & C. B. Begg (1988). A general regression methodology for ROC curve estimation. Medical Decision Making, 8, 205–215. S. Wieand, M. H. Gail, B. R. James & K. L. James (1989). A family of nonparametric statistics for comparing diagnostic markers with paired and unpaired data. Biometrika, 76, 585–592. K. H. Zou, W. H. Hall & D. E. Shapiro (1997). Smooth non-parametric receiver operating characteristic (ROC) curves for continuous diagnostic tests. Statistics in Medicine, 16, 2143–2156. 21

Empirical likelihood confidence regions for comparison distributions

Related documents

Products

Support

Empirical likelihood confidence regions for comparison distributions

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib