Asymptotically best response-adaptive randomization procedures 夡 Feifang Hu

Journal of Statistical Planning and Inference 136 (2006) 1911 – 1922 www.elsevier.com/locate/jspi Asymptotically best response-adaptive randomization procedures夡 Feifang Hua, b,∗ , William F. Rosenbergerc, d, e , Li-Xin Zhange, f a Department of Statistics, University of Virginia, Halsey Hall, Charlottesville, VA 22904-4135, USA b Division of Biostatistics and Epidemiology, Department of Health Evaluation Services, University of Virginia, School of Medicine, USA c Department of Applied and Engineering Statistics, George Mason University, 4400 University Drive, Fairfax, VA 22030 USA d Department of Mathematics and Statistics, University of Maryland, Baltimore County, 1000 Hilltop Circle, Baltimore, MD 21250, USA e Department of Epidemiology and Preventive Medicine, School of Medicine, University of Maryland, USA f Department of Mathematics, Zhejiang University, Xixi Campus, Zhejiang, Hangzhou 310028, PR China Available online 31 August 2005 Abstract We derive a lower bound on the asymptotic variance of the allocation proportions from responseadaptive randomization procedures when the allocation proportions are asymptotically normal. A procedure that attains this lower bound is defined to be asymptotically best. We then compare the asymptotic variances of five procedures, for which allocation proportions converge, to the lower bound. We find that a procedure by Zelen and a procedure by Ivanova attain the lower bound and a procedure by Eisele and its extension to K > 2 treatments can attain the lower bound but are, in general, not asymptotically best. We discuss the tradeoffs among the benefits of randomization, the benefits of attaining the lower bound, and the benefits of targeting an optimal allocation. We conclude that none of these procedures possesses all of these benefits. © 2005 Elsevier B.V. All rights reserved. MSC: Primary 62G10 Keywords: Adaptive designs; Clinical trials; Doubly-adaptive biased coin design; Neyman allocation; Rao–Cramér lower bound; Urn models 夡 This research was supported by NSF grant DMS-0204232 (Hu and Rosenberger) and grants from the National Natural Science Foundation of China and the National Natural Science Foundation of Zhejiang Province (Zhang). ∗ Corresponding author. Department of Statistics, University of Virginia, Halsey Hall, Charlottesville, VA 22904-4135, USA. E-mail address: fh6e@virginia.edu (F. Hu). 0378-3758/$ - see front matter © 2005 Elsevier B.V. All rights reserved. doi:10.1016/j.jspi.2005.08.011 1912 F. Hu et al. / Journal of Statistical Planning and Inference 136 (2006) 1911 – 1922 1. Introduction Response-adaptive randomization procedures for clinical trials incorporate sequentially accruing response data into future randomization probabilities (Rosenberger and Lachin, 2002). Hu and Rosenberger (2003) have recently developed a template for a careful evaluation of a response-adaptive randomization procedure for binary responses. The template considers the competing goals of desiring high power for treatment comparisons while assigning fewer patients to the inferior treatment. It requires (1) a single limiting allocation, (2) a randomization procedure, and (3) the nondegenerate asymptotic distribution of the allocation proportions to each treatment following that randomization procedure. In particular, if the observed allocation proportions are asymptotically normal, the asymptotic power is an decreasing function of the asymptotic variance of the allocation proportions. For requirement (2), Rosenberger and Lachin (2002) define three classes of responseadaptive randomization procedures: urn models, sequential estimation procedures, and treatment effect mappings. There is, as yet, little information on the asymptotic distribution of allocation proportions from treatment effect mappings, and we will therefore focus in this paper on specific randomization procedures from only the classes of urn models and sequential estimation procedures. For K = 2 treatments, many urn models have been proposed. In this paper, we focus on two that have the same limiting allocation, for ease of comparison. These are Wei and Durham’s (1978) procedure, often referred to as the “randomized playthe-winner rule”, and Ivanova’s (2003) procedure, which she refers to as the “drop-the-loser rule”. Sequential estimation procedures for two treatments can be described in full generality for two treatments by Eisele’s (1994) procedure, which he called the “doubly-adaptive biased coin design”. Each of these procedures yields asymptotically normal allocation proportions, and the asymptotic variance is known, thus satisfying requirement (3) above. These procedures are generally analyzed according to a simple homogeneous parametric structure, with pA and pB being the success probabilities for those patients assigned to treatment A and B, respectively. We can also define qA = 1 − pA and qB = 1 − pB . These procedures have analogs for K > 2 treatments, although the theoretical developments and practical consequences are far more difficult to ascertain, and are the subject of current research. For example, Eisele’s procedure was generalized by Hu and Zhang (2004) to K > 2 treatments; the extension of Wei and Durham’s procedure is the generalized Friedman’s urn model of Wei (1979); and the extension of Ivanova’s procedure is the birth and death urn with immigration by Ivanova et al. (2000). For requirement (1), various limiting allocations (pA , pB ) can be proposed, where is the proportion of patients assigned to treatment A in the limit. In all practical cases, will be a function of the unknown parameters, and hence the argument pA , pB . We define “urn allocation”, which is the limiting allocation of many urn models, as (pA , pB ) = qB /(qA + qB ). When the treatments generated by a procedure converge to a desired limiting allocation, we say they target this allocation. Various target allocations have been proposed based on formal optimality criteria (cf. Rosenberger et al., 2001). If one wishes to maximize the power of the usual test comparing two binomial probabilities, it is well-known that Neyman allocation, the ratio of the standard deviations, should be used, given by = √ √ √ pA qA /( pA qA + pB qB ). However, Neyman allocation may be inappropriate for ethical reasons because it assigns more patients to the inferior treatment if pA + pB > 1. An F. Hu et al. / Journal of Statistical Planning and Inference 136 (2006) 1911 – 1922 1913 alternative allocation that minimizes the expected number of treatment failures for fixed √ √ √ power is (pA , pB ) = pA /( pA + pB ) (Rosenberger et al., 2001), which we refer to as “RSIHR allocation” as an acronym of the authors of the original paper. Other forms of optimal allocation are given in that paper as well. An advantage of Eisele’s procedure over the urn models is that we can select any desired allocation, including urn allocation, Neyman allocation, and RSIHR allocation. In this paper we derive a lower bound on the asymptotic variance of the allocation proportions for general response-adaptive randomization procedures. The technique used is an extension of the Rao–Cramér lower bound to dependent sequences of random variables. We show that Ivanova’s procedure has minimum asymptotic variance in the class of all response-adaptive procedures with limiting urn allocation. While Eisele’s procedure does not have minimum asymptotic variance, we can tune a parameter to obtain a nearly deterministic procedure that is minimum asymptotic variance. We also discuss several tradeoffs that we encounter in practice: the desire for high power, the desire for a fully randomized procedure, and the desire to target any allocation of interest. In Section 2, we present the theorem for the special case of binary response and K = 2 treatments, and then demonstrate the results for the response-adaptive randomization procedures we have discussed so far. In Section 3, we generalize the results to any response-adaptive randomization procedure for general K treatments under mild regularity conditions. In Section 4, we draw conclusions. Proofs are relegated to an appendix. 2. Results for two treatments In this section, we present the criterion for an asymptotically best response-adaptive randomization procedure for K = 2 treatments. We first describe the response-adaptive randomization procedures we will be examining and also give some important asymptotic properties. Let NA (n) be a random variable indicating the number of patients assigned to treatment A in n patients (NB (n) = n − NA (n) is then determined). Wei and Durham’s procedure. Wei and Durham’s (1978) famous randomized play-thewinner rule uses an urn model to allocate treatments. The urn starts with a fixed number of type A balls and type B balls in an urn. To randomize a patient, a ball is drawn, the corresponding treatment assigned, and the ball replaced. An additional ball of the same type is added if the patient’s response is success and an additional ball of the opposite type is added if the patient’s response is a failure. The urn can only target one value, namely (pA , pB ) = qB /(qA + qB ). Smythe and Rosenberger (1995) showed that if pA + pB < 23 , n1/2 (NA (n)/n − qB /(qA + qB )) → N(0, v), in law, where v= qA qB (5 − 2(qA + qB )) (2(qA + qB ) − 1)(qA + qB )2 . The limit is presumably nonnormal for pA + pB > 23 . Ivanova’s procedure. Ivanova (2003) considered an urn containing balls of three types, type A, type B, and type 0. A ball is drawn at random. If it is type A or type B, the corre- 1914 F. Hu et al. / Journal of Statistical Planning and Inference 136 (2006) 1911 – 1922 sponding treatment is assigned and the patient’s response is observed. If it is a success, the ball is replaced and the urn remains unchanged. If it is a failure, the ball is not replaced. If a type 0 ball is drawn, no subject is treated, and the ball is returned to the urn together with one ball of type A and one ball of type B. Ivanova (2003) shows that the limiting allocation of the procedure is also (pA , pB ) = qB /(qA + qB ), and also that n1/2 (NA (n)/n − qB /(qA + qB )) → N(0, v), in law, where v = qA qB (pA + pB )/(qA + qB )3 . Eisele’s procedure. Eisele’s (1994) procedure is designed to target any desired allocation (pA , pB ). He defines a function g(x, ) that represents the closeness of the current allocation proportion to the estimated target allocation, when replaced by the current values of maximum likelihood estimators of pA and pB . At the jth allocation, the procedure allocates patient j to treatment A with probability g(NA (j − 1)/(j − 1), (p̂A , p̂B )). Hu and Zhang (2004) define the following function g, evaluated at a generic value x, having nice interpretive properties. For nonnegative integer , g(x, ) = (/x) , (/x) + (1 − )((1 − )/(1 − x)) g(0, ) = 1, g(1, ) = 0. (1) Hu and Zhang also show that n1/2 (NA (n)/n − (pA , pB )) → N(0, v), in law, where v= qA qB ((1 + 2)(pA + pB ) + 2) (1 + 2)(qA + qB )3 . Note that v is a decreasing function of . We now discuss how to find a lower bound on the asymptotic variance of a responseadaptive randomization procedure. The lower bound depends on the unknown parameters as well as a limiting allocation . Any procedure that has the same limiting allocation can then be compared to this lower bound. The general theorem establishing a lower bound on the asymptotic variance of the allocation proportions for K treatments and general parameter vector will be given in Section 3 as Theorem 1 and proved in the appendix. To illustrate the method, we now state the theorem for binary responses and K = 2 treatments. Let I (pA , pB , NA (n)) be the Fisher’s information, where the expectation is taken conditional on NA (n), for estimating pA and pB . Suppose the following regularity conditions hold: 1. pA , pB ∈ (0, 1); F. Hu et al. / Journal of Statistical Planning and Inference 136 (2006) 1911 – 1922 1915 2. NA (n)/n converges to (pA , pB ) ∈ (0, 1) almost surely for the particular responseadaptive randomization procedure; √ 3. n(NA (n)/n − (pA , pB )) converges in law to N(0, v) for the particular procedure. Then a lower bound on the asymptotic variance of NA (n)/n is given by j(pA , pB ) j(pA , pB ) −1 I (pA , pB , n(n, pA , pB )) jpA jpB j(pA , pB ) j(pA , pB ) . (2) × jpA jpB We can compute ⎡ p A qA n(p A , pB ) I −1 (pA , pB , n(pA , pB )) = ⎣ 0 ⎤ 0 ⎦. p B qB n(1 − (pA , pB )) We refer to a response-adaptive allocation procedure that attains the lower bound as asymptotically best for that particular allocation (pA , pB ). It is best in the sense that, for a fixed allocation (pA , pB ), it maximizes an asymptotic approximation to the power of chi-square test for the difference of proportions (p̂A − p̂B )2 pA qA /NA (n) + pB qB /NB (n) as described in Hu and Rosenberger (2003), where p̂A and p̂B are the maximum likelihood estimators of pA and pB . We now apply this result for the three allocation rules mentioned in Section 1: urn allocation, Neyman allocation, and RSIHR allocation. For urn allocation, given by qB /(qA + qB ), from (2), the lower bound is given by qA qB (pA + pB ) n(qA + qB )3 . Since this is the asymptotic variance of Ivanova’s procedure, the procedure attains the lower bound. Hence, Ivanova’s procedure is the asymptotically best procedure among all procedures with limiting allocation qB /(qA + qB ). It is interesting to note that the Zelen’s (1969) deterministic procedure, termed “play-thewinner rule”, which assigns the opposite treatment to the next patient if the previous patient was a treatment failure, and the same treatment if the previous patient was a treatment success, is also an asymptotically best procedure. However, unlike Ivanova’s procedure, Zelen’s procedure is not randomized. That Zelen’s procedure satisfies regularity condition 2 was known in the original paper. To our knowledge, condition 3 along with v had not previously been known. This leads to the following proposition: Proposition 1. For Zelen’s (1969) procedure, n1/2 (NA (n)/n − qB /(qA + qB )) → N(0, v), 1916 F. Hu et al. / Journal of Statistical Planning and Inference 136 (2006) 1911 – 1922 in law, where v = qA qB (pA + pB )/(qA + qB )3 . For Eisele’s design with the g function given in (1), it can be seen that v qA qB (pA + pB )/(qA + qB )3 , with equality holding when → ∞. Thus the lower bound can be attained, but the result is a mostly deterministic procedure that assigns the next patient to A with probability 1 if the current allocation is less than (p̂A , p̂B ) and to B with probability 1 if the current allocation is greater than (p̂A , p̂B ). If the current allocation is equal to (p̂A , p̂B ), then allocation is according to a biased coin. Wei and Durham’s procedure is not an asymptotically best procedure (except trivially when pA =pB ), as their asymptotic variance v is always larger than for Ivanova’s procedure, a fact first pointed out by Ivanova (2003). Complete randomization, with limiting allocation (pA , pB ) = 21 , satisfies regularity conditions 2 and 3 with v = 41 , and this is not asymptotically optimal except in the trivial case where pA =pB = 21 . The asymptotic variance of Ivanova’s procedure is often less than 41 , except for very large values of pA and pB . So Ivanova’s procedure is fully randomized, and assigns more patients to the better treatment (asymptotically) often with smaller variability than complete randomization. When the limiting allocation is Neyman allocation, the lower bound is computed from (2) as 1 pB qB (qA − pA )2 pA qA (qB − pB )2 . + √ √ 3 √ √ pA q A pB qB 4n pA qA + pB qB Eisele’s procedure does not attain the lower bound (except as → ∞). When = 0, we have a procedure explored by Melfi and Page (1998, 2000) with limiting Neyman allocation, and they discovered that this procedure was highly variable. The high variability can reduce power substantially, which is undesirable when the desired allocation is supposed to maximize power. Essentially this procedure allocates to A with probability (p̂A , p̂B ) at each stage. Finally, when the limiting allocation is RSIHR allocation, the lower bound is computed from (2) as p B qA 1 pA qB . √ √ 3 √p + √p A B 4n pA + pB The use of Eisele’s procedure targeting RSIHR with = 2 was strongly recommended in Hu and Rosenberger (2003), although we see that this is not a best asymptotic procedure. We discuss this further in Section 4. 3. Main results for K treatments We now establish a data structure which allows for simplification of the proof of the main results for K treatments with general outcomes. The main results involve the asymptotic F. Hu et al. / Journal of Statistical Planning and Inference 136 (2006) 1911 – 1922 1917 properties of the maximum likelihood estimators from an exponential family and then the Rao–Cramér lower bound based on the Fisher information matrix. For the generalized Friedman’s urn model with K treatments, Rosenberger et al. (1997) proved these results, but the required conditions are not satisfied for many other response-adaptive randomization procedures. Let Tij , i =1, . . . , n, j =1, . . . , K be the n treatment assignments, where Tij =1 if the ith patient is assigned to treatment j, 0 otherwise. Let N(n)=(N1 (n), . . . , NK (n)) be the sample sizes on each treatment. Now define T1 = (T11 , . . . , T1K ), . . . , Tn = (Tn1 , . . . , TnK ). Note that each vector is a vector of K − 1 0s and one 1. For the responses, let X1 = (X11 , X12 , . . . , X1K ), . . . , Xn = (Xn1 , Xn2 , . . . , XnK ), where Xi represents the sequence of responses that would be observed if each treatment were assigned to the ith patient independently. However, only one element of Xi will be observable. We assume that X1 , . . . , Xn are independent and identically distributed, with X1j ∼ fj (·, j ), j = 1, . . . , K, where j ∈ j . We thus assume that Xm is independent of X1 , . . . , Xm−1 , T1 , . . . , Tm , but that Tm depends on X1 , . . . , Xm−1 , T1 , . . . , Tm−1 , m = 1, . . . , n. The value of this data structure for theoretical purposes is obscured by the complexity of the notation. We therefore give a simple example to illustrate the notation before moving to our main result. Example. If K =2 and we use Wei and Durham’s procedure, then f1 (·, 1 ) is Bernoulli(pA ) and f2 (·, 2 ) is Bernoulli(pB ). Also Tm = (1, 0) if treatment A was assigned to the mth patient, and Tm = (0, 1) if treatment B was assigned. Then Xm = (Xm1 , Xm2 ), where Xm1 ∼ f1 and Xm2 ∼ f2 . Clearly Xm is independent of X1 , . . . , Xm−1 and also of T1 , . . . , Tm , but we only observe the element of Xm corresponding to the element of Tm that is 1. However, Tm depends on all previous treatment assignments and responses according to the urn model. We now state the regularity conditions for the main result: 1. The parameter space j is an open subset of Rd , d 1, for j = 1, . . . , K. 2. The distributions f1 (·, 1 ), . . . , fK (·, K ) follow an exponential family. 3. For limiting allocation () = (1 (), . . . , K ()) ∈ (0, 1)K , Nj (n) → j () n almost surely for j = 1, . . . , K. 4. For positive definite matrix V(), N(n) − () → N(0, V()) (n) n in law. We require the following lemma, which gives the asymptotic distribution of the maximum likelihood estimator ˆ of : 1918 F. Hu et al. / Journal of Statistical Planning and Inference 136 (2006) 1911 – 1922 Lemma 1. Under regularity conditions 1–3, we have √ n(ˆ − ) → N(0, I−1 ()), where I() = diag{1 ()I1 (1 ), . . . , K ()IK (K )} and Ij (j ) is the Fisher’s information for a single observation on treatment j = 1, . . . , K. Remark. We can generalize to nonexponential families by imposing further regularity conditions, such as those given in Rosenberger et al. (1997). However, they replace condition 3 essentially with a requirement that E(Tij |Fi−1 ) → j (), j = 1, . . . , K almost surely, where Fn is the sigma algebra generated by the first n assignments and responses. This condition is not satisfied for Ivanova’s procedure and Zelen’s procedure, for example. However, our condition 3 is satisfied for all the procedures we examine. Now we state the main result: Theorem 1. Under regularity conditions 1–4, there exists a 0 ⊂ = 1 ⊗ · · · ⊗ K with Lebesgue measure 0 such that for every ∈ − 0 , V() j() −1 j() I () . j j We can now rigorously define an asymptotically best response-adaptive procedure as one in which V() attains the lower bound B() = j() −1 j() I () j j for a particular target allocation (). We now show that an extension of Eisele’s procedure for K > 2 treatments (Hu and Zhang, 2004) is not an asymptotically best procedure, in general, but can attain the lower bound. Hu and Zhang’s procedure. In the generalization of Eisele’s procedure to K > 2 treatments, g is a function mapping [0, 1]K ⊗ [0, 1]K to [0, 1]K . We allocate to treatment j with probability gj (N(n)/n, ()), where g(x, y) = (g1 (x, y), . . . , gK (x, y)) and yj (yj /xj ) ∧ c gj (x, y) = K j =1 {yj (yj /xj ) ∧ c} for some c > 1, where a ∧ b is the minimum of a and b. The constant c is introduced for technical reasons, and if c is chosen large it will have little influence on the allocation. F. Hu et al. / Journal of Statistical Planning and Inference 136 (2006) 1911 – 1922 1919 Regularity conditions 1–4 hold with V() = 1 2(1 + ) 1 () + B(), 1 + 2 1 + 2 where 1 () = diag{()} − ()() (see Hu and Zhang, 2004). Note that, for any (), the procedure is asymptotically best when → ∞, as in Eisele’s procedure for K = 2 treatments. The form of the asymptotic variance is particularly interesting because it demonstrates the relationship between the asymptotic variance and the lower bound for all choices of target allocations (). 4. Conclusions We have established a lower bound on the asymptotic variability of response-adaptive randomization procedures which provides a baseline for comparison of existing procedures and further guidance in developing new procedures. As such, this paper represents an evolving body of knowledge about these complex procedures; however, it also presents numerous dilemmas regarding the tradeoffs among randomization, variability, and optimality. On the surface, Ivanova’s procedure would seem to give us everything we want in a response-adaptive randomization procedure: it is fully randomized and it attains the lower bound. However, it can only target qB /(qA + qB ), which is not optimal in any formal sense, and previously reported simulations have shown that it can be slow to converge for large values of pA and pB . It also becomes more deterministic for small values of pA and pB (see Hu and Rosenberger, 2003). Eisele’s procedure and Hu and Zhang’s extension solve some of these deficiencies, in that they can target any desired allocation, and can approach the lower bound for large values of . However, the procedure becomes more deterministic as becomes larger, and hence careful tuning of must be done in order to counter the tradeoff. This leads to a challenge for researchers in the area: can a procedure be found that preserves randomization, attains the lower bound, and targets any allocation? We also note that Eisele’s and Hu and Zhang’s procedures are the only procedures mentioned in this paper that can be used for more general response types. Acknowledgements The authors thank an anonymous referee for helpful comments. This paper was revised while the authors were visiting the Department of Statistics and Applied Probability, National University of Singapore. They thank the Department for its hospitality. Appendix Proof of Proposition 1. Zelen’s rule induces a Markov chain {Tn , n 1} with states {0, 1} and transition matrix pA qA P= . qB p B 1920 F. Hu et al. / Journal of Statistical Planning and Inference 136 (2006) 1911 – 1922 The stationary probabilities are =(, 1−), where =qB /(qA +qB ). The n-step transition matrix can then be computed as ⎡ q + q (1 − q − q )n q − q (1 − q − q )n ⎤ B n P ⎢ (n) = ((pij )) = ⎣ qB A A B A A A B ⎥ qA + q B . qA + qB (1 − qA − qB )n ⎦ qA + q B qA + qB − qB (1 − qA − qB )n qA + q B By the central limit theorem for Markov chains (cf. Theorem 17.0.1 of Meyn and Tweedie, 1993), it follows that n √ 1 NA L (Tm − E (T1 )) → N(0, v), − = √ n n n m=1 where v = E (T1 − E (T1 ))2 + 2 ∞ E {(T1 − E (T1 ))(Tm+1 − E (T1 ))}. m=1 We compute E (T1 − E (T1 ))2 = 1 2 = q A qB (qA + qB )2 and E {(T1 − E (T1 ))(Tm+1 − E (T1 ))} = P (T1 = 1, Tm+1 = 1) − (P (T1 = 1))2 (m) = p11 P (T1 = 1) − (P (T1 = 1))2 qA qB (1 − qA − qB )m = . (qA + qB )2 It follows that v= qA qB 1+2 (qA + qB ) 2 ∞ (1 − qA − qB )m = m=1 qA qB (pA + pB ) (qA + qB )3 . Proof of Lemma 1. Let L() = K n (fj (Xi,j , j ))Ti,j = i=1 j =1 K Lj (j ) j =1 be the likelihood function based on the observed sample. It is sufficient to show that 1 j log Lj (j ) + op (n−1/2 ) jj Nj (n) n j log fj (Xi,j , j ) 1 = − Ti,j + op (n−1/2 ), nj (j ) jj ( j − j )Ij (j ) = − i=1 (3) F. Hu et al. / Journal of Statistical Planning and Inference 136 (2006) 1911 – 1922 1921 j = 1, . . . , K, as n Ti,j i=1 j log fj (Xi,j , j ) = Op (n1/2 ). jj Let Yi,j = −Ti,j j log fj (Xi,j , j ) jj and An = (X1 , . . . , Xn , T1 , . . . , Tn+1 ). Then {Yi = (Yi,1 , . . . , Yi,K ), Ai ; i = 1, 2, . . .} is a martingale sequence with 1 1 E(Yi Yi |Ai−1 ] = diag(Ti,1 I1 (1 ), . . . , Ti,K IK (K )) n n i=1 i=1 NK (n) N1 (n) p IK (K ) → I(). I1 (1 ), . . . , = diag n n n n Conditions of the martingale central limit theorem (e.g., Billingsley, 1961) hold by regularity conditions 2 and 3, and it follows that 1 L Yi → N(0, I()). √ n n i=1 It remains to show (3). For fixed j = 1, 2, . . . , K, define i (j ) = min{k : Nj (k) = i} = min{k > i−1 (j ) : Tk,j = 1}, where min{∅} = +∞. Let {i,j } be an independent copy of i,j =X (j ),j I {i (j ) < ∞}+i,j I {i (j )= {Xi,j }, which is also independent of {Ti }. Define X i ∞}, i = 1, 2, . . . . Then {Xi,j ; i = 1, 2, . . .} is a sequence of i.i.d. random variables, with the same distribution as X1,j . Let j (j ) = L j (j ; Nj (n)) = L Nj (n) i,j , j ) fj (X i=1 j ). Then on the event {Nj (n) → ∞}, and let Nj (n),j maximize L( j (j ; Nj (n)) and j = Nj (n),j . Lj (j ) = L (4) i,j : i = 1, 2, . . . , n} are independent and identically distributed. Under Notice that {X regularity condition 2, we have j (j ; Nj (n)) 1 j log L ( j − j )Ij (j ) = − + op (n−1/2 ). n jj By (4) and regularity condition 3, (3) follows. 1922 F. Hu et al. / Journal of Statistical Planning and Inference 136 (2006) 1911 – 1922 i,j : i=1, 2, . . . , m} are Proof of Theorem 1. From the proof of Lemma 1, we know that {X independent and identically distributed. Therefore the Rao–Cramer lower bound of () is j() −1 j() I () . j j Because N(n)/n is an asymptotically unbiased estimator of () and satisfies regularity conditions 2 and 4, by Theorem 4.16 of Shao (1999, p. 249) the theorem follows directly. References Billingsley, P., 1961. Statistical Inference for Markov Chains. University of Chicago Press, Chicago. Eisele, J.R., 1994. The doubly adaptive biased coin design for sequential clinical trials. J. Statist. Plann. Inference 38, 249–261. Hu, F., Rosenberger, W.F., 2003. Optimality, variability, power: evaluating response-adaptive randomization procedures for treatment comparisons. J. Amer. Statist. Assoc. 98, 671–678. Hu, F., Zhang, L.-X., 2004. Asymptotic properties of doubly adaptive biased coin designs for multi-treatment clinical trials. Ann. Statist. 32, 268–301. Ivanova, A.V., 2003. A play-the-winner type urn model with reduced variability. Metrika 58, 1–13. Ivanova, A., Rosenberger, W.F., Durham, S.D., Flournoy, N., 2000. A birth and death urn for randomized clinical trials: asymptotic methods. Sankhya B 62, 104–118. Melfi, V., Page, C., 1998. Variability in adaptive designs for estimation of success probabilities. In: Flournoy, N., Rosenberger, W.F., Wong, W.K. (Eds.), New Developments and Applications in Experimental Design. Institute of Mathematical Statistics, Hayward, pp. 106–114. Melfi, V.F., Page, C., 2000. Estimation after adaptive allocation. J. Statist. Plann. Inference 87, 353–363. Meyn, S.P., Tweedie, R.L., 1993. Markov Chains and Stochastic Stability. Springer, New York. Rosenberger, W.F., Lachin, J.M., 2002. Randomization in Clinical Trials: Theory and Practice. Wiley, New York. Rosenberger, W.F., Flournoy, N., Durham, S.D., 1997. Asymptotic normality of maximum likelihood estimators from multiparameter response-driven designs. J. Statist. Plann. Inference 60, 69–76. Rosenberger, W.F., Stallard, N., Ivanova, A., Harper, C.N., Ricks, M.L., 2001. Optimal adaptive designs for binary response trials. Biometrics 57, 909–913. Shao, J., 1999. Mathematical Statistics. Springer, New York. Smythe, R.T., Rosenberger, W.F., 1995. Play-the-winner, generalized Polya urns, and Markov branching processes. In: Flournoy, N., Rosenberger, W.F. (Eds.), Adaptive Designs. Institute of Mathematical Statistics, Hayward, pp. 13–22. Wei, L.J., 1979. The generalized Polya’s urn design for sequential medical trials. Ann. Statist. 7, 291–296. Wei, L.J., Durham, S.D., 1978. The randomized play-the-winner rule in medical trials. J. Amer. Statist. Assoc. 73, 840–843. Zelen, M., 1969. Play-the-winner rule and the controlled clinical trial. J. Amer. Statist. Assoc. 64, 131–146.

Asymptotically best response-adaptive randomization procedures 夡 Feifang Hu

Related documents

Products

Support

Asymptotically best response-adaptive randomization procedures 夡 Feifang Hu

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib