a l r n u o

J i on Electr o u a rn l o f P c r ob abil ity Vol. 12 (2007), Paper no. 52, pages 1402–1417. Journal URL http://www.math.washington.edu/~ejpecp/ Edgeworth expansions for a sample sum from a finite set of independent random variables∗ Zhishui Hu Department of Statistics and Finance, University of Science and Technology of China, Hefei, 230026, China E-mail: huzs@ustc.edu.cn John Robinson School of Mathematics and Statistics, The University of Sydney, NSW 2006, Australia, E-mail: johnr@maths.usyd.edu.au Qiying Wang School of Mathematics and Statistics, The University of Sydney, NSW 2006, Australia, E-mail: qiying@maths.usyd.edu.au URL: http://www.maths.usyd.edu.au/u/qiying Abstract Let {X1 , · · · , XN } be a set of N independent random variables, and let Sn be a sum of n random variables chosen without replacement from the set {X1 , · · · , XN } with equal probabilities. In this paper we give a one-term Edgeworth expansion of the remainder term for the normal approximation of Sn under mild conditions. Key words: Edgeworth expansion, finite population, sampling without replacement. AMS 2000 Subject Classification: Primary 60F05, 60F15; Secondary: 62E20. Submitted to EJP on October 16, 2006, final version accepted September 24, 2007. ∗ This research is supported in part by an Australian Research Council (ARC) discovery project. 1402 1 Introduction and main results Let {X1 , · · · , XN } be a set of independent random variables, µk = EXk , 1 ≤ k ≤ N . Let R = (R1 , · · · , RN ) be a random vector independent of X1 , · · · , XN , such that P (R Pn= r) = 1/N ! for any permutation r = (r1 , · · · , rN ) of the numbers 1, · · · , N , and put Sn = j=1 XRj , 1 ≤ n ≤ N , that is for a sum of n random variables chosen without replacement from the set {X1 , · · · , XN } with equal probabilities. In the situation that Xk = µk , 1 ≤ k ≤ N , are (nonrandom) real numbers, the sample sum Sn has been studied by a number of authors. The asymptotic normality was established by Erdös and Rényi (1959) under quite general conditions. The rate in the Erdös and Rényi central limit theorem was studied by Bikelis (1969) and later Höglund (1978). An Edgeworth expansion was obtained by Robinson (1978), Bickel and van Zwet (1978), Schneller (1989), Babu and Bai (1996) and later Bloznelis (2000a, b). Extensions to U -statistics and, more generally, symmetric statistics can be found in Nandi and Sen (1963), Zhao and Chen (1987, 1990), Kokic and Weber (1990), Bloznelis and Götze (2000, 2001) and Bloznelis (2003). In contrast to rich investigations for the case Xk = µk , 1 ≤ k ≤ N , are (nonrandom) real numbers, there are only a few results concerned with the asymptotics of general Sn discussed in √ this paper. von Bahr (1972) showed that the distribution of Sn / V arSn may be approximated by a normal distribution under certain mild conditions. The rate of the normal approximation has currently been established by Zhao, Wu and Wang (2004), in which the paper improved essentially earlier work by von Bahr (1972). Along the lines of Zhao, √ Wu and Wang (2004), this paper discusses Edgeworth expansions for the distribution of Sn / V arSn . Throughout the paper, let γ12 = N 1 X EXk EXk2 , N αj = k=1 N 1 X (EXk )j , N βj = k=1 N 1 X E(Xkj ), N k=1 for j = 1, 2, 3, 4, and p = n/N, q = 1 − p, b = 1 − pα2 . Theorem 1. Suppose that α1 = 0 and β2 = 1. Then, for all 1 ≤ n < N , ¯ ¯ √ ¯ ¯ sup ¯P (Sn / nb ≤ x) − Gn (x)¯ x ³ ´ ª © √ ≤ C ∆1n + (nq)−1 + 3 nq log (nq) exp − nq δN , where C is an absolute constant, Gn (x) = Φ(x) − β3 − 3pγ12 + 2p2 α3 ′′′ Φ (x), √ 6 nb3/2 with Φ(x) being a standard normal distribution, −1 ∆1n = (nb) N (nb2 )−1 X E(Xk − pEXk )4 , α4 + N k=1 1403 (1) and δN N ¯1 X ¯ ¯ itXk ¯ = 1− sup Ee ¯ ¯, √ δ0 b/(9L0 )≤|t|≤16 nb N k=1 where L0 = 1 N P k E|Xk |3 and δ0 is so small that 192δ02 + 24δ0 ≤ 1 − cos(1/16). Property (1) improves essentially a result of Mirakhmedov (1983). The related result in Theorem 1 of Mirakhmedov (1983) depends on max1≤k≤N EXk4 . Note that it is frequently the case that P 4 4 N −1 N k=1 EXk is bounded, but max1≤k≤N EXk tends to ∞. Also note that Corollary 1 of Mirakhmedov (1983) requires limt→∞ |EeitXk | ≤ ǫ < 1. This condition is quite restrictive since it takes away the most interesting case that the Xk are all degenerate. When Xk = µk , 1 ≤ P k ≤ N , are (nonrandom) real numbers, it is readily seen that α2 = 1, b = q, N 1 3 α3 = β3 = γ12 = N k=1 µk , −1 ∆1N + (nq) −1 ≤ 3 (nq) N 1 X 4 µk . N k=1 In this case, the property (1) reduces to √ sup |P (Sn / nq ≤ x) − G1n (x)| x −1 ≤ C (nq) √ where G1n = Φ(x)+ 6p−q nq b). 1 N PN N ª © 1 X 4 √ µk + 3 nq log (nq) exp − nq δN , N 3 k=1 µk k=1 Φ′′′ (x), which gives one of main results in Bloznelis (2000a, We next give a result complementary to Theorem 1. The result is better than Theorem 1 under certain conditions such as some of the Xk ’s are non-degenerate random variables and q is close to 0. Theorem 2. Suppose that α1 = 0 and β2 = 1. Then, for all 1 ≤ n ≤ N , ¯ ¯ √ ª © √ ¯ ¯ sup ¯P (Sn / nb ≤ x) − Gn (x)¯ ≤ C ∆2n + 3 n log n exp − n δ1N , x (2) where C is an absolute constant, Gn (x), L0 and δ0 are defined as in Theorem 1, ∆2n = (nb2 )−1 β4 and δ1N N 1 X ¯¯ itXk ¯¯ Ee . = 1− sup √ δ0 b/(9L0 )≤|t|≤16 nb N k=1 In the next section, we prove the main results. Throughout the paper we shall use C, C1 , C2 , ... to denote absolute constants whose value may differ at each occurrence. Also, I(A) P denotes the indicator function of a set A, ♯(A) denotes the number of elements in the set A, k denotes √ QN Q PN −1. k=1 . The symbol i will be used exclusively for k denotes k=1 , and 1404 2 Proofs of Theorems √ Let µk = EXk and Ψ(t) = E exp{itSn / nb}. Recall α1 = 0 and β2 = 1. As in (4) of Zhao, Wu and Wang (2004), Z Y −1 Eρk (ψ, t)dψ, (3) Ψ(t) = [Bn (p)] √ |ψ|≤π nq k where Bn (p) = √ n pn q N −n , X ∗ = X − pµ and 2πnqGn (p), Gn (p) = 2πCN k k k ½ ¾ ½ ¾ itXk∗ ipψ iqψ ipµk t ρk (ψ, t) = q exp − √ − √ + p exp √ + √ . nq nq nb nb √ The main idea of the Q proofs is outlined as follows. We first provide the expansions and the basic properties for k Eρk (ψ, t) in Lemmas 1–4. In Lemma 5, the idea in von Bahr (1972) is extended to give an expansion of Ψ(t) for the case n/N ≥ 1/2. The proofs of Theorems 1 and 2 are finally completed by virtue of the classical Esseen’s smoothing lemma. In the proofs of Lemmas 1–4, we assume that ∆1n < 1/16 and nq > 256, where ∆1n is defined as in Theorem 1. Throughout this section, we also define, h(ψ, t) = Y ³ i3 f (ψ, t) ´ −(ψ2 +t2 )/2 √ and g(ψ, t) = 1 + e , 6 N Eρk (ψ, t) k where f (ψ, t) = A0 ψ 3 + 3A1 ψt2 + A2 t3 , with q−p A0 = √ , pq A1 = √ (1 − 2pα2 ) pq , pb A2 = β3 − 3pγ12 + 2p2 α3 . p1/2 b3/2 (4) −1/4 Lemma 1. For |ψ| ≤ (nq)1/4 /4 and |t| ≤ ∆1n /4, we have |h(ψ, t) − g(ψ, t)| ≤ C[∆1n + (nq)−1 ](s4 + s8 ) exp{−s2 /3}, (5) where s2 = ψ 2 + t2 . Proof. Define a sequence of independent random vectors (Uk , Vk ), 1 ≤ k ≤ N , by the conditional distribution given Xk∗ as follows: p √ P (Uk = −p/ pq, Vk = −pµk / pb | Xk∗ ) = q, p √ P (Uk = q/ pq, Vk = Xk∗ / pb | Xk∗ ) = p. Let Wk = ψUk + tVk . As in Lemma 1 of Zhao, Wu and Wang (2004), tedious but simple calculations show that X X EWk3 = N f (ψ, t), EWk2 = N (ψ 2 + t2 ), EWk = 0, k k X k EWk4 ≤ 8 X E(ψ 4 Uk4 +t 4 Vk4 ) k 1405 2 ≤ 8N (ψ 4 /(nq) + t4 ∆1n ). (6) Furthermore, if we let Bn2 = L23N = P ³X k k EWk2 , LjN = E|Wk |3 ´2 P k ± E|Wk |j Bnj , j = 3, 4, then /Bn6 ≤ L4N ≤ 8((nq)−1 + ∆1n ), (7) −1/4 and whenever |ψ| ≤ (nq)1/4 /4 and |t| ≤ ∆1n /4, p ¤1/4 −1/4 £ −1/4 s := ψ 2 + t2 ≤ L4N 8(ψ 4 /(nq) + t4 ∆1n ) ≤ L4N /2. (8) Now, by recalling that Wk are independent r.v.s and noting that h(ψ, t) = Eei P k √ Wk / N = Eei s P k Wk /Bn , it follows from (7)-(8) and the classical result (see, for example, Theorem 8.6 in Bhattacharya −1/4 and Ranga Rao (1976)) that, for |ψ| ≤ (nq)1/4 /4 and |t| ≤ ∆1n /4, ¯ ´ 2 ¯¯ ³ 3 ¯ i ¯h(ψ, t) − 1 + √ f (ψ, t) e−s /2 ¯ ¯ ¯ 6 N ¯ ¯ ¯ ³ ´ 2 ¯ i3 s3 X ¯ ¯ = ¯h(ψ, t) − 1 + EWk3 e−s /2 ¯ 3 ¯ ¯ 6Bn ≤ C(L4N + k 2 2 4 L3N )(s + s8 )e−s /3 −1 4 8 ≤ C[∆1n + (nq) ](s + s ) exp{−s2 /3}. This proves (5) and hence completes the proof of Lemma 1. 2 Lemma 2. For |ψ| ≤ (nq)1/4 /4 and |t| ≤ 1/4, we have ¯ d h(ψ, t) d g(ψ, t) ¯ ¢ ¡ 2 ¯ ¯ − ¯ ≤ C(1 + ψ 12 ) (nq)−1 + ∆1n e−ψ /4 . ¯ dt dt Proof. We first show that if |ψ| ≤ (nq)1/4 /4 and |t| ≤ 1/4, then ¯ ¯ ¶ µ ¯ ¯ d h(ψ, t) i d f (ψ, t) h(ψ, t)¯¯ Λ(ψ, t) := ¯¯ + t+ √ dt dt 6 N ¢ ¡ 4 −1 ≤ C(1 + ψ ) (nq) + ∆1n |h(ψ, t)| . (9) (10) To proveP(10), define (Uk , Vk ) and Wk = ψUk + tVk as in Lemma 1. Recall that h(ψ, t) = √ E exp{i k Wk / N }. It is readily seen that i h(ψ, t) X d h(ψ, t) IN k , = √ dt N k (11) h √ i−1 h √ i where IN k = E exp{iWk / N } E Vk exp{iWk / N } . Recall nq > 256. It follows from (19) and (20) in Zhao, Wu and Wang (2004) that for |ψ| ≤ (nq)1/4 /4 and |t| ≤ 1/4, h √ i−1 = 1 + θ1 N −1 (ψ 2 + t2 EVk2 ), E exp{iWk / N } 1406 (12) where |θ1 | ≤ 1 and N −1 (ψ 2 + t2 EVk2 ) ≤ 1/4. This, together with Taylor’s expansion of eix , yields that (recall EVk = 0) ¯ ¯ ¯ ¯ ¯IN k − √i EVk Wk + 1 EVk W 2 ¯ k¯ ¯ 2N N ¶ µ ψ 2 + t2 EVk2 3 1 1 2 3 √ |EVk Wk | + ≤ 3/2 E|Vk ||Wk | + |EVk Wk | . (13) N 2N N N As in the proof of (6), for |t| ≤ 1/4, X X E|Vk ||Wk |3 ≤ C(1 + |ψ|3 ) E(Uk4 + Vk4 ) k k £ 2 3 ¤ (nq)−1 + ∆1n , ≤ C(1 + |ψ| )N X k (ψ 2 + EVk2 )|EVk Wk | ≤ C(1 + |ψ|3 ) 3 ≤ C(1 + |ψ| ) 3 X (1 + EVk2 )(EUk2 + EVk2 ) k X (1 + EVk4 ) k 2 ≤ C(1 + |ψ| )N [(nq)−1 + ∆1n ], X k (ψ 2 + EVk2 )|EVk Wk2 | ≤ C(1 + ψ 4 ) ≤ C(1 + ψ 4 ) X k ¡ ¢ (1 + EVk2 ) E|Vk |3 + E|Uk |3 X³ k (pq)−1/2 (1 + EVk2 ) + E|Vk |3 + EVk2 E|Vk |3 h ´i X³ √ ≤ C(1 + ψ ) N (pq)−1/2 + 1 + EVk4 + N EVk4 4 (14) (15) ´ k 4 ≤ C(1 + ψ )N 5/2 ¤ £ (nq)−1 + ∆1n , (16) where, in the proof of (16), we have used the estimates: |Vk |3 ≤ 1 + Vk4 and √ EVk2 E|Vk |3 ≤ (EVk2 )1/2 EVk4 ≤ N EVk4 . Now (10) follows from (11), (13)-(16) and X X N d f (ψ, t) . EVk Wk2 = N (2A1 ψt + A2 t2 ) = EVk Wk = tN, 3 dt (17) k k We next complete the proof of Lemma 2 by virtue of (10) and Lemma 1. We first notice that, by (6), for all ψ and t, ´1/2 X 1 ³X 1 X EWk4 EWk2 E|Wk |3 ≤ |f (ψ, t)| ≤ N N k k k √ 2 2 3/2 −1 1/2 (18) ≤ 3 N (ψ + t ) (∆1n + (nq) ) , and similarly by (17), for all ψ and t, ¯ d f (ψ, t) ¯ √ 3 X ¯ ¯ E|Vk Wk2 | ≤ 9 N (ψ 2 + t2 )3/2 (∆1n + (nq)−1 )1/2 . ¯ ¯ ≤ dt N k 1407 (19) It follows from (18) and Lemma 1 that for |ψ| ≤ (nq)1/4 /4 and |t| ≤ 1/4, |h(ψ, t) − e−(ψ 2 +t2 )/2 | ≤ C(∆1n + (nq)−1 )1/2 e−ψ 2 /4 . (20) Therefore, by noting d g(ψ, t) i d f (ψ, t) −(ψ2 +t2 )/2 = −tg(ψ, t) − √ e , dt dt 6 N simple calculations show that ¯ d h(ψ, t) d g(ψ, t) ¯ ¯ ¯ − ¯ ≤ Λ(ψ, t) + t|h(ψ, t) − g(ψ, t)| ¯ dt dt ¯ ¯ 1 ¯¯ d f (ψ, t) ¯¯ 2 2 |h(ψ, t) − e−(ψ +t )/2 | + √ ¯ ¯ dt 6 N ¢ ¡ 2 ≤ C(1 + ψ 12 ) (nq)−1 + ∆1n e−ψ /4 , where we have used (5), (10), (19) and (20). The proof of Lemma 2 is now complete. 2 Lemma 3. Assume that |ψ| ≤ (nq)1/4 /4. Then, |h(ψ, t)| ≤ C∆1n e−(ψ 2 +t2 )/4 , (21) −1/4 for ∆1n /4 ≤ |t| ≤ (∆∗ )−1 /16, where X X √ √ |µk |3 / nb + N −1 ∆∗ = N −1 E|Xk − pµk |3 /( nb3/2 ). k k √ Assume that (nq)1/4 /4 ≤ |ψ| ≤ π nq. Then, |h(ψ, t)| ≤ C(nq)−4 , (22) for all |t| ≤ δ0 (∆∗ )−1 , where δ0 is so small that 192δ02 + 24δ0 ≤ 1 − cos(1/16). If in addition |t| ≤ 1/4, then we also have ¯ d h(ψ, t) ¯ ¯ ¯ ¯ ¯ ≤ C(nq)−4 . dt (23) Proof. The proof of this lemma follows directly from an application of Lemmas 1-3 in Zhao, Wu and Wang(2004). The choice of δ0 can be found in the proof of Lemma 2 in Zhao, Wu and Wang(2004). We omit the details. 2 Lemma 4. Assume that n/N ≥ 1/2 and ∆2n ≤ (nq)−1 /25, where ∆2n is defined as in Theorem 1 √ 3/2 nb /L0 , 2. Then, for |t| ≤ 15 |h(ψ, t)| ≤ exp{−Ct2 }, where L0 = 1 N P k E|Xk |3 is defined as in Theorem 1. 1408 (24) Proof. We only need to note that the condition ∆2n ≤ (nq)−1 /25 implies that 1 X 1/2 5qα2 ≤ 5qβ4 ≤ b = Var(Xk ) + qα2 , N k that is, N1 k Var(Xk ) ≥ (4/5) b. Then (24) is obtained by repeating the proof of Lemma 4 in Zhao, Wu and Wang(2004). 2 P Lemma Assume thatª n/N © 5. 1/4 √ 1 min (n/β ) , 18 b n/L0 , 4 16 ≥ 1/2 and ∆2n ≤ 1. Then, for |u| ! Ã ¯ ¯ 3 u3 b3/2 √ i ¯ ¯ A2 ¯ ¯E exp{iuSn / n} − exp{−bu2 /2} 1 + √ 6 N ≤ Cn−1 β4 (u2 + u4 + u6 b) exp{−0.3 b u2 }, where L0 = 1 N P k ≤ (25) E|Xk |3 is defined as in Theorem 1. Proof. Write, for 1 ≤ k ≤ N , √ bk (u) = exp{u2 /(2n)}fk (u) − 1 fk (u) = E exp{iuXk / n}, P j and Bj = ((−1)j+1 /j) N k=1 bk (u) for 1 ≤ j ≤ n. As in von Bahr (1972), we have √ exp{u /2}E exp{iuSn / n} = 2 X n Y (pj Bj )ij CN,n,Pnj=1 jij , ij ! (26) ij ≥0, 1≤j≤n j=1 where CN,n,r = ( ¡ ¢. ³ r ¡N ¢´ N −r p n , r ≤ n; n−r 0, r > n. In view of (28) of Zhao, Wu and Wang (2004), for r > 0, CN,n,r ≤ 1, and for n ≥ 4 and r ≤ n, CN,n,r ≥ 1 − r2 /n. (27) To prove (25) by using (26), we need some preliminary results. P Write βjk = EXkj , j = 2, 3, 4. Recall that N −1 k β2k = 1. We have that β4 ≥ 1 and by Taylor’s 1 (n/β4 )1/4 , expansion, for |u| ≤ 16 1 θ4 1 2 u + 2 u4 + 3 u6 , where |θ4 | ≤ 1/24, 2n 8n n u2 iu iu3 θ5 u4 fk (u) = 1 + √ µk − β2k − 3/2 β3k + 2 β4k , where |θ5 | ≤ 1/24. 2n n n 6n exp{u2 /(2n)} = 1 + 1/4 1/2 3/4 Now, by noting that |µk | ≤ β4k ≤ 1 + β4k , |β2k | ≤ β4k ≤ 1 + β4k and |β3k | ≤ β4k ≤ 1 + β4k , 1 (n/β4 )1/4 , we obtain that, for |u| ≤ 16 bk (u) = exp{u2 /(2n)}fk (u) − 1 u2 iu3 iu (1 − β2k ) + 3/2 (3µk − β3k ) + R1k (u). = √ µk + 2n n 6n 1409 (28) where |R1k (u)| ≤ (1 + β4k )u4 /n2 . Furthermore, by noting that √ √ 1/4 β4k |u|/ n ≤ (N β4 )1/4 |u|/ n ≤ 1/8 since n/N ≥ 1/2 and |u| ≤ 1 1/4 , 16 (n/β4 ) we have u2 2 iu3 µk + 3/2 µk (1 − β2k ) + R2k (u), n n 3 iu b3k (u) = − 3/2 µ3k + R3k (u), n j |bk (u)| ≤ 3(1/2)j−4 (1 + β4k )u4 /n2 , for j ≥ 4, b2k (u) = − where |R2k (u)| ≤ 3(1 + β4k )u4 /n2 and |R3k (u)| ≤ 4(1 + β4k )u4 /n2 . Recalling that P 1 1/4 , k (1 − β2k ) = 0, it follows from (28)–(31) that, for |u| ≤ 16 (n/β4 ) pB1 = p X k iu3 bk (u) = − √ β3 + θ6 β4 u4 /n, 6 n p2 B2 = (−p2 /2) X b2k (u) = k p3 B3 = (p3 /3) X k |pj Bj | ≤ X k u2 p iu3 p α2 + √ γ12 + θ7 β4 u4 /n, 2 2 n ip2 u3 b3k (u) = − √ α3 + θ8 β4 u4 /n, 3 n |bjk (u)| ≤ 6β4 u4 (1/2)j−4 /n, for j ≥ 4. (29) (30) (31) P k µk = (32) (33) (34) (35) where |θ6 | ≤ 2, |θ7 | ≤ 3 and |θ8 | ≤ 3. By virtue of (35), it is readily seen that, for |u| ≤ 1 1/4 , 16 (n/β4 ) n X j=4 |pj Bj | ≤ 12β4 u4 /n, (36) Noting that |α3 | + |β3 | + |γ12 | ≤ 3L0 and recalling that ∆2n = (nb2ª)−1 β4 ≤ 1, it follows easily © √ 1 from (32)–(34) and (36) that, for |u| ≤ 16 min (n/β4 )1/4 , 18 b n/L0 , n X j=1 |pj Bj | = = u3 L0 1 u4 β4 p α2 u2 + θ9 √ + θ10 2 n n 1 p α2 u2 + θ11 b u2 , 2 (37) P where |θ9 | ≤ 3, |θ10 | ≤ 20 and |θ11 | ≤ 0.2. Also, if we let L(u) = 3j=1 pj Bj − pα2 u2 /2, we have ¯ ¯ ¯ ¯ 3 b3/2 i u ¯ ¯ (38) |L(u)| ≤ 0.2bu2 , ¯L(u) + √ A2 ¯ ≤ 8β4 u4 /n, ¯ ¯ 6 N where A2 is defined as in (4). As in the proof of (6), we may obtain ´2 ³ X X X EVk4 ≤ N ∆1n ≤ 17N β4 /(nb2 ). EVk2 EVk3 ≤ N −2 A22 = N −1 k k k 1410 This together with (38) yields, for |u| ≤ 1 1/4 , 16 (n/β4 ) h |u|3 b3/2 i2 8 √ |A2 | + β4 u4 n 6 N 6 2 8 ≤ u bβ4 /n + 128β4 u /n2 ≤ β4 (u4 + u6 b)/n. L2 (u) ≤ (39) We are now ready to prove (25) by using (26). Rewrite (26) as √ exp{u2 /2}E exp{iuSn / n} = I1 + I2 + I3 , (40) where I1 = n XY (pj Bj )ij CN,n,Pnj=1 jij , ij ! j=1 3 ´ Y (pj Bj )ij ³ CN,n,P3 jij − 1 , j=1 ij ! X I2 = ij ≥0, 1≤j≤3 j=1 3 Y (pj Bj )ij , ij ! X I3 = ij ≥0, 1≤j≤3 j=1 where the summation in the expression of I1 is over all ij ≥ 0, j = 1, 2, 3 and ij > 0 for at least one j = 4, · · · , n. As in Mirakhmedov (1983), it follows from (36)-(37) that |I1 | ≤ exp ≤ n X 3 nX j=1 o³ exp n nX |pj Bj | exp j=4 −1 ≤ Cn |pj Bj | j=1 4 n nX j=4 |pj Bj | o ´ |pj Bj | − 1 o β4 u exp{pα2 u2 /2 + 0.2bu2 }. As for I2 , it follows easily from (27) and (37) that |I2 | ≤ Cn 3 3 Y |pj Bj |ij ³ X ´2 jij ij ! X −1 ij ≥0, 1≤j≤3 j=1 ≤ Cn−1 ≤ Cn −1 ≤ Cn −1 X ij ≥0, 1≤j≤3 j=1 exp 3 nX j=1 2 j=1 3 Y i2j |pj Bj |ij j |p Bj | 4 ij ! 3 oX ¡ j=1 |pj Bj | + |pj Bj |2 β4 (u + u ) exp{pα2 u2 /2 + 0.2bu2 }. 1411 ¢ P3 We next estimate I3 . Recalling that b = 1 − pα2 and noting that I3 = e j=1 pj B j , we have ¯ ³ i3 u3 b3/2 ´ −bu2 /2 ¯¯ 2 ¯ A2 e ¯I3 e−u /2 − 1 + √ ¯ 6 N ¯ ¯ ¯ i3 u3 b3/2 ¯¯ 2 2 ¯ ¯ ¯ A2 ¯ ≤ e−bu /2 ¯eL(u) − 1 − L(u)¯ + e−bu /2 ¯L(u) − √ 6 N £ ¤ 2 ≤ (1/2)L2 (u)e|L(u)| + 8β4 u4 /n e−bu /2 2 ≤ C n−1 β4 (u4 + u6 b) e−0.3 bu , where L(u) = P3 j=1 p jB j − pα2 u2 /2 and we have used (38)-(39). Combining (40) and all above facts for I1 -I3 , we obtain ¯ Ã !¯ ¯ ¯ 3 u3 b3/2 √ i ¯ ¯ A2 ¯ ¯E exp{iuSn / n} − exp{−bu2 /2} 1 + √ ¯ ¯ 6 N ¯ ³ i3 u3 b3/2 ´ −bu2 /2 ¯¯ 2 ¯ A2 e ≤ exp{−u2 /2}(|I1 | + |I2 |) + ¯I3 e−u /2 − 1 + √ ¯ 6 N ≤ C n−1 β4 (u2 + u4 + u6 b) exp{−0.3bu2 }, which implies (25). The proof of Lemma 5 is now completed. 2 After these preliminaries, we are now ready to prove the theorems. Proof of Theorem 1. Without loss of generality, assume that nq > 256 and ∆1n < 1/16. Write T −1 = ∆1n + (nq)−1 and ³ i3 t3 A2 ´ gn (t) = 1 + √ exp{−t2 /2}, 6 N where A2 is defined as in Lemma 1. We shall prove, (i) if |t| ≤ 1/4, then ¯ ¯ ¯Ψ(t) − gn (t)¯ ≤ C|t| T −1 ; (41) ¯ ¯ ¯Ψ(t) − gn (t)¯ ≤ C T −1 (1 + t8 )e−t2 /4 + C (nq)−3 ; (42) (ii) if |t| ≤ δ0 (∆∗ )−1 , where δ0 and ∆∗ are defined as in Lemma 3, then (iii) if δ0 (∆∗ )−1 ≤ |t| ≤ T , then ¯ ¯ ª © ¯Ψ(t) − gn (t)¯ ≤ C T −1 e−t2 /4 + 3√nq exp − nq δN , (43) where δN is defined as in Theorem 1. √ Note that |A2 | ≤ N /4 by ∆1n ≤ 1/16 and the last second inequality of (39). We have m ≡ supx |G′n (x)| ≤ C(1 + N −1/2 |A2 |) ≤ 2C. So, by virtue of (41)-(43) and Esseen’s smoothing 1412 lemma, simple calculations show that ¯ ¯ √ ¯ ¯ sup ¯P (Sn / nb ≤ x) − Gn (x)¯ x Z Z ³Z + + ≤ |Ψ(t) − gn (t)| dt + Cm T −1 |t| 1/4≤|t|≤T1 |t|≤1/4 T1 ≤|t|≤T ª © √ −1 ≤ C (∆1n + (nq) ) + 3 nq log(nq) exp − nq δN , where T1 = min{δ0 (∆∗ )−1 , T }, which implies (1) and hence Theorem 1. We next prove (41)-(43). Throughout the proof, we write s2 = ψ 2 + t2 . R∞ Consider (42) first. Note that gn (t) = √12π −∞ g(ψ, t)dψ. It is readily seen that |Ψ(t) − gn (t)| ≤ II1 + II2 + II3 + II4 , (44) where −1 II1 = [Bn (p)] Z |ψ|≤(nq)1/4 /4 −1 II2 = [Bn (p)] |h(ψ, t) − g(ψ, t)| dψ, Z √ (nq)1/4 /4≤|ψ|≤π nq |h(ψ, t)| dψ, |f (ψ, t)| ´ −s2 /2 √ e dψ, 6 N |ψ|≥(nq)1/4 /4 ¯ ¯Z ∞ ³ |f (ψ, t)| ´ −s2 /2 ¯ ¯ e dψ. 1+ √ II4 = ¯[Bn (p)]−1 − (2π)−1/2 ¯ 6 N −∞ −1 II3 = [Bn (p)] Z ³ 1+ To estimate IIj , j = 1, 2, 3, 4, we first recall that, by (18), for all ψ and t, √ √ |f (ψ, t)| ≤ 3s3 N (∆1n + (nq)−1 )1/2 ≤ N s3 , (45) and by virtue of Stirling’s formula, 1≤ √ 2π/Bn (p) ≤ 1 + 1/nq. (46) In view of (45) and (46), it is readily seen that II3 + II4 ≤ C (nq)−1 (1 + t6 )e−t 2 /3 . (47) By using (22), we have II2 ≤ C (nq)−3 . (48) −1/4 As for II1 , if |t| ≤ min{∆1n /4, δ(∆∗ )−1 }, Lemma 1 implies that II1 ≤ C (∆1n + (nq)−1 )(1 + t8 )e−t 2 /4 ; (49) −1/4 if ∆1n /4 ≤ |t| ≤ δ(∆∗ )−1 , then it follows from (21) and (45) that Z Z ³ |f (ψ, t)| ´ −s2 /2 |h(ψ, t)|dψ + 1+ √ II1 ≤ e dψ 6 N |ψ|≤(nq)1/4 /4 |ψ|≤(nq)1/4 /4 ≤ C ∆1n e−t 2 /4 + C1 (1 + |t|3 )e−t 1413 2 /2 ≤ C ∆1n e−t 2 /4 . (50) Taking (47)-(50) into (44), we obtain the required (42). R∞ Secondly we prove (41). Recall that gn (t) = √12π −∞ g(ψ, t)dψ. As in (44), we have ¯ ¯ ¯ d Ψ(t) dgn (t) ¯ ¯ ¯ ¯ dt − dt ¯ ≤ III1 + III2 + III3 + III4 , where III1 III2 III3 III4 (51) ¯ ¯ ¯ d h(ψ, t) d g(ψ, t) ¯ ¯ dψ, ¯ − = [Bn (p)] ¯ dt dt ¯ |ψ|≤(nq)1/4 /4 ¯ ¯ Z ¯ d h(ψ, t) ¯ −1 ¯ ¯ dψ, = [Bn (p)] √ ¯ dt ¯ (nq)1/4 /4≤|ψ|≤π nq Z ³ 1 ¯¯ d f (ψ, t) ¯¯´ −s2 /2 |t||f (ψ, t)| −1 √ dψ, + √ ¯ |t| + = [Bn (p)] ¯ e dt 6 N 6 N |ψ|≥(nq)1/4 /4 ¯ ¯Z ∞ ³ 1 ¯¯ d f (ψ, t) ¯¯´ −s2 /2 |t||f (ψ, t)| ¯ −1 −1/2 ¯ √ √ dψ. = ¯[Bn (p)] − (2π) + |t| + ¯ e ¯ ¯ dt 6 N 6 N −∞ −1 Z By (18)-(19) and (46), we have that for |t| ≤ 1/4 III3 + III4 ≤ C(∆1n + (nq)−1 ). (52) By (9), (23) and (46), we have that for |t| ≤ 1/4 III1 + III2 ≤ C(∆1n + (nq)−1 ). (53) Taking these estimates into (51), we obtain for |t| ≤ 1/4, ¯ ¯ ¯ d Ψ(x) dgn (x) ¯ ¯ ¯ −1 ¯Ψ(t) − gn (t)¯ ≤ |t| sup ¯ ¯ ¯ dx − dx ¯ ≤ C|t| (∆1n + (nq) ), |x|≤1/4 which yields (41). Finally we prove (43). We first notice that ∆1n ≥ 1/(16nb). Indeed, if α2 ≤ 1/4, then ³ ´2 X ∆1n ≥ N −1 E(Xk − pµk )2 /(nb2 ) ≥ (1 − 2pα2 )2 /(nb2 ) ≥ 1/(16nb), P and if α2 > 1/4, then ∆1n ≥ (N −1 µ2k )2 /(nb) = α22 /(nb) ≥ 1/(16nb). This, together with the fact that X 9 E|Xk |3 , ∆∗ ≤ √ 3/2 N −1 nb k 1414 √ √ implies that if δ0 (∆∗ )−1 ≤ |t| ≤ T , then δ0 b/(9L0 ) ≤ |t|/ nb ≤ 16 nb and hence ¶ ³ Yµ √ ´ √ 2 1 − 2pq 1 − E cos(ψ/ nq + tXk / nb) |h(ψ, t)| ≤ k ( ≤ exp −2pq X³ k √ ´ 1 − E cos(ψ/ nq + tXk / nb) √ ) ¯!) ¯ X √ ¯ ¯ √ exp{iψ/ nq + itXk / nb}¯¯ ≤ exp −2N pq 1 − ¯¯(1/N )E k ( Ã ¯!) ¯ X √ ¯ ¯ E exp{itXk / nb}¯¯ ≤ exp −2N pq 1 − ¯¯(1/N ) ( Ã k ≤ exp{−2nq δN }. (54) 1/2 We also note that ∆∗ ≤ 2∆1n and this together with (45) implies that, for δ0 (∆∗ )−1 ≤ |t| ≤ T , Z ∞³ |f (ψ, t)| ´ −s2 /2 2 2 1+ √ |gn (t)| ≤ e dψ ≤ C(1 + |t|3 ) e−t /2 ≤ C∆1n e−t /4 . (55) 6 N −∞ Combining (54) and (55) and using the estimate (46), we obtain that, for δ0 (∆∗ )−1 ≤ |t| ≤ T , Z ¯ ¯ ¯Ψ(t) − gn (t)¯ ≤ [Bn (p)]−1 |h(ψ, t)| dψ + |gn (t)| √ |ψ|≤π nq −t2 /4 ≤ C ∆1n e © ª √ + 3 nq exp − nq δN , which yields (43). The proof of Theorem 1 is complete. 2 Proof of Theorem 2. Without loss of generality, assume ∆2n ≤ 1. We first prove the property (2) for n/N ≥ 1/2 and ∆2n ≤ (nq)−1 /25. ª © √ 1/2 1 √ 3/2 nb /L0 . As in the Write T ∗ = (nb2 )/β4 , T1∗ = b16 min (n/β4 )1/4 , 18 b n/L0 and T2∗ = 15 proof of Theorem 1, it follows from Esseen’s smoothing lemma that ¯ ¯ √ ¯ ¯ sup ¯P (Sn / nb ≤ x) − Gn (x)¯ x Z Z ³Z |Ψ(t) − gn (t)| + + ≤ dt + C ∆2n |t| T2∗ ≤|t|≤T ∗ T1∗ ≤|t|≤T2∗ |t|≤T1∗ = Λ1n + Λ2n + Λ3n + C ∆2n , say. (56) By virtue of Lemma 5, simple calculations show that Λ1n ≤ C ∆2n . Recall ∆1n ≤ 17 ∆2n . Applying Lemma 4 and similar arguments as in the proof of (50), we obtain that Λ2n ≤ C ∆2n R and also T ∗ ≤|t|≤T ∗ |gn|t|(t)| dt ≤ C∆2n . Therefore, to prove (2), it remains to show that, for 2 T2∗ ≤ |t| ≤ T ∗ , © √ (57) |Ψ(t)| ≤ 3 n exp − n δ1N }. In fact, by using (3) in Zhao, Wu and Wang (2004), for q > 0, Z π √ ¢ Y¡ √ √ −1 q + peiψ+itXj / nb dψ, e−inψ Ψ(t) = E exp{itSn / nb} = ( 2πGn (p)) −π 1415 k √ √ √ n pn q N −n . This, together with the fact that δ b/(5L ) ≤ |t|/ nb ≤ 16 nb where Gn (p) = 2πCN 0 0 whenever T2∗ ≤ |t| ≤ T ∗ , implies that |Ψ(t)| ≤ ≤ √ √ 2π(Gn (p))−1 √ Y¡ ¢ q + p|EeitXj / nb | k n X¡ √ ¢o 2π(Gn (p))−1 exp p |EeitXj / nb | − 1 k √ ≤ 3 n exp{−nδ1N }, √ √ where we have used the inequality π/2 ≤ nqGn (p) < 1 (see, for instance, Lemma 1 in Höglund(1978)). This proves (57) for q > 0. If q = 0, then N = n hence by the independence of Xk , nX¡ √ √ Y ¢o |Ψ(t)| = |EeitXj / nb | ≤ exp |EeitXj / nb | − 1 ≤ exp{−nδ1N }. k k This implies that (57) still holds for q = 0. We have now completed the proof of (57) and hence (2) for n/N ≥ 1/2 and ∆2n ≤ (nq)−1 /25. Note that β4 ≥ 1, ∆1n ≤ 17 ∆2n and b ≥ q ≥ 1/2 if n/N ≤ 1/2. We have that ∆1n + (nq)−1 ≤ 42 ∆2n , whenever n/N ≤ 1/2 or ∆2n > (nq)−1 /25. Based on this fact, by using a similar argument to that above and that in the proof of Theorem 1, we may obtain (2) for n/N ≤ 1/2 or ∆2n > (nq)−1 /25, as well. The details are omitted. The proof of Theorem 2 is now complete. 2 References [1] Babu, G. J. and Bai, Z. D. (1996). Mixtures of global and local Edgeworth expansions and their applications. J. Multivariate. Anal. 59 282-307. MR1423736 [2] von Bahr, B. (1972). On sampling from a finite set of independent random variables. Z. Wahrsch. Verw. Geb. 24 279–286. MR0331471 [3] Bhattacharya, R. N. and Ranga Rao, R. (1976). Normal approximation and asymptotic expansions. Wiley, New York. MR0436272 [4] Bickel, P. J. and von Zwet, W. R. (1978). Asymptotic expansions for the power of distribution-free tests in the two-sample problem. Ann. Statist. 6 937-1004. MR0499567 [5] Bikelis, A. (1969). On the estimation of the remainder term in the central limit theorem for samples from finite populations. Studia Sci. Math. Hungar. 4 345-354 in Russian. MR0254902 [6] Bloznelis, M. (2000a). One and two-term Edgeworth expansion for finite population sample mean. Exact results, I. Lith. Math. J. 40(3) 213-227. MR1803645 [7] Bloznelis, M. (2000b). One and two-term Edgeworth expansion for finite population sample mean. Exact results, II. Lith. Math. J. 40(4) 329-340. MR1819377 1416 [8] Bloznelis, M. (2003). Edgeqorth expansions for studentized versions of symmetric finite population statistics. Lith. Math. J. 43(3) 221-240. MR2019541 [9] Bloznelis, M. and Götze, F. (2000). An Edgeworth expansion for finite population U statistics. Bernoulli 6 729-760. MR1777694 [10] Bloznelis, M. and Götze, F. (2001). Orthogonal decomposition of finite population statistic and its applications to distributional asymptotics. Ann. Statist. 29 899-917. MR1865345 [11] Erdös, P. and Renyi, A. (1959). On the central limit theorem for samples from a finite population. Fubl. Math. Inst. Hungarian Acad. Sci. 4 49-61. MR0107294 [12] Höglund, T.(1978). Sampling from a finite population. A remainder term estimate. Scand. J. Statistic. 5 69-71. MR0471130 [13] Kokic, P. N. and Weber, N. C. (1990). An Edgeworth expansion for U -statistics based on samples from finite populations. Ann. Probab. 18 390-404. MR1043954 [14] Mirakjmedov, S. A. (1983). An asymptotic expansion for a sample sum from a finite sample. Theory Probab. Appl. 28(3) 492-502. [15] Nandi, H. K. and Sen, P. K. (1963). On the properties of U -statistics when the observations are not independent II: unbiased estimation of the parameters of a finite population. Calcutta Statist. Asso. Bull 12 993-1026. MR0161418 [16] Robinson, J. (1978). An asymptotic expansion for samples from a finite population. Ann. Statist. 6 1004-1011. MR0499568 [17] Schneller, W. (1989). Edgeworth expansions for linear rank statistics. Ann. Statist. 17 1103–1123. MR1015140 [18] Zhao, L.C. and Chen, X. R. (1987). Berry-Esseen bounds for finite population U -statistics. Sci. Sinica. Ser. A 30 113-127. MR0892467 [19] Zhao, L.C. and Chen, X. R. (1990). Normal approximation for finite population U -statistics. Acta Math. Appl. Sinica 6 263-272. MR1078067 [20] Zhao, L.C., Wu, C. Q. and Wang, Q. (2004). Berry-Esseen bound for a sample sum from a finite set of independent random variables. J. Theoretical Probab. 17 557-572. MR2091551 1417

a l r n u o

Related documents

Products

Support

a l r n u o

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib