ORDER STATISTICS AND LIMITING DISTRIBUTIONS 1 ORDER STATISTICS • Let X1, X2,…,Xn be a r.s. of size n from a distribution of continuous type having pdf f(x), a<x<b. Let X(1) be the smallest of Xi, X(2) be the second smallest of Xi,…, and X(n) be the largest of Xi. a X 1 X 2 X n b • X(i) is the i-th order statistic. X 1 m in X 1 , X 2 , X n m ax X 1 , X 2 , , X n , X n 2 ORDER STATISTICS • It is often useful to consider ordered random sample. • Example: suppose a r.s. of five light bulbs is tested and the failure times are observed as (5,11,4,100,17). These will actually be observed in the order of (4,5,11,17,100). Interest might be on the kth smallest ordered observation, e.g. stop the experiment after kth failure. We might also be interested in joint distributions of two or more order statistics or functions of them (e.g. range=max – min) 3 ORDER STATISTICS • If X1, X2,…,Xn be a r.s. of size n from a population with continuous pdf f(x), then the joint pdf of the order statistics X(1), X(2),…,X(n) is g x 1 , x 2 , , x n n ! f x 1 f x 2 f x n for x (1) ... x ( n ) Order statistics are not independent. The joint pdf of ordered sample is not same as the joint pdf of unordered sample. Future reference: For discrete distributions, we need to take ties into account (two X’s being equal). See, Casella and Berger, 1990, pg 231. 4 Example • Suppose that X1, X2, X3 is a r.s. from a population with pdf f(x)=2x for 0<x<1 Find the joint pdf of order statistics and the marginal pdf of the smallest order statistic. 5 ORDER STATISTICS • The Maximum Order Statistic: X(n) GX n gX y y n P X n y y GX y n 6 ORDER STATISTICS • The Minimum Order Statistic: X(1) GX y 1 gX 1 P X 1 y y y GX 1 y 7 ORDER STATISTICS • k-th Order Statistic y1 y2 … yk-1 yk yk+1 … P(X<yk) y yn P(X>yk) # of possible orderings n!/{(k1)!1!(n k)!} fX(yk) gX k y n! k 1 ! n k ! F X y k 1 f X y 1 F X y nk ,a y b 8 Example • Same example but now using the previous formulas (without taking the integrals): Suppose that X1, X2, X3 is a r.s. from a population with pdf f(x)=2x for 0<x<1 Find the marginal pdf of the smallest order statistic. 9 Example • X~Uniform(0,1). A r.s. of size n is taken. Find the p.d.f. of kth order statistic. • Solution: Let Yk be the kth order statistic. g Y (y) k n! ( k 1)! ( n k )! y ( n 1) ( k ) ( n k 1) k 1 y (1 y ) k 1 nk (1 y ) 1 nk for 0 y 1 Y k ~ Beta ( k , n k 1) 10 ORDER STATISTICS • Joint p.d.f. of k-th and j-th Order Statistic (for k<j) k-1 items y1 y2 … yk-1 P(X<yk) j-k-1 items 1 item ( k ) , X ( j) (yk , y j) 1 item # of possible orderings n!/{(k1)!1!(j-k-1)!1!(n j)!} y yk yk+1 … yj-1 yj yj+1 P(yk<X<yj) fX(yk) gX n-j items n! ( k 1)! ( j k 1)! ( n j)! yn P(X>yj) fX(yj) [ F ( y k )] k 1 f ( y k )[ F ( y j ) F ( y k )] j k 1 f ( y j )[1 F ( y j )] n j 11 LIMITING DISTRIBUTION 12 Motivation • The p.d.f. of a r.v. often depends on the sample size (i.e., n) • If X1, X2,…, Xn is a sequence of rvs and Yn=u(X1, X2,…, Xn) is a function of them, sometimes it is possible to find the exact distribution of Yn (what we have been doing lately) • However, sometimes it is only possible to obtain approximate results when n is large limiting distributions. 13 CONVERGENCE IN DISTRIBUTION • Consider that X1, X2,…, Xn is a sequence of rvs and Yn=u(X1, X2,…, Xn) be a function of rvs with cdfs Fn(y) so that for each n=1, 2,… n y P Y y , n y F y for all y F lim F n n where F(y) is continuous. Then, the sequence Yn is said to converge in distribution to Y. d Y Y n 14 CONVERGENCE IN DISTRIBUTION • Theorem: If lim F y F y for every point y at which F(y) is continuous, then Yn is said to have a limiting distribution with cdf F(y). The term “Asymptotic distribution” is sometimes used instead of “limiting distribution” • Definition of convergence in distribution requires only that limiting function agrees with cdf at its points of continuity. n n 15 Example Let X1,…, Xn be a random sample from Unif(0,1). Find the limiting distribution of the max order statistic, if it exists. 16 Example Let {Xn} be a sequence of rvs with pmf 1 1, if x 2 f x P X x n 0, o.w . n Find the limiting distribution of Xn , if it exists. 17 Example Let Yn be the nth order statistic of a random sample X1, …, Xn from Unif(0,θ). Find the limiting distribution of Zn=n(θ -Yn), if it exists 18 Example Let X1,…, Xn be a random sample from Exp(θ). Find the limiting distribution of the min order statistic, if it exists. 19 Example • Suppose that X1, X2, …, Xn are iid from Exp(1). Find the limiting distribution of the max order statistic, if it exists. 20 LIMITING MOMENT GENERATING FUNCTIONS • Let rv Yn have an mgf Mn(t) that exists for all n. If lim M n t M t , n then Yn has a limiting distribution which is defined by M(t). 21 Example X ~ B in n , p w ith m gf M t pe 1 p t X n Let =np. lim M n t lim pe 1 p t X n n e 1 lim 1 M e n t n n t e 1 Y t The mgf of Poisson() The limiting distribution of Binomial rv is the 22 Poisson distribution. CONVERGENCE IN PROBABILITY (STOCHASTIC CONVERGENCE) • A rv Yn convergence in probability to a rv Y if lim P Y n Y 1 n for every >0. Special case: Y=c where c is a constant not depending on n. The limiting distribution of Yn is degenerate at point c. 23 CHEBYSHEV’S INEQUALITY • Let X be an rv with E(X)= and V(X)=2. P X 1 2 2 , 0 • The Chebyshev’s Inequality can be used to prove stochastic convergence in many cases. 24 CONVERGENCE IN PROBABILITY (STOCHASTIC CONVERGENCE) • The Chebyshev’s Inequality proves the convergence in probability if the following three conditions are satisfied. n . 1. E(Yn)=n where lim n 2. V Yn 2 n for all n . 3. lim n 0. 2 n 25 Example Let X be an rv with E(X)= and V(X)=2<. For a r.s. of size n, is the sample mean. X p Is X n ? n 26 WEAK LAW OF LARGE NUMBERS (WLLN) • Let X1, X2,…,Xn be iid rvs with E(Xi)= and V(Xi)=2<. Define X 1 / n X . Then, for every >0, n n i i 1 X n converges in probability to . WLLN states that the sample mean is a good estimate of the population mean. For large n, the probability that the sample mean and population mean are close to each other with probability 1. But more to come on the properties of estimators. 27 Example • Let X1, X2,…,Xn be iid rvs from Ber(p). Then, E(X)=p and Var(X)=p(1-p)<∞. By WLLN, n Xi X n i 1 n # of 1s p p as n n 28 STRONG LAW OF LARGE NUMBERS • Let X1, X2,…,Xn be iid rvs with E(Xi)= and V(Xi)=2<. Define X 1 / n X . Then, for every >0, n n i i 1 P lim X n 1 that is, X n n converges almost sure to . 29 Relation between convergences Almost sure convergence is the strongest. a .s p d Y n Y Y n Y Y n Y (reverse is not true) 30 THE CENTRAL LIMIT THEOREM • Let X1, X2,…,Xn be a sequence of iid rvs with E(Xi)= and V(Xi)=2<∞. Define X 1 / n X . Then, n n i i 1 Z nXn or d N 0,1 n Z X i n i 1 n d N 0,1 . Proof can be found in many books, e.g. Bain and Engelhardt, 1992, page 239 31 Example • Let X1, X2,…,X n be iid rvs from Unif(0,1) n and Y n X i i 1 • Find approximate distribution of Yn. • Find approximate values of – 20 P ( X i 12 ) i 1 – The 90th percentile of Yn. • Warning: n=20 is probably not a big enough number of size. 32 SLUTKY’S THEOREM • If XnX in distribution and Yna, a constant, in probability, then a) YnXnaX in distribution. b) Xn+YnX+a in distribution. 33 SOME THEOREMS ON LIMITING DISTRIBUTIONS • If Xn c>0 in probability, then for any function g(x) continuous at c, g(Xn) g(c) in p prob. e.g. X n c. • If Xnc in probability and Ynd in probability, then • aXn+bYn ac+bd in probability. • XnYn cd in probability • 1/Xn 1/c in probability for all c0. 34 Example X~Gamma(, 1). Show that nXn d N 0,1. Xn 35