Nonasymptotic bounds on the estimation error for regenerative MCMC Wojciech Niemiro

Nonasymptotic bounds on the estimation error for regenerative MCMC1 Wojciech Niemiro Nicolaus Copernicus University, Toruń and University of Warsaw Poland EPSRC Symposium Workshop on Markov Chain-Monte Carlo Warwick, March 2009 1 Work partially supported by Polish Ministry of Science and Higher Education Grant No. N N201387234. Outline 1 Introduction Computing integrals via MCMC Accuracy bounds 2 Regeneration Small set and regeneration A sequential-regenerative estimator 3 Accuracy bounds Mean Square Error Confidence estimation and Median of Averages How tight are the bounds? Outline 1 Introduction Computing integrals via MCMC Accuracy bounds 2 Regeneration Small set and regeneration A sequential-regenerative estimator 3 Accuracy bounds Mean Square Error Confidence estimation and Median of Averages How tight are the bounds? Outline 1 Introduction Computing integrals via MCMC Accuracy bounds 2 Regeneration Small set and regeneration A sequential-regenerative estimator 3 Accuracy bounds Mean Square Error Confidence estimation and Median of Averages How tight are the bounds? 1 Introduction Computing integrals via MCMC Accuracy bounds 2 Regeneration Small set and regeneration A sequential-regenerative estimator 3 Accuracy bounds Mean Square Error Confidence estimation and Median of Averages How tight are the bounds? Computing integrals via MCMC We are to compute Z f (x)π(x)dx =: π(f ), θ= X where X – state space, π – probability distribution on X , Markov chain X0 , X1 , . . . , Xt , . . . P(Xt ∈ ·) → π(·), (t → ∞). MCMC estimator θ̂T = T −1 1 X f (Xi ) → θ T i=0 (T → ∞). Computing integrals via MCMC We are to compute Z f (x)π(x)dx =: π(f ), θ= X where X – state space, π – probability distribution on X , Markov chain X0 , X1 , . . . , Xt , . . . P(Xt ∈ ·) → π(·), (t → ∞). MCMC estimator θ̂T = T −1 1 X f (Xi ) → θ T i=0 (T → ∞). Computing integrals via MCMC We are to compute Z f (x)π(x)dx =: π(f ), θ= X where X – state space, π – probability distribution on X , Markov chain X0 , X1 , . . . , Xt , . . . P(Xt ∈ ·) → π(·), (t → ∞). MCMC estimator θ̂T = T −1 1 X f (Xi ) → θ T i=0 (T → ∞). Accuracy bounds Mean square error: MSE = E (θ̂T − θ)2 ≤ ? Confidence bounds: P(|θ̂T − θ| > ε) ≤ ? Accuracy bounds Mean square error: MSE = E (θ̂T − θ)2 ≤ ? Confidence bounds: P(|θ̂T − θ| > ε) ≤ ? 1 Introduction Computing integrals via MCMC Accuracy bounds 2 Regeneration Small set and regeneration A sequential-regenerative estimator 3 Accuracy bounds Mean Square Error Confidence estimation and Median of Averages How tight are the bounds? Small set and regeneration P(x, A) = P(Xn ∈ A|Xn−1 = x). ASSUMPTION (Small set) There exist J ⊆ X , a probability measure ν and β > 0 such that P(x, ·) ≥ βI(x ∈ J)ν(·). Q(x, ·) := P(x, ·) − βI(x ∈ J)ν(·) is the ,,residual” (sub-stochastic) kernel. If Xn−1 6∈ J then draw Xn ∼ P(Xn−1 , ·), no regeneration; If Xn−1 ∈ J then with probability 1 − β draw Xn ∼ Q(Xn−1 , ·)/(1 − β), no regeneration; with probability β draw Xn ∼ ν(·), Regeneration. Small set and regeneration P(x, A) = P(Xn ∈ A|Xn−1 = x). ASSUMPTION (Small set) There exist J ⊆ X , a probability measure ν and β > 0 such that P(x, ·) ≥ βI(x ∈ J)ν(·). Q(x, ·) := P(x, ·) − βI(x ∈ J)ν(·) is the ,,residual” (sub-stochastic) kernel. If Xn−1 6∈ J then draw Xn ∼ P(Xn−1 , ·), no regeneration; If Xn−1 ∈ J then with probability 1 − β draw Xn ∼ Q(Xn−1 , ·)/(1 − β), no regeneration; with probability β draw Xn ∼ ν(·), Regeneration. Small set and regeneration P(x, A) = P(Xn ∈ A|Xn−1 = x). ASSUMPTION (Small set) There exist J ⊆ X , a probability measure ν and β > 0 such that P(x, ·) ≥ βI(x ∈ J)ν(·). Q(x, ·) := P(x, ·) − βI(x ∈ J)ν(·) is the ,,residual” (sub-stochastic) kernel. If Xn−1 6∈ J then draw Xn ∼ P(Xn−1 , ·), no regeneration; If Xn−1 ∈ J then with probability 1 − β draw Xn ∼ Q(Xn−1 , ·)/(1 − β), no regeneration; with probability β draw Xn ∼ ν(·), Regeneration. Regeneration Remark: Actual sampling from Q(Xn−1 , ·)/(1 − β) is not necessary (Mykland et al. 1995). Start with X0 ∼ ν(·) so that regeneration occurs at T0 = 0. Times of regeneration partition a trajectory into iid blocks: X0 , . . . , XT1 −1 , | {z } XT1 , . . . , XT2 −1 , | {z } XT , . . . , XT −1 , | 2 {z 3 } ↑ R ↑ R ↑ R T1 T2 −T1 R = Regeneration T3 −T2 ... Regeneration Remark: Actual sampling from Q(Xn−1 , ·)/(1 − β) is not necessary (Mykland et al. 1995). Start with X0 ∼ ν(·) so that regeneration occurs at T0 = 0. Times of regeneration partition a trajectory into iid blocks: X0 , . . . , XT1 −1 , | {z } XT1 , . . . , XT2 −1 , | {z } XT , . . . , XT −1 , | 2 {z 3 } ↑ R ↑ R ↑ R T1 T2 −T1 R = Regeneration T3 −T2 ... A sequential-regenerative estimator 0 = T0 , T1 , . . . , Tr , . . . – moments of regeneration. Fix n. R(n) := min{r : Tr ≥ n}. Estimator: θ̂TR(n) = 0, . . . , T1 − 1, ↑ R T1 , . . . . . . , ↑ R 1 TR(n) TR(n) −1 X f (Xi ). i=0 TR(n)−1 , . . . , n, . . . , TR (n) − 1, ↑ ↑ R n The estimator uses the part of trajectory written in blue TR(n) , . . . ↑ R 1 Introduction Computing integrals via MCMC Accuracy bounds 2 Regeneration Small set and regeneration A sequential-regenerative estimator 3 Accuracy bounds Mean Square Error Confidence estimation and Median of Averages How tight are the bounds? Mean Square Error The integral of interest and its MCMC estimator: Z f (x)π(x)dx, θ = π(f ) = X θ̂TR(n) = 1 TR(n) −1 TR(n) X f (Xi ) i=0 THEOREM Under Assumption (Small set), (i) MSE = E (θ̂TR(n ) − θ)2 ≤ 2 (f ) σas µ2 1+ , n n (ii) E TR(n) ≤ n + µ2 , where  2 TX 1 −1 1 2 σas (f ) = E (f (Xi ) − θ) , ET1 i=0 µ2 = ET12 − 1. ET1 Mean Square Error The integral of interest and its MCMC estimator: Z f (x)π(x)dx, θ = π(f ) = X θ̂TR(n) = 1 TR(n) −1 TR(n) X f (Xi ) i=0 THEOREM Under Assumption (Small set), (i) MSE = E (θ̂TR(n ) − θ)2 ≤ 2 (f ) σas µ2 1+ , n n (ii) E TR(n) ≤ n + µ2 , where  2 TX 1 −1 1 2 σas (f ) = E (f (Xi ) − θ) , ET1 i=0 µ2 = ET12 − 1. ET1 Proof (i) Two identities of Abraham Wald (sequential analysis): If R is a stopping time then E PR Var i=1 τk PR = EREτ1 , where τk = Tk − Tk −1 , i=1 dk = ERVard1 , where dk = PTk −1 i=Tk−1 (f (Xi ) (ii) Lorden’s theorem (1970, renewal theory). − θ). A geometric drift condition ASSUMPTION (Drift) There exist a function u : X → [1, ∞), constants λ < 1 and c < ∞ such that ( λu(x) for x 6∈ J, Qu(x) ≤ cλ for x ∈ J, where Q(x, R ·) := P(x, ·) − βI(x ∈ J)ν(·) and Qu(x) := X Q(x, dy)u(y). Explicit bounds under a drift condition 2 (f ) and µ which depend explicitly on the Bounds on σas 2 parameters λ, β, c and π(u): THEOREM If Assumptions (Small set), (Drift) hold and (f − θ)2 ≤ u then 4A1 + 9 p A2 (1 + A1 ) + π(u) , 1−λ (1 − λ)3/2 1 + A1 , µ2 ≤ 2 (π(u) + 1) 1−λ log c where A1 = , log(1 − β)−1 r c and A2 = 4 . 1−β 2 σas (f ) ≤ π(u) Confidence estimation Goal: P(|θ̂ − θ| ≤ ε) ≥ 1 − α. (given precision ε at a given level of confidence 1 − α). We are to choose an estimator and a sufficient number of samples. Possible approaches: Chebyshev inequality: P(|θ̂ − θ| > ε) ≤ MSE . ε2 Problem: appears to be too loose. Exponential inequalities? Problems with this approach: f is usually unbounded, involve quantities difficult to compute explicitly. Confidence estimation Goal: P(|θ̂ − θ| ≤ ε) ≥ 1 − α. (given precision ε at a given level of confidence 1 − α). We are to choose an estimator and a sufficient number of samples. Possible approaches: Chebyshev inequality: P(|θ̂ − θ| > ε) ≤ MSE . ε2 Problem: appears to be too loose. Exponential inequalities? Problems with this approach: f is usually unbounded, involve quantities difficult to compute explicitly. Confidence estimation Goal: P(|θ̂ − θ| ≤ ε) ≥ 1 − α. (given precision ε at a given level of confidence 1 − α). We are to choose an estimator and a sufficient number of samples. Possible approaches: Chebyshev inequality: P(|θ̂ − θ| > ε) ≤ MSE . ε2 Problem: appears to be too loose. Exponential inequalities? Problems with this approach: f is usually unbounded, involve quantities difficult to compute explicitly. Median of Averages (MA) Generate m independent copies of the Markov chain and compute estimators (averages): (1) (1) (1) X0 , X1 , . . . , Xt , . . . 7−→ θ̂(1) , ··· (m) (m) (m) X0 , X1 , . . . , Xt Estimator MA: ,... 7−→ θ̂ = med θ̂(1) , . . . , θ̂(m) . θ̂(m) . Median of Averages (MA) θ̂ = med θ̂(1) , . . . , θ̂(m) . Idea (Jerrum, Valiant and Vazirani, 1986): Fix an initial moderate level of confidence 1 − a < 1 − α and use Chebyshev inequality to get P(|θ̂(j) − θ| > ε) ≤ a (j = 1, . . . , m). Then boost the level of confidence from 1 − a to 1 − α by computing a median: P(|θ̂ − θ| > ε) ≤ 1 exp{−m/b} = α. 2 Optimizing the constants: 1 − a = 0.88031, b = 2.3147 (Niemiro and Pokarowski, 2009, J. Appl. Probab.) How tight are the bounds? Asymptotic level of confidence, based on CLT: lim P(|θ̂ − θ| > ε) = α, ε→0 for the number of samples n∼ i2 2 (f ) h σas −1 Φ (1 − α/2) , ε2 (θ̂ is a simple average over n samples) Nonasymptotic level of confidence: P(|θ̂ − θ| > ε) ≤ α, for the expected number of samples mETR(n) ∼ mn ∼ C · 2 (f ) σas log(2α)−1 , ε2 (θ̂ is MA with sequential/regenerative averages) Symbol ∼ refers to α, ε → 0. How tight are the bounds? Asymptotic level of confidence, based on CLT: lim P(|θ̂ − θ| > ε) = α, ε→0 for the number of samples n∼ i2 2 (f ) h σas −1 Φ (1 − α/2) , ε2 (θ̂ is a simple average over n samples) Nonasymptotic level of confidence: P(|θ̂ − θ| > ε) ≤ α, for the expected number of samples mETR(n) ∼ mn ∼ C · 2 (f ) σas log(2α)−1 , ε2 (θ̂ is MA with sequential/regenerative averages) Symbol ∼ refers to α, ε → 0. Complexity comparison Asymptotic: n∼ i2 2 (f ) h σas −1 Φ , (1 − α/2) ε2 Nonasymptotic: mETR(n) ∼ mn ∼ C · 2 (f ) σas log(2α)−1 , ε2 (θ̂ is MA using sequential/regenerative averages) h i2 Φ−1 (1 − α/2) ∼ 2 log(2α)−1 , C ≈ 19.34. (α → 0), Complexity comparison Asymptotic: n∼ i2 2 (f ) h σas −1 Φ , (1 − α/2) ε2 Nonasymptotic: mETR(n) ∼ mn ∼ C · 2 (f ) σas log(2α)−1 , ε2 (θ̂ is MA using sequential/regenerative averages) h i2 Φ−1 (1 − α/2) ∼ 2 log(2α)−1 , C ≈ 19.34. (α → 0), Problem How to obtain practically applicable and explicitly computable 2 (f ) ? bounds on σas

Nonasymptotic bounds on the estimation error for regenerative MCMC Wojciech Niemiro

Related documents

Products

Support

Nonasymptotic bounds on the estimation error for regenerative MCMC Wojciech Niemiro

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib