Introduction Island bootstrap approximation The double bootstrap algorithm Extensions On the convergence of Island particle models C. Dubarry, P. Del Moral, E. Moulines Institut Mines-Télécom, Télécom ParisTech/ Télécom SudParis, INRIA Bordeaux June 14, 2012 C. Dubarry, P. Del Moral, E. Moulines 1/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Outline 1 Introduction 2 Island bootstrap approximation 3 4 The double bootstrap algorithm Algorithm description Bias and variance of the double bootstrap Numerical application Extensions C. Dubarry, P. Del Moral, E. Moulines 1/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Outline 1 Introduction 2 Island bootstrap approximation 3 4 The double bootstrap algorithm Algorithm description Bias and variance of the double bootstrap Numerical application Extensions C. Dubarry, P. Del Moral, E. Moulines 2/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Notations (Xn , Xn )n≥0 is a sequence of measurable sets. Bb (Xn , Xn ) is the Banach space of all bounded and measurable functions on (Xn , Xn ). (Xn )n≥0 is a non-homogenous Markov chain with initial distribution η0 , and Markov kernels (Mn )n≥1 . Feynman-Kac ow def ηn (fn ) = γn(fn )/γn (1) , Y def γn (fn ) = E fn (Xn ) gp (Xp ) . 0≤p<n C. Dubarry, P. Del Moral, E. Moulines 3/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Feynman-Kac ow Dene by P(Xn , Xn ) the set of probabilities on (Xn , Xn ). The sequence of probabilities (ηn )n≥0 satises the following recursion: ηn+1 = Ψn (ηn )Mn+1 , where Ψn : P(Xn , Xn ) → P(Xn , Xn ) is dened by: def Ψn (ηn )(An ) = 1 ηn (gn ) Z C. Dubarry, P. Del Moral, E. Moulines gn (xn ) ηn (dxn ) , An ∈ Xn . An 4/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Outline 1 Introduction 2 Island bootstrap approximation 3 4 The double bootstrap algorithm Algorithm description Bias and variance of the double bootstrap Numerical application Extensions C. Dubarry, P. Del Moral, E. Moulines 5/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Particle approximation ⊗N1 1 Xp , X p ) = (XN Let N1 be an integer. For any integer p we set (X ). p , Xp Xn , X n ) to (X Xn+1 , X n+1 ) as the Dene the Markov kernel M n+1 from (X product measure def def M n+1 (xn , An+1 ) = Y Ψn (m(xn , ·))Mn+1 (Ain+1 ) , 1≤i≤N1 where m(xn , ·) stands for the empirical measure of xn given for any An ∈ Xn by def m(xn , An ) = N1 1 X δ i (An ) . N1 i=1 xn The particles are multinomially resampled with probabilities proportional 1 to their potential {gn (xin )}N i=1 ; new particle positions are then sampled from the Markov kernel Mn+1 . C. Dubarry, P. Del Moral, E. Moulines 6/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Particle approximation Dene a Markov chain {X n }n≥0 where for each n ∈ N, X n = (ξn1 , . . . , ξnN1 ) ∈ Xn with initial distribution η 0 = η0⊗N1 and transition kernel M n+1 . N1 -particle approximations def def ηnN1 (fn ) = m(X n , fn ) Y N def γnN1 (fn ) = ηnN1 (fn ) ηp 1 (gp ) . 0≤p<n C. Dubarry, P. Del Moral, E. Moulines 7/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Unbiasedness of the particle approximation Theorem (Del Moral, 199x) For any fn ∈ Bb (Xn , Xn ), γnN1 (fn ) is an unbiased estimator of γn (fn ): h i Y E γnN1 (fn ) = E ηnN1 (fn ) ηpN1 (gp ) 0≤p<n = E fn (Xn ) Y gp (Xp ) . 0≤p<n C. Dubarry, P. Del Moral, E. Moulines 8/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions The island Feynman-Kac model N1 1 For xn = (x1n , · · · , xN n ) ∈ Xn dene the sample averaged potential def g n (xn ) = m(xn , gn ) = N1 1 X gn (xin ) . N1 i=1 Feynman-Kac model γ n (1) η n (f n ) = γ n (f n )/γ Y γ n (f n ) = E f n (Xn ) g p (Xp ) , 0≤p<n C. Dubarry, P. Del Moral, E. Moulines 9/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions The island Feynman-Kac model Since g n (X p ) = ηnN1 (gp )P , the unbiasedness property implies that for any f n of i 1 the form f n (xn ) = N1−1 N i=1 fn (xn ) E fn (Xn ) Y gp (Xp ) = E f n (Xn ) 0≤p<n Y g p (Xp ) , 0≤p<n or equivalently γ n (f n ) = γn (fn ) and η n (f n ) = ηn (fn ) . C. Dubarry, P. Del Moral, E. Moulines 10/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions The island Feynman-Kac model From now on, a population of particles X n is called an island. Idea: we may apply the interacting particle system approximation of the Feynman-Kac semigroups both within each island but also across island. To be more specic, we will now describe the so-called double bootstrap algorithm where the bootstrap algorithm is applied both within an island but also across the islands. Of course, many other options are available (more to come !) C. Dubarry, P. Del Moral, E. Moulines 11/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Outline 1 Introduction 2 Island bootstrap approximation 3 4 The double bootstrap algorithm Algorithm description Bias and variance of the double bootstrap Numerical application Extensions C. Dubarry, P. Del Moral, E. Moulines 12/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Algorithm description Feynman-Kac at the island level Xn , X n ) the set of probabilities measures on (X X n , X n ). Dene by P(X The sequence of measures (η n )n≥0 satises the following recursion η n+1 = Ψn (η n )M n+1 , Xn , X n ) → P(X Xn , X n ) is dened by where Ψn : P(X def Ψn (η n )(An ) = 1 η n (g n ) C. Dubarry, P. Del Moral, E. Moulines Z g n (x) η n (dx) , An ∈ X n . An 13/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Algorithm description The double bootstrap algorithm selection i mutation ξin −−−−− −−→ b ξn −−−−− −−→ ξin+1 Let N2 be the number of interacting islands. i During the selection stage, we select randomly N2 islands b ξn 1≤i≤N2 2 among the current islands ξin 1≤i≤N2 ∈ X N n with probability proportional to the empirical mean of the individuals in each island g n (ξin ) = N1−1 N1 X gn (ξni,j ) , 1 ≤ i ≤ N2 . j=1 i 2 During the mutation transition, selected islands (b ξ n )N i=1 evolve randomly i to a new conguration ξn+1 according to the Markov transition M n+1 . C. Dubarry, P. Del Moral, E. Moulines 14/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Algorithm description The double bootstrap ⊗N2 ⊗N2 2 2 2 XN XN Dene the Markov kernel LN ) to (X n ,Xn n+1 from (X n+1 , X n+1 ) for N2 1 N2 N2 1 N2 any (xn , . . . , xn ) ∈ X n and (An+1 , . . . , An+1 ) ∈ X n by 1 N2 1 N2 2 LN n+1 (xn , . . . , xn , An+1 × · · · × An+1 ) Y def i 2 = Ψn (m(x1n , . . . , xN n , ·))M n+1 (An+1 ) , 1≤i≤N2 where stands for the empirical measure of the islands given for any An ∈ X n by 2 m(x1n , . . . , xN n , ·) 1 2 (xn , . . . , xN n ) def 2 m(x1n , . . . , xN n , An ) = C. Dubarry, P. Del Moral, E. Moulines N2 1 X δ i (An ) . N2 i=1 xn 15/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Algorithm description The double bootstrap algorithm p from 0 to n − 1 do N2 multinomially with proba. prop. to Sample I p = (Ipi )i=1 N2 P N1 i,j 1 . j=1 gp (ξp ) N1 1: 2: for 3: 4: i from 1 to N2 do 1 Sample J ip = (Jpi,j )N j=1 multinomially with proba. prop. to N1 i I ,j gp (ξpp ) . i=1 for j=1 i,j according to For 1 ≤ j ≤ N1 , sample independently ξp+1 5: j I i ,Jp Mp+1 (ξpp 6: 7: , ·). end for end for C. Dubarry, P. Del Moral, E. Moulines 16/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Bias and variance of the double bootstrap Bootstrap approximation: bias and variance Theorem For any time horizon n≥0 and any bounded function fn ∈ Bb (Xn , Xn ), we have h i lim N1 E ηnN1 (fn ) − ηn (fn ) = Bn (fn ) , N1 →∞ lim N1 Var ηnN1 (fn ) = Vn (fn ) , N1 →∞ where Bn (fn ) and Vn (fn ) can be computed explicitly. C. Dubarry, P. Del Moral, E. Moulines 17/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Bias and variance of the double bootstrap Double bootstrap approximation: bias and variance Theorem n ≥ 0 and any fn ∈ Bb (Xn , Xn ), we have h i 2 e lim lim N1 N2 E η N n (m(·, fn )) − η n (m(·, fn )) = Bn (fn ) + Bn (fn ) , N1 →∞ N2 →∞ 2 e lim lim N1 N2 Var η N n (m(·, fn )) = Vn (fn ) + Vn (fn ) , For any time horizon N1 →∞ N2 →∞ where en (fn ), Vn (fn ), Ven (fn ) Bn (fn ), B can be computed explicitly. The rate of the interacting island (N2 islands each with N1 individuals) is the same as the one of the single island model with N1 N2 particles. Even though the constant terms may be worst in the interacting island model, it allows to use parallel implementations. C. Dubarry, P. Del Moral, E. Moulines 18/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Bias and variance of the double bootstrap Independent islands Theorem n ≥ 0 and any fn ∈ Bb (Xn , Xn ), we have i o n h 2 eN lim N1 E η n (m(·, fn )) − ηn (fn ) = Bn (fn ) , N1 →∞ 2 eN lim N1 N2 Var η n (m(·, fn )) = Vn (fn ) , For any time horizon N1 →∞ where Bn (fn ) and Vn (fn ) are the same than for the single island model. Although the variance of the particle approximation is inversely proportional to N1 N2 , the bias is independent of N2 and is inversely proportional to N1 . C. Dubarry, P. Del Moral, E. Moulines 19/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Bias and variance of the double bootstrap Example 1 Linear Gaussian Model Xp+1 = φXp + σu Up , Yp = Xp + σv Vp , Computing the predictive distribution of the state Xn given the observations Y0:n−1 = y0:n−1 up to time n − 1 can be cast into the framework of Feynman-Kac model by setting for all p ≥ 0 1 Mp+1 (xp , dxp+1 ) = √ exp −(xp+1 − φxp )2 /(2σu2 ) dxp+1 , 2πσu 1 gp (xp ) = √ exp −(yp − xp )2 /(2σv2 ) . 2πσv C. Dubarry, P. Del Moral, E. Moulines 20/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Bias and variance of the double bootstrap How to choose between interacting and independent islands? Independent islands Interacting islands Squared bias Variance Sum 2 Bn (fn )2 N12 en (fn ) Bn (fn ) + B Vn (fn ) N1 N2 Vn (fn ) + Ven (fn ) N1 N2 Bn (fn )2 Vn (fn ) + N1 N2 N12 Vn (fn ) + Ven (fn ) N1 N2 Vn (fn ) Bn (fn )2 Vn (fn ) + Ven (fn ) + < N1 N2 N12 N1 N2 C. Dubarry, P. Del Moral, E. Moulines N12 N22 ⇔ N1 > Bn (fn )2 N2 . Ven (fn ) 21/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Numerical application Numerical application: Linear Gaussian Model The model is dened by Xp+1 = φXp + σu Up , Yp = Xp + σv Vp . n + 1 = 11 observations were generated with φ = 0.9, σu = 0.6 and σv = 1 . We have E [Xn |Y0:n−1 = y0:n−1 ] = ηn (Id). We compare interacting to independent islands through M h E 100 × 2 2 i 2 2 eN ηN −E η n (Id) − ηn (Id) n (Id) − ηn (Id) . 2 2 eN E η n (Id) − ηn (Id) C. Dubarry, P. Del Moral, E. Moulines 22/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Numerical application Results for the LGSS model Figure: Interacting versus independent island renormalized estimators. C. Dubarry, P. Del Moral, E. Moulines 23/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Outline 1 Introduction 2 Island bootstrap approximation 3 4 The double bootstrap algorithm Algorithm description Bias and variance of the double bootstrap Numerical application Extensions C. Dubarry, P. Del Moral, E. Moulines 24/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Eective Sample Size Interaction Dene Θn,α 2 PN1 i i w g (x ) n i=1 n n 1 1 N1 N1 = xn = (xn , wn , . . . , xn , wn ) ∈ X n PN1 ≥ αN1 . 2 i=1 (wni gn (xin )) Dene m(xn , ·) stands for the empirical measure of xn given for any An ∈ Xn by 1 def m(xn , An ) = PN1 i=1 C. Dubarry, P. Del Moral, E. Moulines N1 X wni wni δxin (An ) , i=1 25/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Eective Sample Size Interaction Consider the Markov kernel M n+1 M n+1 (xn , An+1 ) = (Q N1 i i i i g (xi ) (Bn+1 )Mn+1 (xn , An+1 ) i=1 δwn n n QN i i 1 i=1 δ1 (Bn+1 )Ψn (m(xn , ·))Mn+1 (An+1 ) xn ∈ Θn,α xn 6∈ Θn,α Dene a Markov chain {X n }n≥0 where for each n ∈ N, i h X n = (ξn1 , ωn1 ), . . . , (ξnN1 , ωnN1 ) ∈ X n , C. Dubarry, P. Del Moral, E. Moulines 26/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions ESS: particle approximation N1 -particle approximations of the measures ηn and γn 1 def ηnN1 (fn ) = m(X n , fn ) = PN1 N1 X i i=1 ωn i=1 def Y γnN1 (fn ) = ηnN1 (fn ) ωni fn ξni , ηpN1 (gp ) . 0≤p<n Theorem fn ∈ Bb (Xn , Xn ), γnN1 (fn ) is an unbiased estimator of γn (fn ): h i Y N Y N1 N E γn (fn ) = E ηn 1 (fn ) ηp 1 (gp ) = E fn (Xn ) gp (Xp ) . For any 0≤p<n C. Dubarry, P. Del Moral, E. Moulines 0≤p<n 27/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions ESS: Feynman-Kac approximation N1 1 For xn = (x1n , wn1 , · · · , xN n , wn ) ∈ X n we set 1 def g n (xn ) = m(xn , gn ) = PN1 i=1 N1 X wni wni gn xin . i=1 The associated Feynman-Kac model {(η n , γ n )}n≥0 is γ n (1) η n (f n ) = γ n (f n )/γ Y γ n (f n ) = E f n (Xn ) g p (Xp ) , 0≤p<n C. Dubarry, P. Del Moral, E. Moulines 28/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions ESS: Feynman-Kac approximation Since g n (X n ) = ηnN1 (gn ), for any f n of the form f n (xn ) = P N1 i=1 wni −1 P E fn (Xn ) N1 i=1 wni fn xin where fn ∈ Bb (Xn , Xn ), Y gp (Xp ) = E f n (Xn ) 0≤p<n Y g p (Xp ) , 0≤p<n Therefore γ n (f n ) = γn (fn ) η n (f n ) = ηn (fn ) . C. Dubarry, P. Del Moral, E. Moulines 29/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions 1: for p from 0 to n − 1 do 2: Selection step and weight actualization between islands: 2 2 PN2 i eff = PN2 Ωi g (ξi , ω i ) i 3: Set N / Ωp g p (ξi p p , ωp ) . 2 i=1 p p p i=1 eff 4: if N2 ≥ αIslands N2 then i i i i 5: For 1 ≤ i ≤ N2 , set Ω p+1 = Ωp g p (ξp , ω p ). i N2 6: Set I p = (I ) p i=1 = (1, 2, . . . , N2 ). 7: else N 2 8: Set Ωp+1 = Ωi p+1 i=1 = (1, . . . , 1). N N2 i i i i 2 9: Sample I p = (I ) p i=1 multinomially with proba. prop. to Ωp g p (ξp , ω p ) i=1 . 10: end if 11: Island mutation step: 12: for i from 1 to N2 do 13: Particle selection and weight actualization within each island: 14: same business as usual 15: end for 16: end for 17: Approximate ηn (fn ) by N1 N2 X X 1 1 i i,j i,j . Ωn P ωn fn ξn PN2 N1 i,j i Ω ω i=1 n i=1 j=1 n j=1 C. Dubarry, P. Del Moral, E. Moulines 30/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Results for the ESS model N2 = 100 N2 = 250 N2 = 500 N2 = 1000 N1 = 100 N1 = 250 N1 = 500 N1 = 1000 (1) ESS (2) Bootstrap (2) Independent C. Dubarry, P. Del Moral, E. Moulines Real value 31/32 Introduction Island bootstrap approximation The double bootstrap algorithm Extensions Number of interactions Table: Number of interactions between islands for the ESS within ESS estimator as a percentage of the one the ESS within bootstrap estimator in the LGM. PP P P N2 PP N1 P 100 250 500 1000 100 250 500 1000 4.32 0.88 0.04 0 4.76 0.60 0.02 0 4.92 0.34 0 0 4.98 0.32 0 0 C. Dubarry, P. Del Moral, E. Moulines 32/32