Infinite–Dimensional Diffusion Processes Approximated by Finite Markov Chains on Partitions Leonid Petrov

Infinite–Dimensional Diffusion Processes Approximated by Finite Markov Chains on Partitions Leonid Petrov IPPI — Institute for Information Transmission Problems (Moscow, Russia) May 20, 2011 outline 1 2 3 4 de Finetti’s Theorem up/down Markov chains and limiting diffusions Kingman’s exchangeable random partitions up/down Markov chains on partitions and limiting infinite-dimensional diffusions de Finetti’s theorem Exchangeable binary sequences ξ1 , ξ2 , ξ3 , . . . — random variables ∈ {0, 1} de Finetti’s theorem Exchangeable binary sequences ξ1 , ξ2 , ξ3 , . . . — random variables ∈ {0, 1} distribution of the sequence ξ = (ξ1 , ξ2 , ξ3, . . . ) is invariant under pemutations de Finetti’s theorem Exchangeable binary sequences ξ1 , ξ2 , ξ3 , . . . — random variables ∈ {0, 1} distribution of the sequence ξ = (ξ1 , ξ2 , ξ3, . . . ) is invariant under pemutations de Finetti’s Theorem Exchangeable binary sequences ξ l probability measures P on [0, 1] de Finetti’s theorem From ξ to P #{ξi = 1 : 1 ≤ i ≤ n} Law −−→ P, n n→∞ , 1} approximate the measure distributions on {0, n1 , n2 , . . . , n−1 n P on [0, 1]. de Finetti’s theorem From ξ to P #{ξi = 1 : 1 ≤ i ≤ n} Law −−→ P, n n→∞ , 1} approximate the measure distributions on {0, n1 , n2 , . . . , n−1 n P on [0, 1]. From P to ξ 1 Sample real number p ∈ [0, 1] according to P de Finetti’s theorem From ξ to P #{ξi = 1 : 1 ≤ i ≤ n} Law −−→ P, n n→∞ , 1} approximate the measure distributions on {0, n1 , n2 , . . . , n−1 n P on [0, 1]. From P to ξ 1 Sample real number p ∈ [0, 1] according to P 2 Sample Bernoulli sequence ξ = (ξ1 , ξ2 , . . . ) with probability p of success. Pn (k) := Prob(#{ξ 1 i = 1 : 1 ≤ i ≤ n} = k) n R k p (1 − p)n−k P(dp). = k 0 de Finetti’s theorem Bernoulli sequences are the extremal exchangeable binary sequences (P on [0, 1] is δp ) de Finetti’s theorem Bernoulli sequences are the extremal exchangeable binary sequences (P on [0, 1] is δp ) Any exchangeable binary sequence is a mixture of Bernoulli sequences. finite level distributions Pn on {0, 1, 2, . . . , n − 1, n} Pn (k) = Prob(#{ξi = 1 : 1 ≤ i ≤ n} = k). (floors of the Pascal triangle) finite level distributions Pn on {0, 1, 2, . . . , n − 1, n} Pn (k) = Prob(#{ξi = 1 : 1 ≤ i ≤ n} = k). (floors of the Pascal triangle) The distributions {Pn }∞ n=1 encode the exchangeable binary sequence ξ. The scaling limit of Pn ’s is our measure P on [0, 1]. finite level distributions Pn on {0, 1, 2, . . . , n − 1, n} Pn (k) = Prob(#{ξi = 1 : 1 ≤ i ≤ n} = k). (floors of the Pascal triangle) The distributions {Pn }∞ n=1 encode the exchangeable binary sequence ξ. The scaling limit of Pn ’s is our measure P on [0, 1]. The Pn ’s are compatible with each other: Pn+1 (k) + k+1 P (k + 1) ∀k Pn (k) = n+1−k n+1 n+1 n+1 finite level distributions Pn on {0, 1, 2, . . . , n − 1, n} Pn (k) = Prob(#{ξi = 1 : 1 ≤ i ≤ n} = k). (floors of the Pascal triangle) The distributions {Pn }∞ n=1 encode the exchangeable binary sequence ξ. The scaling limit of Pn ’s is our measure P on [0, 1]. The Pn ’s are compatible with each other: Pn+1 (k) + k+1 P (k + 1) ∀k Pn (k) = n+1−k n+1 n+1 n+1 m ↓ pn+1,n Can define down kernel from {0, . . . , n + 1} to {0, . . . , n} such that ↓ Pn+1 ◦ pn+1,n = Pn ∀n down transition kernel ↓ pn+1,n (k, k) = n+k −1 , n+1 ↓ pn+1,n (k, k − 1) = k . n+1 down transition kernel ↓ pn+1,n (k, k) = 1 n+k −1 , n+1 Markov transition kernel: ↓ pn+1,n (k, k − 1) = P m k . n+1 ↓ pn+1,n (k, m) = 1 for all k. down transition kernel ↓ pn+1,n (k, k) = n+k −1 , n+1 ↓ pn+1,n (k, k − 1) = P k . n+1 ↓ pn+1,n (k, m) = 1 for all k. 1 Markov transition kernel: 2 Can define through the Pascal triangle: # of paths to (n, m) ↓ pn+1,n (k, m) = . # of paths to (n + 1, k) m down transition kernel ↓ pn+1,n (k, k) = n+k −1 , n+1 ↓ pn+1,n (k, k − 1) = P k . n+1 ↓ pn+1,n (k, m) = 1 for all k. 1 Markov transition kernel: 2 Can define through the Pascal triangle: # of paths to (n, m) ↓ pn+1,n (k, m) = . # of paths to (n + 1, k) m summarizing Binary sequence ξ is exchangeable iff {Pn }’s are ↓ compatible with the down kernel: Pn+1 ◦ pn+1,n = Pn . down transition kernel ↓ pn+1,n (k, k) = n+k −1 , n+1 ↓ pn+1,n (k, k − 1) = P k . n+1 ↓ pn+1,n (k, m) = 1 for all k. 1 Markov transition kernel: 2 Can define through the Pascal triangle: # of paths to (n, m) ↓ pn+1,n (k, m) = . # of paths to (n + 1, k) m summarizing Binary sequence ξ is exchangeable iff {Pn }’s are ↓ compatible with the down kernel: Pn+1 ◦ pn+1,n = Pn . de Finetti’s Theorem = classification of compatible (coherent) sequences of measures {Pn } on levels of the Pascal triangle. up transition kernel Fix a measure P on [0, 1] ⇔ a system {Pn }. up transition kernel Fix a measure P on [0, 1] ⇔ a system {Pn }. Define Pn+1 (k) ↑ ↓ . pn,n+1 (m, k) := pn+1,n (k, m) Pn (m) up transition kernel Fix a measure P on [0, 1] ⇔ a system {Pn }. Define Pn+1 (k) ↑ ↓ . pn,n+1 (m, k) := pn+1,n (k, m) Pn (m) Thus ↑ Pn ◦ pn,n+1 = Pn+1 ∀n. (random growth on the Pascal triangle; depends on P). up transition kernel Fix a measure P on [0, 1] ⇔ a system {Pn }. Define Pn+1 (k) ↑ ↓ . pn,n+1 (m, k) := pn+1,n (k, m) Pn (m) Thus ↑ Pn ◦ pn,n+1 = Pn+1 ∀n. (random growth on the Pascal triangle; depends on P). This is a restatement of the two-step sampling procedure for the exchangeable binary sequence ξ = (ξ1 , ξ2 , . . . ) corresponding to P. outline 1 2 3 4 de Finetti’s Theorem up/down Markov chains and limiting diffusions Kingman’s exchangeable random partitions up/down Markov chains on partitions and limiting infinite-dimensional diffusions up/down Markov chains Fix a measure P on [0, 1]. up/down Markov chains Fix a measure P on [0, 1]. ↑ Thus get the up kernel pn,n+1 . up/down Markov chains Fix a measure P on [0, 1]. ↑ Thus get the up kernel pn,n+1 . ↓ Also there is a canonical down kernel pn+1,n . up/down Markov chains Fix a measure P on [0, 1]. ↑ Thus get the up kernel pn,n+1 . ↓ Also there is a canonical down kernel pn+1,n . Use these kernels to obtain the up/down Markov chain on {0, 1, . . . , n}: ↑ ↓ Tn = pn,n+1 ◦ pn+1,n , i.e., n+1 P ↑ ↓ Tn (k, k̃) = pn,n+1 (k, m) ◦ pn+1,n (m, k̃). m=0 up/down Markov chains Fix a measure P on [0, 1]. ↑ Thus get the up kernel pn,n+1 . ↓ Also there is a canonical down kernel pn+1,n . Use these kernels to obtain the up/down Markov chain on {0, 1, . . . , n}: ↑ ↓ Tn = pn,n+1 ◦ pn+1,n , i.e., n+1 P ↑ ↓ Tn (k, k̃) = pn,n+1 (k, m) ◦ pn+1,n (m, k̃). m=0 Pn is reversible w.r.t. Tn for all n. up/down Markov chains Fix a measure P on [0, 1]. ↑ Thus get the up kernel pn,n+1 . ↓ Also there is a canonical down kernel pn+1,n . Use these kernels to obtain the up/down Markov chain on {0, 1, . . . , n}: ↑ ↓ Tn = pn,n+1 ◦ pn+1,n , i.e., n+1 P ↑ ↓ Tn (k, k̃) = pn,n+1 (k, m) ◦ pn+1,n (m, k̃). m=0 Pn is reversible w.r.t. Tn for all n. Is there a (scaling) limit of the chains Tn as n → ∞? distinguished measures P on [0, 1] Is there a (scaling) limit of the chains Tn as n → ∞? distinguished measures P on [0, 1] Is there a (scaling) limit of the chains Tn as n → ∞? → Nothing general can be said for arbitrary P on [0, 1]. distinguished measures P on [0, 1] Is there a (scaling) limit of the chains Tn as n → ∞? → Nothing general can be said for arbitrary P on [0, 1]. Beta distributions Beta distributions Pa,b (where a, b > 0): Pa,b (dx) = Γ(a + b) a−1 x (1 − x)b−1 dx. Γ(a)Γ(b) distinguished measures P on [0, 1] Is there a (scaling) limit of the chains Tn as n → ∞? → Nothing general can be said for arbitrary P on [0, 1]. Beta distributions Beta distributions Pa,b (where a, b > 0): Pa,b (dx) = Γ(a + b) a−1 x (1 − x)b−1 dx. Γ(a)Γ(b) One gets diffusion limits for the chains Tn distinguished measures P on [0, 1] Is there a (scaling) limit of the chains Tn as n → ∞? → Nothing general can be said for arbitrary P on [0, 1]. Beta distributions Beta distributions Pa,b (where a, b > 0): Pa,b (dx) = Γ(a + b) a−1 x (1 − x)b−1 dx. Γ(a)Γ(b) One gets diffusion limits for the chains Tn Pa,b ’s are nice from algebraic point distinguished measures P on [0, 1] Is there a (scaling) limit of the chains Tn as n → ∞? → Nothing general can be said for arbitrary P on [0, 1]. Beta distributions Beta distributions Pa,b (where a, b > 0): Pa,b (dx) = Γ(a + b) a−1 x (1 − x)b−1 dx. Γ(a)Γ(b) One gets diffusion limits for the chains Tn Pa,b ’s are nice from algebraic point Remark: Pa,b ’s from a Bayesian point: Beta prior on Bernoulli parameter p ∈ [0, 1] ⇒ Beta posterior Beta distributions and the uniform distribution Take Pa,b on [0, 1], then Pn (k) = Z1 Γ(n+1)Γ(a+b) p k+a−1 (1 Γ(k+1)Γ(n−k+1)Γ(a)Γ(b) − p)n−k+b−1 dp 0 n (a)k (b)n−k = , k (a + b)n where (x)m := x(x + 1)(x + 2) . . . (x + m − 1). Beta distributions and the uniform distribution Take Pa,b on [0, 1], then Pn (k) = Z1 Γ(n+1)Γ(a+b) p k+a−1 (1 Γ(k+1)Γ(n−k+1)Γ(a)Γ(b) − p)n−k+b−1 dp 0 n (a)k (b)n−k = , k (a + b)n where (x)m := x(x + 1)(x + 2) . . . (x + m − 1). Uniform distribution For the uniform distribution (a = b = 1) on [0, 1], Pn is also uniform on {0, 1, . . . , n}. up/down Markov chains for uniform distribution ↑ pn,n+1 (k, k + 1) = k +1 , n+2 ↑ pn,n+1 (k, k) = n−k +1 . n+2 up/down Markov chains for uniform distribution ↑ pn,n+1 (k, k + 1) = k +1 , n+2 Tn (k → k + 1) = (k+1)(n−k) (n+1)(n+2) Tn (k → k − 1) = k(n−k+1) (n+1)(n+2) Tn (k → k) = (k+1)2 +(n−k+1)2 (n+1)(n+2) ↑ pn,n+1 (k, k) = n−k +1 . n+2 up/down Markov chains for uniform distribution ↑ pn,n+1 (k, k + 1) = k +1 , n+2 Tn (k → k + 1) = (k+1)(n−k) (n+1)(n+2) Tn (k → k − 1) = k(n−k+1) (n+1)(n+2) Tn (k → k) = ↑ pn,n+1 (k, k) = n−k +1 . n+2 (k+1)2 +(n−k+1)2 (n+1)(n+2) 1 k = x ∈ [0, 1], ∆x = : n n Tn (k → k +1) = Prob(x → x +∆x) ∼ x(1−x)+ n1 (1−x) Tn (k → k − 1) = Prob(x → x − ∆x) ∼ x(1 − x) + n1 x up/down Markov chains for uniform distribution Scaling limit 1 1 , scale time by 2 (i.e., one step of the n n nth Markov chain = small time interval of order n−2 ). Scale space by up/down Markov chains for uniform distribution Scaling limit 1 1 , scale time by 2 (i.e., one step of the n n nth Markov chain = small time interval of order n−2 ). Then as n → +∞, d2 d n2 (Tn − 1) → x(1 − x) 2 + (1 − 2x) dx dx Scale space by up/down Markov chains for uniform distribution Scaling limit 1 1 , scale time by 2 (i.e., one step of the n n nth Markov chain = small time interval of order n−2 ). Then as n → +∞, d2 d n2 (Tn − 1) → x(1 − x) 2 + (1 − 2x) dx dx First, check the convergence on polynomials (algebraically!). The processes have polynomial core. Scale space by up/down Markov chains for uniform distribution Scaling limit 1 1 , scale time by 2 (i.e., one step of the n n nth Markov chain = small time interval of order n−2 ). Then as n → +∞, d2 d n2 (Tn − 1) → x(1 − x) 2 + (1 − 2x) dx dx First, check the convergence on polynomials (algebraically!). The processes have polynomial core. Then apply Trotter-Kurtz type theorems to conclude the convergence of finite Markov chains to diffusion processes. Scale space by outline 1 2 de Finetti’s Theorem up/down Markov chains and limiting diffusions intermission: a variety of models 3 Kingman’s exchangeable random partitions 4 up/down Markov chains on partitions and limiting infinite-dimensional diffusions intermission: a variety of models Counting paths ⇒ p ↓ ⇒ classification of coherent measures intermission: a variety of models Counting paths ⇒ p ↓ ⇒ classification of coherent measures Examples Pascal triangle intermission: a variety of models Counting paths ⇒ p ↓ ⇒ classification of coherent measures Examples Pascal triangle Kingman’s exchangeable random partitions intermission: a variety of models Counting paths ⇒ p ↓ ⇒ classification of coherent measures Examples Pascal triangle Kingman’s exchangeable random partitions q-Pascal triangle, Stirling triangles, etc. intermission: a variety of models Counting paths ⇒ p ↓ ⇒ classification of coherent measures Examples Pascal triangle Kingman’s exchangeable random partitions q-Pascal triangle, Stirling triangles, etc. Young graph — the lattice of all partitions intermission: a variety of models Counting paths ⇒ p ↓ ⇒ classification of coherent measures Examples Pascal triangle Kingman’s exchangeable random partitions q-Pascal triangle, Stirling triangles, etc. Young graph — the lattice of all partitions Young graph with multiplicities (one choice of multiplicities is equivalent to Kingman’s theory) intermission: a variety of models Counting paths ⇒ p ↓ ⇒ classification of coherent measures Examples Pascal triangle Kingman’s exchangeable random partitions q-Pascal triangle, Stirling triangles, etc. Young graph — the lattice of all partitions Young graph with multiplicities (one choice of multiplicities is equivalent to Kingman’s theory) Schur graph — the lattice of all strict partitions (my yesterday’s talk: a related Markov dynamics is Pfaffian) intermission: a variety of models Counting paths ⇒ p ↓ ⇒ classification of coherent measures Examples Pascal triangle Kingman’s exchangeable random partitions q-Pascal triangle, Stirling triangles, etc. Young graph — the lattice of all partitions Young graph with multiplicities (one choice of multiplicities is equivalent to Kingman’s theory) Schur graph — the lattice of all strict partitions (my yesterday’s talk: a related Markov dynamics is Pfaffian) Gelfand-Tsetlin schemes ... intermission: a variety of models ; HH H H HHH H Z Z Z Q ZZZ HHH ZZ ZZ Figure: Young graph ... intermission: a variety of models ; HH HH QQ ZZ ZZ ZZZZ ZZ HH HHH HHH HH HH HHH ... ZZZ ZZ ZZZ Figure: Kingman graph (= Young graph with multiplicities) outline 1 2 3 4 de Finetti’s Theorem up/down Markov chains and limiting diffusions Kingman’s exchangeable random partitions up/down Markov chains on partitions and limiting infinite-dimensional diffusions exchangeable random partitions Definition F π is a random partition of N = {1, 2, 3, . . . }, i.e., N = Ai . i ∈I exchangeable random partitions Definition F π is a random partition of N = {1, 2, 3, . . . }, i.e., N = Ai . i ∈I π is called exchangeable if the distribution of π is invariant under permutations of N. exchangeable random partitions Definition F π is a random partition of N = {1, 2, 3, . . . }, i.e., N = Ai . i ∈I π is called exchangeable if the distribution of π is invariant under permutations of N. Kingman Simplex ∇∞ n o P := (x1 ≥ x2 ≥ · · · ≥ 0) : xi ≤ 1 exchangeable random partitions Definition F π is a random partition of N = {1, 2, 3, . . . }, i.e., N = Ai . i ∈I π is called exchangeable if the distribution of π is invariant under permutations of N. Kingman Simplex ∇∞ n o P := (x1 ≥ x2 ≥ · · · ≥ 0) : xi ≤ 1 Compact in coordinatewise topology. exchangeable random partitions Definition F π is a random partition of N = {1, 2, 3, . . . }, i.e., N = Ai . i ∈I π is called exchangeable if the distribution of π is invariant under permutations of N. Kingman Simplex ∇∞ n o P := (x1 ≥ x2 ≥ · · · ≥ 0) : xi ≤ 1 Compact in coordinatewise topology. x ∈ ∇∞ ←→ discrete distribution exchangeable random partitions Definition F π is a random partition of N = {1, 2, 3, . . . }, i.e., N = Ai . i ∈I π is called exchangeable if the distribution of π is invariant under permutations of N. Kingman Simplex ∇∞ n o P := (x1 ≥ x2 ≥ · · · ≥ 0) : xi ≤ 1 Compact in coordinatewise topology. x ∈ ∇∞ ←→ discrete distribution Probability measure on ∇∞ ←→ random discrete distribution exchangeable random partitions Kingman’s representation π — exchangeable random partition of N l probability measure M on ∇∞ (the boundary measure) exchangeable random partitions Kingman’s representation π — exchangeable random partition of N l probability measure M on ∇∞ (the boundary measure) Finite level random integer partitions — encoding of π Restrict π to {1, 2, . . . , n} ⊂ N, thus get a random partition πn of the finite set {1, 2, . . . , n}. exchangeable random partitions Kingman’s representation π — exchangeable random partition of N l probability measure M on ∇∞ (the boundary measure) Finite level random integer partitions — encoding of π Restrict π to {1, 2, . . . , n} ⊂ N, thus get a random partition πn of the finite set {1, 2, . . . , n}. Exchangeable ⇒ the distribution of πn is encoded by the distribution of decreasing sizes of blocks P λ = (λ1 ≥ λ2 ≥ · · · ≥ λℓ(λ) > 0), λi ∈ Z, λi = n. exchangeable random partitions Kingman’s representation π — exchangeable random partition of N l probability measure M on ∇∞ (the boundary measure) Finite level random integer partitions — encoding of π Restrict π to {1, 2, . . . , n} ⊂ N, thus get a random partition πn of the finite set {1, 2, . . . , n}. Exchangeable ⇒ the distribution of πn is encoded by the distribution of decreasing sizes of blocks P λ = (λ1 ≥ λ2 ≥ · · · ≥ λℓ(λ) > 0), λi ∈ Z, λi = n. ⇒ sequence of measures Mn on the sets Pn := {λ = (λ1 ≥ · · · ≥ λℓ(λ) ) P — integer partition such that λi = n} exchangeable random partitions from {Mn } to M Random x1 ≥ x2 ≥ · · · ≥ 0 are limiting values of λ1 , λ2 , . . . : λj Law −−→ xj , j = 1, 2, . . . . n exchangeable random partitions from {Mn } to M Random x1 ≥ x2 ≥ · · · ≥ 0 are limiting values of λ1 , λ2 , . . . : λj Law −−→ xj , j = 1, 2, . . . . n The sets of partitions Pn approximate ∇∞ : Pn ∋ λ = (λ1 , . . . , λℓ ) ֒→ λn1 , . . . λnℓ , 0, 0, . . . ∈ ∇∞ . Images of Mn ’s weakly converge to M on ∇∞ . exchangeable random partitions from {Mn } to M Random x1 ≥ x2 ≥ · · · ≥ 0 are limiting values of λ1 , λ2 , . . . : λj Law −−→ xj , j = 1, 2, . . . . n The sets of partitions Pn approximate ∇∞ : Pn ∋ λ = (λ1 , . . . , λℓ ) ֒→ λn1 , . . . λnℓ , 0, 0, . . . ∈ ∇∞ . Images of Mn ’s weakly converge to M on ∇∞ . From the point of random partitions π of N the x1 ≥ x2 ≥ · · · ≥ 0 are the limiting frequencies of blocks, in decreasing order. exchangeable random partitions from M to {Mn } — two-step random sampling 1 Sample x = (x1 , x2P , . . . ) ∈ ∇∞ according to M on ∇∞ . For simplicity, let xi = 1 exchangeable random partitions from M to {Mn } — two-step random sampling 1 2 Sample x = (x1 , x2P , . . . ) ∈ ∇∞ according to M on ∇∞ . For simplicity, let xi = 1 Consider the measure with weights x1 , x2, . . . on N. Sample n independent numbers A1 , . . . , An ∈ N according to this measure exchangeable random partitions from M to {Mn } — two-step random sampling 1 2 3 Sample x = (x1 , x2P , . . . ) ∈ ∇∞ according to M on ∇∞ . For simplicity, let xi = 1 Consider the measure with weights x1 , x2, . . . on N. Sample n independent numbers A1 , . . . , An ∈ N according to this measure λ1 ≥ λ2 ≥ · · · ≥ λℓ — multiplicities of A1 , A2 , . . . , An in decreasing order: e.g., (A1 , . . . , An ) = (4, 3, 5, 1, 1, 3, 1) → λ = (3, 2, 1, 1) Law of λ = (λ1 , . . . , λℓ ) ∈ Pn is Mn . exchangeable random partitions monomial symmetric functions µ = (µ1 , . . . , µk ) — integer partition, P mµ (y1 , y2 , . . . ) := yiµ1 1 yiµ2 2 . . . yiµk k (sum over all distinct summands). exchangeable random partitions monomial symmetric functions µ = (µ1 , . . . , µk ) — integer partition, P mµ (y1 , y2 , . . . ) := yiµ1 1 yiµ2 2 . . . yiµk k (sum over all distinct summands). Examples: exchangeable random partitions monomial symmetric functions µ = (µ1 , . . . , µk ) — integer partition, P mµ (y1 , y2 , . . . ) := yiµ1 1 yiµ2 2 . . . yiµk k (sum over all distinct summands). Examples: m(k) (y1 , y2 , . . . ) := pk (y1 , y2 , . . . ) = ∞ P i =1 power sum) yik (Newton exchangeable random partitions monomial symmetric functions µ = (µ1 , . . . , µk ) — integer partition, P mµ (y1 , y2 , . . . ) := yiµ1 1 yiµ2 2 . . . yiµk k (sum over all distinct summands). Examples: m(k) (y1 , y2 , . . . ) := pk (y1 , y2 , . . . ) = ∞ P i =1 yik (Newton power sum) P yi1 . . . yik (elementary m(1, . . . , 1) (y1 , y2, . . . ) = i1 <···<ik | {z } k symmetric function) exchangeable random partitions monomial symmetric functions µ = (µ1 , . . . , µk ) — integer partition, P mµ (y1 , y2 , . . . ) := yiµ1 1 yiµ2 2 . . . yiµk k (sum over all distinct summands). Examples: m(k) (y1 , y2 , . . . ) := pk (y1 , y2 , . . . ) = ∞ P i =1 yik (Newton power sum) P yi1 . . . yik (elementary m(1, . . . , 1) (y1 , y2, . . . ) = i1 <···<ik | {z } k symmetric function) For k > 1: m(k,1) (y1 , y2 , . . . ) = ∞ P i ,j=1 yik yj . exchangeable random partitions from M to {Mn } — two-step random sampling P For simplicity, let M be concentrated on { xi = 1}. Mn (λ) = n λ1 , λ2 , . . . , λℓ Z ∇∞ mλ (x1 , x2 , . . . )M(dx). partition structures down kernel The measures {Mn } are compatible with each other, through ↓ a certain canonical down transition kernel pn+1,n from Pn+1 to Pn . partition structures down kernel The measures {Mn } are compatible with each other, through ↓ a certain canonical down transition kernel pn+1,n from Pn+1 to Pn . n ↓ pn+1,n (λ, µ) := µ1 ,...,µℓ(µ) , n+1 λ1 ,...,λℓ(λ) ↓ pn+1,n (λ, µ) := 0, otherwise if λ = µ + partition structures down kernel The measures {Mn } are compatible with each other, through ↓ a certain canonical down transition kernel pn+1,n from Pn+1 to Pn . n ↓ pn+1,n (λ, µ) := µ1 ,...,µℓ(µ) , n+1 λ1 ,...,λℓ(λ) if λ = µ + ↓ pn+1,n (λ, µ) := 0, otherwise n λ1 ,λ2 ,...,λℓ to λ ∈ Pn . — number of paths in the Kingman graph from ∅ partition structures down kernel The measures {Mn } are compatible with each other, through ↓ a certain canonical down transition kernel pn+1,n from Pn+1 to Pn . n ↓ pn+1,n (λ, µ) := µ1 ,...,µℓ(µ) , n+1 λ1 ,...,λℓ(λ) if λ = µ + ↓ pn+1,n (λ, µ) := 0, otherwise n λ1 ,λ2 ,...,λℓ — number of paths in the Kingman graph from ∅ to λ ∈ Pn . ↓ pn+1,n (λ, ·) = take (uniformly) a random box from λ and delete it. partition structures ; HH HH QQ ZZ ZZ ZZZZ ZZ HH HHH HHH HH HH HHH ZZZ ZZ ZZZ Figure: Kingman graph ... partition structures Definition {Mn } — sequence of measures on Pn is called a partition struc↓ ture if it is compatible with pn+1,n : ↓ Mn+1 ◦ pn+1,n = Mn partition structures Definition {Mn } — sequence of measures on Pn is called a partition struc↓ ture if it is compatible with pn+1,n : ↓ Mn+1 ◦ pn+1,n = Mn Kingman’s theorem = classification of partition structures partition structures Definition {Mn } — sequence of measures on Pn is called a partition struc↓ ture if it is compatible with pn+1,n : ↓ Mn+1 ◦ pn+1,n = Mn Kingman’s theorem = classification of partition structures Any partition structure ⇒ up/down Markov chains. partition structures Definition {Mn } — sequence of measures on Pn is called a partition struc↓ ture if it is compatible with pn+1,n : ↓ Mn+1 ◦ pn+1,n = Mn Kingman’s theorem = classification of partition structures Any partition structure ⇒ up/down Markov chains. Which partition structures are good for obtaining diffusions (diffusions will be infinite-dimensional)? outline 1 2 3 4 de Finetti’s Theorem up/down Markov chains and limiting diffusions Kingman’s exchangeable random partitions up/down Markov chains on partitions and limiting infinite-dimensional diffusions Poisson-Dirichlet distributions There is a distinguished two-parameter family of measures PD(α, θ) on ∇∞ , 0 ≤ α < 1, θ > −α. Poisson-Dirichlet distributions There is a distinguished two-parameter family of measures PD(α, θ) on ∇∞ , 0 ≤ α < 1, θ > −α. α = 0 — introduced by Kingman, definition through a Poisson point process on (0, ∞). Poisson-Dirichlet distributions There is a distinguished two-parameter family of measures PD(α, θ) on ∇∞ , 0 ≤ α < 1, θ > −α. α = 0 — introduced by Kingman, definition through a Poisson point process on (0, ∞). α 6= 0 — Pitman(-Yor), ’92–’95, motivated by stochastic processes. Definition through a Cox point process on (0, ∞). Poisson-Dirichlet distributions There is a distinguished two-parameter family of measures PD(α, θ) on ∇∞ , 0 ≤ α < 1, θ > −α. α = 0 — introduced by Kingman, definition through a Poisson point process on (0, ∞). α 6= 0 — Pitman(-Yor), ’92–’95, motivated by stochastic processes. Definition through a Cox point process on (0, ∞). Ewens-Pitman sampling formula Partition structure corresponding to PD(α, θ): Mn (λ) = λi Q Q n! θ(θ + α) . . . (θ + (ℓ(λ) − 1)α) ℓ(λ) Q Q · (j −1−α) · (θ)n λi ! [λ : k]! i =1 j=2 up-down Markov chains Having a partition structure {Mn } corresponding to (α,θ) PD(α, θ), define up/down Markov chains Tn on Pn as before. They are reversible with respect to Mn . up-down Markov chains Having a partition structure {Mn } corresponding to (α,θ) PD(α, θ), define up/down Markov chains Tn on Pn as before. They are reversible with respect to Mn . (α,θ) One step of Tn = move one box in the corresponding Young diagram from one place to another. up-down Markov chains Having a partition structure {Mn } corresponding to (α,θ) PD(α, θ), define up/down Markov chains Tn on Pn as before. They are reversible with respect to Mn . (α,θ) One step of Tn = move one box in the corresponding Young diagram from one place to another. Remark: For α = 0, these chains have population-genetic interpretation, and the limiting infinite-dimensional diffusions were constructed by Ethier and Kurtz in ’81 (by approximating by finite-dimensional diffusions). up-down Markov chains Having a partition structure {Mn } corresponding to (α,θ) PD(α, θ), define up/down Markov chains Tn on Pn as before. They are reversible with respect to Mn . (α,θ) One step of Tn = move one box in the corresponding Young diagram from one place to another. Remark: For α = 0, these chains have population-genetic interpretation, and the limiting infinite-dimensional diffusions were constructed by Ethier and Kurtz in ’81 (by approximating by finite-dimensional diffusions). Scale space by 1/n (embed Pn into ∇∞ ) and scale time by 1/n2 . up-down Markov chains Having a partition structure {Mn } corresponding to (α,θ) PD(α, θ), define up/down Markov chains Tn on Pn as before. They are reversible with respect to Mn . (α,θ) One step of Tn = move one box in the corresponding Young diagram from one place to another. Remark: For α = 0, these chains have population-genetic interpretation, and the limiting infinite-dimensional diffusions were constructed by Ethier and Kurtz in ’81 (by approximating by finite-dimensional diffusions). Scale space by 1/n (embed Pn into ∇∞ ) and scale time by 1/n2 . The measures Mn converge to PD(α, θ), what about (α,θ) Markov chains Tn ? limiting infinite-dimensional diffusions Theorem [P.] 1 As n → +∞, under the space and time scalings, the (α,θ) Markov chains Tn converge to an infinite-dimensional diffusion process (Xα,θ (t))t≥0 on ∇∞ . limiting infinite-dimensional diffusions Theorem [P.] 1 As n → +∞, under the space and time scalings, the (α,θ) Markov chains Tn converge to an infinite-dimensional diffusion process (Xα,θ (t))t≥0 on ∇∞ . 2 The Poisson-Dirichlet distribution PD(α, θ) is the unique invariant probability distribution for Xα,θ (t). The process is reversible and ergodic with respect to PD(α, θ). limiting infinite-dimensional diffusions Theorem [P.] 1 As n → +∞, under the space and time scalings, the (α,θ) Markov chains Tn converge to an infinite-dimensional diffusion process (Xα,θ (t))t≥0 on ∇∞ . 2 The Poisson-Dirichlet distribution PD(α, θ) is the unique invariant probability distribution for Xα,θ (t). The process is reversible and ergodic with respect to PD(α, θ). 3 The generator of Xα,θ is explicitly computed: ∞ X ∞ X ∂ ∂2 − (θxi + α) . xi (δij − xj ) ∂xi ∂xj ∂xi i =1 i ,j=1 It acts on continuous symmetric polynomials in the coordinates x1 , x2, . . . . limiting infinite-dimensional diffusions Theorem [P.] 4 The spectrum of the generator in L2 (∇∞ , PD(α, θ)) is {0} ∪ {−n(n − 1 + θ) : n = 2, 3, . . . }, the eigenvalue 0 is simple, and the multiplicity of −n(n − 1 + θ) is the number of partitions of n with all parts ≥ 2. limiting infinite-dimensional diffusions Theorem [P.] 4 The spectrum of the generator in L2 (∇∞ , PD(α, θ)) is {0} ∪ {−n(n − 1 + θ) : n = 2, 3, . . . }, the eigenvalue 0 is simple, and the multiplicity of −n(n − 1 + θ) is the number of partitions of n with all parts ≥ 2. Remark: connection with the Pascal triangle model Degenerate parameters α < 0 arbitrary, θ = −2α ⇒ partitions have ≤ 2 parts. de Finetti’s model with (any) symmetric Beta distribution. limiting infinite-dimensional diffusions Theorem [P.] 4 The spectrum of the generator in L2 (∇∞ , PD(α, θ)) is {0} ∪ {−n(n − 1 + θ) : n = 2, 3, . . . }, the eigenvalue 0 is simple, and the multiplicity of −n(n − 1 + θ) is the number of partitions of n with all parts ≥ 2. Remark: connection with the Pascal triangle model Degenerate parameters α < 0 arbitrary, θ = −2α ⇒ partitions have ≤ 2 parts. de Finetti’s model with (any) symmetric Beta distribution. α < 0, θ = −K α ⇒ K -dimensional generalization. scheme of proof No finite-dimensional approximating diffusions for α 6= 0! scheme of proof No finite-dimensional approximating diffusions for α 6= 0! 1 (α,θ) The transition operators of the Markov chains Tn act on symmetric functions in the coordinates λ1 , . . . , λℓ of a partition λ ∈ Pn . scheme of proof No finite-dimensional approximating diffusions for α 6= 0! (α,θ) 1 The transition operators of the Markov chains Tn act on symmetric functions in the coordinates λ1 , . . . , λℓ of a partition λ ∈ Pn . 2 Write the operators Tn in a suitable basis (closely related to the monomial symmetric functions). (α,θ) scheme of proof No finite-dimensional approximating diffusions for α 6= 0! (α,θ) 1 The transition operators of the Markov chains Tn act on symmetric functions in the coordinates λ1 , . . . , λℓ of a partition λ ∈ Pn . 2 Write the operators Tn in a suitable basis (closely related to the monomial symmetric functions). 3 Pass to n → +∞ limit of generators (algebraically!). The processes’ core is the algebra of symmetric functions. (α,θ) scheme of proof No finite-dimensional approximating diffusions for α 6= 0! (α,θ) 1 The transition operators of the Markov chains Tn act on symmetric functions in the coordinates λ1 , . . . , λℓ of a partition λ ∈ Pn . 2 Write the operators Tn in a suitable basis (closely related to the monomial symmetric functions). 3 Pass to n → +∞ limit of generators (algebraically!). The processes’ core is the algebra of symmetric functions. 4 Use general technique of Trotter-Kurtz to deduce convergence of the processes (α,θ) Thank you for your attention 1 0.8 0.6 0.4 0.2 0 0 1 2 3 4 5 6 Figure: x1 (t) ≥ x2 (t) ≥ x3 (t) ≥ x4 (t) 7 8

Infinite–Dimensional Diffusion Processes Approximated by Finite Markov Chains on Partitions Leonid Petrov

Related documents

Products

Support

Infinite–Dimensional Diffusion Processes Approximated by Finite Markov Chains on Partitions Leonid Petrov

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib