Why do we care about inequalities in (algebraic) statistics? Piotr Zwiernik

advertisement

Basic algebraic statistics

Constrained multinomial model

Example: Tripod tree

General comments

Why do we care about inequalities in (algebraic) statistics?

Piotr Zwiernik

Mittag-Leffler Institute, Stockholm

(j.w. Jim Q. Smith)

WOGAS 3,

Warwick, 2 April 2011

Why do we care about inequalities in (algebraic) statistics?

1 / 17

Basic algebraic statistics

Constrained multinomial model

Example: Tripod tree

Outline of the talk

General comments

Algebraic statistical models and constrained multinomial models.

How inequalities affect inference for the tripod tree model?

Extensions to more general models.

Why do we care about inequalities in (algebraic) statistics?

2 / 17

Basic algebraic statistics

Constrained multinomial model

Example: Tripod tree

Algebraic statistical model

General comments

X a random variable with m values, p

= ( p

1

, . . . , p m

) probability simplex ∆ = { p ∈

R m : P i p i

=

1 , p i

ASM: p

: Θ → ∆ , M = p

(Θ) , p polynomial map

≥ 0 }

ASM given by polynomial equations and inequalities

X , Y binary and X ⊥ Y : p

: [

0 , 1

] 2 → ∆

( θ x

, θ y

) 7→ (

1 − θ x

)(

1 − θ y

) , (

1 − θ x

) θ y

, θ x

(

1 − θ y

) , θ x

θ y implicit equation: p

00 p

11

− p

01 p

10

=

0

Why do we care about inequalities in (algebraic) statistics?

3 / 17

Basic algebraic statistics

Constrained multinomial model

Example: Tripod tree

General comments

ASM as a constrained multinomial model interpret

∆ as a set of parameters for the multinomial model.

M = p

(Θ) ⊆ ∆ gives a constrained multinomial model

Example:

Bin

(

2 , θ ) no inequalities codim

=

1

(1 , 0 , 0)

θ = 0 .

3

(0 , 1 , 0)

θ = 0 .

6

(0 , 0 , 1)

MixBin

(

2 , θ ) inequalities codim

=

0

θ = 0 .

3

π = 0 .

2

θ = 0 .

6

Why do we care about inequalities in (algebraic) statistics?

4 / 17

Basic algebraic statistics

Constrained multinomial model

Example: Tripod tree

The likelihood function

General comments

X ∈ { 1 , . . . , m } , X

( 1 ) , . . . , X

( N ) an iid sample from M likelihood: L

(

θ

; x

) ∝ Q m i = 1 p i

(

θ

) x i constrained: L

( p

; x

) ∝ Q m i = 1 p i x i such that p ∈ M = p

(Θ) unconstrained: L

( p

; x

) ∝ Q m i = 1 p i x i such that p ∈ ∆

Why do we care about inequalities in (algebraic) statistics?

5 / 17

Basic algebraic statistics

Constrained multinomial model

Example: Tripod tree

The constrained likelihood

General comments

typically MLE does not have a closed-form solution the MLE will often lie on the boundary of the parameters space no usual asymptotics for the LR statistic, MLE etc.

posterior heavily depends on prior specification

Why do we care about inequalities in (algebraic) statistics?

6 / 17

Basic algebraic statistics

Constrained multinomial model

Example: Tripod tree

The tripod tree model

General comments

X

1

⊥ X

2

⊥ X

3

| H or a Bayesian network a tripod tree seven free parameters the codimension is zero

1

2 h

3

Why do we care about inequalities in (algebraic) statistics?

7 / 17

1

Basic algebraic statistics

Constrained multinomial model

Example: Tripod tree

Alternative parametrization

General comments

M

T

: k

12

=

1

4

( 1 − δ 2

) η

1

η

2

, k

13

=

1

4

( 1 − δ 2

) η

1

η

3

, k

23

=

1

4

( 1 − δ 2

) η

2

η

3

, k

123

=

1

4

( 1 − δ 2

) δη

1

η

2

η

3

Why do we care about inequalities in (algebraic) statistics?

8 / 17

µ

12

= µ

23

= 0

General comments

Basic algebraic statistics

Constrained multinomial model

Example: Tripod tree

Application: Identifiability

M

T

µ

13

= µ

23

= 0

µ

12

= µ

13

= 0

T

η

2

= 0

η

1

= 0

δ 2 = 1

η

3

= 0

Why do we care about inequalities in (algebraic) statistics?

9 / 17

Basic algebraic statistics

Constrained multinomial model

Example: Tripod tree

The model structure

General comments

The model covers 8

% of the probability simplex.

Picture: µ

1

= µ

2

= µ

3

= 1

2 and k

123

=

0 , 0 .

005 , 0 .

02

Why do we care about inequalities in (algebraic) statistics?

10 / 17

Basic algebraic statistics

Constrained multinomial model

Example: Tripod tree

General comments

The constrained likelihood for the tripod

Three possible scenarios:

(i)

ˆ ∈ M

T and then ` and ` ( θ ) bimodal

( p

) for p ∈ M is unimodal

(ii)

ˆ /

T and ` ( p

) is multimodal with one global maximum but many local maxima.

(iii)

ˆ /

T and ` ( p

) has multiple global maxima.

Why do we care about inequalities in (algebraic) statistics?

11 / 17

Basic algebraic statistics

Constrained multinomial model

Example: Tripod tree

Implications for inference

General comments

x

000 x

010 x

001 x

011 x

100 x

110 x

101 x

111

=

2069 16 2242 331

2678 863 442 1359

.

only k

12 k

13 k

23

≥ 0 does not hold the EM algorithm gives 4 different maxima of L

(

θ

)

θ

( r )

1

θ

( 1 )

1 | 0

θ

( 1 )

1 | 1

θ

( 2 )

1 | 0

θ

( 2 )

1 | 1

θ

( 3 )

1 | 0

θ

( 3 )

1 | 1

1 0 .

4658 0 .

3371 0 .

5524 1 .

0000 0 .

0000 0 .

4159 0 .

0745

2 0 .

5342 0 .

5524 0 .

3371 0 .

0000 1 .

0000 0 .

0745 0 .

4159

3 0 .

4771 0 .

0000 0 .

9167 0 .

6369 0 .

4216 0 .

1468 0 .

3775

4 0 .

5229 0 .

9167 0 .

0000 0 .

4216 0 .

6369 0 .

3775 0 .

1468

Why do we care about inequalities in (algebraic) statistics?

12 / 17

Basic algebraic statistics

Constrained multinomial model

Example: Tripod tree

Does it generalize?

General comments

Why do we care about inequalities in (algebraic) statistics?

13 / 17

Basic algebraic statistics

Constrained multinomial model

Example: Tripod tree

Main theorems

General comments

Theorem[Z.,JQ Smith]

There exist explicit formulae for the MLEs if

ˆ ∈ M

T

.

Generically the map is 2 int ( V ) − to − 1 , for some point the preimage is a manifold with corners, otherwise it is a singular subset of the parameter space.

All the equations and inequalities defining the model can be listed.

Why do we care about inequalities in (algebraic) statistics?

14 / 17

Basic algebraic statistics

Constrained multinomial model

Example: Tripod tree

Models with equality constraints

General comments

Can we determine from

ˆ whether the MLE lies on the boundary?

Can we perform a full Bayesian analysis?

Can we test these models?

For the Bayesian perspective see the results of the Dutch group

Utrecht: Herbert Hoijtink, Irene Klugkist, Olav Laudy, Bernet Kato et al.

Amsterdam: Ruud Wetzels, Raoul P.P.P. Grasman, Eric-Jan

Wagenmakers

Why do we care about inequalities in (algebraic) statistics?

15 / 17

Basic algebraic statistics

Constrained multinomial model

Example: Tripod tree

General comments

Thank you!

Why do we care about inequalities in (algebraic) statistics?

16 / 17

Basic algebraic statistics

Constrained multinomial model

Example: Tripod tree

The bibliography

General comments

D AVIS -S TOBER , C.

, Analysis of multinomial models under inequality constraints: Applications to measurement theory , Journal of Mathematical

Psychology , 2009.

P. Z

WIERNIK AND

J. Q. S

MITH

, Tree-cumulants and the geometry of binary tree models , to appear in Bernoulli .

P. Z WIERNIK AND J. Q. S MITH , The geometry of tree models and some consequences for inference , on arXiv.

Why do we care about inequalities in (algebraic) statistics?

17 / 17

Download