   

Stat 543 EM Algorithm Example Suppose that n iid trials each produce one of k possible outcomes. If p j  the probability that any trial produces outcome j and X ij  I  trial i produces outcome j  and n n j   X ij  the number of trials producing outcome j i 1 a joint pmf for all nk of the variables X ij is  k 1 n f x | p   p    p j j j 1  j 1 k nj j   k 1   1   p j  j 1   nk A loglikelihood is k lX  p    n j ln p j j 1 n n n and it is possible to argue that an MLE of p is  1 , 2 , , k n n n  .  For sake of example, suppose that what is observed in n  10 trials in a problem where k  4 is as below trial 1 outcome is 1 X 11  1 trial 2 outcome is 3 X 23  1 trial 3 outcome is 2 or 4 X 32  X 34  1 trial 4 outcome is 2 X 42  1 trial 5 outcome is 3 X 53  1 trial 6 outcome is 2 or 3 trial 7 outcome is 1 X 71  1 trial 8 outcome is 1 or 2 trial 9 outcome is 2 trial 10 outcome is 4 X 62  X 63  1 X 81  X 82  1 X 92  1 X 104  1 1 The likelihood function based on this information is LY  p   p12 p22 p32 1  p1  p2  p3 1  p1  p3  p2  p3  p1  p2   p12 p22 p32 p4  p2  p4  p2  p3  p1  p2  and one might potentially simply try to optimize this directly. Another possibility is consider an EM algorithm. If I knew all X ij I would have the log-likelihood l X  p   n1 ln p1  n2 ln p2  n3 ln p3  n4 ln 1  p1  p2  p3  that I know how to optimize. I don't have all the n j ' s . But I do know that n1  2  X 81 n2  2  X 32  X 62  X 82 n3  2  X 63 n4  1  X 34 and for a particular p 0   p01 , p02 , p03 , p04  conditioned on the data in hand (call them Y  y ) E p0  X 81 | Y  y   p01 p01  p02 E p0  X 32 | Y  y   p02 p02  p04 E p0  X 62 | Y  y   p02 p02  p03 E p0  X 82 | Y  y   p02 p01  p02 E p0  X 63 | Y  y   p03 p02  p03 E p0  X 34 | Y  y   p04 p02  p04 So E p0 lX  p  | Y  y  is easy enough to compute. For 2 a  p0   2  p01 p01  p02   1 1 1 b  p 0   2  p02      p01  p02 p02  p03 p02  p04  p03 c  p0   2  p02  p03 d  p0   1  p04 p02  p04 note that a  p 0   b  p 0   c  p 0   d  p 0   10  n and then that E p0 lX  p  | Y  y   a  p 0  ln p1  b  p 0  ln p2  c  p 0  ln p3  d  p 0  ln p4 This is maximized at  a  p0  b  p0  c  p0  d  p0   , , , p*  p 0     10 10 10   10 So, with a starting value p 0 , define iterates by   p l 1  p* p  l  and hope to iterate to a fixed point optimizing the loglikelihood. 3

   

Related documents

Products

Support

   

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib