Suppose that X1,X2,...,Xn are i.i.d random variables on the interval

advertisement
Suppose that X1,X2,...,Xn are i.i.d random variables on the interval [0,1] with density
function: F(x|alpha) = [gamme(2alpha)/[gamma(alpha)]^2] * [x(1-x)]^(alpha -1) where
alpha > 0 is a parameter to be estimated from the sample. It can be shown that E[X] = 1/2
and Var[X] = 1/ [4(2alpha+1)] a. How does the shape of the density depend on alpha? b.
How can the method of moments be used to estimate alpha? c. What equation does the
m.l.e. of alpha satisfy? d. What is the asymptotic variance of the m.l.e.?
𝑓(π‘₯/𝛼) =
Π“(2𝛼)
2
[π‘₯(1 − π‘₯)]𝛼−1 , 𝛼 > 0,0 ≤ π‘₯ ≤ 1
(Π“(𝛼))
1
1
1
𝐸(𝑋 ) =
∫ π‘₯ π‘˜ π‘₯ 𝛼−1 (1 − π‘₯)𝛼−1 𝑑π‘₯ =
𝐡(𝛼 + π‘˜, 𝛼)
𝐡(𝛼, 𝛼)
𝐡(𝛼, 𝛼)
π‘˜
0
=
Π“(2𝛼)Π“(𝛼 + π‘˜)Π“(𝛼)
2
(Π“(𝛼)) Π“(2𝛼 + π‘˜)
𝐸(𝑋) =
𝐸(𝑋 2 ) =
=
Π“(2𝛼)Π“(𝛼 + π‘˜)
Π“(𝛼)Π“(2𝛼 + π‘˜)
Π“(2𝛼)Π“(𝛼 + 1)
𝛼
1
=
=
Π“(𝛼)Π“(2𝛼 + 1) 2𝛼 2
(𝛼 + 1)
Π“(2𝛼)Π“(𝛼 + 2)
𝛼(𝛼 + 1)
=
=
Π“(𝛼)Π“(2𝛼 + 2) 2𝛼(2𝛼 + 1) 2(2𝛼 + 1)
𝑉(𝑋) = 𝐸(𝑋 2 ) − [𝐸(𝑋)]2 =
(𝛼 + 1)
1 2𝛼 + 2 − 2𝛼 − 1
1
− =
=
2(2𝛼 + 1) 4
4(2𝛼 + 1)
4(2𝛼 + 1)
1
The mean of the distribution is 2 irrespective of the value of 𝛼
So the spread of the distribution decrease with 𝛼
Moment estimate
Since the first moment does not depend on 𝛼, use the second moment.
Equate population variance and sample variance
1
= 𝑠2
4(2𝛼 + 1)
4(2𝛼 + 1) =
8𝛼 + 4 =
1
𝑠2
1
𝑠2
8𝛼 =
1
−4
𝑠2
𝛼=
1
1
−
2
8𝑠
2
𝛼̂ =
1
1
−
8𝑠 2 2
The moment estimate is
MLE
𝐿(𝛼) =
Π“(2𝛼)
2 (∏[π‘₯𝑖
(Π“(𝛼))
(1 − π‘₯𝑖 )]𝛼−1 )
𝑛
𝑙𝑛(𝐿) = 𝑙𝑛(Π“(2𝛼) − 2𝑙𝑛(Π“(𝛼)) + (𝛼 − 1) ∑[𝑙𝑛(π‘₯𝑖 ) + 𝑙𝑛(1 − π‘₯𝑖 )]
𝑖=1
𝑛
πœ•π‘™π‘›(𝐿) πœ•π‘™π‘›(Π“(2𝛼)
πœ•π‘™π‘›(Π“(𝛼)
=
−2
+ ∑[𝑙𝑛(π‘₯𝑖 ) + 𝑙𝑛(1 − π‘₯𝑖 )]
πœ•π›Ό
πœ•π›Ό
πœ•π›Ό
𝑖=1
𝑛
πœ•π‘™π‘›(Π“(2𝛼)
πœ•π‘™π‘›(Π“(𝛼)
−2
+ ∑[𝑙𝑛(π‘₯𝑖 ) + 𝑙𝑛(1 − π‘₯𝑖 )] = 0
πœ•π›Ό
πœ•π›Ό
𝑖=1
Or
𝑛
∑[𝑙𝑛(π‘₯𝑖 ) + 𝑙𝑛(1 − π‘₯𝑖 )] = 2
𝑖=1
πœ•π‘™π‘›(Π“(𝛼) πœ•π‘™π‘›(Π“(2𝛼)
−
πœ•π›Ό
πœ•π›Ό
is the equation to be used for finding the MLE.
Fisher information
𝐼(𝛼) = −𝐸 (
πœ• 2 𝑙𝑛(𝐿)
πœ• 2 𝑙𝑛(Π“(2𝛼)
πœ• 2 𝑙𝑛(Π“(𝛼)
πœ• 2 𝑙𝑛(Π“(𝛼) πœ• 2 𝑙𝑛(Π“(2𝛼)
=
−𝐸
−
2
=
2
−
)
(
)
πœ•π›Ό 2
πœ•π›Ό 2
πœ•2𝛼
πœ•2𝛼
πœ•π›Ό 2
The asymptotic variance is
1
1
=
2
πœ• 𝑙𝑛(Π“(𝛼) πœ• 2 𝑙𝑛(Π“(2𝛼)
𝐼(𝛼)
2
−
πœ•2𝛼
πœ•π›Ό 2
𝐿(𝛼, 𝑋1, 𝑋2, . . , 𝑋𝑛) =
Π“(2𝛼)
2 (∏[π‘₯𝑖
(Π“(𝛼))
(1 − π‘₯𝑖 )]𝛼−1 ) =
Π“(2𝛼)
2𝑑
(Π“(𝛼))
𝛼−1
×1
where 𝑑 = ∏𝑛𝑖=1[π‘₯𝑖 (1 − π‘₯𝑖 )]
Now the first function
Π“(2𝛼)
2
(Π“(𝛼))
𝑑 𝛼−1 is a function of 𝛼 and 𝑑 alone and the second function 1 is
independent of 𝛼. Therefore by Neyman’s factorization criterion, 𝑑 = ∏𝑛𝑖=1[π‘₯𝑖 (1 − π‘₯𝑖 )] is a sufficient
statistic.
The MLE equation can be solved to get 𝛼 when the X1, X2, …, Xn are numerically given using tables of
digamma and trigamma functions using numerical methods or software packages. No simple formula is
possible.
Download