Chapter 7.3: Estimation - Interval Estimation for

ST 361 Estimation --- Interval Estimation for  (§7.3) ---------------------------------------------------------------------------------------------------------------------Topics: I. Interval estimation: confidence interval II. (Two-sided) Confidence interval for estimating population mean  (§7.2, 7.4) (a) When the population SD  is known: use Z distribution (§7.2) (b) When the population SD  is NOT known: use t distribution (§7.4) III. (Two-sided) confidence interval for estimating population proportion  (§7.3) IV. Two-sided confidence interval for estimating population mean difference 1  2 (§7.5) (a) when the population SD’s  1 , 2 are known (b) when the population SD’s  1 , 2 are NOT unknown ---------------------------------------------------------------------------------------------------------------------III. Confidence Interval for  , the population proportion (or the success probability)  The natural statistic for estimating  is the sample proportion of success p.  Recall that p   Also recall that if n is large (i.e., n  30, n  5 and n 1     5 ), the sampling distribution of p X , where X is total number of successes out of n trials. n can be approximated by p  N ( ,  (1   ) n ) The CI for  with confidence level ( 1   ) takes the following form: [ p  z / 2  (1   ) n , p  z / 2  (1   ) n ] Since  in the above interval is unknown, replace it by the sample proportion p and actually use the following CI: [ p  z / 2  p(1  p) p(1  p) , p  z / 2 ] n n Note that here we still use a Z value from the Standard normal distribution instead of a tdistribution, but this time it requires a more stringent criterion for “large n”. 1  In conclusion,  Still use Z-distribution to find the critical value z*  Need np > 10 and n(1-p)>10 for p to be normally distributed.  Confidence interval for  : (Assume n>30, np > 10 and n(1-p)>10) The Confidence interval for  is [ p  z / 2 p(1  p) p(1  p) , p  z / 2 ] n n Ex1. For a certain disease, there is a generic drug and a brand name. We want to estimate  = P(the brand name is more preferred by a patient). So 100 patients were asked, and 20 of them preferred the brand name. (a) What is your estimate of  ? What is the estimated SE of your estimator? Answer: A natural estimator of  is the sample proportion p = 20/100=0.2. The estimated standard error (SE) of p is p(1  p) 0.2  (1  0.2)   0.04 n 100 (b) Find the 95% CI for  Answer: Here n = 100>30, np = 100*0.2=20>10, n(1-p)=100*0.8=80>10. So we can use the CI formula for  given above. 1    0.95,    0.05,   / 2  0.025,  z0.025  1.96. So a 95% CI for  is [ p  z / 2 p(1  p) p(1  p) , p  z / 2 ] =[0.2 - 1.96*0.04, 0.2+1.96*0.04]=[0.122,0.278]. n n (c) What is the width of your CI? Answer: The width of the CI is 0.278 – 0.122=0.156. (d) If we want to make the width of our 95%CI for  be  0.1, how many patients should be sampled? Answer: The width of our 95% CI is 2  z / 2 p(1  p) p(1  p)  2 1.96  0.1 n n 2 0.2  0.8  2 1.96   0.1,  n     0.2  0.8  246. n  0.1  2 If we use p=0.2, then 2 1.96 If we treat p unknown (it is unknown before the sampling), we can use p=0.5 in the formula and 0.5  0.5  2 1.96   0.1,  n     0.5  0.5  384. n  0.1  2 solve 2 1.96 Ex2. Suppose that the proportion of the left-handed students at a certain university is  . A random sample of 200 students was collected and found that 40 out of the 200 students are left-handed. (a) Use an unbiased estimator to compute a point estimate of  . Answer: an unbiased estimator of  is the sample proportion p =40/200=0.2 (b) What is the approximate distribution of your estimator in (a)? Why? Answer: It can be well approximated by a normal distribution since n=200>30, np=40>10 and n(1-p)=160>10. (c) What is the estimated standard error of your estimator? (Hint: if the population proportion  is known, then  p   1    n ) Answer: The estimated standard error of p is p(1  p) 0.2  0.8   0.028 n 200 (d) Your estimator in (a) is unbiased because (circle one) i. Its distribution is normal ii. Its mean is equal to  iii. Its SE is equal to  1    n iv. It is based on a sample with size greater than 30 (e) What is a 95% confidence interval for  ? Answer: since n=200>30, np=40>10 and n(1-p)=160>10, we can use the formula given before: [ p  z / 2 p(1  p) p(1  p) , p  z / 2 ]  [0.2  1.96  0.028,0.2  1.96  0.028]  [0.145, 0.255] n n 3

Chapter 7.3: Estimation - Interval Estimation for

Related documents

Products

Support

Chapter 7.3: Estimation - Interval Estimation for

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib