note-estimation

advertisement
Chapter 8: Estimation
8.1 Point and Interval Estimates
Definition 8.1: A Point Estimate
The value of the sample statistics that is used to estimate a population parameter
is called a point estimate.
A sample statistics or briefly statistics is said to be the best linear estimator of
population parameters or briefly parameters, the estimator must be;
(i) Unbiased
(iii) Efficient
(ii) Consistent
(iv) Sufficient
Definition 8.2: Properties of Estimators
Unbiased Estimator
A statistics ˆ is said to be an unbiased estimator of parameter  if,
E (ˆ)  
Consistent Estimator
Let ˆ is unbiased estimator of  . If V( ˆ )  0 when sample size, n   hence
ˆ is said to be consistent estimator of  .
Efficient Estimator
An estimator is said to be more efficient from the others estimator if its variance
is smaller than variance of the others estimator.
Sufficient Estimator
 is a sufficient estimator if it makes use of all of the information in the sample.
The sample variance, for example, is more sufficient than the sample range,
since the former uses all the scores in the sample, the latter only two.
Definition 8.3: An Interval Estimate
In interval estimation, an interval is constructed around the point estimate and it
is stated that this interval is likely to contain the corresponding population
parameter.
Definition 8.4: Confidence Level and Confidence Interval
Each interval is constructed with regard to a given confidence level and is called
a confidence interval. The confidence level associated with a confidence interval
states how much confidence we have that this interval contains the true
population parameter. The confidence level is denoted by 1   100% .
8.1.1 Confidence Interval Estimates of Population Parameters
The (1-  )100% Confidence Interval for  for Large Samples (n  30)
(i)
(ii)
x  z
x  z

n
2
2
s
n
if  is known and normally distributed population
if  is not known, n large
x  z
or
2
s
s
   x  z
2
n
n
with (1-  )% CI
The (1-  )100% Confidence Interval for  for Small Samples (n<30)
(i)
(ii)
x  z
2
x  t n 1,
s
n
2
if  is known
s
if  is not known
n
The (1-  )100% Confidence Interval for p for Large Samples (n  30)
pˆ  z
2
pˆ 1  pˆ 
n
Example 8.1 :
A publishing company has just published a new textbook. Before the company
decides the price at which to sell this textbook, it wants to know the average
price of all such textbooks in the market. The research department at the
company took a sample of 36 comparable textbooks and collected the
information on their prices. This information produced a mean price RM 70.50
for this sample. It is known that the standard deviation of the prices of all such
textbooks is RM4.50.
(a) What is the point estimate of the mean price of all such college
textbooks?
(b) Construct a 90% confidence interval for the mean price of all such college
textbooks.
Solutions
(a) The point estimate of the mean price of all such college textbooks is
RM 70.50, that is
Point estimate of   x  RM 70.50
(b) It is known that, n = 36,   x  RM 70.50 and   RM 4.50 .
For 90% CI,
90% = 100(1–  )%
1–  = 0.90
 = 0.1

2
 0.05
z  z0.05  1.65
2
Hence, 90% CI
From normal distribution
table,
 
= x  z 

n
2
 4.50 
= 70.50  1.65

 36 
= 70.50  1.24
= [ RM 69.26, RM 71.74]
Thus, we are 90% confident that the mean price of all such college textbooks is
between RM 69.26 and RM 71.74
Example 8.2
Consider a survey on male students height in a certain IPTA, a random sample
of 100 male students are taken. The height of the male students is normally
distributed with mean 178.2 cm dan variance 17.75 cm2.
a) Construct a 95% CI for mean of the male students height
b) If mean of the female students height is 170.2 cm, at 98% CI, verify whether
if this can proof that male students are taller than female students.
Solutions
a) It is known that, n = 100,   x  178.2 and 2  17.75 .
For 95% CI,
95% = 100(1–  )%
1–  = 0.95
 = 0.05

 0.025
2
From normal distribution
table,
z   z0.025  1.96
2
Hence, 95% CI
 
= x  z 

n
2
 17.75 

= 178 .2  1.96
 100 
= 178.2  0.83
= [177.37, 179.03]
b) It is known that,   x  178.2 and 2  17.75 .
For 98% CI,
98% = 100(1–  )%
1–  = 0.98
 = 0.02

 0.01
2
From normal distribution
table,
z   z0.01  2.33
2
Hence, 98% CI
 
= x  z 

n
2
 17.75 

= 178 .2  2.33
100


= 178.2  0.98
= [177.22, 179.18]
We can see that mean of female students does not lies in the interval
[177.22, 179.18] hence, this indicate that male students are taller than
female students.
Example 8.3
According to the analysis of Women Magazine in June 2005, “Stress has
become a common part of everyday life among working women in Malaysia.
The demands of work, family and home place an increasing burden on average
Malaysian women”. According to this poll, 40% of working women included in
the survey indicated that they had a little amount of time to relax. The poll was
based on a randomly selected of 1502 working women aged 30 and above.
Construct a 95% confidence interval for the corresponding population
proportion.
Solution
Let p be the proportion of all working women age 30 and above, who have a
limited amount of time to relax, and let p̂ be the corresponding sample
proportion. From the given information,
n = 1502 , p̂ =0.40, qˆ  1  pˆ = 1 – 0.40 = 0.60
Hence, 95% CI
 pˆ qˆ 

= pˆ  z  
n
2

 0.40(0.60) 

= 0.40  1.96
1502


= 0.40  0.01264069
= [0.375, 0.425] or 37.5% to 42.5%
Thus, we can state with 95% confidence that the proportion of all working
women aged 30 and above who have a limited amount of time to relax is
between 37.5% and 42.5%.
Download