ST361: Ch 5.5 + Ch 5.6 Sampling Distribution

advertisement
ST361: Ch 5.5 + Ch 5.6 Sampling Distribution
Topics:
I. What is Sampling Distribution?
II. Sampling Distribution of a Sample Mean X
(a) X ~ Normal Distribution
(b) X ~ Non-normal Distribution
III. Central Limit Theorem
IV. Sampling Distribution of the Sample Proportion p
---------------------------------------------------------------------------------------------------------------------------I. Sampling Distribution

Population vs. Sample:

A Parameter is_____________________________________________

A Statistic is _________________________________________________

The observed value of statistic depends on the particular sample; hence it ________
from sample to sample. Such variability is called ____________________________

The probability distribution of the statistics is called _________________________
1
Ex1. A neighborhood has 5 houses A, B, C, D and E. They respectively have 3, 2, 5, 3, and 4
bedrooms. We randomly draw 3 houses at a time and calculate the sample statistics
median and mean. What is the sampling distribution of the sample median? What is the
sampling distribution of the sample mean?

Population =

Variable of interest =

Sample =
Houses drawn in the
sample
# of bedrooms
Sample median
Sample mean
ABC
3,2,5
3
10/3=3.3
ABE
3,2,4
3
9/3 = 3
ACD
3,5,3
3
11/3 = 3.7
ACE
3,5,4
ADE
3,3,4
ABD
10/3 = 3.3
BCD
BCE
2,5,4
4
11/3
BDE
2,3,4
3
9/3 = 3
CDE
5,3,4
2
II. Sampling Distribution of a Sample Mean X
Let X be the sample mean of a random sample X1 , X 2 ,..., X n from a population mean  and
SD  . (That is, X 
X1  X1    X n
.) We want to know the sampling distribution of X .
n
 If X ~ Normal (mean=  , SD=  ). Then X , the mean of a random sample of n
observations

follows a _____________________, with mean

X =

X
______________, and
X
=
X
and standard deviation
X .
____________
is also called standard error (SE) of X , or Standard error of the mean
Ex 2. Thousands of boxes contain nuts. The weights are normally distributed with mean  =1 lb
and SD  =0.01 lb. We inspect 4 boxes and get their weights X 1 , X 2 , X 3 , X 4 . The sample
mean is X 
X1  X 2  X 3  X 4
4
(a) What is the sampling distribution of X ? Mean and SE of X ?
(b) What is the probability that X lies between 0.99 and 1.01 lb?
3
 X ~ any non-normal distribution with mean=  , SD=  . The sampling distribution of X
based on samples of size n is
(a) If n is small (i.e., _______________ ), then

Distribution:

Mean  X and SE  X :
(b) If n is large (i.e., ________________ ), then

Distribution:

Mean  X and SE  X :
 These results follow from Central Limit Theorem (CLT)
III. Central Limit Theorem
Assume X follows an arbitrary distribution with mean  and SD  .
When sample size is sufficiently large (i.e., n  30), the sample distribution of X always
follows normal distribution with mean  and SE


n
Usually the ____________________ a distribution is, the __________ the sample
size will need to ensure normality of X
4
Ex3. Let X be the number of major defects for each new automobile tested. Suppose the number
of such defects for a certain model is with mean  =3.2 and SD  =2.4. A sample of 100
new cars is collected.
(a) What is the sampling distribution of X based on samples of size 100? What is its center
and what is the SE of X ?
(b) What is the probability that the sample average number of major defects exceeds 4?
 Comments:

If X be the sample mean of a random sample X1 , X 2 ,..., X n from a population mean
 and SD  , then regardless of the sample size n and the distribution of X,
 X  ____________,  X  ___________________

The variation of sample means is ________( , ,or  ) than variation of the original data
5

As sample size n increases,  x (the SE of X ) ____________, and the shape of the
sampling distribution becomes __________________ . This implies higher probability
around its mean  .
Ex4. The heights of college age students (denoted by X) are known to have mean  =115 and
SD  =30.
(a) What is the sampling distribution of X , the average height of 36 college age students?
What are the mean and SE of the sampling distribution of X ?
(b) What is the sampling distribution of X based on samples of 9 college age students? What
are the mean and SE of the sampling distribution of X ?
(c) Assume that we were told that the heights of college age students are normally
distributed. What is the sampling distribution of X based on samples of 9 college age
students? What are the mean and SE?
6
IV. Sampling Distribution of a Sample Proportion p
Ex. Consider a basket containing 100 balls with 2 colors: Red and White. The proportion of
Red balls is denoted by  (and is not known). Assume 20 balls were randomly picked
from the basket with replacement, and 14 balls out of the 20 balls were red.
(1) In the sample, what is the proportion of red balls?
(2) We refer such quantity, 14 / 20, as _________________________and denote it by_____.
(Note that ____________________________________________) Our question of
interests: what is the distribution of the sample proportion p ?
Thoughts: we can think a r.v such that X = 1 if “red” and X=0 if “not red”.
Then p can be view as ________________________________.
That is, p is ______________________.
Thus by ___________________________, p ~______________if n large.
(However, different criteria for “large n ” are needed here.)
Sampling Distribution of p
(a) If _large n (i.e.,____________________________), then the sample proportion p has

A ________________________ ( by ___________________________ )

Mean (denoted by  p )   , and SE (denoted by  p ) 
 1   
n
(b) If _small n (i.e.,_____________________________), then the sample proportion p has

________________________________

Mean (denoted by  p )  ______, and SE (denoted by  p ) = ____________
7
Ex5. In the population, the proportion of defectives  =12%.
(a) What is the sampling distribution of p based on 100 observations? What is the mean?
What is the standard error?
(b) What is the probability that p <0.05?
8
Download