Sampling Distributions

advertisement

Sampling

Distributions

The “What If?” game

Parameters

O Values that describe a characteristic of the POPULATION

O

O

Most of the time, there is no way for us to really know what this number is

μ = mean Of the

O

O p (or π ) = proportion

O α = y-int. of LSRL

O β = slope of LSRL

Statistics

O Value computed from a sample

O

O

Of the 𝑥 = mean

SAMPLE s = standard deviation

O 𝑝 = proportion

O a = y-int of LSRL

O b = slope of LSRL

Distribution

O All the values a variable can take, and the number of times that it takes each value

O A distribution is just a picture of the data

Put them together

Sampling Distribution

O The distribution of possible values of a statistic, from all the possible samples of the same size, from the same population

Sampling Distribution

O

O

O

I take a sample from a population, calculate a statistic.

What if I could take every possible sample of that size from that population, and calculate the same statistic every time, and then plot all of these values

The picture (distribution) of all those statistics from all the samples

(sampling)

Sampling Distributions

O We are going to be concerned with the distributions of sample proportions, 𝑝 , and the distributions of sample means, 𝑥.

O Since we often don’t know the true proportion, p, or the true mean, μ , the only information we have to base decisions on is the statistics 𝑝 and 𝑥.

Sample Proportions

O 𝑝 = 𝑥 𝑛

# in the sample that have this characteristic

Sample size

Assumptions - Proportions

O

O If we assume that our sample is not too big, less than 10% of the population so we can have independence

And

O If we assumer our sample is big enough, where np > 10 and n(1 – p) > 10

O Then we can use a normal curve to approximate the sampling distribution

If we’re going to use a normal model, we need:

O

Mean

O

Standard deviation

Suppose we have a population of six people: Alice,

Ben, Charles, Denise, Edward, & Frank

We are interested in the proportion of females.

This is called the parameter of interest.

What is the proportion of females?

Draw samples of two from this population.

How many different samples are possible?

6

C

2

=15

Find the 15 different samples that are possible & find the sample proportion of the number of females in each sample

Alice & Ben .5

Alice & Charles .5

Alice & Denise 1

Alice & Edward .5

Alice & Frank .5

Ben & Charles 0

Ben & Denise .5

Ben & Edward 0

Ben & Frank 0

Charles & Denise .5

Charles & Edward 0

Charles & Frank 0

Denise & Edward .5

Denise & Frank .5

Edward & Frank 0

Find the mean & standard deviation of all p-hats.

μ 

1

3

& σ  0 .

29814

Once you have your distribution of all the sample proportions in the whole wide world, from this size sample from the population…

μ

 p

The mean of all the sample proportions (statistics) in the whole wide world, all the p-hats, is equal to the value of the proportion for the whole population (parameter)

σ

 p

1

 p

 n

These are found on the formula chart!

O

Sample Means

𝑥 =

𝑥 𝑖 𝑛

Add all the individual values

Divide by how many there are

Assumptions - Means

O We want to be able to use a normal model

O Central Limit Theorem – When n is sufficiently large, the sampling distribution of 𝒙 is well approximated by a normal curve, even when the population distribution itself is not normal

Assumptions - Means

O So, what is

“sufficiently large”?

O n ≥ 30

Consider the population of 5 fish in my pond – the length of fish (in inches):

2, 7, 10, 11, 14

What is the mean and standard deviation of this population?

Let’s take samples of size 2

( n = 2) from this population:

How many samples of size 2 are possible?

5

C

2

= 10

Find all 10 of these m x

What is the mean

= 8.8

s x

= 2.4919

means .

sample means?

Repeat this procedure with sample size n = 3

Find all 10 of these samples and record the sample means .

What is the mean and standard deviation of the sample means?

m x

= 8.8

s x

= 1.6613

m x

= m s x

= 𝜎 𝑛

The mean of all the sample means

(statistics) in the whole wide world, all the x-bars, is equal to the value of the mean for the whole population

(parameter)

These are found on the formula chart!

Download