Standard Deviation (s)

advertisement
AP Statistics
1.2 Measuring Spread
Variance and Standard Deviation
Objectives: Given a data set, compute the standard deviation and variance as measures of spread.
Give two reasons why we use squared deviations rather than just average deviations
from the mean.
Explain what is meant by degrees of freedom.
Identify situations in which the standard deviation is the most appropriate measure of
spread and situations in which the IQR is the most appropriate measure.
Standard Deviation (s):
The five-number summary is not the most common numerical description of a distribution.
That belongs to the combination of the mean to measure center and the standard deviation to
measure spread. The standard deviation measures spread by looking at:
The Variance s2 and Standard Deviation s
The variance s2 of a set of observations is the average of the squares of the deviations of the
observations from their mean. In symbols, the variance of n observations x1, x2, …, xn is :
Or, more compactly:
The standard deviation s is the square root of the variance s2:
In words:
Some deviations will be positive and some negative because some of the observations fall on each
side of the mean. In fact the:
Squaring the deviations makes them all positive.
The Variance is the average squared deviation, therefore:
Example 1:
Radon is a naturally occurring gas and is the second leading cause of lung cancer in the United
States. It comes from the natural breakdown of uranium in the soil and enters buildings through
cracks and other holes in the foundations. Found throughout the United States, levels vary
considerably from state to state. There are several methods to reduce the levels of radon in your
home, and the Environmental Protection Agency recommends using one of these if the measured
level in your home is above 4 picocuries per liter. Four readings from Franklin County, Ohio, where
the county average is 9.32 picocuries per liter, were 5.2, 13.8, 8.6 and 16.8.
a) Find the mean step-by-step.
b)
Find the standard deviation step-by-step.
c) Now enter the data into your calculator and use the mean and standard deviation buttons to
obtain x̅ and s. Do the results agree with your hand calculations?
Degrees of Freedom:
Properties of the Standard Deviation

s measures spread about the mean and should be used only when:

s = 0 only when there is no spread/variability. This happens only when all
observations have the same value. Otherwise, s > 0. As the observations become
more spread out about their mean, _____________________________.

s, like the mean x̅ , is NOT RESISTANT. A few outliers can make s very large.
The use of squared deviations renders s even more sensitive than x̅ to a few extreme
observations.
Choosing a Summary:
The five-number summary is usually better than the mean and standard deviation for describing a
___________________________________________ or a distribution with strong ____________________. Use x̅ and s only
for reasonably __________________________ distributions that are free of outliers.
Example for choosing summaries:
Stemplots of annual returns for stocks and Treasury bills, 1950 to 2003. (a) Stock returns, in
whole percents. (b) Treasury bill returns, in percents and tenths of percent. Which one would you
use s and x̅ , and which one would you use the five-number summary?
**Remember! The choice of using mean-based (mean, standard deviation) or median-based
(median, five-number summary, IQR) statistics to describe a distribution should be based on
the ________________ of the distribution!
HW: pg. 89-90; 1.39 – 1.44
Download