Standard Deviation

advertisement
Although the 5 number summary is very useful
for describing a data set, it is not the most
widely used. The most common measures are
the mean for the center and the standard
deviation to measure spread. Standard deviation
measures how far the observations are from their
mean.
*
* The variance of a set of observations is the
average of the squares of the deviations of
each observation from the mean.
* The standard deviation (s) is simply the square
root of the variance. Your TI-83 calls this Sx.
*
*
2
2
2
( x1 - x ) + ( x2 - x ) + ...+ ( xn - x )
Variance = s =
n-1
2
1
2
( xn - x )
s =
n-1
2
* Seven men took part in a study of metabolic
rates. Here are the calories burned in 24 hours
by the men:
1792
1666
1362
1460
1867
1439
* Find s.
* s=189.24
*
1614
* The sum of the deviations from the mean will
always be 0. This is why we square and square
root when finding s.
*
* Now the question comes up as to why we divide by n-1
instead of n. Because the sum of the deviations is always
0, the last deviation can be found once we know the first
n-1 deviations. Since only n-1 of the squared deviations
can vary freely, we average by dividing the total by n-1.
The number n-1 is called the degrees of freedom of the
variance or standard deviation.
*
1. s measures spread about the mean - so, use s only when
using xbar.
2. s=0 if there is no spread (ie, all observations are the
same). Else, s>0. A big s value implies the data are spread
out.
3. s is nonresistant.
*
* The five number summary is usually better
than the mean and standard deviation for
describing a skewed distribution.
* Use x-bar and s for reasonably symmetric
distributions.
* Remember, always graph the data!
*
Download