MEASURES of SPREAD

advertisement
MEASURES of SPREAD
(Standard Deviation and Variance)
The measures of spread or dispersion of a data set are
quantities that indicate how closely a set of data clusters
around its centre. There are several different measures of
spread.
A deviation is the difference between an individual value and the mean for the data.
The larger the size of the deviations, the greater the spread in the data.
values below
values above
(–) deviation
mean
(+) deviation
NOTE: The sum of all the deviations of a set of data will equal zero!!
For a SAMPLE:
For a POPULATION:
deviation = x i  x
deviation = x i  
The standard deviation is the square root of the mean of the squares of the
deviations.
For a SAMPLE:
For a POPULATION:
 fi (xi  x)2
 fi (xi  )2
s
n 1

N
NOTE: (n-1) is used to compensate for the fact that a sample tends to underestimate
the deviations of a population.
For grouped data, where fi is the frequency for the given interval and mi is the
midpoint of the interval, the standard deviation is calculated using:
For a SAMPLE:
For a POPULATION:
 fi (mi  ) 2
 fi (mi  x) 2

N
n 1
The variance is the mean of the squares of the deviations (which is equal to the
square of the standard deviation).
s
For a SAMPLE:
s
2
fi ( x i  x ) 2


n 1
For a POPULATION:

2
fi ( x i  ) 2


N
A z-score is the number of standard deviations that a datum is from the mean.
For a SAMPLE:
z
(–) z-score
Example 
For a POPULATION:
xi  x
s
values below
z
xi  

values above
mean
z=0
(+) z-score
Twenty students each took six shots with a basketball from the free
throw line. The number of baskets made by each student was
recorded. The results were as follows:
3
3
5
5
4
3
5
4
3
2
4
2
5
4
3
3
4
2
6
2
Determine the mean, standard deviation, variance, and
z-scores for the basketball data.
Data
(xi)
TOTALS:
Frequency
(fi)
Deviation
(xi - x )
(xi - x )2
fi(xi - x )2
Z-score
x x
z i
s
Download