BI 152 Week 12 sp. 2011 Name _____________________________ DISPERSION Ambrose Ch. 5 In Ambrose we have been working in Chapter 4, where the types of data and measures of central tendency are described. Last week we saw two samples of data that had the same mean, median and mode. But they were quite different in structure. I have added a third example with a different mode. 1. What are the ranges? (see below.) Sample A had a wide range. Sample B had a narrow range. But that still doesn’t tell us how the observations are clustered about the mean. Example (I have added a third sample with same mean and median, different mode): Sample A Foot length (cm) Sample B Foot length (cm) 31.0 17.8 27.0 31.1 27.0 26.8 mean 27 median 27 mode 26.0 27.0 29.5 24.5 27.0 Range = 13.2 Standard deviation (s) s= 26.8 27 median 27 mode Sample C Foot Length (cm) 16. 15 27 38 38 26.8 27 median 38 mode Range = Range = s= s= How can we describe that difference in the way the numbers are spread out? We use measures of dispersion – how spread out is the distribution? Is it close to the mean or not?. 1 BI 152 Week 12 sp. 2011 Name _____________________________ Measures of dispersion Standard deviation is most commonly used. The most common ones are the Variance and the Standard Deviation. Variance is simply the square of the standard deviation. So, if standard deviation is symbolized by “s” then variance is designated by s 2. Both measure the spread of the data. Standard Deviation Definition: Standard deviation is a statistical measure of spread or variability. It is the value that divides the normal distribution up so that one standard deviation from the mean in both directions includes 68.26% of the data. What does that mean? All the area under the bell curve represents 100% of the observations. The standard deviation partitions that space so that the mean, + or – one standard deviation includes 68.26% of the data. That’s a definition. Distribution A The value of the standard deviation is applied to the X axis. It divides the distribution up into defined units. Two standard deviations on both sides of the mean include just over 95% of the data. And three standard deviations on both sides of the mean include 99.74% of the data. Notice that the area under the curve becomes smaller as you move out toward the tails of the distribution. That is why less and less of a percentage of the data is included in each standard deviation away from the mean. 2. Which distribution has a smaller standard deviation, A or B (assume the x axis has the same scale in both)? Distribution B 3. How do you know? 4. How does standard deviation relate to accepting or rejecting a null hypothesis? 2 BI 152 Week 12 sp. 2011 Name _____________________________ Calculating the Standard Deviation The standard deviation is calculated from the data using this formula. Note: the formula in Ambrose is a different version of this. But I like this one. You don’t need to memorize this. The Σ symbol (sigma) means “sum of” You add up what follows the sigma symbol. In this formula, X is an observation. M is the mean for that sample. (We use X with a bar over it for mean.) n = the number of observations in the sample. So, subtract the mean from the observation. This is the “deviation” from the mean. Then square that number. Do this for all observations. Add up those squared deviations. Divide the sum of those squared deviations by n-1. Finally, take the square root of that to get the value for the standard deviation. 5. Let’s do one by hand. I have calculated the first deviation and squared it. Sample A Foot length (cm) _ (x – x) = _ (x – x)2= _ Σ (x – x)2= n-1 (31-26.8) = 4.2 31.0 17.64 17.8 27.0 31.1 27.0 Mean = 26.8 _ Σ (x – x)2= Let’s never do that again. You can use excel to calculate the standard deviations. Simply type =STDEV(A1:Z99) (of course you will fill in the actual range of your data points). You can also use http://www.graphpad.com/quickcalcs/CImean1.cfm You just type in your data for the sample, one number per line, and it will calculate the mean and standard deviation and other descriptive statistics. For another explanation of standard deviation, with great illustrations, go to http://www.mathsisfun.com/data/standard-deviation.html 3 BI 152 Week 12 sp. 2011 Name _____________________________ Have you got the idea? 6. Try this. Standard A B deviation s=2 5.413 s = 6.82 (s) Which distribution goes with which standard deviation? I C s = 11.26 II III 7. Suppose the mean height in BI 152 was 170 cm and the standard deviation was 10 cm How tall would a person be who is one standard deviation taller than the class average? How tall would a person be who is two standard deviations smaller than the class average? What is the range of heights that includes 95% of the data? 4