LAB EXERCISE: Statistical Analysis (calculating 95% confidence intervals) Instructions for calculating means and 95% confidence intervals 1) Compute the mean or average for each condition or treatment. As you know, the average (indicated by X ) is obtained by summing all measurement values (X's) and dividing this sum by the number of measurements (n). This is written in mathematical symbols as follows: n Xi X= i 1 n n The symbol indicates summing the individual measurements (Xi’s) from the first value (i=1) to i 1 the last value (i=n). The symbol n is always used to indicate the sample size or number of available measurements. 2) Compute the standard deviation. The standard deviation, “sd”, can be computed as: n 2 ( X i X ) sd = i 1 n 1 The numerator indicates taking the difference between the mean and each individual measurement, squaring this difference, and summing the squared difference for all the measurements. The sum is then divided by the sample size minus one, and the square root is taken of the remaining value. Many calculators provide the mean and standard deviation with a single key stroke when the data are entered using a special key (usually indicated with a summation symbol, ∑, or the word DATA). A description of the appropriate steps and keys to use will be in the instruction manual for the calculator. 3) Compute the standard error of the mean. The standard error of the mean is frequently referred to as simply the standard error, and is represented symbolically as SE. It is computed as follows: sd SE = n 4) Compute the 95% confidence interval of the mean. This is the interval within which the actual mean for the population is predicted to fall. The confidence that the mean is within this interval is 95%, hence the name 95% confidence interval, or 95% CI. This interval is why we determined the Standard Error (SE)– remember that 95% of 1 the means of similar experiments are expected to fall within 2 S.E. of our mean. Thus, the lower 95% confidence limit, represented symbolically as 95% LCL, is computed as the mean minus 2*S.E. Similarly, the upper 95% confidence limit, represented symbolically as 95% UCL, is computed as the mean plus 2*S.E. These equations can be written as: _ 95% LCL = X- 2*SE _ 95% UCL = X+ 2*SE Interpretation of 95% confidence intervals Here is a Figure showing the findings of a previously conducted experiment. Look carefully at the Figure to see how much information is conveyed. Even though you haven't read the accompanying Research Report, you should able to deduce from this figure precisely what the experiment was about and what the results were. A good figure should accomplish that goal for the reader. Later you will want to return to this figure to look carefully at the particulars of the display format. Change in mass (g) 0.5 0.45 0.4 0.35 0.3 0.25 0.2 0.1 0.2 Acid Concentration (M) Figure 1. The mean mass change in antacid tablets exposed for ten minutes to one of two different acid concentrations. N=10 antacids per concentration. The central horizontal lines represent the means. The error bars represent the 95% confidence interval. Using the graph and the following rules it is possible to approximate a statistical test for answering this question by visually comparing details of the graphed samples. 2 Rules for visual comparison of samples in order to decide if means are significantly different. 1. There is a statistically significant difference when both means fall outside of the 95% confidence intervals of the samples being compared. 2. There is not a statistically significant difference when either mean falls within the 95% confidence interval of the sample to which it is being compared. Here is a hypothetical set of frequency distributions for growth of seedlings. Apply the rules for comparing means, and determine which data sets are statistically significantly different in the graph below: Change in mas (g) 35 30 25 20 A B C D E Group Figure 2. Change in mass of five groups of seedlings given identical soils, but with the addition of extra amounts of the specified substance (5 grams of each substance was added weekly). Group A was the control group (soil only), Group B had phosphorus added, Group C had Nitrate added, Group D had Ammonium added, and group E had aluminum hydroxide added. For each Group, the central horizontal line represents the mean and the error bars are 95% confidence intervals. Which groups grew significantly faster than the control? Significantly slower? What about relationships between the experimental groups? 3 BIOL201 Laboratory Assignment: Statistical Analysis and Graphing Name: ____________________________________________ Table 1 presents the raw data for an experiment on grip strength. Table 1. Individual measurements of maximum grip strength (in Newtons) in the dominant and non-dominant hands of nine students. Student ID # 1 2 3 4 5 6 7 8 9 Dominant 28.0 28.5 29.8 31.0 31.3 32.5 34.1 34.6 34.6 21.0 26.2 29.6 27.0 22.5 27.1 29.5 29.7 31.0 Non-dominant Problem 1. Calculate the standard deviation and standard error for both the dominant and nondominant hand. Step 1. Subtract each data value from the mean of the sample set. This equals the deviation value. Step 2. Square each of the deviation values. Step 3. Sum the deviations 2 (d2). Sample 1 data value lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll Sample 2 mean - lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll = = = = = = = = = = deviation d2 data value lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll lllllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll sum of d2 ________ mean - llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll = = = = = = = = = = deviation d2 llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll llllllllll sum of d 2 ________ Step 4. Calculate the standard deviation (sd) = square root ((sum of deviations 2)/ (sample size – 1)) Sample 1 sd ____________ Sample 2 sd ____________ 4 The standard error = standard deviation / square root (sample size [n]); SE = sd n Standard error of sample 1 = ____________ Standard error of sample 2 = ____________ Calculate the upper and lower confidence intervals (95% Confidence interval = X ± 2*SE) Sample 1 Sample 2 Upper confidence interval = mean + 2*S.E. Upper confidence interval = mean + 2*S.E. ___________ ___________ Lower confidence interval = mean - 2*S.E. Lower confidence interval = mean - 2*S.E. ___________ ___________ This data can be properly represented in Table format: Following is an example of the graph you will need to complete for your sample data. Table 2. Comparison of the mean grip strength in the dominant and non-dominant hands of CSUB students (n= 9, sd= standard deviation, S.E. = standard error of the mean). LCL and UCL are lower and upper confidence limits for the 95% confidence interval (CI) of the mean. 95% CI Treatment mean sd 2*SE LCL UCL Dominant Hand 50.0 0.71 0.10 49.9 50.1 Non-dominant Hand 50.4 24.40 21.8 28.2 72.2 Your assignment is to present a reduced data table which should include the mean, standard deviation, standard error, and the upper and lower confidence limits. In addition you must graph your results. In the captions be sure to include a brief statement about the data presented, the sample size, and include definitions for any abbreviations that occur in the table. Don’t forget to include the units of measurement. 5 II. Graph the variation within the data set. Use the graph below for your sample 1 and sample 2 data. Create a figure similar to Figure 2 using your values from Table 2. Include the independent and dependent variable with correct axis. 90 80 70 60 50 40 30 20 10 0 Is there a significant difference between the two samples? _________________________ In addition, you must generate at Table and a Graph for your stomata data set. Keep the graph and table but your lab instructor must initial here, indicating that your table and graph look correct. Lab instructors initials here _________. 6