LAB EXERCISE: Statistical Analysis (calculating 95% confidence intervals) Instructions for calculating means and 95% confidence intervals 1) Compute the mean or average for each condition or treatment. As you know, the average (indicated by X ) is obtained by summing all measurement values (X's) and dividing this sum by the number of measurements (n). This is written in mathematical symbols as follows: n Xi X= i 1 n n The symbol indicates summing the individual measurements (Xi’s) from the first value (i=1) to i 1 the last value (i=n). The symbol n is always used to indicate the sample size or number of available measurements. 2) Compute the standard deviation. The standard deviation, “sd”, can be computed as: n 2 ( X i X ) sd = i 1 n 1 The numerator indicates taking the difference between the mean and each individual measurement, squaring this difference, and summing the squared difference for all the measurements. The sum is then divided by the sample size minus one, and the square root is taken of the remaining value. Many calculators provide the mean and standard deviation with a single key stroke when the data are entered using a special key (usually indicated with a summation symbol, ∑, or the word DATA). A description of the appropriate steps and keys to use will be in the instruction manual for the calculator. 3) Compute the standard error of the mean. The standard error of the mean is frequently referred to as simply the standard error, and is represented symbolically as SE. It is computed as follows: sd SE = n 4) Compute the 95% confidence interval of the mean. This is the interval within which the actual mean for the population is predicted to fall. The confidence that the mean is within this interval is 95%, hence the name 95% confidence interval, or 95% CI. This interval is why we determined the Standard Error (SE)– remember that 95% of 1 the means of similar experiments are expected to fall within 2 S.E. of our mean. Thus, the lower 95% confidence limit, represented symbolically as 95% LCL, is computed as the mean minus 2*S.E. Similarly, the upper 95% confidence limit, represented symbolically as 95% UCL, is computed as the mean plus 2*S.E. These equations can be written as: _ 95% LCL = X- 2*SE _ 95% UCL = X+ 2*SE Interpretation of 95% confidence intervals Here is a Figure showing the findings of a previously conducted experiment. Look carefully at the Figure to see how much information is conveyed. Even though you haven't read the accompanying Research Report, you should able to deduce from this figure precisely what the experiment was about and what the results were. A good figure should accomplish that goal for the reader. Later you will want to return to this figure to look carefully at the particulars of the display format. Change in mass (g) 0.5 0.45 0.4 0.35 0.3 0.25 0.2 0.1 0.2 Acid Concentration (M) Figure 1. The mean mass change in antacid tablets exposed for ten minutes to one of two different acid concentrations. N=10 antacids per concentration. The central horizontal lines represent the means. The error bars represent the 95% confidence interval. 2 Using the graph and the following rules it is possible to approximate a statistical test for answering this question by visually comparing details of the graphed samples. Rules for visual comparison of samples in order to decide if means are significantly different. 1. There is a statistically significant difference when both means fall outside of the 95% confidence intervals of the samples being compared. 2. There is not a statistically significant difference when either mean falls within the 95% confidence interval of the sample to which it is being compared. Here is a hypothetical set of frequency distributions for growth of seedlings. Apply the rules for comparing means, and determine which data sets are statistically significantly different in the graph below: Change in mas (g) 35 30 25 20 A B C D E Group Figure 2. Change in mass of five groups of seedlings given identical soils, but with the addition of extra amounts of the specified substance (5 grams of each substance was added weekly). Group A was the control group (soil only), Group B had phosphorus added, Group C had Nitrate added, Group D had Ammonium added, and group E had aluminum hydroxide added. For each Group, the central horizontal line represents the mean and the error bars are 95% confidence intervals. Which groups grew significantly faster than the control? Significantly slower? What about relationships between the experimental groups? 3 Exercise Table 1 presents the raw data for an experiment on grip strength. Table 1. Individual measurements of maximum grip strength (in Newtons) in the dominant and non-dominant hands of nine students. Student ID # 1 2 3 4 5 6 7 8 9 Dominant 28.0 28.5 29.8 31.0 31.3 32.5 34.1 34.6 34.6 21.0 26.2 29.6 27.0 22.5 27.1 29.5 29.7 31.0 Non-dominant Your assignment is to present a reduced data table which should include the mean, standard deviation, standard error, and the upper and lower confidence limits. In addition you must graph your results. In the captions be sure to include a brief statement about the data presented, the sample size, and include definitions for any abbreviations that occur in the table. Don’t forget to include the units of measurement. 4