Statistical Analysis (calculating 95% confidence intervals)

advertisement
LAB EXERCISE: Statistical Analysis (calculating 95% confidence intervals)
Instructions for calculating means and 95% confidence intervals
1) Compute the mean or average for each condition or treatment.
As you know, the average (indicated by X ) is obtained by summing all measurement values (X's)
and dividing this sum by the number of measurements (n). This is written in mathematical symbols
as follows:
n
 Xi
X=
i 1
n
n
The symbol  indicates summing the individual measurements (Xi’s) from the first value (i=1) to
i 1
the last value (i=n). The symbol n is always used to indicate the sample size or number of available
measurements.
2) Compute the standard deviation.
The standard deviation, “sd”, can be computed as:
n
2
 ( X i X )
sd =
i 1
n 1
The numerator indicates taking the difference between the mean and each individual measurement,
squaring this difference, and summing the squared difference for all the measurements. The sum is
then divided by the sample size minus one, and the square root is taken of the remaining value.
Many calculators provide the mean and standard deviation with a single key stroke when the data
are entered using a special key (usually indicated with a summation symbol, ∑, or the word DATA).
A description of the appropriate steps and keys to use will be in the instruction manual for the
calculator.
3) Compute the standard error of the mean.
The standard error of the mean is frequently referred to as simply the standard error, and is
represented symbolically as SE. It is computed as follows:
sd
SE =
n
4) Compute the 95% confidence interval of the mean.
This is the interval within which the actual mean for the population is predicted to fall. The
confidence that the mean is within this interval is 95%, hence the name 95% confidence interval,
or 95% CI. This interval is why we determined the Standard Error (SE)– remember that 95% of
1
the means of similar experiments are expected to fall within 2 S.E. of our mean. Thus, the lower
95% confidence limit, represented symbolically as 95% LCL, is computed as the mean minus
2*S.E. Similarly, the upper 95% confidence limit, represented symbolically as 95% UCL, is
computed as the mean plus 2*S.E. These equations can be written as:
_
95% LCL = X- 2*SE
_
95% UCL = X+ 2*SE
Interpretation of 95% confidence intervals
Here is a Figure showing the findings of a previously conducted experiment. Look carefully at the
Figure to see how much information is conveyed. Even though you haven't read the accompanying
Research Report, you should able to deduce from this figure precisely what the experiment was
about and what the results were. A good figure should accomplish that goal for the reader. Later
you will want to return to this figure to look carefully at the particulars of the display format.
Change in mass (g)
0.5
0.45
0.4
0.35
0.3
0.25
0.2
0.1
0.2
Acid Concentration (M)
Figure 1. The mean mass change in antacid tablets exposed for ten minutes
to one of two different acid concentrations. N=10 antacids per
concentration. The central horizontal lines represent the means. The error
bars represent the 95% confidence interval.
Using the graph and the following rules it is possible to approximate a statistical test for answering
this question by visually comparing details of the graphed samples.
2
Rules for visual comparison of samples in order to decide if means are significantly different.
1. There is a statistically significant difference when both means fall outside of the
95% confidence intervals of the samples being compared.
2. There is not a statistically significant difference when either mean falls within the
95% confidence interval of the sample to which it is being compared.
Here is a hypothetical set of frequency distributions for growth of seedlings.
Apply the rules for comparing means, and determine which data sets are statistically significantly
different in the graph below:
Change in mas (g)
35
30
25
20
A
B
C
D
E
Group
Figure 2. Change in mass of five groups of seedlings given identical soils,
but with the addition of extra amounts of the specified substance (5 grams of
each substance was added weekly). Group A was the control group (soil
only), Group B had phosphorus added, Group C had Nitrate added, Group D
had Ammonium added, and group E had aluminum hydroxide added. For
each Group, the central horizontal line represents the mean and the error
bars are 95% confidence intervals.
Which groups grew significantly faster than the control?
Significantly slower?
What about relationships between the experimental groups?
3
BIOL201
Laboratory Assignment: Statistical Analysis and Graphing
Name: ____________________________________________
Table 1 presents the raw data for an experiment on grip strength.
Table 1. Individual measurements of maximum grip strength (in Newtons) in the dominant and
non-dominant hands of nine students.
Student ID # 1
2
3
4
5
6
7
8
9
Dominant
28.0
28.5
29.8
31.0
31.3
32.5
34.1
34.6
34.6
21.0
26.2
29.6
27.0
22.5
27.1
29.5
29.7
31.0
Non-dominant
Problem 1. Calculate the standard deviation and standard error for both the dominant and nondominant hand.
Step 1. Subtract each data value from the mean of the sample set. This equals the deviation value.
Step 2. Square each of the deviation values.
Step 3. Sum the deviations 2 (d2).
Sample 1
data value
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
Sample 2
mean
-
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
=
=
=
=
=
=
=
=
=
=
deviation
d2
data value
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
lllllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
sum of d2 ________
mean
-
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
=
=
=
=
=
=
=
=
=
=
deviation
d2
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
llllllllll
sum of d 2 ________
Step 4.
Calculate the standard deviation (sd) = square root ((sum of deviations 2)/ (sample size – 1))
Sample 1 sd ____________
Sample 2 sd ____________
4
The standard error = standard deviation / square root (sample size [n]); SE =
sd
n
Standard error of sample 1 = ____________ Standard error of sample 2 = ____________
Calculate the upper and lower confidence intervals (95% Confidence interval = X ± 2*SE)
Sample 1
Sample 2
Upper confidence interval = mean + 2*S.E.
Upper confidence interval = mean + 2*S.E.
___________
___________
Lower confidence interval = mean - 2*S.E.
Lower confidence interval = mean - 2*S.E.
___________
___________
This data can be properly represented in Table format:
Following is an example of the graph you will need to complete for your sample data.
Table 2. Comparison of the mean grip strength in the dominant and non-dominant hands of
CSUB students (n= 9, sd= standard deviation, S.E. = standard error of the mean). LCL and
UCL are lower and upper confidence limits for the 95% confidence interval (CI) of the mean.
95% CI
Treatment
mean
sd
2*SE
LCL
UCL
Dominant
Hand
50.0
0.71
0.10
49.9
50.1
Non-dominant
Hand
50.4
24.40
21.8
28.2
72.2
Your assignment is to present a reduced data table which should include the mean, standard
deviation, standard error, and the upper and lower confidence limits. In addition you must
graph your results. In the captions be sure to include a brief statement about the data
presented, the sample size, and include definitions for any abbreviations that occur in the
table. Don’t forget to include the units of measurement.
5
II. Graph the variation within the data set.
Use the graph below for your sample 1 and sample 2 data. Create a figure similar to Figure 2 using
your values from Table 2. Include the independent and dependent variable with correct axis.
90
80
70
60
50
40
30
20
10
0
Is there a significant difference between the two samples? _________________________
In addition, you must generate at Table and a Graph for your stomata data set. Keep the graph and
table but your lab instructor must initial here, indicating that your table and graph look correct.
Lab instructors initials here _________.
6
Download