NAME: ______________________________________ DATE: ____________ BLOCK: ____ Statistics can be used to determine if differences among groups are significant, or simply the result of predictable error. The statistical test most frequently used to determine whether data obtained experimentally provide a good fit, or approximation to the expected or theoretical data is the Chi-square test. This test can be used to determine if deviations from the expected values are due to chance alone, or to some other circumstance. For example, consider corn seedlings resulting from an F1 cross between parents heterozygous for the color. A Punnett square of the F1 cross Gg x Gg would predict that the expected proportion of green to albino seedlings would be 3:1. Use this information to fill in the expected (e) column and the (o-e) column in Table 1. Phenotype Green Albino Genotype GG or Gg gg Total Table 1 # observed (o) 72 12 84 # expected (e) (o-e) There is a small difference between the observed and the expected results, but are these data enough that the difference can be explained by random chance or variation in the sample? To determine if the observed data fall within acceptable limits, a Chi-square analysis is performed to test the validity of the null hypothesis. A null hypothesis states that there is NO statistically significant difference between the observed and expected data. If the Chisquare (Χ2) analysis indicates that the data vary too much from the expected ratio (for example, 3:1), an alternative hypothesis is accepted. Χ2= Σ (o-e)2 e where: o = observed number of individuals e = expected number of individuals Σ = the sum of the values (in this case the differences squared) This statistical test will examine the null hypothesis, which predicts that the data from the experimental cross above will be expected to fit the 3:1 ratio. Copy the data from Table 1 and complete Table 2. 1 Table 2 Phenotype Genotype # observed (o) Green GG or Gg 72 Albino gg 12 Total Number Observed # expected (e) (o-e) (o-e)2 (o-e)2 e Χ2 = Σ (o-e)2 e 84 Your calculations should give you a value of Χ2 = 5.14. This value is then compared to the following table of critical values. Table 3: Critical Values for Chi-square distribution Degree of Freedom Probability (p) 1 2 3 4 0.05 3.84 5.99 7.82 9.49 0.01 6.64 9.21 11.3 13.2 5 11.1 15.1 How to use the Critical Values Table: Determine the degrees of freedom (df) for your experiment. It is the number of categories minus 1. Since there are 2 possible phenotypes, for this experiment the df = 2 samples -1 = 1. If the experiment had gathered data for a cross such as GgTt x GgTt there would be 4 possible phenotypes and 3 degrees of freedom. The p value we will always use is 0.05. Find the row where p = 0.05. Under the 1 df column the critical value is 3.84. What does this mean? If the calculated Chi-square value is greater than or equal to the critical value from the table, then the null hypothesis is rejected. In our example, Χ2 = 5.14 and 5.14 > 3.84, therefore we reject the null hypothesis. We reject that there is no significant difference between the observed and the expected data. In other words, chance alone cannot explain the difference between the observed and expected and something went “askew”. With a p=0.05, you can say that you are 95% sure that your data do not fit the expected 3:1 ratio. Since these data do not fit the expected 3:1 ratio, you must then consider reasons for this variation. Additional experimentation may need to be done. You may have to increase the sample size. 2 Practice Problem 1 In a study of incomplete dominance in tobacco seedlings, the following data was collected from a cross between two heterozygous (GY) plants. Table 4 Genotype GG GY YY Phenotype Green Yellowish-green Albino Total Observed 22 50 12 84 Use the table below to help you calculate Χ2 for these data. The use the Critical value table and decide whether you will reject or accept the null hypothesis. Table 5 Phenotype Genotype # observed (o) Green GG 22 Yellowishgreen Albino GY 50 YY 12 # expected (e) (o-e) (o-e)2 (o-e)2 e Χ2 = Σ (o-e)2 e Total Number Observed 1. How many degrees of freedom do you have? ___________ 2. Χ2 = ______________ 3. Referring to the critical values chart, what is the critical value for these data (use p=0.05)? 4. Can you accept or reject the null hypothesis? Explain why. 3 Practice Problem 2 An investigator observes that when pure-breeding long wing Drosophila are mated with purebreeding short wing flies, the F1 offspring have intermediate wing length. The F1 offspring are allowed to interbreed and the following results are obtained. Observed 230 long wings 510 intermediate wings 260 short wings 1. What is the genotype of the F1 intermediate winged flies? 2. Write a hypothesis describing the mode of inheritance of wing length in Drosophila (this is your null hypothesis). Complete Table 6: Table 6 Phenotype Genotype # observed (o) # expected (e) (o-e) (o-e)2 (o-e)2 e Χ2 = Σ (o-e)2 e Total Number Observed 3. How many degrees of freedom do you have? ___________ 4. Χ2 = ______________ 5. Referring to the critical values chart, what is the critical value for these data (use p=0.05)? 6. Can you accept or reject the null hypothesis? Explain why. 4