Biology 336 Midterm answer key 1 General exam notes 120 points were possible. High score was 114.5 out of 120. Mean was 60% Revised score: Take original marks, minus question 3 marks, plus (0.4 * question 3 score), then divide by 102.5 Note: if this adjustment led to a lower mark, the original scoring was used for that individual. Mean is now 65%. Exam queries? Read this key completely, then feel free to come to office hours to ask. Poor performance? If your mark is higher on the final, the final will be weighted more heavily. If your mark is better on the midterm, the weighting will remain 25% for midterm, 35% for the final. The population genetic models are challenging: things may ‘click’ if you continue to work on them. If you were careful and thorough in your studying, and still did very poorly, come to office hours to discuss how you studied. Remarking of exams Time and care were taken to award partial credit. Remarks should be requested only for major errors. The entire exam will be re-marked if any question is re-marked: scores may drop if any of the original marks are found to be generous. Re-mark requests must be made in writing and submitted with the exam. Individuals requesting re-marking will not be eligible for a higher weight for the final exam. There were two versions of some questions: answers to both versions are given below. Biology 336 Midterm answer key 2 Question 1: questions and answers are the same for both exam versions. Mean score: 14.7 / 20. 18 individuals scored 20 / 20 1a (10 pts). HIV will quickly evolve resistance to AZT in patients that are treated with just AZT. Explain the requirements for evolution by natural selection and describe how drug resistant AZT fulfills the requirements. (3 sentences) Natural selection requires variation in a phenotype, selection favouring individuals with one phenotype over those with another, and heritability, so that the offspring of favoured parents will share their phenotype to some degree. HIV generates heritable variation, including variation in polymerase effectiveness when AZT is present, due to its high mutation rates. Virions with mutations that allow them to replicate in the presence of AZT will produce more copies and will thus become more common. 2 marks each for variation, selection, and heritability. 1 mark each for a concrete link between these concepts and the AZT example. Common error: not mentioning heritability. b. (5 pts) Explain the role played by stratigraphy in convincing Darwin and his contemporaries of the plausibility of evolution. (2 sentences) Stratigraphy demonstrated that the layers of rock were laid down over many millions of years. This convinced Darwin and others that sufficient time had passed for natural selection to have led to the existing diversity of organisms. 2 marks for demonstrating an understanding of stratigraphy, 1 mark for noting that stratigraphy demonstrates that the world is older than previously thought, and 2 marks for saying that the old age was necessary for the plausibility of evolution by the gradual process of natural selection. Common errors: - irrelevant discussion of homology and fossils - missing the importance of the age of the earth to natural seelctoin c. (5 pts) Identify the two major problems with this description of evolution of beak depth during the drought. Explain why each of these is a misconception. (2 sentences) During the drought in 1979 the finches faced a crisis that threatened the survival of the species. Unless the finches mutated to have larger beaks, the species would go extinct. Fortunately, individuals were able to adapt to the new, dryer conditions, and the species survived with larger beaks. First, mutations do not occur when required, but occur randomly, so the variation for larger beaks must have been present in the population. Second, individuals do not evolve, populations do, so the change in beak size did not occur in individuals but over time in the population. 2.5 marks for mutations occurring randomly not as needed. 2.5 marks for stating that individuals do not evolve, populations evolve. Many students missed the second misconception. Biology 336 Midterm answer key 3 2a. (20 points). Researchers have developed a mouse model for cystic fibrosis, a disease in humans that was typically lethal until recently. Mice homozygous for the cf mutation have poorly developed lungs and are more susceptible to infection, just as human cystic fibrosis patients are. Evolutionary biologists have collaborated with the medical geneticists to test hypotheses. In a (hypothetical) study, the scientists bred populations with a low frequency of cf, allowed them to randomly mate for a few generations, then exposed them to cholera (or the mouse version of it). There were two versions of this question. A Genotype Starting population Population post disease cf / cf 5 2 CF / cf 130 84 CF / CF 865 514 Starting population Population post disease 5 1 130 94 865 495 B a. (10 pts) Calculate the genotype and allele frequencies for this population after the epidemic. Is it in HWE? Be sure to show your work. A f(cf) = 0.073 f(CF) = 0.927 cf/cf CF/cf Observed genotype frequencies 0.003 0.14 Expected genotype numbers 3.2 81.5 deviates 0.47 0.07 X^2 = 0.54 << 3.84; not significantly out of equilibrium CF/CF 0.857 515.2 0.003 f(cf) = 0.081 f(CF) = 0.919 cf/cf CF/cf CF/CF Observed genotype frequencies 0.002 0.16 0.84 Expected genotype numbers 3.9 88.2 497.9 deviates 2.2 0.4 0.02 X^2 = 2.6 << 3.84; not significantly out of equilibrium Common errors: using expected frequencies rather than expected numbers to perform the chisquare test. b. (10 pts) Calculate the changes in allele frequency due to the epidemic and the relative fitnesses for the three genotypes. Show your work. A p(0) = 0.07; ∆p = 0.003; Absolute fitnesses Rel fit cf/cf 2/5 = 0.4 .62 CF/cf 0.65 1 CF/CF 0.59 .92 B p(0) = 0.07; ∆p = 0.011 Absolute fitness Relative fitness cf/cf 0.2 0.3 CF/cf 0.72 1 CF/CF 0.57 0.8 Biology 336 Midterm answer key 4 Common errors: - using the “after” frequencies in the formulae to calculate the after frequencies. (No formulae were needed: the “after” frequencies were calculated in part a, and the initial frequencies could be easily calculated. - using the wrong formulae, particularly the haploid model - calculating absolute fitness by dividing by the total number of individuals in the population, thereby confusing frequency with fitness. Mean score: 14.9 / 20. 35 individuals scored 20 / 20. Easy question! Question 2b: 20 points 2c. (10 pts). Do these data support the idea that selection due to cholera might maintain the cf allele in human populations? Explain how this might be true, and whether the data support it, being careful to refer to the data above. (2 sentences) Yes, the heterozygote has a higher fitness than either homozygote; heterozygote advantage will maintain both alleles in the population even if one is harmful. If humans were similar to mice, this could explain how cholera maintained the cf allele in the population. If calculations in part 2b resulted in different estimates of relative fitness, and these data were coherently discussed, partial credit was given. Common errors: - if the population is in HWE, selection is not acting. This is not a valid inference: a population may be affected by selection but the genotype frequencies do not change sufficiently to detect the process with a chi-square test - confusing heterozygote advantage (heterozygote has highest fitness) with a dominant favourable allele (deleterious allele in a heterozygote has high fitness, but not higher than one of the two homozygotes). Heterozygote advantage will maintain two alleles in a population indefinitely, while directional selection with a recessive deleterious allele will slow the loss of the deleterious allele from the population. - failing to mention the assumption that the mouse model applies to humans (1 point) 2d. (10 pts). An alternate hypothesis would propose that mutation maintains the cf allele in human populations. Estimate the mutation rate that would be required, and evaluate whether this hypothesis is reasonable given your knowledge of mutation. (Note: cf patients nearly always die before reproducing). Show your work. ~1 sentence for the evaluation. We will assume that cf is recessive (that is, without cholera, heterozygotes have same fitness as CF/CF: this assumption was given in part a)). Then, the frequency of cf = square root of (-u / s). We assume that s = -1 (zero fitness). So, f(cf) = sqrt(u) = 0.04. This means that u = (0.04)^2 = 0.0016. This is high: we estimate mutation rates per locus at 10-5 to 10-6. This makes it an unlikely explanation. Biology 336 Midterm answer key 5 Common errors: - many students used the f(cf) from part a. Note that this question asked about humans. An experimental population would not be a good one to evaluate for mutation / selection balance. - many students used s from part b, ignoring the information that cf patients die prior to reproducing, giving a very low fitness. Also, s in part b was a case of heterozygote advantage: mutation –selection balance is an alternate hypothesis, and assumes directional selection. - many students used the dominant equilibrium equation, rather than the recessive equation. 10.4 marks out of 20. No perfect scores: 11 scored 19 out of 20. 3. (20 pts) Two graduate students are performing mutation accumulation experiments in bacteria. Each follows a slightly different experimental protocol. One student starts with a bacterium in a large culture broth. Broth continuously flows into the system and out of the system, so that a large population is maintained for 500 days. The student maintains 50 different cultures during the experiment. The second student starts with a single bacterium in a small broth culture. Each day, the student transfers a single bacterium to a new culture, and maintains 50 replicate lines during the experiment. In both cases, the students freeze a sample every 50 days. Type Insertion Deletion Substitution Table 1A Type Insertion Deletion Substitution Table 1B Line 1 .10 .30 .60 Line 1 .10 .20 .70 Line 2 .40 .20 .40 Line 2 .30 .30 .40 Mutations observed per line At the end of the experiment, both students sequence three genes from each of the 50 lines at every 50 days. The data are shown below. The table at the left breaks down the frequency of each type of mutation found. 0.45 0.4 0.35 0.3 0.25 line 1 0.2 0.15 0.1 0.05 0 line 2 0 200 400 600 Days Figure 1: a. (10 pts). Explain why the differences in the methods led to the differences in the number of mutations observed per line (Figure 1). (1-2 sentences) Method 1 has high levels of competition (2 marks), which means that unfavourable mutations will become less common through time (3 marks), while method 2, with limited mutation, will maintain more unfavourable mutations (3 marks). This will result in lower levels of mutation detected for line 1 compared to line 2. The Ne (effective population sizes are different with Ne=v.small (1) for Method 2 and Ne=large (500) in Method 1. (2marks) Biology 336 Midterm answer key 6 2 marks only if you poined out that line 2 had more mutations 3 marks only if you recognized a difference in the competitive environment and made the correct association with the appropriate method. b. (10 pts). Based on the comparison of the frequency of the different types of mutations observed in this experiment (Table 1), explain the possible differences in the fitness effects of each type of mutation. Be sure to connect your hypotheses about fitness effects to the data presented! (1-2 sentences) A: The mutations seen in line 1 that are less common in line 2 are those that are more harmful. This means that insertions are more harmful than deletions or substitutions. B: The mutations seen in line 1 that are less common in line 2 are those that are more harmful. This means that deletions and insertions are more harmful than substitutions. 2 marks only for just pointing out the differences ie A. insertions > others for Line 2 or B. insertions and deletions>other. The other marks come from making the connection between fitness and their frequencies, given the method. COMMON MISTAKES: 1. The assumption that a larger population size would have a larger number of mutations. On its own this is true, but given the context of a selective environment in Method 1, this statement is problematic. This hypothesis does not fit with the data presented. 2. Mixing up the lines. Line 1=Method 1 and Line 2=Method 2. 3. Common misconceptions about a haploid bacterial organism (which means no heterozygotes!) a. Line 1=homozygous, Line 2=heterozygotes b. Bacteria have sex. They do not. They make exchange DNA via conjugation, transduction or transformation. c. Mating – implies sexual reproduction d. Inbreeding. 4. Many students assumed the experiments were run for different periods of time. The graph clearly shows this is not true. 5. Migration is occurring in Method 2. It is not. 6. Assumption that Method 2 exposed bacteria to different environments because they were transferred to a new culture or that nutrients were somehow different in Method 2. This is not in the question. Culture is the same in both methods one is simply using test tubes and Petri dishes while the other uses a large broth of culture. The nutrient media is the same in both. 7. Bacterial transfers in Method 2 DO NOT result in starting with a new line of bacteria. The line or genetic background comes from the starting bacteria but you pick a single bacterium and then allow bacteria to expand in population size each transfer. But always bottlenecking each time step to a population size of 1. Biology 336 Midterm answer key 7 Mean score: 7.7 out of 20. 21 individuals scored 20 out of 20. (This question was reweighted to 5 points due to its apparent difficulty.) 4. (20 pts). A friend is studying sunflowers growing in saline (= salty) areas next to the Salt Lake in Utah in the US and has surveyed the population at one locus. He surveyed seedlings when they had 4 leaves, then followed the plants to adulthood. Here are the number of seedlings found with each genotype, and the numbers that survived to adulthood. A. Genotype Seedlings Adults Genotype Seedlings Adults B. AA 491 245 AA 91 18 Aa 418 207 Aa 418 207 aa 91 18 aa 491 245 1 1 0.9 0.9 0.8 0.8 0.7 0.7 0.6 0.6 f(a) f(a) a. (10 pts) Assume that the relative fitness for each genotype remains constant over the next 20 years. Graph f(a) vs. time for that time period. (Ignore drift). (Do not calculate frequencies for all 20 generations, just sketch the general shape of the graph). Don't forget useful items such as labelled axes. 0.5 0.5 0.4 0.4 0.3 0.3 0.2 0.2 0.1 0.1 0 0 0 5 10 generation 15 20 0 5 10 15 20 generation b. (10 pts) Explain the pattern you graphed, noting how each factor affects the rate of evolution, and how they did so in this case. Show calculations where appropriate. (~3 sentences) A. A is favourable, with s = -0.6, which is large, so the decrease is fast. The initial frequency is 0.258, which gives many copies of the favourable allele, so it is fairly steep at first. Allele a is recessive (or nearly so: h = 0.012) so the rate of increase slows as a becomes more common as most of the copies of the unfavourable a alleles are in highfitness heterozygotes. B. a is favourable, with s = -0.6, which is large, so it increases fairly quickly. The initial frequency is 0.741, which gives many copies of the favourable homozygote so the rise is fairly rapid at first. Since a is almost dominant (h = 0.012: Aa is like AA) the rate of incline slows as A becomes rarer since it is mostly found in the high-fitness heterozygotes. Biology 336 Midterm answer key 8 For full marks the calculations for s & h must have been given. Using s, h and the allele frequencies why the curve was shaped as it was (ie why did it rates start relatively fast and why rates slowed). Mean score: 13.5 / 20. 10 individuals scored 20 / 20. 5a. (10) Imagine that there are two populations of plants, one of size 100 and one of size 2000. (For the moment, we will assume that N = Ne). How would the difference in population size would affect a) the fate of a slightly beneficial rare mutation, and b) the fate of a slightly deleterious mutation? (2-3 sentences: please briefly explain the process as well as the outcome). In smaller populations the relative effects of drift are stronger since the change in allele frequency due to sampling error of one individual is larger. This means that in a smaller population, a favourable allele as a higher probability of be lost and an unfavourable allele has a higher probability of fixing, although it is still more likely that favourable alleles will fix and unfavourable will be lost. 5b. (5) A new mutation has occurred in a population of 400 haploid algae. How long will it take this mutation to reach frequency of 0.8 if the mutation has a selective advantage of 30%? Show your work. (Version B: 600 algae, selective advantage of 25%). A: p(t) = 0.8; q(t) = 0.2; p(0) = 1/400; q(0) = 399/400; WA = 1, Wa = 0.7). t p (t ) p (0)W A . Since WA = 1, (0.8 / 0.2) = 0.0025 / (0.9975 * .7t); t ln .7 = ln 0.0063. = t q (t ) q (0)Wa t = 20.7 – 21 generations. B: p(t) = 0.8; q(t) = 0.2; p(0) = 1/600; q(0) = 599/600; WA = 1, Wa = 0.75). t p (t ) p (0)W A = . Since WA = 1, (0.8 / 0.2) = 0.002 / (0.998 * .75t); t ln .75 = ln 0.0004 q (t ) q (0)Wa t t = 27.05 – 27 generations. Note: the more precise fitness calculations of WA = 1, Wa = 0.77 or WA = 1, Wa = 0.83 were fine. Common errors: - Choosing arbitrary initial frequency anywhere from 0.01 to 0.0000001. The question indicates the starting frequency by saying “A new mutation has occurred in a population of 400 haploid algae.” This means f(mutant)=1/400 and f(nonmutant)=399/400. Sinc this is a HAPLOID algae there is no need to multiply the denominator by 2. 2 marks were taken off for choosing an arbitrary initial frequency 2 marks were taken off for making it a diploid. 1 mark for calculation errors that were horrendous. Midterm answer key Linkage disequilibrium (D) vs. time Linkage disequilibrium (D) vs. time 0.2 0.2 0.15 0.15 D 5c. (5) You observe the following pattern of linkage disequilibrium in a population (see graph). Suggest one explanation for the increase in D (1 sentence), and estimate r. Show your work. 9 D Biology 336 0.1 0.1 0.05 0.05 0 0 0 5 10 Time (generations) 0 5 Time (generations) D could increase due to migration from another population, selection on a new linked mutation, or selection on a multi-locus genotype. D will decrease due to r: D’ = (1-r)D. For A: 0.12 = (1-r)(0.15); r = 0.2. For B: 0.14 = (1-r)(0.15); r = 1/15 = 0.07. (more accurately, 0.1 = (1-r)4(0.15); ln (0.67) = 4 ln(1-r); -0.1 = ln(1-r); 1-r = exp(-0.1) = 0.90; r = 0.1. 2 marks for a correct explanation for an increase in D, 3 marks for a complete solution for estimating r. Common errors: - stating that D increases due a decrease in recombination without giving a potential explanation / cause of it. - not realizing that r will decrease d and so calculating r from time 0. (Recombination cannot explain an increase in D!) Average marks: 10.7 / 20. Nine individuals scored 20 / 20. 10