Examples of Deviation from Hardy-Weinberg Equilibrium Generation 1 Generation 2 Generation 3 Generation 4 AA 0.64 0.63 0.64 0.65 Aa 0.32 0.33 0.315 0.31 aa 0.04 0.04 0.045 0.04 Is this population in HW equilibrium? If not, how does it deviate? What could be the reason? 1 Testing for Deviaton from HardyWeinberg Expectations • A c2 goodness-of-fit test can be used to determine if a population is significantly different from the expections of Hardy-Weinberg equilibrium. • If we have a series of genotype counts from a population, then we can compare these counts to the ones predicted by the Hardy-Weinberg model. • O = observed counts, E = expected counts, sum across genotypes 2 Testing for Deviaton from Hardy-Weinberg Expectations • O = observed counts, E = expected counts, sum across genotypes • We test our c2 value against the Chi-square distribution (sum of square of a normal distribution), which represents the theoretical distribution of sample values under HW equilibrium Less likely to get these values by chance • And determine how likely it is to get our result simply by chance (e.g. due to sampling error); i.e., do our Observed values differ from our Expected values more than what we would expect by chance (= significantly different)? 3 Test for Deviation from HW equilibrium • Genotype Count Generation 4: AA 65 Aa 31 aa 4 • Calculate the c2 value: Genotype AA Aa aa Total Observed Expected (O-E)2/E 65 64.8 0.00062 31 31.4 0.0051 4 3.8 0.0105 100 100 0.016 • Since c2 = 0.016 < 3.841 (from Chi-square table for critical values, alpha = 0.05), we conclude that the genotype frequencies in this population are not significantly different than what would be expected if the population were in Hardy-Weinberg equilibrium.4 • The chi-squared distribution is used because it is the sum of squared normal distributions • • • • Calculate Chi-squared test statistic Figure out degrees of freedom Select confidence interval (P-value) Compare your Chi-squared value to the theoretical distribution (from the table), and accept or reject the null hypothesis. – If the test statistic > than the critical value, the null hypothesis (H0 = there is no difference between the distributions) can be rejected with the selected level of confidence, and the alternative hypothesis (H1 = there is a difference between the distributions) can be accepted. – If the test statistic < than the critical value, the null hypothesis 5 cannot be rejected Test for Significance of Deviation from HW Equilibrium Degrees of Freedom is n – 1 = 2 alleles (p, q) -1 = 1 6 Testing for significance • The results come out not significantly different from HW equilibrium • This does not necessarily mean that genetic drift is not happening, but that we cannot conclude that genetic drift is happening • Either we do not have enough power (not enough data, small sample size), or genetic drift is not happening • Sometimes it is difficult to test whether evolution is happening, even when it is happening... The signal needs to be sufficiently large to be sure that you can’t get the results by chance (like by sampling error) 7 Test for Deviation from HW equilibrium • Genotype Count Generation 4 increase sample size AA 65000 Aa 31000 aa 4000 • Calculate the c2 value: Genotype AA Aa aa Total Observed Expected 65000 64800 31000 31400 4000 3800 100,000 100,000 (O-E)2/E 0.617 5.10 10.32 16.04 • Since c2 = 16.04 > 3.841 (from Chi-square table for critical values, alpha = 0.05), we conclude that the genotype frequencies in this population ARE significantly different than what would be expected if the population were in Hardy-Weinberg equilibrium.8 Test for Significance of Deviation from HW Equilibrium Degrees of Freedom is n – 1 = 2 alleles (p, q) -1 = 1 9 Evolutionary Mechanisms Genetic Drift (and Inbreeding) 10 DEVIATION from Hardy-Weinberg Equilibrium Indicates that EVOLUTION Is happening 11 Evolutionary Mechanisms (will put population out of HW Equilibrium): • • • • Genetic Drift Natural Selection Mutation Migration *Epigenetic modifications change expression of alleles but not the frequency of alleles themselves, so they won’t affect the actual inheritance of alleles However, if you count the phenotype frequencies, and not the genotype frequencies , you might see phenotypic frequencies out of HW Equilibrium due to epigenetic silencing of alleles. (epigenetic modifications can change phenotype, not genotype) Epigenetic modification could affect which alleles are exposed to natural selection (no effect of Genetic Drift) 12 Sources of Genetic Variation Mutation generates genetic variation Natural Selection Natural Selection acts on genetic or epigenetic variation in a population Without genetic or epigenetic variation, Natural Selection cannot occur Epigenetic modification changes expression of genes Genetic Drift causes fluctuations in allele frequencies and can reduce genetic variation 13 Sources of Genetic Variation Mutation generates genetic variation Natural Selection Natural Selection acts on genetic or epigenetic variation in a population Without genetic or epigenetic variation, Natural Selection cannot occur Epigenetic modification changes expression of genes Genetic Drift causes fluctuations in allele frequencies and can reduce genetic variation 14 Today’s OUTLINE: (1) Migration (2) Genetic Drift (3) Effective Population Size (4) Model of Genetic Drift (5) Heterozygosity (6) A Consequence of Genetic Drift: Inbreeding 15 Migration (fairly obvious) Can act as a Homogenizing force (If two populations are different, migration between them can reduce the differences) A population could go out of HW equilibrium with a lot of migration 16 Impact of Migration Immigration: could introduce genetic variation into a population Emigration: could reduce genetic variation in a population 17 Random Genetic Drift 18 Sewall Wright (1889-1988) • Sewall Wright worked on agricultural stocks (e.g. cows), and was consequently interested in small, inbred populations • Thus, he regarded Inbreeding and Genetic Drift as particularly important genetic mechanisms • Genetic Drift and Inbreeding could generate new gene interactions • These new gene interactions (epistasis caused by new recombinations) are the main substrate for selection 19 “Null Model” No Evolution: Null Model to test if no evolution is happening should simply be a population in Hardy-Weinberg Equilibrium No Selection: Null Model to test whether Natural Selection is occurring should have no selection, but should include Genetic Drift This is because Genetic Drift is operating even when there is no Natural Selection 20 “Null Model” No Selection: A model that tests for the presence of Natural Selection should include Genetic Drift This is because Genetic Drift is operating even when there is no Natural Selection So the null model that tests for natural selection should include demography, population growth and decline, as population size directly affects the level of genetic drift acting on a population 21 Even after the synthesis the relative importance of Natural Selection and Genetic Drift were debated • During the Evolutionary Synthesis, Sewall Wright focused more on importance of Genetic Drift, whereas Fisher focused on Natural Selection • Shortly after the Evolutionary Synthesis many focused on selection to the point of assuming that most phenotypes were the result of Natural Selection • Emphasis on Genetic Drift resurged in the 1970s, 80s with Kimura’s “Neutral Theory” • Then in the 2000s and 2010s interest in Selection increased with the ability to detect signatures of Natural Selection in genome sequence data 22 Motoo Kimura (1924-1994) The Neutral Theory of Molecular Evolution Classic Paper: Kimura, Motoo. 1968. Evolutionary rate at the molecular level. Nature. 217: 624–626. Classic Book: Kimura, Motoo (1983). The neutral theory of molecular evolution. Cambridge University Press. 23 Molecular Clock Observations of amino acid changes that occurred during the divergence between species, show that molecular evolution (mutations) takes place at a roughly constant rate. This suggests that molecular evolution is constant enough to provide a “molecular clock” of evolution, and that the amount of molecular change between two species measures how long ago they shared a common ancestor. From this, Kimura concluded that not much selection is going on (because the pattern of regular mutations is not obscured by selection), and that most evolution is influenced by Genetic Drift. Figure: the rate of evolution of hemoglobin. Each point on the graph is for a pair of species, or groups of species. From Kimura (1983). 24 The Neutral Theory of Molecular Evolution • The Neutral theory posits that the vast majority of evolutionary change at the molecular level is caused by random genetic drift rather than natural selection. Motoo Kimura • Neutral theory is not incompatible with Darwin's theory of evolution by natural selection: adaptive changes are acknowledged as present and important, but hypothesized to be a small minority of evolutionary change. • Recent tests of selection have found that in many cases evolution is not neutral, even in non-coding regions of the genome. • Nevertheless, the neutral theory is useful as the null hypothesis, for testing whether natural selection is occurring. 25 I. Random Genetic Drift Definition: Changes in allele frequency from one generation to the next simply due to chance (sampling error). This is a NON ADAPTATIVE evolutionary force. Darwin did not consider genetic drift as an evolutionary force, only natural selection. 26 Genetic Drift Genetic Drift happens when populations are limited in size, violating HW assumption of infinite population size When population is large, chance events cancel each other out When population is small, random differences in reproductive success begin to matter much more 27 Effective Population Size In Evolution, when we talk about population size, we mean effective population size 28 Effective Population Size The concept of effective population size Ne was introduced by Sewall Wright, who wrote two landmark papers on it (Wright 1931, 1938). Sewall Wright defined it as "the number of breeding individuals in an idealized population that would show the same amount of dispersion of allele frequencies under random genetic drift or the same amount of inbreeding as the population under consideration” OK, easier definition: Effective population size is the number of individuals in a population that actually contribute offspring to the next generation. 29 Effective Population Size The effective population size is usually smaller than the real census population size because not everyone breeds and leaves offspring The effective population size is always either equal to or less than the census population size (N). Unequal sex ratio, variation in number of offspring, overlapping generations, fluctuations in population size, nonrandom mating could lead to an effective population size that is smaller than the census size. Ne = 4Nm Nf (Nm + Nf ) where Nm = number of males, Nf = number of females from this equation, you can see that unequal sex ratio would lead to lower Ne 30 Why do we care about effective population size Ne? Because Ne is the actual unit of evolution, rather than the census size N Why? Because only the alleles that actually get passed onto the next generation count in terms of evolution… the individuals that do not mate or have offspring are evolutionary dead ends… 31 Chance Events (no Selection) There is always an element of chance, in: Who leaves Offspring # of Offspring Which Offspring survive (which gametes, which alleles) 32 Example of sampling error Green fur: Blue fur: G (dominant) b (recessive) If Gb x Gb mate, the next generation is expected to have: 3:1 Ratio of Green to blue (GG, Gb, bG, bb) But this might not happen One family could get this unusual frequency just due to chance: You might get: bb, bb, bb, bb And, so you might accidentally lose the G allele not for any reason, but just due to chance The larger the population, the more these effects average out, and frequencies approach HW equilibrium 33 Why is this not Selection? Selection happens when some survive for a reason: better adapted. Genetic Drift is just a numbers game. Which gamete gets fertilized, which allele gets passed on is RANDOM 34 Consequences of Genetic Drift Random fluctuations in allele frequency If population size is reduced: 1. At the Allelic level: Random fixation of Alleles (loss of alleles) 2. At the Genotypic level: Loss of Heterozygosity (because of fewer alleles) 35 Random Shift in Allele Frequencies across Generations 36 Probability of loss of alleles is greater in smaller populations For example, if there are 50 different alleles in population …and a new population is founded by only 10 individuals Then the new population will be unable to capture all 50 alleles, and many of the alleles will be lost 37 Bottleneck Effect 38 Random fixation of alleles FIXATION: When an allele frequency becomes 100% The other alleles are lost by chance Fluctuations are much larger in smaller populations 39 Probability of fixation of an allele Probability of fixation of an allele = the allele’s starting frequency (Sewall Wright 1931) That is, if the frequency of an allele is 0.10 (10% in the population), then its probability (chances) of fixation (going to 100% in the population) is 0.10, or 10% 40 Probability of fixation of an allele Probability of fixation of an allele = the allele’s starting frequency (Sewall Wright 1931) That is, if the frequency of an allele is 0.10 (10% in the population), then its probability (chances) of fixation (going to 100% in the population) is 10% WHY? 41 Probability of fixation of an allele If a population has 2N alleles The probability of each allele being fixed = 1 2N If there are X # copies of that allele in the population, then the probability of fixation for that allele is X 2N This is the proportion (%) of alleles in the population 42 Genetic Drift: Random Fixation 43 Genetic Drift: Random Fixation After 7 generations, all allelic diversity was lost… the population became fixed for allele #1 just due to chance 44 Genetic Drift: Random Fixation As populations get smaller, the probability of fixation goes up 45 46 Heterozygosity Heterozygosity: frequency of heterozygotes in a population (% of heterozygotes) Often used as an estimate of genetic variation in a population HW expected frequency of heterozygotes in a population = 2p(1-p) As genetic drift drives alleles toward fixation or loss, the reduction in number of alleles causes the frequency of heterozygotes to go down 47 Heterozygosity The Expected Heterozygosity following a population bottleneck (frequency of Heterozygotes in the next generation = Hg+1): g+1 g e Note in this equation that as population size 2Ne gets small, heterozygosity in the next generation Hg+1, goes down 48 Grey line = theoretical expected heterozygosity: g+1 g e Blue line = observed heterozygosity (% heterozygotes in population) 49 Heterozygosity declines faster in smaller populations 50 Loss of Variation Allele frequencies 0.36 Lots of Alleles: 0.16 0.20 0.24 0.04 51 Lots of alleles --> Lots of genotypes: Possible genotype combinations 52 Loss of Allelic Variation Allele frequencies Due to Genetic Drift 0.53 0.067 0.20 0.20 0 53 Loss of Allelic Variation due to Genetic Drift results in Increased Homozygosity Fewer possible genotype combinations Increase in homozygosity 54 No Genetic Variation = No Natural Selection 55 No Genetic Variation = No Natural Selection Selection has nothing to act on 56 Genetic Drift and Natural Selection Because of the randomness introduced by Genetic Drift, Natural Selection is less efficient when there is genetic drift Thus, Natural Selection is more efficient in larger populations, and less effective in smaller populations 57 Allele A1 Demo • Impact of Effective Population Size • Impact of starting allele frequency • Interaction between Natural Selection and Genetic Drift 58 How do you detect Genetic Drift? Random fluctuations in allele frequencies Fluctuations in non-coding and non-functional regions of the genome The same pattern of fluctuations across these regions of the genome (same demographic factors would act across the genome) - but mtDNA not same Fluctuations in allele frequencies correspond to demography of the population (population size) 59 Consequences of Genetic Drift ---> Random fluctutations in allele frequency If population size is reduced and Drift acts more intensely, we get: 1. At the Allelic level: Random fixation of Alleles (loss of alleles) 2. At the Genotypic level: Inbreeding, Reduction in Heterozygosity (because of fewer alleles) 60 Dilemma for Conservationists • It is easier to remove genetic diversity than create it… especially variation that is potentially adaptive Census size: Roughly 2000-3000 cheetahs live in Namibia today The effective population size is much lower 61 2. Genetic Drift often leads to Inbreeding Inbreeding is a consequence of Genetic drift in small populations, resulting from loss of alleles Due to loss of alleles, there is an increase in homozygosity 62 With small population size, heterozygosity goes down, homozygosity goes up 63 Inbreeding Mating among genetic relatives, often because of small population size Alleles at a locus will more likely become homozygous A consequence of loss of alleles due to genetic drift (reduction in population size) 64 Calculations like these are used in genetic counseling and animal breeding to calculate F, the coefficient of inbreeding (developed by Sewall Wright) Show popgen pedigree F = fixation index, a measure of homozygosity; the probability that two genes at any locus in an individual are identical by descent from the common ancestor(s) of the two parents (homozygous, rather than heterozygous). 65 grandmother Probability of having same allele (either triangle or circle) = 1/2 Other grandparent Other grandparent 1/2 mother 1/2 father 1/2 Calculating the Inbreeding Coefficient F Probability of getting two circles or two triangles, identical by descent = ½ x ½ x ½ = 1/8 66 grandmother Probability of having same allele (either triangle or circle) = 1/2 Other grandparent Other grandparent 1/2 mother 1/2 father 1/2 Probability of getting two circles or two triangles, identical by descent = ½ x ½ x ½ = 1/8 The inbreeding coefficient is the measure of relatedness between two individuals… used a lot in agriculture, genetic counseling, 67 etc. small population size genetic drift loss of alleles decrease in heterozygosity increase in homozygosity increase of exposure of deleterious alleles 68 Consequences of Inbreeding Increase in Homozygosity: Exposure of recessive alleles (that could be subjected to selection) Inbreeding Depression (reduction in survival and fitness) Lower genotypic diversity (poor response to natural selection) 69 Consequences of Inbreeding • Deleterious recessive alleles are exposed as homozygotes (and less masked in the heterozygous state) • These deleterious recessive alleles get removed out of an inbred population more rapidly, because they are exposed to natural selection ---> this process is aided by Sex, discussed in Next Lecture (on Variation) 70 Mechanism: Most deleterious alleles are recessive Alleles On average each of us carries 35 lethal recessive alleles Locus1 A a These deleterious alleles are expressed in inbred individuals Locus2 B b Locus3 C c In Heterozygotes, these deleterious recessives are masked, and not exposed to selection 71 Increasing genetic distance Fitness Inbreeding Depression Hybrid Vigor Outbreeding Depression 72 Examples of Inbreeding 73 74 Inbreeding and Conservation Vexing Problem: How do we prevent extinctions? How do we maximize genetic variance in a population? What should we preserve? (species, populations, habitats, ecosystems???) 75 Inbreeding and Agriculture: Because we have focused excessively on a few breeds, effective population sizes of many crops are incredibly small 600 breeds of livestock have disappeared ~78 breeds lost each year 1000-1500 breeds of livestock are considered at risk of extinction (30% of current breeds) 76 8 million Holstein cows are descendents of 37 individuals!!! Neurological disorders Autoimmune diseases Fertility problems 77 Recent efforts have focused on outcrossing to increase fitness in agricultural stocks Duvick, D. N. 2002. Biotechnology in the 1930s: the development of hybrid maize. Nature Reviews Genetics. 2: 69-74. Bruce, W. B., G. O. Edmeades, and T. C. Barkre. 2002. Review Article: Molecular and physiological approaches to maize improvement for drought tolerance. Journal 78 of Experimental Botany. 53: 13-25. Inbreeding depression in humans 79 Genetic Diseases due to Inbreeding have afflicted many Royal Families George III Insane due to Porphyria Examples: Porphyria: accumulation of porphyrin precursors, causes insanity; Dominant, but more intense in the homozygous form Mary, Queen of Scots George III, loss of American colony Acromegaly: Overproduction of GH by the pituitary gland; recessive Charles II, King of Spain (1665-1700) Hemophilia: “Victoria’s Secret” Charles II “Hapsburg Jaw” X-linked, shows up in males Cognitively disabled Daughters spread it among the royals, impotent downfall of Russian monarchy 80 Porphyria in the Royal Families of Europe George III Dominant, but more intense in the homozygous form 81 Royal Pedigree of the “Hapsburg Jaw” In small populations, individuals tend to mate with relatives = expression of the disorder 82 Royal Pedigree of Hapsburg Jaw = expression of the disorder = marriage with close relative Inbreeding within the Royal Families of Europe was genetically “disasterous” 83 Inbreeding within Royal Families has been reduced in recent years Diana was a 7th cousin of Charles, rather than a first or second cousin (which was the normal practice previously) Kate Middleton is neither an aristocrat nor a royal, but the marriage was approved given the modern recognition of the perils of inbreeding… this marriage is an example of genetic immigration (Migration) 84 into an enclosed population Articles on British Royalty and Inbreeding: News Article: http://www.telegraph.co.uk/news/worldnews/e urope/spain/5158513/Inbreeding-causeddemise-of-the-Spanish-Habsburg-dynastynew-study-reveals.html Original Scientific Article: http://www.plosone.org/article/info:doi/10.1371/jou rnal.pone.0005174 85 Amish in Lancaster County, PA Descendents of ~200 Swiss who emigrated mid-1700s. Closed genetic population for 12+ generations. Host to 50+ identified inherited disorders (many more uncharacterized) 86 Examples of Diseases that Afflict Amish Populations High levels of infant mortality High Incidence of brain damage Large number of metabolic disorders, such as PKU Ellis-van Creveld (dwarfism) syndrome 1/14 in Amish Glutaric aciduria type I 1/200 in Amish High incidence of bipolar disorder Polydactyly (6 fingers) 87 There is nothing special about the Amish, genetically speaking (or Royal Families). • There are about ~200 of you here, similar to the size of the founding Amish population in Lancaster county, Pennsylvania. • If all of you were stranded on a desert island, eventually your descendents would be related and mating with one another. • Each of us carries 3-5 lethal recessive alleles. • If 200 people each had 3-5 different recessive lethals and started forming a closed population, eventually, hundreds of genetic disorders would emerge. 88 Dating Service FIND Your Genetically Most Distant Mate!!!! (For example, Scientificmatch.com matches people that differ in their MHC loci; people that differ at these loci tend to be more attracted to one another) 89 Summary (1) Genetic Drift occurs when a population is small, and leads to random loss of alleles due to sampling error (2) Inbreeding, an extreme consequence of Genetic Drift, results in increases in homozygosity at many loci, including recessive lethals 90 CONCEPTS Genetic Drift Effective population size Fixation Heterozygosity Inbreeding Inbreeding Depression Heterozygote advantage 91 1. When is Genetic Drift LEAST likely to operate? (A) When there are migrants between populations (B) When populations are fragmented and isolated (C) When population size is small (D) When population size is very very large 92 2. Which of the following is FALSE regarding inbreeding? (A) Inbreeding could result as a consequence after genetic drift has acted on a population (B) Populations with lower allelic diversity tend to have lower genotypic diversity (more homozygous) (C) Selection acts more slowly in inbred populations to remove deleterious recessive alleles (D) One way to reduce inbreeding in a population is to bring in migrants from another population 93 3. Which of the following is FALSE regarding Genetic Drift? (A) Genetic Drift occurs more rapidly in smaller populations (B) In populations of finite size, random changes in allele frequency occur at each generation (C) Genetic Drift could lead to inbreeding due to loss of alleles (D) Genetic Drift occasionally leads to the generation of favorable alleles 94 Answers to sample exam questions: 1D 2C 3D 95