Chapter 6. Multi-locus coevolution, epistasis, and linkage

advertisement
Chapter 6. Multi-locus coevolution, epistasis, and linkage disequilibrium
Biological Motivation
Obviously, more than a dsingle locus is involved. Here we develop a basic framework for studying two
locus systems introducing the concepts of epistasis, recombination, and linkage disequilibrium. After
studying how coevolution proceeds in a simple two locus system (motivated by???) we move on to
explore ??? INTRODUCE EPISTASIS AND LINKAGE DISEQUILIBRIUM
Key Questions:
ο‚·
ο‚·
ο‚·
What patterns of epistasis are likely to be generated by species interactions?
How do these patterns of epistasis influence the dynamics and outcome of coevolution?
What patterns of linkage disequilibrium do we expect to emerge in coevolving systems?
Building a 2-locus model of coevolution
Our goal is to develop the simplest possible model that captures the potentially important
consequences of the multi-locus gene-for-gene interactions for coevolution between X and X. Clearly,
the simplest starting point is to focus on only a single pair of loci and haploid sexual species. Within
haploid sexuals, recombination occurs in a transient diploid phase but selection occurs in the haploid
phase. Thus, we avoid the complexities of diploidy that we struggled with in the previous chapter. Of
course, ignoring diploidy also comes at the cost of reduced realism since both XX and XX are, indeed,
diploid species.
We imagine that rusts and flax’s run into each other at random, and that this has negative fitness
consequences for the flax and posoitive fitness consequences for the rust… Assuming random
encounters and that the probability of infection depends upon the two locus genotypes of flax and rust,
the fitness of the four possible Flax genotypes is given by:
π‘Šπ‘‹,𝐴𝐡 = 1 − 𝑠𝑋 (π‘Œπ΄π΅ 𝛼𝐴𝐡,𝐴𝐡 + π‘Œπ΄π‘ 𝛼𝐴𝐡,𝐴𝑏 + π‘Œπ‘Žπ΅ 𝛼𝐴𝐡,π‘Žπ΅ + π‘Œπ‘Žπ‘ 𝛼𝐴𝐡,π‘Žπ‘ )
(1a)
π‘Šπ‘‹,𝐴𝑏 = 1 − 𝑠𝑋 (π‘Œπ΄π΅ 𝛼𝐴𝑏,𝐴𝐡 + π‘Œπ΄π‘ 𝛼𝐴𝑏,𝐴𝑏 + π‘Œπ‘Žπ΅ 𝛼𝐴𝑏,π‘Žπ΅ + π‘Œπ‘Žπ‘ 𝛼𝐴𝑏,π‘Žπ‘ )
(1b)
π‘Šπ‘‹,π‘Žπ΅ = 1 − 𝑠𝑋 (π‘Œπ΄π΅ π›Όπ‘Žπ΅,𝐴𝐡 + π‘Œπ΄π‘ π›Όπ‘Žπ΅,𝐴𝑏 + π‘Œπ‘Žπ΅ π›Όπ‘Žπ΅,π‘Žπ΅ + π‘Œπ‘Žπ‘ π›Όπ‘Žπ΅,π‘Žπ‘ )
(1c)
π‘Šπ‘‹,π‘Žπ‘ = 1 − 𝑠𝑋 (π‘Œπ΄π΅ π›Όπ‘Žπ‘,𝐴𝐡 + π‘Œπ΄π‘ π›Όπ‘Žπ‘,𝐴𝑏 + π‘Œπ‘Žπ΅ π›Όπ‘Žπ‘,π‘Žπ΅ + π‘Œπ‘Žπ‘ π›Όπ‘Žπ‘,π‘Žπ‘ )
(1d)
Similarly, the fitness of the four possible Rust genotypes is given by:
π‘Šπ‘Œ,𝐴𝐡 = 1 − π‘ π‘Œ (1 − 𝑋𝐴𝐡 𝛼𝐴𝐡,𝐴𝐡 − 𝑋𝐴𝑏 𝛼𝐴𝑏,𝐴𝐡 − π‘‹π‘Žπ΅ π›Όπ‘Žπ΅,𝐴𝐡 − π‘‹π‘Žπ‘ π›Όπ‘Žπ‘,𝐴𝐡 )
(2a)
π‘Šπ‘Œ,𝐴𝑏 = 1 − π‘ π‘Œ (1 − 𝑋𝐴𝐡 𝛼𝐴𝐡,𝐴𝑏 − 𝑋𝐴𝑏 𝛼𝐴𝑏,𝐴𝑏 − π‘‹π‘Žπ΅ π›Όπ‘Žπ΅,𝐴𝑏 − π‘‹π‘Žπ‘ π›Όπ‘Žπ‘,𝐴𝑏 )
(2b)
Mathematica Resources: http://www.webpages.uidaho.edu/~snuismer/Nuismer_Lab/the_theory_of_coevolution.htm
π‘Šπ‘Œ,π‘Žπ΅ = 1 − π‘ π‘Œ (1 − 𝑋𝐴𝐡 𝛼𝐴𝐡,π‘Žπ΅ − 𝑋𝐴𝑏 𝛼𝐴𝑏,π‘Žπ΅ − π‘‹π‘Žπ΅ π›Όπ‘Žπ΅,π‘Žπ΅ − π‘‹π‘Žπ‘ π›Όπ‘Žπ‘,π‘Žπ΅ )
(2c)
π‘Šπ‘Œ,π‘Žπ‘ = 1 − π‘ π‘Œ (1 − 𝑋𝐴𝐡 𝛼𝐴𝐡,π‘Žπ‘ − 𝑋𝐴𝑏 𝛼𝐴𝑏,π‘Žπ‘ − π‘‹π‘Žπ΅ π›Όπ‘Žπ΅,π‘Žπ‘ − π‘‹π‘Žπ‘ π›Όπ‘Žπ‘,π‘Žπ‘ )
(2d)
Now, if we assume that the probability of survival to mating for the various Flax and Rust genotypes
depends on these fitnesses, we can calculate the frequency of each genotype after selection but prior to
random mating. As before, we can calculate these frequencies by multiplying the current frequency by
its relative fitness. For the Flax, this yields the following expressions:
′
𝑋𝐴𝐡
=
𝑋𝐴𝐡 π‘Šπ‘‹,𝐴𝐡
̅𝑋
π‘Š
(3a)
′
𝑋𝐴𝑏
=
𝑋𝐴𝑏 π‘Šπ‘‹,𝐴𝑏
̅𝑋
π‘Š
(3b)
′
π‘‹π‘Žπ΅
=
π‘‹π‘Žπ΅ π‘Šπ‘‹,π‘Žπ΅
̅𝑋
π‘Š
(3c)
′
π‘‹π‘Žπ‘
=
π‘‹π‘Žπ‘ π‘Šπ‘‹,π‘Žπ‘
̅𝑋
π‘Š
(3d)
̅𝑋 is the population mean fitness of species X and is given by:
where, as usual, the symbol π‘Š
̅𝑋 = 𝑋𝐴𝐡 π‘Šπ‘‹,𝐴𝐡 + 𝑋𝐴𝑏 π‘Šπ‘‹,𝐴𝑏 + π‘‹π‘Žπ΅ π‘Šπ‘‹,π‘Žπ΅ + π‘‹π‘Žπ‘ π‘Šπ‘‹,π‘Žπ‘
π‘Š
(3e)
The same procedure can now be applied to the rust population to calculate the frequency of two-locus
genotypes there after selection but prior to mating:
′
π‘Œπ΄π΅
=
π‘Œπ΄π΅ π‘Šπ‘Œ,𝐴𝐡
Μ…π‘Œ
π‘Š
(4a)
′
π‘Œπ΄π‘
=
𝑋𝐴𝑏 π‘Šπ‘Œ,𝐴𝑏
Μ…π‘Œ
π‘Š
(4b)
′
π‘Œπ‘Žπ΅
=
π‘Œπ‘Žπ΅ π‘Šπ‘Œ,π‘Žπ΅
Μ…π‘Œ
π‘Š
(4c)
′
π‘Œπ‘Žπ‘
=
π‘Œπ‘Žπ‘ π‘Šπ‘Œ,π‘Žπ‘
Μ…π‘Œ
π‘Š
(4d)
̅𝑋 is the population mean fitness of species X and is given by:
where, as usual, the symbol π‘Š
Μ…π‘Œ = π‘Œπ΄π΅ π‘Šπ‘Œ,𝐴𝐡 + π‘Œπ΄π‘ π‘Šπ‘Œ,𝐴𝑏 + π‘Œπ‘Žπ΅ π‘Šπ‘Œ,π‘Žπ΅ + π‘Œπ‘Žπ‘ π‘Šπ‘Œ,π‘Žπ‘
π‘Š
(4e)
OK, so now we know what the frequencies of the various genotypes are just before mating ensues. How
can we now move forward to incorporate changes to genotype frequencies that accrue during the
process of mating?
If we are willing to assume that both Flax and Rust mate at random and have quite large
population sizes, we can derive basic expressions for changes in genotype frequencies. The long and
2
tedious way to go about this is to first tabulate the frequency of offspring with various genotypes that
are produced by all possible combinations of parents (Table 1). RECOMBINATION! INTRODUCE IT HERE
Table 1. Genotype frequencies produced by random matings
Maternal|Paternal
genotypes
AB|AB
AB|Ab
AB|aB
AB|ab
Ab|AB
Ab|Ab
Ab|aB
Ab|ab
aB|AB
aB|Ab
aB|aB
aB|ab
ab|AB
ab|Ab
ab|aB
ab|ab
Frequency of mating
AB
𝑋𝐴𝐡 𝑋𝐴𝐡
𝑋𝐴𝐡 𝑋𝐴𝑏
𝑋𝐴𝐡 π‘‹π‘Žπ΅
𝑋𝐴𝐡 π‘‹π‘Žπ‘
𝑋𝐴𝐡 𝑋𝐴𝐡
𝑋𝐴𝐡 𝑋𝐴𝑏
𝑋𝐴𝐡 π‘‹π‘Žπ΅
𝑋𝐴𝐡 π‘‹π‘Žπ‘
𝑋𝐴𝐡 𝑋𝐴𝐡
𝑋𝐴𝐡 𝑋𝐴𝑏
𝑋𝐴𝐡 π‘‹π‘Žπ΅
𝑋𝐴𝐡 π‘‹π‘Žπ‘
𝑋𝐴𝐡 𝑋𝐴𝐡
𝑋𝐴𝐡 𝑋𝐴𝑏
𝑋𝐴𝐡 π‘‹π‘Žπ΅
𝑋𝐴𝐡 π‘‹π‘Žπ‘
1
1/2
1/2
(1 − π‘Ÿ)/2
1/2
0
π‘Ÿ/2
0
1/2
π‘Ÿ/2
0
0
(1 − π‘Ÿ)/2
0
0
0
Offspring genotype
Ab
aB
0
1/2
0
π‘Ÿ/2
1/2
1
(1 − π‘Ÿ)/2
1/2
0
(1 − π‘Ÿ)/2
0
0
π‘Ÿ/2
1/2
0
0
0
0
1/2
π‘Ÿ/2
0
0
(1 − π‘Ÿ)/2
0
1/2
(1 − π‘Ÿ)/2
1
1/2
π‘Ÿ/2
0
1/2
0
ab
0
0
0
(1 − π‘Ÿ)/2
0
0
π‘Ÿ/2
1/2
0
π‘Ÿ/2
0
1/2
(1 − π‘Ÿ)/2
1/2
1/2
1
What Table 1 provides us with is the raw material for calculating the frequency of the various genotypes
in the offspring generation. All we need to do now is sum up the entries in each column, weighting each
entry by the frequency with which the two relevant parental genotypes encounter one another at
random and mate. Mathematically, this amounts to evaluating the following expression for each of the
four possible offspring genotypes, i:
𝑋𝑖′′ = ∑4𝑗=1 ∑4π‘˜=1 𝑋𝑗′ π‘‹π‘˜′ Π𝑋,𝑗+π‘˜→𝑖
(5a)
and the following expression for the four possible offspring genotype in Rust:
π‘Œπ‘–′′ = ∑4𝑗=1 ∑4π‘˜=1 π‘Œπ‘—′ π‘Œπ‘˜′ Ππ‘Œ,𝑗+π‘˜→𝑖
(5b)
where Π𝑋,𝑗+π‘˜→𝑖 and Ππ‘Œ,𝑗+π‘˜→𝑖 are the probability that two parents with genotypes j and k produce an
offspring of genotype i within the Flax and Rust populations, respectively.
Although equations (5) help to see, mechanistically speaking, how the genotype frequencies
within one generation are translated into those of the next through the process of segregation and
recombination, they are quite clunky and not terribly insightful. Fortunately, these equations can be
greatly simplified and re-expressed in a way that is much easier to implement from a practical
standpoint, and also much more biologically insightful. Specifically, plugging away algebraically allows
equations (5) to be re-written as:
3
′′
′
𝑋𝐴𝐡
= 𝑋𝐴𝐡
+ π‘Ÿπ‘‹ 𝐷𝑋′
(6a)
′′
′
𝑋𝐴𝑏
= 𝑋𝐴𝑏
− π‘Ÿπ‘‹ 𝐷𝑋′
(6b)
′′
′
π‘‹π‘Žπ΅
= π‘‹π‘Žπ΅
− π‘Ÿπ‘‹ 𝐷𝑋′
(6c)
′′
′
π‘‹π‘Žπ‘
= π‘‹π‘Žπ‘
+ π‘Ÿπ‘‹ 𝐷𝑋′
(6d)
in the Flax and as
′′
′
π‘Œπ΄π΅
= π‘Œπ΄π΅
+ π‘Ÿπ‘Œ π·π‘Œ′
(7a)
′′
′
π‘Œπ΄π‘
= π‘Œπ΄π‘
− π‘Ÿπ‘Œ π·π‘Œ′
(7b)
′′
′
π‘Œπ‘Žπ΅
= π‘Œπ‘Žπ΅
− π‘Ÿπ‘Œ π·π‘Œ′
(7c)
′′
′
π‘Œπ‘Žπ‘
= π‘Œπ‘Žπ‘
+ π‘Ÿπ‘Œ π·π‘Œ′
(7d)
in the rust. In these equations, DX and DY quantify linkage disequilibrium, a measure of the statistical
′
′
′
′
association or covariance between alleles at the A and B loci. Specifically, 𝐷𝑋′ = 𝑋𝐴𝐡
π‘‹π‘Žπ‘
− 𝑋𝐴𝑏
π‘‹π‘Žπ΅
and
′
′
′
′
′
π·π‘Œ = π‘Œπ΄π΅ π‘Œπ‘Žπ‘ − π‘Œπ΄π‘ π‘Œπ‘Žπ΅ such that linkage disequilibrium is positive if there is an excess of AB and ab
genotypes within a population and negative if it is, instead, the Ab and aB genotypes that are in excess.
A key insight illuminated by equations (6-7) is that the change in genotype frequencies that occurs in
response to random mating depends entirely on the rate of recombination. If no recombination occurs,
genotype frequencies within the offspring population remain identical to those within the parental
population. If, instead, recombination occurs, genotype frequencies in the offspring generation differ
from those in the parental generation by an amount proportional to linkage disequilibrium. Clearly,
then, recombination can influence coevolution only in cases where coevolutionary selection, or some
other evolutionary force, acts to create linkage disequilibrium within populations of interacting species.
We are now at a point where we have successfully described how genotype frequencies change
over the course of a single generation. To maintain some generality, let’s wait to subsititute in the
specific values for fitness corresponding to our GFG model, and simply express how genotype
frequencies change in terms of arbitrary fitness values, W. Specifically, subsitututing (3) into (6) and
changing from recursion equations to difference equations, yields the following expressions for the
change in host genotype frequencies that occurs over the course of a single generation:
βˆ†π‘‹π΄π΅ =
Μ… 𝑋 )π‘Š
̅𝑋
π‘Ÿπ‘‹ (π‘Šπ‘‹,π‘Žπ΅ π‘Šπ‘‹,𝐴𝑏 π‘‹π‘Žπ΅ 𝑋𝐴𝑏 −π‘Šπ‘‹,π‘Žπ‘ π‘Šπ‘‹,𝐴𝐡 π‘‹π‘Žπ‘ 𝑋𝐴𝐡 )+𝑋𝐴𝐡 (π‘Šπ‘‹,𝐴𝐡 −π‘Š
2
Μ…
π‘Šπ‘‹
(8a)
βˆ†π‘‹π΄π‘ =
Μ… 𝑋 )π‘Š
̅𝑋
π‘Ÿπ‘‹ (π‘Šπ‘‹,π‘Žπ‘ π‘Šπ‘‹,𝐴𝐡 π‘‹π‘Žπ‘ 𝑋𝐴𝐡 −π‘Šπ‘‹,π‘Žπ΅ π‘Šπ‘‹,𝐴𝑏 π‘‹π‘Žπ΅ 𝑋𝐴𝑏 )+𝑋𝐴𝑏 (π‘Šπ‘‹,𝐴𝑏 −π‘Š
Μ… 𝑋2
π‘Š
(8b)
βˆ†π‘‹π‘Žπ΅ =
Μ… 𝑋 )π‘Š
̅𝑋
π‘Ÿπ‘‹ (π‘Šπ‘‹,π‘Žπ‘ π‘Šπ‘‹,𝐴𝐡 π‘‹π‘Žπ‘ 𝑋𝐴𝐡 −π‘Šπ‘‹,π‘Žπ΅ π‘Šπ‘‹,𝐴𝑏 π‘‹π‘Žπ΅ 𝑋𝐴𝑏 )+π‘‹π‘Žπ΅ (π‘Šπ‘‹,π‘Žπ΅ −π‘Š
Μ… 𝑋2
π‘Š
(8c)
4
βˆ†π‘‹π‘Žπ‘ =
Μ… 𝑋 )π‘Š
̅𝑋
π‘Ÿπ‘‹ (π‘Šπ‘‹,π‘Žπ΅ π‘Šπ‘‹,𝐴𝑏 π‘‹π‘Žπ΅ 𝑋𝐴𝑏 −π‘Šπ‘‹,π‘Žπ‘ π‘Šπ‘‹,𝐴𝐡 π‘‹π‘Žπ‘ 𝑋𝐴𝐡 )+π‘‹π‘Žπ‘ (π‘Šπ‘‹,π‘Žπ‘ −π‘Š
2
Μ…
π‘Šπ‘‹
(8d)
Equations for the pathogen species, Y, are essentially identical and so are not shown. We are now to a
point where we could, if we wished, simply simulate the process of coevolution by plugging in the values
for fitness we derived previously for the GFG system (EQUSTIONS X) and iterating equations (X).
Although this approach would surely provide us with some insights into the process of a coevolution, a
much more insightful and elegant approach is to first make a change of variables (Appendix 3) that
allows us to focus on allele frequencies and linkage disequilibrium rather than genotype frequencies. In
addition to facilitating biological interpretation and intuition, this change of variables simplifies our
model by reducing the number of variables we follow from four in equations (X) to three, which is the
actual number of degrees of freedom in the system.
In order to make the change of variables from genotype frequencies to allele frequencies and
linkage disequilibrium, we first need to clearly define the new variables. Specifically, we define allele
frequencies:
𝑝𝑋,𝐴 = 𝑋𝐴𝐡 + 𝑋𝐴𝑏
(9a)
𝑝𝑋,𝐡 = 𝑋𝐴𝐡 + π‘‹π‘Žπ΅
(9b)
π‘π‘Œ,𝐴 = π‘Œπ΄π΅ + π‘Œπ΄π‘
(9c)
π‘π‘Œ,𝐡 = π‘Œπ΄π΅ + π‘Œπ‘Žπ΅
(9d)
and linkage disequilibrium:
𝐷𝑋 = 𝑋𝐴𝐡 π‘‹π‘Žπ‘ − 𝑋𝐴𝑏 π‘‹π‘Žπ΅
(10a)
π·π‘Œ = π‘Œπ΄π΅ π‘Œπ‘Žπ‘ − π‘Œπ΄π‘ π‘Œπ‘Žπ΅
(10b)
for both of the interacting species. The next step in our change of variables is to write down new
recursions that capture the way in which our new variables change over the course of a single
generation. The easiest way to do this is to just substitute the predicted values for the genotype
′′
′′
frequencies in the next generation (e.g., 𝑋𝐴𝐡
, 𝑋𝐴𝑏,
etc.) into expressions (9-10), yielding:
′′
𝑝𝑋,𝐴
=
π‘Šπ‘‹,𝐴𝑏 𝑋𝐴𝑏 +π‘Šπ‘‹,𝐴𝐡 𝑋𝐴𝐡
̅𝑋
π‘Š
(11a)
′′
𝑝𝑋,𝐡
=
π‘Šπ‘‹,π‘Žπ΅ π‘‹π‘Žπ΅ +π‘Šπ‘‹,𝐴𝐡 𝑋𝐴𝐡
̅𝑋
π‘Š
(11b)
𝐷𝑋′′ =
(π‘Šπ‘‹,π‘Žπ΅ π‘Šπ‘‹,𝐴𝑏 π‘‹π‘Žπ΅ 𝑋𝐴𝑏 −π‘Šπ‘‹,π‘Žπ‘ π‘Šπ‘‹,𝐴𝐡 π‘‹π‘Žπ‘ 𝑋𝐴𝐡 )(π‘Ÿπ‘‹ −1)
Μ… 𝑋2
π‘Š
′′
π‘π‘Œ,𝐴
=
π‘Šπ‘Œ,𝐴𝑏 π‘Œπ΄π‘ +π‘Šπ‘Œ,𝐴𝐡 π‘Œπ΄π΅
Μ…π‘Œ
π‘Š
(11c)
(12a)
5
′′
π‘π‘Œ,𝐡
=
π·π‘Œ′′ =
π‘Šπ‘Œ,π‘Žπ΅ π‘Œπ‘Žπ΅ +π‘Šπ‘Œ,𝐴𝐡 π‘Œπ΄π΅
Μ…π‘Œ
π‘Š
(12b)
(π‘Šπ‘Œ,π‘Žπ΅ π‘Šπ‘Œ,𝐴𝑏 π‘Œπ‘Žπ΅ π‘Œπ΄π‘ −π‘Šπ‘Œ,π‘Žπ‘ π‘Šπ‘Œ,𝐴𝐡 π‘Œπ‘Žπ‘ π‘Œπ΄π΅ )(π‘Ÿπ‘Œ −1)
Μ… π‘Œ2
π‘Š
(12c)
Obviously, we still have a bit of a problem! Our equations now contain a mix of old and new variables
which can never be a good thing. The way to move forward is to recognize that the genotype
frequencies appearing in the right hand sides of the equations can be re-written using definitions (9-10)
in the following way:
𝑋𝐴𝐡 = 𝑝𝑋,𝐴 𝑝𝑋,𝐡 + 𝐷𝑋
(13a)
𝑋𝐴𝑏 = 𝑝𝑋,𝐴 π‘žπ‘‹,𝐡 − 𝐷𝑋
(13b)
π‘‹π‘Žπ΅ = π‘žπ‘‹,𝐴 𝑝𝑋,𝐡 − 𝐷𝑋
(13c)
π‘‹π‘Žπ‘ = π‘žπ‘‹,𝐴 π‘žπ‘‹,𝐡 + 𝐷𝑋
(13d)
π‘Œπ΄π΅ = π‘π‘Œ,𝐴 π‘π‘Œ,𝐡 + π·π‘Œ
(14a)
π‘Œπ΄π‘ = π‘π‘Œ,𝐴 π‘žπ‘Œ,𝐡 − π·π‘Œ
(14b)
π‘Œπ‘Žπ΅ = π‘žπ‘Œ,𝐴 π‘π‘Œ,𝐡 − π·π‘Œ
(14c)
π‘Œπ‘Žπ‘ = π‘žπ‘Œ,𝐴 π‘žπ‘Œ,𝐡 + π·π‘Œ
(14d)
Substituting (13 and 14) into (11 and 12) and doing a bit of algebra allows us to finally complete our
change of variables and arrive at a set of equations expressed entirely in terms of the new variables.
′′
𝑝𝑋,𝐴
=
𝑝𝑋,𝐴 (π‘žπ‘‹,𝐡 π‘Šπ‘‹,𝐴𝑏 +𝑝𝑋,𝐡 π‘Šπ‘‹,𝐴𝐡 )+(π‘Šπ‘‹,𝐴𝐡 −π‘Šπ‘‹,𝐴𝑏 )𝐷𝑋
′′
𝑝𝑋,𝐡
=
𝑝𝑋,𝐡 (π‘žπ‘‹,𝐴 π‘Šπ‘‹,π‘Žπ΅ +𝑝𝑋,𝐴 π‘Šπ‘‹,𝐴𝐡 )+(π‘Šπ‘‹,𝐴𝐡 −π‘Šπ‘‹,π‘Žπ΅ )𝐷𝑋
𝐷𝑋′′ =
_
π‘Šπ‘‹
_
π‘Šπ‘‹
(π‘Šπ‘‹,π‘Žπ΅ π‘Šπ‘‹,𝐴𝑏 (π‘žπ‘‹,𝐴 𝑝𝑋,𝐡 −𝐷𝑋 )(𝑝𝑋,𝐴 π‘žπ‘‹,𝐡 −𝐷𝑋 )−π‘Šπ‘‹,π‘Žπ‘ π‘Šπ‘‹,𝐴𝐡 (π‘žπ‘‹,𝐴 π‘žπ‘‹,𝐡 +𝐷𝑋 )(𝑝𝑋,𝐴 𝑝𝑋,𝐡 +𝐷𝑋 ))(π‘Ÿπ‘‹ −1)
Μ… 𝑋2
π‘Š
(15a)
(15b)
(15c)
and,
′′
π‘π‘Œ,𝐴
=
π‘π‘Œ,𝐴 (π‘žπ‘Œ,𝐡 π‘Šπ‘Œ,𝐴𝑏 +π‘π‘Œ,𝐡 π‘Šπ‘Œ,𝐴𝐡 )+(π‘Šπ‘Œ,𝐴𝐡 −π‘Šπ‘Œ,𝐴𝑏 )π·π‘Œ
Μ…π‘Œ
π‘Š
(16a)
′′
π‘π‘Œ,𝐡
=
π‘π‘Œ,𝐡 (π‘žπ‘Œ,𝐴 π‘Šπ‘Œ,π‘Žπ΅ +π‘π‘Œ,𝐴 π‘Šπ‘Œ,𝐴𝐡 )+(π‘Šπ‘Œ,𝐴𝐡 −π‘Šπ‘Œ,π‘Žπ΅ )π·π‘Œ
Μ…π‘Œ
π‘Š
(16b)
π·π‘Œ′′ =
(π‘Šπ‘Œ,π‘Žπ΅ π‘Šπ‘Œ,𝐴𝑏 (π‘žπ‘Œ,𝐴 π‘π‘Œ,𝐡 −π·π‘Œ )(π‘π‘Œ,𝐴 π‘žπ‘Œ,𝐡 −π·π‘Œ )−π‘Šπ‘Œ,π‘Žπ‘ π‘Šπ‘Œ,𝐴𝐡 (π‘žπ‘Œ,𝐴 π‘žπ‘Œ,𝐡 +π·π‘Œ )(π‘π‘Œ,𝐴 π‘π‘Œ,𝐡 +π·π‘Œ ))(π‘Ÿπ‘Œ −1)
Μ… π‘Œ2
π‘Š
6
(16c)
PHHHEEEEWWWWYYYY! We have done it. We now have a set of equations describing how allele
frequencies and linkage disequilibrium evolve in response to natural selection and random mating over
the course of a single generation. With the bulk of the mathematical tedium behind us, we can now
move on to use these equations to gain insights into the process of coevolution between rust and flax.
Analyzing the model
With our general model in hand, we can return to flax and flax rust and begin to answer the
questions we posed at the beginning of the chapter. In order to do so effectively, however, we are going
to need to take one final mathematical leap. Specifically, we are going to introduce the idea of a quasilinkage equilibrium (QLE) approximation that will allow us to better understand coevolution both here,
and in later chapters. Although frequently misunderstood, this approximation assumes only that
selection is relatively weak and that recombination is relatively frequent. Often, the idea of weak
selection causes a knee jerk reaction that the approximation does not allow us to understand cases of
“real world” coevolution. In my opinion, this knee jerk reaction is generally misguided. When we say
“weak selection” in the context of a QLE approximation, we mean as a rule of thumb that selection
should be less than 5% per generation. Although this qualifies as weak mathematically, it certainly is
quiet strong, and not often observed, within natural populations. The second misconception about the
QLE is that it assume linkage disequilibrium is zero. Again, this is untrue.
βˆ†π‘π‘‹,𝐴 ≈ 𝑠𝑋 𝑝𝑋,𝐴 π‘žπ‘‹,𝐴 π‘žπ‘Œπ΄ (1 − 𝑝𝑋𝐡 π‘žπ‘Œπ΅ )
(17a)
βˆ†π‘π‘‹,𝐡 ≈ 𝑠𝑋 𝑝𝑋𝐡 π‘žπ‘‹π΅ π‘žπ‘Œπ΅ (1 − 𝑝𝑋𝐴 π‘žπ‘Œπ΄ )
(17b)
βˆ†π·π‘‹ ≈ −𝑠𝑋 (1 − π‘Ÿπ‘‹ )𝑝XA π‘žXA 𝑝XB π‘žXB π‘žYA π‘žYB − π‘Ÿπ‘‹ 𝐷𝑋
(17c)
βˆ†π‘π‘Œ,𝐴 ≈ π‘ π‘Œ 𝑝YA π‘žYA 𝑝XA (1 − 𝑝XB π‘žYB )
(18a)
βˆ†π‘π‘Œ,𝐡 ≈ π‘ π‘Œ 𝑝YB π‘žYB 𝑝XB (1 − 𝑝XA π‘žYA )
(18b)
βˆ†π·π‘Œ ≈ π‘ π‘Œ (1 − π‘Ÿπ‘Œ )𝑝YA π‘žYA 𝑝YB π‘žYB 𝑝XA 𝑝XB − π‘Ÿπ‘Œ π·π‘Œ
(18c)
Now that is a pretty set of equations! The beauty of the QLE approximation, and the primary reason for
using it, is that it allows us to “see” things about the biology of a system that we might otherwise spend
hours upon hours simulating and still never pick up on. For instance, here we can immediately see that
coevolutionary change in allele frequencies is independent of linkage disequilibrium. As a result, we can
solve for the quasi-equilibrium values of linkage disequilibrium very easily by simply setting (17c and
18c) equal to zero and solving for D and D:
̃𝑋 ≈ − 𝑠𝑋 (1−π‘Ÿπ‘‹ )𝑝XA π‘žXA 𝑝XB π‘žXB π‘žYA π‘žYB
𝐷
π‘Ÿ
(19a)
𝑋
7
Μƒπ‘Œ ≈ π‘ π‘Œ (1−π‘Ÿπ‘Œ )𝑝YA π‘žYA 𝑝YB π‘žYB 𝑝XA 𝑝XB
𝐷
π‘Ÿ
(19b)
π‘Œ
A quick inspection of (19) shows that the sign of linkage disequilribium should be different in Flax and
Flax rust. Specifically, linkage disequilibrium between resistance genes within the flax should always be
negative whereas linkage disequilbrium between virulence genes in the rust should always be positive.
Why does this consistent pattern arise? MUST HAVE SOMETHING TO DO WITH THE FACT THAT THE
HOST NEEDS ONLY ONE R GENE TO “WIN” WHEREAS THE PATHOGEN NEEDS TWO V GENES TO “WIN”.
SIGN OF EPISTASIS SHOULD BE DIFFERENT IN THE TWO SPECIES. IS IT? Yup…
Our QLE approximation has already unearthed a valuable insight about our expectations for the form of
epistasis and sign of linkage disequilibrium that we expect to emerge from GFG coevolution. Can we
push our QLE approximation further to learn about the dynamics and outcomes of coevolution? The
place to start is with an analysis of allele frequency change.
Answers to key questions
New Questions Arising:
Extensions
Extension 1: Snails and schistosomes
Extension 2: Quantitative traits
Conclusions and Synthesis
Like dominance before, epistasis plays an important role in the dynamics and outcome of coevolution.
Yet, here too, we know so very little about actual patterns of epistasis within real systems it is almost
shocking; certainly humbling. This, along with dominance, is the frontier of coevolutionary genetics.
8
References
Figure Legends
Dybdahl, M. F., C. E. Jenkins, and S. L. Nuismer. 2014. Identifying the Molecular Basis of Host-Parasite
Coevolution: Merging Models and Mechanisms. AMERICAN NATURALIST 184:1-13.
Mitta, G., C. M. Adema, B. Gourbal, E. S. Loker, and A. Theron. 2012. Compatibility polymorphism in
snail/schistosome interactions: From field to theory to molecular mechanisms. Developmental
and Comparative Immunology 37:1-8.
9
Download