Gene-Environment Interaction AGES workshop, Fall 2012 Lindon Eaves – Introduction Tim York – Lots of slides Sarah Medland – Making it work Genotype x Environment Interaction • Genetic control of sensitivity to the environment • Environmental control of gene expression The Biometrical Genetic Approach: GxE = Genetic Control of “Sensitivity” to environment See e.g. Mather and Jinks. Like any other phenotype – individual differences in sensitivity may be genetic Alternative Approach GxE = “Environmental modulation” of path from genotype to phenotype See e.g. Purcell “a,c,e modulated by covariate” CARE! Results depend on scale – often remove GxE (and other “interactions” – e.g. epistasis) by change in scale Often try to choose units so model is additive/linear and simple. “Simplicity is truth” (?) “Environment” • Micro-environment (unmeasured, random) “E” and “C” in twin models • Macro-environment (measured, ?”fixed”) SES, life events, exposure, smoking • “Independent” (of genotype) • “Correlated with genotype – of individual (“active/evocative”) or relative (“passive”) Animals, Plants and Microorgansims • GxE common property of genetic systems • Ranking of lines/strains changes over environments: ?“genetic correlation across environments” • Polygenic: genes affecting response to E widely distributed across genome (Caligari and Mather) • GxE usually small relative to main effects of G and E (“power”?) • Can select separately for means and slopes (“different genes cause GxE”) • Some genes affect response to specific environmental features (Na, K, rain, diet, separation) • Some genes affect response to overall quality of environment (“stress”) • GxE adaptive – own genetic architecture (A,D etc) – genetic vulnerability, resilience, sensation-seeking, harm avoidance (rGE) • GxE scale dependent – choose units of measurement Genotype x Environment Interaction Where does GxE go in “ACE” model? • AxE -> “E” • AxC -> “A” See Jinks, JL and Fulker, DW (1970) Psych. Bulletin Contributions of Genetic, Shared Environment, Genotype x Shared Environment Interaction Effects to Twin/Sib Resemblance Shared Environment Additive Genetic Effects Genotype x Shared Environment Interaction MZ Pairs 1 1 1x1=1 DZ Pairs/Full Sibs 1 ½ 1x½=½ In other words—if gene-(shared) environment interaction is not explicitly modeled, it will be subsumed into the A term in the classic twin model. Contributions of Genetic, Unshared Environment, Genotype x Unshared Environment Interaction Effects to Twin/Sib Resemblance Unshared (Unique) Environment Additive Genetic Effects Genotype x Unshared Environment Interaction MZ Pairs 0 1 0x1=0 DZ Pairs/Full Sibs 0 ½ 0x½=0 If gene-(unshared) environment interaction is not explicitly modeled, it will be subsumed into the E term in the classic twin model. Testing for GxE and GxC • Plot absolute within MZ pair differences against pair means (CARE!!) • Plot absolute within DZ pair differences against shared measured environment Basic Model (1960’s) Yi |E = gi + biE + di Yi |E = phenotype of ith genotype in environment E gi = main effect (“intercept”) of ith genotype bi = slope (“sensitivity”) of ith genotype in response to environment (gi , bi) ~ N[m,G] GxE: A biometrical genetic model A A 1 2 M g2 g1 b a E e P b=m + g1A1 + g2A2 ith phenotype in kth level of measured environment Pi|Mk= aA1i + (m + g1A1i + g2A2i)Mk+ eEik = aA1i + (m + g1A1i + g2A2i)Mk+ eEik = ck+ (a + g1Mk) A1i + g2MkA2i+ eEik Expected twin covariance conditional on environments of first and second twins Cov(Pi1|Mk ,Pi2|Ml ) = [a2 + ag1(Mk+Ml)+ (g12+g22)MkMl]r + Cov(Ei1k,Ei2l) r = genetic correlation (1 in MZs) No GxE: Genetic covariances and correlations No G x E: Genetic Covariance as Function of Environment No G x E: Genetic Correlation as Function of Environment viro nm e nt alpha=0.5, gamma1=0.0, gamma2=0.0 me nt viro n En Tw in 1 En viro nm e Tw in 2 En Tw in 2 En viro n me nt nce varia c Co ion rrelat tic Co Gene ti Gene Tw in 1 nt alpha=0.5, gamma1=0.0, gamma2=0.0 Scalar GxE: Genetic covariances and correlations Scalar G x E: Genetic Covariance as Function of Environment Scalar G x E: Genetic Correlation as Function of Environment me n t alpha=0.5, gamma1=0.4, gamma2=0.0 En viro n Tw in 1E n vi ron me nt Tw in 2 1E nvi ron Tw in 2 En vi ron m ent me nt lation Corre nce varia tic Gene tic Co Gene Tw in alpha=0.5, gamma1=0.4, gamma2=0.0 Non-Scalar GxE: Genetic covariances and correlations Non-scalar G x E: Genetic Covariance as Function of Environment Non-scalar G x E: Genetic Correlation as Function of Environment tic Gene tic Gene la Corre nm e nt alpha=0.5, gamma1=0.0, gamma2=0.4 viro n En Tw in 1 En viro nm en t Tw in 2 En viro Tw in 2 En vi ron m ent me nt tion e rianc Cova Tw in 1 alpha=0.5, gamma1=0.0, gamma2=0.4 Mixed GxE: Genetic covariances and correlations Mixed G x E: Genetic Covariance as Function of Environment Mixed G x E: Genetic Correlation as Function of Environment tic Gene viro nm e nt alpha=0.5, gamma1=0.4, gamma2=0.3 En viro n Tw in 1 En viro n me n Tw in 2 En Tw in 2 En viro n me nt me nt lation Corre nce varia tic Co Gene Tw in 1 t alpha=0.5, gamma1=0.4, gamma2=0.3 Note: Can’t separate components of “mixed” GxE unless you have same or related genotypes (MZ and/or DZ pairs) in concordant and discordant environments. [c.f. Analysis of sex limited gene effects]. Modulation of gene expression by measured environment [and environmental modulation of non-genetic paths] Ways to Model Gene-Environment Interaction in Twin Data • Multiple Group Models – (parallel to testing for sex effects using multiple groups) Sex Effects Females 1.0 (M Z ) / .5 (D Z ) A1 aF C1 cF P1 E1 eF Males 1.0 1 .0 (M Z ) / .5 (D Z ) A2 aF C2 cF P2 E2 eF A1 aM C1 cM P1 E1 eM 1 .0 A2 aM C2 cM P2 E2 eM Sex Effects Females 1.0 (M Z ) / .5 (D Z ) A1 aF C1 cF E1 eF P1 aF = aM ? Males 1.0 1 .0 (M Z ) / .5 (D Z ) A2 aF C2 cF E2 A1 eF P2 aM C1 cM E1 eM 1 .0 A2 aM P1 cF = cM ? C2 cM P2 eF = eM ? E2 eM GxE Effects Urban 1.0 (M Z ) / .5 (D Z ) A1 aF C1 cF E1 eF P1 aU = aR ? Rural 1.0 1 .0 (M Z ) / .5 (D Z ) A2 aF C2 cF E2 A1 eF P2 aM C1 cM E1 eM 1 .0 A2 aM P1 cU = cR ? C2 cM P2 eU = eR ? E2 eM Problem: • Many environments of interest do not fall into groups – Regional alcohol sales – Parental warmth – Parental monitoring – Socioeconomic status • Grouping these variables into high/low categories loses a lot of information Standard model • Means vector (m m • Covariance matrix a2 + c2 + e2 Za 2 + c 2 2 2 2 a +c +e Model-fitting approach to GxE A C a c E e A C a c E e m M m Twin 1 Twin 2 M Model-fitting approach to GxE A a+bXM m M C c E e Twin 1 A a+bXM C c E e Twin 2 m M Individual specific moderators A a+bXM1 C c E e A a+bXM2 C c E e m M m Twin 1 Twin 2 M E x E interactions A a+bXM1 C E c+bYM1 e+bZM1 A a+bXM2 C c+bYM2 e+bZM2 m M E m Twin 1 Twin 2 M ‘Definition variables’ in Mx • General definition: Definition variables are variables that may vary per subject and that are not dependent variables • In Mx: The specific value of the def var for a specific individual is read into a matrix in Mx when analyzing the data of that particular individual ‘Definition variables’ in Mx create dynamic var/cov structure • Common uses: 1. To model changes in variance components as function of some variable (e.g., age, SES, etc) 2. As covariates/effects on the means (e.g. age and sex) 1.0 (MZ) / .5 (DZ) • Classic Twin Model: Var (T) = a2 + c2 + e2 A1 a C1 c E1 e 1.0 A2 C2 a c Twin 1 E2 e Twin 2 • Moderation Model: Var (T) = (a + βXM)2 + (c + βYM)2 + (e + βZM)2 A a + βXM C e + βZM c + βyM m + βMM Purcell 2002, Twin Research E T Var (T) = (a + βXM)2 + (c + βYM)2 (e + βZM)2 Where M is the value of the moderator and Significance of βX indicates genetic moderation Significance of βY indicates common environmental moderation Significance of βZ indicates unique environmental moderation BM indicates a main effect of the moderator on the mean A a + βXM C E e + βZM c + βyM m + βMM T Additional Things to Consider • Unstandardized versus standardized effects Additional Things to Consider Unstandardized versus standardized effects ENVIRONMENT 1 ENVIRONMENT 2 Unstandardized Variance Standardized Variance Unstandardized Variance Standardized Variance Genetic 60 0.60 60 0.30 Common environmental 35 0.35 70 0.35 Unique environmental 5 0.05 70 0.35 Total variance 100 200 Environment may not modulate all the genes C.f. Biometrical genetic model – Different genes may control main effect and sensitivity/slope AS aM AU aS + βXSM aU + βXUM βXS indicates moderation of shared genetic effects M T BXU indicates moderation of unique genetic effects on trait of interest Genotype-Environment Covariance/Correlation (rGE) see e.g. RB Cattell (1965) Gene-environment Interaction • Genetic control of sensitivity to the environment • Environmental control of gene expression Gene-environment Correlation • Genetic control of exposure to the environment • Different genotypes select or create different environments • Different genotypes are exposed to correlated environments (e.g. sibling effects, maternal effects) • Environments select on basis of genotype (Stratification, Mate choice) This complicates interpretation of GxE effects • If there is a correlation between the moderator (environment) of interest and the outcome, and you find a GxE effect, it’s not clear if: – The environment is moderating the effects of genes or – Trait-influencing genes are simply more likely to be present in that environment Ways to deal with rGE • Limit study to moderators that aren’t correlated with outcome – Pro: easy – Con: not very satisfying • Moderator in means model will remove from the covariance genetic effects shared by trait and moderator – Pro: Any interaction detected will be moderation of the trait specific genetic effects – Con: Will fail to detect GxE interaction if the moderated genetic component is shared by the outcome and moderator • Explicitly model rGE using a bivariate framework – Pro: explicitly models rGE – Con: Power to detect BXU decreases with increasing rGE; difficulty converging Getting it to work SARAH!!!! Practical (1): Using Multiple Group Models to test for GxE Adding Covariates to Means Model A C a c E e A C a c E e m+bMM1 M m+bMM2 Twin 1 Twin 2 M Matrix Letters as Specified in Mx Script A C E a+bXM1 c+bYM1 a +aM*D1 e+eM*D1 Twin 1 C a+bXM2 c+cM*D2 e+b M Z 2 a+aM*D2 e+eM*D2 Twin 2 M m+bMM2 m+bMM1 mu+b*D1 E c+bYM2 c+cM*D1 e+b M Z 1 M A Main effects and moderating effects mu+b*D2 Practical (2): Using Definition Variables to test for GxE Matrix Letters as Specified in Mx Script A C E a+bXM1 c+bYM1 a +aM*D1 c+cM*D1 e+b M Z 1 Twin 1 C E c+bYM2 e+eM*D1 M A a+bXM2 c+cM*D2 e+b M Z 2 a+aM*D2 e+eM*D2 Twin 2 M m m mu mu ‘Definition variables’ in Mx create dynamic var/cov structure • Common uses: 1. To model changes in variance components as function of some variable (e.g., age, SES, etc) 2. As covariates/effects on the means (e.g. age and sex) Definition variables used as covariates General model with age and sex as covariates: yi = a + b1(agei) + b2 (sexi) + e Where yi is the observed score of individual i, a is the intercept or grand mean, b1 is the regression weight of age, agei is the age of individual i, b2 is the deviation of males (if sex is coded 0= female; 1=male), sexi is the sex of individual i, and e is the residual that is not explained by the covariates (and can be decomposed further into ACE etc). Allowing for a main effect of X • Means vector ( m + b X 1i m + b X 2i • Covariance matrix a2 + c2 + e2 Za 2 + c 2 2 2 2 a +c +e Common uses of definition variables in the means model • Incorporating covariates (sex, age, etc) • Testing the effect of SNPs (association) • In the context of GxE, controlling for rGE