Twin studies

advertisement
Principles of Genetic
Epidemiology
Kirsten Ohm Kyvik
Genetic epidemiology
Genetic epidemiology deals with the etiology,
distribution, and control of disease
(epidemiology) in groups of relatives and with
inherited causes of disease (genetics) in
populations (adapted from Morton and Chung
1978)
Steps in genetic epidemiology
•
•
•

Evidence for familial aggregation
Is familial aggregation due to genes or environment?
Specific genetic mechanisms
Taking advantage of designs involving
 Families
 Twins
 Adoptees and their families
Fundamentals
 Definition of phenotype
 Classification of phenotype
 Natural history of phenotype
Adaptation of concept of causation
 Family status changes risk profile
 Observations on family members not independent
 Boundary between cohort and case-control studies is
blurred
Multifactorial inheritance
Monogenic
Mød en forsker
Quantitativ
T
R
E
S
H
O
L
D
M
O
D
E
L
Family studies
Design of familiestudies

Identify probands – ”ascertainment probability”

Information on phenotype in relatives (1.degree, 2.
degree etc.)

Compare groups of relatives

Compare with background population
Familial aggregation = genetic aetiology?
Against:
 Effect of:
Groups of relatives
Risk of siblings compared to risk in parent-offspring
• RR(sib) = RR(par)
• RR(sib) >> RR(par
• RR(sib) and RR(par) small, but bigger than
population risk
Expected risk pattern
70
60
Risiko %
50
40
Family risk
30
20
10
0
MZ twins
1-degr
2-degr
3-degr
Parkinson’s disease in Iceland
(Sveinbjørnsdottir et al. NEJM, 2000)
Relatives
Risk ratio
p
(family vs
population)
Sibling
6.3
<0.001
Children
3.0
0.001
Nephew/niece
2.4
<0.001
Cousin
2.4
0.1
Spouse
1.9
0.16
Genetic epidemiology of
infantile hypertrophic pyloric stenosis
The IHPS register
•
•
•
•
•
Funen based
Cases from 1950 to 2004
A total of 892 cases, 870 identified in CPR
Questionnaire send to all cases
Reply from 65%
Smoothed prevalence
7
6
5
4
3
2
1
0
1950
1960
1970
1980
Year
All
Girls
1990
Boys
2000
Recurrence risk in relatives
Recurrence risk % (95% Confidence Interval)
Group
Female
Population
1.degree
0.11
Male
(0.060.15)
0.43
All
(0.400.46)
0.27
(0.240.30)
5.7
(3.9-9.5)
4.4
(3.4-6.1)
4.8
(4.1-7.0)
Parent
4.5
(1.4-7.4)
3.9
(2.4-5.7)
4.0
(2.9-6.2)
Offspring
4.5
(0.145.3)
4.5
(0.108.3)
4.5
(0.248.3)
11.4
(4.017.5)
5.1
(3.010.8)
6.6
(4.7-9.8)
Siblings
2. degree
Grandpa
rents
0.76
(-0.131.5)
0.51
(0.101.1)
0.57
(0.201.0)
Twin studies
Aims
•
•
•
•
•
•
What is the risk/recurrence risk in twins
Is a phenotype genetically determined
Aetiological models
Size of genetic variation / heritability
Genes, markers, chromosomal regions
Environmental determinants
DESIGNS
 Classical
twin study
 Classical
twin study with separated
MZ twins
 Twin
family studies
 Twin-control
studies
Classical twin study
MZ pairs:
DZ pairs:
DESIGNS
 Classical
twin study
 Classical
twin study with separated
MZ twins
 Twin
family studies
 Twin-control
studies
Is a phenotype genetically determined?
• Categorical data
• Continous data
Types of concordance
Pairwise: Probability that both in a pair is
affected:
Casewise/probandwise: Probability that a twin
is diseased given that the twin partner is
diseased:
Probandwise concordance
Estimate of the casewise probability by the
proband method.
2C1 + C2
-----------------
2C1 + C2 + D
Concordance
CMZ = CDZ
CMZ > CDZ
CMZ <1.0 (100%)
Solutions to problems with age at diagnosis

Survival analysis
Actuarial/Kaplan Meier methodology
Frailty models
Newer models

Others?
Correction methods
Concordance type 1 diabetes
Zygosity
Pairs
(probands)
Conc
Disc
Concordance
Pairwise* Probandwise
MZ
10(18)
16
0.38
[0.20-0.59]
0.53
[0.33-0.73]
0.70
[0.45-0.95)
DZ
4 (8)
65
0.06
[0.02-0.14]
0.11
[0.05-0.21]
0.13
[0.04-0.21]
( ) Number of probands; [ ] 95% confidence limits.
* Chi21d.f. = 10.93, p < 0.001
Cumulated
Cumulative concordance type 1 diabetes
Interpretable as cumulative risk from
birth
%
0-100
MZ
0.70
DZ
0.13
Age
0-40
Correlations
Twin-twin correlations
rMZ = rDZ
rMZ > rDZ
rMZ < 1.0 (100%)
INTRACLASS CORRELATIONS
lnTSH (Pia Skov Hansen)
MZ n=284 pairs
2
DZ n=285 pairs
rMZ=0.64 (CI 0.56-0.70)
2
1
ln T S H in T win 2
1
ln T S H in T win 2
rDZ=0.29 (CI 0.18-0.39)
0
0
-1
-1
-2
-2
-2
-1
0
lnTSH in Twin 1
1
2
p<0.00005
-2
-1
0
lnTSH in Twin 1
1
2
INTRACLASS CORRELATIONS
lnTSH
1
0,9
0,8
0,7
0,6
rMZ
0,5
rDZ
0,4
0,3
0,6625
0,6373
0,6358
0,2
0,3577
0,2915
0,1
0,1814
0
All
Male
Female
Aetiological components
 Additive genetic variance
 Dominant genetic variance/epistasis
 Common environmental variance
 Unique environmental variance
Inheritance Models in Single Gene Trait
Genotype Group
Model
A is
Dominant
A is
Recessive
A is
Co-Dominant
AA
Aa
aa
Inheritance Models in Quantitative Trait
Model
-x
A is Completely
Dominant
aa
A is Partially
Dominant
aa
A is Not
Dominant
aa
Population Mean
0
+x
AA
Aa
Aa
Aa
AA
AA
Heritability




V (total) = VG + VE
V (total) = VA + VD + VI + VC + VE
h2narrow = VA/VA + VD + VI + VC + VE
h2broad = VA + VD + VI/VA + VD + VI + VC + VE
Heritability
 Function of population, NOT a constant
 Does not apply to individuals
 Biased if mean and variance not the same in MZ
and DZ
 Greater MZ covariance will inflate h2
Correlations and aetiological models
rMZ < 1
rMZ = rDZ = 0
rMZ = rDZ > 0
rMZ = 2rDZ > 0
rMZ > 2rDZ
rMZ < 2rDZ
Aetiological models and genetic variation
 Variance analysis
 Regression analysis
 Structural equation modelling
Path model for twin analysis
Pleiotrophy
RESULTS TSH-LEVEL
Unique
Environmental
effect
0.36
Genetic
effect
0.64
The genetic effects account for 64% of the variation
Multivariate ACE Model
BMI
BMI
Waist
Gluc12
0
Ins0
SBP
DBP
HDL
TG
0.86
(0.01)
-0.13
(0.06)
0.48
(0.04)
0.29
(0.04)
0.27
(0.04)
-0.18
(0.05)
0.20
(0.06)
-0.16
(0.06)
0.51
(0.05)
0.30
(0.05)
0.26
(0.05)
-0.19
(0.06)
0.26
(0.06)
0.09
(0.08)
0.12
(0.07)
0.11
(0.07)
-0.02
(0.08)
0.23
(0.08)
0.31
(0.06)
0.29
(0.06)
-0.17
(0.07)
0.31
(0.07)
0.71
(0.03)
-0.09
(0.06)
0.28
(0.06)
-0.01
(0.06)
0.27
(0.06)
Waist
0.85
(0.01)
Gluc12
0
0.02
0.03
(0.03) (0.03)
0.46
Ins0
0.46
(0.02)
SBP
0.28
(0.03)
0.26
(0.03)
0.14
(0.03)
0.23
(0.03)
DBP
0.26
(0.03)
0.23
(0.03)
0.13
(0.03)
0.23
(0.03)
0.69
(0.02)
HDL
-0.17
(0.03)
-0.19
(0.03)
-0.04
(0.03)
-0.14
(0.03)
-0.01
(0.03)
-0.03
(0.03)
TG
0.22
(0.03)
0.27
(0.03)
0.20
(0.03)
0.35
(0.02)
0.20
(0.03)
0.20
(0.03)
(0.02)
0.13
(0.03)
-0.24
(0.07)
-0.22
(0.03)
Important assumptions
• Biology of twinning
• ”True” zygosity
• Equal environment assumption
•
true or not true?
• Generalisability
Adoption studies
Adoption design
Adoptees are expected to
Early death in adoptees
Cause of death
Parent dead < 50 yrs
Parent dead < 70 yrs
Natural
Bio
Ado
1.98*
0.96
1.49
0.8
Infection
Bio
Ado
5.81*
0.73
5*
1
Vasculær
Bio
Ado
4.52*
3.02
1.92
1.5
Cancer
Bio
Ado
1.19
5.16*
0.87
1.49
Assumptions and problems
 Early adoption
 Non-familial adoption
 Comparable environment in biological and adoptive
family
 Contact to biological family
 Intra-uterine environment
 Transcultural adoptions
Comparison of correlations
Correlation
Twin studies
MZ
DZ
MZA
0.7
0.36
0.7
Family studies
PO
Sib
0.27
0.25
Adoption studies
Bio
Ado
0.17
0.1
Comparison heritability
Heritability
Twin studies
MZA
50-90%
60-70%
Family studies
20-80%
Adoption studies
20-60%
Download