Document 13553454

advertisement
17th Annual Mee-ng of the Organiza-on for Human Brain Mapping Introduc-on to Imaging Gene-cs Quebec City, Canada – June 25-­‐10, 2010 Structure and Analysis
of Genetic Variation
Fabio Macciardi MD PhD
Dept of Psychiatry and Human Behavior
Lab of Molecular Psychiatry
University of California, Irvine
Structure & Analysis of
Genetic Variation
•  What we will look into:
–  Genes, promoters, micro RNA
–  SNP, CNV, microsatellites
–  methylation
–  Cis - trans-acting & epistasis *
–  transcriptome to Genes
–  Genes to Pathways
Let s see first how we
have modified our idea
of what a genetic
disorder looks like …
The old paradigm of genetics …
G
Disease
Cystic Fibrosis
Key concepts in genetic epidemiology of complex traits
… but even simple mendelian disorders are not
THAT simple …
… and how now we think genetics works..
Multiple gene variants interacting with each other,
and with multiple environmental factors
G
G
G
G
E
E E
E
Phenotype
Key concepts in genetic epidemiology of complex traits
Variability in human
Variability is the law of life, and
as no two faces are the same,
so no two bodies are alike,
and no two individuals react alike,
and behave alike under the
abnormal conditions which we
know as disease.
Sir William Osler
(1849-1919)
What is a gene? A real example, the DRD4
Regulatory regions
coding region
not coding region
12bp repeat
(+61 to +85)
48 bp
VNTR
C-11T
(SmaI)
120bp
duplication
(PstI)
5’
Exon I
Exon II
(G)n repeat
Gly11Arg
(G+31C)
Exon III
G+492T
(high freq)
Exon IV
Val194Gly
(T+581G)
(low freq)
(low freq)
Single Nucleotide Polymorphism
3’
C+870A
(high freq)
Classes of human genetic variants
Frazer et al., 2009 Nature Rev Genet. 10:241-51.
Key concepts in genetic epidemiology of complex traits
Key-words in genetics
Single Nucleotide
SNPs
A B C D E F G H
Polymorphisms
chr 1
at different loci
alleles
C A A C A T G C
chr 1
alleles
T
C A T C T T
genotype
C
haplotype
an “allele” …
t
en
Example = gene . ic – g .. etic
r
am
… and a haplotype
We found variable letter SNPs in all individuals..
general population
genomic
sequences
... A
C
T
T
T
G
A ...!
T
G
A ...!
!
... A
T
T
T
= SNP
(Single Nucleotide
Polymorphism)
… today we know almost 15 - 18 million common SNPs
(and many more not so common)
…and identify the genetic signature of each of us
Person A
Person B
Person C
Person D
Person A
Person B
Person C
Person D
Key concepts in genetic epidemiology of complex traits
A new technology: DNA MICROARRAYS
Allow us to detect these SNPs ….
Population-based designs: what does practically means?
Population-based :Cases and unrelated population controls from the same study
base
Affected Individuals (CASES)
A
a
656
879
CONTROLS 525
471
CASES
A
a
606
929
CONTROLS 555
441
CASES
p-value = 0.3
Not affected Individuals (CONTROLS)
p-value = 0.1
A
a
856
679
CONTROLS 325
671
CASES
p-value = 0.01 !!!
SIGNIFICANT !! …and so on!
WHOLE GENOME SCAN ASSOCIATION
1
2
3
Known Gene
New Gene
4
5
6
How many genes?
•  In complex traits, there are genes acting together and we
must understand “how” if we want to understand the biology
of disease:
modeling gene^gene interactions – the Epistasis effect
Gene A
Gene B
+
+ +
+ + +
+ +
+
+ + +++++
PNAS 105: 12387-92, 2008
GWAS..here we are..
Outcome of a Genome wide association study
What are CNVs?
Stretches of DNA larger than 1 kb that display copy
number differences in comparison to a reference genome
- Copy number variation (germline, inherited)
- inherited: also present in parents genome
- de novo: absent in parents genome
- Copy number alteration (somatic, e.g. in cancer cells)
- Copy number polymorphism (relatively common CNV,
with a fixed starting/ending position)
- Copy number difference (between-species copy number
differences, e.g chimpanzees and humans)
Types of Genomic Structural Changes Affecting Segments of DNA
Leading to Deletions, Duplications, Inversions, and CNV Changes
(biallelic, Multiallelic, and Complex
The current map of human structural variation is far from
complete….
http://humanparalogy.gs.washington.edu/structuralvariation/
http://projects.tcag.ca/variation/
Am J Hum Genet. 2009;84:148-61
~16% of the
genome
Low
frequency
High
frequency
… genomic burden of rare variants
Science 320: 539-543, 2008
Nat Rev Genet 40: 8881-885, 2008
Genetics of complex disorders:
what has been achieved so far
Traditional genetic methods (eg, association)
in a genomic perspective point to extreme
alternatives:
•  GWAS
CVCD
•  CNV/CNP
RVCD
Dissecting the genetic basis of disease risk requires
measuring all forms of genetic variation, including
SNPs and copy number variants (CNVs) ……..
…….. Most common, diallelic CNPs were in strong
linkage disequilibrium with SNPs, and most lowfrequency CNVs segregated on specific SNP
haplotypes
Nature Genetics 40: 1168-74, 2008
Looking at individual structural variants with sequencing
technologies
PLoS Biol. 2007 4;5:e254.
Looking at individual structural variants with sequencing
technologies
PLoS Biol. 2007 4;5:e254.
4.1 million DNA variants (~12.3 Mb)
1.288.319 (~30%) novel!
• 
• 
• 
3,213,401 single nucleotide polymorphisms (SNPs)
53,823 block substitutions (2–206 bp)
559,473 homozygous indels (1–82,711 bp)
• 
• 
• 
• 
62 CNVs
292,102 heterozygous insertion/deletion events (indels)(1–571 bp)
90 inversions
Non-SNP DNA variation accounts for 22% of all events identified in the donor, however they
involve 74% of all variant bases
We have uncovered only the tip of the iceberg…
…each genomic region contributes a
modest effect, and collectively all
associated region for a given trait
explain only a small fraction (5-10%) of
the observed phenotypic variation
attributed to genetic elements…
Outcome of a Genome wide association study
We have uncovered only the tip of the iceberg…
…each genomic region contributes a
modest effect, and collectively all
associated region for a given trait
explain only a small fraction (5-10%) of
the observed phenotypic variation
attributed to genetic elements…
Where is the rest of the missing heritability?
•  Incomplete marker coverage
• Allelic heterogeneity at a given locus
• Contribution of rare variants, including
structural (CNVs and smaller) variants
• Epistatic interactions
• GxE interactions
• Epigenetic modifications
• Overestimation of heritability
Outcome of a Genome wide association study
Once a gene has been mapped:
understanding the function …
Outcome of a Genome wide association study
Where is the culprit?
Functional variants and affected genes
Ioannidis et al., 2009 Nature Rev Genet. 10:318-29.
Key concepts in genetic epidemiology of complex traits
ON
OFF
ATTCGGTCTTACCGATATTCGG
From S. Beck 2008
Integrated genomic approach
phen MVPs
X
X
BisSeq
DMRmap
Genome
HapMap
tag SNPs
WGAmap
DeepSeq
X
X
X
phen SNPs
candidate (dys)-functional gene
From S. Beck 2008
From Genes to Pathways:
toward a systemic understanding
of disease
Human GWAS legacy data SNP
Disease Phenotype In Genome-Wide Association Studies
(GWAS) our goal is to find out the
relationship between a Single Nucleotide
Polymorphism (SNP, as a proxy for a
gene) and the Disease Phenotype of
interest
Human GWAS legacy data Disease Phenotype SNP Gene
Protein Neuronal func-on Neural
circuitry
Human GWAS legacy data Animal models SNP
Gene
Protein Neuronal Disease Phenotype Neural
circuitry
func-on Bioinforma-cs Systems Biology addresses links between
SNPs and human phenotype originally
identified by GWAS …..
Human GWAS legacy data Animal models SNP
Genes Genome Non-­‐
coding RNA PROTEIN Neuronal Disease Phenotype Neural
circuitry
func-on Bioinforma-cs ….. including information related to
WHOLE genomic complexity
From single genes to networks
Genes associated with asthma
Leukemia disease network
Inferred Networks based on mouse PFC expression data
Gene interaction network inferred from prefrontal cortex gene expression in 42 different
inbred mouse strains. Schizophrenia candidate genes from GWAS are in yellow. Some
unexpected connections: DACT3 (circled), encodes regulator of Wnt signaling that has
been linking to schizophrenia41-43.
Fig
atl
fro
thi
an
cu
mR
ad
sid
cre
10
ne
cu
cri
spa
an
seg
da
tex
da
AHI1 exon expression in brain, LCL and
immortalized cell-lines (187 subjects):
a TE-derived TSS effect?
Sequencing the genome of schizophrenic patients
Download