Accelerated Permutation Inference for Pedigree Heritability Inference with Family-Based Neuroimaging data

advertisement
Accelerated Permutation Inference for Pedigree Heritability Inference with
Family-Based Neuroimaging data
Habib Ganjgahi1, Anderson Winkler2, David C. Glahn3, John Blangero4, Peter Kochunov5, and Thomas Nichols1
1University
of Warwick
2
University of Oxford
3
Yale University
4
5
University of Texas Rio Grande Valley
University of Maryland
Results
• WLS and ML heritability estimators are compareable for large samples.
• The parametric inference is conservative.
• False positive rates with P2 permutation almost exact, P1 conservative..
PP plot for Uncorrected P-values
0.2
0.6
0.15
0.4
0.1
0.2
0.05
0
0
=
nS
8
13
6
62
=
nS
8
85
=
nS
=1
7
49
8
13
bias h2: ML vs WLS
0.02
0.03
0
0.025
-0.02
0.02
-0.04
0.015
-0.06
0.01
-0.08
0.005
-0.1
=
nS
6
62
nS
8
85
1
S=
n
26
n
6
S=
8
S=
58
=
n
n
nS
1
S=
58
6
S=
8
S=
n
n
100
80
80
40
4
=1
TL,ML TL,WLS TW,ML TW,WLS
Figure 1b: h2 Test FP (top left) and Power
FWE Rates
Observed -log(P-values)
2
0.04
0.03
1.5
0.02
• Var(ε ) = Σ =
8
2.5
2
1.5
1
0.5
1
1.5
2
2.5
3
3.5
TS
4
2
1.5
1
0.5
0
0
0
2.5
TW,WLS
0.5
1
1.5
2
2.5
3
Theoretical -log(P-values)
3.5
0
0.5
1
1.5
2
2.5
3
3.5
Theoretical -log(P-values)
Test Statistics
Figure 2a.
Figure 2b.
Figure 2c.
• Voxel-wise FWE p-values (2a), Cluster-wise FWE rates (2b) and cluster-wise FWE p-values (2c) for different
cluster forming thresholds. Rates are nominal except for the higher cluster forming thresholds of the Wald
test (Fig 2b,c).
(2)
σA2Dg
• MLE estimation of
ML invariance property.
6
3
0.5
0.01
Theoretical -log(P-values)
• S: kinship matrix eigenvalue decomposition Φ = SDgS 0 where, N×N matrix S and diagonal
Dg are eigenvector and eigenvalue matrices respectively.
+ σE2 I: transformed data variance, a diagonal matrix.
β, σA2 and σE2 are same in the original and transformed
4
PP plot for T W,WLS Max Cluster Size
3.5
P=0.05
P=0.01
P=0.005
P=0.001
3
0.05
2.5
PP plot for T S Max Cluster Size
3.5
Observed -log(P-values)
P=0.05
P=0.01
P=0.005
P=0.001
0.06
W,WLS
3
0
⇒ Y ∗ = X ∗β + ε ∗,
2
Figure 1c: Uncorrected P-value
Performance
0
• Y ∗ and X ∗: transformed trait and covariates respectively.
∗
0
Theoretical -log(P-values)
Observed -log(P-values)
T
0
Eigen-simplified polygenic model: Applying an orthogonal transformation based on the
eigenvectors of the kinship matrix to the Eq (1) gives
∗
0
TL,ML TL,WLS TW,ML TW,WLS
Test Statisics, h2 =0.6
0.07
0.5
• Φ: N × N kinship matrix; 2Φij is the genetic relatedness of subjects i and j.
S 0Y = S 0X β + S 0g + S 0ε
1
TS
1
• σA2 and σE2 : additive genetic and environmental effects, respectively.
3
Cluster-wise Inference
PP plot for Maximum Statistic
3.5
(1)
4
2
Test Statisics, h2 =0.4
nS
5
40
0
TS
6
Image-wise Simulations
Methods
Var(ε) = Σ = σA2(2Φ) + σE2 I
60
20
TS
Y = Xβ + ε
TL,ML TL,WLS TW,ML TW,WLS
Test Statisics, h2 =0.2
100
97
Figure 1a: h2 Estimation Accuracy
4
Polygenic model:
40
Test Statisics, h2 =0
60
7
Parametric TW,WLS
TS
0
26
Parametric TS
0
=1
8
Permutation T W,WLS
60
TL,ML TL,WLS TW,ML TW,WLS
20
38
Permutation T S
P1
P2
Parametric
20
nS
ML: H2=0
ML: H2=0.2
ML: H2=0.4
ML: H2=0.6
ML: H2=0.8
WLS: H2=0
WLS: H2=0.2
WLS: H2=0.4
WLS: H2=0.6
WLS: H2=0.8
97
4
=1
2
7
49
0
38
3
TS
MSE h2: ML vs WLS
0.035
80
0
=
0.04
4
1
nS
nS
100
Rejection Rates
0.8
5
Observed -log(P-values)
sd h2: ML vs WLS
Rejection Rates
Mean h2: ML vs WLS
Rejection Rates
The prerequisite of any genetic analysis is establishing the heritability of your trait of
interest. In neuroimaging genetics, the large number of voxel-wise measurements present
a challenge both in terms of computational intensity, and the need to control false positive
risk over multiple tests. There is a gap in existing tools, as standard neuroimaging software
cannot estimate heritability, and yet standard quantitative genetics tools cannot provide
essential neuroimaging inferences, like FWE corrected voxel- or cluster-wise P-values.
Moreover, available heritability tools rely on P-values that can be inaccurate due to use
of asymptotic inference methods. Hence, there is a need for alternative, computationally
efficient inference procedures that make fewer assumptions. Permutation methods can
provide exact control of false positive rates and FWE corrected voxel and cluster-wise
inferences. Blangero (2013) introduced a method to accelerate maximum likelihood heritability estimation. However, this advance doesn’t eliminate iterative optimization, possible
convergence problems, nor the use of asymptotic P-values. In the present work (Ganjgahi
2015), we expanded upon Blangero (2013) to derive approximate, non-iterative estimates
and test statistics based on the first iteration of Newton’s method, develop corresponding
Wald and Score tests, and find P-values from a permutation test.
Rejection Rates
Introduction
Real Data Analysis
• ML and WLS heritability estimators have same distribution (Fig 3b)
• WLS estimator showing a slight but consistent trend towards underestimation relative to ML (Fig 3a).
• Score Test is slightly less sensitive than the other tests (Fig 3c).
model due to
Histogram of non-zero h 2 (89.1% of h 2 > 0)
4
Parameter Estimation
3
2
• A non-iterative heritability estimator can be created based on 1-step optimization of the
likelihood, equivalent to a WLS regression of squared residuals on the kinship matrix
eigenvalues.
1
• Amemiya (1977) showed that such estimator is asymptotically normal and consistent.
n
o
−1 0 ∗
∗0 ∗ −1 ∗0 ∗
0
β̂OLS = X X
X Y , θ̂OLS = max 0, U U
U fOLS , ε̂OLS = Y ∗ − X ∗β̂OLS;
1
β̂WLS = X
∗0
(Σ̂∗OLS)−1X ∗
−1
∗0
X (Σ̂∗OLS)−1Y ∗,
0
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0.9
1
ML
Histogram of non-zero h 2 (89.1% of h 2 > 0)
4
3
2
0
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
WLS
Figure 3a
Figure 3b
Figure 3c
−1
−1
0 ∗2 −1 ∗
θ̂WLS = max 0, U 0(Σ̂∗2
)
U
U
(Σ̂OLS) fOLS ,
OLS
• β and θ = (σA2, σE2) are the transformed model fixed and random parameters.
∗
: squared OLS residuals.
• fOLS
• U = [1, λg]: N ×2 auxiliary model design matrix, where 1 is a vector of ones and λg = {λgi}
is a vector of kinship matrix eigenvalues.
2
2
2
2
2
• Σ̂∗OLS is formed by θ̂OLS = (σA,OLS
, σE,OLS
). ĥ2WLS = σ̂A,WLS
/ (σ̂A,WLS
+ σ̂E,WLS
).
Test Statistics:
• The Wald test
ã Fully converge ML estimator.
ã One-step WLS estimator (corresponds to a generalized sums of squares):
−1
1 2
0
∗−1
∗−1 0 ∗−1
0 ∗−1
∗−1
TW,WLS = (σ̂A,WLS)2(Σ̂∗−1
λ
)
I
−
Σ̂
1
(
Σ̂
1)
(
Σ̂
1)
1
Σ̂
Σ̂
OLS g
OLS
OLS
OLS
OLS
OLSλg.
2
Figure 3e
Figure 3f
• Real data results, voxel-wise heritability estimates for ML
(top) and WLS (bottom) (Fig 3d).
• Likelihood Ratio Test (LRT):
• Voxel-wise 5% FWE significant heritability, for LRT (top),
WLS Wald (middle) and the score (bottom) tests (Fig 3e).
ã TL,ML: based on fully converged estimator.
ã TL,WLS: based on WLS estimator.
• Cluster-wise 5% FWE inference results for uncorrected
P=0.01% cluster forming threshold, LRT (top row), WLS
Wald (middle) and the score test (bottom) (Fig 3f).
• The Score test: Takes the form of an auxiliary model explained sums of square.
!2 2
0
1 σ̂A,OLS
11
0
TS =
λg I −
λg ,
2
2 σ̂OLS
N
Figure 3d
Conclusion
Hypothesis Testing
• Parametic Test: Asymptotic 50 : 50 mixture χ with 1 and 0 degrees of freedom.
In this work we presented fast and powerful method to estimate heritability that enables standard spatial inference for genetic analysis tools optimized for imaging research, such as the SOLAR/SOLAReclipse
(Kochunov2013).
2
• Permutation Test: two variants
ã P1: Permute the kinship structure, refitting the model with Φ∗ = PΦP 0.
∗
ã P2: Ỹ ∗ = X ∗β̂OLS + Pε̂OLS
.
where P is a permutation matrix, β̂OLS is the fixed effect parameter estimation and
the residulas under the null hypothesis.
References
∗
ε̂OLS
is
T. Amemiya. A note on a heteroscedastic model. Journal of Econometrics, 6(3):365-370, 1977.
J. Blangero et al. A kernel of truth: statistical advances in polygenic variance component models for complex human pedigrees., volume 81. Academic Press, 2013.
H Ganjgahi et al, (2015). Fast and powerful heritability inference for family-based neuroimaging studies. NeuroImage, In press. P. Kochunov, T.E, Nichols (2013). SOLAR-Eclipse computational tools for imaging genetic and
mega-genetic analysis. 19th Annual Meeting of the Organization for Human Brain Mapping.
Electronic copy of this poster: http://warwick.ac.uk/tenichols/ohbm
Download