basics.fmri.inference

advertisement
Basics of fMRI Inference
Douglas N. Greve
Overview
• Inference
• False Positives and False Negatives
• Problem of Multiple Comparisons
• Bonferroni Correction
• Cluster Correction (voxel-wise threshold)
• False Discovery Rate
• Selection Bias
Statistical Inference
• Can your conclusions be extended to data you have
not seen?
– Subjects, Time Points, Groups, Scanners
• Or are your results the product of a chance
occurrence that is unlikely to be repeated?
• Generalizability, Repeatability,
Reproducibility, Predictability
• Uncertainty
• Beyond good Experimental Design
Group Population
(All members)
Hundreds?
Thousands?
Billions?
Sample
18 Subjects
3
Truth Table
Reality
Conclusion
Effect Is Not Effect Is
Present (Neg) Present (Pos)
True Negative False Positive
Effect Is Not
Present (Neg)
Effect Is
False Negative True Positive
Present (Pos)
4
Error Rate
Conclusion
Reality
Effect Is Not
Present (Neg)
Effect Is Not
Present (Neg)
Effect Is
Present (Pos)
Effect Is
Present (Pos)
True Negative False Positive
TNR=1-a
FPR = a
False Negative True Positive
FNR = b
TPR = 1-b (Power)
False Positive Rate (FPR) – probability that you declare an effect to be
present when there is no effect
False Negative Rate (FNR) - probability that you declare no effect to be
present when there is an effect
5
How Do You Draw Conclusions?
Protocol: reduce all your data to one number (the “test
statistic” T).
• If T is greater than some threshold (q) then conclude that
an effect is present (ie, a positive)
• Otherwise conclude that an effect is not present (ie, a
negative).
Every protocol has some FPR and some FNR, though it is
not always easy to figure out!
6
Noise Causes Uncertainty
Voxel 1
Voxel 2
7
GLM Inference

bOFF
2
OFF
bON
2
 ON
T=8
T=
T=1
b ON  b OFF
2
2
( N ON  1) ON
 ( N OFF  1) OFF
( N ON  N OFF  2) 2
2
b ON ,  ON
, N ON  Mean,Var, N in ON
2
b OFF ,  OFF
, N OFF  Mean,Var, N in OFF
8
Example Protocol
• Collect data
• Motion Correct
• Smooth by 5mm FWHM
• Extract Voxel 1 (throw away rest of data)
• Compute Mean and StdDev of ON time points
• Compute Mean and StdDev of OFF time points
• Compute test statistic T
• If T > 3.41, Conclude that the voxel is active
Test Statistic (T) is the t-ratio
Threshold (q) is 3.41
What is the FPR (a) and FNR (b) for this protocol?
9
Example Protocol: False Positive Rate
b b
• “NULL” Distribution Student’s t-Distribution T =
( N  1)  ( N  1)
• p-value is area under curve to the right of T
( N  N  2)
DOF = N  N  2
• For T = 3.4, FPR = p =.01
• For T=8, FPR = p = 10-11
• For T=1, FPR = p = 0.1
FPR=area
Student’s t Distribution
under curve
• Assumptions:
to the right
• Gaussian noise
of line
(p-value)
• Independent noise
• Homoskedastic (equal variances)
• Violation of assumptions change FPR
ON
ON
2
ON
OFF
2
OFF
OFF
2
ON
ON
OFF
OFF
10
Example Protocol: False Negative Rate
• Need to know what the effect size is
• Previous data
• Guess
• Power Analysis
• Grants require a power analysis!
11
Trade Off of Error Rates
FPR=.10
FPR=.01
• Inverse relationship between error rates
• As False Positives (a) are reduced, the False Negatives
(b) increase
• Increase sample size decreases b, does not affect a
• Which Error is more important? Depends ..
• Science? FPR=.05ish, TNR<0.2
• Pre-operative surgery?
FPR=10-7
12
What conclusions to draw from this?
• Brain is activated?
• Visual Cortex?
• Auditory Cortex?
• False Positive Rate?
Need a protocol!
13
Possible Protocol
• First Level Analysis
• Compute t-ratio for each voxel
• Compute p-value for each voxel
• If any brain voxel has p < .01, declare a positive
• Same as
• Test Statistic: T = max(Ti)
• Threshold: q=3.4
What is the False Positive Rate for this protocol?
14
What does p<.01 mean?
Rand(0,1)
100x100
10,000 vox
p < 0.1
1000 vox
p < 0.01
100 vox
p < 0.001
10 vox
• p<.01 means one expects 1% of voxels will be active purely
by chance
• Protocol gives a False Positive any time even a single voxel
has p<.01
• What is the probability that at least one voxel has p<.01?
15
The “Problem of Multiple Comparisons”
a FWE = 1  (1  aVox )
N
N = 10,000
aVox
• aVox = voxel-wise threshold (p< aVox)
• aFWE = Protocol False Positive Rate
(FWE = Family-wise Error)
• N = Number of voxels (“Search Space”)
aVox =.10
aVox =.01
aVox =10-7
aFWE
0.00001 0.095
0.0001
0.632
0.001
1.000
0.01
1.000
16
Bonferroni Correction
aVox = 1  1  a FWE 
N
a FWE
N
= f (a FWE , N )
Compute Voxel-wise threshold needed to achieve
a desired Family-wise FPR.
To achieve aFWE = 0.01 with N = 10,000 voxels
Need aVox = 0.000001 (10-6)
17
Search Space
• Set of voxels over which positives are searched
• Severity of correction increases with size of search
space (regardless of method)
• Reduce Search Space
• Reduce the area to a ROI (eg, superior temp gyrus)
• Increase voxel size (cover same volume with fewer voxels)
• Spatial Smoothing
18
Spatial Smoothing
• Spatially convolve image with Gaussian kernel.
• Kernel sums to 1
• Full-Width/Half-max: FWHM = /sqrt(log(256))
 = standard deviation of the Gaussian
0 FWHM
5 FWHM
10 FWHM
Full-Width/Half-max
Full Max
2mm FWHM
Half Max
5mm FWHM
Smoothing causes irreversible loss of
information (resolution)
10mm FWHM
Spatial Smoothing
Smoothing causes irreversible loss of information (resolution),
similar to increasing voxel size.
0mm
5mm
10mm
Smoothing
1mm
Increased
Voxel Size
4mm
8mm
Resel
• Pixel = picture element
• Voxel = volume element
• Resel = resolution element (depends on smoothing level)
Resel = (FWHM)3 for volumes
Resel = (FWHM)2 for surfaces
If FWHM>Voxel Size, fewer Resels than Voxels.
Correct based on the number of Resels instead of number of
voxels (math is more complicated, need Random Field Theory)
aVox = f (a FWE , N , FWHM )
Bonferroni
aVox = 1  N 1  a FWE 
aVox
N
Clusters
aVox =.10
aVox =.01
aVox =10-7
• True signal tends to be clustered
• False Positives tend to be randomly distributed in space
• Cluster – set of spatially contiguous voxels that are above a
given threshold.
Cluster-wise Correction
• Cluster – set of spatially contiguous voxels that are
above a given threshold.
• Protocol
• Perform 1st level analysis.
• Threshold volume at aVox
• Find clusters.
• If Cluster Size > Threshold (q), Declare a Positive
• Test Statistic: Cluster Size
• What is the FPR (aFWE) for this protocol?
Random Field Theory
aFWE = f(aVox,N,FWHM,ClusterSize)
p=.05
p=.05
Smoothing increases size of random clusters
FWHM 0
Z
Z>2.3
p<.01
FWHM 2
FWHM 4
FWHM 6
Cluster Images
Sig Map
pVox < .001
Cluster Map
pCluster < .05
Some small clusters do not “survive”
Cluster Table
MNI305
Size Cluster Atlas
Cluster X Y Z (mm3) p-value Location
1
40 -67 -11 41368
~0 Right Lateral
Occipital
2
3
4
-40 -85 -13 51184
-6 17
-50
7
~0
Left Lateral
Occipital
45
2784 .00026
Left Superior
Frontal
23
3768 .00002
Left
Precentral
R
L
Radiological
Orientation
ROI Atlas
Cluster Data Extraction
• Spatial average over cluster of each subject’s contrast
• Can correlate with other measures (age, test score, etc)
• Be careful of “Selection Bias” (“Voodoo Correlations”)
Cluster Correction Summary
• Cluster – set of supra-threshold voxels (size)
• Critical Size Threshold given by Random Field Theory
• Search Space
• Voxel-wise threshold (arbitrary)
• FWHM (smoothing level)
• Assumptions on each
• Loose small clusters (False Negatives)
False Discovery Rate (FDR)
p < 0.1
1000 vox
p < 0.01
100 vox
p < 0.001
10 vox
• Given the voxel-wise threshold, know expected
number of False Positives
• If there are more Positives than this, then some of
them must be True Positives
False Discovery Rate (FDR)
Number of False P ositives
Number of False P ositives Number of T rue P ositives
Number of False P ositives
=
T otalNumber P ositives
FDR =
• Number of False Positives = N*aVox
• Total Number of Positives = Count from image
• aVox = f(FDR,N,Data)
False Discovery Rate (FDR)
• FDR = .05 means that 5% of Positives are False Positives
• Which 5%, no one knows
• How to interpret?
FDR = .05
aVox = .0070
FDR = .01
aVox = .0070
False Discovery Rate (FDR)
• FDR = .05 means that 5% of Positives are False Positives
• Which 5%, no one knows
• How to interpret?
FDR = .05
aVox = .0070
FDR = .01
aVox = .0070
Would you change your
opinion of this blob if 50
of the voxels were False
Positives?
False Discovery Rate (FDR)
• FDR = .05 means that 5% of Positives are False Positives
• Which 5%, no one knows
• How to interpret?
FDR = .05
aVox = .0070
FDR = .01
aVox = .0070
Would you change your
opinion of this blob if 50
of the voxels were False
Positives?
False Discovery Rate Summary
• False Discoveries
• FDR does not control FPR (False Positive Rate)
• Careful when interpreting
• Voxel-wise threshold is Data Dependent
Summary
• Can your conclusions be extended to data you have not seen?
• Truth Table: False Positives (a) and False Negatives (b)
• Protocol – describes how you will draw conclusions
• Problem of Multiple Comparisons (Family-wise Error)
• Search Space, Search Space reduction
• Larger voxels (less resolution)
• Smoothing (Resels)
• Bonferroni Correction
• Cluster Correction (voxel-wise threshold)
• False Discovery Rate
• Selection Bias
37
Download