response5

advertisement
Here we are detailing our concerns on the validity of the bootstrap method
proposed in BWC. Let Aijk denote the psudovalue for the ROC curve area of the
modality i, reader j, and case sample k, and let Yijkt denote the value of the
decision variable for the modality i, reader j, case k, and the true disease
state t. Then, we can write Aijk as a function of Yij11 , , Yijn1 , Yij12 , , Yijm 2 ,
Aijk  hi (Yij11 ,
, Yijm 2 ) .
, Yijn1 , Yij12 ,
Because Yij11 ,
, Yijn1 , Yij12 ,
, Yijm 2 are not independent, to derive valid bootstrap
estimates we have to modify the standard bootstrap method in the i.i.d case to
account for the correlation structure of the data. To understand the issues
associated with the BWC’s bootstrap method, we first need to understand what is
the definition of a bootstrap estimator for the variance of Aijk when
Yij11 ,
, Yijm 2 are correlated. (For a more detailed description on this,
, Yijn1 , Yij12 ,
see Shao and Tu (1995)).
Let Fij ( y11,
, yn1 , y12 , ym 2 ) be the joint distribution of Yij11 ,
estimator for Fij ( y11,
,Yijn1 ,Yij12 ,
, yn1 , y12 , ym 2 ) using the data Yij11 ,
, Yijm 2 , and F n ,m be an
, Yijn1 , Yij12 ,
, Yijm 2 .
Then, the bootstrap estimator for the variance of Aijk is given by the following
formula:
VarBOOT  Var*[hi (Yij*11 ,
where
{Yij*11 ,
, Yijn* 1, Yij*12 ,
, Yijn* 1 , Yij*12 ,
variance given Yij11 ,
*
, Yijm
2 )],
(3)
*
, Yijm
2 } is a sample from F n , m , and Var* is the conditional
, Yijm 2 . Since we cannot directly calculate the
, Yijn1 , Yij12 ,
bootstrap variance estimator given in (3), we need to use a Monte Carlo method
for approximating VarBOOT . If we can generate B independent samples
{Yij*11b ,
, Yijn*b1, Yij*12b ,
Am*b,n  hi (Yij*11b ,
*b
, Yijm
2 } ,b=1,…,B, from the estimated F n , m , we calculate
, Yijn*b1 , Yij*12b ,
*b
, Yijm
2 )], and approximate VarBOOT by
B
B
b 1
b 1
B
VarBOOT
 (1/ B) ( An*,bm  (1/ B) An*,bm ) 2 .
From this definition for the bootstrap variance estimator, we see that
the validity of the bootstrap variance estimator depends on two closely related
assumptions: (1) we can have a consistent estimator F n , m for the joint
distribution Fij ( y11,
, yn1 , y12 ,
, ym 2 ) , and (2) we can effectively sample from the
estimated joint distribution F n , m .
In the case where Yij11 ,
Fij ( y11,
, yn1 , y12 ,
, Yijn1 , Yij12 ,
, ym 2 )  Fij ( y11 )
, Yijm 2 are i.i.d., we can write
Fij ( ym 2 ) and estimate the univariate distribution
function Fij(.) by its empirical distribution. We can also easily generate a
bootstrap sample
{Yij*11 ,
replacement from Yij11 ,
, Yijn* 1 , Yij*12 ,
, Yijn1 , Yij12 ,
*
, Yijm
2 } by a simple random sampling with
, Yijm 2 . However, when Yij11 ,
, Yijn1 , Yij12 ,
, Yijm 2 are
correlated, how to deal with these two issues is not obvious due to the
multivariate nature of Fij ( y11, , yn1 , y12 , , ym 2 ) . The method taken by BWC for
estimating this multivariate distribution and generating a bootstrap sample from
it was to treat reader and cases as either random or fixed. However, it is not
clear to us what is the property of the resulting bootstrap procedure.
In addition, the proposed re-sampling method seems to treat diseased and nondiseased patients in the same way. However, as we know the distribution of
diseased patients is different from that of a non-diseased patient, they should
be treated differently.
Finally, we must point out that, despite our concerns, the BWC methods seem to
have undergone extensive development, testing and use without serious problems
arising. Thus the method appears to be robust enough that the concerns raised
in our book and in this letter do not seem to constitute "fatal flaws".
References:
1.
S. V. Beiden, R. F. Wagner, G. Campbell. Components-of-variance models and multiplebootstrap experiments. Acad Radiol 2000; 7: 341-349.
2.
J. Shao and D. Tu (1995). The Jackknife and Bootstrap. Springer, New
York.
Download