Complements

advertisement
Complements
As explained in paragraphs 5.3 and 5.4, there are three quantities which
you should know for interpreting the results of a diagnostic test. These are
Sensitivity, Specificity and Prevalence.
In a test there are two dichotomic random variable of interest.
1. The result of the test. We code its possible outcomes with Neg for
Negative and Pos for positive.
2. The presence or absence of the characteristic which we are testing,
for example a disease. We code its possible outcomes by Y if the
characteristic is present and N if the characteristic is absent.
In terms of conditional probabilities of the possible outcomes, the definitions
of the quantities above are as follows
Sensitivity = p(P os|Y )
Specificity = p(N eg|N )
Prevalence = p(Y )
For patients or doctors which receive the result of the test, the quantities of
interests are instead:
p(Y |P os)
p(N |P os)
p(Y |N eg)
p(N |N eg)
Since
p(N |P os) = 1 − p(Y |P os)
e
p(Y |N eg) = 1 − p(N |N eg)
we need only two formulas for the above quantities. These are
p(Y |P os) =
Sensitivity · Prevalence
Sensitivity · Prevalence + (1 − Prevalence)(1 − Specificity)
and
p(N |N eg) =
Specificity · (1 − Prevalence)
Specificity · (1 − Prevalence) + Prevalence(1 − Sensitivity)
Proof of the first identity
p(Y |P os) =
p(Y ∩ P os)
p(P os|Y )p(Y )
=
=
p(P os)
p(Y ∩ P os) + p(N ∩ P os)
p(P os|Y )p(Y )
=
p(Y ) · p(P os|Y ) + p(N )p(P os|N )
p(P os|Y )p(Y )
(1)
p(Y ) · p(P os|Y ) + (1 − p(S))(1 − p(N eg|N ))
1
The first equality comes from the definition of conditional probabilities.
The second equality comes from N = S c , hence P os = P os ∩ Ω = P os ∩
(S ∪ S c ) = (P os ∩ S) ∪ (P os ∩ S c ).
The third is just for the definition of conditional probabilities
The last, which ends the proof of the result, follows from the general fact
that p(A|B) + p(Ac |B) = 1 as follows immediately from the definition of
conditional probabilities.
2
The second equality can be proved in a completely analogous way.
In the proof of the identity, we have also proved the following useful
formulas
p(P os) = Sensitivity · Prevalence + (1 − Prevalence)(1 − Specificity)
and
p(N eg) = Specificity · (1 − Prevalence) + Prevalence(1 − Sensitivity).
The two formulas are not independent, since their sum is one.
2
Download