Decision making as a model 4. Signal detection: models and measures Nice theorems, but how to proceed in practice? 1. Hard work Get several points on ROC-curve by inducing several criteria (pay-off, signal frequency) Compute hit rate and false alarm rate at every criterion Many trials for every point! Measure or compute A using graphical methods Certainly no signal 0 1 2 3 4 5 Certainly a signal Variant: numeric (un)certainty scale: implies multiple criteria – consumes many trials too 2. Rough approximation Area measure for one point: A' Average of those two areas: Hits 1-H F A' = 1 - ¼ ------- + ----1-F H (H-F)(1+H-F) = ½ + ¼ ----------------H(1-F) H F False Alarms Iff H>F B''= -.4 if H = 1, F≠0, F≠1, then B'' = -1 B''= .07 F B''=.4 HIT RATE B''= -.07 H Comparable measure for criterion/bias: Grier’s B'' Isobias curves FALSE ALARM RATE if H = 1 - F then B'' = 0 if F = 0, H≠ 0, H≠1 then B'' = 1 H(1 - H) – F(1 – F) B'' = sign(H - F) -----------------------H(1 - H) + F(1 – F) 3. Introducing assumptions Even when several points ara available, they may not lie on a nice curve Then you might fit a curve, but which one? Every curve reflects some (implicit) assumptions about distributions Save labor: more assumptions less measurement (but the assumptions may not be justified) Normal distributions are popular (there are other models!) 0,45 0,4 0,35 0,3 0,25 0,2 0,15 0,1 0,05 0 -4 -3 -2 -1 0 1 2 3 4 Simplest model: noise and signal distributions normal with equal variance One point (PH, PFA pair) is sufficient 5 Example: in an experiment with noise trials and signal trials these results were obtained: Hit rate: .933, False Alarm rate .309 (.067 misses and .691 correct rejections) Normal distributions: via corresponding z-scores the complete model can be reconstructed: distance: d´ = 2 measure for “sensitivity” z.933 = - 1.5 z.309 = .5 f h .933 .309 h β = ---- = .37 f Measure for bias/criterion -3 -2 -1 0 1 2 3 4 5 6 1 0,9 0,8 d'=0 0,7 proportion hits -4 several d'-s and corresponding ROC-curves d'=0.5 0,6 d'=1 0,5 d'=1.5 d'=2 0,4 d=2.5 0,3 d'=3 0,2 0,1 0 0 0,2 0,4 0,6 proportion false alarm s 0,8 1 Gaussian models: preliminary Standaard normal curve M=0, sd = 1 P z z Φ(z) = -∞ φdx φ(z)= 1 -z2/2 e √2π Transformations: Φ(z) P Φ-1(P) or: Z(P) z see tabels and standard software Roc-curve PH = f(PFA) PH PFA λ Z-transformation ROC-curve Pz zH - zH = f(zFA) Nice way to plot several (PFA, PH) points zFA Plotting with regression line? Regression line zH = a1zFA+ b1, Minimize (squared) deviations zH : Underestimation of a Regresionline zFA= a2zH + b2 Minimize (squared) deviations zFA : Overestimation of a Compromise: average of regression lines ZH = ½(a1+1/a2)ZFA + ½(b1+b2/a2) Equal variance model: PFA = 1- Φ(λ), = Φ(-λ), PH = 1 – Φ(-(d' - λ)) = Φ(d' – λ), zFA = -λ zH = d' – λ zH = zFA + d' d' = zH –zFA z-plot ROC 45° 0 λ zH d' 45° d' zFA Criterion/bias: β = h/f = φ(zH)/φ(zF) f h 1 -z2/2 φ(z) = ------ e (standard-normal) √(2π) 1 -zH2/2 zFA2 – zH2 φ(zH) = ------ e -----------2 √ (2π) Divide: -------------------- = e 1 -zFA2/2 φ(zFA) = ------ e √(2π) To get symmety a log transformation is often applied: log β = log h – log f = ½(z2FA – z2H ) f h λ c Alternatve: c (aka λcenter), distance (in sd) between middle (were h=f) and criterion c = -(d'/2 – λ) zFA = -λ d' = zH - zFA zH – zFA 2zFA c = - ---------+ ----2 2 zH + zFA c = - ---------2 β Isobias curves for β en c c