Comparing Expert Fingerprint Examiners and Novices on a Visual Working Memory Task: What Makes Experts So Good? Technical Report Thomas Busey, Ph.D. Indiana University, Bloomington busey@indiana.edu The following is a summary of an experiment that addresses the information used by novice and fingerprint experts when performing a tasks that abstracts many of the relevant factors from real fingerprint comparisons. While there remain important methodological differences between a real comparison and the present study, the results bear on the nature of the differences between experts examiners and novices. Methods The goal of the experiment is to determine what information novices and fingerprint examiners might use when comparing fingerprint features. In the experiment, a single feature was presented for 1400 ms (or 1.4 seconds): It was then replaced by a mask, which was on the screen for either 200 ms or 5200 ms: and then two features were shown: The subject then indicated which of the two test prints corresponded to the studied print. In this example, the feature on the left matches the studied print. The images remained on the screen until the observer responded, which was not time-limited. Note that we have altered the orientation of the images, and slightly changed the brightness levels to prevent observers from using low-level cues to make a match. These types of cues would be irrelevant in an inked print/latent print comparison. We also removed images that had artifacts such as lint flakes that were specific to the inking procedure and would not also be present in a latent print (such as a scar). These procedures were designed to make the testing conditions rely on many of the same sources of information that an examiner would use while matching latent to inked prints. However, the testing procedures (limited viewing time) were designed to probe the results of perceptual expertise rather than mimic actual viewing conditions, which usually are not timelimited. Three different factors were included in the experiment. First, the mask could be shown for a short delay (a fifth of a second) or a long delay (5200 ms). Comparison across the delays will tell us whether, for example, experts are better at holding the features in memory. The second factor was the inclusion of visual noise at the test images. Latent prints will rarely be as clearly as inked prints, and thus we added visual noise to the prints to test whether experts could better tolerate the degraded viewing conditions than novices. The images below demonstrate the visual noise manipulation: The third factor was a manipulation meant to mimic the fact that latent prints are rarely complete copies of the inked versions. Usually latent prints are partial prints, and may be patchy due to textures on the contact surface. To produce testing conditions that were similar to partial prints, we constructed masks that reduced or eliminated portions of the fingerprint feature. An example is shown below. Sometimes the compliment masks were used, which allows testing for configural properties as described below. All three factors were factorially combined, so that the partial image could be shown in noise or free of noise. An example of a partial print shown in noise is shown below: There were a total of 8 conditions in experiment, which were factorial combinations of delay (short vs. long), noise (no noise or noise added) and partial (full image or just partial displayed). To further mimic conditions present during a fingerprint match process, the features were rotated at test up to 25 degrees and the brightness was altered. This forces observers to rely on features that survive such transformations and ignore smaller features like a piece of lint on the inked print which would not be part of a latent print. Each subject performed 18 replications in each of the 8 conditions for a total of 144 trials. Results The results are presented below. Where significant effects or interactions are reported, these are significant at the .05 level in a repeated measures analysis of variance (ANOVA) with the three within-subject factors (delay, noise and partial) and one betweensubject factor (novice vs. expert). Complete statistical analyses are available upon request from Tom Busey (busey@indiana.edu). Not surprisingly, fingerprint experts performed much better than novices. Several conditions produced near-ceiling performance, while some conditions for the novices produced performance at chance. To deal with the potential compression that can occur at the extremes of the scale, we converted performance to d', which is a measure of the observer's sensitivity in the task i. All differences reported below are for the d' data, but all effects were consistent in the percent correct data as well at the .05 level. Experts: Experts show reduced performance when the feature is presented in noise, as well as when the feature is partially masked. Somewhat surprisingly, however, is the result that experts are not affected by the delay. Thus they appear to be able to hold features in memory for longer periods of time. It is perhaps not surprising that partially masking the feature or presenting it in noise reduces performance. However, the experts show an interaction between these two factors, such that presented the partially masked image in noise causes a much greater reduction in performance than would be expected on the basis of the effects of noise and partial masking. Novices: Unlike Experts, Novices are affected by the longer delay. Thus they seem to be less able to hold each fingerprint feature in memory as well as experts can. Novices also show the same main effects of partial masking and noise as experts do, but no interaction between partial masking and noise. In fact, the novices show no interactions at all. Comparisons across groups: The different patterns of results described above leads to a number of interesting interactions with group membership. All three main variables (delay, noise and partial masking) show interactions with group type, which means that the three manipulations affect the two groups differently (the partial masking result is only marginally significant). The interaction between partial masking and added noise was significant for experts but not novices, and this produces an interaction between the two factors and group membership. Discussion What factors underlie the strong performance demonstrated by Experts? While they demonstrate better ability to retain images in memory over longer delays, it may also be the case that Experts process the fingerprint features in a different way from novices. One possible strategy adopted by experts might be that of configural or holistic processing. Holistic processing The partial masks were carefully constructed so that each one presented only half of the information available in the intact fingerprint. This was done by sometimes inverting the mask so that its compliment would be used to mask the fingerprint. Given this design it is possible to predict how well observers will do in the intact condition based on how well they have done on the partially masked condition. There are some statistical issues to consider that relate to issues like guessing, but basically the logic boils down to this: in the partially masked case, the observer has only one chance to get enough information from the test images in order to make the correct decision. In the intact case, however, the observer has two chances, essentially one chance from each of the two partial images that make up an intact image. Are the experts and novices making use of all the information in the intact image? Surprisingly, the answer depends on whether the images are presented in noise or not. Without noise, both novices and experts seem to be underutilizing the information in the intact image. That is, they do worse on the intact image than we would have expected given their performance on partial images. For some reason, both novices and experts seem to be underperforming when it comes to intact images. There is a difference, however, when the partial and intact images are presented in noise. This condition is particularly interesting, since latent prints are often very noisy, and while the visual noise we add may not be exactly the same as the noise caused by environmental factors in a latent print, one might expect that if there is a difference between novices and experts it may be revealed under high-noise testing conditions. When noise is added to both the intact and partial test prints, novices perform about as well as expected on the intact images. However, and this is the most interesting finding, experts perform much better on the intact images than one would predict based on their partial print performance. Somehow for experts, the whole is greater than the sum of the parts, and this is consistent with some form of holistic processing that may come about when experts talk about 'using the totality of the image' rather than focusing on individual features. This suggests that features are not processed individually, but affect each other, and this occurs more often in the intact print. Summary In summary, there are a number of factors that differentiate Expert Examiners from Novices. Experts show overall much better performance than novices. In addition, Experts are unaffected by long delays imposed between the study image and the test image, while novices show a marked decrease in performance. Both Experts and Novices are affected by adding noise and partially masking the test images, but Experts show an interaction between the added noise and partial masking procedures. This suggests that they change how they view images embedded in noise. One possibility is that experts adopt a strategy of doing more holistic processing when viewing images in noise, which works for intact but not partial images. i For conversion to d', perfect scores were converted to a value equal to (1-(1/(2N)) where N is the number of trials in that condition, usually 18. This is not an ideal solution, but performance for novices was in danger of reaching chance in some conditions, and so we feel that we chose stimulus durations that were appropriate for both sets of observers. We could reduce the duration for the experts, but at 200 ms it is already fairly brief, and we run the risk of changing the nature of the information they acquire by going to shorter displays. Thus at this point we are content to analyze both the d' and percent correct data and if they give the same result assume that ceiling effects are not a major determinant of our interactions.