J. P. Rosenfeld, Ph.D.
jp-rosenfeld@northwestern.edu
We should spend an equal
time on phenomenology and
MECHANISMS of deception….
…as we do on DETECTION of
deception. But there is a lot more
research-based (real) knowledge
about the latter, so most of the
time, that’s what we will cover.
Regarding detection; there are
two approaches:
1. Behavioral
(a) non-verbal
(b) Verbal
2. Physiological: Nervous
system activity.
….We start with Behavioral:
Are the following notions True?
1. People typically reveal their lies by fidgeting, acting
 nervous, avoiding eye contact, etc.
2. Therefore, we are rather good lie detectors (unless we are
very stupid).
3. This is especially true when we detect lies in those close to
us.
4. Criminals, con-men, professional crooks, and such,
however, are harder to spot for us lay people.
5. Fortunately, trained professionals (police, FBI) are superior
lie detectors, so they protect us against pros.
We wish we
had a
Pinocchio's
nose
indicator
But we don’t…
…NONE of those assertions
is true!
People think they are
great lie detectors.
But they miss many
lies. Why?
 (1a) If someone thinks
you look great, why
argue? (Vrij: Ostrich
effect)
(1b) I’ll ask no
questions……(and hope
you tell no lies!)
Bill to Hillary and to an
aide: “Ah did not have
sex with that woman…..”
(2) Lie detection is
tough!
 There is no behavioral or
physiological specific index of
deception, though things are not
as bad a Vrij suggests: P300, fMRI
Beware of internet & Media claims!
 Pavlidis’ thermal imaging not so
great…
 BEOS Brain Electrical Oscillations
Signature is Bogus!
 “Brain Fingerprinting?!” (See my
critique….)
How common is lying?
 A) once a year?
 B) once a month?
 C) once a week?
 D) More than daily?
Definition of lying:

OK, give one…
Intentional, without
warning, misinform or
mislead another.
Liar must believe
information is false.
Lie could fail.
(Types of Lies)
From Bad-est to least bad:
 (1) Outright Lie– (total B.S.)
 John Edwards: “I am not the
father of that child, no way, I’ll
happily take a DNA test….”
 He was stating “I didn’t do it”
while knowing he did.
(2)Exaggerations (overstatements)
or Minimizations(understatements)
 “I am the most eminently famous
researcher in the Psych.
Department.” (but almost…)
 “This is an important discovery of
mine not yet published” (which
was actually rejected by one
journal and now in review at
another.)
(3) Subtle Lies (Omit details)
 Clinton: “Ah did not have sex
with that woman, Miss
Lewinski…”
 (Not according to the usual
definition: intercourse.)
(4) Concealment…
 “How was your day?…mine
was as usual…”
 [Heaven forbid she asks me
about my day!]
Many reasons:
 (1) Material Gain
 (2) Avoid loss/punishment
 (3) Avoid embarrassment
 (4) Make good impression (&
get the job)
All these involve
gain for oneself.
The other kind of (Otherdirected) Lie:
Altruistic Lies:
“You made a great impression…”
Telling a gravely ill person (child)
that she will be fine..
Why is it hard to know for sure
how often people lie?
Self report fails
because (1) people
don’t want to recall &
(2) don’t remember.
But 2+ times a day is a
fair estimate.
Whom do people lie to?
Strangers more than
close contacts. (Why?)
(1) Gender:
 Men tell more self-oriented lies.
 Women tell more other oriented
lies
(2) Age
 What is definite minimum age for
lies in children? (2,4,6,8,10) See
Vrij, P. 27.
 What motivates youngest
children’s lies? (Gain of say
cookies? Avoid punishment?
(earliest lies). Make a good
impression?)
(3) Personality Type
 In romantic relations:
 (a) attachment style: avoidant
(lack of trust, keep people at
distance)…..these folks lie to keep
others at a distance.
Versus
 anxious attachments: lack of selfesteem. These people lie to look
good as they feel bad.
(b) Psychopathy
 The types have no empathy or
sympathy for others. They regard
others as pieces of themselves,
and objects of manipulation—like
their own limbs. These are big
time liars to everyone they might
wish to manipulate. Superficially
charming, they get away with it
for a while…
(c) Extraverts and introverts
 Outward/sociable vs. reserved.
 Which are the bigger liars?
(d) Self-consciousness:
 These are very concerned with
impression they make, so how do
they behave with respect to
deception?
In this connection, one speaks also
of social anxiety: they lack self
confidence, so they tell tales.
(Are we all on same page? What
are these?)
We wish we
had a
Pinocchio's
nose
indicator
DiPaolo’s group emphasizes 3
theoretical perspectives:
 During deception, 3 phenomena
occur that may yield “tells”:
 1) emotional reactions: guilt, fear---and delight. (delight?)
 Guilt  low eye contact
 Fear  stuttering, blinks, squeals.
 Delight (of duper) excitement,
inappropriate laughter. (Ekman)
What else does liar experience?
 2) mental (task) demand, also
called cognitive load. Liars will
also tend to look away, so as to
concentrate on made up details.
Except when they monitor you to
see if you believe.
 3) Attempted self control of
behavior to avoid tells. Not easy.
Voices break, facial expressions
occur: Ekman’s micro-expressions.
These are theories of what
should happen. Moreover…
…these effects are affected by
certain other factors, e.g.,
 GUILT—The degree of guilt felt
depends on personality. Consider
Psychopathy. & Consider Stakes.
& Consider sense of righteousness
or duty felt by CIA agent.
 FEAR– Experienced liars of
whatever breed have less fear.
 EXCITEMENT– helped by type of
audience. Also, if it’s a big
conquest, or else, no.
More…
 COGNITIVE LOAD--- also depends on
verbal skill-intelligence and experience.
(Vrij is wrong about Psychopaths
feeling less load.) Degree of rehearsal
also makes a difference (practice
makes perfect!). No transparency
illusion. So more monitoring of other.
 SELF CONTROL depends on
Psychopathy.
Vrij is a pain, but best we
have..(E.g.Box 3.1 p. 47)….
 …and his best statement is on the
top of p. 49. “ …the relationship
between lying and non-verbal
behavior is complex.” The
implications of this statement
regarding development of nonverbal “tells” is what?! (Life is not
Poker.)
Field Studies.(Define.)
 We want to compare liars vs.
truth-tellers, but…
Difficulties:
1)Getting videos: (Are cameras
always running?)
2) Establishing “Ground Truth.”
3) Controls: only possible in lab to
have unconfounded conditions—so
not many field studies re:
nonverbal cues are out there…
What’s a Confound ?
Consider our deception
feedback study:
OK, how about lab studies?
 Pluses: Ground truth easy to get
by design. Likewise, unconfounded control conditions.
 Minuses: Lies are instructed. So
what happens to guilt? fear? What
happens regarding stakes? You
can offer lots of money, but not
punishment (as in real life).
 DECEPTION HARD TO STUDY!
Bottom Line:
 Vrij’s Appendix 3.1(p. 90) makes
very clear that there is NO reliable
non-verbal sign of deception.
There may be trends and even
significant group effects (explain),
which support theoretical views
noted above…..
**But in D of D, the key statistic is
INDIVIDUAL HIT RATE (as utilized
in d’ or AUC from SDT).
Dramatically:
 Is there such a thing as “lyin’
eyes?” Are gaze or eye contact
helpful? (p. 60) Vrij says no, but
there is new research: N. Cohen,
Frank.
*Are liars more emotional than
truth tellers? Why or why not? (p.
61) [HINT: Is it only liars who feel
emotion?]
Table 3.1 (p. 57) in Vrij is about
effect size.
 J. Cohen's d = M1 - M2 / spooled
where spooled = �[(s 1�+ s 2�) / 2]
 In Table 3.1, only 3/18 d values are
> .30 (moderate) in the big meta
analysis of 100 behaviors.
 Remember, d is a GROUP
measure!
Vrij’s “Group Differences”
(p.67)….
Very suspect!
 As is rest of chapter about
Clinton, Saddam Hussein, Huntley.
 Amazing that he says (p.89) look
for the cluster of c(l)ues to
deception, not single c(l)ues.
Which cluster? “There is no
straightforward answer….”
That is….speech signs…
We wish we
had a
Pinocchio's
nose
indicator
Vrij description of possible
verbal signs (“cues”)…
 ….as in his box 4.1 list (p. 101) is
mostly pathetic, i.e., wrong (imho).
 On the other hand, sometimes one
has to say, “duhhh!” as when he
states that direct statements are
more credible than evasive ones.
 Obviously plausibility helps! As
does consistency (lack of
contradiction).
But Vrij best statement is…
 “As I will demonstrate, a verbal
cue uniquely related to deception,
akin to Pinocchio's growing nose,
does not exist.” (p. 103)
 Again, an appendix (4.1 yuk!) shows
some trends & group effects as
supporting some views in Box 4.1.
 Again, clusters> singles. But which
cluster? No one knows.
Vrij indulges himself with
many “I think..” type ideas.
 That’s fine. Be sure you do not
confuse these hypotheses for
facts!
 Note: people & researchers
always start out with what
appears obvious, like twitches &
speech signs. These all are
ultimately controllable, unlike
physiology (looking ahead).
Henceforth….
 We will be looking at more
specialized and developed
approaches to detecting
deception. Vrij still likes “more
immediate” (non-physiological)
tools, because they are allegedly
quick, easy, & cheap. Imho, he
stretches things by emphasizing
how laborious some methods are...
OK,
 fMRI requires a $1M+ machine and an
annual budget of >$1M…
 But when Vrij states (p. 189) that
EEG/ERP recording usually requires 10
or more scalp electrodes, he
exaggerates: 2 (actually 1) will do, and
it takes only 5 minutes to hook up.
 But Vrij & we will go on with SVA, RM &
possibly SCAN, imho, unphysiological
extensions of what we just did…
Reality Monitoring, Statement Validity
Assessment, & Scientific Content
Analysis.
For SVA, go to power point on
323 web site called…..
 “Statement Validity Analysis”
and read Vrij chapter 8. I will
not lecture on it. But I will
lecture on the Implicit
Association Test. Go to this
web site and take a demo test:
 https://implicit.harvard.edu/im
plicit/demo/
Reality Monitoring…
 RM started not as a DoD method,
but rather as a discipline in
Cognitive Psychology to study on a
theoretical basis, the differences
between perceived versus
imagined experience; as with false
(but honestly believed) memories.
What are the characteristics of
Real versus False Memories…..
 …with regard to:
 Sensory Information?
 Contextual Information?
 Affective information?
 Think of dimensions of clarity
versus vagueness?
 Give examples of what you might
actually remember about an event.
This Q & A system of M.
Johnson sorta works, but…
What happens as time
passes?
So what do you remember
better, last week or when
you were 3?
Actually…
 Actually, it turns out that
Physiology (ERPs) come to the
rescue again regarding false
memories… which is good
because RM tool isn’t so great..(p.
265)
 …as we will see later.
If RM is not a perfect false
memory detector, then why..
…would we think this strategy
would be a good deception
detector?
Are there differences between
false memories and deceptions?
HINT: What do subjects believe
about veracity of f.m. and lies?
RM Criteria (Sporer ‘97) some overlap with
CBCA: based on interviews:nonstandardized
 From Vrij Table 9.1. The first 7 should
be there in true statements.
1. Clarity—clear sharp, vivid statements
2. Perceptual info.—details of smell,
sound, etc.
3. Spatial info.—details of location,
seating arrangement, object location.
4. Temporal Info– When things happened,
and in what order.
(continued…)
Four more…
5. Affect detail—How subject felt:” I was
disgusted when I saw the body…”
6. Reconstructability of the testimony in
detail.
7. Realism: Is the story plausible,
realistic, logical?
8. Cognitive operations. Should be less
in true story: It shouldn’t be necessary
to make inferences: “He appeared
nervous.”
The RM method usually
involves 2+ raters scoring tape.
 Inter-rater reliability should be and
is found by research to be high.
Good as CBCA, anyway.
 Vrij thinks it’s easier to learn and
teach, because…
 There are fewer criteria which are
more concrete and thus less
subject to interpretation.
Vrij’s review of RM research in
Appendix 9.1 has limitations:
 Not all reviewed studies are peer-
reviewed publications.
 Not all studies are given!
 Scoring not standardized.
 Some listings lack inter-rater data.
 Only recent events are studied.
 RM is problematic in children.
 Etc, etc. maybe not important ….
...because:
 Appendix 9.1 shows a mostly
mixed pattern.
 Average accuracy is 68.8 %
overall, which mean many false
alarms (about 29% in truth
tellers) and misses (about 34% in
liars). I.e., group but not individual
results.Yes, RM beats flipping a
coin, but not by much. More work
is needed. (Physiology wins.)
End of RM
“SCAN”
Israeli tool:
Vrij: Widely used, though
Not much researched, nor
endorsed by researchers.
Basic assumption:
 A memory based on actual
experience will differ in content
and “quality” (?) from an invented
or fantasized memory. (Duhhhh!)
 Also problematic: Not all lies are
invented out of whole cloth. Some
are substituted actual memory, as
in most alibis.
Proceedure:
 Suspect writes in handwriting a
statement of events in absence of
investigator (to avoid
contamination).
 Statement is then scored on
several criteria:
The Criteria (+ = seen in truth
tellers. V means vs. CBCA)
1. Direct denials +
2. Specific identifications +
3. Cross-outs(against instructs.) – V
4. Vague memories – V
5. 20-50-30 structure +
6. Emotions +
7. Subj/obj time correspondence +
More….
8. Events in chronological order +
9. Missing info: :”A bit later..” 10. 1st person, present tense +
11. Pronouns “I” Ownership. +
12. Change in language(???):
Change from “conversation” to
“Discussion.” - if not justified. Too
subjective (imho)
Research results with SCAN:
1. Field study: .77 TTs, .88 Liars at
best with Denial criterion.
….But, as with almost all field
studies, no ground truth!!!!! So,
fuggedaboudid!
2. Lab Study(So no ground truth
issue): Three crirteria utterly
failed to discriminate liars and
truth tellers!! So, fuggedaboudid!
Final field study…
 Again no ground truth!!
 Though they claim75% TT
accuracy, 80% liar accuracy.
 Great lack of consistency among
SCAN users: So, fuggedaboudid!
 Bottom SCAN line: Little empirical
support, little standardization.
 Thus the widespread use is stupid.
Based for openers
on Polygraph.
Polygraph:
 Does not mean “lie detector.”
 Means a machine with many
(poly) channels of visually
presented (graphical) information;
in inked paper or laptop display
format.
 A Polygraph machine can be and
is used in many fields of
Psychophysiology
2 Major Protocols currently
used :
 Comparison (“Control”) Question Test
(CQT)-- beloved by enforcers, ‘cuz it’s
easy to use (“Didja do it?”) and tends to
elicit confessions—but hated by
academics who prefer the
 Guilty Knowledge Test (GKT better
known as the Concealed Information
Test (CIT), since knowledge can’t be
guilty (just perps.)
 Both utilize autonomic arousal.
Why do us professors frown on
CQT?
 Pre-test interview cannot be
standardized.
 It compares arousal responses to
relevants. & “controls.”There can
never be true comparison control
questions in the scientific sense
of control.(Yet there could be
better versions.)
 High false positive rate (which CIA
et al. don’t care about….)
Why do we love CIT?
 It compares arousal (?) responses to critical
(probe) and non-critical (irrelevant) stimuli.
 There are usually 6 or more multiple choice
questions with 1 + 5 choices per question.
(Throw away 1st.) Thus for each item, the
P(chance hit) = 1/5=.2
 For 6 INDEPENDENT items, probability of
subject hitting on all 6 by chance =P = .2^6 =
.2 x .2 x .2 x .2 x .2 x .2. Use Bernoulli for
chance hits on < 6. Thus you can reduce the
false positive P as low as you like by adding
independent items (not always so easy).
……and bootstrapping
…..from 4 sites on the scalp:
the class to the
movie…called, “ERP
Averaging”
Since averages are so much
cleaner than single sweeps,
…and show the true stimulus-evoked
event that is time-locked to the
eliciting event, and are more noise free,
….it obviously makes sense to compare
averages rather than single sweeps,
that is, to do analysis, like t-tests on
averages.
People did that in comparing group
ERPs or grand averages.
For example,
 The schizophrenic group average
versus the normal average
 or the well-trained group average
P300 vs. that of the untrained
group.
 Remember, in a group, each
subject has an average ERP.
 ….but within a single subject,
there are only single sweeps to
compare
In Bootsrapping…
 …..the original set of single
sweeps is repeatedly randomly
sampled –but with replacement—
…yielding multiple averages in a
single subject.
 Let’s say there are 6 repetitions of
sampling of 18 single sweeps:
….that look like real average of
original set but with variations
Correlation Example
Let's assume that we want to look at the relationship between two variables
Person
Height
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
68
71
62
75
58
60
67
68
71
69
68
67
63
62
60
63
65
67
63
Self
Esteem
4.1
4.6
3.8
4.4
3.2
3.1
3.8
4.1
4.3
3.7
3.5
3.2
3.7
3.3
3.4
4.0
4.1
3.8
3.4
Now we're ready to compute the correlation value. The formula for the correlation is:
Remember 3ST protocol:
 Probe (P) is the guilty item (stolen item)
 Irrelevants (Ir) are non-stolen items
 Target (T)is a IR with unique response
How P300 amplitude is
supposed to catch Liars:
1)P>I (‘BAD’)
2)P-TR corr
>P-I corr(‘BC-AD’)
1)P=I
2)P-I corr
>P-TR corr