Speech Perception: Models of Spoken Word Recognition

advertisement
1
Speech Perception: Models of Spoken Word Recognition
Today’s class intends to:
1. discuss general phenomena in spoken word recognition; and
2. introduce different models of speech recognition:
a. Motor Theory
b. Cohort Model
c. Trace
Factors in spoken word recognition
1. Frequency Effect: High frequency words are recognized more easily than lowfrequency words (e.g. ophthalmologist vs. eye doctor)
2. Phonological neighborhood effect:
a. measure: number of different phoneme in a word
b. word recognition is more difficult for words for which there are many
similar sounding words in the lexicon (e.g. mail: rail, bail, wail, hail...)
3. Frequency * phonological neighborhood interaction:
 A high frequency word with low frequency neighbors is easy to recognize
(e.g. dog vs. {log, bog, ...}.
 A low-frequency word with high-frequency neighbors is more difficult to
recognize (e.g. wail vs. {tail, mail, rail, sail, ...}
4. Context effect (easier to identify word in context vs. by itself)
5. Familiarity effects
a. meaningful sentence context (easier to identify word in meaningful context
vs. meaningless context)
b. speaking rate information (better recognition of words spoken at the same
rate vs. varied rate)
c. characteristics of individual speakers (easier to recognize words in noise
when spoken by a familiar voice)
Motor Theory
1. Liberman and colleagues at Haskins Lab (1967, 1970)
2. Background assumption: speech is a special type of auditory stimulus for human
beings; when we are exposed to it, we evaluate speech, as opposed to music or other
nonspeech sounds (i.e. speech perception is innate and species specific – it is a
uniquely human property)
3. Idea: At some point in the speech perception process, speech signals are interpreted
by reference to motor speech movements
2
4. Criticism: babies can perceive speech sounds while they cannot properly produce
them; mute people can understand speech perfectly well; gap between perception
and production exists.
5. McGurk Effect: integrating visual and auditory information
Cohort Model
1. Marslen-Wilson and Welsh (1978)
2. Motivation:
a. shadowing – fluent restorations
b. cross-modal priming
3. Three stages of word recognition:
“At the end of the drinking competition, the lorrydriver was the last one to sta...”
Access stage
perceptual representation used to activate lexical
items, thus generating a candidate set of items;
the cohort

Selection stage
one item only is chosen from this set

Integration stage
in which the semantic and syntactic properties of
the chosen word are utilized
4. All possible candidate words in sound sequence (i.e. the cohort) become active early
in word presentation
5. Elimination from cohort:
a. because words don’t match (bottom-up) or
b. because they are inconsistent with context (top-down)
6. Processing continues until all but one of the words are eliminated (uniqueness point)
and the survivor recognized (recognition point)
7. Criticism:
a. Human’s perception system is resilient to errors – it recovers from errors;
the current cohort model is not compatible with the recovery from errors
(once a lexical item is eliminated, it stays eliminated until the end)
8. Later development of the theory:
a. all-or-none (or on or off) elimination is replaced by gradual elimination (to
account for the ability of the system to recover from errors)
b. Role of context is less emphasized; in fact, context effect used only in the
integration level (each level is autonomous)
Download