Acoustic Characteris..

advertisement
Acoustic Characteristics of
Consonants
Robert A. Prosek, Ph.D.
CSD 301
Consonants
• Consonant articulations are more complex than vowel
articulations
• consonants are usually described in groups according
to their significant acoustic and articulatory properties
• stops
• fricatives
• affricates
• nasals
• glides
• liquids
Stop Consonants (1)
• Stop consonants are characterized by a complete
closure somewhere in the vocal tract
• Three phases
• closure
• release
• transition
• reverse the steps for postvocalic stops
Stop Consonants (2)
• Stop gap
• this event corresponds to the complete closure of the
vocal tract
• minimum radiated acoustic energy
• silence for voiceless stops
• voice bar for voiced stops
• 50 - 150 ms
Stop Consonants (3)
• Stop release (burst)
• pressure has been rising behind the obstruction
• rapid release produces a transient
• 20 - 30 ms
• thus, suitable temporal resolution is needed
• voiceless stops follow the burst with frication
• noise generated at the place of articulation
• low frequency for /p/ (500 - 1500 Hz) (falling spectrum)
• high frequency for /t/ (above 4 kHz) (rising spectrum)
• mid-frequency for /k/ (1.5 - 4 kHz) (peaked spectrum)
Stop Consonants (4)
• Cues for voicing
• /p t k/ are phonetically distinguished from /b d g/ by
voicing
• VOT is the interval between the release of the stop
and the onset of vocal fold vibration
• for /b d g/ VOT from -20 to +20 ms with a mean of
10 ms
• for /p t k/ VOT from 25 to 80 ms with a mean of 45
ms
• voice bar for intervocalic stops
Stop Consonants (5)
• Formant transitions
• articulatory movement from stop to vowel entails a
formant movement
• as the resonating chamber of the vocal tract changes,
the formant frequencies change
• formant transitions are important for perception
• formant transitions are approximately 50 ms in
duration
Stop Consonants (6)
• Formant transitions (continued)
• F1 usually rises for the stop consonants
• F2 and F3 are not so simple
• for /p b/ F2 and F3 rise slightly
• for /t d/ F2 falls and F3 rises slightly
• for /k g/ F2 and F3 separate steeply and rapidly
• However, a given stop is associated with a variety of
transitions (see Fig. 5-14)
• there is no fixed transition pattern for perception
• Cue trading in stop consonants
Fricatives
• Articulation
• Narrow constriction in the vocal tract
• When air flow rate is high, turbulence results
• Turbulence is complex, unpredictable air flow
• Turbulent airflow is perceived as turbulent noise
• Fricatives have a relatively long duration
• Fricatives are divided into
• sibilants (stridents)
• nonsibilants (nonstridents)
Fricatives (2)
•
Sibilants
•
Intense noise
•
Differentiated among themselves by
•
•
•
•
voicing
•
noise spectrum
Voicing
•
pulses (glottal closures) for /z ʒ/
•
no pulses for /s ʃ/
Noise spectrum
•
Alveolar sibilants have higher frequency energy
•
Palatal sibilants have energy down to 3 kHz
•
Spectral irregularities aren’t important in perception
Formant transitions
•
Formant transition locations depend on the articulation, but the transitions are not important perceptually for sibilants
•
Fricatives (3)
Nonsibilants /f v θ ð h/
•
Less noise energy than sibilants
•
Voiced nonsibilants will have quasi-periodic pulses
•
Noise spectra are
•
fairly flat
•
diffuse
•
The relationship between noise spectrum nonsibilant
identification is not known
•
Formant transitions play the primary role in perception
•
Noise spectrum may play a secondary role
Fricatives (4)
• Acoustical needs for fricatives
• Measures that are economical
• Economical
• Valid
• Reliable
• Problems
• Ambient noise
• Filtering values
Affricates
• Described as a combination of stop and fricative
• /ʧ ʤ/
• Articulation
• complete obstruction in the vocal tract
• intraoral pressure builds up
• release to generate fricative noise
• Acoustic features
• rise time
• duration of frication
• relative amplitude in third formant region
• stop gap
Nasals
•
•
Articulation
•
complete closure in vocal tract
•
sound radiated through nasal cavities
•
sometimes called nasal stops
•
/m n ŋ/
Acoustics
•
Nasal murmur - sound of a nasal
•
associated strictly with nasal radiation of sound
•
there are many spectral peaks, but most have low amplitude
•
antiformants
•
nasal formant
•
low frequency (~300 Hz)
•
highest energy
Nasals (2)
• Nasal formant (continued)
• consonant energy, overall, is reduced because
• higher formants have reduced energy
• Other acoustic features
• highly damped formants (broad bandwidths)
• formant transitions in connected speech
Glide Consonants
• Also called approximants and semivowels
• /w ʲ/
• Articulation
• gradual articulatory motion
• narrow, but not closed, vocal tract
• Acoustics
• Formants
• for /w/
• F1 and F2 are both low
• for /ʲ/
• low F1 and high F2
Liquid Consonants
• Also included as semivowels
• /ɹ l/
• Characterized by
• rapid movements
• formant structure
• F3 is the main difference
• Antiformants for /l/
Download