Introduction & Acoustics Review

advertisement
SPPA 4030
Speech Science
Stephen M. Tasko Ph.D. CCC-SLP
Topic 1: The Speech Chain
Learning Objectives
• Outline the general sequence of
biological/physical events that occur from speech
formulation to speech perception.
• Describe the different types of information
content embedded within the speech signal.
• Know and describe the different branches of
physics and biology used to inform basic
mechanisms of speech production and
perception.
The Speech Chain (Denes & Pinson, 1993)
What information is embedded in the
speech signal?
•
•
•
•
•
Phonetic information
Affective information
Personal information
Transmittal information
Diagnostic Information
Branches of science employed to
understand speech communication
Physics
• Acoustics
• Aerodynamics
• Kinematics
• Dynamics
Biology
• Anatomy
–
–
–
–
Gross anatomy
Microscopic anatomy
Molecular biology
Neuroimaging
• Physiology
– Electrophysiology
Physical Quantities Review
An Independent Learning Activity
Learning Objectives
• Distinguish between basic and derived units
• Distinguish between scalar and vector
quantities
• Define a range of derived quantities with
special emphasis on displacement, velocity,
acceleration, force, pressure, intensity,
resistance and their physical relationship
Assignment 1
• See Assignments section of course website
• Due September 12, 2013
Topic 2: The Source-Filter Theory of
Speech Production: An Introduction
Learning Objectives
• Outline the key assumptions of the source filter
theory of speech production
• Distinguish between the source signal, filter
characteristics, and the output signal
• Use a range of examples to demonstrate
understanding of the source filter theory
• Distinguish to role that different vocal tract
structures play in speech sound generation and
speech sound filtering
Producing Speech
• The vocal tract can be
conceived as a set of
interconnecting tubes
and valves.
• Speech production is
achieved through the
systematic regulation of
air pressures and flows
within the vocal tract.
Source-Filter Theory of
Speech Production
• The sounds we hear as speech is the product
of a sound source that has undergone filtering
by the vocal tract
• source and the filter may be considered to be
independent of each other
Vocal tract is a tube that can vary its shape
From Titze (1994)
Source Filter Theory
Source
(Phonation)
Filter
(Resonator)
Speech
(What We Hear)
Input Spectrum
Frequency Response
Curve
Output Spectrum
Same Source, Different Filter
Different Source, Same Filters
White Noise
Different Source, Same Filters
(Human)
burp
Different Source, Same Filters
(Human)
snore
Different Source, Same Filters
(Human)
Lip buzz
Different Source, Same Filters
(Human)
?
Different Source, Same Filters
(Non-Human)
sheep
Different Source, Same Filters
(Non-Human)
accordion
Different Source, Same Filters
(Non-Human)
If it quacks like a duck…
Source Filter Theory Applied:
Alaryngeal Speech
Source-Filter Theory Applied:
Esophageal Insufflation Test
Source-Filter Theory Applied:
Tracheoesophageal (TE) Speech
Source Filter Theory Applied:
The Talkbox
Source-Filter Theory Applied:
The Talkbox
http://www.youtube.com/watch?v=YS3gAVNlceg
Topic 3: A Brief Review of
Physical Acoustics
Learning Objectives
• Outline the physical processes underlying simple harmonic motion
using the mass-spring model
• Describe the molecular basis of sound wave propagation
• Define the key characteristics of sinusoidal motion including
– Amplitude: instantaneous, peak, peak-to-peak, root-mean-square
(RMS), the decibel scale
– Frequency/period including units of measure
– Phase
– Wavelength
• Briefly describe the relation between the sine wave and uniform
circular motion
• Outline the relationship between the frequency and wavelength of
a sound wave
Spring Mass Model
• Mass (inertia)
– Newton’s first law of motion
– Opposition to
acceleration/deceleration
• Elasticity
– Opposition to displacement
– Rest position
– Recoil force
• Friction
http://phet.colorado.edu/en/simulation/mass-spring-lab
What is sound?
• It may be defined as the propagation of a
pressure wave in space and time.
• Sound must propagate through a medium
Sound-conducting media
• Medium is composed of
molecules
• Molecules have “wiggle room”
• Molecules exhibit random
motion
• Molecules can exert pressure
A
B
Model of air molecule vibration
(Time 1)
Air molecules sitting side by side
Rest positions
Model of air molecule vibration
(Time 2)
Model of air molecule vibration
(Time 3)
Model of air molecule vibration
(Time 4)
Model of air molecule vibration
(Time 5)
Model of air molecule vibration
a
Time
1
2
3
4
5
Distance
b
c
d
Wave action of molecular
motion
Time
1
2
3
4
5
Distance
Amplitude waveform
Position
Time
Amplitude waveform
Amplitude
Time
Where is pressure in this model?
Time
1
2
3
4
5
Pressure measuring device at
a specific location
Pressure waveform
Sound Pressure
+
0
Ambient Pressure
Time
Measuring Sound
•
•
•
•
Amplitude
Frequency
Phase
Wavelength
Measuring Sound: Signal Amplitude
Ways to measure it
• Instantaneous
• Peak
• Peak-to-peak
• Root mean square
(RMS)
• Decibel –see later
Sound Pressure
+
0
Time
Measuring Sound: Signal Amplitude
• Root mean square (RMS)
What units do we use to measure
signal amplitude?
• Pressure: Force/area
• Intensity = Power/area where
power=work/time & work=Force*distance
• Intensity is proportionate to Pressure2
Brief Review: The decibel scale
• decibel scale typically used to represent signal
amplitude
• Many common measurement scales are
absolute and linear
• However, the decibel scale is relative and
logarithmic
Absolute vs. relative measurement
• Relative measures are a ratio of a measure to
some reference
• Relative scales can be referenced to anything
you want.
• decibel scale doesn’t measure amplitude
(intensity or pressure) absolutely, but as a ratio
of some reference value.
Typical reference values
• Intensity
– 10-12 watts/m2
– Threshold for normal hearing at 1000 Hz
• Sound Pressure Level (SPL)
– 20 micropascals
However…
• You can reference intensity/pressure to
anything you want
For example,
• Post therapy to pre therapy
• Sick people to healthy people
• Sound A to sound B
Linear vs. logarithmic
• Linear scale: 1,2,3…
• For example, the difference between 2 and
4 is the same as the difference between 8
and 10.
• We say these are additive
Linear vs. logarithmic
• Logarithmic scales are multiplicative
• Recall from high school math and hearing science
10 = 101 = 10 x 1
100 = 102 = 10 x 10
1000= 103 = 10 x 10 x 10
0.1 = 10-1 = 1/10 x 1
Logarithmic scales use the exponents for the number scale
log1010 = 1
log10100 = 2
log 101000=3
log 100.1 = -1
Logarithmic Scale
• base doesn’t have to be 10
• In the natural sciences, the base is often 2.7…
or e
Logarithmic Scale
• Why use such a complicated scale?
– logarithmic scale squeezes a very wide range of
magnitudes into a relatively compact scale
– this is roughly how our hearing works in that a
logarithmic scales matches our perception of
loudness change
Combining the idea of logarithmic and
relative…
bel= log 10(Im/ Ir)
Im –measured intensity
Ir – reference intensity
A bel is pretty big, so we tend to use decibel where
deci is 1/10. So 10 decibels makes one bel
dBIL = 10log 10(Im/ Ir)
Intensity vs. Pressure
• Intensity is difficult to measure.
• Pressure is easy to measure – a microphone is
a pressure measuring device.
• Intensity is proportionate to Pressure2
Extending the formula to pressure
Using some logrithmic tricks, this translates our
equation for the decibel to
dBSPL= (2)(10)log 10(Pm/ Pr) = 20log 10(Pm/ Pr)
Measuring Sound: Frequency/period
Sound Pressure
+
0
-
Period (T)
Time
Period (T): duration
of a single cycle
Frequency (F):
rate that cycle
repeats itself (1/T)
Measuring Sound: Frequency/period
• Absolute measure
– Cycles-per-second: Hertz (Hz)
• Relative measure
– Octave (double or halving of frequency)
– Semitones (12 semitones = 1 octave)
Phase: Uniform Circular Motion
Initiating a sound waves that differ
only in phase
A force is applied to molecule at frequency f and time t
same force applied at frequency f at time t+a where a < the period of vibration
Spatial variation in pressure
wave
wavelength () is the
distance covering
adjacent high and low
pressure regions
Spatial variation in pressure wave
Time
1
2
3
4
5
Spatial variation in pressure
wave
Relation between frequency and
wavelength
=c/F where
: wavelength
F: is the frequency
c: is sound speed in medium (35,000 cm/sec)
Learning Objectives
• Draw and describe time-domain and frequency-domain
representation of sound
• Distinguish between simple and complex sound sounds with
regard to physical characteristics and graphical
representations
• Distinguish between periodic and aperiodic sounds with
specific emphasis on terms such as fundamental
frequency/period, harmonics, and overtones
• Distinguish between continuous and transient sounds
• Describe how waves sum, define Fourier's theorem and be
able to describe the basics of Fourier analysis
Graphic representation of
sound
• Time domain
– Called a waveform
– Amplitude plotted as a
function of time
• Frequency domain
– Called a spectrum
– Amplitude spectrum
• amplitude vs. frequency
– Phase spectrum
• phase vs. frequency
– May be measured using a
variety of “window” sizes
Same sound, different graphs
Time domain
Frequency domain
From Hillenbrand
Classification of sounds
• Number of frequency components
– Simple
– Complex
• Relationship of frequency components
– Periodic
– Aperiodic
• Duration
– Continuous
– Transient
Simple periodic sound
• Simple: one frequency component
• Periodic: repeating pattern
• Completely characterized by
– amplitude
– period (frequency)
– phase
• Other names: sinusoid, simple harmonic
motion, pure tone
Simple periodic sound: Graphic
appearance
From Hillenbrand
Complex periodic sounds
•
•
•
•
Complex: > one frequency component
Periodic: repeating pattern
Continuous
Frequencies components have a special relation
– Lowest frequency: fundamental frequency
• Symbol: fo
• Frequency component with longest period
– Higher frequency components: harmonics
• integer (whole number) multiples of the fo
Complex periodic sounds: Graphic
appearance
• Time domain:
– repeating pattern of pressure change
– within the cycle, things look complex
• Frequency domain:
– spectral peaks at evenly spaced frequency
intervals
• Auditory impression: sounds ‘musical’
Complex periodic sounds: Graphic
appearance
From Hillenbrand
Amplitude
Amplitude
Glottal Source
Time
Frequency
Amplitude vs. Phase
Spectrum
Amplitude spectrum:
different
Phase spectrum:
same
From Hillenbrand
Amplitude vs. Phase
Spectrum
Amplitude spectrum:
same
Phase spectrum:
different
From Hillenbrand
(Complex) Aperiodic sounds
• Complex: > one frequency component
• Aperiodic: Does not repeat itself
• Frequency components are not systematically
related
• May be
– Continuous
– Transient
Aperiodic sounds: Graphic
appearance
• Time domain:
– no repeating pattern of pressure change
• Frequency domain:
– the spectrum is dense
– No “picket fence”
• Auditory impression: sounds ‘noisy’
Aperiodic sounds: Graphic
appearance
From Hillenbrand
Analysis of complex waves
• Waves can be summed
• Complex waves are the sum of simple waves
• Fourier: French Mathematician:
– Any complex waveform may be formed by summing sinusoids of
various frequency, amplitude and phase
• Fourier Analysis
– Provides a unique (only one) solution for a given sound signal
– Is reflected in the amplitude and phase spectrum of the signal
– Reveals the building blocks of complex waves, which are sinusoids
Learning Objectives
• Draw and differentiate the waveform and the
waveform envelope
• Draw and differentiate the amplitude
spectrum, the phase spectrum and the
spectrum envelope
• Differential between short-term spectra and
long-term average spectra.
The “envelope” of a sound
wave
• Waveform envelope:
– imaginary smooth line that follows the peak of the
amplitude of a sound pressure waveform
• Spectrum envelope:
– Imaginary smooth line drawn on top of the
amplitude spectrum
Waveform envelope
From Hillenbrand
Waveform envelope
Time
Amplitude
Spectrum envelope
Frequency
Thought Question
Can an aperiodic and complex
periodic sound have identical
spectrum envelopes?
Amplitude Spectrum:
Window Size
• “short-term” vs. “long-term average”
amplitude spectrum
“Instantaneous” Amplitude
Spectra
(Long Term) Average Amplitude Spectrum
Learning Objectives
• Describe how the amplitude spectrum and the spectrogram
are related.
• Identify the axis units of the spectrogram.
• Provide some advantages of the spectrogram over the
amplitude spectrum.
• Distinguish between a wide band and narrow band
spectrogram and outline the different information each
provides.
• Distinguish between a harmonic and a formant on a
spectrogram.
• Be able to draw stylized (highly simplified) spectrograms
based on spectra and spectrum envelopes.
The Spectrogram
Building a spectrogram
F
A
Rotate
90 degrees
F
A
Building a spectrogram
F
Rotate it so that
The amplitude is
Coming out of the
page
F
A
Time
This is really narrow because it is a slice in time
Building a spectrogram
Frequency
Dark bands
= amplitude
Peaks
Time
Two main types of
spectrograms
• Narrow-band spectrograms
– Akin to amplitude spectrums “lined up”
– Frequency resolution is really sharp
• Wide-band spectrograms
– Akin to spectrum envelopes “lined up”
– Frequency resolution not so sharp
Wide vs. Narrow Band
Spectrograms
Highlights harmonic structure
Highlights spectrum envelope
Learning Objectives
• Define an acoustic filter
• Draw and label a frequency response curve
• Draw and differentiate different types of acoustic
filters
• Define terms such as cutoff frequency, center
frequency, roll off rate, gain, and bandwidth
• Define and draw a basic filter system and relate that
to the source-filter theory of speech production
What is an “Acoustic” Filter
• holds back (attenuates) certain sounds and lets other
sounds through - selective.
Why might we be interested in
filters?
• Human vocal tract acts like a frequency selective
acoustic filter
• Human auditory system behaves as a frequency
selective filter
• helps us understand how speech is produced and
perceived.
Kinds of frequency selective
filters
Low-pass filters
– Lets low frequencies “pass through” and attenuates high
frequencies
High-pass filters
– Lets high frequencies “pass through” and attenuates low
frequencies
Band-pass filters
– Lets a particular frequency range “pass through” and
attenuates other frequencies
Low Pass Filters
Gain
+
low
Frequency
high
High Pass Filters
Gain
+
low
Frequency
high
Band Pass Filter
Gain
+
low
Frequency
high
Frequency Response Curve (FRC)
Center frequency
+
3 dB
Gain
passband
lower cutoff
frequency
upper cutoff
frequency
low
high
Frequency
Operation of a filter on a
signal
NOTE:
Amplitude spectrum describes a sound
Frequency response curve describes a filter
Learning Objectives
• Define resonance, free and forced vibration
• Describe how the pendulum and spring mass models
can help explain resonance.
• Outline how mass and stiffness influences the resonant
frequency of a mass spring system.
• Outline how acoustic resonators behave like acoustic
filters.
• Calculate resonant frequencies of a uniform tube based
on its physical dimensions.
• Describe how the wavelength of the sound determines
the resonant frequency of tube.
Free vibration
• objects tend to vibrate at a characteristic or
resonant frequency (RF)
Forced vibration
• A vibrating system can force a nearby system
into vibration
• The efficiency with which this is accomplished
is related to the similarity in the resonant
frequency (RF) of the two systems
Forced vibration
• If the RF of the two systems are the same, the
amplitude of forced vibration will be large
• If the RF of the two systems are quite
different, the amplitude of forced vibration
will be small or nonexistent
Resonance refers to
• The tendency for an object to vibrate at a
particular frequency or frequencies.
• The ability of a vibrating system to force
another system into vibration.
Back to the mass spring
model
• Vibratory frequency of
the mass spring
determined by
– Mass
– Stiffness of the spring
http://phet.colorado.edu/en/simulation/mass-spring-lab
Acoustic Resonance
• Ideas from mechanical resonance also applies
to acoustic systems
• Acoustic chambers will transmit sound
frequencies with more or less efficiency,
depending upon the physical characteristics
• Therefore, they act as filters, passing through
(and even amplifying) some frequencies and
attentuating others.
Acoustic Resonance
• And since they act as filters, they have most of
the same features of a filter, even though we
might use different names.
• Center frequency is often termed the resonant
frequency.
• Frequency response curve often termed the
resonance curve.
Helmholtz Resonator
Actions of a Helmholtz
Resonator
Other Acoustic Resonators: Tube
Resonators
• Uniform tubes: Factors that influence resonance
– Length.
– Cross-sectional area along its length.
– Whether it is closed at either or both ends.
Stephen M. Tasko
Uniform tube, closed at one end
Uniform tube, closed at one end
Uniform tube closed at one end
First resonance or formant Higher resonant/formant
frequencies are odd
multiples of F1
F1 = c/4l
For example,
Where
• F1 = (c/4l )*1
c=speed of sound
(35,000 cm/sec)
• F2 = (c/4l )*3
l = length of the tube
• F3 = (c/4l )*5
males ~ 17.5 cm
females ~ 14 cm
Stephen M. Tasko
Comparing Helmholtz and
tube resonators
Resonator Features
Sharply tuned
Broadly tuned
Resonator Features
Gain
Frequency
An example of the resonance characteristics
of the human vocal tract
Download