I t- -- >- - -

advertisement
--
6.541J Handout
3/11/04
-/
6.i
Y
ok
Ckxahmpbae~
S0ynzp-e
OVrt P
Ce.AUx
vdHA
%-
DHA
VoeA as i'oteA
cT
_
I
_
1--
I.
e If-
t--- >-- -
L.0]+81#1
S
lW
o- 2M--L
4
3
I
- fS
v
stop .
2
Sin
( hiai
500
naFM
I... -... (mn
,
-30 Lt5
2
52
100
b
I
150
200
TIME (ms)
250
Figure 8.74 Acoustic data from the syllable /dha/ produced by an adult male Bengali speaker.
The spectrogram is displayed at the top left. To the right of the spectrogram are spectra sampled
at five points throughout the utterance: (1) in the prevoicing, (2) at the consonant release, (3)
during the aspiration, (4) near the onset of glottal vibration, and (5) 30 ms after spectrum 4.
Different time windows were used to calculate the spectra, as shown on the waveforms. and in
the case of (2) and (3) the spectra are averages over the times indicated beside the spectra. Also
shown (below the spectrogram) are plots of the first- and second-formant frequencies vs. time
and the amplitudes of the peaks in the F2 and F3 regions, for the time interval following the
vertical lines indicate the times of release and of onset of glottal vibration
consonant release. -The
-I
fi
?-i . !
DI
,
Courtesy of MIT Press. Used with permission.
0.2 s
' Figure 8.73 The curve gives a measure of the glottal opening (as sensed using a photoelectric
glottograph device) during the voiced aspirated stop consonant /bh/ in the context fibhi/, produced by a Hindi speaker. The lower horizontal bar shows the regions of voicing (solid), consonant closure (open), and frication and aspiration (stippled). (Adapted from Dixit, 1975.)
Courtesy of MIT Press. Used with permission.
--------
�-
rY4
OcYV
Courtesy of MIT Press. Used with permission.
Courtesy of MIT Press. Used with permission.
Tones On Single Vowels
A
Arc
ne
,'
{T
Tones (Thai)
Frequency in CPS
140
c,
a
130
0
z
z
120
w
aL
LL.
110
100
90
DURATION
0FIG. 3.5 TONES ON 50SINGLE VOWELS 100%
Duration
muttu
7
6 P
-5
annh
aplra\
I
-4
U
z
wq
6entA
( Ma
i
i
0
stDpCs
ol
Y')
L
10
Jsel
l0 msec
FIG. 20.5. These spectrograms and waveforms show the distinction between dental It] and apical [t] in Malayalam. The waveforms at the bottom
illustrate the difference in abruptness of the release for the two consonants.
The spectrograms above illustrate differences in F2 immediately preceding and following the consonantal closure, indicating a difference in tongue
body position for the two classes of consonants.
ore-In=--b
wo
A
4320 ~
B
4000 ~
3960 ~
C
3600 ~
2400 ~
3600 ~
3240 ~
3000 ~
2880 ~
1200 ~
2720
0~
2520 ~
2200
2160 ~
2000 ~
1800 ~
0 0.1 0.2 0.3 0.4
1830
Time (Seconds)
1320
1440 ~
1080 ~
960
1000 ~
720 ~
360 ~
270
1
As In
Bee
0.015 Sec.
360
e
As In
Rate
720
540
�
As In
Yet
540
a
G
As In As In
Ask
Jaw
(Eastern)
720
360
o
As In
Go
600
270
u
As In
Tooth
Perception of Stop Bursts
Showing frequency-position (A) of the schematic stops; (B) of the vowel-formats;
and (C) one of the schematic syllables.
4320
T
3960
3400
Frequency
3240
2880
2520
2160
K
1800
P
P
1440
1080
720
360
P
i
e
�
a
G
o
Map of the areas in which the judgment p, t, or k predominates.
u
'_Concept of Formant "Locus"
Synthetic
spectrograms
showing second-formant
"\ocWt s
t o- (cOA&Vt
ConCLep
transitions that produce the voiced stops before various vowels.
24001
o
bec
be
61
b1
z 2400
Frequency in CPS
-
4
0
_
600
0a 3000oo
0
M/
>
b
t
3000
2400
1200]
1800
60
1200
b�
be
A'
b
boa
bo
ba
bG
bo
bu
bu
dd
800
a
3000
z
r200
2400
0J 600
W
1800
1200
\4w m
N
'0' N-
h
_
di
de
de
0
do
da
.
do
du
-O3 Se _
600
2400
0 800-
N%~
_
di
d�
de
da
do
dG
du
g
r200
14I
N·
3000
-.
A
~ '. - *
0.3 sec
2400
1800
Figure 5.1
1200
Synthetic spectrograms showing second-formant transitions that produce the voiced stops before various
600
vowels.
0
g�
g1
ge
ga
go
gu
gG
300
A
.
I
0
1tx.L
I
_J~~~-
B
L
-z*
24C)0-
I. , :
0.
0
1.:'I-,
-----
Con
;
1 W==
Isee
C>.
Z t80
Stimulus patterns
(shown schematically) and identifications with and
without a silent
- 8440
z
D
12C
interval between the second-formant locus and the onset of the transition.
-
i~~~~~~
§ (B)'I~
60
3000
(A)
0$
O'
Frequency in CPS
0
2400
1800
1200
600
0
0.1
o0.
}
o3
o
}
fe.
,coep
Silent
'Interval
0.
720
o.z
o0.3
F2
2520 CPS
2400
b
2280
Figure 5.4
2100
2040 the
Stimulus patterns (shown schematically) and identifications with and without a silent interval between
at the d
second-formant locus and the onset of the transition. (A) Second-formant transitions that originate1920
1900
d together with the first formant with which eachdwas paired.
locus and go to various steady-state levels,
1660 (B)
and the
The same patterns, except that a silent interval of 50 msec has been introduced between the locus1640
1440
is no silent interval in the first formant, but that it has been displaced
start of the transition. Note that there g
1320
Similar
along the time scale so that its onset is, as in (A), simultaneous with that of the second formant.1200
1080
introadjustments in time of onset were made for all the silent intervals tested in this experiment. The
960
d formant always weakened the consonant, but did not affect
duction of a silent interval into the first
840 its
720
identity.
TIME IN SECONDS
}
}
}
0
0.1
0.2
0.3
0
0.1
0.2
0.3
Time in Seconds
--- ·----------------
-·---�--·--··--·-�-�
F1
400 CPS
Voice Onset Time (VOT)
Illustrations of the patterns used to study the effects of the first-formant
time of onset on the perceived distinction between /d/ and /t/. The numbers
above the patterns show the extent of cutback in msec.
foice onet
400U,
Frequency in CPS
aO
z
z
Uo
4800
'.
10
o
o20
10
2400-
20
II
2400
1~~~~~~~200-~~1
1200
0
.
0
3600
C,
Cc'
IlI
0
(VODT
tgie
,
-,%
0
0
100 o200
,
.
00
400
-
"
0
100 200 300
400 IN MSEC.0
TIME
100
40 in msec.
Time
30
30
1
40
260
0
0
100
100 200
200
0
so0
50
6
200
100 200
60
-~~~~~~AE
1600 26o00
6
tbo z6o
6
60o 6o
Figure 8.4
Illustrations of the patterns used to study the effects of the first-formant time of onset on the perceived
0 100 200
0 100 200
0 100 200
0 100 200
distinction between /d/ and /t/. The numbers above the patterns show the extent of cutback in msec.
100
U,
w
z
0
80
.L
Iz
gC
laJ
Percent d Responses
U,
W
60
/d/ vs /t/
40
20
0
FIRST-FORMANT
0
10TIME 20DELAY30 IN MSEC.
40
50
60
Figure 8.5
First-Formant
Time Delay
msec.
Responses of 27 phonetically naive
listeners to the patterns
in whichintime
of onset of the first formant was
varied (see figure 8.4). Each listener judged each stimulus six times, making a total of 162 judgments per
stimulus. The data are shown as the percentage of times each stimulus was heard as /d/.
Responses of 27 phonetically naive listeners to the patterns in which time of onset of the first-formant was
varied. Each listener judged each stimulus six times, making a total of 162 judgments per stimulus.
The data are shown as the percentage of times each stimulus was heard as /d/.
-��---------�
Rate of formant movement /w-b/
Illustrations of the spectrographic patterns used to produce the stimuli of Experiment I.
The first four patterns in each row show how the tempo of the transitions was varied. At the extreme right
of each row is a complete stimulus pattern, i.e, transition plus steady-state vowel, for the longest
duration of transition tested.
2400
10
20
b �- w� -u�series
30
40
300 msec.
Frequency in CPS
1800
1200
600
0
g �- j �- i�series
2400
1800
1200
10
20
30
300 msec.
40
600
0
Judgments in percent
Time in msec.
100
100
80
80
b
g
60
60
40
40
20
20
g �vs. j �
b � vs. w�
0
0
30
60
90
120
0
0
30
60
90
120
Transition duration in msec.
The distinction between stop consonant and semivowel as a function of the tempo of transition.
These curves show the percentage of stop consonant responses, and are based on the judgments of
separate groups of 59 and 60 listeners for the b-w and g-j distinctions, respectively. Tempo is here
expressed in terms of duration.
Example of categorical perception
Illustrations of the spectrographic patterns from which the stimuli of the experiment
were produced. Pattern 14, at the lower right, is complete in all respects.
Efple o
Frequency in CPS
1
&teocrcal prepton
2
I
3
4
5
6
7
6
5
4
3
2
7
2400
C 2400-
1800
r~
r
/PE
)t=
'00M
Am
AM
IAm
r_
___
r_
_
I_
I
z 1800-
1200
O 1200'
z
w 600
n
600-
0
I
n.-
IL.
8
_
8
9
9
10
10
11
11
12
12
13
~~~~~~~~~~~~I_
s-I
13
14
·
14
-I
r
TIME IN MSEC.
Time in msec.
V
-2400
2400
-1800
1800
-1200
1200
-600
600
__
U
,{4S-
0
300
Figure 10.1
Illustrations of the spectrographic patterns from which the stimuli of the experiment were produced. Pattern 14, at the lower right, is complete in all respects.
Identification Data
75
b
d
g
50
25
0
100
Percent Correct
Percent Identification
100
75
50
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14
Two-Step
Discrimination
25
1 2 3 4 5 6 7 8 9 10 11 12 13 14
Stimulus Value
Value of "A" Stimulus
100
100
75
75
50
25
0
One-Step
Discrimination
1 2 3 4 5 6 7 8 9 10 1112 13
Value of "A" Stimulus
Percent Correct
Percent Correct
Obtained
Predicted
50
25
0
Three-Step
Discrimination
1 2 3 4 5 6 7 8 9 10 11
Value of "A" Stimulus
Labeling and discrimination data for a single S, CD.
The values given on the ordinates in terms of percentage are based on 32 and 42
judgments for the labeling and discrimination data, respectively.
�_____
-IV\ecau
adift-L
o
e
tlIWP--
IOAAu,
1
(
FREQUENCY IN CYCLES PER SECOND
Figure 4.30 The upper curve plots the width AF of the critical band as a function of the center
frequency of the band, as measured in masking experiments that manipulate the bandwidth of
the masker. The lower curve gives the bandwidths calculated from the critical ratio, as in figure
4.25. (From Greenwood, 1961a.)
Courtesy of MIT Press. Used with permission.
kHz
4
nMado
A 2-
L4aift-cT
-
3
r-1
-
=
-
2
C
=
o
1
e
y
ae
-t-
o
o
FORMANT OF REFERENCE
MEAN MATCHED F2
u
VOWEL
Figure 4.34 The horizontal lines give the first four formant frequencies of a set of Swedish
vowels used in a matching experiment. The boxes show the mean frequency of the second
formant (F2') of a two-formant stimulus that gave a best match to the full vowel. Fl of the
two-formant matching stimulus was set to Fl of the vowel. (From Carlson et al, 1970.)
Courtesy of MIT Press. Used with permission.
D CM&M'V0V,)&J e
DChhtSb
FREQUENCY
Figure 4.32 Schematization of the spectrum envelope of a two-formant sound used in the
matching experiments of Chistovich and Lublinskaya (1979). The frequency spacing Fb - Fa and
the relative amplitudes of the spectral prominences were manipulated.
Courtesy of MIT Press. Used with permission.
F'
kHz
1.5
10
iiIaxv
0.5
0
I
I I
I
I
I
I
I
I
I
I
I
C* I's
1.5
--------- ----------------------------
____ _
__ ____ -----1.0
0.5
0
_ I
-20
I
-10
-
I
I
I
I
0
10
20
30
dB
_ I
-30
I
-20
I
-10
I
0
I
10
I
20
dB
Figure 4.33 Results of the matching experiments in which the frequency P of a one-formant
vowel is adjusted to match a two-formant vowel. The spacing between the two formant frequencies (shown by dashed lines) is less than 3.5 bark for the left column and more than 3.5 bark
for the right column. The points give the mean and a measure of variability for the judgments as
a function of the relative amplitude A2/Ai of the formant peaks for two subjects: L.C. (top) and
VL. (bottom). (From Chistovich and Lublinskaya, 1979.)
Courtesy of MIT Press. Used with permission.
---------~~~~~~~~~~~~~~~~~~~~~~~~~~
-
-3
­
LO
CO
i
t)
E
a,
o
E
I0
Cn
E
CD
'­F:
40
Co
c,
oQ
0
c..
C)
CL
0C)
C
)
wn.
1
cj
0
0
000
o
)
X
L-.I I
(ZH)
(9P) dVJV
O3Hd
_~~~~~I__
~~~~~~~~~~~
I~~~~C~~~~~I~~~~~C~~~~~
__
I1_~~~~~~~~~~~~~~~~E
0
~
~
~
­
LO
CO
C
C
C
C
Ea)
a)
C
C
O
0
T
C
C
C
c.
*1*
I
Cl
C
Co
Co
CM
,U)
E
L/
2
0
I:
E
C
..
C
C'
n'-a
c
c
09
c
CD
LL
C)
0­
or)
-j
I
w
-j
C:)
o
L--j
,
(]P) dWV
(ZHl) 03-1
----------�-s�---
O
�
U
C)
to
cn
CL2u)
LL~ 1
. E aM
c
>.Si
r 't 'It
·E
·
·
l
;
iI
p­
I - (, ll
I I
I
f 1
I
, (-41
N
C,
0)
-
E
0
o
It
Irl",
I C
C,
co
N
I
4­
w
Lo
LL
r
-.
cu
cr
I
k
r
I
o
O)
C)
C)
a
_
I., C
x
-i
C
,­CL
Co
a)
U)
t8P) DW
Nm
Nm
14)
LO
.o
cl,
Q~ c\
II
cn 6 11 0c
c1
0E
. LL C
CB
i
Co
.. * CO
"
L
c)
E
e,
(1)
N
CM
o
o
C)
o
0
N
0
0
11 F
N
LO
cnO11,u
4­
4
CO
L1
,...--
C)
-- _.
~
ct-i
C
_
o)
=,~
i
-)
(gP) DVNV
C)
(D
E
U-
2r
__
I
9
CD
_
_
I
a.
v,
(BP) !DVW
a)
6
LO
cn
n
COU)
LL
Ucl
,
' I't
'.D FE
.O N'MO
>:
)
-D
co
/ _!~~~I
- a
I
i
O
¢
N
E
w
O0
LL
I
w
\1 =5
rO
o6
O
IN
a)
o
u)
Cl
(BP)
!WvvI
LO
to
c)
cO
E
LJ
oI
E
o
0
0)
o
00
a:
Q
n
co
._.
a)
. .
U)
(0
n
X
:3
a
',,
a)
)
0')
0u
-J
v
---------------
-----
--- ��-
LO
C3)
LO
CO
(b
E
U0
E
LO
t­
O
0
4-.
Ol
o0
0)
I4­
0
n-
cr
0)
LU
1.7
cu
a)
a)
a)
x
4-
a)
4­
x
0
a)
w_
le,
a.
0)
0Y
to
CO
x
a)
LO
CO
cn
E
E
CM]
oo
0
C'
0
D.
o0
(D1 ..
a) 0)
a cn
oi
0
w
3
a)
cn
de
tpJ UvtVW
IV
o0
.4­
3
x
a)
Gr
Download