Ultrasound-based tongue root imaging and measurement James M Scobbie QMU With thanks to collaborators Jane Stuart-Smith, Marianne Pouplier, Alan Wrench, Eleanor Lawson, Olga Gordeeva • Pros and cons of Ultrasound Tongue Imaging • EPG/UTI experiment on English /l/ – Alveolar contact or vocalisation – Light and dark allophones of /l/ • The ECB08 UTI corpus – Scottish derhoticisation and articulation of /r/ – Vowel system – A handful of /l/ again… • Demo of AAA software Introduction 2 • From qualitative “transcription” to quantitative laboratory-based studies with stabilisation UTI 3 • Pro – Tongue root to blade in one image – Instant, real-time, easy, safe, cheap – Qualitative and quantitative analysis – Can be combined with other techniques • Con – Image quality is variable – Hardly any constriction or info on passive articulator – Frame rate of video output is only ~30Hz (~33ms) – Synchronisation with acoustics is problematic – Quantitative analysis is time-consuming and as yet poorly developed… what to measure? Pros and cons of UTI 4 Future: corrected high speed data 5 • /l/ is lighter in onset, darker in coda • Many accents have “vocalisation” in coda • EPG + UTI study of 10 speakers – UTI image quality uniformly awful – EPG results very interesting – Context was /i/+/l/ (+ {/b/, /h/, /l/}) +/i/ • Pee leewards, peel beavers, peel heaps of, etc. • EPG results – Reduction or loss of alveolar contact in codas – Reduced palatal contact (compared to /i/) due to /l/ English /l/ 6 • Alveolar contact in orange, palatal in green • S2 typical in losing palatal contact in onset (can we pee leeward in a gentle breeze) Example onset 7 • No alveolar contact, more palatal contact (can we peel BBC advertising from the shop window) Example coda_b retraction 8 • • • • E + S1: light onset and dark coda in palatality Scots S2,3,4 show darker (less [i]-like) onset Question 1: what about intergestural timing? Question 2: what about the pharyngeal aspect of darkness rather than loss of palatality? Onset ambi Coda_b 100% 60% 40% 40% 20% 0% 0% E2 E3 Speaker E4 E5 gem Coda_h 60% 20% E1 Onset ambi Coda_b 80% Contact i-Zone Contact i-Zone 80% 100% gem Coda_h S1 S2 S3 S4 Speaker EPG results: loss of palatal contact 9 • Relatively simultaneous alveolar contact and loss of palatality in onset • Alveolar contact is delayed in coda (or missing) and loss of palatality occurs earlier 120 100 80 Lag (ms) 60 40 S E 20 0 -20 onset gem ambi coda_h coda_b -40 -60 context EPG results: timing 10 Coda = vocalised and darker 11 • Measurement of Tongue root retraction in [i] and in [l] for a single sample speaker S2 – Coping with terrible quality UTI • Find frames of maximum advancement and maximum retraction of root just above hyoid shadow) – Typical problems in measuring images UTI: Scottish pharyngealisation? 12 • Poor image quality • Time and location of root: top of hyoid shadow Example onset 13 • This is only a bit better than guessing, but impression is of slight pharyngealisation Example onset 14 • Tongue root retracts earlier in coda_b (p<0.01) – Max advancement appears to be near end of [i] vowel in onset condition and mid-way through [i] in coda_b condition – Max retraction apparently at end of [l] in onset condition and towards the end of [b] in coda_b condition • [i] is less advanced in coda_b than onset (p<0.005) – There is a n.s. trend for greater pharyngealisation in coda [l] retraction distance 200 5 150 4.5 mm ms retraction duration 100 4 50 3.5 0 3 onset coda_b Results (S2, n = 18) onset coda_b i l 15 • Darkness as measured by decrease in palatality in /i/ context shows onset/coda differences for only some speakers – Probably dialectal: Scots /l/ is less [i]-like in onset • “All” speakers show a strong timing difference – Front and back gestures dissociate in coda so that posterior gesture is earlier and alveolar (if present at all in coda) is later (“gestural dissociation”) • Qualitative (and quantitative) analysis of UTI data probably shows greater pharyngealisation for all speakers’ coda than onset. Conclusions 16 • Ultrasound/acoustic corpus – 15 teenagers (12-14) in friendship pairs (+4 11yrs) – Wordlist and some spontaneous discourse – Half from a WC and half from a MC school – Main purpose to test effect of use of UTI on vernacular speech variables • Secondary purpose – Derhoticisation of coda /r/ - pharyngealisation? – Vowel space • But sadly not much room for – Vocalisation of coda /l/ - pharyngealisation? ECB08 17 Hiya my name's Kaj McInally My company's FinesseDecor (Scotland) Ltd I'm not a manager. I'm a painter and decorator to trade, first and foremost who just so happened to start work for myself, and then we’ve been that... kinda... successful that we've had to take on people Derhoticised coda /r/ 18 • Since the 1970s coda /r/-“loss” has been reported in working class speech – Not the RP-like middle-class non-rhoticity • Stuart-Smith (2003) in a Glasgow corpus including 1415 year old children showed that WC girls have no overtly rhotic consonant for coda /r/ in approximately 90% of cases, boys in about 80% – Middle class children and older adults are rhotic, so the stratified derhoticisation is indicative of change in progress. – /r/ seems to be turning into a vowel right now – Strong impression of pharyngealisation offglide on vowels with monophthongal pharyngealisation on low back ones Losing /r/ in Scotland 19 F3 F2 rain, with an anterior approximant, usually described as being “retroflex” (note low F3) ferry, with a tap (an approximant is more common) Typically rhotic tokens of Scottish /r/ 20 F3 F2 F3 F2 Derhoticised ear (above) car (below) Rhotic ear (above) car (below) F3 F2 Word-final derhoticisation in ECB08 F3 F2 21 • Lexical sets BIRD WORD HERD merged (8/11) – Earth, verb, berth, (err) = third, word, surf, birth, fur – Could be a rhotic vowel /ɚ/ • No /a/ split (Pam/palm are homophones) • /ʉ/ is central and not very high i ʉ o iɹ ʉɹ oɹ e ı ɔ eɹ ɚ ɛ a ʌ ɑɹ Rhotic (MC) speakers ɔɹ 22 300 o i e u F1 (Hz) Dimension 1 ɪ ɔ ɛ ʌ a 1300 3300 MC Edinburgh F2 (Hz) Dimension 2 800 23 300 o i ɪ F1 (Hz) ɛ Dimension 1 ɔ u e ʌ a 1300 3300 F2 (Hz) Dimension 2 WC West Lothian 800 24 • Phonologically, only /ɛ I / are “lax” Articulation of vowels (EF4) 25 • Tipup (LM17 onset) or tipdown (LM15 onset) Sample ultrasound images of /r/ 26 Tongue blade raising [he] [ɹ] Tongue root retraction [ɹ] [he] [ə] Waterfall time sequence: hair 27 • More vowels (and environments) with weak /r/ – No merger of /ɛr/ and /ʌr/ (8/8) – /a/ split (hat/heart) [a] vs. [ɑ] for the most derhotic – /ʌr/ is short without compensatory lengthening – High vowels create diphthongs – Pre-pausal /r/ tends to devoice • Potential /ʌ/ merger (hut/hurt, bud/bird) i ʉ o iə ʉə oʌ e ı ɔ eə ʌ ɛˤ ɑː ɛ a ɔˤ (ʕ) Derhoticising (WC) speakers 28 Pre-pausal /r/ may have late (covert?) tip • Low vowels sound derhoticised, acoustically lack F2/F3 approximation, and are near-monophthongs. • Articulatorily a clear rhotic gesture was retained car 29 Covert rhoticity occurs even in weak syllables and in spontaneous speech 30 • What about /l/? – If dark, is it pharyngealised? – If vocalised, is it a pharyngeal? – How are derhoticised /r/ & vocalised /l/ kept apart? – Hip hum hut – Fur/fir hurt – Pill film – Mull bulb cult • Clear difference between /r/ and /l/ in open and closed syllables /l/ in a derhoticising (WC) speaker 31 – Red = // mull (cons) & bulb (vocalised) – Blue = /ı/ film (cons) & pill (vocalised) • Pharyngealisation vs. velarisation? UTI of laterals 32 – Red = cult (cons /lt/) – Green = hurt (cons /t/) • /l/ pharyngealised + velarised? • Pharyngealised postalveolar /r/ with saddle UTI of laterals 33 • Pharyngealisation and velarisation more extreme than in vowels /l/ compared to /o/ and /ɔ/ 34 • Onset/coda differences in /l/ in a high vowel context are well-known to involve loss of palatality and a greater pharyngeal constriction (Sproat and Fujimura 1993), plus subtle loss of alveolar contact (eg Giles & Moll) • Scottish speakers who have no onset/coda difference in palatality do show increased pharyngealisation in coda (and may show very strong vocalisation, not gestural undershoot) • Vocalised /l/ may be velarised while pharyngealisation occurs for consonantal /l/ Conclusions 35 • Derhoticisation often sounds like pharyngealisation • But in prepausal and other masking contexts there can be delayed covert post-alveolar constriction, due to “gestural dissociation” • WC /r/ seems to be changing from consonant into vowel, with some increase in vowel space • Meanwhile, MC rhotic speakers merge vowels • WC /l/ and /r/ seem to be keeping distinct – Is the pharyngealised /l/ also velarised? – Is the difference purely anterior? Conclusions 36 • Let’s look at pharyngealisation in a derhoticising speaker – Hut vs. hurt – Bud vs. bird – Far vs. fir AAA demo 37 Who says you need ultrasound? 38