Audibility & Speech Recognition Chapter 6 Perry C. Hanavan, Au.D. Speech Test Applications obtain information for counseling illustrate benefits of visual cues determine hearing aid candidacy – or determine candidacy for CI or ALDs predict hearing aid benefit determine when binaural aids might NOT be appropriate – 80%, AD and 20% AS determine amplification characteristics and features – whether high frequency beneficial demonstrate advantage of special hearing aid features demonstrate aided performance is better than undaided determine whether cognitive or APD exists – older person may have cognitive problems, anyone may have APD demonstrate that understanding of speech is impaired Audiologic Evaluation Information helpful to providing AR Audibility Dynamic Range Frequency Resolution Temporal Resolution Audiologic Evaluation Critical consideration in helping to plan AR: Time of Onset of loss Degree of Loss Etiology Type of Loss And wealth of other factors Audiologic Evaluation Observations Interviews Questionnaires Otoscopic examination Pure tone results Speech recognition Immittance OAE Electrophysiologic (ECochG, ABR, MLR, LR, etc.) Pure Tone Results Degree of loss – normal, minimal, mild, moderate, moderately severe, severe, profound Type of loss – conductive – sensorineural – mixed Tactile Only SNR Many Items Stimulus Familiarity Equivalent Lists Audition Plus Vision Stimulus Units Distance/ Intensity Learning Effects Stimulus Context Stimulus Mode Stimulus Elements WithinSubject Statistical Procedure Audition Only Audition Plus Tactile Vision Only Open Set Speech Test Factors Clinical Significance Auditory Skill Response Format Closed Set Paired t-statistic Stimulus Format Listener Variables Cognitive Ability Communication Mode Test-Restest Variability Linguistic Ability Hearing Loss Live Synthesized Speech Test Reliability Test Conditions Altered Speech Recorded Stimulus Mode Tactile Only auditory alone vision alone Audition Plus Vision tactile alone auditory and vision auditory and tactile tactile and vision auditory, vision and tactile Audition Only Audition Plus Tactile Vision Only Auditory Alone phoneme, syllable, word, phrase, sentence open, closed set high, low context cues quiet, noise – signal to noise ratio (SNR) recorded, live Auditory Plus Vision Speechreading enhancement – speechreading enhancement – speechreading enhancement ratio • vision only / auditory plus vision = SE – Children –Craig Sentence and Craig Words and CHIVE – Adults –Iowa Sentence Test and CUNY Sentences – Erber (vision and hearing assessment) Auditory Plus Visual Information Sumby and Pollack (1954) demonstrated that the addition of visual speech information could significantly improve speech perception performance and that the importance of visual speech information increased as the listening situation became more difficult. Auditory Plus Vision Tye-Murray – CHIVE (adult) – CAVET (children) – Audition plus vision – Vision only – Audition alone Eber – Sent-Ident Speechreading Enhancement/ Visual Enhancement Benefit from adding a visual signal to an auditory signal Refers to the benefit obtained from seeing and hearing a speaker compared with auditory alone – Difference or ratio between speech recognition performance in an vision-only condition and an audition plus-vision condition – Difference Formula (AV-V) – Normalized Score Formula (AV-V/100-V) Patient A (V score=50% AV score = 75%) – 75-50=25% enhancement (difference score) – (75-50)/(100-50)=50% (normalized difference score) Patient B (V score = 10% AV score = 55%) – 55-10=45% visual enhancement – (55-10)/(100-10)=50% (normalized difference score) Auditory Enhancement Benefit from adding an auditory signal to a visual only signal Difference Formula (AV-A) Normalized Score Formula (AV-A/100-A) Patient A (A score=50% AV score = 75%) – 75-50=25% enhancement (difference score) – (75-50)/(100-50)=50% (normalized difference score) Patient B (A score = 10% AV score = 55%) – 55-10=45% auditory enhancement – (55-10)/(100-10)=50% (normalized difference score) Integration Enhancement Measure of ability to integrate auditory and visual information AV-[100-(100-A)+(100-V)]/100-[100(100-V)+(100-A)] THE AUDITORY SANDWICH Visual cues: • lip-reading • printed word • cued speech • signs HEAR SEE - SAY HEAR-UNDERSTAND First, listen. Then, if need be, watch or say it. Then, listen again (no visual cues) Always put it back into hearing! Back to “choices” Auditory -Verbal Auditory Cued -Oral Speech AUDITORY Total Communication Language Communication Cognition ASL VISUAL Choices reframed Fully Auditory Mostly Auditory Mostly Visual Fully Visual Communicator Communicator Communicator Communicator A Av AV VA V Flexibility is essential Fully Auditory Mostly Auditory Mostly Visual Fully Visual Communicator Communicator Communicator Communicator A Av AV VA V Stimulus Elements SNR Units: phoneme, syllable, word, phrase, sentence, non-sense Distance/ stimuli Intensity Proximity: distant, near Intensity: soft, comfortable, loud Context: high, low context cues, SNR: quiet, background sounds-signal to noise ratio Familiarity: high, low familiarity with material, nonsense material Stimulus Familiarity Stimulus Units Stimulus Context Learning Effects Learning effect: familiarity with items, procedures…not change in ability – Repeated Frame Test – Cinderella-Brahman Speech Recognition Test Equivalent lists: lists that contain items that are presumed to be equally difficult to recognize – PB, sentences – Repeated Frame Sentences Test Numerous stimuli – Full lists rather than half lists Many Items Equivalent Lists Stimulus Items Live vs. Recorded – – – – – Voicing frequency Intonation Speech rate Clarity of articulation Physical characteristics Synthesized speech Altered speech – Time-compressed – Expanded – Filtered Live Synthesized Speech Altered Speech Recorded Significance Clinical significance When a small change in performance is clinically significant When comparison between two test results is clinically significant Statistical design Paired t-statistic Within subject statistical procedure WithinSubject Statistical Procedure Paired t-statistic Significant Difference for NU-6 recorded speech tests Listener Variables Degree of hearing loss Cognitive abilities – Thinking, reasoning, remembering, imagining, or learning words Linguistic abilities – Knowledge of language Communication mode used Multicultural Cognitive Ability Communication Mode Linguistic Ability Hearing Loss Test Reliability Test reliability: the degree to which a single test score approximates the true score Test-retest variability: measure of consistency from one test presentation to the next Test conditions: Variables affecting test-retest variability: – mode of presentation—live vs recorded – location—test booth vs classroom – talker—familiar vs unfamiliar, male vs female – number times item repeated—once, twice, etc. leads to better performance Test Reliability Test Conditions Response Format Closed or limited set Open set Auditory skill – – – – Detection Discrimination Identification Comprehension Open Set Auditory Skill Closed Set Acclimatization and Brain Reorganization Plasticity Physiological changes in the CNS (and PNS – auditory nerve) that occurs from sensory experiences – Brain’s ability to reorganize space – Benefit from HA, CI, HATs may need to be measured at later date – Brain may continue to acclimate for several years following HA, CI, HAT, therapy Making Connections A child is born with over 100 billion neurons or brain cells. These neurons form connections, called synapses, which make up the wiring of the brain. Brain Development EARLY EXPERIENCES At age eight months an infant may have 1,000 trillion synapses. By age 10 the number of synapses decrease to about 500 trillion. The final number of synapses is largely determined by a child's early experiences, which can increase or decrease the number of synapses by as much as 25 percent. Brain Development "USE IT OR LOSE IT!" The brain operates on a "use it or lose it" principle: only those connections and pathways that are frequently activated are retained. Other connections that are not consistently used will be pruned or discarded so the active connections can become stronger. Brain Development DEFINING LANGUAGE SKILLS When an infant is three months old, his brain can distinguish several hundred different spoken sounds. Over the next several months, his brain will organize itself more efficiently so that it only recognizes those sounds that are part of the language he regularly hears. During early childhood, the brain retains the ability to relearn sounds it has discarded, so young children typically learn new languages easily and without an accent. Dendrites Dendrites are thin, branching fibers lined with receptors at which the dendrite receives information from other neurons. The greater the surface area, the greater the amount of information. Some dendrites are covered with spines which greatly increase its surface area. Plasticity, Adaptation, Acclimatization Many researchers think that training techniques sometimes can help those with the reading disability, dyslexia, because they modify brain networks. The images above hint that this is the case. The top images show the brain activity (litup areas) of a 10 year-old boy while he completes a task that requires the ability to identify the sounds of words. His reading level equaled that of an eight-year-old child. The bottom images show his brain activity while he completes the same task after receiving eight weeks of a type of special training. Following the intervention training, his reading level increased by three years and the images indicate that his brain activity changed as well. Researchers are conducting a very large, ongoing study to confirm this one example. Auditory Acclimatization Back in the 1940s, wideband high-fidelity phonograph consoles were just becoming available. Because of his interest in high-quality audio, Harvey Fletcher bought one for his home. Harvey enjoyed listening to this new high-fidelity system, but unfortunately, the enjoyment was not shared by his wife. After listening to a an old 78 rpm record, with the surface noise made particularly prominent by the extended bandwidth of this new high-fidelity system, she said: "That sounds awful. I don't really like having that screechy sound in my home." Always the creative thinker, the next day, when his wife was out of the house, Harvey went into the living room and soldered twenty 1 uF capacitors across the loudspeaker terminals, rolling off the high frequencies. (Remember that amplifiers were high impedance back then, so the trick worked.) That evening, when the music played, his wife was now happy. One night each week, while his wife was sleeping, Harvey would sneak downstairs and clip one capacitor. After twenty weeks, when the music played, they were both happy. Acclimatization Adapting to a new environment (in this case, auditory) or as defined by Darwin, the process of inuring to a new climate, or the state of being so inured. This seems to be a reasonable term, as it also is used to describe how the human body acclimates to temperature, altitude, and other environmental conditions. From an auditory standpoint, Gatehouse was one of the first to use the term acclimatization, explaining the speech processing capabilities of a group of people aided monaurally. In later research Gatehouse used the term acclimatization to describe an improvement in speech recognition over time. Today, the term is used widely to explain adaptation to hearing aid use in general, and is not limited to the Gatehouse definition. Adaptation The process of adapting to something, such as environmental conditions (in this case, auditory); the responsive adjustment of a sense organ. This too is a reasonable term, as it has long been used in reference to the eye—e.g., adaptation to varying light conditions. Sensory Reorganization When nerve stimulation changes, as with amputation, the brain reorganizes. In one theory, signals from a finger and thumb of an uninjured person travel independently to separate regions in the brain's thalamus (left). After amputation, however, neurons that formerly responded to signals from the finger respond to signals from the thumb (right). Auditory Reorganization/Plasticity Cochlear dead regions Brain reorganization will occur with damage to regions of the cochlea Speech Recognition Tests WIPI Word Intelligibility by Picture Identification (WIPI) closed-set picture-pointing (six pictures per plate) appropriate for children whose language age is between 5 and 10-11 comprised of four 25monosyllabic word lists contains 26 color plates (one for practice), six pictures per page. (A, V, A-V) NU-CHIPS NORTHWESTERN UNIVERSITY-CHILDREN’S PERCEPTION OF SPEECH (NUCHIPS) closed set picture pointing word recognition test children whose language age is as low as three 50 words familiar to three-year-old children in four randomizations includes two picture books with 50 monochrome plates, four pictures per plate. Book A is used for forms A & B, book B for C & D. There are two recordings, one male and one female talker. MAC The Minimal Auditory Capabilities (MAC) (2nd Edition) battery specifically targeted to CI patient Consists of a series of tasks which are graded in difficulty Most of the MAC battery sub-tests assume patient’s hearing loss has occurred post-lingual, but can be employed as a means of evaluating the hearing abilities of persons for whom traditional speech materials are too difficult Second edition has been standardized. The recorded materials include gross sound identification, inflection detection, contrast detection, accent discrimination, and word identification 14 sub-tests, 13 audio and one video SERT SOUND EFFECTS RECOGNITION TEST (SERT) developed for those instances where conventional word recognition measures are not appropriate, such as when language limitations due to hearing impairment Certain children who are unable to recognize even simple speech can perceive correctly environmental sounds to which they are exposed in their daily lives Under these circumstances, the SERT can provide valuable information about the integrity of the auditory system. Closed set, picture-pointing tasks 10 sounds plus a practice sound CID Every Day Sentences EVERYDAY SPEECH 10 sets of 10 sentences each with 50 "target" words in each set for word recognition assessment under contextual conditions Can be employed in auditory training Sentences vary in length and are spoken with minimal inflection Normative data on recording lacking BKB Sentences Bench, Koval, & Bamford (BKB) Open set sentences Appropriate for linguistic abilities of most 8-15 years of age with hearing loss Lists of 16 simple sentences, including 50 key words were devised to include vocabulary, grammar and sentence length for 8-15 year olds The sentences are presented in an open-set format and the child imitates as much of the sentence as possible. Responses are recorded word-for-word and scored by percent of key words correctly repeated. NU-6 NORTHWESTERN UNIVERSITY AUDITORY TEST NUMBER SIX (NU-6) Phonetically balanced CNC monosyllabic open set word recognition test Four lists of 50 words each recorded in four randomizations Talker has a General American dialect Standardized Auditory Numbers Test ANT Test helps identify tactile from auditory listeners Word closed set recognition auditory alone test Appropriate for 3-8 year old children with severe to profound hearing loss Simple auditory alone test to measure ability to perceive simple auditory cues – (Erber, 1980) Iowa Consonant Confusion Test Closed set consonant (phoneme) recognition test test can also be analyzed in terms of the listener's ability to identify phonetic features: – Chance performance for consonant voicing, manner, and place of articulation identification is 50%, 33%, and 20% respectively – Example: Mr. S achieved a total score of 79% correct, 96% on voicing, 94% on manner, and 85% on place 10 consonants presented 12 times in VCV context – – – – p, t, k, b, d, g, v, z, n, m Presentation examples: aba, ada, aga, etc Each consonant presented 12 times in random order Presented with carrier phrase “The next word is” • (Tyler et al, 1983) S t i m i l u s p t k b d g v z n m p 0 0 0 1 1 1 1 1 0 0 t 2 3 1 1 0 0 2 2 1 0 k 6 2 3 2 3 5 1 0 0 0 Response b d g v 2 1 0 0 1 1 0 1 0 0 3 1 2 0 3 1 2 0 0 1 0 2 0 0 0 3 0 2 2 0 0 3 0 0 0 0 0 0 0 0 z 0 4 2 0 2 0 1 0 0 0 n 0 0 1 1 3 0 2 2 4 4 m 1 0 1 1 0 4 0 2 7 8 Example of Iowa Consonant Confusion Test (example: auditory alone) Children’s Auditory Test CAT Auditory alone limited set test assessing ability to perceive stress patterns and word recognition Consists of 12 words – Monosyllabic words – Trochees – Spondees IMSPAC Imitative tests of Speech Pattern Contrast Perception Developed by Arthur Boothroyd Syllable level, 4 lists randomized Choose odd one of 3 (forced choice) Pointing, button-press or verbal response Age 7 years and up AB Short Word List Isophonemic Word List Developed in 1968 by Arthur Boothroyd Speech recognition open set test Each list consists of ten words, and each word is constructed as consonant - vowel – consonant 30 phonemes, 10 vowels and 20 consonants present in each list CNC words Score is based on the phonemes correct out of 30 Larsen Recorded Test Auditory word discrimination test Pairs of phonemes in words – few vs. chew – bill vs. mill – nice vs. vice Lists represent an attempt to present the phone in the initial, medial and final position of a word Limited choice—select one of two words by drawing line through printed word heard CAVET Children’s Audiovisual Enhancement Test Assesses speechreading enhancement in children within the vocabulary level of 7-9 year olds with profound prelingual hearing loss Designed to minimize ceiling and floor effects, eliminate syntactic factors, and minimize semantic factors 3 lists of 20 words each with half of words easy to recognize in a vision-only condition and half are less likely to be recognized in each list but presented in random order Each list is designated for auditory alone, visual alone, or auditory-visual only mode Test available in CD-ROM and VHS format – (Tye-Murray & Geers, 2002) CUNY Sentences Test The City University of New York (CUNY) Topic Related Sentence Sets consist of 25 equivalent sets of 12 sentences of varying length Each sentence is related to one of 12 topics Open set sentence recognition test Listener is presented with sentences in three listening conditions: A, V and AV The test is scored with the total number of words correctly identified (Boothroyd, Hannin, & Hnath, 1985) Lexical Neighborhood Test (LNT) Lexical neighbors--words that sound similar to a target item. Often defined as words that differ by a single phoneme from target word Open-set test that requires child to imitate stimulus words immediately after they are presented Each list consists of 50 monosyllabic words, 25 of which are high-frequency words with few lexical neighbors (easy) and 25 of which are lower frequency words with many lexical neighbors (hard) Words were selected to be familiar to children with limited vocabularies. – Easy Words: juice, good, drive, time, hard, gray, foot, orange, count – Hard Words: thumb, pie, wet, fight, toe, cut, pink, hi, song, fun, use, mine, Alternate version of test, the Multisyllabic Lexical Neighborhood Test (MLNT) consists of 50 words with two to three syllables. – (Kirk, Pisoni, & Osberger, 1995) QuickSIN Provides a one-minute estimate of SNR loss for adults A quick method to quantify patient’s ability to hear in noise Can determine if extended high frequency emphasis improves or degrades understanding of speech in noise Assist in choosing appropriate amplification and/or other hearing assistance technologies Demonstrates directional microphones may improve speech intelligibility in noise Open set test recognition test Consists of list of six sentences with five key words per sentence presented in four-talker babble noise. Sentences are presented at pre-recorded signal-to-noise ratios which decrease in 5dB steps from 25 (very easy) to 0 (extremely difficult). The SNR's used are 25, 20, 15, 10, 5, 0, encompassing normal to severely impaired performance in noise. BKB - SIN Similar to Quick-SIN but can be utilize with children Sentence recognition using BKB sentence material Open set sentence recognition test BKB SIN test Test score sheets HINT The HINT is a prerecorded test that measures sentence speech recognition abilities in quiet or in noise accurately, reliably and efficiently The HINT demonstrated the substantial role that binaural, directional hearing plays in a normal hearing individual's ability to communicate in noise Any degree of hearing impairment, therefore, reduces the benefits of directional hearing in noise and increases communication inadequacy Used to evaluate functional hearing capabilities of HI users, CI users and those in listening-critical jobs Copyrighted by the House Institute and is available for purchase on a compact disc recording to hearing clinics and research laboratories nationwide. A children's version of the HINT is also available DICHOTIC SENTENCE IDENTIFICATION (DSI) Developed in an attempt to conceive a dichotic listening task that would be only minimally affected by peripheral hearing loss Sentences selected from the Synthetic Sentence Identification (SSI) test presented dichotically Onsets and offsets of the sentences are aligned with an accuracy of 100 microseconds. Developers claim DSI less susceptible to hearing loss than SSW test Viable test of central auditory function with hearing loss Test applicable for auditory assessment of impaired ears through pure tone averages (PTAs) up to 50 dB 2 sets of 30 pairs of sentences (closed set identification test) Normative data are available Ling-6 Phoneme level detection and recognition test m, s, sh, e, a, u Procedure Detection and identification of phonemes Further Auditory Evaluations Mark C. Flynn—Evaluation of Individuals with hearing loss Assignment If you accept the challenge, assign each test discussed in this section on the following grid, thus, Be able to correctly indicate on the following template, each of the auditory tests discussed earlier Speech Paradigm Detection Non-Speech Sounds Phonemes Syllables Suprasegmentals Words Phrases Sentences Connected Speech Discrimination Recognition Comprehension