Hearing & Deafness (4) Pitch Perception 1. Pitch of pure tones 2. Pitch of complex tones Pitch of pure tones Place theory Place of maximum in basilar membrane excitation (excitation pattern) - which fibers excited Timing theory Temporal pattern of firing how are the fibers firing needs phase locking Phase Locking of Inner Hair Cells Auditory nerve connected to inner hair cell tends to fire at the same phase of the stimulating waveform. Phase-locking 1 0.5 0 -0.5 0 0.2 0.4 0.6 0.8 1 -1 Response to Low Frequency tones Inter-spike Intervals 2 periods 1 period time (t) Response to High Frequency tones > 5kHz Random intervals time (t) nerve spike Pure tones: place vs timing Low frequency tones Place & timing High frequency tones Place only 1. Phase locking only for tones below 4 kHz 2. Frequency difference threshold increases rapidly above 4 kHz. 3. Musical pitch becomes absent above 4-5 kHz (top of piano) Frequency thresholds increase above 4 kHz BCJ Moore (1973) JASA. Pitch of complex tones: fundamental & harmonics 4 pressure 3 2 1 0 1 -1 -2 amplitude Fundamental =200 Hz Harmonic spacing = 200 Hz 1.0 -3 -4 time (t) Period = 1/200 s = 5ms 200 400 600 800 frequency (Hz) Helmholtz’s place theory amplitude Fundamental =200 Hz Harmonic spacing = 200 Hz 1.0 high Peaks in excitation 200 400 600 800 frequency (Hz) Pitch = frequency of fundamental Coded by place of excitation low Arguments against Helmholtz 1. Fundamental not necessary for pitch (Seebeck) Missing fundamental Harmonic spacing = 200 Hz 3 pressure 1 0 1 -1 -2 -3 amplitude 1.0 2 time (t) 200 400 600 800 frequency (Hz) Period = 1/200 s = 5ms No fundamental but you still hear the pitch at 200 Hz Track 37 Same timbre - different pitch Distortion: Helmholtz fights back Sound stimulus Sound going into cochlea 1.0 amplitude Middle-ear distortion 200 400 600 800 frequency (Hz) Produces f2 - f1 600 - 400 amplitude Harmonic spacing = 200 Hz Fundamental =200 Hz Harmonic spacing = 200 Hz 1.0 200 400 600 800 frequency (Hz) Against Helmholtz: Masking the fundamental Unmasked complex still has a pitch of 200 Hz amplitude Fundamental =200 Hz Harmonic spacing = 200 Hz 1.0 Tracks 40-42 200 400 600 800 frequency (Hz) amadeus Against Helmholtz: Enharmonic sounds Middle-ear distortion gives difference tone (1050 - 850 = 200) 200 amp BUT Pitch heard is actually about 210 Tracks 38-39 200 850 1050 1250 Schouten’s theory Excitation pattern of complex tone on bm unresolved 1600 resolved 800 600 400 25.0 b m vibration 200 20.0 15.0 10.0 5.0 base Pitch due to beats of unresolved harmonics log (ish) frequency Output of 1600 Hz filter -5.0 1/200s = 5ms 1 0.8 1.5 0.6 1 0.4 0.5 0.2 0 -1 0.0 Output of 200 Hz filter 1/200s = 5ms 2 -0.5 apex 0 0 0.2 0.4 0.6 0.8 1 -0.2 0 -0.4 -0.6 -1.5 -0.8 -2 -1 0.2 0.4 0.6 0.8 1 Tracks 43-45 Problems with Schouten’s theory (1) 1. Resolved harmonics dominant in pitch perception not unresolved (Plomp) “down” “down” 1.0 200 400 600 800 frequency (Hz) 2000 2400 Problems with Schouten (2) 1. Musical pitch is weak for complex sounds consisting only of unresolved harmonics 2. Pitch difference harder to hear for unresolved than resolved complexes Against Schouten (3): Dichotic harmonics • Pitch of complex tone still heard with one harmonic to each ear (Houtsma & Goldstein, 1972, JASA) 200 Hz pitch 400 600 No chance of distortion tones or physical beats Goldstein’s theory • Pitch based on resolved harmonics • Brain estimates frequencies of resolved harmonics (eg 402 597 806) - could be by a place mechanism, but more likely through phaselocked timing information near appropriate place. • Then finds the best-fitting consecutive harmonic series to those numbers (eg 401 602 804) -> pitch of 200.5 Two pitch mechanisms ? • Goldstein has difficulty with the fact that unresolved harmonics have a pitch at all. • So: Goldstein’s mechanism could be good as the main pitch mechanism… • …with Schouten’s being a separate (weaker) mechanism for unresolved harmonics Schouten’s + Goldstein's theories unresolved 1600 resolved 800 600 400 25.0 200 20.0 15.0 10.0 5.0 base log (ish) frequency Output of 1600 Hz filter 0.4 0.5 -1.5 -2 1/200s = 5ms 0.8 0.6 1 -1 -5.0 1 1.5 0 -0.5 0 0.0 Output of 200 Hz filter 1/200s = 5ms 2 apex 0.2 0.2 0.4 0.6 0.8 1 0 -0.2 0 -0.4 -0.6 -0.8 -1 0.2 0.4 0.6 0.8 1 JCR Licklider Autocorrelator Meddis, R. & O'Mard, L. 1997 A unitary model of pitch perception. J. Acoust. Soc. Am. 102, 18111820. input Summary autocorrelogram Some other sounds that give pitch • SAM Noise: envelope timing - no spectral – Sinusoidally amplitude modulated noise • Rippled noise - envelope timing - spectral – Comb-filter (f(t) + f(t-T)) -> sinusoidal spectrum (high pass to remove resolved spectral structure) – Huygens @ the steps from a fountain – Quetzal @ Chichen Itza • Binaural interactions Huygen’s repetition pitch Christian Huygens in 1693 noted that the noise produced by a fountain at the chateau of Chantilly de la Cour was reflected by a stone staircase in such a way that it produced a musical tone. He correctly deduced that this was due to the successively longer time intervals taken for the reflections from each step to reach the listener's ear. Effect of SNHL • Wider bandwidths, so fewer resolved harmonics • Therefore more reliance on Schouten's mechanism - less musical pitch? Problem we haven’t addressed • What happens when you have two simultaneous pitches - as with two voices or two instruments - or just two notes on a piano? • How do you know which harmonic is from which pitch? Bach: Musical Offering (strings) Harmonic Sieve • Only consider frequencies that are close enough to harmonic. Useful as front-end to a Goldstein-type model of pitch perception. Duifhuis, Willems & Sluyter JASA (1982). blocked 200 Hz sieve spacing 0 400 800 1200 frequency (Hz) 1600 2000 2400 Mistuned harmonic’s contribution to pitch declines as Gaussian function of mistuning • Experimental evidence from mistuning expts: 1.5 ∆F = a - k ∆f exp(-∆f 1 Moore, Glasberg & Peters JASA (1985). (Hz) ∆F 0 800 2 /2s ) 0 -0.5 s = 19.9 k = 0.073 -1 400 2 0.5 Match low pitch 0 o 1200 1600 frequency (Hz) 2000 2400 -1.5 540 560 580 600 620 640 660 680 Frequency of mistuned harmonic f (Hz) Darwin (1992). In M. E. H. Schouten (Ed). The auditory processing of speech: from sounds to words Berlin: Mouton de Gruyter Is “harmonic sieve” necessary with autocorrelation models? • Autocorrelation could in principle explain mistuning effect – mistuned harmonic initially shifts autocorrelation peak – then produces its own peak • But the numbers do not work out. – Meddis & Hewitt model is too tolerant of mistuning. One pitch mechanism or two? experimental evidence • Ease of comparing two pitches: – both resolved (best) - Goldstein – both unresolved - Schouten – one resolved, one unresolved (worst) • So easier within than between mechanisms • Carlyon, R. P. and Shackleton, T. M. (1994). J. Acoust. Soc. Am. 95, 3541-3554. • But see later paper for alternative explanation: • Gockel, H., Carlyon, R. P. and Plack, C. J. (2004). J. Acoust. Soc. Am. 116, 1092-104. One pitch mechanism or two? Efficiency • Autocorrelation always detects evidence for periodicity at F0 in each frequency channel. • Goldstein uses evidence of periodicity at the harmonic frequency. May be more efficient. General problems • How to do deal with multiple sounds • What is the physiological instantiation of the long delays (c. 30 ms) necessary for autocorrelation or for Schouten?