Lecture 5: The Psychophysics of Sound: 3 Slide 1 Recap Two theories of Pitch Perception Place Theory Temporal Theory Place Theory Pattern of activation on basilar membrane disassembled into components Pattern of harmonics ‘predicts’ a lower note - the fundamental of the series Requires at least 2 resolvable harmonics to work Temporal Theory Pattern of activation on basilar membrane disassembled into components Pattern of periodic information intrinsic to wave form coincides at period of fundamental frequency Uses the unresolved harmonics Pattern of AM in beats of unresolved harmonics – phase locked Phase locking drops away > 5KHz Mixed models Combination of Place and Temporal Information Temporal Info refines initial rough analysis by place mechanism? Fits with fineness of JND over Critical Bandwidth CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 2 Pitch Perception Combination or DifferenceTones H&A 5.5.1 JND – The Difference Limen H&A 3.3.2 Loudness Perception Calculating dB Measurement & Range of Loudness H&A 2.4 Measurement of Subjective Loudness H&A 2.4 SPL, SL, Phons & Sons Loudness and Critical Bandwidth JND for loudness Intensity, Frequency & Duration Hearing Damage & Loss H&A 2.5 Tinnitus Ghost Harmonics Adaptation CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 3 Combination or Difference Tones A Combination Tone is a tone that is heard when two tones are sounded together, but is not present when the tones are presented separately. Several Combination/Difference tones are produced for two frequencies. where f2 > f1 Simple Difference Tone at f2 – f1 Cubic Difference Tone at f1 – K(f2 – f1 ) or K(f1) – K–1(f2) If the two tones are adjacent harmonics of a lower fundamental then the combination tone is that fundamental and the other difference tones are the harmonic series. CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 4 Combination or Difference Tones One theory is that Combination tones arise from interference between the two primary waves in the Basilar Membrane. The effect is dependent on the intensity of the sounds and is quite fragile. The combination tone can be suppressed by a continuous tone just below f1 and f2, or by masking with noise. The phenomena of the Difference Tones is used in making instruments, e.g. organs. If we want a low pitch (v. long pipe) if can use two shorter pipes in the ratio 2:3, this produces the lower frequency that is in the relationship 1:2:3, e.g. 55 Hz can be produced by playing two pipes of 110 & 165 Hz. CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 5 Just Noticeable Difference for Pitch Pitch, & loudness & duration (also other senses: colour, shape, size, texture, smell) are continuous, however we perceive the world discretely i.e. we split the continuum into points or chunks. The granularity of this discrimination is called a Difference Limen or Just Noticeable Difference (JND) – it is the difference there must be between two signals before we can recognise them as different. If two pure tones (of the same intensity) are alternated and the difference between them is slowly reduced, we get to a point where they are indistinguishable. f Or conversely if two pitches are gradually separated when the difference between them exceeds the JND, we hear two pitches instead of one JND is much finer than Critical Bandwidth JND is approx. 1/27th of a CB – Zwicker(1970) JND is much finer than pitches used in musical scales CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 6 Measuring Frequency JND There are 2 measures of frequency JND DLF 2 successive steady different tones – 2nd higher DLF is where 75% of correct discrimination into two tones DLF is smallest for low frequencies – larger for higher. FMDL – frequency modulation of a continuous tone – amount of modulation required for subject to recognise variation. FMDL smallest for middle frequencies larger for higher & lower Also, Frequency JND is blurred. There are two thresholds in the differences between pitches Difference below which change is never detected. This is called the Neural Quantum – NQ Difference above which change is always detected, This is (approx) NQ x 2 JND varies from person to person and is affected by musical training, It is also dependent on the frequency, intensity & duration of pitches CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 7 Variation in JND by frequency 1000 Hz ERB 100 Hz JND ^ ^ ^ B A N D W I D T H 10 Hz 1 Hz 100 Hz 1000 Hz 10000 Hz Frequency >>>>> Plot of JND for pure tones from 100 – 10,000 Hz at 80 dB < 1kHz, JND 2-4 Hz (at 30Hz – 10% = major 2nd) 1 kHz JND 0.30% (0.05 semitones, 5 cents) > 1kHz JND relatively constant 0.25% of CB >10 kHz discrimination of pitches deteriorates rapidly. CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 8 Variations in JND JND Varies with intensity Louder Pitches are easier to discriminate Louder = finer JND JND Varies with duration Longer Duration Pitches are easier to discriminate longer = finer JND (von Bekesy 1960), 2 Subjects; 800 Hz pure tone at 40 dB SL f/f (%) 1.0 0.9 0.6 0.4 0 0.1 0.2 0.3 0.4 ( secs) 0.2 Demonstration ASA Demo 17 0.0 CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 9 JND for Pitch You hear 10 groups of 4 pairs of tones (a,b,c,d) each tone is 500 ms long and each pair is separated by 250ms The base frequency f is 1000 Hz Tone pairs are either rising or falling f f + or f + f starts at 10 Hz and drops by 1 Hz in each subsequent group Group a b c d 1 2 3 4 5 6 7 8 9 10 Instructions: As you listen to the pairs of pitches write down in the box whether or not you hear the pitch rise or fall (R,F) or you cannot tell (C) or you think they are the same (S) CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 10 Loudness The loudness of sounds is measured in terms of pressure. There are various measures – e.g. Pascals = Newtons per m2 (1 Newton is force needed to accelerate 1 kg to 1 Meter per sec) This measures pressure or intensity - in sound called amplitude. Intensity = Pressure2 The range of intensity of sounds is very large and so amplitude is measured on a logarithmic scale. The most commonly used scale is the deciBel Scale (dB) Named after Alexander Graham Bell. dB SPL (SPL means Sound Pressure Level) dB SPL is the difference between a reference and a measured sound. The reference of 0 dB SPL = 20 micro pascals (20 Pa) It is the average threshold for a child’s hearing at 1 kHz CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 11 Calculating dB The difference in Decibels is calculated as 10 the log of the ratio of intensity of the measured (m) and reference (r) sound bel = log(Pm/Pr) dB = 10 log (Pm/Pr)2 dB = 20 log (Pm/Pr) (log x2 = 2 * log x) The pressure ratio of 10 to 1 is 20 decibels Pressure Intensity 0 dB 20 dB = 10 100 20 dB 40 dB = 10 100 40 dB 60 dB = 10 100 0120 dB is 6 increments of 20, i.e. 106, = 1 to 1,000,000 Normal hearing is 20 100 dB = 104 = 1 to 10,000 0dB = 120dB = threshold of hearing threshold of pain CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 12 Loudness Perception The threshold of hearing varies with frequency. Hearing is most sensitive in the frequency range - 1 to 4.5 kHz Outside this range higher sounds are easier to hear than lower. Two ears are more sensitive than one in the higher frequencies CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 13 dB SL A second measure - dB SL – in which SL = Sound Level. This is the sound level above the threshold of hearing at a frequency e.g. at 100 Hz the threshold of hearing is 32 dB SPL, therefore 10 dB SL equals 42 dB SPL. dB SL tries to calibrate the subjective level of sound intensity – by measuring relative to the threshold. But it is difficult to use, because you need to know what the threshold of sound is at a particular frequency. A tone of 40 dB SPL is audible at 1 kHz but is inaudible at 50 Hz Both SPL and SL are unsatisfactory because they are relative to the sensation of loudness. CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 14 Subjective Loudness – The Phon Scale One ‘objective’ scale of subjective loudness is the Phon Scale People judge loudness relative to a reference tone The reference tone is at 1 kHz at 0 dB SPL The intensity of other frequencies are varied until they sound as loud. (Fletcher & Munson, 1933) The measurements define contours of equal-loudness Subjective Loudness rises more slowly for frequencies below 500 Hz – (C4) – i.e. you need more energy to hear This is important for music reproduction systems. Because lower frequencies need more energy to be heard it follows that as volume is decreased that relatively speaking bass levels need to be boosted to remain at the same subjective level. e.g. – if there is a balance across frequencies at 80 dB SPL – if the sound level drops to 40 dB SPL frequencies < 100 Hz will be inaudible those at 1 kHz are still audible. The answer: reduce the intensity of lower less than higher frequencies. CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 15 Subjective Loudness – The Son Scale Another scale for subjective loudness is the Son scale (Stevens 1936,1955). Again this uses a reference tone – 1 kHz at 4 dB SPL = 1 Sone. The sound was changed in intensity and listeners were asked to judge how much louder the new sound was – e.g. 2, 1/3rd etc. The results were averaged and plotted as follows….. Stevens derived a Power Law for the relationship between the change in power and perceived loudness L = kI 0.3 Where k is a constant, and I is the Intensity of the sound. Steven’s Power Law gives (approx.) a two fold increase in perceived loudness for a 10 dB SPL change in pressure, 1/2 intensity means approx 0.81 of subjective loudness 1/3rd Intensity means approx 0.72 of subjective loudness CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 16 Perceived Loudness & Critical Bands Complex tones comprise pure tones components. The Critical Bandwidth contributes to what is heard. The general principles are: If a sound has components whose energy is constant and the components lie within one CB then it sounds softer than if the components lie in more than one CB. (Zwicker, Flottorn & Stevens 1957) Loudness is proportional to the total intensity of the sound in a CB If the intensity is constant then the perceived loudness is independent of the number of components and their bandwidths within the CB i.e. the sound is judged to be as loud as a pure tone of the same intensity at the Centre Frequency of the CB. If the bandwidth is > than a CB then the sound is louder (measured in phons). This is generally the case for noise and complex tones) e.g. loudness in phones for 1 kHz sounds of various bandwidths CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 17 Perceived Loudness & Critical Bands This can be understood using Steven’s Power Law Say we have a sound of Intensity 1 in one CB the loudness = 1 If a sound is distributed over 2 CB’s, each CB receives approximately 1/2 the energy intensity. 1/2 the intensity 80% of loudness. 2 x 0.81 = 1.62 rather than 1 or over 3 CB’s – 1/3rd intensity = 0.72% loudness 3 x 0.72 = 2.16 rather than 1 The physiological reason for this is thought to be that neuron firing rates at moderate to high intensities change relatively slowly as a function of intensity (they are saturating). So when all the energy is in one CB it is integrated and its net effect is decreased. Halving the energy delivered to a CB is offset by the increased mass of neuronal activity across multiple CBs. Overall - perceived loudness depends on the bandwidth of the sound CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 18 JND for loudness Our perception of changes in energy levels is measured in 2 ways Amplitude Modulation – when change is noticed = JND Ordered Pulses – L-S, S-L etc – judgement of order = JND Wideband Noise JND is a constant ratio of the difference in intensity and intensity I/I = constant This is Weber’s Fraction – the smallest detectable increment is proportional to the magnitude of amplitude. JND is 0.5 – 1 dB from 20 – 100 dB. Slightly higher when < 20 dB Pure Tones Weber’s Law is approximated JND for loudness varies slightly at different frequencies Discrimination is slightly better at higher energy levels, i.e. JND is finer with higher dB. CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 19 Frequency and Duration JND for Intensity becomes coarser as duration of pitches shortens L/L 12 10 500 Hz, 40 dB 1000 Hz, 40 8 6 500 Hz, 70 dB 0 250 500 750 duration ms Also, the perceived intensity of a sound is affected by its duration. 4 > 500 ms (1/2 sec) loudness appears constant < 2200 ms decreasing perception of loudness as length drops. This implies that energy is measured over time and if the time 0 period during which the sound occurs is less than the period used by the nervous system to ‘integrate’ the energy of an ongoing stimulus then it sounds softer. Other points Increase in dB = more nerve firing Phase locking increases with energy CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 20 Hearing Damage and Loss Hearing is permanently damaged by listening to sounds that are too loud and/or by listening to them for too long. It occurs most easily in the range that we are most sensitive to –1 to 4.5 kHz. Centred around 4 kHz Prolonged exposure to sounds of greater than 90 dB SPL leads to permanent damage. The damage potential of a sound level is calculated in terms of a ‘consolidated dose’ (borrowed from measuring exposure to radiation). The measurement is relative to 90 dB SPL for 8 hours. This is equivalent to 93 dB SPL for 4 hours, 96 dB SPL for 2 hours, etc…. Hearing loss takes two forms Loss of sensitivity Loss of acuity (accuracy) Loss of sensitivity Due to mechanical damage to hair cell – flattening Loss of acuity Probably due to an increase in Critical Bandwidth caused by “The Evoked Cochlear Mechanical Response” (Kemp 1978). This is feedback from the hair cells to the Basilar Membrane that amplifies the wave in the Cochlea – a self sustaining loop. If CB is larger then separation of components is cruder, hence degradation of ability to analyse and discriminate sounds. Implies Place mechanism CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 21 Hearing Loss The degrees of hearing loss induced by a series of intensities over time, measured in dB – i.e. the amount of additional energy required for sounds to be audible Reduction in Sensitivity 106 dB dB 40 100 dB 94 dB 30 88 dB 20 1 10 100 mins 10 0 CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 22 Tinnitus Tinnitus is where the ear spontaneously produces noise – hisses, pure tones, clicks etc. It is associated with hearing loss It is triggered by loud sounds – which also damage hearing People with hearing loss suffer from it People with it are more susceptible to hearing loss Tinnitus seems to be the result of combinations of two or three components 1) Activity in the Cochlea 2) Increase Auditory Nerve activity 3) Higher centres (brain stem and cortex) These three components operate differently in different people – including people who are deaf! Activity in the Cochlea Used to be thought cause was middle ear muscle contractions. Now it is thought to be due to “The Evoked Cochlear Mechanical Response”. In Tinnitus it occurs in the absence of a sound – i.e. no wave. This may be prominent in Basilar Membrane areas surrounding places for frequencies that have suffered damage CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 23 Tinnitus Tinnitus is not just an increase in Auditory Nerve activity Some hearing deficits decrease nerve activity This implies some contribution by higher centres (brain stem & cortex) If the auditory nerve is cut in some cases it does not reduce Tinnitus In some cases if there is increased stimulation of Auditory nerve then Tinnitus is reduced. CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 24 Ghost Harmonics When a pure tone is sounded (loudly) people can hear harmonics, as if the pure tone was complex tone – these overtones are produced by the auditory system. How much we hear them depends on how loud the pure tone is The louder the tone is the more harmonics we hear and the louder each harmonic sounds. The effect is that the pure tone sounds as if it is the fundamental of a complex tone. E n e r g y 3 9 Harmonics This may be related to Difference Tones CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 25 Hearing - Adaptation and Fatigue Our hearing changes as we listen to something over time. Two closely related phenomena are: Adaptation – the subjective magnitude of a stimulus decreases – it seems softer – most adaptation within a couple of minutes affects both loud and soft sounds, some studies indicate that adaptation is greatest for higher frequencies Fatigue – the energy needed to produce nerve firing is increased, i.e. the threshold to register a stimulus is raised. ‘Temporary Threshold Shift’ (TTS) increases markedly above 90100 dB SPL – also the point above which permanent damage occurs Recovery time can be as long as 16 hours (or never) TTS is difficult to measure – it seems to be affected by the intensity of the tone, its duration, its frequency, the frequency of the test stimulus and how long after the test tone occurs At low intensities only close frequencies are affected. At high intensities spread can be as large as ½ an octave e.g. levels above fatiguing tone for I kHz tone – greater for frequencies higher than fatiguing tone CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 26 Adaptation and Fatigue TTS dB 7 1400 Hz 6 5 1000 Hz 4 3 20 40 60 80 100 dB 2 Fatigue is highest 4-6 kHz – region of greatest hearing damage 1 0 CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 27 Adaptation The nervous system is designed to deal with change. In hearing auditory nerve neurones respond to the onset of stimuli and then their response declines rapidly over 2-5ms and more gradually until 40-50 ms. The drop is stronger for higher dB Onset and rapid adaptation tracks intensity. This is reflected in the ratio of max. firing to start firing. -5dB flat response: no initial high peak 47 dB variable response: high onset rapid decline For natural sounds onset is often gradual -up to 40 msecs. Thus, the net effect of this is that when a neurone fires it is less sensitive for a period of time – it becomes adapted, and is less able to respond to renewed stimulation. CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick Lecture 5: The Psychophysics of Sound: 3 Slide 28 Adaptation Adaptation is important when two tones are very close or identical in frequency and are simultaneous or follow each other rapidly. If we have two tones Background tone is presented Test tone is presented This causes Adaptation Firing rate drops - Background adapts - firing decreases Response to the test tone suppressed There is asymmetry in how long the adaptation takes to build up and dissipate. 45 msecs to drop by 63% of initial firing rate 90 msecs to recover from 63% reduction The implication is that in an ongoing musical or sonic universe firing rates are being continually modified by the effect of other preceding and simultaneous events adapting neuron activity. This means that over short time scales onset timing and stimulus intensity affect the way that we can distinguish different tones: Adaptation is also thought to contribute to Masking and interact with Critical Bandwidth e.g. for figure and ground, a slight dislocation of onsets is enough to separate sounds (Western & Japanese Music both use this). CS5611 - Psychoacoustics - Niall Griffith – Semester 1 Computer Science and Information Systems, University of Limerick