Music and the Mind Cognitive Elements of Music Listening Kevin D. Donohue Databeam Associate Professor Electrical and Computer Engineering University of Kentucky Is it Music? Identify which sounds are Music: A B C D E F G What is Music? 1 a : the science or art of ordering tones or sounds in succession, in combination, and in temporal relationships to produce a composition having unity and continuity b : vocal, instrumental, or mechanical sounds having rhythm, melody, or harmony Merrian-Webster Online Dictionary: http://www.m-w.com/dictionary/music Auditory Scene: Input Sensory organs (ears) separate acoustic energy into frequency bands and convert band energy into neural firings The auditory cortex receives the neural responses and abstracts an auditory scene. 1 2 0 0.0 5 3 0.1 Time 4 http://hyperphysics.phy-astr.gsu.edu/hbase/sound/hearcon.html Frequency Auditory Scene: Perception Perception derives a useful representation of reality from sensory input. Auditory Stream refers to a perceptual unit associated with a single happening (A.S. Bregman, 1990) . Acoustic to Neural Conversion Organize into Auditory Streams Representation of Reality Auditory Stream Experiment Bergman & Campbell (1971) Streams tend to form by grouping notes close in time and frequency (similarity and proximity). http://www.psych.mcgill.ca/labs/auditory/demo3.html http://www.psych.mcgill.ca/labs/auditory/demo2.html Circularity in Pitch Judgement Shepard’s Scale (1964) (Auditory Demonstrations CD, from the Acoustical Society of America) Perceptual Organization Organization properties: Belongingness – a sensory element belongs to an organization (or stream) of which is a part. Exclusive allocation – a sensory element cannot belong to more than one organization at a time. Bregman & Rudnicky (1975) Perceptual Organization Organization properties: Closure – perceived continuity, a tendency to close strong perceptual forms, response to missing evidence. Sequential and Spectral Integration Sequential Integration Grouping sensory elements over time or events at different times considered to be from the same source. Melody, rhythm Spectral Integration Fusing simultaneous sensory elements over frequency into one Timbre, harmony Timbre and Spectral Integration The time envelope and harmonic structure give rise the timbre of the sound. 0.5 Amplitude 0 dB -20 -40 -60 0 2000 4000 6000 Hertz 8000 10000 -0.5 12000 dB -20 -40 0 2000 4000 6000 Hertz 8000 10000 0.2 0.4 0.6 Seconds 0.8 1 0 0.2 0.4 0.6 Seconds 0.8 1 0 0.2 0.4 0.6 Seconds 0.8 1 0.5 0 -0.5 -1 12000 0 Amplitude 1 dB -20 -40 -60 0 1 Amplitude 0 -60 0 0 2000 4000 6000 Hertz 8000 10000 12000 0.5 0 -0.5 -1 Timbre and Spectral Integration Simultaneous tones grouped by timbre 2 Notes (F and A) 5000 5000 4000 4000 3000 3000 Hertz Hertz Same Note (A) 2000 2000 1000 1000 0 0.1 0.2 0.3 Seconds 0.4 0 0.1 0.2 0.3 Seconds 0.4 Auditory Scene Organization Primitive Stream Segregation Inherent constraints in auditory scene analysis (perceptual organization demonstrated by infants/children) Music: Organization of musical sensory units Schema-based segregation Learned constraints in auditory scene analysis (differences in perceptual organization resulting from training and culture) Music: Differences between musicians and non-musicians Music: Differences resulting from acculturation (A.S. Bregman, Auditory Scene Analysis, MIT Press 1990, pp. 1-45) Music Related Terms Pitch – Perceived frequency/fundamental tone (20Hz20kHz Range) Melody – Pattern of tones identified by the intervals between consecutive pitches Contour – Shape of the melody without regard to intervals Loudness – Perceived intensity of sound (0dB to 120dB) Timbre – Nature of a sound defined mostly by its harmonic structure and time envelope Rhythm – Repeated pattern of strong and weak sounds Tempo – Rate of the rhythm Melody Invariance A melody can typically be recognized over changes in pitch, loudness, timbre, tempo, spatial location, and reverberations. Contours are typically recalled better than actual melodies (intervals) for unfamiliar tunes. (Massaro, Kallman, and Kelly 1980). (Daniel J. Levitin, Memory for Musical Attributes, in Music Cognition and Computerized Sound, ed. P.R. Cook, MIT Press, 1999, pp. 209-227) Primitive Musical Perception Distinguish between cognitive components present at an early age and those resulting from acculturation. Infant: Grasp of musical structures Adult: Develop cognitive strategies for applying musical structures (W. Jay Dowling, The Development of Music Perception and Cognition, The Psychology of Music Academic Press, 1999, pp 603-625) Infant Perception Group rules for tones similar to that of adults (Bregman and Campbell 1971, and Demany 1982) Pitch octave equivalence and pitch constancy similar to that of adults (Kessen, Levine, and Wendrich 1979, Demany and Armand 1984) Contour changes mostly noticed. Adults are more perceptive to changes in melodies based on the tonal scale of the culture. Key constraints over different cultures: Octave equivalence 2:1 (Burns and Ward 1978) Perfect 5th 3:2 (most consonant interval, fundamental building block of multi-cultural scales) A limit of pitch classes within the octave (Miller 1956, argues 7 is the limit) Acculturation can be observed by 6 months and by 1 year different responses to non-diatonic and diatonic scales can be observed. Infant Perception Infants discriminate between different rhythmic patterns (Chang and Trehub 1977, Demany, McKenzie, and Vurpillot 1977) Melodies retain identity over transposition, likewise rhythm retains identity over changes in tempo (Monahan and Carterette 1985) and can notice changes in rhythm (Trehub and Thorpe 1989). Infants are sensitive to similar cues (pitch contour and note duration) as adults when identifying musical phrases (Jusczyk and Krumhansl 1993). Childhood Perception 9 to 18 months singing recognizable and repeatable songs. Within phrase accents and timing of phrases follow a regular beat (Dowling and Harwood 1986). Songs lack tonal stability with repeatable contours (Ostwald 1973). By 5 years holding a stable tonality over duration of song. Absolute pitch is rare in the general population of adults or children. Early learning hypothesis (Takeuchi and Hulse 1993). Childhood Perception Contour remains an important basis for melodic organization (Andrews and Dowling, 1991). 5-6 years recognize familiar version of tonal and atonal imitations. 9-10 distinguish familiar versions and same contour imitations. 5 years can distinguish near and far keys but not tonal imitations. 7 years can detect out-of-key and out-of-harmony changes like adults. 5 years can detect out-of-key changes (Trainor and Trehub, 1994). Childhood Perception Aspects of Musical Rhythm: Control attention in temporal sequence of events Remember and reproduce rhythm Hidden melodies in same pitch range can be identified by adults though following rhythms. 9-10 years regularly detect hidden melodies if distracter notes in different pitch range. 5-6 year olds can reproduce rhythms at 2 levelsof-organization (Drake 1993) Childhood Perception Rhythm Hierarchies: Beat or Meter are accents at regular time intervals Rhythmic accents can subdivide or extend over several beats Melodic changes can also create another level of rhythm 5 years can respond to more than one level of rhythmic hierarchy and integrate it with melody (Drake, Dowling, and Palmer, 1991) Little difference between 7 years and adult nonmusicians in rhythmic organization (Drake 1993) Childhood Perception Emotional states of music 4 years perform well in assigning affective labels to music: happy, sad, angry, afraid (Cunningham and Stearling, 1988, Dolgin and Adelson 1990) Summary Innate organization for separating sounds from different sources. Grouping by pitch, contour, rhythm (phrasing), and timbre are exhibited by infants. Acculturation refines melody distinctions and its relationship to harmonies and rhythms based on cultural scales and patterns. Melodic memory is enhanced for melodies following note of a known scale. Auditory scene analysis operations apply broadly to all sounds (speech, noise, music). Why some auditory streams become pleasurable/stimulating/interesting (music), and others are simply used to form a perception of reality is still not clear. How many streams are there? Tell Me Ma - Spectrogram in dB 8000 120 7000 100 6000 80 Hertz 5000 60 4000 3000 40 2000 20 1000 0 0 5 10 Seconds 15 Interesting Websites Mind, Music, and Machine http://www.nici.kun.nl/mmm/ Auditory Scene Analysis http://www.psych.mcgill.ca/labs/auditory/introASA.html Joe Wolfe’s Web Page http://www.phys.unsw.edu.au/~jw/Joe.html