Audio Signals

Audio Signals

Karanveer Mohan and Stephen Boyd

EE103

Stanford University

September 23, 2015

Acoustic pressure

I mean atmospheric pressure is around 10 5 N / m 2

I acoustic pressure p ( t ) is instantaneous pressure minus mean pressure

I we can only hear variations in p ( t ) on submillisecond and millisecond time scale

I rms ( p ) corresponds (roughly) to loudness of sound

I

I rms ( p ) = 1 N / m

2 is ear-splitting ( ∼ 120 dB SPL) rms ( p ) = 10

− 4 N / m 2 is barely audible ( ∼ 14 dB SPL)

I Sound Pressure Level (SPL) of acoustic pressure signal p is

20 log

10

( rms ( p ) /p ref

) , p ref

= 2 × 10 − 5 N / m 2

2

Vector representation of audio

I vector x ∈ R

N represents audio (sound) signal (or recording) over some time interval

I x i is (scaled) acoustic pressure at time t = hi : x i

= αp ( hi ) , i = 1 , . . . , N

I x i is called a sample

I h > 0 is the sample time; 1 /h is the sample rate

I typical sample rates are 1 /h = 44100/sec or 48000/sec ( h ≈ 20 µ sec)

I for a 3 -minute song, N ∼ 10

7

I α is scale factor

I stereophonic audio signal consists of a left and a right audio signal

3

0.2

0.1

0

−0.1

−0.2

0.15

0.1

0.05

0

−0.05

−0.1

Examples

Instrumental (play)

Speech (play)

4

Scaling audio signals

I if x is an audio signal, what does ax sound like? ( a is a number)

I answer: same as x but louder if | a | > 1 and quieter if | a | < 1

– 2 x sounds noticeably louder than x

– (1 / 2) x sounds noticeably quieter than x

– 10 x sounds much louder than x

– − x sounds the same as x

I a volume control simply scales an audio signal

I for this reason, the scale factor usually doesn’t matter

I example

– play x

– play 2 x

– play (1 / 2) x

– play − x

5

Linear combinations and mixing

I suppose x

1

, . . . , x k are k different audio signals with same length

I form linear combination y = a

1 x

1

+ a

2 x

2

+ · · · + a k x k

I y sounds like a mixture of the audio signals, with relative weights

| a

1

| , . . . , | a k

|

I forming y is called mixing , and x i are called tracks

I producers do this to produce finished recordings from separate tracks for vocals, instruments, drums, . . .

I coefficients a

1

, . . . , a k are adjusted (by ear) to give a good balance

I typical number of tracks: k = 48

6

Mixing example

I tracks

– drums (play)

– vocals (play)

– guitar (play)

– synthesizer (play)

I mix 1: a = (0 .

25 , 0 .

25 , 0 .

25 , 0 .

25) (play)

I mix 2: a = (0 , 0 .

7 , 0 .

1 , 0 .

3) (play)

I mix 3: a = (0 .

1 , 0 .

1 , 0 .

5 , 0 .

3) (play)

7

Musical tones

I suppose p ( t ) is an acoustic signal, with t in seconds

I it is periodic with period T if p ( t + T ) = p ( t ) for all t

(in practice, it’s good enough for p ( t + T ) ≈ p ( t ) for t in an interval at least 1/4 second or so)

I its frequency is f = 1 /T (in 1/sec of Hertz, Hz)

I for f in range 100 – 2000 , p is perceived as a musical tone

– frequency f determines pitch (or musical note)

– shape (a.k.a.

waveform ) of p determines timbre (quality of sound)

8

Musical notes

I f = 440 Hz is middle A

I one octave is doubling of frequency

I f = 880 Hz is A above middle A; f = 220 Hz is A below middle A

I each musical half step is a factor of 2

1 / 12 in frequency

I middle C is frequency f = 2 3 / 12 × 440 ≈ 523 .

2 Hz

(C is 3 half-steps above A)

I in Western music, certain consonant intervals have frequency ratios close to ratios of small integers

9

Frequency ratios and musical intervals

half steps

0

1

2

3

4

5

6 name unison frequency ratio

2 0 / 12 = 1

2 1 / 12

2 2 / 12

= 1

= 1 .

.

0595

1225 minor 3rd 2

3 / 12 major 3rd 2 4 / 12 perfect 4th 2 5 / 12

2 6 / 12

= 1

= 1

= 1

= 1 .

.

.

.

1892

2599

3348

4142

≈

≈

≈

6

5

4

/

/

/

5

4

3 play play

7

8

9

10

11

12 perfect 5th 2

7 / 12

2 8 / 12

= 1 .

4983

= 1 .

5974

2 9 / 12

2 10 / 12

= 1 .

6818

= 1 .

7818

≈ 3 / 2 play octave

2

11 / 12

2 12 / 12

= 1

= 2

.

8877 play

10

Periodic signals

I periodic signal p ( t ) =

K

X

( a k cos(2 πf kt ) + b k sin(2 πf kt )) k =1

I k is called harmonic or overtone

I f is frequency

I a k

, b k are harmonic coefficients

I any periodic signal can be approximated this way (Fourier series) with large enough K

11

Timbre

I timbre (quality of musical tone) is determined by harmonic amplitudes c

1

= q a 2

1

+ b 2

2

, . . .

c

K

= q a 2

K

+ b 2

K

I c = (1 , 0 , . . . , 0) (pure sine wave) is heard as pure, boring tone

I c = (0 .

3 , 0 .

4 , 0 .

2 , 0 .

3) has same pitch, but sounds ‘richer’

I with different harmonic amplitudes, can make sounds (sort of) like oboe, violin, horn, piano, . . .

12

0.5

Various timbres, same pitch

pure 220hz tone, c = 1 (play)

0

−0.5

0.5

0

−0.5

c = (0 .

7 , 0 .

6 , 0 .

3 , 0 .

04) (play)

13

Various timbres, same pitch

0.5

c = (0 .

21 , 0 .

4 , 0 .

9 , 0 .

05 , 0 .

05 , 0 .

05) (play)

0

0

−0.5

−0.5

0.5

c = (0 .

3 , . . . , 0 .

3) ∈ R

10

(play)

14

Audio Signals - Stanford University

Acoustic pressure

Vector representation of audio

Examples

Scaling audio signals

Linear combinations and mixing

Mixing example

Musical tones

Musical notes

Frequency ratios and musical intervals

Periodic signals

Timbre

Various timbres, same pitch

Various timbres, same pitch

Related documents

Products

Support

Audio Signals - Stanford University

Audio Signals

Acoustic pressure

Vector representation of audio

Examples

Scaling audio signals

Linear combinations and mixing

Mixing example

Musical tones

Musical notes

Frequency ratios and musical intervals

Periodic signals

Timbre

Various timbres, same pitch

Various timbres, same pitch

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib