Speech & Audio Processing - Part–II Digital Audio Signal Processing Marc Moonen Dept. E.E./ESAT-STADIUS, KU Leuven marc.moonen@esat.kuleuven.be homes.esat.kuleuven.be/~moonen/ Speech & Audio Processing • Part-I (H. Van hamme) speech recognition speech coding (+audio coding) speech synthesis (TTS) • Part-II (M. Moonen): Digital Audio Signal Processing microphone array processing noise- ,echo-, feedback- cancellation (de)reverberation active noise control, 3D audio PS: selection of topics Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 2 Digital Audio Signal Processing • • • • • • • Aims/scope Case study: Hearing instruments Overview Prerequisites Lectures/course material/literature Exercise sessions/project Exam Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 3 Aims/Scope Aim is 2-fold : • Speech & audio per se S & A industry in Belgium/Europe/… • Basic signal processing theory/principles : Optimal filters Adaptive filter algorithms (APA, Filtered-X LMS,..) Kalman filters (linear/nonlinear) etc... Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 4 Hearing Aids (HAs) • Audio input/audio output (`microphone-processing-loudspeaker’) • ‘Amplifier’, but so much more than an amplifier!! • History: Horns/trumpets/… `Desktop’ HAs (1900) Wearable HAs (1930) Digital HAs (1980) 2007 (Oticon) • • • • 1921 Case Study: Hearing Instruments 1/12 • State-of-the-art: • MHz’s clock speed • Millions of arithmetic operations/sec, … • Multiple microphones Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 5 • Audio input/electrode stimulation output • Stimulation strategy + preprocessing similar to HAs • History: Intra-cochlear • • • • Volta’s experiment… First implants (1960) Commercial CIs (1970-1980) Digital CIs (1980) electrode • State-of-the-art: • MHz’s clock speed, Mops/sec, … • Multiple microphones © Cochlear Ltd Cochlear Implants (CIs) Alessandro Volta 1745-1827 Case Study: Hearing Instruments 2/12 Other: Bone anchored HAs, middle ear implants, … Digital Audio Signal Processing: Introduction Electrical stimulation Electrical stimulation p. 6 Version 2013-2014 Lecture-1: Introduction for low frequency for high frequency Case Study: Hearing Instruments 3/12 • Hearing loss types: • conductive • sensorineural • mixed • One in six adults (Europe) …and still increasing • Typical causes: • aging • exposure to loud sounds • … Digital Audio Signal Processing: Introduction Version 2013-2014 [Source: Lapperre] Lecture-1: Introduction p. 7 Case Study: Hearing Instruments 4/12 Hearing impairment : Dynamic range & audibility Normal hearing subjects Hearing impaired subjects Level 100dB 0dB Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 8 Case Study: Hearing Instruments 5/12 Hearing impairment : Dynamic range & audibility Dynamic range compression (DRC) (…rather than `amplification’) 100dB Output Level (dB) Level 100dB 0dB 0dB 0dB 100dB Input Level (dB) Design: multiband DRC, attack time, release time, … Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 9 Case Study: Hearing Instruments 6/12 Hearing impairment : Audibility vs speech intelligibility • Audibility does not imply intelligibility SNR 20dB • Hearing impaired subjects need 5..10dB larger signal-to-noise ratio (SNR) for speech understanding in 0dB noisy environments 30 50 70 90 Hearing loss (dB, 3-freq-average) • Need for noise reduction (=speech enhancement) algorithms: • State-of-the-art: monaural 2-microphone adaptive noise reduction • Near future: binaural noise reduction (see below) • Not-so-near future: multi-node noise reduction (see below) Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 10 Case Study: Hearing Instruments 7/12 HA technology requirements • Small form factor (cfr. user acceptance) • Low power: 1…5mW (cfr. battery lifetime ≈ 1 week) • Low processing delay: 10msec (cfr. synchronization with lip reading) DSP challenges in hearing instruments • • • • Dynamic range compression (cfr supra) Dereverberation: undo filtering (`echo-ing’) by room acoustics Feedback cancellation Noise reduction Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 11 Case Study: Hearing Instruments 8/12 DSP Challenges: Feedback Cancellation • Problem statement: Loudspeaker signal is fed back into microphone, then amplified and played back again • Closed loop system may become unstable (howling) • Similar to feedback problem in public address systems (for the musicians amongst you) Model F Similar to echo cancellation in GSM handsets, Skype,… but more difficult due to signal correlation Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 12 Case Study: Hearing Instruments 9/12 DSP Challenges: Noise reduction Multimicrophone ‘beamforming’, typically with 2 microphones, e.g. ‘directional’ front microphone and ‘omnidirectional’ back microphone “filter-and-sum” the microphone signals Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 13 Case Study: Hearing Instruments 10/12 Binaural hearing: Binaural auditory cues • ITD (interaural time difference) • ILD (interaural level difference) signal ILD ITD • Binaural cues (ITD: f < 1500Hz, ILD: f > 2000Hz) used for • Sound localization • Noise reduction =`Binaural unmasking’ (‘cocktail party’ effect) 0-5dB Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 14 Case Study: Hearing Instruments 11/12 Binaural hearing aids • Two hearing aids (L&R) with wireless link & cooperation • Opportunities: • More signals (e.g. 2*2 microphones) • Better sensor spacing (17cm i.o. 1cm) • Constraints: power/bandwith/delay of wireless link • ..10kBit/s: coordinate program settings, parameters,… • ..300kBits/s: exchange 1 or more (compressed) audio signals • Challenges: • Improved localization through cue preservation • Improved noise reduction + benefit from binaural unmasking • Signal selection/filtering, audio coding, synchronisation, … Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 15 Case Study: Hearing Instruments 12/12 Future: Multi-node noise reduction – sensor networks Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 16 Overview General speech communication set-up : - background ‘noise’ noise suppression, source separation - far-end echoes acoustic echo cancellation - reverberation de-reverberation/deconvolution Applications : • teleconferencing/teleclassing • hands-free telephony • hearing aids, etc.. Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 17 Overview : Lecture-2 Microphone Array Processing Spatial filtering - Beamforming Fixed vs. adaptive beamforming Example filter-and-sum beamformer : S ( ) F1 ( ) F2 ( ) Z ( , ) Fm ( ) FM ( ) Y1 ( , ) Y2 ( , ) d m Y ( , ) 1 Y1 ( ,Ym)( , ) YM ( , ) d m cos Application: hearing aids Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 18 Overview : Lecture-3 Noise Reduction `microphone_signal[k] = speech[k] + noise[k]’ • Single-microphone noise reduction – Spectral Subtraction Methods (spectral filtering) – Iterative methods based on speech modeling (Wiener & Kalman Filters) • Multi-microphone noise reduction – Beamforming revisited – Optimal filtering approach : spectral+spatial filtering Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 19 Overview : Lecture-4 Acoustic Echo Cancellation Adaptive filtering problem: • non-stationary/wideband/… speech signals • non-stationary/long/… acoustic channels Adaptive filtering algorithms AEC Control AEC Post-processing Stereo AEC Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 20 Overview : Lecture-5 Acoustic Feedback Cancellation • Ex: Hearing aids • Ex: PA systems • correlation between filter input (`x ’) and near-end signal (‘ n ’) • fixes : noise injection, pitch shifting, notch filtering, ... amplifier Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 21 Overview : Lecture-6 Reverb & De-reverberation ` microphone_signal[k] = filter*speech[k] (+ noise[k]) ’ • Reverb = effect of acoustic channel in between speaker and microphone(s) • Reverb has an impact on coding, speech recognition, etc. • Single-microphone de-reverberation – Cepstrum techniques • Multi-microphone de-reverberation: – Estimation of acoustic impulse responses – Inverse-filtering method – Matched filtering Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 22 Overview : Lecture-7 Active Noise Control • Solution based on `filtered-X LMS’ • Application : active headsets/ear defenders Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 23 Overview : Lecture-7bis 3D Audio & Loudspeaker Arrays • Binaural synthesis …with headphones head related transfer functions (HRTF) …with 2+ loudspeakers (`sweet spot’) crosstalk cancellation Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 24 Overview : Lecture-8 Case Study: Signal Processing in Cochlear Implants 1Hr lecture by Cochlear LtD To be scheduled Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 25 Aims/Scope (revisited) Aim is 2-fold : • Speech & audio per se • Basic signal processing theory/principles : Optimal filtering / Kalman filters (linear/nonlinear) here : speech enhancement other : automatic control, spectral estimation, ... Advanced adaptive filter algorithms here : acoustic echo cancellation other : digital communications, ... Filtered-X LMS here : 3D audio other : active noise/vibration control Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 26 Lectures Lectures: 7*2hrs + 1*1hr – PS: Time budget = (15hrs)*4 = 60 hrs Course Material: Slides – Use version 2013-2014 ! – Download from DASP webpage http://homes.esat.kuleuven.be/~dspuser/dasp/ Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 27 Prerequisites • H197 Signals & Systems (JVDW) • HJ09 Digital Signal Processing (I) (PW) signal transforms, sampling, multi-rate, DFT, … • HC63 DSP-CIS (MM) filter design, filter banks, optimal & adaptive filters Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 28 Literature Literature (General) (available in DSP-CIS library) • Simon Haykin `Adaptive Filter Theory’ (Prentice Hall 1996) • P.P. Vaidyanathan `Multirate Systems and Filter Banks’ (Prentice Hall 1993) Literature (specialized) (some available in DSP-CIS library) • S.L. Gay & J. Benesty `Acoustic Signal Processing for Telecommunication’ (Kluwer 2000) • M. Kahrs & K. Brandenburg (Eds) `Applications of Digital Signal Processing to Audio and Acoustics’ (Kluwer1998) • B. Gold & N. Morgan `Speech and Audio Signal Processing’ (Wiley 2000) Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 29 Exercise Sessions/Project Direction-of-arrival θ Acoustic source localization – – – – Direction-of-arrival estimation Noise reduction Echo cancellation Simulated set-up Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 30 Acoustic Source Localization Project PS: groups of 2 • Runs over 4 weeks (non-consecutive) • Each week – 1 PC/Matlab session (supervised, 2.5hrs) – 2 ‘Homework’ sesions (unsupervised, 2*2.5hrs) PS: Time budget = 4*(2.5hrs+5hrs) = 30 hrs • ‘Deliverables’ after week 2 & 4 • Grading: based on deliverables, evaluated during sessions • TAs: guiliano.bernardi@esat (English+Italian) alexander.bertrand@esat (English+Dutch) Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 31 Acoustic Source Localization Project Work Plan – Week 1: Design Matlab simulation set-up – Week 2: Direction-of-arrival (DoA) estimation *deliverable* – Week 3: DoA estimation + noise reduction – Week 4: DoA estimation + echo cancellation *deliverable* ..be there ! Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 32 Exam • Oral exam, with preparation time • Open book • Grading 7 for question-1 7 for question-2 +6 for project ___ = 20 Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 33 September Retake Exam • Oral exam, with preparation time • Open book • Grading 7 for question-1 7 for question-2 +6 for question-3 ___ = 20 Digital Audio Signal Processing: Introduction (related to project work) Version 2013-2014 Lecture-1: Introduction p. 34 Website 1) TOLEDO 1) http://homes.esat.kuleuven.be/~dspuser/dasp/ • • • • • Contact: guiliano.bernardi@esat Slides (use `version 2013-2014’ !!) Schedule DSP-library FAQs (send questions to marc.moonen@esat) Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 35 Questions? 1) Ask teaching assistant (during exercises sessions) 2) E-mail questions to teaching assistant or marc.moonen@esat 3) Make appointment marc.moonen@esat ESAT Room 01.69 Digital Audio Signal Processing: Introduction Version 2013-2014 Lecture-1: Introduction p. 36