EE 8373 – Digital Speech Processing Spring 2004 Lecture 1 Reading •Quatieri: Ch. 1 •R&S: Ch. 1 •Feel free to do as many of the exercises as you want •Play with speech files. Experiment. •Fill out the feedback forms Note: A large part of the material in the notes has been borrowed from slides that Dr. Larry Rabiner put together, and they are intended for use only in this class. 1 EE 8373 Digital Speech Processing Instructor: Professor Panos Papamichalis Office: Room 349, Junkins Telephone: 214-768-4905 E-mail: panos@engr.smu.edu Office Hours: M-W-F 10:00-11:00, or by appointment (but you can send an e-mail any time) Course Description: A detailed treatment of theory and application of digital speech processing. The course provides a fundamental knowledge of speech signals and speech processing techniques. Topics include digital speech coding, speech synthesis, speech recognition, and speaker verification. Prerequisites: EE 7372, Digital Signal Processing Text: “Discrete-Time Signal Processing”, Thomas Quatieri Prentice-Hall, 2002 (Required) “Digital Processing of Speech Signals”, L. Rabiner & R. Schafer Prentice-Hall, 1978 (Optional) Grading: Homework Computer projects Final Exam 20% 50% 30% (Sat, May 1, 11:30-14:30) Homework & Computer Projects: •The homework will be more of the analytical type. •Computer projects will require you to process signals and then view and listen to them. The processing should be done using MATLAB, and the MATLAB files should accompany the project, to be able to duplicate your results. •Late homework: Solutions will be given on the lecture period after the homework is collected. No homework will be accepted after solutions are posted. •Late projects: The projects’ due date for distance education students will be one lecture period after the in-class students. The grade will be reduced by 10% for every lecture period you miss in turning it in. No projects will be accepted after the last day of classes. Feedback: After every lecture: •What stood out as most important in today’s lecture? •What are you confused about? •Other comments / complaints / suggestions? 2 Course Website use •Engr.smu.edu/ee/8373 •Place homework assignments and answers •Project assignments •Place lecture notes weekly •Other links like: •Cooledit 96 MATLAB use •Access on campus / off campus •Student MATLAB version 13 •Process speech files / view results / listen to results •.wav files read / modify / playback (wavread, wavwrite, wavplay) 3 Types of Speech Processing and Applications Speech Processing Coding •Store-and-forward •Applications: Cell phones Synthesis Enhancement •Word concatenation •Aids to the handicapped •Text to Speech •Helium speech •Cocktail-party effect Recognition •Speech recognition •Speaker verification •Speaker identification •Helicopter speech Another area: High-quality Audio processing 4 EE 8373 – Digital Speech Processing Spring 2004 Lecture 2 Reading •Quatieri: Ch. 2 •R&S: Ch. 2 •Fill out the feedback forms •Send me an e-mail to get your e-mail address •Feel free to do as many of the exercises as you want •Play with speech files. Experiment. 5 Speech Communication Mechanism Digital Processing of Speech Speech Source Speech Sink 6 7 “Which tea party did Baker go to?” 8 Digital Processing of Speech Signals 9 10 11 EE 8373 – Digital Speech Processing Spring 2004 Lecture 3 Reading •Quatieri: Ch. 2 (cont) •R&S: Ch. 2 (cont) •Fill out the feedback forms •Send me an e-mail to get your e-mail address •Feel free to do as many of the exercises as you want •Play with speech files. Experiment. 12 13 14 15 16 17