EE 8373 – Digital Speech Processing Spring 2004 Lecture 1

advertisement
EE 8373 – Digital Speech Processing
Spring 2004
Lecture 1
Reading
•Quatieri: Ch. 1
•R&S: Ch. 1
•Feel free to do as many of the exercises as
you want
•Play with speech files. Experiment.
•Fill out the feedback forms
Note: A large part of the material in the
notes has been borrowed from slides that
Dr. Larry Rabiner put together, and they are
intended for use only in this class.
1
EE 8373
Digital Speech Processing
Instructor:
Professor Panos Papamichalis
Office: Room 349, Junkins
Telephone: 214-768-4905
E-mail: panos@engr.smu.edu
Office Hours: M-W-F 10:00-11:00, or by appointment
(but you can send an e-mail any time)
Course Description:
A detailed treatment of theory and application of digital
speech processing. The course provides a fundamental knowledge of speech signals
and speech processing techniques. Topics include digital speech coding, speech
synthesis, speech recognition, and speaker verification.
Prerequisites: EE 7372, Digital Signal Processing
Text:
“Discrete-Time Signal Processing”, Thomas Quatieri
Prentice-Hall, 2002 (Required)
“Digital Processing of Speech Signals”, L. Rabiner & R. Schafer
Prentice-Hall, 1978 (Optional)
Grading:
Homework
Computer projects
Final Exam
20%
50%
30% (Sat, May 1, 11:30-14:30)
Homework & Computer Projects:
•The homework will be more of the analytical type.
•Computer projects will require you to process signals and then view and listen to them.
The processing should be done using MATLAB, and the MATLAB files should accompany
the project, to be able to duplicate your results.
•Late homework: Solutions will be given on the lecture period after the homework is
collected. No homework will be accepted after solutions are posted.
•Late projects: The projects’ due date for distance education students will be one lecture
period after the in-class students. The grade will be reduced by 10% for every lecture
period you miss in turning it in. No projects will be accepted after the last day of classes.
Feedback:
After every lecture:
•What stood out as most important in today’s lecture?
•What are you confused about?
•Other comments / complaints / suggestions?
2
Course Website use
•Engr.smu.edu/ee/8373
•Place homework assignments and answers
•Project assignments
•Place lecture notes weekly
•Other links like:
•Cooledit 96
MATLAB use
•Access on campus / off campus
•Student MATLAB version 13
•Process speech files / view results / listen to results
•.wav files read / modify / playback (wavread, wavwrite, wavplay)
3
Types of Speech Processing and Applications
Speech Processing
Coding
•Store-and-forward
•Applications: Cell
phones
Synthesis
Enhancement
•Word concatenation •Aids to the
handicapped
•Text to Speech
•Helium speech
•Cocktail-party effect
Recognition
•Speech recognition
•Speaker verification
•Speaker identification
•Helicopter speech
Another area: High-quality Audio processing
4
EE 8373 – Digital Speech Processing
Spring 2004
Lecture 2
Reading
•Quatieri: Ch. 2
•R&S: Ch. 2
•Fill out the feedback forms
•Send me an e-mail to get your e-mail address
•Feel free to do as many of the exercises as
you want
•Play with speech files. Experiment.
5
Speech Communication Mechanism
Digital Processing of Speech
Speech Source
Speech Sink
6
7
“Which tea party did Baker go to?”
8
Digital Processing of Speech Signals
9
10
11
EE 8373 – Digital Speech Processing
Spring 2004
Lecture 3
Reading
•Quatieri: Ch. 2 (cont)
•R&S: Ch. 2 (cont)
•Fill out the feedback forms
•Send me an e-mail to get your e-mail address
•Feel free to do as many of the exercises as
you want
•Play with speech files. Experiment.
12
13
14
15
16
17
Download