Download presentation
Presentation is loading. Please wait.
1
09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 1 ENEE408G: Capstone Design Project: Multimedia Signal Processing Design Project 1: Digital Speech Processing
2
09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 2 Outline of Design Project 1 Part I : Speech Analysis Part II : Speech Coding: Linear Predictive Vocoder Part III: Speech Recognition by IBM ViaVoice Part IV: Speech Synthesis Part V : Human Computer Interface Part VI: Mobile Computing and Pocket PC Programming
3
09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 3 Adjust the Microphone Device Use Sound Recorder By accessories entertainment sound recorder Select Line-In 2/Mic 2 By Edit audio properties sound recording Volume
4
09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 4 Part I. Speech Analysis (1) Human Vocal Apparatus
5
09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 5 Part I. Speech Analysis (2) Vocal Tract Model
6
09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 6 Part I. Speech Analysis (3) COLEA toolbox: Waveform on Time Domain Spectrogram Pitch and Formant Tracking LPC Spectra Record your own voice and analyze pitch and formants.
7
09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 7 Part I. Speech Analysis (4)
8
09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 8 Part I. Speech Analysis (5) Gender Identification: Use Auditory Toolbox to obtain Linear Predictive coefficients. Design your algorithm to identify the gender of samples in the training set. Test your algorithm on 9/26 by new samples.
9
09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 9 Pat II. Linear Predictive Vocoder: Encoder Encoder:
10
09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 10 Part II. Linear Predictive Vocoder:Decoder
11
09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 11 Part III. Speech Recognition IBM ViaVoice ViaVoice Training: Operate PC by ViaVoice
12
09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 12 Part III. IBM ViaVoice Training Start from BLUE word. Keep specking, the recognized words become GRAY. If you hear sounds or the BLUE sign stop in a specific word, return to the blue word and read the BLACK sentence again.
13
09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 13 Part III. IBM ViaVoice Dictation Speak Pad Menu Bar: 1. Menu Button 2. Microphone State 3. Status Area 4. ViaCenter Help 5. Current User
14
09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 14 Part IV. Speech Synthesis Text-To-Speech and Talking Head Vowel Synthesis Demo
15
09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 15 Part V. Human Computer Interface CSLU Human Computer Interface Rapid Application Developer (RAD) Start Speech Toolkit RAD MIT Galaxy System JUPITER: Weather Information System http://www.sls.lcs.mit.edu/sls/applications/jupiter.shtml TEL: 1-888-573-8255 PEGASUS: Airline Flight Planning System http://www.sls.lcs.mit.edu/sls/applications/pegasus.shtml TEL: 1-877-527-8255
16
09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 16 Part VI. Pocket PC Programming Apply what you learned from previous parts and design a simple application related to digital speech processing by Microsoft eMbedded Tools for Pocket PC.
17
09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 17 Announcement Matlab task: Part II C++ task: Part VI Check out Pocket PC
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.