Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech.

Similar presentations


Presentation on theme: "1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech."— Presentation transcript:

1 1 Speech Processing

2 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech Production System  Phonemics and Phonetics  Spectrogram Reading  Linear Prediction Analysis  Speech Coding and Compression  Speech Synthesis (Text to Speech)  Speech Quality Assessment (Subjective and Objective)  Speech Recognition (Speech to Text)  Speech Enhancement

3 3 Speech Processing: Marking Scheme:  Homeworks:10%  Projects : 15%  Quizzes:20%  Midterm: 25%  Final Exam: 30%

4 4 Speech Processing: Text:  Spoken language processing Huang, Acero, Hon, 2000  Introduction to Digital Speech Processing Lawrence R. Rabiner and Ronald W. Schafer, 2007  Discrete time processing of speech Signals Deller,Proakis,Hansen,1993  Fundamentals of speech recognition Rabiner,Juang,1993  Password for any documents for the course: 40967spring93

5 ارسطو:‌ انسان، حيوان ناطق است. 5

6 Old Speech Synthesizers –Speech organ of Wheatstone, based on a system proposed by Wolfgang von Kempelen in 1791

7 Old Speech Synthesizers (cont’d) –Speech organ of Joseph Faber (1830-40)

8 Old Speech Synthesizers (cont’d) –Voder demonstrated in 1939 Source: http://www.ling.su.se/staff/hartmut/kemplne.htmhttp://www.ling.su.se/staff/hartmut/kemplne.htm

9 More modern labs (ICP lab in Grenoble, France) –Study of the face movements to be included in speech synthesis (and recognition).

10 Communication via Spoken Language

11

12 Virtues of Spoken Language Natural: Requires no special training Flexible: Leaves hands and eyes free Efficient: Has high data rate Economical: Communicated inexpensively Expressive:Conveys more than just words Popular/preferred:Verbal-acoustic problem solving Much longer evolution, compared to written language

13 Virtues of Spoken Language Speech interfaces are ideal for information access and management when:  The information space is broad and complex,  The users are not allowed (or at ease or capable) to use their eyes to read text messages,  The users are technically naive, or  Only telephones are available.

14 Diverse Sources of Constraint for Spoken Language Communication Acoustic: human vocal tract Phonetic: let us pray lettuce spray Phonological: gas shortage fish sandwich Phonotactic: sprachst (german) Syntactic: I am flying to Chicago tomorrow tomorrow I flying Chicago am to Semantic: Is the baby crying Is the bay bee crying Contextual:It is easy to recognize speech It is easy to wreck a nice beach

15 A Conversational System Architecture

16 Demo: Conversational Interface Jupiter weather information system  Access through telephone  500 cities worldwide  Harvest weather information from the Web several times daily


Download ppt "1 Speech Processing. 2 Speech Processing:  Review of DSP Concepts  Review of Probability and Stochastic Processes  Anatomy and Physiology of Speech."

Similar presentations


Ads by Google