"Dude, Where's My... Signals and Systems Textbook?" Joseph Picone Inst. for Signal and Info. Processing Dept. Electrical and Computer Eng. Mississippi.

Slides:



Advertisements
Similar presentations
Introduction to Computational Linguistics
Advertisements

Introduction to Computational Linguistics
Lesson 1: Introduction to IT Business and Careers
Masters Presentation at Griffith University Master of Computer and Information Engineering Magnus Nilsson
ECE 8443 – Pattern Recognition Objectives: Acoustic Modeling Language Modeling Feature Extraction Search Pronunciation Modeling Resources: J.P.: Speech.
ECE 8443 – Pattern Recognition Objectives: Course Introduction Typical Applications Resources: Syllabus Internet Books and Notes D.H.S: Chapter 1 Glossary.
PHONEXIA Can I have it in writing?. Discuss and share your answers to the following questions: 1.When you have English lessons listening to spoken English,
Department of Mathematics and Computer Science
SPEECH RECOGNITION Kunal Shalia and Dima Smirnov.
Spoken Language Technologies: A review of application areas and research issues Analysis and synthesis of F0 contours Agnieszka Wagner Department of Phonetics,
Top Level System Block Diagram BSS Block Diagram Abstract In today's expanding business environment, conference call technology has become an integral.
4/25/2001ECE566 Philip Felber1 Speech Recognition A report of an Isolated Word experiment. By Philip Felber Illinois Institute of Technology April 25,
Feature vs. Model Based Vocal Tract Length Normalization for a Speech Recognition-based Interactive Toy Jacky CHAU Department of Computer Science and Engineering.
Real-Time Speech Recognition Thang Pham Advisor: Shane Cotter.
Biometrics: Voice Recognition
Track: Speech Technology Kishore Prahallad Assistant Professor, IIIT-Hyderabad 1Winter School, 2010, IIIT-H.
... NOT JUST ANOTHER PUBLIC DOMAIN SOFTWARE PROJECT ...
Introduction to Automatic Speech Recognition
Course Overview Course Requirements Knowledge and Skills Student Expectations.
Information Technology
1 ECOR 1010 Introduction to Engineering The Systems Engineering Tradition at Carleton U D.L. Bailey Department of Systems and Computer Engineering Carleton.
OverviewOverview – Preparation – Day in the Life – Earnings – Employment – Career Path Forecast – ResourcesPreparationDay in the LifeEarningsEmploymentCareer.
BravoBrava Mississippi State University Can Advances in Speech Recognition make Spoken Language as Convenient and as Accessible as Online Text? Joseph.
Applications of Risk Minimization to Speech Recognition Joseph Picone Inst. for Signal and Info. Processing Dept. Electrical and Computer Eng. Mississippi.
Knowledge Base approach for spoken digit recognition Vijetha Periyavaram.
Midterm Review Spoken Language Processing Prof. Andrew Rosenberg.
Supervisor: Dr. Eddie Jones Electronic Engineering Department Final Year Project 2008/09 Development of a Speaker Recognition/Verification System for Security.
Joseph Picone, PhD Human and Systems Engineering Professor, Electrical and Computer Engineering Bridging the Gap in Human and Machine Performance HUMAN.
Joseph Picone, PhD Department of Electrical and Computer Engineering Mississippi State University The Important Role of Mathematics in Human Language Technology.
Dr. Alireza Ghorshi Dr. Mohammad Mortazavi Dr. Mohammad Khansari Dr. Alireza Nemany Pour.
Beyond Call Recording: Speech Improves Quality Assurance Larry Mark Chief Technology Officer SER Solutions, Inc.
1 BILC SEMINAR 2009 Speech Recognition: Is It for Real? Tony Mirabito Defense Language Institute English Language Center (DLIELC) DLIELC.
12/28/08 - L1 Crs OvrvwCopyright Joanne DeGroat, ECE, OSU1 Course Overview Resume.
Temple University QUALITY ASSESSMENT OF SEARCH TERMS IN SPOKEN TERM DETECTION Amir Harati and Joseph Picone, PhD Department of Electrical and Computer.
Math 5 Professor Barnett Timothy G. McManus Anthony P. Pastoors.
IMPROVING RECOGNITION PERFORMANCE IN NOISY ENVIRONMENTS Joseph Picone 1 Inst. for Signal and Info. Processing Dept. Electrical and Computer Eng. Mississippi.
ELEC692/04 course_des 1 ELEC 692 Special Topic VLSI Signal Processing Architecture Fall 2004 Chi-ying Tsui Department of Electrical and Electronic Engineering.
DIALOG SYSTEMS FOR AUTOMOTIVE ENVIRONMENTS Presenter: Joseph Picone Inst. for Signal and Info. Processing Dept. Electrical and Computer Eng. Mississippi.
Signal Processing Emphasis Group Robert Moorhead Roger King Joe Picone Nick Younan Jim Fowler Lori Bruce Jenny Du.
THE DEPARTMENT OF ELECTRICAL AND COMPUTER ENGINEERING Joseph Picone, PhD Professor and Chair, Department of Electrical and Computer Engineering College.
Joseph Picone, PhD Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering An Overview of Statistical.
DIALOG SYSTEMS FOR AUTOMOTIVE ENVIRONMENTS Presenter: Joseph Picone Inst. for Signal and Info. Processing Dept. Electrical and Computer Eng. Mississippi.
INSTITUTE FOR SIGNAL AND INFORMATION PROCESSING Joseph Picone Inst. for Signal and Info. Processing Dept. Electrical and Computer Eng. Mississippi State.
Why You Should All Become Computer Engineers ECE 200 (Fall 2015) Saurabh Bagchi School of Electrical and Computer Engineering Purdue University.
S PEECH T ECHNOLOGY Answers to some Questions. S PEECH T ECHNOLOGY WHAT IS SPEECH TECHNOLOGY ABOUT ?? SPEECH TECHNOLOGY IS ABOUT PROCESSING HUMAN SPEECH.
© 2013 by Larson Technical Services
Automatic Speech Recognition A summary of contributions from multiple disciplines Mark D. Skowronski Computational Neuro-Engineering Lab Electrical and.
Automatic Discovery and Processing of EEG Cohorts from Clinical Records Mission: Enable comparative research by automatically uncovering clinical knowledge.
Basic structure of sphinx 4
Foundations of Information Systems in Business. System ® System  A system is an interrelated set of business procedures used within one business unit.
ARTIFICIAL INTELLIGENCE FOR SPEECH RECOGNITION. Introduction What is Speech Recognition?  also known as automatic speech recognition or computer speech.
Performance Analysis of Advanced Front Ends on the Aurora Large Vocabulary Evaluation Authors: Naveen Parihar and Joseph Picone Inst. for Signal and Info.
Chapter 7 Speech Recognition Framework  7.1 The main form and application of speech recognition  7.2 The main factors of speech recognition  7.3 The.
Natural Language and Speech (parts of Chapters 8 & 9)
Speaker Verification System Middle Term Presentation Performed by: Barak Benita & Daniel Adler Instructor: Erez Sabag.
Speech Recognition Created By : Kanjariya Hardik G.
Message Source Linguistic Channel Articulatory Channel Acoustic Channel Observable: MessageWordsSounds Features Bayesian formulation for speech recognition:
PREPARED BY MANOJ TALUKDAR MSC 4 TH SEM ROLL-NO 05 GUKC-2012 IN THE GUIDENCE OF DR. SANJIB KR KALITA.
Speaker Recognition UNIT -6. Introduction  Speaker recognition is the process of automatically recognizing who is speaking on the basis of information.
The Neural Engineering Data Consortium Mission: To focus the research community on a progression of research questions and to generate massive data sets.
SPEECH TECHNOLOGY An Overview Gopala Krishna. A
Artificial Intelligence for Speech Recognition
College of Engineering
HUMAN LANGUAGE TECHNOLOGY: From Bits to Blogs
Kocaeli University Introduction to Engineering Applications
EEG Recognition Using The Kaldi Speech Recognition Toolkit
TECHNOLOGICAL PROGRESS
HUMAN LANGUAGE TECHNOLOGY: From Bits to Blogs
HUMAN AND SYSTEMS ENGINEERING:
Network Training for Continuous Speech Recognition
Presentation transcript:

"Dude, Where's My... Signals and Systems Textbook?" Joseph Picone Inst. for Signal and Info. Processing Dept. Electrical and Computer Eng. Mississippi State University Contact Information: Box 9571 Mississippi State University Mississippi State, Mississippi Tel: Fax: IEEE: MSU STUDENT BRANCH URL:

INTRODUCTION ABSTRACT A funny thing happened on the way to an IEEE section talk at IIT in I found a career. I'll relate the rest of this story, including a couple of nice demos during this talk (hint: her name is Julie). I'll review some of the early history of digital signal processing and describe some of the speech applications that transformed this field into one of the single largest semiconductor markets today. I will show how signal processing research elegantly combines mathematics, software, and computer hardware to produce innovative technology such as cell phones and computers enhanced with speech recognition. Oh yeah, and I'll try to explain why we choose to torture you with a class called Signals and Systems. Joseph Picone is currently a Professor in the Department of Electrical and Computer Engineering at Mississippi State University, where he also directs the Institute for Signal and Information Processing. He has previously been employed by Texas Instruments and AT&T Bell Laboratories. Dr. Picone received his Ph.D. in Electrical Engineering from Illinois Institute of Technology in He is a Senior Member of the IEEE and a registered Professional Engineer.

CASE STUDIES Introduced at the Summer Consumer Electronics Show in Chicago First commercial speech synthesis consumer toy Based on linear prediction Contained a proprietary speech synthesis chip SPEAK & SPELL™ (JUNE 1978) Left to right: Gene Frantz, Richard Wiggins, Paul Breedlove and Larry Brantingham (1978)

CASE STUDIES Yes / no / true / false recognizer Answer questions about history Variety of learning modules Speaker independent recognition Microphone + children??? Won several industry design awards for the mechanical design VOYAGER™ (JUNE 1988)

CASE STUDIES Worlds of Wonder approach TI in September of Can you put this toy on the market by Thanksgiving? 10-word speaker dependent isolated word recognizer 100 sentences for synthesis “Transparent training” First large-scale consumer toy application for a DSP JULIE (DECEMBER 1988)

CASE STUDIES Voice verification for calling card security First wide-spread deployment of recognition technology in the telephone network Stimulated interest in voice dialing and other user-programmable features Original application was obsolete before wide- scale deployment WATSON (EARLY 1990’S)

CASE STUDIES Jack Deller said: model voice with a digital all-pole filter identify a person by their filter coefficients diagnose a disease by shifts in these coefficients recognize the words being spoken using pattern matching techniques maximum entropy, linear prediction, and a few other big words I didn’t understand... and I was hooked... WHY ATTEND IEEE TALKS?

CASE STUDIES Deller said you can replace this: DIGITAL FILTERS With this: What are the advantages?

HUMAN LANGUAGE TECHNOLOGY SPEECH RECOGNITION RESEARCH? Why do we work on speech recognition? “Language is the preeminent trait of the human species.” “I never met someone who wasn’t interested in language.” “I decided to work on language because it seemed to be the hardest problem to solve.” Why should we work on speech recognition? Antiterrorism, homeland security, military applications Telecommunications, mobile communications Education, learning tools, educational toys, enrichment Computing, intelligent systems, machine learning Commodity or liability? Fragile technology that is error prone

HUMAN LANGUAGE TECHNOLOGY FUNDAMENTAL CHALLENGES

HUMAN LANGUAGE TECHNOLOGY STATISTICAL APPROACHES

SPEECH RECOGNITION BLOCK DIAGRAM OVERVIEW Core components: transduction feature extraction acoustic modeling (hidden Markov models) language modeling (statistical N-grams) search (Viterbi beam) knowledge sources

SPEECH RECOGNITION FEATURE EXTRACTION

SPEECH RECOGNITION ACOUSTIC MODELING

SPEECH RECOGNITION LANGUAGE MODELING

SPEECH RECOGNITION VITERBI BEAM SEARCH breadth-first time synchronous beam pruning supervision word prediction natural language

Traditional Output: best word sequence time alignment of information Other Outputs: word graphs N-best sentences confidence measures metadata such as speaker identity, accent, and prosody SPEECH RECOGNITION APPLICATION OF INFORMATION RETRIEVAL

APPLICATIONS COMMON EVALUATIONS FUEL RESEARCH Only elite participate ISIP is competitive for same level of technology More advanced systems use more heuristics and incremental technology Common evaluations: –common data/task –WER metric Performance improves over time (annually) Resource intensive

TECHNOLOGY Speech recognition –State of the art –Continuous speech –Large vocabulary –Speaker independent Goal: Accelerate research –Flexibility, Extensibility –Efficient (C++) –Easy to Use –Toolkits, GUIs Benefit: Technology –Standard benchmarks –Conversational speech PUBLIC DOMAIN SOFTWARE

APPLICATIONS INFORMATION RETRIEVAL Metadata extraction from conversational speech Automatic gisting and intelligence gathering Speech to text is the core technology challenge Machines vs. humans Real-time audio indexing Time-varying channel Dynamic language model Multilingual and cross-lingual

APPLICATIONS In-vehicle dialog systems improve information access. Advanced user interfaces enhance workforce training and increase manufacturing efficiency. Noise robustness in both environments to improve recognition performance Advanced statistical models and machine learning technology Multidisciplinary team (IE, ECE, CS). CAVS: DIALOG SYSTEMS FOR THE CAR

SUMMARY THREE THINGS TO THINK ABOUT Speech Recognition: Application of machine learning technology Requires multidisciplinary background A grand challenge (long-term problem) Career Development: Broaden your horizons! Position yourself at the upper end of the job market (e.g. graduate school) Signals and Systems: Introduces you to a brave new world Abstraction is essential in engineering Competency in math and physics is critical