By: Nicole Cappella. Why I chose Speech Recognition  Always interested me  Dr. Phil Show Manti Teo Girlfriend Hoax  Three separate voice analysts proved.

Slides:



Advertisements
Similar presentations
Robust Speech recognition V. Barreaud LORIA. Mismatch Between Training and Testing n mismatch influences scores n causes of mismatch u Speech Variation.
Advertisements

An Analysis of Hamptonese Using Hidden Markov Models Ethan Le and Mark Stamp Department of Computer Science San Jose State University McNair Scholars Program.
Machine Learning Hidden Markov Model Darshana Pathak University of North Carolina at Chapel Hill Research Seminar – November 14, 2012.
HIDDEN MARKOV MODELS Prof. Navneet Goyal Department of Computer Science BITS, Pilani Presentation based on: & on presentation on HMM by Jianfeng Tang Old.
Speech in Multimedia Hao Jiang Computer Science Department Boston College Oct. 9, 2007.
December 2006 Cairo University Faculty of Computers and Information HMM Based Speech Synthesis Presented by Ossama Abdel-Hamid Mohamed.
Hidden Markov Models Bonnie Dorr Christof Monz CMSC 723: Introduction to Computational Linguistics Lecture 5 October 6, 2004.
Language Model based Information Retrieval: University of Saarland 1 A Hidden Markov Model Information Retrieval System Mahboob Alam Khalid.
Ch-9: Markov Models Prepared by Qaiser Abbas ( )
Hidden Markov Models Theory By Johan Walters (SR 2003)
Foundations of Statistical NLP Chapter 9. Markov Models 한 기 덕한 기 덕.
Hidden Markov Models in NLP
SPEECH RECOGNITION Kunal Shalia and Dima Smirnov.
Part II. Statistical NLP Advanced Artificial Intelligence (Hidden) Markov Models Wolfram Burgard, Luc De Raedt, Bernhard Nebel, Lars Schmidt-Thieme Most.
Hidden Markov Models in Bioinformatics Example Domain: Gene Finding Colin Cherry
Speech Translation on a PDA By: Santan Challa Instructor Dr. Christel Kemke.
1 Hidden Markov Model Instructor : Saeed Shiry  CHAPTER 13 ETHEM ALPAYDIN © The MIT Press, 2004.
Dynamic Time Warping Applications and Derivation
A PRESENTATION BY SHAMALEE DESHPANDE
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Natural Language Understanding
CHAPTER 15 SECTION 3 – 4 Hidden Markov Models. Terminology.
Toshiba Update 04/09/2006 Data-Driven Prosody and Voice Quality Generation for Emotional Speech Zeynep Inanoglu & Steve Young Machine Intelligence Lab.
Some Voice Enable Component Group member: CHUAH SIONG YANG LIM CHUN HEAN Advisor: Professor MICHEAL Project Purpose: For the developers,
Introduction to Automatic Speech Recognition
Knowledge Base approach for spoken digit recognition Vijetha Periyavaram.
1 7-Speech Recognition (Cont’d) HMM Calculating Approaches Neural Components Three Basic HMM Problems Viterbi Algorithm State Duration Modeling Training.
Speaker Recognition By Afshan Hina.
Hidden Markov Models Applied to Information Extraction Part I: Concept Part I: Concept HMM Tutorial HMM Tutorial Part II: Sample Application Part II: Sample.
Graphical models for part of speech tagging
Chapter 14 Speaker Recognition 14.1 Introduction to speaker recognition 14.2 The basic problems for speaker recognition 14.3 Approaches and systems 14.4.
Speech, Perception, & AI Artificial Intelligence CMSC March 5, 2002.
Hidden Markov Models for Information Extraction CSE 454.
Online Arabic Handwriting Recognition Fadi Biadsy Jihad El-Sana Nizar Habash Abdul-Rahman Daud Done byPresented by KFUPM Information & Computer Science.
Speaker independent Digit Recognition System Suma Swamy Research Scholar Anna University, Chennai 10/22/2015 9:10 PM 1.
LML Speech Recognition Speech Recognition Introduction I E.M. Bakker.
22CS 338: Graphical User Interfaces. Dario Salvucci, Drexel University. Lecture 10: Advanced Input.
Advanced Topics in Speech Processing (IT60116) K Sreenivasa Rao School of Information Technology IIT Kharagpur.
Speech, Perception, & AI Artificial Intelligence CMSC February 13, 2003.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Reestimation Equations Continuous Distributions.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Reestimation Equations Continuous Distributions.
Speech Recognition with CMU Sphinx Srikar Nadipally Hareesh Lingareddy.
Performance Comparison of Speaker and Emotion Recognition
Introduction Part I Speech Representation, Models and Analysis Part II Speech Recognition Part III Speech Synthesis Part IV Speech Coding Part V Frontier.
ARTIFICIAL INTELLIGENCE FOR SPEECH RECOGNITION. Introduction What is Speech Recognition?  also known as automatic speech recognition or computer speech.
BY KALP SHAH Sentence Recognizer. Sphinx4 Sphinx4 is the best and versatile recognition system. Sphinx4 is a speech recognition system which is written.
Automated Interpretation of EEGs: Integrating Temporal and Spectral Modeling Christian Ward, Dr. Iyad Obeid and Dr. Joseph Picone Neural Engineering Data.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Elements of a Discrete Model Evaluation.
Statistical Models for Automatic Speech Recognition Lukáš Burget.
1 Hidden Markov Model: Overview and Applications in MIR MUMT 611, March 2005 Paul Kolesnik MUMT 611, March 2005 Paul Kolesnik.
EEL 6586: AUTOMATIC SPEECH PROCESSING Hidden Markov Model Lecture Mark D. Skowronski Computational Neuro-Engineering Lab University of Florida March 31,
Classification of melody by composer using hidden Markov models Greg Eustace MUMT 614: Music Information Acquisition, Preservation, and Retrieval.
Speech Recognition Created By : Kanjariya Hardik G.
Message Source Linguistic Channel Articulatory Channel Acoustic Channel Observable: MessageWordsSounds Features Bayesian formulation for speech recognition:
PREPARED BY MANOJ TALUKDAR MSC 4 TH SEM ROLL-NO 05 GUKC-2012 IN THE GUIDENCE OF DR. SANJIB KR KALITA.
Visual Recognition Tutorial1 Markov models Hidden Markov models Forward/Backward algorithm Viterbi algorithm Baum-Welch estimation algorithm Hidden.
Christoph Prinz / Automatic Speech Recognition Research Progress Hits the Road.
Speaker Recognition UNIT -6. Introduction  Speaker recognition is the process of automatically recognizing who is speaking on the basis of information.
BIOMETRICS VOICE RECOGNITION. Meaning Bios : LifeMetron : Measure Bios : LifeMetron : Measure Biometrics are used to identify the input sample when compared.
ARTIFICIAL NEURAL NETWORKS
Speech Recognition UNIT -5.
Artificial Intelligence for Speech Recognition
EEL 6586: AUTOMATIC SPEECH PROCESSING Hidden Markov Model Lecture
Statistical Models for Automatic Speech Recognition
An INTRODUCTION TO HIDDEN MARKOV MODEL
Computational NeuroEngineering Lab
HUMAN LANGUAGE TECHNOLOGY: From Bits to Blogs
Statistical Models for Automatic Speech Recognition
Command Me Specification
Speaking patterns -MAS.662J, Fall 2004
Presentation transcript:

By: Nicole Cappella

Why I chose Speech Recognition  Always interested me  Dr. Phil Show Manti Teo Girlfriend Hoax  Three separate voice analysts proved Roniaha was girlfriends voice

Roadmap  What is Speech Recognition? Voice Recognition?  Process from Speech Production to Speech Perception  How Speech is Represented Models of Speech Recognition Types of Speech Recognition  Hidden Markov Model  Why HMM used in Speech Recognition  Three Basic Problems of HMM

Voice Recognition  Aimed towards identifying the person who is speaking  How it works  Every individual has unique pattern of speech due to their anatomy and behavioral patterns  Speaker verification vs. Speaker identification

Speech Recognition  Also known as Automatic Speech Recognition or Computer Speech Recognition  Translation of spoken words into text Speaker Independent Speaker Dependent  Performance of speech: Accuracy Speed  Problem?

Speech Recognition Applications:  Voice User Interfaces  Call Routing  Domestic Appliance Control  Search  Simple Data Entry  Radiology Report  Speech-to-text Processing  Aircrafts

Diagram of the Speech Production/Perception Process

Speech Representation  Speech signal represented in two different domains: time and the frequency domain  Three speech representations: Able to use speech signal and interpret its characteristics ○ Three-state Representation ○ Spectral Representation ○ Parameterization of the Spectral Activity  Useful to label the speech waveform being analyzed in a linguistic sense

Basic Model of Speech Recognition  This is a diagram of the recognition process  Standard Approach P(W,Y)  Goal: Decode string

Types of Speech Recognition  Different classes based on types of utterances they are able to recognize 1. Isolated Words “Listen/Not-Listen” states 2. Connected Words “run-together” 3. Continuous Speech Natural speech 4. Spontaneous Speech “ums”, “ahs”, stutter

Approaches to Speech Recognition  3 different approaches: 1. Acoustic Phonetic Approach 2. Pattern Recognition Approach HMM 3. Artificial Intelligence Approach

Pattern Recognition Approach  2 steps: Pattern Training Pattern Comparison  Uses mathematical framework  Forms: Speech Template Statistical Model (HMM)  Goal to determine identity of unknown speech according to how well patterns match

Methods in Pattern Comparison Approach  Template Based Approach Patterns stored as dictionary of words Match unknown utterance with reference templates Select best matching pattern  Stochastic Approach (HMM) Probabilistic Models Uncertainty and Incompleteness

HMM  HMM is used in the technique to implement speech recognition systems  Characterized by finite state Markov Model and set of output distributions  Doubly stochastic Underlying stochastic process which is not observable

The “Hidden” Part of the Model  System being modeled is assumed to be a Markov process with unobserved states  States not visible output is visible  Each state has probability distribution  Hidden refers to the state sequence through which model passes

Diagram and Representation of HMM -Three Probability Densities -Least important -Most important

Why HMM’s Used in Speech Recognition  General purpose speech recognition systems are based on HMM  Used because speech signal can be viewed as: a piecewise stationary signal short-time stationary signal  Can be trained automatically Simple Computationally feasible

Problems with HMM  Three problems 1. Evaluation Problem How do we “score” or evaluate the model? 2. Estimation Problem How do we uncover state sequence? 3. Training Problem It adapts the model parameters to observed training data  will create the best models for real phenomena

How Solutions to HMM Problems select word:  Example: How use Problem 3 ( Training Problem) Get model parameters for each word model How use Problem 2 ( Estimation Problem) Understand the physical meaning of the model states How use Problem 1 (Evaluation Problem) To recognize an unknown word Score each word based on given test observation sequence and select word whose model scored the highest

Recap  Voice Recognition vs. Speech Recognition  Approaches to Speech Recognition  Pattern Recognition leading to HMM  How HMM works  Problems and Solutions to HMM

References  Thompson, Lawrence. "Key Differences Between Speech Recognition and Voice Recognition." Key Differences Between Speech Recognition and Voice Recognition. N.p., n.d. Web. 10 Feb  Nilssan, Mikael, and Marcus Ejnarsson. Speech Recognition Using Hidden Markov Model. Tech. N.p.: n.p., Print.  Stamp, Mark. A Revealing Introduction to Hidden Markov Models. Rep. San Jose State University: n.p., Sept Web. 9 Feb  Li, Jia. "Hidden Markov Model." Hidden Markov Model. N.p., Mar Web. 17 Feb  Rabiner, L. R., and B. H. Juang. IEEE ASSP MAGAZINE, Jan Web. 10 Feb

References  Young, Steve. "HMMs and Related Speech Recognition Technologies." N.p., n.d. Web. 11 Feb  Anusuya, M. A., and S. K. Kattie. "Speech Recognition by Machine: A Review." International Journal of Computer Science and Information Security, Web. 12 Feb  "Hidden Markov Model." Wikipedia. Wikimedia Foundation, 4 Feb Web. 11 Feb  Srinivasan, A. "Speech Recognition Using Hidden Markov Model." Applied Mathematical Sciences, Web. 9 Feb  Mori, Renato De, and Fabio Brugnara. "1.5: HMM Methods in Speech Recognition." HMM Methods in Speech Recognition. N.p., n.d. Web. 12 Feb  "Speech Recognition." Wikipedia. Wikimedia Foundation, 30 Jan Web. 12 Feb