Speech Recognition http://electronics.howstuffworks.com/gadgets/high-tech-gadgets/speech-recognition.htm/printable http://www.explainthatstuff.com/voicerecognition.html.

Slides:



Advertisements
Similar presentations
The Five Main Components of Reading Instruction
Advertisements

Created By: Lauren Snyder, Juliana Gerard, Dom Williams, and Ryan Holsopple.
For Friday No reading Homework –Chapter 23, exercises 1, 13, 14, 19 –Not as bad as it sounds –Do them IN ORDER – do not read ahead here.
SPEECH RECOGNITION Kunal Shalia and Dima Smirnov.
Natural Language Processing - Speech Processing -
Natural Language and Speech Processing Creation of computational models of the understanding and the generation of natural language. Different fields coming.
4/25/2001ECE566 Philip Felber1 Speech Recognition A report of an Isolated Word experiment. By Philip Felber Illinois Institute of Technology April 25,
ITCS 6010 Spoken Language Systems: Architecture. Elements of a Spoken Language System Endpointing Feature extraction Recognition Natural language understanding.
1 Phonetics Study of the sounds of Speech Articulatory Acoustic Experimental.
COMP 4060 Natural Language Processing Speech Processing.
Katie Shriver Danielle Tevlowitz Kristie Harris. Word recognition includes the following elements:  Recognizing words without conscious attention  Recognizing.
Assistive Technology By: Roxanne Majeski, Oscar Guerin, Tasha Reaves, Elias Luna.
Natural Language Understanding
ISSUES IN SPEECH RECOGNITION Shraddha Sharma
Introduction to Automatic Speech Recognition
Knowledge Base approach for spoken digit recognition Vijetha Periyavaram.
1 7-Speech Recognition (Cont’d) HMM Calculating Approaches Neural Components Three Basic HMM Problems Viterbi Algorithm State Duration Modeling Training.
Artificial Intelligence 2004 Speech & Natural Language Processing Natural Language Processing written text as input sentences (well-formed) Speech.
Speaker Recognition By Afshan Hina.
Supervisor: Dr. Eddie Jones Electronic Engineering Department Final Year Project 2008/09 Development of a Speaker Recognition/Verification System for Security.
7-Speech Recognition Speech Recognition Concepts
1 Computational Linguistics Ling 200 Spring 2006.
CHAPTER SEVEN ASSESSING AND TEACHING READING: PHONOLOGICAL AWARENESS, PHONICS, AND WORD RECOGNITION.
Machine Translation  Machine translation is of one of the earliest uses of AI  Two approaches:  Traditional approach using grammars, rewrite rules,
Math 5 Professor Barnett Timothy G. McManus Anthony P. Pastoors.
EcE 301- Research Seminar I Speech Recognition Paul M. Cabacungan ATENEO DE MANILA UNIVERSITY Image courtesy Amazon There are several software programs.
Artificial Intelligence 2004 Speech & Natural Language Processing Natural Language Processing written text as input sentences (well-formed) Speech.
Levels of Language 6 Levels of Language. Levels of Language Aspect of language are often referred to as 'language levels'. To look carefully at language.
Artificial Intelligence 2004 Speech & Natural Language Processing Speech Recognition acoustic signal as input conversion into written words Natural.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
Speech Recognition Speech Recognition lets you speak into a microphone to control your computer. You can give commands that the computer will carry out.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
ARTIFICIAL INTELLIGENCE FOR SPEECH RECOGNITION. Introduction What is Speech Recognition?  also known as automatic speech recognition or computer speech.
A Primer on Reading Terminology. AUTOMATICITY Readers construct meaning through recognition of words and passages (strings of words). Proficient readers.
Reducing uncertainty in speech recognition Controlling mobile devices through voice activated commands Neil Gow, GWXNEI001 Stephen Breyer-Menke, BRYSTE003.
Language Language - a system for combining symbols (such as words) so that an unlimited number of meaningful statements can be made for the purpose of.
Distinctively Visual. Your task Define/describe what each symbol represents. Write down the first few things that pop into your mind.
Natural Language Processing (NLP)
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
Speech Recognition Created By : Kanjariya Hardik G.
A Hybrid Model of HMM and RBFN Model of Speech Recognition 길이만, 김수연, 김성호, 원윤정, 윤아림 한국과학기술원 응용수학전공.
1 7-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches Recognition Theories Bayse Rule Simple Language Model P(A|W) Network Types.
TKT Tutoring Class Phonology.
Reading Information for Parents. How many sounds are in Manchester? How many syllables are in supercalifragilisticexpialido cious?
Christoph Prinz / Automatic Speech Recognition Research Progress Hits the Road.
Speech Recognition Xiaofeng Lai. What is speech recognition?  Speech recognition :  This is the ability of a machine or program to identify words and.
Speech Recognition through Neural Networks By Mohammad Usman Afzal Mohammad Waseem.
IIS for Speech Processing Michael J. Watts
Language in Use Pragmatics Natural and conventional signs.
Unit 3 Language Disabilities
ARTIFICIAL NEURAL NETWORKS
Speech Recognition UNIT -5.
Artificial Intelligence for Speech Recognition
A presentation on Basics of Speech Recognition Systems
Voice To Text Conversion
Phonics.
CS4705 Natural Language Processing
Oral Reading and the Development of Early Reading Ability
Natural Language Processing
Natural Language Processing
Command Me Specification
Languages – key stage 2 Subject content Key stage 2: Foreign language
Artificial Intelligence
Phoneme Recognition Using Neural Networks by Albert VanderMeulen
Personalized Speech Recognition for IoT Mahnoosh Mehrabani, Srinivas Bangalore, Benjamin interactions LLC Proceedings of IEEE 2nd World Forum on.
Scott Merkley & Jake Erikson
Lecture 21: Machine Learning Overview AP Computer Science Principles
Artificial Intelligence 2004 Speech & Natural Language Processing
Information Retrieval
Lecture 9: Machine Learning Overview AP Computer Science Principles
Presentation transcript:

Speech Recognition http://electronics.howstuffworks.com/gadgets/high-tech-gadgets/speech-recognition.htm/printable http://www.explainthatstuff.com/voicerecognition.html

Speech to Data analog-to-digital converter (ADC) translates an analog wave from your microphone into digital data that the computer can understand. It samples (digitizes) the sound by taking precise measurements of the wave at the recording sample frequency.

People pause their airflow when saying certain consonants like "p" or "t." The program then matches these segments to known phonemes in the appropriate language. A phoneme is the smallest element of a language -- a representation of the sounds we make and put together to form meaningful expressions. There are roughly 40 phonemes in the English language.

Hidden Markov Model 1. Has finite internal states that generate a set of external events (observations) 2. The internal state changes are invisible (hidden) to a viewer outside the system 3. The current state is always dependent on the immediate previous state only (Markov process) http://setosa.io/ev/markov-chains/

Interesting Examples http://cecas.clemson.edu/~ahoover/ece854/refs/Ramos-Intro-HMM.pdf

How to turn spoken sounds into written words Simple pattern matching (where each spoken word is recognized) Pattern and feature analysis (where each word is broken into bits and recognized from key features, such as the vowels it contains) Language modeling and statistical analysis (in which a knowledge of grammar and the probability of certain words or sounds following on from one another is used to speed up recognition and improve accuracy) Artificial neural networks (brain-like computer models that can reliably recognize patterns, such as word sounds, after exhaustive training).

Issues Homonyms Background noise Syntax (the grammatical structure of language) Semantics (the meaning of words)

Applications In-car systems – Apple CarPlay People with disabilities What others can you think of?

Amazon Alexa https://hackernoon.com/alexa-skills-and-intents-be8886645ff https://www.pluralsight.com/guides/node-js/amazon-alexa-skill-tutorial https://blog.kit.com/build-an-alexa-bot-e1342bff0465 https://developer.amazon.com/public/solutions/alexa/alexa-skills-kit/docs/supported-phrases-to-begin-a-conversation

Google Home https://www.programmableweb.com/news/how-to-get-started-google-actions/how-to/2017/01/31 https://docs.api.ai/docs/guidelines-slot-filling#section-managing-yes-no-unknown-answers-with-contexts