1 Speech Recognition. 2 Introduction What is Speech Recognition? - Voice Recognition? Where can it be used? - Dictation - System control/navigation -

Slides:



Advertisements
Similar presentations
Marlene Cvetko November 18, 2009 For audio call Toll Free and use PIN/code Make Teaching Easier with Dragon Naturally Speaking.
Advertisements

1 Speech Sounds Introduction to Linguistics for Computational Linguists.
Using Speech Recognition Vicki Wassenhove Quad-Cities Computer Society June 10 th, 2009.
Speech Recognition There are different kinds of voice or speech “_______" that take the sounds of your voice and match it with words. The engine is software.
Spik v1.0 Voice Commands Execution in a Windows Environment Dekel Abelson Eliran Dahan Instructor: Ari Todtfeld.
Voice-enabled Image Identification System Design Aashish P. Shrestha Ming Ming Zheng Multimedia Signal Processing, University of Bridgeport, Connecticut.
Auditory User Interfaces
Why is ASR Hard? Natural speech is continuous
How Speech Recognition Has Revolutionized a Profession Jennifer S. Smith, CVR President National Verbatim Reporters Association
Automatic Speech Recognition
Dragon Naturally Speaking Tutorial What is Dragon Naturally Speaking? Dragon is a dictation software, students can dictate a paper rather than type it.
Natural Language Processing and Speech Enabled Applications by Pavlovic Nenad.
Speech Recognition SR Commands Alternative Input Handhelds.
Automatic Transcript Generation Helmer Strik A 2 RT Dept. of Language & Speech University of Nijmegen.
1 Dragon NaturallySpeaking: Training Agenda. What to Expect Goals: Method / Essential Skills / Getting Help Starting to use speech-recognition software.
Glencoe Digital Communication Tools Speech Recognition Tools Chapter Contents Lesson 6.1Lesson 6.1 Prepare to Use Speech Recognition (pg. 155) Lesson 6.2Lesson.
Describe the purpose, components, and use of speech recognition systems.
Voice Recognition Software : Helping You Teach! A Title V Cooperative Workshop Oct. 3, 2007 Holly Hofmann A Title V Cooperative Workshop Oct. 3, 2007 Holly.
1 “ Speech ” EMPOWERED COMPUTING Greenfield Business Centre, 20 th September, 2006.
Assistive Technology and Education Mrs. G. Bacal Guidelines Designed for people who struggle to learn for different reasons, such as: learning disabilities,emotional.
Speech Recognition. My computer doesn’t understand me……….. Software is now mainstream Many people use it within office/home setting for inputting text.
Speech & Language Modeling Cindy Burklow & Jay Hatcher CS521 – March 30, 2006.
A VERY USEFUL E-LEARNING TOOL FOR TEACHERS, RESEARCHERS, AND STUDENTS.
Speech Recognition Update -- ALA April, 1999 James A. Eidelman Eidelman Associates Ann Arbor, MI
Temple University Speech Recognition using Sphinx 4 (Ti Digits test) Jaykrishna shukla,Amir Harati,Mubin Amehed,& cara Santin Department of Electrical.
Practical AT session 3 WP4-D4.2. Prepared by: Shams Eldin Mohamed Ahmed Hassan Speech, Text and Braille AT.
Design of a Speech Recognition System to Assist Hearing Impaired Students Richard Kheir 2 and Thomas P. Way Department of Computing Sciences, Villanova.
Supervisor: Dr. Eddie Jones Electronic Engineering Department Final Year Project 2008/09 Development of a Speaker Recognition/Verification System for Security.
Speech Recognition Application
How Spread Works. Spread Spread stands for Speech and Phoneme Recognition as Educational Aid for the Deaf and Hearing Impaired Children It is a game used.
Temple University Goals : 1.Down sample 20 khz TIDigits data to 16 khz. 2. Use Down sample data run regression test and Compare results posted in Sphinx-4.
CMU Shpinx Speech Recognition Engine Reporter : Chun-Feng Liao NCCU Dept. of Computer Sceince Intelligent Media Lab.
1 BILC SEMINAR 2009 Speech Recognition: Is It for Real? Tony Mirabito Defense Language Institute English Language Center (DLIELC) DLIELC.
By: Meghal Bhatt.  Sphinx4 is a state of the art speaker independent, continuous speech recognition system written entirely in java programming language.
1 An Assessment of a Speech-Based Programming Environment Andrew Begel Microsoft Research (formerly UC Berkeley)
Recognition of spoken and spelled proper names Reporter : CHEN, TZAN HWEI Author :Michael Meyer, Hermann Hild.
Speech Recognition: The State of the Business VR = Voice Recognition Resources Interviewed Bill Grube (Agfa Talk Technology) Tim Fagert (Dictaphone Powerscribe)
MULTIMEDIA INPUT / OUTPUT TECHNOLOGIES INTRODUCTION 6/1/ A.Aruna, Assistant Professor, Faculty of Information Technology.
INTERVIEW TRANSCRIPT Software used: Dragon Naturally Speaking 11.0 Liceo Scientifico Albert Einstein - Cervignano del Friuli Mattia Giavedoni V A.
Why Should I Use Speech Recognition? Kim Larsh, Presenter Mesa Public Schools Mesa,AZ.
Creating User Interfaces Directed Speech. XML. VoiceXML Classwork/Homework: Sign up to be Voxeo developer. Do tutorials.
Speech Recognition Speech Recognition lets you speak into a microphone to control your computer. You can give commands that the computer will carry out.
Speech Recognition with CMU Sphinx Srikar Nadipally Hareesh Lingareddy.
ARTIFICIAL INTELLIGENCE FOR SPEECH RECOGNITION. Introduction What is Speech Recognition?  also known as automatic speech recognition or computer speech.
VocaLinks Speech Recognition Seminar Assistive Technology, Consulting, Training, Sales and Support.
Reducing uncertainty in speech recognition Controlling mobile devices through voice activated commands Neil Gow, GWXNEI001 Stephen Breyer-Menke, BRYSTE003.
Copyright © 2013 by Educational Testing Service. All rights reserved. Evaluating Unsupervised Language Model Adaption Methods for Speaking Assessment ShaSha.
Behrooz ChitsazLorrie Apple Johnson Microsoft ResearchU.S. Department of Energy.
Speech Recognition Created By : Kanjariya Hardik G.
#SummitNow Yes, I'm able to index audio files within Alfresco 2013 Fernando González @fegorama.
PREPARED BY MANOJ TALUKDAR MSC 4 TH SEM ROLL-NO 05 GUKC-2012 IN THE GUIDENCE OF DR. SANJIB KR KALITA.
Glencoe Introduction to Web Design Chapter 4 XHTML Basics 1 Review Do you remember the vocabulary terms from this chapter? Use the following slides to.
Using Commonsense Reasoning to Improve Voice Recognition.
Speech Recognition Xiaofeng Lai. What is speech recognition?  Speech recognition :  This is the ability of a machine or program to identify words and.
Speech User Interface 10/26/2010. Pervasive Information Access Information & Services I-Land vision by Streitz, et. al.
Natural Language Processing and Speech Enabled Applications
Speech Recognition There are different kinds of voice or speech "engines" that take the sounds of your voice and match it with words. The engine is software.
Yes, I'm able to index audio files within Alfresco
Artificial Intelligence for Speech Recognition
A presentation on Basics of Speech Recognition Systems
Tomorrow’s User Interface 1
Digital Dictation News Gathering.
Derek Hunt Education Commons
Speech Recognition There are different kinds of voice or speech "engines" that take the sounds of your voice and match it with words. The engine is software.
Dialog Design 4 Speech & Natural Language
الفصل الثاني البرمجيات
BAM Annual Conference, 9th -11th September 2008
Internet and Community Resources
Human and Computer Interaction (H.C.I.) &Communication Skills
Artificial Intelligence 2004 Speech & Natural Language Processing
Presentation transcript:

1 Speech Recognition

2 Introduction What is Speech Recognition? - Voice Recognition? Where can it be used? - Dictation - System control/navigation - Commercial/Industrial applications - Hand held digital recorders

3 Contents: Continuous/Discrete How does it work? Recent improvements Current software options Future of SR

4 Continuous or Discrete? Continuous speech - dictation Discrete speech - system controls

5 How does SR work? Recognition Training Correction Command/Control

6 Recognition (1) Voice InputAnalog to DigitalAcoustic Model Language Model DisplaySpeech EngineFeedback

7 Recognition (2) Acoustic Modeling Spoken words: “I think there are…..” Phonemes: ‘ ay th-in-nk-kd dh-eh-r aa-r’ H.M.M.’s: 5 state representation Speech Engine

8 Recognition (3) Language Modeling Word context Word frequency Transition possibilities

9 Voice Training (1) Can be done by: Predetermined text segments Individual words Compare new acoustic with old and combines More training = better recognition

10 Voice Training (2) User specific Voice file Voice qualities Pronunciation Patterns of word use Preferred vocabulary

11 Making Corrections Move cursor by voice command Memorize edit commands List of possible alternatives Make correction manually

12 Command/Control Desktop grid Program or Link name/number URL name Memorized commands

13 Recent Improvements in SR Faster training ~10 min. Better recognition ~95% More compatible software Better system control/command

14 Current Software Options for PC Dragon Systems – Naturally Speaking Philips – FreeSpeech IBM – ViaVoice Lernout & Hauspie – Voice Xpress

15 How well do the work? TrainingDictation Correct. App. Integrat. Command - Control DragonExcellent Good PhilipsFair Good IBMExcellentGood Excellent L & HGood

16 Future of SR SUI – Speech-based User Interface Improvements needed: - Greater accuracy - Greater system control/command - More compatible software

17 Conclusion SR Uses How does it work? Current Software Problems of SR More SR coming soon….

18 References 1. Alwang, Greg. “Speech Recognition,” PC Magazine, December Hauptmann, Alexander G. Jang, Photina Jaeyun. Carnegie Mellon University. “Learning to Recognize Speech by Watching Television,” IEEE Intelligent Systems, September/October Miastkowski, Stan. “Latest Speech Software Gets You Up and Running Faster,” PC World, November 1999.