ROBOT CONTROL WITH VOICE

Slides:



Advertisements
Similar presentations
Improved ASR in noise using harmonic decomposition Introduction Pitch-Scaled Harmonic Filter Recognition Experiments Results Conclusion aperiodic contribution.
Advertisements

Speech Processing. References L.R. Rabiner and R.W. Schafer. Digital Processing of Speech Signals. Prentice-Hall, Lawrence Rabiner and Biing-Hwang.
Speech in Multimedia Hao Jiang Computer Science Department Boston College Oct. 9, 2007.
HCI : Speech /Speaker Recognition System
Speech Recognition Problem and Hidden Markov Model Ziba Rostamian CS Winter 2008.
Reduction of Additive Noise in the Digital Processing of Speech Avner Halevy AMSC 664 Final Presentation May 2009 Dr. Radu Balan Department of Mathematics.
SOME SIMPLE MANIPULATIONS OF SOUND USING DIGITAL SIGNAL PROCESSING Richard M. Stern demo August 31, 2004 Department of Electrical and Computer.
Top Level System Block Diagram BSS Block Diagram Abstract In today's expanding business environment, conference call technology has become an integral.
4/25/2001ECE566 Philip Felber1 Speech Recognition A report of an Isolated Word experiment. By Philip Felber Illinois Institute of Technology April 25,
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
EE491D Special Topics in Communications Adaptive Signal Processing Spring 2005 Prof. Anthony Kuh POST 205E Dept. of Elec. Eng. University of Hawaii Phone:
09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 1 ENEE408G: Capstone Design Project: Multimedia Signal Processing Design Project 1: Digital Speech.
Real-Time Speech Recognition Thang Pham Advisor: Shane Cotter.
Digital signal Processing Digital signal Processing ECI Semester /2004 Telecommunication and Internet Engineering, School of Engineering, South.
Pitch Prediction for Glottal Spectrum Estimation with Applications in Speaker Recognition Nengheng Zheng Supervised under Professor P.C. Ching Nov. 26,
A PRESENTATION BY SHAMALEE DESHPANDE
Ameneh Sadat Yazdaninik
ISSUES IN SPEECH RECOGNITION Shraddha Sharma
1 7-Speech Recognition (Cont’d) HMM Calculating Approaches Neural Components Three Basic HMM Problems Viterbi Algorithm State Duration Modeling Training.
Artificial Intelligence 2004 Speech & Natural Language Processing Natural Language Processing written text as input sentences (well-formed) Speech.
Supervisor: Dr. Eddie Jones Electronic Engineering Department Final Year Project 2008/09 Development of a Speaker Recognition/Verification System for Security.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission.
Page 0 of 23 MELP Vocoders Nima Moghadam SN#: Saeed Nari SN#: Supervisor Dr. Saameti April 2005 Sharif University of Technology.
By Sarita Jondhale1 Signal Processing And Analysis Methods For Speech Recognition.
Supervisor: Dr. Eddie Jones Co-supervisor: Dr Martin Glavin Electronic Engineering Department Final Year Project 2008/09 Development of a Speaker Recognition/Verification.
1 Linear Prediction. 2 Linear Prediction (Introduction) : The object of linear prediction is to estimate the output sequence from a linear combination.
Math 5 Professor Barnett Timothy G. McManus Anthony P. Pastoors.
Speaker independent Digit Recognition System Suma Swamy Research Scholar Anna University, Chennai 10/22/2015 9:10 PM 1.
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
Speaker Recognition by Habib ur Rehman Abdul Basit CENTER FOR ADVANCED STUDIES IN ENGINERING Digital Signal Processing ( Term Project )
MULTIMEDIA INPUT / OUTPUT TECHNOLOGIES INTRODUCTION 6/1/ A.Aruna, Assistant Professor, Faculty of Information Technology.
Recognition of Speech Using Representation in High-Dimensional Spaces University of Washington, Seattle, WA AT&T Labs (Retd), Florham Park, NJ Bishnu Atal.
VQ for ASR 張智星 多媒體資訊檢索實驗室 清華大學 資訊工程系.
Artificial Intelligence 2004 Speech & Natural Language Processing Speech Recognition acoustic signal as input conversion into written words Natural.
ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska
Artificial Intelligence & Neural Network
Signals & Systems B-Tech (Hons). Signals & Systems Lecture # 1 Instructor Engr. Kashif Shahzad 2015.
WEEK INTRODUCTION IT440 ARTIFICIAL INTELLIGENCE.
Speech Recognition with CMU Sphinx Srikar Nadipally Hareesh Lingareddy.
ARTIFICIAL INTELLIGENCE FOR SPEECH RECOGNITION. Introduction What is Speech Recognition?  also known as automatic speech recognition or computer speech.
Chapter 20 Speech Encoding by Parameters 20.1 Linear Predictive Coding (LPC) 20.2 Linear Predictive Vocoder 20.3 Code Excited Linear Prediction (CELP)
By: Nicole Cappella. Why I chose Speech Recognition  Always interested me  Dr. Phil Show Manti Teo Girlfriend Hoax  Three separate voice analysts proved.
Course Name: Speech Recognition Course Number: Instructor: Hossein Sameti Department of Computer Engineering Room 706 Phone:
1 Speech Compression (after first coding) By Allam Mousa Department of Telecommunication Engineering An Najah University SP_3_Compression.
PREPARED BY MANOJ TALUKDAR MSC 4 TH SEM ROLL-NO 05 GUKC-2012 IN THE GUIDENCE OF DR. SANJIB KR KALITA.
Speaker Recognition UNIT -6. Introduction  Speaker recognition is the process of automatically recognizing who is speaking on the basis of information.
Acoustic phonetics: Concerned with describing the acoustics of speech. Also called speech acoustics. Big questions: (1) What are the relationships between.
Speech Recognition through Neural Networks By Mohammad Usman Afzal Mohammad Waseem.
Instructor: Mian Shahzad Iqbal
Voice Manipulator Department of Electrical & Computer Engineering
Total Physical Response (TPR)
Vocoders.
Artificial Intelligence for Speech Recognition
Overview Communication is the transfer of information from one place to another. This should be done - as efficiently as possible - with as much fidelity/reliability.
Linear Prediction.
Sharat.S.Chikkerur S.Anand Mantravadi Rajeev.K.Srinivasan
1 Vocoders. 2 The Channel Vocoder (analyzer) : The channel vocoder employs a bank of bandpass filters,  Each having a bandwidth between 100 HZ and 300.
HUMAN LANGUAGE TECHNOLOGY: From Bits to Blogs
Kocaeli University Introduction to Engineering Applications
Linear Predictive Coding Methods
Mobile Systems Workshop 1 Narrow band speech coding for mobile phones
The Vocoder and its related technology
Human Speech Perception and Feature Extraction
Richard M. Stern demo January 12, 2009
Digital Systems: Hardware Organization and Design
Linear Prediction.
Speech Processing Final Project
Keyword Spotting Dynamic Time Warping
Dr. Babasaheb Ambedkar Marathwada University, Aurangabad
Presentation transcript:

ROBOT CONTROL WITH VOICE ROBOTICS PROJECT ROBOT CONTROL WITH VOICE MEHMET ALİ ARABACI SAİT KAÇMAZ 8/14/2019

Aim of the Project Creating a command list that directs the robot to the desired location Sending these commands to the robot that uses an RF communication link by using a Speech Recognizer written in MATLAB 8/14/2019

Speech Recognizer Types Number of speakers Speaker Dependent Speaker Independent Nature of Utterance Isolated Word Recognition Continuous Word Recognition Spontaneous Recognition Vocabulary Size Small Size Recognizers Mid-Size Recognizers Large Vocabulary Size Recognizers 8/14/2019

What kind of Recognizer will be used in the project? Speaker Dependent Isolated Word Small Size 8/14/2019

Techniques used for Speech Recognition Acoustic-Phonetic Approach Pattern Recognition Approach Artificial Intelligence Approach 8/14/2019

Which technique is used in the project and why? Pattern Recognition Technique is used in the project Because, it is simplier to realize compared to Acoustic-Phonetic and AI Approach Acoustic-Phonetic Approach requires extensive knowledge of acoustic properties of phonetic units. AI requires an expertise work 8/14/2019

Disadvantages of Pattern Recognition Performance of the sensitivity of the system depends on the amount of training data available Sensitive to the enviroment 8/14/2019

Advantages of Pattern Recognition Simplier to realize compared to Acoustic-Phonetic and Artificial Intelligence Approach No specific knowledge of speech is required 8/14/2019

Time-Varying Digital Filter Pattern Recognition LPC (Linear Predictive Coding) Impulse Train Generator Pitch Period Random Noise Generator Unvoiced/Voiced Switch u(n) G Time-Varying Digital Filter Vocal Tract Parameters s(n) 8/14/2019

Block Diagram of the System Front-end Processor Comparison Decision Serial Link Robot RF Link Signal Input From Microphone A Simple Block Diagram of the System 8/14/2019

References L.R. Rabiner and B.H. Juang, Fundamentals of Speech Recognition, Prentice-Hall, Englewood Cliffs, N.J., 1993. “Remote Speaker and Speech Recognition”; Department of Electrical Engineering - University of California, Riverside; Prepared by Isaac Saldana and David Ginsberg http://murray.newcastle.edu.au/users/staff/speech/home_pages/tutorial_sr.html http://www.otolith.com/otolith/olt/lpc.html 8/14/2019