Speech Processing AEGIS RET All-Hands Meeting

Slides:



Advertisements
Similar presentations
Speech-to-Text Technology on Mac OS X Computer Access for Individuals with Disabilities.
Advertisements

Speech Processing AEGIS RET All-Hands Meeting University of Central Florida July 20, 2012 Applications of Images and Signals in High Schools.
Voiceprint System Development Design, implement, test unique voiceprint biometric system Research Day Presentation, May 3 rd 2013 Rahul Raj (Team Lead),
Speech Processing AEGIS RET All-Hands Meeting University of Central Florida July 20, 2012 Applications of Images and Signals in High Schools.
Quickfilter Pro Software Demonstration for QF1D512 The following slides will illustrate how you can design and verify a filter design in minutes! BEGIN.
Introduction The aim the project is to analyse non real time EEG (Electroencephalogram) signal using different mathematical models in Matlab to predict.
Reduction of Additive Noise in the Digital Processing of Speech Avner Halevy AMSC 664 Final Presentation May 2009 Dr. Radu Balan Department of Mathematics.
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
Introduction to Matlab II EE 2303 Lab. Basic Matlab Review Data file input/output string, char, double, struct  Types of variables load, save  directory/workspace.
DSP Implementation of a 1961 Fender Champ Amplifier James Siegle Advisor: Dr. Thomas L. Stewart March 11, 2003.
DSP Implementation of a 1961 Fender Champ Amplifier James Siegle Advisor: Dr. Thomas L. Stewart April 8, 2003.
Real-Time Speech Recognition Thang Pham Advisor: Shane Cotter.
Digital signal Processing Digital signal Processing ECI Semester /2004 Telecommunication and Internet Engineering, School of Engineering, South.
By Chance Berman and Clark Baumgartner. 1. Introduction 2. History 3. Modern Applications 4. Case Study 5. Ethical Analysis.
Representing Acoustic Information
Case Studies Dr Lee Nung Kion Faculty of Cognitive Sciences and Human Development UNIVERSITI MALAYSIA SARAWAK.
1 “ Speech ” EMPOWERED COMPUTING Greenfield Business Centre, 20 th September, 2006.
EE 701 Digital Signal Processing and Filtering Instructor: Dr. Ghazi Al Sukkar Dept. of Electrical Engineering The University of Jordan
Knowledge Base approach for spoken digit recognition Vijetha Periyavaram.
Modeling speech signals and recognizing a speaker.
Copyright ©2010, ©1999, ©1989 by Pearson Education, Inc. All rights reserved. Discrete-Time Signal Processing, Third Edition Alan V. Oppenheim Ronald W.
By: Meghal Bhatt.  Sphinx4 is a state of the art speaker independent, continuous speech recognition system written entirely in java programming language.
Activity 1 Record and edit your voice using Audacity 1.Download Audacity (a free and open source audio editing software from
Jacob Zurasky ECE5526 – Spring 2011
Supervisor: Dr. Eddie Jones Co-supervisor: Dr Martin Glavin Electronic Engineering Department Final Year Project 2008/09 Development of a Speaker Recognition/Verification.
Standard Grade Presentations & Multimedia. Presentation & Multimedia Software Allows the user to set up exciting and attractive documents which helps.
Dan Rosenbaum Nir Muchtar Yoav Yosipovich Faculty member : Prof. Daniel LehmannIndustry Representative : Music Genome.
Speech Recognition Feature Extraction. Speech recognition simplified block diagram Speech Capture Speech Capture Feature Extraction Feature Extraction.
Controlling Computer Using Speech Recognition (CCSR) Creative Masters Group Supervisor : Dr: Mounira Taileb.
Chapter 14 Multimedia Networking Cisco Learning Institute Network+ Fundamentals and Certification Copyright ©2005 by Pearson Education, Inc. Upper Saddle.
+ Assistive Technology By Lyndsay RHodes. + Screen Reader A screen reader is a software application for people with severe visual impairments. A screen.
Basic structure of sphinx 4
ARTIFICIAL INTELLIGENCE FOR SPEECH RECOGNITION. Introduction What is Speech Recognition?  also known as automatic speech recognition or computer speech.
Fourier and Wavelet Transformations Michael J. Watts
The first thing you need to do is log in. This is what the “Log In Screen” looks like. Remember to get teacher permission and login information prior.
Copyright ©2010, ©1999, ©1989 by Pearson Education, Inc. All rights reserved. Discrete-Time Signal Processing, Third Edition Alan V. Oppenheim Ronald W.
Chapter 7 Speech Recognition Framework  7.1 The main form and application of speech recognition  7.2 The main factors of speech recognition  7.3 The.
Copyright ©2010, ©1999, ©1989 by Pearson Education, Inc. All rights reserved. Discrete-Time Signal Processing, Third Edition Alan V. Oppenheim Ronald W.
EC1358 – DIGITAL SIGNAL PROCESSING
ADAPTIVE BABY MONITORING SYSTEM Team 56 Michael Qiu, Luis Ramirez, Yueyang Lin ECE 445 Senior Design May 3, 2016.
Bryant Tober. Problem Description  View the sound wave produced from a wav file  Apply different modulations to the wave file  Hear the effect of the.
 Signal: Physical quantity that varies with time, space or any other independent variable/s.  System: Physical device that performs an operation on.
Digital Signal Processing Rahil Mahdian LSV Lab, Saarland University, Germany.
Speech Recognition Xiaofeng Lai. What is speech recognition?  Speech recognition :  This is the ability of a machine or program to identify words and.
بسم الله الرحمن الرحيم Lecture (1) Introduction to DSP Dr. Iman Abuel Maaly University of Khartoum Department of Electrical and Electronic Engineering.
Speech Processing Dr. Veton Këpuska, FIT Jacob Zurasky, FIT.
Computer Graphics Lecture 1 Introduction to Computer Graphics
Topic: Waveforms in Noesis
Speech Processing AEGIS RET All-Hands Meeting
Voice selection on notes
Introduction to Digital Signal Processing
Automatic Speech Recognition
Speech recognition in mobile environment Robust ASR with dual Mic
ARTIFICIAL NEURAL NETWORKS
Speech Processing AEGIS RET All-Hands Meeting
Spoken Digit Recognition
Artificial Intelligence for Speech Recognition
Lecture 12 Linearity & Time-Invariance Convolution
ECE 3551 Microcomputer Systems
Fourier and Wavelet Transformations
Ch.1: Introduction to audio signal processing
Leigh Anne Clevenger Pace University, DPS ’16
VAD (Voice Activity Detector)
Ala’a Spaih Abeer Abu-Hantash Directed by Dr.Allam Mousa
Activity 1 Record and edit your voice using Audacity
Digital Systems: Hardware Organization and Design
John H.L. Hansen & Taufiq Al Babba Hasan
Interactive media.
Practical Hidden Voice Attacks against Speech and Speaker Recognition Systems NDSS 2019 Hadi Abdullah, Washington Garcia, Christian Peeters, Patrick.
Photo Story 3 for Windows
Presentation transcript:

Speech Processing AEGIS RET All-Hands Meeting Applications of Images and Signals in High Schools AEGIS RET All-Hands Meeting Florida Institute of Technology July 6, 2012

Contributors Dr. Veton Këpuska, Faculty Mentor, FIT vkepuska@fit.edu Jacob Zurasky, Graduate Student Mentor, FIT jzuraksy@my.fit.edu Becky Dowell, RET Teacher, BPS Titusville High dowell.jeanie@brevardschools.org

Motivation Timeline / Background – need to add this Difficulties Siri demo

Motivation Speech audio processing has increased in its usefulness. Applications Siri on iPhone 4S Automated telephone systems Voice transcription (e.g. dictation software) Hands-free computing (e.g., OnStar) Video games (e.g., XBOX Kinect) Military applications (e.g., aircraft control) Healthcare applications

Motivation Speech recognition requires speech to first be characterized by a set of “features”. Features are used to determine what words are spoken. Our project implements the feature extraction stage of a speech processing application.

Speech Recognition Front End: Pre-processing Back End: Recognition Speech Recognized speech Large amount of data. Ex: 256 samples Features Reduced data size. Ex: 13 features Front End – reduce amount of data for back end, but keep enough data to accurately describe the signal. Output is feature vector. 256 samples ------> 13 features Back End - statistical models used to classify feature vectors as a certain sound in speech

Front-End Processing of Speech Recognizer Pre-emphasis Window FFT Mel-Scale log IFFT

Speech Analysis Project Added GUI Allow user to record audio or input audio from a sound file Displays graph of the audio User can click on graph to select speech frame Processes speech frame and displays output for each state of processing Displays spectrogram

GUI Components

GUI Components Plotting Axes

Buttons GUI Components Plotting Axes

Future Work Improve GUI Audio Effects Noise Filtering

References Ingle, Vinay K., and John G. Proakis. Digital signal processing using MATLAB. 2nd ed. Toronto, Ont.: Nelson, 2007. Oppenheim, Alan V., and Ronald W. Schafer. Discrete-time signal processing. 3rd ed. Upper Saddle River: Pearson, 2010. Weeks, Michael. Digital signal processing using MATLAB and wavelets. Hingham,Mass.: Infinity Science Press, 2007.

Thank you! Questions?

Unit Plan