Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, 2008 1 Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet 2009.

Slides:



Advertisements
Similar presentations
Voiceprint System Development Design, implement, test unique voiceprint biometric system Research Day Presentation, May 3 rd 2013 Rahul Raj (Team Lead),
Advertisements

10 / 31 Outline Perception workshop groups Signal detection theory Scheduling meetings.
Sinusoidal Waves. Objective of Lecture Discuss the characteristics of a sinusoidal wave. Define the mathematical relationship between the period, frequency,
4-5 graphs of sine and cosine functions
Improvement of Audio Capture in Handheld Devices through Digital Filtering Problem Microphones in handheld devices are of low quality to reduce cost. This.
Stats 95.
PITCH DETECTION Shaan Patel HoangMinh Nguyen Richard King.
Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Speaker Identification Using Pitch Engineering Expo Banquet /08/09.
A Robust Algorithm for Pitch Tracking David Talkin Hsiao-Tsung Hung.
M. Emre Sargın, Ferda Ofli, Yelena Yasinnik, Oya Aran, Alexey Karpov, Stephen Wilson,Engin Erzin, Yücel Yemez, A. Murat Tekalp Combined Gesture- Speech.
CHAPTER 4: HARMONIC RESPONSE WITH A SINGLE DEGREE OF FREEDOM
MAE 552 – Heuristic Optimization Lecture 6 February 6, 2002.
Reliability and Channel Coding
Scaffold Download free viewer:
1 Live Sound Reinforcement Audio measurements. 2 Live Sound Reinforcement One of the most common terms you will come across when handling any type of.
Chemometrics Method comparison
Introduction to Frequency Selective Circuits
Saeed Ebrahimijam Fall Faculty of Business and Economics Department of Banking and Finance Doğu Akdeniz Üniversitesi FINA 417.
Data Selection In Ad-Hoc Wireless Sensor Networks Olawoye Oyeyele 11/24/2003.
SEQC Pipeline Comparison Using SEQC data to create performance metrics for RNA Quantification pipelines.
An automated image prescreening tool for a printer qualification process by † Du-Yong Ng and ‡ Jan P. Allebach † Lexmark International Inc. ‡ School of.
ELECTRICAL CIRCUIT ET 201 Define and explain characteristics of sinusoidal wave, phase relationships and phase shifting.
Knowledge Base approach for spoken digit recognition Vijetha Periyavaram.
This is the graph of y = sin xo
Results from TOBAs Results from TOBAs Cross correlation analysis to search for a Stochastic Gravitational Wave Background University of Tokyo Ayaka Shoda.
Speech Processing Laboratory
Acid-Base Titrations Introduction 3.)Overview  Titrations are Important tools in providing quantitative and qualitative data for a sample.  To best understand.
Do Now Make a table for –2 ≤ x ≤ 2 and draw the graph of: y = 2 x (Problem #1 from today’s packet)
Linear Functions 2 Sociology 5811 Lecture 18 Copyright © 2004 by Evan Schofer Do not copy or distribute without permission.
Paris, July 17, 2009 RECENT RESULTS OF THE IGEC2 COLLABORATION SEARCH FOR GRAVITATIONAL WAVE BURST Massimo Visco on behalf of the IGEC2 Collaboration.
Signal Detection Theory October 10, 2013 Some Psychometrics! Response data from a perception experiment is usually organized in the form of a confusion.
Sound Quality.
Learning Phase at Head Ends 1 Edge Events Appliance Table Input Output by Naoki ref: M. Baranski and V. Jurgen (2004) by Josh Implemented in Java with.
BRAID: Stream Mining through Group Lag Correlations Yasushi Sakurai Spiros Papadimitriou Christos Faloutsos SIGMOD 2005.
Breaks in Daily Climate Records Ralf Lindau University of Bonn Germany.
UNIVERSITY OF SOUTH CAROLINA College of Engineering & Information Technology Bioinformatics Algorithms and Data Structures Chapter 1: Exact String Matching.
Bids and Budgeting Quiz Review. How is Ad Rank determined?
Creating an Automated Blood Vessel Diameter Tracking Tool Peter McLachlan Department of Medical Biophysics The University of Western Ontario Supervisor:
Optimising Cuts for HLT George Talbot Supervisor: Stewart Martin-Haugh.
ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska
Voice Activity Detection based on OptimallyWeighted Combination of Multiple Features Yusuke Kida and Tatsuya Kawahara School of Informatics, Kyoto University,
© Imperial College LondonPage 1 A method for estimating the cost of reducing the false alarm rate in multi- institution performance monitoring using CUSUM.
Archived Data Management System Study Advisory Committee Meeting May 14, 2003.
Chapter 6: Frequency Domain Anaysis
Image Segmentation Dr. Abdul Basit Siddiqui. Contents Today we will continue to look at the problem of segmentation, this time though in terms of thresholding.
Vector Quantization Vector quantization is used in many applications such as image and voice compression, voice recognition (in general statistical pattern.
LIGO- G D Experimental Upper Limit from LIGO on the Gravitational Waves from GRB Stan Whitcomb For the LIGO Scientific Collaboration Informal.
Speaker Change Detection using Support Vector Machines V.Kartik, D.Srikrishna Satish and C.Chandra Sekhar Speech and Vision Laboratory Department of Computer.
Presented by: Idan Aharoni
Did I say that?? Speech Lab Spring 2009 February 03, 09 1 Montgomery College Did I Say That? Did I Say That? Automatic Keyword Spotting Using Crosscorrelation.
Speech Graffiti Tutorial FlightLine version Fall 03.
Modern VLSI Design 3e: Chapter 3 Copyright  1998, 2002 Prentice Hall PTR Topics n Electrical properties of static combinational gates: –transfer characteristics;
DYNAMIC TIME WARPING IN KEY WORD SPOTTING. OUTLINE KWS and role of DTW in it. Brief outline of DTW What is training and why is it needed? DTW training.
The joint influence of break and noise variance on break detection Ralf Lindau & Victor Venema University of Bonn Germany.
© 2001 Dr. Laura Snodgrass, Ph.D.1 Psychophysics Mathematical formula for relationship between changes in the physical stimulus and changes in conscious.
Meeting 8: Features for Object Classification Ullman et al.
Introduction Sample surveys involve chance error. Here we will study how to find the likely size of the chance error in a percentage, for simple random.
Spectral subtraction algorithm and optimize Wanfeng Zou 7/3/2014.
Study Skills & Test taking Want SUCCESS? “Study is nothing else but a possession of the mind” - Thomas Hobbes, 1651.
COMP8330/7330/7336 Advanced Parallel and Distributed Computing Decomposition and Parallel Tasks (cont.) Dr. Xiao Qin Auburn University
What is a CAT? What is a CAT?.
Copyright © American Speech-Language-Hearing Association
Automatic Picking of First Arrivals
Results from TOBAs Cross correlation analysis to search for a Stochastic Gravitational Wave Background University of Tokyo Ayaka Shoda M. Ando, K. Okada,
Retrieval of audio testimonials via voice search
SLOPE: A MATLAB Revival
Pitch Estimation By Chih-Ti Shih 12/11/2006 Chih-Ti Shih.
Input-Output-Process Demo
6-2 Polynomials & Linear Factors
Auditory Morphing Weyni Clacken
Presentation transcript:

Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet /08/09 Keyword Spotting Using Crosscorrelation Presenters: Bathiya Senevirathna Roshan Rajeev Roshan RajeevAdvisor: Dr. Uchechukwu Abanulo Montgomery College Speech Processing Laboratory

Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet /08/09 Research Goal Applications of Research Method Results Demo Presentation Outline Presenters: Bathiya Senevirathna Roshan Rajeev Roshan RajeevAdvisor: Dr. Uchechukwu Abanulo Montgomery College Speech Processing Laboratory

Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet /08/09 Research Goal Keyword detection Did the speaker say ____? Keyword location Where did the speaker say ____? Research Goal Applications of Research Method Experiment Demo

Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet /08/09 Applications of Research Research Goal Applications of Research Method Experiment Demo

Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet /08/09 Research Goal Interactive Voice Response Telephone ticket booking National Security Conversation monitoring to identify words of interest Research Goal Applications of Research Method Experiment Demo

Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet /08/09 Method Research Goal Applications of Research Method Experiment Demo

Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet /08/09 Crosscorrelation Measure of similarity between two signals Two signals compared by Sliding one signal by a certain time lag Multiplying both the overlapping regions Repeating the process and adding the products until there is no more overlap If both signals are exactly the same, there’s a maximum peak at the time = 0, and the rest of the correlation signals tapers off to zero Research Goal Applications of Research Method Experiment Demo Area Overlap

Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet /08/09 Research Goal Applications of Research Method Experiment Demo Crosscorrelation

Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet /08/09 Research Goal Applications of Research Method Experiment Demo Typical Cross-Correlation Results Keyword Match No Match See any differences??

Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet /08/09 Research Goal Applications of Research Method Experiment Demo A Closer look… Much higher amplitude near zero-lag pointMuch higher amplitude near zero-lag point Rest of graph is almost zeroRest of graph is almost zero No clear maximum pointsNo clear maximum points Amplitude is generally the same throughoutAmplitude is generally the same throughout Keyword Match No Match

Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet /08/09 Research Goal Applications of Research Method Experiment Demo The Algorithm 1. 1.Find average of points in the first 10% of the samples 2. 2.Find average of points in the last 90% of the samples 3. 3.Compute the ratio of the two values. If n = the number of samples in the crosscorrelation graph: Ratio =

Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet /08/09 Research Goal Applications of Research Method Experiment Demo The Algorithm xcorr 3. Shift observed portion by a small amount and repeat process If a portion is reached where the calculated ratio is above a defined minimum threshold then mark the location of the indices 2. If a portion is reached where the calculated ratio is above a defined minimum threshold then mark the location of the indices 1. Let the length of the keyword or phrase be n. The cross correlation of the keyword and the first n samples of the utterance is computed.

Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet /08/09 The Experiment Research Goal Applications of Research Method Experiment Demo

Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet /08/09 Research Goal Applications of Research Method Experiment Demo Experiment Effectiveness of the algorithm in finding keywords in a speech utterance 8 speakers, mixed gender Threshold varied from 6 to 15 Criteria: Hit: >50% of keyword length found in correct location False Alarm: 2 x length of keyword found in wrong location

Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet /08/09 Research Goal Applications of Research Method Experiment Demo Criteria - Hits Actual Location Wrong Location Miss! <50% of Word Found Miss! >50% of Word Found Hit!

Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet /08/09 Research Goal Applications of Research Method Experiment Demo Criteria – False Alarms Actual Location (No Keyword) >2x Keyword Length Found False Alarm! <2x Keyword Length Found No False Alarm! No Keyword Found No False Alarm!

Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet /08/09 Research Goal Applications of Research Method Experiment Demo Results Performance of Algorithm at Different Threshold Levels At a threshold of 8: 4/8 keywords were found 1/8 utterances with false alarms At a threshold of 8: 4/8 keywords were found 1/8 utterances with false alarms

Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet /08/09 Research Goal Applications of Research Method Experiment Demo Summary Crosscorrelation is a versatile tool for keyword spotting This was just one example of a possible algorithm Further research to optimize performance

Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet /08/09 Demo Research Goal Applications of Research Method Experiment Demo