Did I say that?? Speech Lab Spring 2009 February 03, 09 1 Montgomery College Did I Say That? Did I Say That? Automatic Keyword Spotting Using Crosscorrelation.

Slides:



Advertisements
Similar presentations
Physics January
Advertisements

Correlation of Discrete-Time Signals Transmitted Signal, x(n) Reflected Signal, y(n) = x(n-D) + w(n) 0T.
1 Cross-Correlations and Cleaning Up Data Jessica Ferguson.
Voiceprint System Development Design, implement, test unique voiceprint biometric system Research Day Presentation, May 3 rd 2013 Rahul Raj (Team Lead),
Sampling and Pulse Code Modulation
Efficiency of Algorithms
Hierarchy of Design Voice Controlled Remote Voice Input Control Path Speech Processing IR Interface.
Introduction to Training and Learning in Neural Networks n CS/PY 399 Lab Presentation # 4 n February 1, 2001 n Mount Union College.
Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Speaker Identification Using Pitch Engineering Expo Banquet /08/09.
A Robust Algorithm for Pitch Tracking David Talkin Hsiao-Tsung Hung.
Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Keyword Spotting Using Crosscorrelation Engineering Expo Banquet 2009.
Itay Ben-Lulu & Uri Goldfeld Instructor : Dr. Yizhar Lavner Spring /9/2004.
Speech Classification Speech Lab Spring 2009 February 17, 09 1 Montgomery College Speech Classification Uche O. Abanulo Physics, Engineering And Geosciences.
Time-Scale Modification of Speech Signal
74 th EAGE Conference & Exhibition incorporating SPE EUROPEC 2012 Automated seismic-to-well ties? Roberto H. Herrera and Mirko van der Baan University.
1 Speech Parametrisation Compact encoding of information in speech Accentuates important info –Attempts to eliminate irrelevant information Accentuates.
Communications & Multimedia Signal Processing Analysis of the Effects of Train noise on Recognition Rate using Formants and MFCC Esfandiar Zavarehei Department.
MAE 552 – Heuristic Optimization Lecture 6 February 6, 2002.
CS 376b Introduction to Computer Vision 02 / 27 / 2008 Instructor: Michael Eckmann.
CS 376b Introduction to Computer Vision 02 / 18 / 2008 Instructor: Michael Eckmann.
On the Correlation between Energy and Pitch Accent in Read English Speech Andrew Rosenberg Weekly Speech Lab Talk 6/27/06.
Algorithms and Efficiency of Algorithms February 4th.
Characterization of Optical Devices Using Magnitude and Group Delay Measurement Haiqiao Lin, Undergraduate Student Dr. Christi Madsen, Professor Department.
1 Speech Parametrisation Compact encoding of information in speech Accentuates important info –Attempts to eliminate irrelevant information Accentuates.
Reasons for Teaching & Assessing Reading Fluency Reading Fluency.
Patrick-André Savard, Philippe Gournay and Roch Lefebvre Université de Sherbrooke, Québec, Canada.
V Telecommunications Industry AssociationTR XXX.
Performance of SPIROC chips in SKIROC mode
Speech Processing Laboratory
Project overview Face Recognition Security System Based on “Image Passport” Algorithm (FRSS)
Correlated and Uncorrelated Signals Problem: we have two signals and. How “close” are they to each other? Example: in a radar (or sonar) we transmit a.
1 BIEN425 – Lecture 8 By the end of the lecture, you should be able to: –Compute cross- /auto-correlation using matrix multiplication –Compute cross- /auto-correlation.
GoogleDictionary Paul Nepywoda Alla Rozovskaya. Goal Develop a tool for English that, given a word, will illustrate its usage.
BRAID: Stream Mining through Group Lag Correlations Yasushi Sakurai Spiros Papadimitriou Christos Faloutsos SIGMOD 2005.
6.2 Polynomials and Linear Functions
Efficient Elastic Burst Detection in Data Streams Yunyue Zhu and Dennis Shasha Department of Computer Science Courant Institute of Mathematical Sciences.
Uniform Quantization It was discussed in the previous lecture that the disadvantage of using uniform quantization is that low amplitude signals are drastically.
Bids and Budgeting Quiz Review. How is Ad Rank determined?
Creating an Automated Blood Vessel Diameter Tracking Tool Peter McLachlan Department of Medical Biophysics The University of Western Ontario Supervisor:
Experimental Results ■ Observations:  Overall detection accuracy increases as the length of observation window increases.  An observation window of 100.
Section 4 & 5 Review Google Adwords.  Contextual Targeting.
Temple University MASS SPECTROMETRY MATLAB SIMULATION Ilyana Mushaeva and Amber Moscato Department of Electrical and Computer Engineering Temple University.
Automated Summaries of Audio/Video Presentations Abigail Curden.
ILIAS WP1 – Cascina IGEC – First experience using the data of 5 bar detectors: ALLEGRO, AURIGA, EXPLORER NAUTILUS and NIOBE. – 1460.
ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska
Audio processing methods on marine mammal vocalizations Xanadu Halkias Laboratory for the Recognition and Organization of Speech and Audio
Cognitive Systems Foresight Language and Speech. Cognitive Systems Foresight Language and Speech How does the human system organise itself, as a neuro-biological.
Engineering Students Opening Meeting. Montgomery College  Analyzing speech signals What did he say? Who is speaking Now?  Synthesizing speech  Improving.
YALE LAW SCHOOL POLICY SCIENCES CENTER ANNUAL INSTITUTE Using a New Method of Natural Language Intelligence for Performing Wiretap Analysis Amy Neustein,
HMM-Based Speech Synthesis Erica Cooper CS4706 Spring 2011.
Copyright © 2011 Pearson Education, Inc. Systems of Linear Equations in Two Variables Section 5.1 Systems of Equations and Inequalities.
Digital Computer Concept and Practice Copyright ©2012 by Jaejin Lee Control Unit.
Speaker Change Detection using Support Vector Machines V.Kartik, D.Srikrishna Satish and C.Chandra Sekhar Speech and Vision Laboratory Department of Computer.
LIGO-G Z r statistics for time-domain cross correlation on burst candidate events Laura Cadonati LIGO-MIT LSC collaboration meeting, LLO march.
LECTURE 13 QUALITY ASSURANCE METHOD VALIDATION
LIGO- G Z AJW, Caltech, LIGO Project1 A Coherence Function Statistic to Identify Coincident Bursts Surjeet Rajendran, Caltech SURF Alan Weinstein,
Speech Processing Laboratory, Temple University May 5, Structure-Based Speech Classification Using Nonlinear Embedding Techniques Uchechukwu Ofoegbu.
The information contained herein is CONFIDENTIAL and is not to be used or distributed in any manner without the express consent of Global Tel*Link Introducing.
AD insertion For commercial insertion. A MERP Approach Use an integrated database. Both planning and playout and media management.
SARVAJANIK COLLEGE OF ENGINEERING & TECHNOLOGY ANALOG ELECTRONICS Presentation Subject:-Class B & Class AB Power amplifiers.
Solar Image Recognition Workshop, Brussels, 23 & 24 Oct. The Detection of Filaments in Solar Images Dr. Rami Qahwaji Department of Electronic Imaging and.
Correlation Signal Processing ES & BM.
Control Unit Lecture 6.
Copyright © American Speech-Language-Hearing Association
ARTIFICIAL NEURAL NETWORKS
Lecture: Image manipulations (2) Filters and kernels
2/24/2019 Project: IEEE P Working Group for Wireless Personal Area Networks (WPANs) Submission Title: [Secure Ranging] Date Submitted: [15 August.
Anthor: Andreas Tsiartas, Prasanta Kumar Ghosh,
What kinds of graphs would be useful in calculating work and energy?
Keyword Spotting Dynamic Time Warping
Presentation transcript:

Did I say that?? Speech Lab Spring 2009 February 03, 09 1 Montgomery College Did I Say That? Did I Say That? Automatic Keyword Spotting Using Crosscorrelation Uchechukwu Abanulo In collaboration with Temple University Speech Lab Funded by the US Air Force Research Lab, Rome, NY Uche O. Abanulo Physics, Engineering And Geosciences

Did I say that?? Speech Lab Spring 2009 February 03, 09 2 Montgomery College Presentation Outline Uche O. Abanulo Physics, Engineering And Geosciences Research Goal Applications of Research Method Preliminary Results

Did I say that?? Speech Lab Spring 2009 February 03, 09 3 Montgomery College Research Goal Applications Method Preliminary Results

Did I say that?? Speech Lab Spring 2009 February 03, 09 4 Montgomery College Research Goal Keyword or Key-phrase detection – –Did the speaker say __________? Keyword or Key-phrase Identification – –What portion of the utterance contains ___________? Research Goal Applications Method Preliminary Results

Did I say that?? Speech Lab Spring 2009 February 03, 09 5 Montgomery College System Specifications Speaker independence Minimum utterance length – 2 seconds Noise/Interference Robustness Confidence Level Outputs Research Goal Applications Method Preliminary Results

Did I say that?? Speech Lab Spring 2009 February 03, 09 6 Montgomery College Illustration Listening Device Research Goal Applications Method Preliminary Results

Did I say that?? Speech Lab Spring 2009 February 03, 09 7 Montgomery College Research Goal Applications Method Preliminary Results Applications

Did I say that?? Speech Lab Spring 2009 February 03, 09 8 Montgomery College Eliminate manual listening to terrorist or ‘enemy’ conversations Homeland Security Automatically detect when targeted persons or groups utter certain flag words Research Goal Applications Method Preliminary Results

Did I say that?? Speech Lab Spring 2009 February 03, 09 9 Montgomery College Commercial Automatically search through speeches of important personnel for certain words or phrases Automated response systems Research Goal Applications Method Preliminary Results

Did I say that?? Speech Lab Spring 2009 February 03, Montgomery College Criminal Activity Detection Monitoring inmate conversations Detection of use of unpermitted words Automatic searches for flag words Research Goal Applications Method Preliminary Results

Did I say that?? Speech Lab Spring 2009 February 03, Montgomery College Method Research Goal Applications Method Preliminary Results

Did I say that?? Speech Lab Spring 2009 February 03, Montgomery College Cross correlation Measure of similarity between two signals Two signals compared by –Sliding one signal by a certain time lag –Multiplying both the overlapping regions –Repeating the process and adding the products until there is no more overlap If both signals are exactly the same, there’s a maximum peak at the time = 0, and the rest of the correlation signals tapers of to zero Research Goal Applications Method Preliminary Results

Did I say that?? Speech Lab Spring 2009 February 03, Montgomery College Cross correlation Research Goal Applications Method Preliminary Results

Did I say that?? Speech Lab Spring 2009 February 03, Montgomery College Approach Research Goal Applications Method Preliminary Results 1. Let the length of the keyword or phrase be n. The cross correlation of the keyword and the first n samples of the utterance is computed. xcorr Max power is not around zero lag – not position of keyword 2. Observe position of peak to see if it’s around the zero lag. Yes: Keyword No: Not keyword 3. Shift observed portion by a small amount and repeat process If a portion is reached where the peak is close to the zero lag, then that’s where the keyword is. If not, the utterance does not contain the keyword.

Did I say that?? Speech Lab Spring 2009 February 03, Montgomery College Demo Research Goal Applications Method Preliminary Results

Did I say that?? Speech Lab Spring 2009 February 03, Montgomery College Results Research Goal Applications Method Preliminary Results

Did I say that?? Speech Lab Spring 2009 February 03, Montgomery College Uche O. Abanulo Physics, Engineering And Geosciences