Madhulika Pannuri Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Correlation Dimension.

Slides:



Advertisements
Similar presentations
Machine Learning and its Application to Speech Impediment Therapy.
Advertisements

Acousteen Herman J.M. Steeneken Subjective Intelligibility Assessment Dr. Herman J.M. Steeneken.
Look Who’s Talking Now SEM Exchange, Fall 2008 October 9, Montgomery College Speaker Identification Using Pitch Engineering Expo Banquet /08/09.
CENTER FOR SPOKEN LANGUAGE UNDERSTANDING 1 PREDICTION AND SYNTHESIS OF PROSODIC EFFECTS ON SPECTRAL BALANCE OF VOWELS Jan P.H. van Santen and Xiaochuan.
Abstract Statistical or machine-learning techniques, such as Hidden Markov models and Gaussian mixture models, have dominated the signal processing and.
Acoustic / Lexical Model Derk Geene. Speech recognition  P(words|signal)= P(signal|words) P(words) / P(signal)  P(signal|words): Acoustic model  P(words):
Page 0 of 8 Time Series Classification – phoneme recognition in reconstructed phase space Sanjay Patil Intelligent Electronics Systems Human and Systems.
Speaker Clustering using MDL Principles Kofi Boakye Stat212A Project December 3, 2003.
Presenters: Guy Elazar, Eyal Shindler Supervised By: Pavel Kislov, Inna Rivkin המעבדה למערכות ספרתיות מהירות High speed digital systems laboratory הטכניון.
Communications & Multimedia Signal Processing Analysis of the Effects of Train noise on Recognition Rate using Formants and MFCC Esfandiar Zavarehei Department.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
Philip Jackson and Martin Russell Electronic Electrical and Computer Engineering Models of speech dynamics in a segmental-HMM recognizer using intermediate.
Speech Recognition in Noise
LORIA Irina Illina Dominique Fohr Chania Meeting May 9-10, 2007.
Optimal Adaptation for Statistical Classifiers Xiao Li.
1 Lab Preparation Initial focus on Speaker Verification –Tools –Expertise –Good example “Biometric technologies are automated methods of verifying or recognising.
SNR-Dependent Mixture of PLDA for Noise Robust Speaker Verification
Weighted Median Filters for Complex Array Signal Processing Yinbo Li - Gonzalo R. Arce Department of Electrical and Computer Engineering University of.
Communications & Multimedia Signal Processing Analysis of Effects of Train/Car noise in Formant Track Estimation Qin Yan Department of Electronic and Computer.
Natural Language Understanding
Approximating The Kullback- Leibler Divergence Between Gaussian Mixture Models ICASSP 2007 John R. Hershey and Peder A. Olsen IBM T. J. Watson Research.
Nonlinear Mixture Autoregressive Hidden Markov Models for Speech Recognition S. Srinivasan, T. Ma, D. May, G. Lazarou and J. Picone Department of Electrical.
IIT Bombay ICA 2004, Kyoto, Japan, April 4 - 9, 2004   Introdn HNM Methodology Results Conclusions IntrodnHNM MethodologyResults.
Age and Gender Classification using Modulation Cepstrum Jitendra Ajmera (presented by Christian Müller) Speaker Odyssey 2008.
Time-Domain Methods for Speech Processing 虞台文. Contents Introduction Time-Dependent Processing of Speech Short-Time Energy and Average Magnitude Short-Time.
Page 0 of 14 Dynamical Invariants of an Attractor and potential applications for speech data Saurabh Prasad Intelligent Electronic Systems Human and Systems.
Reconstructed Phase Space (RPS)
Compensating speaker-to-microphone playback system for robust speech recognition So-Young Jeong and Soo-Young Lee Brain Science Research Center and Department.
NONLINEAR DYNAMIC INVARIANTS FOR CONTINUOUS SPEECH RECOGNITION Author: Daniel May Mississippi State University Contact Information: 1255 Louisville St.
Daniel May Department of Electrical and Computer Engineering Mississippi State University Analysis of Correlation Dimension Across Phones.
ICASSP Speech Discrimination Based on Multiscale Spectro–Temporal Modulations Nima Mesgarani, Shihab Shamma, University of Maryland Malcolm Slaney.
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
Nonlinear Dynamical Invariants for Speech Recognition S. Prasad, S. Srinivasan, M. Pannuri, G. Lazarou and J. Picone Department of Electrical and Computer.
Seungchan Lee Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification Experiment.
Temple University Training Acoustic model using Sphinx Train Jaykrishna shukla,Mubin Amehed& cara Santin Department of Electrical and Computer Engineering.
Temple University Training Acoustic Models Using SphinxTrain Jaykrishna Shukla, Mubin Amehed, and Cara Santin Department of Electrical and Computer Engineering.
Seungchan Lee Institute for Signal and Information Processing Department of Electrical and Computer Engineering Research Updates.
Univ logo Fault Diagnosis for Power Transmission Line using Statistical Methods Yuanjun Guo Prof. Kang Li Queen’s University, Belfast UKACC PhD Presentation.
USE OF IMPROVED FEATURE VECTORS IN SPECTRAL SUBTRACTION METHOD Emrah Besci, Semih Ergin, M.Bilginer Gülmezoğlu, Atalay Barkana Osmangazi University, Electrical.
Page 0 of 8 Lyapunov Exponents – Theory and Implementation Sanjay Patil Intelligent Electronics Systems Human and Systems Engineering Center for Advanced.
Multi-Speaker Modeling with Shared Prior Distributions and Model Structures for Bayesian Speech Synthesis Kei Hashimoto, Yoshihiko Nankaku, and Keiichi.
Jun-Won Suh Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Speaker Verification System.
Speech Communication Lab, State University of New York at Binghamton Dimensionality Reduction Methods for HMM Phonetic Recognition Hongbing Hu, Stephen.
Philip Jackson, Boon-Hooi Lo and Martin Russell Electronic Electrical and Computer Engineering Models of speech dynamics for ASR, using intermediate linear.
Speech recognition Home Work 1. Problem 1 Problem 2 Here in this problem, all the phonemes are detected by using phoncode.doc There are several phonetics.
X = Y. Direct variation X 1 X = Y 1 Y 2.
1 Electrical and Computer Engineering Binghamton University, State University of New York Electrical and Computer Engineering Binghamton University, State.
Phone-Level Pronunciation Scoring and Assessment for Interactive Language Learning Speech Communication, 2000 Authors: S. M. Witt, S. J. Young Presenter:
SNR-Invariant PLDA Modeling for Robust Speaker Verification Na Li and Man-Wai Mak Department of Electronic and Information Engineering The Hong Kong Polytechnic.
Tao Ma Department of Electrical and Computer Engineering Mississippi State University Research Update September 1, 2006.
Motorola presents in collaboration with CNEL Introduction  Motivation: The limitation of traditional narrowband transmission channel  Advantage: Phone.
Chapter 15: Classification of Time- Embedded EEG Using Short-Time Principal Component Analysis by Nguyen Duc Thang 5/2009.
Date of download: 5/27/2016 Copyright © 2016 American Medical Association. All rights reserved. From: The Importance of High-Frequency Audibility in the.
Page 0 of 5 Dynamical Invariants of an Attractor and potential applications for speech data Saurabh Prasad Intelligent Electronic Systems Human and Systems.
Electrical Engineering
Spectral and Temporal Modulation Features for Phonetic Recognition Stephen A. Zahorian, Hongbing Hu, Zhengqing Chen, Jiang Wu Department of Electrical.
Quantized Angular Beamforming for mmWave Channels
Lecture Slides Elementary Statistics Twelfth Edition
Sampling Distribution
Sampling Distribution
Underwater Acoustic Channel Estimation and
Date, location, department
Giovanni M. Di Liberto, James A. O’Sullivan, Edmund C. Lalor 
Date, location, department
Lecture Slides Elementary Statistics Twelfth Edition
SNR-Invariant PLDA Modeling for Robust Speaker Verification
NAME OF EVENT Name of Speaker Date, location, department
Sustained Phones (/aa/, /ae/, /eh/, /sh/, /z/, /f/, /m/, /n/)
Speaker Recognition Experiment
Vaal university of technology
Presentation transcript:

Madhulika Pannuri Intelligent Electronic Systems Human and Systems Engineering Department of Electrical and Computer Engineering Correlation Dimension on speech (sustained phones)

Page 1 of 13 Madhulika Pannuri Analysis setup This analysis includes estimates of dimension for varying initial embedding dimension and data size. Variation in dimension estimates with SNR of the signal was studied. The analysis was performed on 3 vowels, 2 nasals and 2 fricatives. Statistical modelling with male model, female model and cross speaker model was performed.

Page 2 of 13 Madhulika Pannuri Varying initial embedding dimension (Vowels)

Page 3 of 13 Madhulika Pannuri Varying the initial embedding dimension (nasals)

Page 4 of 13 Madhulika Pannuri Varing the initial embedding dimension (fricatives)

Page 5 of 13 Madhulika Pannuri Varying data size (Vowels)

Page 6 of 13 Madhulika Pannuri Varying data size (nasals)

Page 7 of 13 Madhulika Pannuri Varying the data size (fricatives)

Page 8 of 13 Madhulika Pannuri Varying SNR (Vowels)

Page 9 of 13 Madhulika Pannuri Varying SNR (Nasals)

Page 10 of 13 Madhulika Pannuri Varying SNR (Fricatives)

Page 11 of 13 Madhulika Pannuri KL Divergence measures for various phonemes

Page 12 of 13 Madhulika Pannuri Statistics of dimension

Page 13 of 13 Madhulika Pannuri Aurora Plans: - Analyze speaker variability for sustained phones. - Add other nonlinear invariants to feature vector and analyze the improvement in performance for AURORA. Mixtures% WER % %