Missing feature theory

Slides:



Advertisements
Similar presentations
Current HOARSE related activities 6-7 Sept …include the following (+ more) Novel architectures 1.All-combinations HMM/ANN 2.Tandem HMM/ANN hybrid.
Advertisements

Robust Speech recognition V. Barreaud LORIA. Mismatch Between Training and Testing n mismatch influences scores n causes of mismatch u Speech Variation.
Histogram-based Quantization for Distributed / Robust Speech Recognition Chia-yu Wan, Lin-shan Lee College of EECS, National Taiwan University, R. O. C.
Distribution-Based Feature Normalization for Robust Speech Recognition Leveraging Context and Dynamics Cues Yu-Chen Kao and Berlin Chen Presenter : 張庭豪.
Second order cone programming approaches for handing missing and uncertain data P. K. Shivaswamy, C. Bhattacharyya and A. J. Smola Discussion led by Qi.
December 2006 Cairo University Faculty of Computers and Information HMM Based Speech Synthesis Presented by Ossama Abdel-Hamid Mohamed.
Hidden Markov Model based 2D Shape Classification Ninad Thakoor 1 and Jean Gao 2 1 Electrical Engineering, University of Texas at Arlington, TX-76013,
Model-Based Fusion of Bone and Air Sensors for Speech Enhancement and Robust Speech Recognition John Hershey, Trausti Kristjansson, Zhengyou Zhang, Alex.
MODULATION SPECTRUM EQUALIZATION FOR ROBUST SPEECH RECOGNITION Source: Automatic Speech Recognition & Understanding, ASRU. IEEE Workshop on Author.
HIWIRE MEETING CRETE, SEPTEMBER 23-24, 2004 JOSÉ C. SEGURA LUNA GSTC UGR.
Speech Recognition in Noise
LORIA Irina Illina Dominique Fohr Chania Meeting May 9-10, 2007.
1 Speech Enhancement Wiener Filtering: A linear estimation of clean signal from the noisy signal Using MMSE criterion.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
Robust Automatic Speech Recognition by Transforming Binary Uncertainties DeLiang Wang (Jointly with Dr. Soundar Srinivasan) Oticon A/S, Denmark (On leave.
Major Tasks in Data Preprocessing(Ref Chap 3) By Prof. Muhammad Amir Alam.
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING MARCH 2010 Lan-Ying Yeh
Normalization of the Speech Modulation Spectra for Robust Speech Recognition Xiong Xiao, Eng Siong Chng, and Haizhou Li Wen-Yi Chu Department of Computer.
HCSNet December 2005 Auditory Scene Analysis and Automatic Speech Recognition in Adverse Conditions Phil Green Speech and Hearing Research Group, Department.
HMM-BASED PSEUDO-CLEAN SPEECH SYNTHESIS FOR SPLICE ALGORITHM Jun Du, Yu Hu, Li-Rong Dai, Ren-Hua Wang Wen-Yi Chu Department of Computer Science & Information.
A VOICE ACTIVITY DETECTOR USING THE CHI-SQUARE TEST
1 Robust HMM classification schemes for speaker recognition using integral decode Marie Roch Florida International University.
Cepstral Vector Normalization based On Stereo Data for Robust Speech Recognition Presenter: Shih-Hsiang Lin Luis Buera, Eduardo Lleida, Antonio Miguel,
International Conference on Intelligent and Advanced Systems 2007 Chee-Ming Ting Sh-Hussain Salleh Tian-Swee Tan A. K. Ariff. Jain-De,Lee.
Reporter: Shih-Hsiang( 士翔 ). Introduction Speech signal carries information from many sources –Not all information is relevant or important for speech.
REVISED CONTEXTUAL LRT FOR VOICE ACTIVITY DETECTION Javier Ram’ırez, Jos’e C. Segura and J.M. G’orriz Dept. of Signal Theory Networking and Communications.
Survey of ICASSP 2013 section: feature for robust automatic speech recognition Repoter: Yi-Ting Wang 2013/06/19.
LOG-ENERGY DYNAMIC RANGE NORMALIZATON FOR ROBUST SPEECH RECOGNITION Weizhong Zhu and Douglas O’Shaughnessy INRS-EMT, University of Quebec Montreal, Quebec,
Informing Multisource Decoding for Robust Speech Recognition Ning Ma and Phil Green Speech and Hearing Research Group The University of Sheffield 22/04/2005.
Hyperparameter Estimation for Speech Recognition Based on Variational Bayesian Approach Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee and Keiichi.
Authors: Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky Temporal envelope compensation for robust phoneme recognition using modulation spectrum.
Yi-zhang Cai, Jeih-weih Hung 2012/08/17 報告者:汪逸婷 1.
Robust Entropy-based Endpoint Detection for Speech Recognition in Noisy Environments 張智星
Multi-Speaker Modeling with Shared Prior Distributions and Model Structures for Bayesian Speech Synthesis Kei Hashimoto, Yoshihiko Nankaku, and Keiichi.
Voice Activity Detection based on OptimallyWeighted Combination of Multiple Features Yusuke Kida and Tatsuya Kawahara School of Informatics, Kyoto University,
ICASSP 2006 Robustness Techniques Survey ShihHsiang 2006.
ICASSP 2007 Robustness Techniques Survey Presenter: Shih-Hsiang Lin.
RCC-Mean Subtraction Robust Feature and Compare Various Feature based Methods for Robust Speech Recognition in presence of Telephone Noise Amin Fazel Sharif.
Sampling Theory and Some Important Sampling Distributions.
6. Population Codes Presented by Rhee, Je-Keun © 2008, SNU Biointelligence Lab,
A. R. Jayan, P. C. Pandey, EE Dept., IIT Bombay 1 Abstract Perception of speech under adverse listening conditions may be improved by processing it to.
Statistical Models for Automatic Speech Recognition Lukáš Burget.
Machine Learning CUNY Graduate Center Lecture 6: Linear Regression II.
Survey of Robust Speech Techniques in ICASSP 2009 Shih-Hsiang Lin ( 林士翔 ) 1Survey of Robustness Techniques in ICASSP 2009.
Exploring the Use of Speech Features and Their Corresponding Distribution Characteristics for Robust Speech Recognition Shih-Hsiang Lin, Berlin Chen, Yao-Ming.
Feature Transformation and Normalization Present by Howard Reference : Springer Handbook of Speech Processing, 3.3 Environment Robustness (J. Droppo, A.
Voice Activity Detection Based on Sequential Gaussian Mixture Model Zhan Shen, Jianguo Wei, Wenhuan Lu, Jianwu Dang Tianjin Key Laboratory of Cognitive.
1 LOW-RESOURCE NOISE-ROBUST FEATURE POST-PROCESSING ON AURORA 2.0 Chia-Ping Chen, Jeff Bilmes and Katrin Kirchhoff SSLI Lab Department of Electrical Engineering.
Confidence Intervals Cont.
Speech Enhancement Summer 2009
Fundamentals of estimation and detection in signals and images
Statistical Models for Automatic Speech Recognition
Special Topics In Scientific Computing
Dynamical Statistical Shape Priors for Level Set Based Tracking
Two-Stage Mel-Warped Wiener Filter SNR-Dependent Waveform Processing
Statistical Models for Automatic Speech Recognition
朝陽科技大學 資訊工程系 謝政勳 Application of GM(1,1) Model to Speech Enhancement and Voice Activity Detection 朝陽科技大學 資訊工程系 謝政勳
A Tutorial on Bayesian Speech Feature Enhancement
DeLiang Wang (Jointly with Dr. Soundar Srinivasan) Oticon A/S, Denmark
EM for Inference in MV Data
LECTURE 15: REESTIMATION, EM AND MIXTURES
Finding Periodic Discrete Events in Noisy Streams
EM for Inference in MV Data
Wiener Filtering: A linear estimation of clean signal from the noisy signal Using MMSE criterion.
Improved Spread Spectrum: A New Modulation Technique for Robust Watermarking IEEE Trans. On Signal Processing, April 2003 Multimedia Security.
Speech / Non-speech Detection
Visual Recognition of American Sign Language Using Hidden Markov Models 문현구 문현구.
Presented by Chen-Wei Liu
Presenter: Shih-Hsiang(士翔)
Combination of Feature and Channel Compensation (1/2)
Presentation transcript:

Missing feature theory Statistical estimation of unreliable features for robust speech recognition 2) Missing feature theory and probabilistic estimation of clean speech components for robust speech recognition 3) State based imputation of missing data for robust speech recognition and speech enhancement 4) Missing data theory,spectral subtraction and signal-to-noise estimation for robust ASR: an integrated study

Parameters used in speech recognition can be Introduction Parameters used in speech recognition can be divided in two subsets 1) reliable or present parameters 2) unreliable or missing parameters

Introduction There are 2 problems in the application of missing data in robust ASR 1) identification of the reliable regions 2) recognition techniques that can deal with incomplete data

Detection of unreliable feature Method: (1) negative energy criterion (2) SNR criterion or

Detection of unreliable feature (3) statistical approach : noise is considered as normally distributed

Noise estimation method in [4] simple estimation weighted average estimation C) second order method D) Histogram method

Accuracy for the three detection methods

Recognition with incomplete data Method (1) Marginalization : unreliable data are ignored for a single state model ,the probability to emit vector is

Marginalization

Marginalization =1 bounded marginalization

Marginalization In Philippe’s another paper [2] ,the clean parameters are represented as pdfs and missing parameters are considered as being uniformly if 0<x<|Y(w)| otherwise

Recognition with incomplete data Method (2) GMM based Imputation : unreliable data are estimated advantages of the approach are that can be followed by conventional techniques like cepstral,RASTA In the estimation process,the GMM means are used to replace the unreliable features the means and variances of GMM are compensated with the additive noise,as in PMC

Imputation using inverse log-normal approximation

Imputation transformed into log-spectral domain :

Imputation using the noisy GMM,the weighting factor associated with each distribution is computed as follows:

Imputation Finally,the reliable data are enhanced using a spectral subtraction and the unreliable data are replaced by a weighted sum of the GMM means

features spetra

Discussion Why using GMM in this paper? A HMM based data imputation has been proposed in [3], when using time-dependent statistical models,if an error in the decoding sequence occurs,it can influence the recognition in the second feature domain therefore , GMM instead of HMM,but sufficient and computationally efficient for data imputation

Experiments results

Experiments results

Experiments results

Experiments results

Experiments results

Experiments results

Experiments results

Experiments results