Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application.

Slides:



Advertisements
Similar presentations
1 12. Principles of Parameter Estimation The purpose of this lecture is to illustrate the usefulness of the various concepts introduced and studied in.
Advertisements

Digital Audio Processing Lab, Dept. of EEThursday, June 17 th Data-Adaptive Source Separation for Audio Spatialization Supervisors: Prof. Preeti Rao and.
Dimension reduction (1)
Microphone Array Post-filter based on Spatially- Correlated Noise Measurements for Distant Speech Recognition Kenichi Kumatani, Disney Research, Pittsburgh.
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
2008 SIAM Conference on Imaging Science July 7, 2008 Jason A. Palmer
Maximum likelihood separation of spatially autocorrelated images using a Markov model Shahram Hosseini 1, Rima Guidara 1, Yannick Deville 1 and Christian.
Bayesian Robust Principal Component Analysis Presenter: Raghu Ranganathan ECE / CMR Tennessee Technological University January 21, 2011 Reading Group (Xinghao.
Visual Recognition Tutorial
Paper Discussion: “Simultaneous Localization and Environmental Mapping with a Sensor Network”, Marinakis et. al. ICRA 2011.
Parameter Estimation: Maximum Likelihood Estimation Chapter 3 (Duda et al.) – Sections CS479/679 Pattern Recognition Dr. George Bebis.
Independent Component Analysis (ICA)
Multichannel Phonocardiogram Source Separation PGBIOMED University of Reading 20 th July 2005 Conor Fearon and Scott Rickard University College Dublin.
Dimensional reduction, PCA
Multiple Pitch Tracking for Blind Source Separation Using a Single Microphone Joseph Tabrikian Dept. of Electrical and Computer Engineering Ben-Gurion.
A gentle introduction to Gaussian distribution. Review Random variable Coin flip experiment X = 0X = 1 X: Random variable.
3/24/2006Lecture notes for Speech Communications Multi-channel speech enhancement Chunjian Li DICOM, Aalborg University.
Subband-based Independent Component Analysis Y. Qi, P.S. Krishnaprasad, and S.A. Shamma ECE Department University of Maryland, College Park.
Independent Component Analysis (ICA) and Factor Analysis (FA)
Audio Source Separation And ICA by Mike Davies & Nikolaos Mitianoudis Digital Signal Processing Lab Queen Mary, University of London.
© 2005, it - instituto de telecomunicações. Todos os direitos reservados. Gerhard Maierbacher Scalable Coding Solutions for Wireless Sensor Networks IT.
HIWIRE Progress Report – July 2006 Technical University of Crete Speech Processing and Dialog Systems Group Presenter: Alex Potamianos Technical University.
1 Blind Separation of Audio Mixtures Using Direct Estimation of Delays Arie Yeredor Dept. of Elect. Eng. – Systems School of Electrical Engineering Tel-Aviv.
Bayesian Learning, cont’d. Administrivia Homework 1 returned today (details in a second) Reading 2 assigned today S. Thrun, Learning occupancy grids with.
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
Binary Variables (1) Coin flipping: heads=1, tails=0 Bernoulli Distribution.
1 Linear Methods for Classification Lecture Notes for CMPUT 466/551 Nilanjan Ray.
A VOICE ACTIVITY DETECTOR USING THE CHI-SQUARE TEST
Machine Learning CUNY Graduate Center Lecture 3: Linear Regression.
1 HMM - Part 2 Review of the last lecture The EM algorithm Continuous density HMM.
November 1, 2012 Presented by Marwan M. Alkhweldi Co-authors Natalia A. Schmid and Matthew C. Valenti Distributed Estimation of a Parametric Field Using.
© 2009 IBM Corporation 1 Improving Consolidation of Virtual Machines with Risk-aware Bandwidth Oversubscription in Compute Clouds Amir Epstein Joint work.
“A fast method for Underdetermined Sparse Component Analysis (SCA) based on Iterative Detection- Estimation (IDE)” Arash Ali-Amini 1 Massoud BABAIE-ZADEH.
Blind Separation of Speech Mixtures Vaninirappuputhenpurayil Gopalan REJU School of Electrical and Electronic Engineering Nanyang Technological University.
May 3 rd, 2010 Update Outline Monday, May 3 rd 2  Audio spatialization  Performance evaluation (source separation)  Source separation  System overview.
SUPA Advanced Data Analysis Course, Jan 6th – 7th 2009 Advanced Data Analysis for the Physical Sciences Dr Martin Hendry Dept of Physics and Astronomy.
Progress in identification of damping: Energy-based method with incomplete and noisy data Marco Prandina University of Liverpool.
ECE 8443 – Pattern Recognition LECTURE 10: HETEROSCEDASTIC LINEAR DISCRIMINANT ANALYSIS AND INDEPENDENT COMPONENT ANALYSIS Objectives: Generalization of.
Parameter estimation. 2D homography Given a set of (x i,x i ’), compute H (x i ’=Hx i ) 3D to 2D camera projection Given a set of (X i,x i ), compute.
PROBABILITY AND STATISTICS FOR ENGINEERING Hossein Sameti Department of Computer Engineering Sharif University of Technology Principles of Parameter Estimation.
CHAPTER 5 SIGNAL SPACE ANALYSIS
HMM - Part 2 The EM algorithm Continuous density HMM.
Full-rank Gaussian modeling of convolutive audio mixtures applied to source separation Ngoc Q. K. Duong, Supervisor: R. Gribonval and E. Vincent METISS.
Endmember Extraction from Highly Mixed Data Using MVC-NMF Lidan Miao AICIP Group Meeting Apr. 6, 2006 Lidan Miao AICIP Group Meeting Apr. 6, 2006.
Review of Spectral Unmixing for Hyperspectral Imagery Lidan Miao Sept. 29, 2005.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 12: Advanced Discriminant Analysis Objectives:
Independent Component Analysis Independent Component Analysis.
Introduction to Independent Component Analysis Math 285 project Fall 2015 Jingmei Lu Xixi Lu 12/10/2015.
An Introduction of Independent Component Analysis (ICA) Xiaoling Wang Jan. 28, 2003.
MINUET Musical Interference Unmixing Estimation Technique Scott Rickard, Conor Fearon Department of Electronic & Electrical Engineering University College.
Comparative Analysis of Spectral Unmixing Algorithms Lidan Miao Nov. 10, 2005.
Spatial Covariance Models For Under- Determined Reverberant Audio Source Separation N. Duong, E. Vincent and R. Gribonval METISS project team, IRISA/INRIA,
Spatial vs. Blind Approaches for Speaker Separation: Structural Differences and Beyond Julien Bourgeois RIC/AD.
HST.582J/6.555J/16.456J Gari D. Clifford Associate Director, Centre for Doctoral Training, IBME, University of Oxford
Dimension reduction (1) Overview PCA Factor Analysis Projection persuit ICA.
Learning to Align: a Statistical Approach
CS479/679 Pattern Recognition Dr. George Bebis
Sparsity Based Poisson Denoising and Inpainting
Approaches of Interest in Blind Source Separation of Speech
12. Principles of Parameter Estimation
LECTURE 11: Advanced Discriminant Analysis
Brain Electrophysiological Signal Processing: Preprocessing
Lecture 14 PCA, pPCA, ICA.
Unsupervised Learning II: Soft Clustering with Gaussian Mixture Models
Generally Discriminant Analysis
Independent Factor Analysis
Biointelligence Laboratory, Seoul National University
12. Principles of Parameter Estimation
A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models Jeff A. Bilmes International.
A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models Jeff A. Bilmes International.
Presentation transcript:

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application to Noisy Blind Source Separation Justinian Rosca Christian Borss # Radu Balan Siemens Corporate Research, Princeton, USA # Presently: University of Brawnschweig

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 ICA/BSS Scenario Signal Processor x1x1 x2x2 xDxD S 1, S 2,…,S L s1s1 s2s2 s L-1 sLsL n L sources D microphones

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Motivation Solve ICA problem in realistic scenarios In the presence of noise. Is this really feasible? When A is “fat” (degenerate) Successful DUET/Time-frequency masking - approach and implementation Can we do better if we relax the DUET assumption about number of sources “active” at any time- frequency point? [Rickard et al. 2000,2001]

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Sparseness in TF DUET assumption: the maximum number of sources active at any time - frequency point in a mixture of signals is one

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Example Voice Signal and TF Representation Siemens Corporate ResearchJ.Rosca et al. – Scalable BSS under Noise – DAGA, Aachen 2003

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Sparseness in TF Sources hop from one set of frequencies to another over time, with no collisions (at most one source active at any time-freq. point) s1s1 s2s2 s3s3

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Generalized Sparseness in TF Sources hop from one set of frequencies to another over time, with collisions (at most N sources active at any time-freq. point) s1s1 s2s2 s3s3 N=2, L=3

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 The Independence Assumption Assume the TF coefficient S(k,  ) is modeled as a product of a Bernoulli (0/1) r.v., V, and a continuous r.v. G: The p.d.f. of S becomes: For L independent signals the joint source pdf becomes: W-Disjoint Orthogonality (DUET): q very small→ retain first two terms; at most one source is active at any time-freq. point Generalized W-Disj.Orth.: q very small→ retain first N+1 terms; at most N sources are active at any time-freq. point

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Signal Model (1) Assumptions: –L sources, D sensors –Far-field –Direct-path –Noises iid, Gaussian (0,σ 2 ) 1 2 … D

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Signal Model (2) –Mixing model: –Source sparseness in TF –Let those be:

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Example Let N=2, two sources active at any time-freq. point

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 BSS Problem Given measurements {x(t)} 1<=t<=T, D sensors Determine estimate of parameters  : Note: L>D, degenerate BSS problem Mixing parameters Mapping and Source signals

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Approach: Two Steps 1.Estimate mixing parameters, e.g. using the stronger constraint of W-disjoint orthogonality 2.Estimate the source signals under the generalized W-disjoint orthogonality assumption

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Solution Sketch (Ad-Hoc) Employ principle of coherence (e.g. N=2) –Given a pair of sources S a and S b active at some time-freq. point, then what we know what we should measure at all microphones pairs! –S a and S b are the true ones if they result in minimum variance across all microphone pairs, i.e. coherent measurements –Note: For N=2 and L=4 there are 6 pair of sources to be tested! (1,2),(1,3),(1,4),(2,3),(2,4),(3,4) i j

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Solution Sketch (ML-1) Maximize likelihood function L( ,R)=p(X| ,R)

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Solution Sketch (ML-2) max L( ,R), after taking log

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Solution Sketch (ML-3) After substituting R:

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Interpretation of Solution Criterion: –projection of X onto the span of columns of M Solution  (“coherent” measurements) –N-dim subspace of C D closest to X among all L-choose-N subspaces spanned by different combinations of N columns of the matrix M Existence iff N≤D-1 CDCD

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Experimental Results (1) Algorithm applied to realistic synthetic mixtures From anechoic, low echoic, echoic to strongly echoic 16kHz data, 256 sample window, 50% overlap, coherent noise, SIR (-5dB,10dB), 30 gradient steps/iteration (Step 2), 5 iterations Evaluation: SIRGain, SegmentalSNR, Distortion

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Example Sources L=4, Mics D=2, N=2 Mixing Sources Estimates

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Discussion: L=4 sources, D=2 mics is a case too simple? In some simulations, the N=2 assumption helps Conjecture: approach is useful when N is a small fraction of L

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Conclusion Contribution: ML approach to noisy BSS problem under generalized sparseness assumptions, addressing degenerate case D<L Estimation problem can be addressed using sparse decomposition techniques: progress is needed

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Thank you! Real speech separation demo for those interested after session!

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Outline Generalized sparseness assumption Signal model and assumptions BSS problem definition Solution sketch: Ad-hoc and ML estimators Geometrical interpretation of solution Experimental results Conclusion