Spatial vs. Blind Approaches for Speaker Separation: Structural Differences and Beyond Julien Bourgeois RIC/AD.

Slides:

Advertisements

Similar presentations

Applications of one-class classification

Advertisements

Cooperative Transmit Power Estimation under Wireless Fading Murtaza Zafer (IBM US), Bongjun Ko (IBM US), Ivan W. Ho (Imperial College, UK) and Chatschik.

Change-Point Detection Techniques for Piecewise Locally Stationary Time Series Michael Last National Institute of Statistical Sciences Talk for Midyear.

Independent Component Analysis

Robust Speech recognition V. Barreaud LORIA. Mismatch Between Training and Testing n mismatch influences scores n causes of mismatch u Speech Variation.

Implicit Speaker Separation DaimlerChrysler Research and Technology.

Speech Enhancement through Noise Reduction By Yating & Kundan.

Authors: David N.C. Tse, Ofer Zeitouni. Presented By Sai C. Chadalapaka.

Microphone Array Post-filter based on Spatially- Correlated Noise Measurements for Distant Speech Recognition Kenichi Kumatani, Disney Research, Pittsburgh.

Optimizing Flocking Controllers using Gradient Descent

Manifold Sparse Beamforming

A gentle introduction to fluid and diffusion limits for queues Presented by: Varun Gupta April 12, 2006.

3/24/2006Lecture notes for Speech Communications Multi-channel speech enhancement Chunjian Li DICOM, Aalborg University.

Subband-based Independent Component Analysis Y. Qi, P.S. Krishnaprasad, and S.A. Shamma ECE Department University of Maryland, College Park.

HIWIRE MEETING CRETE, SEPTEMBER 23-24, 2004 JOSÉ C. SEGURA LUNA GSTC UGR.

Independent Component Analysis (ICA) and Factor Analysis (FA)

Goals of Adaptive Signal Processing Design algorithms that learn from training data Algorithms must have good properties: attain good solutions, simple.

Speech Recognition in Noise

Audio Source Separation And ICA by Mike Davies & Nikolaos Mitianoudis Digital Signal Processing Lab Queen Mary, University of London.

A Multipath Sparse Beamforming Method

Adaptive Signal Processing

Dept. E.E./ESAT-STADIUS, KU Leuven homes.esat.kuleuven.be/~moonen/

Chapter 5ELE Adaptive Signal Processing 1 Least Mean-Square Adaptive Filtering.

Survey on ICA Technical Report, Aapo Hyvärinen, 1999.

1 Patch Complexity, Finite Pixel Correlations and Optimal Denoising Anat Levin, Boaz Nadler, Fredo Durand and Bill Freeman Weizmann Institute, MIT CSAIL.

Introduction to estimation theory Seoul Nat’l Univ.

Introduction to Adaptive Digital Filters Algorithms

Heart Sound Background Noise Removal Haim Appleboim Biomedical Seminar February 2007.

Acoustic impulse response measurement using speech and music signals John Usher Barcelona Media – Innovation Centre | Av. Diagonal, 177, planta 9,

Multiuser Detection (MUD) Combined with array signal processing in current wireless communication environments Wed. 박사 3학기 구 정 회.

ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.

Adaptive Methods for Speaker Separation in Cars DaimlerChrysler Research and Technology Julien Bourgeois

Nico De Clercq Pieter Gijsenbergh.  Problem  Solutions  Single-channel approach  Multichannel approach  Our assignment Overview.

EDGE DETECTION IN COMPUTER VISION SYSTEMS PRESENTATION BY : ATUL CHOPRA JUNE EE-6358 COMPUTER VISION UNIVERSITY OF TEXAS AT ARLINGTON.

Signal Processing Algorithms for Wireless Acoustic Sensor Networks Alexander Bertrand Electrical Engineering Department (ESAT) Katholieke Universiteit.

A note about gradient descent: Consider the function f(x)=(x-x 0 ) 2 Its derivative is: By gradient descent (If f(x) is more complex we usually cannot.

An Introduction to Blind Source Separation Kenny Hild Sept. 19, 2001.

Image Denoising Using Wavelets

Image cryptosystems based on PottsNICA algorithms Meng-Hong Chen Jiann-Ming Wu Department of Applied Mathematics National Donghwa University.

EE565 Advanced Image Processing Copyright Xin Li Image Denoising Theory of linear estimation Spatial domain denoising techniques Conventional Wiener.

LEAST MEAN-SQUARE (LMS) ADAPTIVE FILTERING. Steepest Descent The update rule for SD is where or SD is a deterministic algorithm, in the sense that p and.

CHAPTER 10 Widrow-Hoff Learning Ming-Feng Yeh.

Professors: Eng. Diego Barral Eng. Mariano Llamedo Soria Julian Bruno

3.7 Adaptive filtering Joonas Vanninen Antonio Palomino Alarcos.

Present document contains informations proprietary to France Telecom. Accepting this document means for its recipient he or she recognizes the confidential.

ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 12: Advanced Discriminant Analysis Objectives:

Autoregressive (AR) Spectral Estimation

Dongxu Yang, Meng Cao Supervisor: Prabin.  Review of the Beamformer  Realization of the Beamforming Data Independent Beamforming Statistically Optimum.

Discrete-time Random Signals

NTU & MSRA Ming-Feng Tsai

Independent Component Analysis Independent Component Analysis.

1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.

An Introduction of Independent Component Analysis (ICA) Xiaoling Wang Jan. 28, 2003.

Spatial Covariance Models For Under- Determined Reverberant Audio Source Separation N. Duong, E. Vincent and R. Gribonval METISS project team, IRISA/INRIA,

Siemens Corporate Research Rosca et al. – Generalized Sparse Mixing Model & BSS – ICASSP, Montreal 2004 Generalized Sparse Signal Mixing Model and Application.

Variable Step-Size Adaptive Filters for Acoustic Echo Cancellation Constantin Paleologu Department of Telecommunications

HST.582J/6.555J/16.456J Gari D. Clifford Associate Director, Centre for Doctoral Training, IBME, University of Oxford

08/10/ High Performance Parallel Implementation of Adaptive Beamforming Using Sinusoidal Dithers High Performance Embedded Computing Workshop Peter.

Deep Learning and Deep Reinforcement Learning. Topics 1.Deep learning with convolutional neural networks 2.Learning to play Atari video games with Deep.

Tree and Forest Classification and Regression Tree Bagging of trees Boosting trees Random Forest.

By: Soroosh Mariooryad Advisor: Dr.Sameti 1 BSS & ICA Speech Recognition - Spring 2008.

Speech Enhancement Summer 2009

Approaches of Interest in Blind Source Separation of Speech

LECTURE 11: Advanced Discriminant Analysis

Application of Independent Component Analysis (ICA) to Beam Diagnosis

Instructor :Dr. Aamer Iqbal Bhatti

Independent Factor Analysis

Presenter: Shih-Hsiang(士翔)

Presentation transcript:

Spatial vs. Blind Approaches for Speaker Separation: Structural Differences and Beyond Julien Bourgeois RIC/AD

2 x 1 (t)x 4 (t) Array Processor Recover clean individual speech flows: separate and denoise the sources Microphone Array get mixtures of the sources and noise Individual speech flows s 1 (t ) s 2 (t) Road Noise spatially diffuse Several simultaneous speakers (sources) spatially located Problem Context

3 “Spatial” vs. “Statistical” Techniques Spatial Filte r + - Min Power Filter s Min Dependence Statistical “Cocooning”

4 Spatial technique (Beamforming) s1s1 s2s2 + h2h2 h 1 w2w2 y1y1 x 1 (signal ref) x 2 (noise ref) + + High cross-talk levels : cancellation of the target signal (leakage). Solution : Voice Activity Detector. unknow n weak

5 Blind Source Separation (BSS) s1s1 s2s2 h1h1 h2h2 w1w1 w2w2 y1y1 y2y2 x1x1 x2x2 w 1 and w 2 are jointly optimized such that the outputs are independent. Sources are assumed to be independent. unknow n Dependence measure

6 BSS - Second Order Criteria There are plenty independence measures... We choose a decorrelation criterion. Other separation criteria include Higher Order Statistics, that are difficult to estimate. Second Order Statistics are easier to estimate...

7 BSS - Second Order Criteria Specifically Set (hyperbolas) of decorrelators (not all are separators) We need more info. Non-stationary sources: “non stationary hyperbolas” They intersect at the solution:.... but they do not determine w 1 and w 2 uniquely.

8 BSS - Graphically... D2(t2)D2(t2) D 2 (t 1 ) D 2 (t 1 ) + D 2 (t 2 ) Non-stationary sources generates hyperbolas that intersect at the separation point -(h 1, h 2 ) and at -(1/h 2, 1/h 1 ).

9 Beamforming vs. B SS Weak cross-talk levels or Voice Activity Detector. Leakage problem. 1D Search. Independence prior on (s 1,s 2 ) Permutation ambiguity. 2D Search. Asymptotic performances of BSS are more “robust” than Beamforming.

10 Adaptive Behavior: Comparison Framework s 1 = 0 s2s2 h2h2 w2w2 y1y1 y2y2 x1x1 x2x2 Comparison framework: only one source s 2 stationary Gaussian s 1 = h 1 = 0 (no leakage) Avoid structural differences between the two criterions. Both criteria are minimized with a STOCHASTIC gradient descent. Q: How well estimated is this gradient with finite length signals ? ++

11 Estimation Error on the Gradient At the starting point w 2 = 0, numerical evaluation of the variance of the estimation error. BSS converges more slowly because its gradient is more “random”. In noisy condition, BSS does not bring any gain if the cross-talk is below a certain threshold. This threshold is smaller for MV (beamforming) BSS Beamforming

12 Conclusion Beamforming is based on power minimization principle. In practice: weak cross-talk levels or needs a Voice Activity Detector (VAD) Asymptotic performances depends on the quality of the VAD. Robust stochastic behavior. Blind Source Separation based on independence of the sources. Asymptotic performances: exact separation. Stochastic behavior: needs a longer signals to estimate the gradient. Moreover sources on a finite (short) time scale are not exactly independent. Both methods cannot reduce diffuse background noise.