3/24/2006Lecture notes for Speech Communications Multi-channel speech enhancement Chunjian Li DICOM, Aalborg University.

Slides:



Advertisements
Similar presentations
Speech Enhancement through Noise Reduction By Yating & Kundan.
Advertisements

Adaptive Filters S.B.Rabet In the Name of GOD Class Presentation For The Course : Custom Implementation of DSP Systems University of Tehran 2010 Pages.
Microphone Array Post-filter based on Spatially- Correlated Noise Measurements for Distant Speech Recognition Kenichi Kumatani, Disney Research, Pittsburgh.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: The FIR Adaptive Filter The LMS Adaptive Filter Stability and Convergence.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Newton’s Method Application to LMS Recursive Least Squares Exponentially-Weighted.
Blind Source Separation of Acoustic Signals Based on Multistage Independent Component Analysis Hiroshi SARUWATARI, Tsuyoki NISHIKAWA, and Kiyohiro SHIKANO.
1/44 1. ZAHRA NAGHSH JULY 2009 BEAM-FORMING 2/44 2.
Independent Component Analysis (ICA)
Application of Statistical Techniques to Neural Data Analysis Aniket Kaloti 03/07/2006.
Independent Component Analysis (ICA) and Factor Analysis (FA)
Goals of Adaptive Signal Processing Design algorithms that learn from training data Algorithms must have good properties: attain good solutions, simple.
Project Presentation: March 9, 2006
Audio Source Separation And ICA by Mike Davies & Nikolaos Mitianoudis Digital Signal Processing Lab Queen Mary, University of London.
Noise Supression Techniques for Speech Enhancement Using Adaptive Filtering Derek Shiell 03/09/2006 ECE 463: Project Presentation Professor Michael Honig.
Laurent Itti: CS599 – Computational Architectures in Biological Vision, USC Lecture 7: Coding and Representation 1 Computational Architectures in.
1 Blind Separation of Audio Mixtures Using Direct Estimation of Delays Arie Yeredor Dept. of Elect. Eng. – Systems School of Electrical Engineering Tel-Aviv.
Adaptive Signal Processing
Normalised Least Mean-Square Adaptive Filtering
Dept. E.E./ESAT-STADIUS, KU Leuven homes.esat.kuleuven.be/~moonen/
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Adaptive Noise Cancellation ANC W/O External Reference Adaptive Line Enhancement.
Survey on ICA Technical Report, Aapo Hyvärinen, 1999.
Acoustic Echo Cancellation Using Digital Signal Processing. Presented by :- A.Manigandan( ) B.Naveen Raj ( ) Parikshit Dujari ( )
For 3-G Systems Tara Larzelere EE 497A Semester Project.
Equalization in a wideband TDMA system
Algorithm Taxonomy Thus far we have focused on:
Introduction to Adaptive Digital Filters Algorithms
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Introduction SNR Gain Patterns Beam Steering Shading Resources: Wiki:
1 Techniques to control noise and fading l Noise and fading are the primary sources of distortion in communication channels l Techniques to reduce noise.
By Asst.Prof.Dr.Thamer M.Jamel Department of Electrical Engineering University of Technology Baghdad – Iraq.
Eigenstructure Methods for Noise Covariance Estimation Olawoye Oyeyele AICIP Group Presentation April 29th, 2003.
Nico De Clercq Pieter Gijsenbergh Noise reduction in hearing aids: Generalised Sidelobe Canceller.
Particle Filtering (Sequential Monte Carlo)
Heart Sound Background Noise Removal Haim Appleboim Biomedical Seminar February 2007.
Shifted Independent Component Analysis Morten Mørup, Kristoffer Hougaard Madsen and Lars Kai Hansen The shift problem Informatics and Mathematical Modelling.
Independent Component Analysis Zhen Wei, Li Jin, Yuxue Jin Department of Statistics Stanford University An Introduction.
2010/12/11 Frequency Domain Blind Source Separation Based Noise Suppression to Hearing Aids (Part 1) Presenter: Cian-Bei Hong Advisor: Dr. Yeou-Jiunn Chen.
Multiuser Detection (MUD) Combined with array signal processing in current wireless communication environments Wed. 박사 3학기 구 정 회.
Blind Separation of Speech Mixtures Vaninirappuputhenpurayil Gopalan REJU School of Electrical and Electronic Engineering Nanyang Technological University.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Signal and Noise Models SNIR Maximization Least-Squares Minimization MMSE.
CHAPTER 4 Adaptive Tapped-delay-line Filters Using the Least Squares Adaptive Filtering.
Nico De Clercq Pieter Gijsenbergh.  Problem  Solutions  Single-channel approach  Multichannel approach  Our assignment Overview.
Unit-V DSP APPLICATIONS. UNIT V -SYLLABUS DSP APPLICATIONS Multirate signal processing: Decimation Interpolation Sampling rate conversion by a rational.
ECE 8443 – Pattern Recognition LECTURE 10: HETEROSCEDASTIC LINEAR DISCRIMINANT ANALYSIS AND INDEPENDENT COMPONENT ANALYSIS Objectives: Generalization of.
An Introduction to Blind Source Separation Kenny Hild Sept. 19, 2001.
CCN COMPLEX COMPUTING NETWORKS1 This research has been supported in part by European Commission FP6 IYTE-Wireless Project (Contract No: )
2010/12/11 Frequency Domain Blind Source Separation Based Noise Suppression to Hearing Aids (Part 2) Presenter: Cian-Bei Hong Advisor: Dr. Yeou-Jiunn Chen.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Derivation Computational Simplifications Stability Lattice Structures.
Full-rank Gaussian modeling of convolutive audio mixtures applied to source separation Ngoc Q. K. Duong, Supervisor: R. Gribonval and E. Vincent METISS.
Professors: Eng. Diego Barral Eng. Mariano Llamedo Soria Julian Bruno
3.7 Adaptive filtering Joonas Vanninen Antonio Palomino Alarcos.
Equalization Techniques By: Mohamed Osman Ahmed Mahgoub.
PCA vs ICA vs LDA. How to represent images? Why representation methods are needed?? –Curse of dimensionality – width x height x channels –Noise reduction.
Smart antenna Smart antennas use an array of low gain antenna elements which are connected by a combining network. Smart antennas provide enhanced coverage.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 12: Advanced Discriminant Analysis Objectives:
Dongxu Yang, Meng Cao Supervisor: Prabin.  Review of the Beamformer  Realization of the Beamforming Data Independent Beamforming Statistically Optimum.
Independent Component Analysis Independent Component Analysis.
Yi Jiang MS Thesis 1 Yi Jiang Dept. Of Electrical and Computer Engineering University of Florida, Gainesville, FL 32611, USA Array Signal Processing in.
Introduction to Independent Component Analysis Math 285 project Fall 2015 Jingmei Lu Xixi Lu 12/10/2015.
An Introduction of Independent Component Analysis (ICA) Xiaoling Wang Jan. 28, 2003.
Spatial vs. Blind Approaches for Speaker Separation: Structural Differences and Beyond Julien Bourgeois RIC/AD.
ARENA08 Roma June 2008 Francesco Simeone (Francesco Simeone INFN Roma) Beam-forming and matched filter techniques.
Part 3: Estimation of Parameters. Estimation of Parameters Most of the time, we have random samples but not the densities given. If the parametric form.
Estimation Techniques for High Resolution and Multi-Dimensional Array Signal Processing EMS Group – Fh IIS and TU IL Electronic Measurements and Signal.
LECTURE 11: Advanced Discriminant Analysis
Techniques to control noise and fading
Brain Electrophysiological Signal Processing: Preprocessing
Equalization in a wideband TDMA system
PCA vs ICA vs LDA.
A Fast Fixed-Point Algorithm for Independent Component Analysis
Independent Factor Analysis
Presentation transcript:

3/24/2006Lecture notes for Speech Communications Multi-channel speech enhancement Chunjian Li DICOM, Aalborg University

3/24/2006Lecture notes for Speech Communications Methods & applied fields Dual-channel spectral subtraction - noise reduction in speech Adaptive Noise Canceling (ANC) - noise reduction and interference elimination - echo canceling - adaptive beamforming Blind Source Separation (BSS) Blind Source Extraction (BSE)

3/24/2006Lecture notes for Speech Communications Dual-channel spectral subtraction - Hanson and Wong, ICASSP84.

3/24/2006Lecture notes for Speech Communications The method The exponent is chosen to be a=1 based on listening test and spectral distortion measure. The noisy phase is used in the reconstruction of signal. The estimate of noise spectrum is either obtained from a reference channel or estimated from the noisy signal assuming the SNR is very low (about -12 dB).

3/24/2006Lecture notes for Speech Communications Revisiting the phase issue To see the dependency of magnitude on phase: where is the phase difference between the two signals. It is clear that the estimate of signal magnitude spectrum depends on both the SNR and the phase difference. But phase is not estimated in this method because the enhanced quality is acceptable.

3/24/2006Lecture notes for Speech Communications Comments The simplest (and a bit unrealistic) form of exploiting multi-channel. Aims at improving intelligibility. Significant intel. gains only at very low SNR (-12dB). Unvoiced speech is not processed.

3/24/2006Lecture notes for Speech Communications Adaptive Noise Canceling First proposed by Widrow et al. [1] in It is adaptive because of the use of adaptive filter such as the LMS algorithm. The objective: estimate the noise in the primary channel using the noise recorded in the secondary channel, and subtract the estimate from the primary channel recordings. [ 1] B. Widrow, J. R. Grover, J. M. McCool et al. ”Adaptive noise canceling: Principles and applications,” Proceedings of the IEEE, vol.63, pp , Dec

3/24/2006Lecture notes for Speech Communications Signal model

3/24/2006Lecture notes for Speech Communications Signal estimation The optimization criterion: The estimated signal:

3/24/2006Lecture notes for Speech Communications Signal estimation The minimization can be solved by applying the orthogonality principle: This can be solved in the same way as solving the normal equations. But it is usually solved by sequential algorithms such as the LMS algorithm. The advantages of the LMS are: -No matrix inversion, low complexity -Fully adaptive, suitable to non-stationary signal and noise -Low delay

3/24/2006Lecture notes for Speech Communications LMS -It is a sequential, gradient descent minimization method, - The estimate of the weights is updated each time a new sample is available: Where the element of the gradient vector:

3/24/2006Lecture notes for Speech Communications LMS The most important trick is, in this sequential implementation, to approximate the correlation matrix and cross-correlation vector by The instantaneous estimates. Or, in matrix form:

3/24/2006Lecture notes for Speech Communications LMS The step size is often chosen empirically, as long as the following condition is satisfied for stability reason: whereis the largest eigenvalue of the matrix The larger the step-size, the faster the convergence, but also the larger estimation variance.

3/24/2006Lecture notes for Speech Communications Comments The LMS belongs to the stochastic gradient algorithm. The algorithm is based on the instantaneous estimates of correlation function, which are of high variance. But the algorithm works well because of its iterative nature, which averages the estimate over time. Low complexity: O ( M ), where M is the filter order. Although the derivation is based on WSS assumption, the algorithm is applicable to stationary signals, due to the sequential implementation.

3/24/2006Lecture notes for Speech Communications Implementation issues of ANC Microphones must be sufficiently separated in space or contain acoustic barriers. Typically 1500 taps are needed => large misadjustment => pronounced echo => must use small step-size => long convergence time. Different delays from the sources to the two microphones must be taken care of. Frequency domain LMS can reduces the number of taps needed. ANC can be generalizes to a multi-channel system, which can be seen as a generalized beamforming system.

3/24/2006Lecture notes for Speech Communications Eliminating cross-talk Cross-talk: If the signal is also captured in the reference channel, the ANC will suppress part of the signal. Cross-talk can be reduced by employing two adaptive filter within a feedback loop.

3/24/2006Lecture notes for Speech Communications Beamforming Compared to ANC, beamforming is truly a spatial filtering technique. First, locate the source direction; then form a beam directing to the source. The source location problem is a analogy of the spectral analysis problem, with the frequency domain replaced by the spatial domain.

3/24/2006Lecture notes for Speech Communications A simple array model Planar wave Uniform linear array Sensors responses are identical and LTI Sensors are omni directional One parameter to estimate: DOA

3/24/2006Lecture notes for Speech Communications ULA

3/24/2006Lecture notes for Speech Communications ULA The signal model: where the array transfer vector : Where is the delay with reference to the first sensor, and is the center frequency of the signal. By defining the spatial frequency as: we can write the array transfer vector as:

3/24/2006Lecture notes for Speech Communications ULA A direct analogy between frequency analysis and spatial analysis using the spatial frequency. To avoid spatial aliasing: All frequency analysis techniques can be applied to the DOA estimation problem.

3/24/2006Lecture notes for Speech Communications Spatial filtering Analogy between spatial filter and temporal filter

3/24/2006Lecture notes for Speech Communications Spatial filtering The spatially filtered signal: Objective: find the filter that passes undistorted the signals with a given DOA; and attenuates all the other DOAs as much as possible.

3/24/2006Lecture notes for Speech Communications The beam pattern

3/24/2006Lecture notes for Speech Communications Restrictions to beamforming Very sensitive to array geometry, need good calibration Has only directivity, no selectivity in range or other location parameters Frequency response is not flat Ambient noises are assumed to be spatially white Beam width (or selectivity) depends on the size of the array Spatial aliasing problem

3/24/2006Lecture notes for Speech Communications Blind Source Separation (BSS) MIMO systems Spatial processing techniques with no knowledge of array geometry Invisible beam Arbitrarily high spatial resolution Do not depend on signal frequency Spatial noise is not assumed to be white Not a spatial sampling system

3/24/2006Lecture notes for Speech Communications Solutions to BSS Independent Component Analysis (ICA) [2] Independent Factor Analysis (IFA) [3] [2] A. Hyvarinen, J. Karhunen, and E. Oja, Independent Component Analysis, John Wiley & Sons, Inc [3] H. Attias, “Independent factor analysis”, Neural Computation, 1999.

3/24/2006Lecture notes for Speech Communications Independent component analysis (ICA) Instantaneous mixing The number of sensors is greater than or equal to the number of sources No system noise The sources (components) are independent of each other The sources are non-Gaussian processes

3/24/2006Lecture notes for Speech Communications ICA model Cocktail party problem. Three sources, three sensors: Neither s nor A are known. Can not be solved by linear algebra. If the sources are independent non-Gaussian, the A matrix can be found by maximizing the non-Gaussianity of the sources. Or, in matrix form

3/24/2006Lecture notes for Speech Communications Contrast function An iterative gradient method. First initialize the A matrix. If the mixing matrix A is square and non-singular, move it to the left: Calculate the non-Gaussianity of s, and find the next estimate of A that gives a higher non-Gaussianity. Iterate until convergence. The contrast function is the objective function to maximize or minimize.

3/24/2006Lecture notes for Speech Communications Maximizing non-Gaussianity Non-Gaussian is independent Measuring non-Gaussianity - by kurtosis - by negentropy

3/24/2006Lecture notes for Speech Communications ICA methods ICA by maximizing non-Gaussianity ICA by Maximum Likelihood ICA by minimizing mutual information ICA by nonlinear decorrelation

3/24/2006Lecture notes for Speech Communications Extensions to ICA Noisy ICA ICA with non-square mixing matrix Independent Factor Analysis Convolutive mixture Methods using time structure

3/24/2006Lecture notes for Speech Communications Blind Source Extraction Only interested in one or a few sources out of many (feature extraction) Save computation Don’t know the exact number of sources

3/24/2006Lecture notes for Speech Communications BSE D. Mandic and A. Cichocki, An Online Algorithm For Blind Extraction Of Sources With Different Dynamical Structures.