Signal Processing Algorithms for Wireless Acoustic Sensor Networks Alexander Bertrand Electrical Engineering Department (ESAT) Katholieke Universiteit.

Slides:



Advertisements
Similar presentations
Bayesian Belief Propagation
Advertisements

Acoustic Echo Cancellation for Low Cost Applications
1 Closed-Form MSE Performance of the Distributed LMS Algorithm Gonzalo Mateos, Ioannis Schizas and Georgios B. Giannakis ECE Department, University of.
Air Force Technical Applications Center 1 Subspace Based Three- Component Array Processing Gregory Wagner Nuclear Treaty Monitoring Geophysics Division.
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Beamforming Issues in Modern MIMO Radars with Doppler
Speech Enhancement through Noise Reduction By Yating & Kundan.
Cooperative Multiple Input Multiple Output Communication in Wireless Sensor Network: An Error Correcting Code approach using LDPC Code Goutham Kumar Kandukuri.
1 Wireless Communication Low Complexity Multiuser Detection Rami Abdallah University of Illinois at Urbana Champaign 12/06/2007.
BYU Auxiliary Antenna Assisted Interference Cancellation for Radio Astronomy Imaging Arrays Brian Jeffs and Karl Warnick August 21, 2002.
Independent Component Analysis (ICA)
3/24/2006Lecture notes for Speech Communications Multi-channel speech enhancement Chunjian Li DICOM, Aalborg University.
Zhengyou Zhang, Qin Cai, Jay Stokes
EE 685 presentation Optimization Flow Control, I: Basic Algorithm and Convergence By Steven Low and David Lapsley Asynchronous Distributed Algorithm Proof.
ICA Alphan Altinok. Outline  PCA  ICA  Foundation  Ambiguities  Algorithms  Examples  Papers.
Laurent Itti: CS599 – Computational Architectures in Biological Vision, USC Lecture 7: Coding and Representation 1 Computational Architectures in.
Weighted Median Filters for Complex Array Signal Processing Yinbo Li - Gonzalo R. Arce Department of Electrical and Computer Engineering University of.
Adaptive Signal Processing
For 3-G Systems Tara Larzelere EE 497A Semester Project.
Wireless Communication Elec 534 Set IV October 23, 2007
International Technology Alliance In Network & Information Sciences International Technology Alliance In Network & Information Sciences 1 Cooperative Wireless.
Sensor Positioning in Wireless Ad-hoc Sensor Networks Using Multidimensional Scaling Xiang Ji and Hongyuan Zha Dept. of Computer Science and Engineering,
1 Sequential Acoustic Energy Based Source Localization Using Particle Filter in a Distributed Sensor Network Xiaohong Sheng, Yu-Hen Hu University of Wisconsin.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Introduction SNR Gain Patterns Beam Steering Shading Resources: Wiki:
-1- ICA Based Blind Adaptive MAI Suppression in DS-CDMA Systems Malay Gupta and Balu Santhanam SPCOM Laboratory Department of E.C.E. The University of.
A Shaft Sensorless Control for PMSM Using Direct Neural Network Adaptive Observer Authors: Guo Qingding Luo Ruifu Wang Limei IEEE IECON 22 nd International.
1 Techniques to control noise and fading l Noise and fading are the primary sources of distortion in communication channels l Techniques to reduce noise.
Dynamic Clustering for Acoustic Target Tracking in Wireless Sensor Network Wei-Peng Chen, Jennifer C. Hou, Lui Sha.
Capacity Scaling with Multiple Radios and Multiple Channels in Wireless Mesh Networks Oguz GOKER.
Shape Matching for Model Alignment 3D Scan Matching and Registration, Part I ICCV 2005 Short Course Michael Kazhdan Johns Hopkins University.
Eigenstructure Methods for Noise Covariance Estimation Olawoye Oyeyele AICIP Group Presentation April 29th, 2003.
Nico De Clercq Pieter Gijsenbergh Noise reduction in hearing aids: Generalised Sidelobe Canceller.
Heart Sound Background Noise Removal Haim Appleboim Biomedical Seminar February 2007.
Tracking with Unreliable Node Sequences Ziguo Zhong, Ting Zhu, Dan Wang and Tian He Computer Science and Engineering, University of Minnesota Infocom 2009.
REVISED CONTEXTUAL LRT FOR VOICE ACTIVITY DETECTION Javier Ram’ırez, Jos’e C. Segura and J.M. G’orriz Dept. of Signal Theory Networking and Communications.
2010/12/11 Frequency Domain Blind Source Separation Based Noise Suppression to Hearing Aids (Part 1) Presenter: Cian-Bei Hong Advisor: Dr. Yeou-Jiunn Chen.
Blind speech dereverberation using multiple microphones Inseon JANG, Seungjin CHOI Intelligent Multimedia Lab Department of Computer Science and Engineering,
Multiuser Detection (MUD) Combined with array signal processing in current wireless communication environments Wed. 박사 3학기 구 정 회.
Image Restoration using Iterative Wiener Filter --- ECE533 Project Report Jing Liu, Yan Wu.
Tufts University. EE194-WIR Wireless Sensor Networks. April 21, 2005 Increased QoS through a Degraded Channel using a Diverse, Cross-Layered Protocol Elliot.
Adaptive Methods for Speaker Separation in Cars DaimlerChrysler Research and Technology Julien Bourgeois
Signal Processing & Communication for Smart Dust Networks Haralabos (Babis) Papadopoulos ECE Department Institute for Systems Research University of Maryland,
Nico De Clercq Pieter Gijsenbergh.  Problem  Solutions  Single-channel approach  Multichannel approach  Our assignment Overview.
Reduced-bandwidth and distributed MWF-based noise reduction algorithms Simon Doclo, Tim Van den Bogaert, Jan Wouters, Marc Moonen Dept. of Electrical Engineering.
Energy-Efficient Signal Processing and Communication Algorithms for Scalable Distributed Fusion.
1 Optical Packet Switching Techniques Walter Picco MS Thesis Defense December 2001 Fabio Neri, Marco Ajmone Marsan Telecommunication Networks Group
An Introduction to Blind Source Separation Kenny Hild Sept. 19, 2001.
1 Blind Channel Identification and Equalization in Dense Wireless Sensor Networks with Distributed Transmissions Xiaohua (Edward) Li Department of Electrical.
College of Engineering Anchor Nodes Placement for Effective Passive Localization Karthikeyan Pasupathy Major Advisor: Dr. Robert Akl Department of Computer.
Wireless Multiple Access Schemes in a Class of Frequency Selective Channels with Uncertain Channel State Information Christopher Steger February 2, 2004.
EE 685 presentation Optimization Flow Control, I: Basic Algorithm and Convergence By Steven Low and David Lapsley.
Study of Broadband Postbeamformer Interference Canceler Antenna Array Processor using Orthogonal Interference Beamformer Lal C. Godara and Presila Israt.
A Semi-Blind Technique for MIMO Channel Matrix Estimation Aditya Jagannatham and Bhaskar D. Rao The proposed algorithm performs well compared to its training.
Laboratory for Experimental ORL K.U.Leuven, Belgium Dept. of Electrotechn. Eng. ESAT/SISTA K.U.Leuven, Belgium Combining noise reduction and binaural cue.
Performance of Adaptive Beam Nulling in Multihop Ad Hoc Networks Under Jamming Suman Bhunia, Vahid Behzadan, Paulo Alexandre Regis, Shamik Sengupta.
Energy-Efficient Signal Processing and Communication Algorithms for Scalable Distributed Fusion.
Smart Sleeping Policies for Wireless Sensor Networks Venu Veeravalli ECE Department & Coordinated Science Lab University of Illinois at Urbana-Champaign.
Yi Jiang MS Thesis 1 Yi Jiang Dept. Of Electrical and Computer Engineering University of Florida, Gainesville, FL 32611, USA Array Signal Processing in.
Introduction to Independent Component Analysis Math 285 project Fall 2015 Jingmei Lu Xixi Lu 12/10/2015.
An Introduction of Independent Component Analysis (ICA) Xiaoling Wang Jan. 28, 2003.
Spatial vs. Blind Approaches for Speaker Separation: Structural Differences and Beyond Julien Bourgeois RIC/AD.
Distributed Signal Processing Woye Oyeyele March 4, 2003.
Optimization-based Cross-Layer Design in Networked Control Systems Jia Bai, Emeka P. Eyisi Yuan Xue and Xenofon D. Koutsoukos.
Flexible Speaker Adaptation using Maximum Likelihood Linear Regression Authors: C. J. Leggetter P. C. Woodland Presenter: 陳亮宇 Proc. ARPA Spoken Language.
Sridhar Rajagopal Bryan A. Jones and Joseph R. Cavallaro
Speech Enhancement Summer 2009
Advanced Wireless Networks
Two-Stage Mel-Warped Wiener Filter SNR-Dependent Waveform Processing
Information Sciences and Systems Lab
Chenhui Zheng/Communication Laboratory
Presentation transcript:

Signal Processing Algorithms for Wireless Acoustic Sensor Networks Alexander Bertrand Electrical Engineering Department (ESAT) Katholieke Universiteit Leuven , University of Oldenburg, MEDI-AKU-SIGNAL Kolloquium

Outline 1.Introduction 2.Multi-channel Wiener filter (MWF) 3.Example: distributed MWF in binaural hearing aids 4.DANSE in fully connected WASN 5.Tree-DANSE 6.Multi-speaker VAD Tracking of speech power Noise reduction

Outline 1.Introduction 2.Multi-channel Wiener filter (MWF) 3.Example: distributed MWF in binaural hearing aids 4.DANSE in fully connected WASN 5.Tree-DANSE 6.Multi-speaker VAD

4 Traditional sensor array DSP centralized processing known / fixed sensor positions Sensor array DSP  Long distance (SNR drops 6dB for each doubling of distance)  Sharp angle  #microphones is limited

5 Distributed sensor arrays Wireless acoustic sensor network (WASN) More spatial information More sensors Subset: high SNR recordings

6 Challenges 3) Distributed processing 1) Unknown/changing positions, link failure  ADAPTIVE 2) Bandwidth efficiency 4) Subset selection Distributed sensor arrays

Outline 1.Introduction 2.Multi-channel Wiener filter (MWF) 3.Example: distributed MWF in binaural hearing aids 4.DANSE in fully connected WASN 5.Tree-DANSE 6.Multi-speaker VAD

Multi-channel Wiener Filtering (MWF) - Goal: estimate speech component in 1 of the N microphones - Output = sum of filtered microphone signals: W1 W2 W3 W4 + Clean speech

Multi-channel Wiener Filtering (MWF) - Goal: estimate speech component in 1 of the N microphones - Output = sum of filtered microphone signals: W1 W2 W3 W4 + Clean speech

Multi-channel Wiener Filtering (MWF) - Goal: estimate speech component in 1 of the N microphones - Output = sum of filtered microphone signals: - Needs: - N x N noise+speech correlation matrix R yy - N x 1 clean speech correlation (column of R dd ) - R dd can be estimated using R dd = R yy - R nn using voice activity detection (VAD) mechanism W1 W2 W3 W4 + Clean speech

Multi-channel Wiener Filtering (MWF) RECAP - Given: N microphone signals - Choose one (arbitrary) reference microphone - MWF computes optimal filters such that sum of outputs is as close as possible to speech component in target microphone

Noise frame: destructive interference  Noise = electro music F1 F2 F3 F4 +

 Noise = electro music F1 F2 F3 F4 + Speech frame: constructive interference

Outline 1.Introduction 2.Multi-channel Wiener filter (MWF) 3.Example: distributed MWF in binaural hearing aids 4.DANSE in fully connected WASN 5.Tree-DANSE 6.Multi-speaker VAD 7.Subset selection 8.Conclusions

15 Example: binaural hearing aids MWF leftMWF right Binaural link large bandwidth needed full matrix inversion = 2-node WASN

16 Example: binaural hearing aids w 11 Binaural link g 12 + g 21 w 22 + Converges to optimum if single desired source (Doclo et al., 2007)

17 Motivation for DANSE > 2 nodes ? e.g. supporting external sensor nodes or multiple hearing aid users.

18 Motivation for DANSE > 2 nodes ? e.g. supporting external sensor nodes or multiple hearing aid users.

19 Motivation for DANSE > 2 nodes ? e.g. supporting external sensor nodes or multiple hearing aid users.

20 Motivation for DANSE > 2 nodes ? e.g. supporting external sensor nodes or multiple hearing aid users.

21 Motivation for DANSE > 2 nodes Multiple desired sources e.g. conversation monitoring.

22 Motivation for DANSE > 2 nodes Multiple desired sources e.g. conversation monitoring.

Outline 1.Introduction 2.Multi-channel Wiener filter (MWF) 3.Example: distributed MWF in binaural hearing aids 4.DANSE in fully connected WASN 5.Tree-DANSE 6.Multi-speaker VAD

24 DANSE Previous requires more general framework: Distributed adaptive node-specific signal estimation (DANSE) Allows for multiple nodes (fully connected topology) Allows for multiple target sources: Estimating K sources requires communication of K-channel signals (DANSE K )

DANSE Considered here: Fully connected WSN Multi-channel sensor signal observations Goal: each node estimates node-specific signal, but common latent signal subspace (dimension= # targets)

26 3 nodes, fully connected

27 Binaural hearing aids (revisited) w 11 Binaural link g 12 + g 21 w 22 +

28 w 11 (2) Binaural link g 12 (2) ++ w 11 (1) g 12 (1) w 22 (2)g 21 (2) w 22 (1) g 21 (1) Converges to optimum if #desired sources ≤ 2 J=2, DANSE 2 (K=2) auxiliary channels (capture signal space) Binaural hearing aids (revisited)

29 Binaural link ++ J=2, DANSE K Converges to optimum if K= # desired sources Binaural hearing aids (revisited)

Sequential updating Sequential round-robin update

31 DANSE with simultaneous updating - Simultaneous updating: parallel computing - Sometimes convergence to optimal solution, but not always - Solution: relaxation yields convergence and optimality:

32 Without relaxation (S-DANSE) 4 nodes, 3-6 sensors/node DANSE with simultaneous updating

33 With relaxation (rS-DANSE) 4 nodes, 3-6 sensors/node DANSE with simultaneous updating

34 DANSE audio demo (tracking omitted) Unfiltered rS-DANSE Centralized MWF

35 Robust DANSE - Theory: DANSE == centralized MWF, but…

36 Robust DANSE - Numerical errors due to: -Estimation errors in R dd (especially at low SNR nodes)  ripple effect -Reference microphones are close to each other  ill-conditioned basis for signal subspace - Solution: estimate speech component in communicated signals, preferably from high SNR nodes (= Robust DANSE or R-DANSE) - Convergence is proven under certain dependency conditions

Outline 1.Introduction 2.Multi-channel Wiener filter (MWF) 3.Example: distributed MWF in binaural hearing aids 4.DANSE in fully connected WASN 5.Tree-DANSE 6.Multi-speaker VAD

What if not fully connected?

Nodes must pass on information from other nodes 1) Nodes act as relays (virtually fully connected): - huge increase in bandwidth if limited connections - routing problem 2) Nodes broadcast the sum of all filtered inputs: - no increase in bandwidth - no routing problem (?)

40 What if not fully connected?

FEEDBACK !! What if not fully connected?

- Intuition - Theoretical analysis - Conclusion: feedback causes major problems - Direct feedback (one edge) vs. indirect feedback (loops)

Direct feedback cancellation Transmitter feedback cancellation

Receiver feedback cancellation Direct feedback cancellation

What if not fully connected? - Intuition - Theoretical analysis - Conclusion: feedback causes major problems - Direct feedback (one edge) vs. indirect feedback (loops) - Prune to tree topology  T-DANSE (= still optimal output!!)

Outline 1.Introduction 2.Multi-channel Wiener filter (MWF) 3.Example: distributed MWF in binaural hearing aids 4.DANSE in fully connected WASN 5.Tree-DANSE 6.Multi-speaker VAD

47 Multi-speaker VAD - Goal : Track individual speech power of multiple simultaneous speakers or other non-stationary sources (  VAD) - Exploit spatial diversity from WASN speaker microphone

48 Multi-speaker VAD Ad-hoc microphone array Assumptions: 1.Speakers in near-field 2.Speakers are independent 3.Limited noise/reverberance 4.Sources to track are well-grounded (= they attain zero-values) Advantages: Array geometry unknown Speaker positions unknown Energy-based  low data rate  synchronization not crucial WASN’s !

Data model

Non-negative blind source separation - Theorem (Plumbley, 2002): “An orthogonal mixture of non-negative, well-grounded source signals, that preserves non-negativity, is a permutation of the original signals.”

Exploiting non-negativity and well- groundedness (J=N=2 example) s1s1 s2s2 s1s1 s2s2 y=As

Exploiting non-negativity and well- groundedness (J=N=2 example) s1s1 s2s2 Orthogonal transformation preserves uncorrelatedness  simple decorrelation (whitening) of measurements gives original up to a rotation whiten s1s1 s2s2 ?

Exploiting non-negativity and well- groundedness (J=N=2 example) - Well-grounded source signals y=As s1s1 s2s2 s1s1 s2s2

Exploiting non-negativity and well- groundedness (J=N=2 example) - Well-grounded source signals s1s1 s2s2 whiten s1s1 s2s2 !

Exploiting non-negativity and well- groundedness (J=N=2 example) - Well-grounded source signals s1s1 s2s2 s1s1 s2s2

Non-negative blind source separation - Theorem (Plumbley, 2002): “An orthogonal mixture of non-negative, well-grounded source signals, that preserves non-negativity, is a permutation of the original signals.” - Two different techniques: 1.- Whitening, ignoring non-negativity constraints (=easy) - Search for rotation matrix that restores non-negativity (=hard) 2. Whitening with non-negativity constraints (=hard) - 1 st approach (Oja & Plumbley) = NPCA (Non-negative principal component analysis) - 2 nd approach (Bertrand & Moonen) = MNICA (Multiplicative non- negative independent component analysis)

MNICA: results