Project-Final Presentation Blind Dereverberation Algorithm for Speech Signals Based on Multi-channel Linear Prediction Supervisor: Alexander Bertrand Authors:

Slides:



Advertisements
Similar presentations
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: The Linear Prediction Model The Autocorrelation Method Levinson and Durbin.
Advertisements

Liner Predictive Pitch Synchronization Voiced speech detection, analysis and synthesis Jim Bryan Florida Institute of Technology ECE5525 Final Project.
Use of Kalman filters in time and frequency analysis John Davis 1st May 2011.
Improvement of Audio Capture in Handheld Devices through Digital Filtering Problem Microphones in handheld devices are of low quality to reduce cost. This.
Adaptive Filters S.B.Rabet In the Name of GOD Class Presentation For The Course : Custom Implementation of DSP Systems University of Tehran 2010 Pages.
OPTIMUM FILTERING.
Itay Ben-Lulu & Uri Goldfeld Instructor : Dr. Yizhar Lavner Spring /9/2004.
ELE Adaptive Signal Processing
AGC DSP AGC DSP Professor A G Constantinides©1 A Prediction Problem Problem: Given a sample set of a stationary processes to predict the value of the process.
6/3/20151 Voice Transformation : Speech Morphing Gidon Porat and Yizhar Lavner SIPL – Technion IIT December
Blind Source Separation of Acoustic Signals Based on Multistage Independent Component Analysis Hiroshi SARUWATARI, Tsuyoki NISHIKAWA, and Kiyohiro SHIKANO.
Speech & Audio Processing
Overview of Adaptive Multi-Rate Narrow Band (AMR-NB) Speech Codec
Top Level System Block Diagram BSS Block Diagram Abstract In today's expanding business environment, conference call technology has become an integral.
Virtualized Audio as a Distributed Interactive Application Peter A. Dinda Northwestern University Access Grid Retreat, 1/30/01.
Paper Title Your Name CMSC 838 Presentation. CMSC 838T – Presentation Motivation u Problem paper is trying to solve  Characteristics of problem  … u.
Pole Zero Speech Models Speech is nonstationary. It can approximately be considered stationary over short intervals (20-40 ms). Over thisinterval the source.
Single-Channel Speech Enhancement in Both White and Colored Noise Xin Lei Xiao Li Han Yan June 5, 2002.
Why is ASR Hard? Natural speech is continuous
Linear Prediction Problem: Forward Prediction Backward Prediction
EE513 Audio Signals and Systems Noise Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
1 A Matched Filter for Cosmic Ray Detection from Eletromagnetic Wave Reflection Luciano Andrade Thiago Ciodaro José Seixas Federal University of Rio de.
Internet Engineering Czesław Smutnicki Discrete Mathematics – Discrete Convolution.
Page 0 of 23 MELP Vocoders Nima Moghadam SN#: Saeed Nari SN#: Supervisor Dr. Saameti April 2005 Sharif University of Technology.
Acoustic impulse response measurement using speech and music signals John Usher Barcelona Media – Innovation Centre | Av. Diagonal, 177, planta 9,
LIGO- G Z August 19, 2004Penn State University 1 Extracting Signals via Blind Deconvolution Tiffany Summerscales Penn State University.
Blind speech dereverberation using multiple microphones Inseon JANG, Seungjin CHOI Intelligent Multimedia Lab Department of Computer Science and Engineering,
T – Biomedical Signal Processing Chapters
1 Linear Prediction. 2 Linear Prediction (Introduction) : The object of linear prediction is to estimate the output sequence from a linear combination.
1 Linear Prediction. Outline Windowing LPC Introduction to Vocoders Excitation modeling  Pitch Detection.
Supervisor: Dr. Boaz Rafaely Student: Limor Eger Dept. of Electrical and Computer Engineering, Ben-Gurion University Goal Directional analysis of sound.
Name : Arum Tri Iswari Purwanti NPM :
Basics of Neural Networks Neural Network Topologies.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Definitions Random Signal Analysis (Review) Discrete Random Signals Random.
Communication Group Course Multidimensional DSP DoA Estimation Methods Pejman Taslimi – Spring 2009 Course Presentation – Amirkabir Univ Title: Acoustic.
Experimental research in noise influence on estimation precision for polyharmonic model frequencies Natalia Visotska.
Unit-V DSP APPLICATIONS. UNIT V -SYLLABUS DSP APPLICATIONS Multirate signal processing: Decimation Interpolation Sampling rate conversion by a rational.
Open-Loop Dereverberation of Multichannel Room Impulse Responses Bowon Lee, Mark A. Hasegawa-Johnson, and Camille Goudeseune Department of Electrical and.
Chapter 6 Linear Predictive Coding (LPC) of Speech Signals 6.1 Basic Concepts of LPC 6.2 Auto-Correlated Solution of LPC 6.3 Covariance Solution of LPC.
EE Audio Signals and Systems Linear Prediction Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
0 - 1 © 2007 Texas Instruments Inc, Content developed in partnership with Tel-Aviv University From MATLAB ® and Simulink ® to Real Time with TI DSPs Spectrum.
Professor Walter W. Olson Department of Mechanical, Industrial and Manufacturing Engineering University of Toledo Observers/Estimators …  bnbn b n-1 b2b2.
PhD Candidate: Tao Ma Advised by: Dr. Joseph Picone Institute for Signal and Information Processing (ISIP) Mississippi State University Linear Dynamic.
Dr. Galal Nadim.  The root-MUltiple SIgnal Classification (root- MUSIC) super resolution algorithm is used for indoor channel characterization (estimate.
Automatic Equalization for Live Venue Sound Systems Damien Dooley, Final Year ECE Progress To Date, Monday 21 st January 2008.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Normal Equations The Orthogonality Principle Solution of the Normal Equations.
Signal Processing in Neuroinformatics EEG Signal Processing Yongnan Ji.
Single Correlator Based UWB Receiver Implementation through Channel Shortening Equalizer By Syed Imtiaz Husain and Jinho Choi School of Electrical Engineering.
Impulse Response Measurement and Equalization Digital Signal Processing LPP Erasmus Program Aveiro 2012 Digital Signal Processing LPP Erasmus Program Aveiro.
Institut für Nachrichtengeräte und Datenverarbeitung Prof. Dr.-Ing. P. Vary On the Use of Artificial Bandwidth Extension Techniques in Wideband Speech.
Spatial Covariance Models For Under- Determined Reverberant Audio Source Separation N. Duong, E. Vincent and R. Gribonval METISS project team, IRISA/INRIA,
BRAIN TISSUE IMPEDANCE ESTIMATION Improve the Brain’s Evoked Potential’s source Temporal and Spatial Inverse Problem Improve the Brain Tissue Impedance.
Linear Prediction.
State-Space Recursive Least Squares with Adaptive Memory College of Electrical & Mechanical Engineering National University of Sciences & Technology (NUST)
Bayesian Enhancement of Speech Signals Jeremy Reed.
Locating a Shift in the Mean of a Time Series Melvin J. Hinich Applied Research Laboratories University of Texas at Austin
Opracowanie językowe dr inż. J. Jarnicki
Figure 11.1 Linear system model for a signal s[n].
Linear Prediction Simple first- and second-order systems
Equalization in a wideband TDMA system
Linear Prediction.
Convolution and Deconvolution
Chapter 6 Discrete-Time System
Whitening-Rotation Based MIMO Channel Estimation
A maximum likelihood estimation and training on the fly approach
Homomorphic Speech Processing
Wiener Filtering: A linear estimation of clean signal from the noisy signal Using MMSE criterion.
Fixed-point Analysis of Digital Filters
16. Mean Square Estimation
Real-time Uncertainty Output for MBES Systems
Presentation transcript:

Project-Final Presentation Blind Dereverberation Algorithm for Speech Signals Based on Multi-channel Linear Prediction Supervisor: Alexander Bertrand Authors: Dusko Karaklajic Kong Fanxiao Dec.2008

Review of Previous Our Goals :  To solve the reverberation problem  Automatic Speech Recognition (ASR) problem  Room transfer function?  Title “Blind” –why?  Whitening of a signal?

Review of Previous Basic Principle  Input signal generation  AR (Auto Regressive) process  Prediction filters  Prediction error  Estimated AR process

Review of Previous Something about Mathematics  Transfer functions  Prediction Error  Size of the matrix H full row-rank matrix =>> (m+L)x2L and2L≥m+L  The AR polynomial

Review of Previous Why use multi-channel  Necessary and sufficient condition for existing of generalized inverse matrix

Basic Algorithm  Input signal generation  Prediction error  From the same eigenvalue λ of matrix C & Q Here C is the companion matrix and

Basic Algorithm  Calculate Q with the signal received at the microphone  The first column of the matrix Q give us the prediction filter coefficients as matrix  Calculate the prediction error

Basic Algorithm  Calculate the characteristic polynomial of Q to estimated AR  Recover the input signal by filtering the prediction error with the estimated AR parameters We can prove λ(Q)=λ(C) λ is the eigenvalue

Simulation Environment  Room Transfer functions  Simulation conditions Length of impulse response 50 taps Number of input signal samples 45,000 Length of generating AR process 21 taps Sampling frequency 16 kHz Length of prediction filters 50 taps Length of estimated AR process 101 taps

Simulation Room Transfer Function  Reflection coefficient of the walls 0.8  Different positions of microphones->different impulse response  Length of the impulse response?

Simulation Speech Simulation  AR process  Sound “U” is used for the estimation of AR parameters

Simulation Speech Simulation

Simulation Intermediate Results  “reverberated “ signal  Red-spectrum of the microphone signal  Blue-spectrum of the input speech signal

Simulation Final Result  whitening of the signal-output white noise  estimated AR coefficients

Simulation Final Result  Good “blind” AR parameter estimation!  Dereverberated signal

Simulation Statistics Results: SDR Before =3.22 SDR After =51.48

Simulation For Real speech signal

Simulation Intermediate result  “reverberated ” signal  Red-spectrum of the microphone signal  Blue-spectrum of the input speech signal

Simulation Final result  Good “blind” AR parameter estimation!  Dereverberated signal

Simulation Statistics Results: SDR Before =3.97 SDR After =20.33

Conclusions  Expected results vs. practical results  non-whitened output signal  Limitations?  Length of impulse response vs. Q matrix length vs. computational time  Simulated vs. Real Speech Signal  Noise free environment- not realistic