Communications & Multimedia Signal Processing Meeting 7 Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel University 23 November,

Slides:



Advertisements
Similar presentations
Learning Introductory Signal Processing Using Multimedia 1 Outline Overview of Information and Communications Some signal processing concepts Tools available.
Advertisements

Speech Coding Workshop 2000 Jean-Marc Valin, Roch Lefebvre 1 IEEE Speech Coding Workshop Sept 17–20, 2000 Lake Lawn Resort Delavan, WI Jean-Marc Valin,
Liner Predictive Pitch Synchronization Voiced speech detection, analysis and synthesis Jim Bryan Florida Institute of Technology ECE5525 Final Project.
Page 0 of 34 MBE Vocoder. Page 1 of 34 Outline Introduction to vocoders MBE vocoder –MBE Parameters –Parameter estimation –Analysis and synthesis algorithm.
2004 COMP.DSP CONFERENCE Survey of Noise Reduction Techniques Maurice Givens.
Analysis and Synthesis of Shouted Speech Tuomo Raitio Jouni Pohjalainen Manu Airaksinen Paavo Alku Antti Suni Martti Vainio.
AGC DSP AGC DSP Professor A G Constantinides©1 A Prediction Problem Problem: Given a sample set of a stationary processes to predict the value of the process.
Speech Coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.
Zhiyao Duan, Gautham J. Mysore, Paris Smaragdis 1. EECS Department, Northwestern University 2. Advanced Technology Labs, Adobe Systems Inc. 3. University.
Overview of Adaptive Multi-Rate Narrow Band (AMR-NB) Speech Codec
Communications & Multimedia Signal Processing Frequency Kalman Noise Reduction Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel.
Communications & Multimedia Signal Processing Meeting 6 Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel University 6 July,
Communications & Multimedia Signal Processing Report of Work on Formant Tracking LP Models and Plans on Integration with Harmonic Plus Noise Model Qin.
Communications & Multimedia Signal Processing Analysis of the Effects of Train noise on Recognition Rate using Formants and MFCC Esfandiar Zavarehei Department.
Single-Channel Speech Enhancement in Both White and Colored Noise Xin Lei Xiao Li Han Yan June 5, 2002.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
7th IEEE Technical Exchange Meeting 2000 Hybrid Wavelet-SVD based Filtering of Noise in Harmonics By Prof. Maamar Bettayeb and Syed Faisal Ali Shah King.
Communications & Multimedia Signal Processing Formant Based Synthesizer Qin Yan Communication & Multimedia Signal Processing Group Dept of Electronic.
Tracking using the Kalman Filter. Point Tracking Estimate the location of a given point along a sequence of images. (x 0,y 0 ) (x n,y n )
Communications & Multimedia Signal Processing Formant Track Restoration in Train Noisy Speech Qin Yan Communication & Multimedia Signal Processing Group.
Communications & Multimedia Signal Processing 1 Speech Communication for Mobile and Hands-Free Devices in Noisy Environments EPSRC Project GR/S30238/01.
Speech Recognition in Noise
Communications & Multimedia Signal Processing Formant Tracking LP with Harmonic Plus Noise Model of Excitation for Speech Enhancement Qin Yan Communication.
Communications & Multimedia Signal Processing Refinement in FTLP-HNM system for Speech Enhancement Qin Yan Communication & Multimedia Signal Processing.
1 Speech Enhancement Wiener Filtering: A linear estimation of clean signal from the noisy signal Using MMSE criterion.
Communications & Multimedia Signal Processing Analysis of Effects of Train/Car noise in Formant Track Estimation Qin Yan Department of Electronic and Computer.
IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING MARCH 2010 Lan-Ying Yeh
Normalization of the Speech Modulation Spectra for Robust Speech Recognition Xiong Xiao, Eng Siong Chng, and Haizhou Li Wen-Yi Chu Department of Computer.
EE513 Audio Signals and Systems Digital Signal Processing (Systems) Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
A VOICE ACTIVITY DETECTOR USING THE CHI-SQUARE TEST
„Bandwidth Extension of Speech Signals“ 2nd Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction 22nd and 23rd June.
Sensitivity System sensitivity is defined as the available input signal level Si for a given (SNR)O Si is called the minimum detectable signal An expression.
AUDIO COMPRESSION msccomputerscience.com. The process of digitizing audio signals is called PCM PCM involves sampling audio signal at minimum rate which.
Computer Vision - Restoration Hanyang University Jong-Il Park.
Blind speech dereverberation using multiple microphones Inseon JANG, Seungjin CHOI Intelligent Multimedia Lab Department of Computer Science and Engineering,
Concepts of Multiplexing Many input signals to one transmission media Reduces the number of channels or conductors running from point A to point B Added.
Authors: Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky Temporal envelope compensation for robust phoneme recognition using modulation spectrum.
School of Electrical & Computer Engineering Image Denoising Using Steerable Pyramids Alex Cunningham Ben Clarke Dy narath Eang ECE November 2008.
ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska
VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.
8-1 Chapter 8: Image Restoration Image enhancement: Overlook degradation processes, deal with images intuitively Image restoration: Known degradation processes;
Noise Reduction Two Stage Mel-Warped Weiner Filter Approach.
Implementation of a speech Analysis-Synthesis Toolbox using Harmonic plus Noise Model Didier Cadic 1, engineering student supervised by Olivier Cappé.
CHAPTER 4 COMPLEX STIMULI. Types of Sounds So far we’ve talked a lot about sine waves =periodic =energy at one frequency But, not all sounds are like.
Digital Audio Signal Processing Lecture-3 Noise Reduction
RCC-Mean Subtraction Robust Feature and Compare Various Feature based Methods for Robust Speech Recognition in presence of Telephone Noise Amin Fazel Sharif.
Chapter 20 Speech Encoding by Parameters 20.1 Linear Predictive Coding (LPC) 20.2 Linear Predictive Vocoder 20.3 Code Excited Linear Prediction (CELP)
Page 0 of 7 Particle filter - IFC Implementation Particle filter – IFC implementation: Accept file (one frame at a time) Initial processing** Compute autocorrelations,
Cameron Rowe.  Introduction  Purpose  Implementation  Simple Example Problem  Extended Kalman Filters  Conclusion  Real World Examples.
Wavelet Thresholding for Multiple Noisy Image Copies S. Grace Chang, Bin Yu, and Martin Vetterli IEEE TRANSACTIONS
1 Speech Compression (after first coding) By Allam Mousa Department of Telecommunication Engineering An Najah University SP_3_Compression.
UNIT-IV. Introduction Speech signal is generated from a system. Generation is via excitation of system. Speech travels through various media. Nature of.
High Quality Voice Morphing
Speech Enhancement Summer 2009
P.Astone, S.D’Antonio, S.Frasca, C.Palomba
Vocoders.
Linear Prediction Simple first- and second-order systems
Speech Enhancement with Binaural Cues Derived from a Priori Codebook
1 Vocoders. 2 The Channel Vocoder (analyzer) : The channel vocoder employs a bank of bandpass filters,  Each having a bandwidth between 100 HZ and 300.
Speech and Audio Processing
ON THE ARCHITECTURE OF THE CDMA2000® VARIABLE-RATE MULTIMODE WIDEBAND (VMR-WB) SPEECH CODING STANDARD Milan Jelinek†, Redwan Salami‡, Sassan Ahmadi*, Bruno.
Two-Stage Mel-Warped Wiener Filter SNR-Dependent Waveform Processing
Chapter 3: PCM Noise and Companding
Vocoders.
Wiener Filtering: A linear estimation of clean signal from the noisy signal Using MMSE criterion.
Presenter: Shih-Hsiang(士翔)
Additive Manufacturing: Denoising and Particle Tracking
Combination of Feature and Channel Compensation (1/2)
Presentation transcript:

Communications & Multimedia Signal Processing Meeting 7 Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel University 23 November, 2005

Communications & Multimedia Signal Processing Contents Kalman Filter: Speech and noise tracking HNM Model: The degree of “Harmonicity” Bandwidth extension Future work: noise reduction using HNM

Communications & Multimedia Signal Processing Kalman Filter: Speech and noise tracking Previous method: Modelling speech with an AR model X : Noisy S : Speech D : Noise Noise Variance

Communications & Multimedia Signal Processing Kalman Filter: Speech and noise tracking (cont.) New method: Modelling speech AND noise with AR models

Communications & Multimedia Signal Processing Kalman Filter: Speech and noise tracking (cont.) The noise AR models are obtained from noise-only periods. Results from the new model sound more natural Clean Speech Noisy Speech Kalman Old Kalman New SNR Method CarCar DFTKUN DFTKCN MMSE Wiener PSS TrainTrain DFTKUN DFTKCN MMSE Wiener PSS WGNWGN DFTKUN DFTKCN MMSE Wiener PSS

Communications & Multimedia Signal Processing HNM Model Harmonic sub-bands are modelled as the sum of a Gaussian and some random noise R: Random Noise with Rayleigh Distribution

Communications & Multimedia Signal Processing HNM Model Sample Reconstructed Frame

Communications & Multimedia Signal Processing HNM Model Original Synthesized, PESQ:3.91

Communications & Multimedia Signal Processing HNM Model Noise severely affects the A k Pitch, Harmonicity and Harmonic frequencies are much less distorted by noise Simple analysis/synthesis of noisy speech improves its quality (SNR<10dB)

Communications & Multimedia Signal Processing LP-HNM Decompose the signal to an LP model (AR or LSF) and an HNM model of the residual (f k,A k,V k ) Amplitude can be assumed to be equal (whitened by inverse modelling) Frequencies also may be assumed to be multiples of the fundamental frequency (later displaced slightly by LP modelling) LP-HNM synthesized PESQ: 3.50

Communications & Multimedia Signal Processing Bandwidth Extension One application of the model is Bandwidth Extension for getting 16KHz speech quality from 8KHz Speech Trained LP-HNM Model LP-HNM Analysis 8KHz Speech Signal 16KHz Speech Signal

Communications & Multimedia Signal Processing Bandwidth Extension Codebook Mapping is used to obtain higher LPF coefficients from lower LPF coefficients extracted from 8KHz signal Similar method is used to obtain the harmonicity degree of higher sub- bands

Communications & Multimedia Signal Processing Bandwidth Extension A shadow codebook for LP gain ratio (G 8 /G 16 ) is used for gain mapping Phase is extrapolated assuming a linear phase for the harmonics, some random noise is added to unvoiced sub-bands The performance of the system deteriorates in noise

Communications & Multimedia Signal Processing Future Work Tracking the HNM parameters using Kalman filter, in other words, rather than tracking DFT trajectories in one frequency bin, it might be better to track only the harmonic bins (reduced computational complexity) along the harmonic frequencies (intuitively makes more sense!)

Communications & Multimedia Signal Processing Future Work Some harmonics proved very difficult to recover from noise (e.g. 1-3). Investigate the possibility of a similar model based approach as the BWE method for estimating parameters of those harmonics. The Harmonicity of the sub-bands and the reciprocal noise level at those frequencies may be used as weights in the mapping process. Clean Speech De-noised Speech

Communications & Multimedia Signal Processing Future Work A is a parameter vector, W is the weighting vector (e.g. reciprocal of normalized noise spectrum). B j is the j th entry of the codebook. The result can be used for reconstructing speech