Reduction of Additive Noise in the Digital Processing of Speech Avner Halevy AMSC 663 Mid Year Progress Report December 2008 Professor Radu Balan 1.

Slides:



Advertisements
Similar presentations
Background Noise Definition: an unwanted sound or an unwanted perturbation to a wanted signal Examples: – Clicks from microphone synchronization – Ambient.
Advertisements

Zhiyao Duan, Gautham J. Mysore, Paris Smaragdis 1. EECS Department, Northwestern University 2. Advanced Technology Labs, Adobe Systems Inc. 3. University.
Reduction of Additive Noise in the Digital Processing of Speech Avner Halevy AMSC 664 Final Presentation May 2009 Dr. Radu Balan Department of Mathematics.
Model-Based Fusion of Bone and Air Sensors for Speech Enhancement and Robust Speech Recognition John Hershey, Trausti Kristjansson, Zhengyou Zhang, Alex.
Advances in WP1 Nancy Meeting – 6-7 July
Communications & Multimedia Signal Processing Frequency Kalman Noise Reduction Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel.
Communications & Multimedia Signal Processing Meeting 6 Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel University 6 July,
Communications & Multimedia Signal Processing Report of Work on Formant Tracking LP Models and Plans on Integration with Harmonic Plus Noise Model Qin.
Communications & Multimedia Signal Processing Meeting 7 Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel University 23 November,
Single-Channel Speech Enhancement in Both White and Colored Noise Xin Lei Xiao Li Han Yan June 5, 2002.
Bayesian Methods for Speech Enhancement I. Andrianakis P. R. White Signal Processing and Control Group Institute of Sound and Vibration Research University.
Speech Enhancement Based on a Combination of Spectral Subtraction and MMSE Log-STSA Estimator in Wavelet Domain LATSI laboratory, Department of Electronic,
Communications & Multimedia Signal Processing Formant Track Restoration in Train Noisy Speech Qin Yan Communication & Multimedia Signal Processing Group.
Communications & Multimedia Signal Processing 1 Speech Communication for Mobile and Hands-Free Devices in Noisy Environments EPSRC Project GR/S30238/01.
Speech Recognition in Noise
Communications & Multimedia Signal Processing Formant Tracking LP with Harmonic Plus Noise Model of Excitation for Speech Enhancement Qin Yan Communication.
Communications & Multimedia Signal Processing Refinement in FTLP-HNM system for Speech Enhancement Qin Yan Communication & Multimedia Signal Processing.
Background Noise Definition: an unwanted sound or an unwanted perturbation to a wanted signal Examples: Clicks from microphone synchronization Ambient.
Advances in WP1 and WP2 Paris Meeting – 11 febr
1 Speech Enhancement Wiener Filtering: A linear estimation of clean signal from the noisy signal Using MMSE criterion.
1 New Technique for Improving Speech Intelligibility for the Hearing Impaired Miriam Furst-Yust School of Electrical Engineering Tel Aviv University.
Communications & Multimedia Signal Processing Analysis of Effects of Train/Car noise in Formant Track Estimation Qin Yan Department of Electronic and Computer.
Warped Linear Prediction Concept: Warp the spectrum to emulate human perception; then perform linear prediction on the result Approaches to warp the spectrum:
Normalization of the Speech Modulation Spectra for Robust Speech Recognition Xiong Xiao, Eng Siong Chng, and Haizhou Li Wen-Yi Chu Department of Computer.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
INTRODUCTION  Sibilant speech is aperiodic.  the fricatives /s/, / ʃ /, /z/ and / Ʒ / and the affricatives /t ʃ / and /d Ʒ /  we present a sibilant.
Digital Audio Signal Processing Lecture-4: Noise Reduction Marc Moonen/Alexander Bertrand Dept. E.E./ESAT-STADIUS, KU Leuven
Speech Enhancement Using Spectral Subtraction
Reporter: Shih-Hsiang( 士翔 ). Introduction Speech signal carries information from many sources –Not all information is relevant or important for speech.
Image Restoration using Iterative Wiener Filter --- ECE533 Project Report Jing Liu, Yan Wu.
Chapter 3: Image Restoration Introduction. Image restoration methods are used to improve the appearance of an image by applying a restoration process.
Authors: Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky Temporal envelope compensation for robust phoneme recognition using modulation spectrum.
Yi-zhang Cai, Jeih-weih Hung 2012/08/17 報告者:汪逸婷 1.
Constrained Least Squares Filtering
Robust Entropy-based Endpoint Detection for Speech Recognition in Noisy Environments 張智星
Noise Reduction Two Stage Mel-Warped Weiner Filter Approach.
Speech Enhancement Using a Minimum Mean Square Error Short-Time Spectral Amplitude Estimation method.
USE OF IMPROVED FEATURE VECTORS IN SPECTRAL SUBTRACTION METHOD Emrah Besci, Semih Ergin, M.Bilginer Gülmezoğlu, Atalay Barkana Osmangazi University, Electrical.
Speech Enhancement for ASR by Hans Hwang 8/23/2000 Reference 1. Alan V. Oppenheim,etc., ” Multi-Channel Signal Separation by Decorrelation ”,IEEE Trans.
ICASSP 2006 Robustness Techniques Survey ShihHsiang 2006.
Digital Audio Signal Processing Lecture-3 Noise Reduction
RCC-Mean Subtraction Robust Feature and Compare Various Feature based Methods for Robust Speech Recognition in presence of Telephone Noise Amin Fazel Sharif.
A. R. Jayan, P. C. Pandey, EE Dept., IIT Bombay 1 Abstract Perception of speech under adverse listening conditions may be improved by processing it to.
Digital Image Processing CSC331 Image restoration 1.
MINUET Musical Interference Unmixing Estimation Technique Scott Rickard, Conor Fearon Department of Electronic & Electrical Engineering University College.
CS654: Digital Image Analysis Lecture 22: Image Restoration.
The Frequency Domain Digital Image Processing – Chapter 8.
UNIT-IV. Introduction Speech signal is generated from a system. Generation is via excitation of system. Speech travels through various media. Nature of.
PART II: TRANSIENT SUPPRESSION. IntroductionIntroduction Cohen, Gannot and Talmon\11 2 Transient Interference Suppression Transient Interference Suppression.
1 LOW-RESOURCE NOISE-ROBUST FEATURE POST-PROCESSING ON AURORA 2.0 Chia-Ping Chen, Jeff Bilmes and Katrin Kirchhoff SSLI Lab Department of Electrical Engineering.
Speech Enhancement Summer 2009
Fourier series With coefficients:.
Spectral Analysis Spectral analysis is concerned with the determination of the energy or power spectrum of a continuous-time signal It is assumed that.
Speech Processing AEGIS RET All-Hands Meeting
Image Analysis Image Restoration.
Outline Linear Shift-invariant system Linear filters
Two-Stage Mel-Warped Wiener Filter SNR-Dependent Waveform Processing
朝陽科技大學 資訊工程系 謝政勳 Application of GM(1,1) Model to Speech Enhancement and Voice Activity Detection 朝陽科技大學 資訊工程系 謝政勳
Ningping Fan, Radu Balan, Justinian Rosca
A Tutorial on Bayesian Speech Feature Enhancement
Gary Margrave and Michael Lamoureux
6. Time and Frequency Characterization of Signals and Systems
Wiener Filtering: A linear estimation of clean signal from the noisy signal Using MMSE criterion.
- Final project of digital signal processing
Lec.6:Discrete Fourier Transform and Signal Spectrum
Digital Image Processing Lecture 11: Image Restoration
INTRODUCTION TO ADVANCED DIGITAL SIGNAL PROCESSING
Presenter: Shih-Hsiang(士翔)
Combination of Feature and Channel Compensation (1/2)
Speech Enhancement Based on Nonparametric Factor Analysis
Presentation transcript:

Reduction of Additive Noise in the Digital Processing of Speech Avner Halevy AMSC 663 Mid Year Progress Report December 2008 Professor Radu Balan 1

Brief Reminder Goal: reduce additive white Gaussian noise degrading a speech signal Use short time analysis (frames) Algorithms: – Spectral subtraction (done) – Iterative Wiener filtering (next) Test objectively in lab conditions 2

Spectral Subtraction Estimate the magnitude of the noise spectrum when speech is absent Subtract the estimate from the magnitude of the spectrum of the noisy signal Keep the noisy phase Inverse Fourier to obtain enhanced signal in time domain Underlying statistical assumptions 3

Spectral Subtraction 4

SS – “Filtering” View 5

Spectral Subtraction Implemented Implementation issues – Choice of analysis and synthesis windows – Ensuring nonnegative magnitude – Estimation of noise spectrum – Choice of exponent Validation Variations 6

Choice of Windows 7

SS – Preliminary Results 8

Time (sec) Frequency (Hz) 9 Clean

SS – Preliminary Results Clean Noisy 10

SS – Preliminary Results 11

SS – Preliminary Results Noisy Enhanced 12

Evaluation of Exponent Values tested: p =.1,.5, 1, 1.5, 2, 3, 4, 5 13

Testing TIMIT – database of 6300 sentences for evaluation of speech processing algorithms Segmental SNR Segmental “Filtering” SNR Segmental “Filtering” distortion Future: Perceptual Evaluation of Speech Quality (PESQ) – telecomm standard 14

Shortcomings of SS Using the noisy phase (in low SNR) “musical noise” artifacts caused by – Error in noise spectrum estimation – Flooring of negative components – Fluctuations in signal spectrum Solutions: – Over subtraction – Smoothing of signal spectrum 15

Dealing with “Musical Noise” Over subtraction Smoothing signal spectrum 16

Next Steps Test more signals Evaluate using PESQ Implement Wiener filtering Further testing Compare performance 17

Bibliography [1] Deller, J., Hansen, J., and Proakis, J. (2000) Discrete Time Processing of Speech Signals, New York, NY: Institute of Electrical and Electronics Engineers [2] Quatieri, T. (2002) Discrete Time Speech Signal Processing, Upper Saddle River, NJ: Prentice Hall [3] Loizou, P. (2007) Speech Enhancement: Theory and Practice, Boca Raton, FL: Taylor & Francis Group [4] Rabiner, L., Schafer, R. (1978) Digital Processing of Speech Signals, Englewood Cliffs, NJ: Prentice Hall 18