Communications & Multimedia Signal Processing 1 Speech Communication for Mobile and Hands-Free Devices in Noisy Environments EPSRC Project GR/S30238/01.

Slides:



Advertisements
Similar presentations
Figures for Chapter 7 Advanced signal processing Dillon (2001) Hearing Aids.
Advertisements

Robust Speech recognition V. Barreaud LORIA. Mismatch Between Training and Testing n mismatch influences scores n causes of mismatch u Speech Variation.
AVQ Automatic Volume and eQqualization control Interactive White Paper v1.6.
Advanced Speech Enhancement in Noisy Environments
Topics Recognition results on Aurora noisy speech databaseRecognition results on Aurora noisy speech database Proposal of robust formant.
Page 0 of 34 MBE Vocoder. Page 1 of 34 Outline Introduction to vocoders MBE vocoder –MBE Parameters –Parameter estimation –Analysis and synthesis algorithm.
2004 COMP.DSP CONFERENCE Survey of Noise Reduction Techniques Maurice Givens.
Speaker Recognition Sharat.S.Chikkerur Center for Unified Biometrics and Sensors
Itay Ben-Lulu & Uri Goldfeld Instructor : Dr. Yizhar Lavner Spring /9/2004.
Xkl: A Tool For Speech Analysis Eric Truslow Adviser: Helen Hanson.
Zhiyao Duan, Gautham J. Mysore, Paris Smaragdis 1. EECS Department, Northwestern University 2. Advanced Technology Labs, Adobe Systems Inc. 3. University.
Advances in WP1 Turin Meeting – 9-10 March
Advances in WP1 Nancy Meeting – 6-7 July
Communications & Multimedia Signal Processing Frequency Kalman Noise Reduction Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel.
Communications & Multimedia Signal Processing Meeting 6 Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel University 6 July,
HIWIRE MEETING Nancy, July 6-7, 2006 José C. Segura, Ángel de la Torre.
HIWIRE MEETING Torino, March 9-10, 2006 José C. Segura, Javier Ramírez.
Communications & Multimedia Signal Processing Report of Work on Formant Tracking LP Models and Plans on Integration with Harmonic Plus Noise Model Qin.
Communications & Multimedia Signal Processing Meeting 7 Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel University 23 November,
Communications & Multimedia Signal Processing Analysis of the Effects of Train noise on Recognition Rate using Formants and MFCC Esfandiar Zavarehei Department.
Single-Channel Speech Enhancement in Both White and Colored Noise Xin Lei Xiao Li Han Yan June 5, 2002.
Speech Enhancement Based on a Combination of Spectral Subtraction and MMSE Log-STSA Estimator in Wavelet Domain LATSI laboratory, Department of Electronic,
HIWIRE MEETING CRETE, SEPTEMBER 23-24, 2004 JOSÉ C. SEGURA LUNA GSTC UGR.
Communications & Multimedia Signal Processing Formant Based Synthesizer Qin Yan Communication & Multimedia Signal Processing Group Dept of Electronic.
Communications & Multimedia Signal Processing Formant Track Restoration in Train Noisy Speech Qin Yan Communication & Multimedia Signal Processing Group.
Speech Recognition in Noise
Communications & Multimedia Signal Processing Formant Tracking LP with Harmonic Plus Noise Model of Excitation for Speech Enhancement Qin Yan Communication.
Communications & Multimedia Signal Processing Refinement in FTLP-HNM system for Speech Enhancement Qin Yan Communication & Multimedia Signal Processing.
Analysis & Synthesis The Vocoder and its related technology.
Advances in WP1 and WP2 Paris Meeting – 11 febr
1 Speech Enhancement Wiener Filtering: A linear estimation of clean signal from the noisy signal Using MMSE criterion.
Communications & Multimedia Signal Processing Analysis of Effects of Train/Car noise in Formant Track Estimation Qin Yan Department of Electronic and Computer.
Pitch Prediction for Glottal Spectrum Estimation with Applications in Speaker Recognition Nengheng Zheng Supervised under Professor P.C. Ching Nov. 26,
A VOICE ACTIVITY DETECTOR USING THE CHI-SQUARE TEST
„Bandwidth Extension of Speech Signals“ 2nd Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction 22nd and 23rd June.
Nico De Clercq Pieter Gijsenbergh Noise reduction in hearing aids: Generalised Sidelobe Canceller.
Page 0 of 23 MELP Vocoders Nima Moghadam SN#: Saeed Nari SN#: Supervisor Dr. Saameti April 2005 Sharif University of Technology.
Reporter: Shih-Hsiang( 士翔 ). Introduction Speech signal carries information from many sources –Not all information is relevant or important for speech.
Reduction of Additive Noise in the Digital Processing of Speech Avner Halevy AMSC 663 Mid Year Progress Report December 2008 Professor Radu Balan 1.
Ekapol Chuangsuwanich and James Glass MIT Computer Science and Artificial Intelligence Laboratory,Cambridge, Massachusetts 02139,USA 2012/07/2 汪逸婷.
Speech Signal Representations I Seminar Speech Recognition 2002 F.R. Verhage.
Authors: Sriram Ganapathy, Samuel Thomas, and Hynek Hermansky Temporal envelope compensation for robust phoneme recognition using modulation spectrum.
VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.
Noise Reduction Two Stage Mel-Warped Weiner Filter Approach.
In-car Speech Recognition Using Distributed Microphones Tetsuya Shinde Kazuya Takeda Fumitada Itakura Center for Integrated Acoustic Information Research.
[Advanced] Speech & Audio Signal Processing ES 157/257: Speech and Audio Processing Prof. Patrick Wolfe, Harvard DEAS 02 February 2006.
(Extremely) Simplified Model of Speech Production
IIT Bombay ICSCI 2004, Hyderabad, India, Feb’ 04 Introduction Analysis / synthesis Spec. Sub. Methodology Results Conclusion and.
EEL 6586: AUTOMATIC SPEECH PROCESSING Speech Features Lecture Mark D. Skowronski Computational Neuro-Engineering Lab University of Florida February 27,
RCC-Mean Subtraction Robust Feature and Compare Various Feature based Methods for Robust Speech Recognition in presence of Telephone Noise Amin Fazel Sharif.
A. R. Jayan, P. C. Pandey, EE Dept., IIT Bombay 1 Abstract Perception of speech under adverse listening conditions may be improved by processing it to.
EEL 6586: AUTOMATIC SPEECH PROCESSING Speech Features Lecture Mark D. Skowronski Computational Neuro-Engineering Lab University of Florida February 20,
1 Speech Compression (after first coding) By Allam Mousa Department of Telecommunication Engineering An Najah University SP_3_Compression.
语音与音频信号处理研究室 Speech and Audio Signal Processing Lab Multiplicative Update of AR gains in Codebook- driven Speech.
UNIT-IV. Introduction Speech signal is generated from a system. Generation is via excitation of system. Speech travels through various media. Nature of.
AVQ Automatic Volume and eQualization control
High Quality Voice Morphing
Speech Enhancement Summer 2009
Vocoders.
AVQ Automatic Volume and eQqualization control
Linear Prediction.
1 Vocoders. 2 The Channel Vocoder (analyzer) : The channel vocoder employs a bank of bandpass filters,  Each having a bandwidth between 100 HZ and 300.
ON THE ARCHITECTURE OF THE CDMA2000® VARIABLE-RATE MULTIMODE WIDEBAND (VMR-WB) SPEECH CODING STANDARD Milan Jelinek†, Redwan Salami‡, Sassan Ahmadi*, Bruno.
The Vocoder and its related technology
Two-Stage Mel-Warped Wiener Filter SNR-Dependent Waveform Processing
Vocoders.
Wiener Filtering: A linear estimation of clean signal from the noisy signal Using MMSE criterion.
Presenter: Shih-Hsiang(士翔)
Combination of Feature and Channel Compensation (1/2)
Presentation transcript:

Communications & Multimedia Signal Processing 1 Speech Communication for Mobile and Hands-Free Devices in Noisy Environments EPSRC Project GR/S30238/01 Brunel, Southampton, UEA, Queens, TTPCOM Project meeting 28 Jan 04

Communications & Multimedia Signal Processing 2 De-noising Speech Qin Yan, Vaseghi Analysis-synthesis restoration with performance indicators Speech Recognition in Noise Esi Zavaeri, Vaseghi Focus on robust features and minimum discrimination error in noise Project Sections Speaker Adaptation and Recognition in Noise Focus on Speaker features and adaptation in noise Jon Darch, Milner Modelling Non-stationary Multi-source Noise Environments Ioannis, Paul White Analysis, Modelling, Classification, Prediction of noise and Environment assessment

Communications & Multimedia Signal Processing 3 De-noising Speech: Main Features Noise Spectral-Time Tracker (Ioannis with Qin Yan) Estimates of noise tracks during speech inactive/active periods Noise tracks are dominant noise frequency channels Noisy Speech spectral-time Tracker Estimates of speech tracks in noise Estimates of speech signal to noise ratio (SNRs) across tracks Performance Measures, Monitors and SNR Guarantees De-noising LP model poles Pitch harmonics tracking in noise VAD based on spectral distance from noise, formants and pitch track Glottal Pulse Estimation (Emir Turajlic)

Communications & Multimedia Signal Processing 4 VAD Noise Reduction LP-SS Noise Model Templates Noise Formant Tracking LP pole Estimation Pitch Tracking Excitation Estimation Signal Re- composition Noisy Speech Enhanced Speech Noisy Model Perceptual Model A block Diagram of Noisy Speech Processing System Version 1.0 Speech and Noise Model Decomposition Work plan for Feb-May 04 Noise modelled with LP spectral template or GMM Speech LP spectra obtained from LP spectral subtraction Utilisation of LP predictions across formant tracks `Pitch tracking and excitation denoising Restoration via Wiener filtering or LP re-composition

Communications & Multimedia Signal Processing 5 VAD Noise Reduction LP-SS Noise Model Templates Noise Time-averaged Adaptive Noise LP Spectra (current work) Estimated from speech inactive periods (VAD) Statistics of update periods for mobiles, hands-free (with head phones and with loudspeakers on) (Ioannis) GMM of Noise LP-Spectra resulting in a several noise templates The issue of predictability of noise tracks and finite-state noise models Noise Models LP AnalysisLP Spectrum Averaging or GMM [a][a] N LP (f) Noise Models Noise

Communications & Multimedia Signal Processing 6 Spectrograms of Car Noise Spectrogram of Car Noise shows noise tracks LP Spectrogram of Car Noise

Communications & Multimedia Signal Processing 7 Spectrograms of Train Noise Spectrogram of train Noise LP Spectrogram of train Noise

Communications & Multimedia Signal Processing 8 VAD In Noise Noise Reduction LP-SS Noise Model Templates Noise LP-based Spectrum NL Spectral Subtraction LP spectrum Smoothing Input Noisy Speech Pole-Track Smoothing Noise Model Templates Speech Spectral Tracks LP Spectral Envelope LP Model Coeffs Illustration of spectral subtraction for removing noise from LP model spectrum Speech spectral tracks: Formant track and pitch track De-noising LP model spectrum

Communications & Multimedia Signal Processing 9 VAD Noise Reduction LP-SS Noise Model Templates Noise Formant Tracking LP pole Estimation Pitch Tracking Excitation Estimation Signal Re- composition Noisy Speech Enhanced Speech Noisy Model Perceptual Model Formant tracking in noise Predictive formant tracking to include dynamic parameters Speech state indicator of expected formant continuity LP pole estimation from formant tracks

Communications & Multimedia Signal Processing 10 Pitch Tracking Excitation Estimation Pitch tracking in noise Pitch tracking using adaptive band-pass filter centred on pitch estimate Pitch Harmonics tracking State-based excitation restoration Glottal pulse models

Communications & Multimedia Signal Processing 11 VAD In Noise Noise Reduction LP-SS Noise Model Templates Noise Formant Tracking LP pole Estimation Pitch Tracking Excitation Estimation Signal Re- composition Noisy Speech Enhanced Speech Noisy Model Perceptual Model Two alternative restoration methods: LP-based Wiener Filters Re-composition of de-noised LP model with de-noised Excitation Signal Restoration

Communications & Multimedia Signal Processing 12 Performance Indicators for de-noising Performance SNRs before and after de-noising Distortion measures and prediction Performance bounds and guarantees

Communications & Multimedia Signal Processing 13 Dissemination De-noising Website De-nosing software Database of noise with public access Invitation of industry to research meetings