Speech Enhancement Algorithm for Digital Hearing Aids

Slides:



Advertisements
Similar presentations
Noise in Radiographic Imaging
Advertisements

DCSP-13 Jianfeng Feng
Acoustic/Prosodic Features
Figures for Chapter 7 Advanced signal processing Dillon (2001) Hearing Aids.
Hearing relative phases for two harmonic components D. Timothy Ives 1, H. Martin Reimann 2, Ralph van Dinther 1 and Roy D. Patterson 1 1. Introduction.
1 A Spectral-Temporal Method for Pitch Tracking Stephen A. Zahorian*, Princy Dikshit, Hongbing Hu* Department of Electrical and Computer Engineering Old.
Digital Coding of Analog Signal Prepared By: Amit Degada Teaching Assistant Electronics Engineering Department, Sardar Vallabhbhai National Institute of.
Multipitch Tracking for Noisy Speech
Computational Rhythm and Beat Analysis Nick Berkner.
EE513 Audio Signals and Systems Digital Signal Processing (Synthesis) Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Page 0 of 34 MBE Vocoder. Page 1 of 34 Outline Introduction to vocoders MBE vocoder –MBE Parameters –Parameter estimation –Analysis and synthesis algorithm.
Source Localization in Complex Listening Situations: Selection of Binaural Cues Based on Interaural Coherence Christof Faller Mobile Terminals Division,
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Newton’s Method Application to LMS Recursive Least Squares Exponentially-Weighted.
Dual Tone Multi-Frequency System Michael Odion Okosun Farhan Mahmood Benjamin Boateng Project Participants: Dial PulseDTMF.
Communications & Multimedia Signal Processing Meeting 7 Esfandiar Zavarehei Department of Electronic and Computer Engineering Brunel University 23 November,
Single-Channel Speech Enhancement in Both White and Colored Noise Xin Lei Xiao Li Han Yan June 5, 2002.
Digital Voice Communication Link EE 413 – TEAM 2 April 21 st, 2005.
1 New Technique for Improving Speech Intelligibility for the Hearing Impaired Miriam Furst-Yust School of Electrical Engineering Tel Aviv University.
EE513 Audio Signals and Systems Wiener Inverse Filter Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
EE513 Audio Signals and Systems Noise Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Digital to Analogue Conversion Natural signals tend to be analogue Need to convert to digital.
EE513 Audio Signals and Systems Statistical Pattern Classification Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Speech Segregation Based on Sound Localization DeLiang Wang & Nicoleta Roman The Ohio State University, U.S.A. Guy J. Brown University of Sheffield, U.K.
Normalization of the Speech Modulation Spectra for Robust Speech Recognition Xiong Xiao, Eng Siong Chng, and Haizhou Li Wen-Yi Chu Department of Computer.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIXFACTORIZATION AND SPECTRAL MASKS Jain-De,Lee Emad M. GraisHakan Erdogan 17 th International.
A VOICE ACTIVITY DETECTOR USING THE CHI-SQUARE TEST
Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission.
METHODOLOGY INTRODUCTION ACKNOWLEDGEMENTS LITERATURE Low frequency information via a hearing aid has been shown to increase speech intelligibility in noise.
EE Audio Signals and Systems Digital Signal Processing (Synthesis) Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Signals CY2G2/SE2A2 Information Theory and Signals Aims: To discuss further concepts in information theory and to introduce signal theory. Outcomes:
1 Information in Continuous Signals f(t) t 0 In practice, many signals are essentially analogue i.e. continuous. e.g. speech signal from microphone, radio.
Gammachirp Auditory Filter
Analysis of Optimal and Suboptimal Discrete-Time Digital Communications Receivers Clemson University SURE Program 2005 Justin Ingersoll, Prof. Michael.
Study of Broadband Postbeamformer Interference Canceler Antenna Array Processor using Orthogonal Interference Beamformer Lal C. Godara and Presila Israt.
P. N. Kulkarni, P. C. Pandey, and D. S. Jangamashetti / DSP 2009, Santorini, 5-7 July DSP 2009 (Santorini, Greece. 5-7 July 2009), Session: S4P,
Chapter 11 Filter Design 11.1 Introduction 11.2 Lowpass Filters
A Semi-Blind Technique for MIMO Channel Matrix Estimation Aditya Jagannatham and Bhaskar D. Rao The proposed algorithm performs well compared to its training.
ES97H Biomedical Signal Processing
Spectrum Sensing In Cognitive Radio Networks
A. R. Jayan, P. C. Pandey, EE Dept., IIT Bombay 1 Abstract Perception of speech under adverse listening conditions may be improved by processing it to.
By Sarita Jondhale 1 Signal preprocessor: “conditions” the speech signal s(n) to new form which is more suitable for the analysis Postprocessor: operate.
Performance of Digital Communications System
Statistical Signal Processing Research Laboratory(SSPRL) UT Acoustic Laboratory(UTAL) SINGLE CHANNEL SPEECH ENHANCEMENT TECHNIQUE FOR LOW SNR QUASI-PERIODIC.
语音与音频信号处理研究室 Speech and Audio Signal Processing Lab Multiplicative Update of AR gains in Codebook- driven Speech.
UNIT-IV. Introduction Speech signal is generated from a system. Generation is via excitation of system. Speech travels through various media. Nature of.
A diagnosis method of DC/DC converter aging based on the variation of parasitic resistor 1/11 1. Introduction The power converter system supplies power.
[1] National Institute of Science & Technology Technical Seminar Presentation 2004 Suresh Chandra Martha National Institute of Science & Technology Audio.
Speech Enhancement Summer 2009
4aPPa32. How Susceptibility To Noise Varies Across Speech Frequencies
3.1 Introduction Why do we need also a frequency domain analysis (also we need time domain convolution):- 1) Sinusoidal and exponential signals occur.
Digital Communications Chapter 13. Source Coding
Vocoders.
Speech Enhancement with Binaural Cues Derived from a Priori Codebook
Filtering Geophysical Data: Be careful!
Subject Name: Digital Communication Subject Code:10EC61
Uniform Linear Array based Spectrum Sensing from sub-Nyquist Samples
朝陽科技大學 資訊工程系 謝政勳 Application of GM(1,1) Model to Speech Enhancement and Voice Activity Detection 朝陽科技大學 資訊工程系 謝政勳
By Viput Subharngkasen
EE513 Audio Signals and Systems
On the Design of RAKE Receivers with Non-uniform Tap Spacing
Sampling Theorems- Nyquist Theorem and Shannon-Hartley Theorem
Chapter 7 Finite Impulse Response(FIR) Filter Design
9.5 Least-Squares Digital Filters
Tania Stathaki 811b LTI Discrete-Time Systems in Transform Domain Simple Filters Comb Filters (Optional reading) Allpass Transfer.
Dealing with Acoustic Noise Part 1: Spectral Estimation
Lec.6:Discrete Fourier Transform and Signal Spectrum
Emad M. Grais Hakan Erdogan
Presenter: Shih-Hsiang(士翔)
Combination of Feature and Channel Compensation (1/2)
Presentation transcript:

Speech Enhancement Algorithm for Digital Hearing Aids on the basis of Auditory Scene Analysis Masahiro Sunohara, Kiyoaki Terada, Takatoshi Okuno & Takashi Iwakura, RION CO., LTD gp,0 gp,1 gp,2 gp,3 F0 Computer Simulation Introduction Japanese female speech with white noise weighted by speech power spectrum (SN +10dB). Normal hearing people have an instinct ability to segregate complexly-mixed sound. That’s why they can communicate with others in a noisy environments. The ability has been summarized as the basis of psychoacoustical knowledge called Auditory Scene Analysis: ASA (Bregman 1990). Most of the hearing impaired people have a difficulty in understanding speech in a noisy environments even if they wear their hearing aids… It can be assumed that the ability of ASA for the hearing impaired people is getting worse compared with the normal listener. We propose a new speech enhancement algorithm which can help the impaired ability of ASA for better understandings of speech in noise. In our system, the fundamental frequency F0 and its harmonics for speech are enhanced with a low computational cost. Fig.3 A result of the F 0 extraction for speech plotted on the spectrogram. (Japanese male speech) However … Time-variant flexible comb filter Fig.8 Waveforms of input (blue) and enhanced signal (red). Frequency [Hz] Magnitude [dB] Time [sec] Fig.5 Spectrum of the comb filter and multi-level filterbank. To enhance the F0 and its harmonics, a time-variant flexible comb filter is designed to deliver a natural enhanced sound with a low computational cost. Fig.4 shows a block diagram of the filter. Controlling time-variant gains To improve the hearing ability … The optimal characteristic for the comb filter is reconsidered from the perspective of the Wiener filter shown in Fig.6. (Xp(z)) ① ② Fig.9 3D views of the response of comb filter (left) , spectrogram of input (center) and enhanced signal (right). (H(jω)) Fig.4 Block diagram of a flexible comb filter. Evaluation The N-band multi-level filterbank in Fig.4 consists of a tree structured allpass filters. The transfer functions HM(z) from Xp(z) to each Cp,n(z) is defined as follows. Fig.6 Block diagram of the speech enhancement method from the perspective of the Wiener filter. The prototype hearing aid with the algorithm are prepared. The word intelligibility test for 6 hearing impaired subjects with prototypes is carried out. Sound source is 20 test words (4 moras) with a multi-talker noise so that SN ratio is +10 dB. Fig.1 Block diagram of the speech enhancement system. This algorithm involves two processing sections such as … ① F0 extraction for speech ② Time-variant flexible comb filter The amplitude of the enhancement filter |H(jωp)| for the peak is estimated by minimizing the mean-squared error E[e2(n)] along with the Wiener-Hopf equation and Wiener-Khinchin theorem as (1) (4) F 0 extraction where Rl is an l×l Hadamard matrix, denotes the Kronecker product of matrices, Ak(z) is an allpass filter providing the doubly complementary filter pairs. The entire transfer function C(z) of the comb filter is defined as Since Eq.(4) is a simple theoretical equation, the gain fluctuates rapidly when SNR stays around 0dB. To prevent this, the amplitude characteristic is modified using the sigmoid function as follows Improvement score [%] An algorithm of the F0 extraction is proposed based on spectro- temporal approach. Computational cost of the method is rather small because of curtailing the use of the autocorrelation function. First, an input signal is filtered through the filterbank consisted of 4 bandpass filters whose center frequencies are 145, 197, 256 and 323 Hz. Next, the filtered signals are quantized to binary sequences. The candidates of F0 are obtained by detecting the period of rising edges from binary sequences. Finally, the most likely candidate is chosen via a conditional branching. Fig.3 shows a result of F0s. (2) (5) Fig.10 Improvement scores for each subject. The word intelligibility score for 5 out of 6 subjects are improved or almost same. However, score of sub.F is dropped. Some subjects complain of the whistle sound or echoic feeling. where L is the number of samples corresponding to a reciprocal of F0. The amplitude characteristic |C(ejωpT)| of the comb filter at each peak frequency is as follows, Fig.7 shows the above two gain functions calculated from Eq.(4) and (5). Conclusions (3) A speech enhancement algorithm for digital hearing aid is proposed. The F0 of speech can be estimated robustly by the algorithm. The F0 and its harmonics can be enhanced naturally by the use of the proposed comb filter. The evaluation results show that the word intelligibility scores are slightly improved in a noisy case. Eq.(3) indicates that the amplitude characteristics of each peak correspond to the one of the multi-level filterbank. The amplitudes are determined by the gain vector gp only, and is completely independent of the gd. Fig. 5 shows the amplitude spectrum of the comb filter derived from the Eq.(3) and multi-level filterbank. Fig.2 Block diagram of the F 0 extraction. Fig.7 Two gains as a function of the SNR .