PART II: TRANSIENT SUPPRESSION
IntroductionIntroduction Cohen, Gannot and Talmon\11 2 Transient Interference Suppression Transient Interference Suppression Transient is an abrupt or impulsive sound followed by decaying oscillations, e.g. keyboard typing and door knocking Common single-channel speech enhancement algorithms are not suitable for the abrupt nature of transients – For example: spectral subtraction [boll, 79’], decision directed [Ephraim & Malah, 84’], LSA [Ephraim & Malah, 85’], OM-LSA [Cohen & Berdugo, 01’] Noisy Speech Enhanced via OM-LSA
IntroductionIntroduction PART II: TRANSIENT SUPPRESSION Cohen, Gannot and Talmon\11 3 Problem Formulation Problem Formulation Let denote a speech signal and let and be contaminating transient interference and stationary noise The signal measured by a microphone is given by The transient interference is where – is a sequence of impulses of varying amplitudes – is an impulse response that characterizes the transient interference type.
IntroductionIntroduction PART II: TRANSIENT SUPPRESSION Cohen, Gannot and Talmon\11 4 Problem Formulation Problem Formulation Let denote the STFT of the measured signal in time frame and frequency bin where, and are the STFT of, and We use analysis and synthesis windows of length and time shift The transient interference in the STFT domain is where is the STFT of, and is the band-to-band filter of
Transient Interference Modeling PART II: TRANSIENT SUPPRESSION Cohen, Gannot and Talmon\11 5 Statistical Model Statistical Model We propose a statistical model of the band-to-band filters – denotes the decay rate of the filter – and are zero-mean mutually i.i.d. Gaussian random variables, representing the abrupt and the decaying parts of the transient Let and be the spectral variances The spectral variance of the filter
Transient Interference Modeling PART II: TRANSIENT SUPPRESSION Cohen, Gannot and Talmon\11 6 Statistical Model Statistical Model We model the spectral variance as a fixed value across the frequency bins, determined by the impulse amplitude Let denote the set of time frame indices that consist of an impulse where are i.i.d. positive random variables with mean and variance The speech is uncorrelated with the transient amplitude Thus, the spectral variance of the measurements is given by where, and
Transient Enhancement PART II: TRANSIENT SUPPRESSION Cohen, Gannot and Talmon\11 7 Transient Enhancement Using OM-LSA Transient Enhancement Using OM-LSA Employ the MCRA [Cohen & Berdugo, ’02] to estimate the PSD of the slower speech – Use short time frames ( ) to reduce the variations of the speech between sequential frames – Carry out temporal smoothing with small recursion parameter Use two sliding windows to capture speech phoneme onsets where As the stationary noise is slowly varying w.r.t speech, speech is slowly varying w.r.t transient interference.
Transient Enhancement PART II: TRANSIENT SUPPRESSION Cohen, Gannot and Talmon\11 8 Transient Enhancement Using OM-LSA Transient Enhancement Using OM-LSA Let be the spectral gain of the OM-LSA estimator, based on the two-sliding windows and fast recursion Initial estimation of the transient interference
Transient Enhancement PART II: TRANSIENT SUPPRESSION Cohen, Gannot and Talmon\11 9 Abrupt and Oscillatory Parts Estimation Abrupt and Oscillatory Parts Estimation Adapt a statistical model for room reverberation [Habets et al, 09’] – Exponentially decaying spectral envelope – Random oscillations Extract the spectral variance of the decaying part
Transient Enhancement PART II: TRANSIENT SUPPRESSION Cohen, Gannot and Talmon\11 10 Abrupt and Oscillatory Parts Estimation Abrupt and Oscillatory Parts Estimation Assume and are mutually independent It can be represented as with and [Habets et al., ’09]
Transient Enhancement PART II: TRANSIENT SUPPRESSION Cohen, Gannot and Talmon\11 11 Experimental Results Experimental Results Clean transient eventNoisy speech Enhanced transient Via OM-LSA Estimated decaying part