Download presentation
Presentation is loading. Please wait.
Published byHomer Neal Modified over 8 years ago
2
Statistical Signal Processing Research Laboratory(SSPRL) UT Acoustic Laboratory(UTAL) A TWO-STAGE DATA-DRIVEN SINGLE MICROPHONE SPEECH ENHANCEMENT WITH CEPSTRAL ANALYSIS PRE-PROCESSING Yu Rao, Chetan Vahanesa, Chandan K.A. Reddy, Issa M. S. Panahi
3
Statistical Signal Processing Research Laboratory(SSPRL) UT Acoustic Laboratory(UTAL) Outline of the presentation 1.Introduction 2.Review of temporal Cepstral smoothing method 3.Proposed method 4.Experimental results and performance evaluation 5.Real-time implementation 6.Conclusion 2 This research was supported by NIH-NIDCD Project No: 1R56DC014020-01
4
Statistical Signal Processing Research Laboratory(SSPRL) UT Acoustic Laboratory(UTAL) Problem Statement – We are living in the environment which is surrounded by different types of noise. Sometimes these noise will have negative effect in our daily lives. – Conventional single microphone speech enhancement methods do not perform well in all types of noise and may generate musical tones in some conditions. Sometimes this may degrade device’s performance. 3 1. Introduction PHOTO COURTESY: www.ttsrelief.com, www.volpe.dot.gov
5
Statistical Signal Processing Research Laboratory(SSPRL) UT Acoustic Laboratory(UTAL) 4 [1] C. Breithaupt, T. Gerkmann and R. Martin, “A novel a priori SNR estimation approach based on selective Cepstro-temporal smoothing,” in Proceeding IEEE International Conference on Acoustic, Speech and Signal Processing, ICASSP 2008, pp. 4897-4900, April. 2008. 2. Review of temporal Cepstral smoothing method [1]
6
Statistical Signal Processing Research Laboratory(SSPRL) UT Acoustic Laboratory(UTAL) 5 Figure 1. Block diagram of first stage 3. Proposed method 1. TCS 2.A-Priori & Posteriori SNR Estimation 4. MMSE-LSA Estimator 3. Lookup Table 1
7
Statistical Signal Processing Research Laboratory(SSPRL) UT Acoustic Laboratory(UTAL) 6 Figure 1. Block diagram of first stage 1. TCS 2.A-Priori & Posteriori SNR Estimation 4. MMSE-LSA Estimator 3. Lookup Table 1 [2] J. S. Erkelens and R. Heusdens, “Tracking of nonstationary noise based on data-driven recursive noise power estimation,” IEEE Trans., Audio, Speech and Lang. Process., vol. 16, no. 6, pp. 1112-1123, Aug, 2008 [3] Y. Ephraim and D. Malah, “Speech enhancement using a minimum mean-square error log-spectral amplitude estimator,” IEEE Trans., Acoust., Speech and Signal Process., vol.33, no. 2, pp. 443-445, Apr. 1985.
8
Statistical Signal Processing Research Laboratory(SSPRL) UT Acoustic Laboratory(UTAL) 7 Figure 2. Block diagram of second stage 1.A-Priori & Posteriori SNR Estimation 3. MMSE-LSA Estimator 2. Lookup Table 2 [2] J. S. Erkelens and R. Heusdens, “Tracking of nonstationary noise based on data-driven recursive noise power estimation,” IEEE Trans., Audio, Speech and Lang. Process., vol. 16, no. 6, pp. 1112-1123, Aug, 2008
9
Statistical Signal Processing Research Laboratory(SSPRL) UT Acoustic Laboratory(UTAL) 8 4. Experimental results and performance evaluation Driving-CarWhiteSpeech-Shaped Figure 3. PESQ and NAL comparison MMSE-LSA using VAD based decision-directed method (DD), MMSE-LSA using data-driven recursive noise power tracking method (RNPT), proposed two-stage speech enhancement (PP)
10
Statistical Signal Processing Research Laboratory(SSPRL) UT Acoustic Laboratory(UTAL) 9 4. Experimental results and performance evaluation
11
Statistical Signal Processing Research Laboratory(SSPRL) UT Acoustic Laboratory(UTAL) 10 Figure 4. Block diagram of the proposed method 5. Real-time implementation on smartphone Figure 5. Smartphone screenshot
12
Statistical Signal Processing Research Laboratory(SSPRL) UT Acoustic Laboratory(UTAL) The main contributions of this work are listed as follows: Proposing a two stage speech enhancement algorithm using Temporal Cepstral smoothing method as pre-processing. Comparing the objective measurement result with the well-known single microphone speech enhancement method Introducing a real-time frame work of the proposing method and its real-time implementation on smartphone 11 5. Conclusion
13
Statistical Signal Processing Research Laboratory(SSPRL) UT Acoustic Laboratory(UTAL) Thank you! For your time and participation 12
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.