2001/07/18Chin-Kai Wu, CS, NTHU1 A Voicing-Driven Packet Loss Recovery Algorithm for Analysis- by-Synthesis Predictive Speech Coders over Internet Jhing-Fa Wang, Fellow, IEEE, Jia-Ching Wang, Jar-Ferr Wang, and Jian-Jia Wang IEEE Transaction on Multimedia, VOL. 3, NO. 1, March 2001
2001/07/18Chin-Kai Wu, CS, NTHU2 Outline Introduction Analysis-by-Synthesis Predictive Coders Voicing-Driven Packet Loss Recovery Algorithm Multiresolution Excitation Generation Pulse Tracking Procedure Experimental Results Conclusion
2001/07/18Chin-Kai Wu, CS, NTHU3 Analysis-by-Synthesis Predictive Coders Codebook Index LPC parameter Total Excitation Gain aperiodic periodic
2001/07/18Chin-Kai Wu, CS, NTHU4 Voicing-Driven Packet Loss Recovery Algorithm nearest half previous frame
2001/07/18Chin-Kai Wu, CS, NTHU5 Multiresolution Excitation Generation Reason to use wavelet transform Good time-frequency localization Multiresolution characteristics Revealing some characteristics that other signal analysis techniques miss Two-stage two-band Wavelet analysis filter banks h(n): Lowpass filter coefficient g(n): Highpass filter coefficient c j : scaling coefficient d j : wavelet coefficient
2001/07/18Chin-Kai Wu, CS, NTHU6 Multiresolution Excitation Generation (Cont’d) voiced unvoiced
2001/07/18Chin-Kai Wu, CS, NTHU7 Multiresolution Excitation Generation (Cont’d)
2001/07/18Chin-Kai Wu, CS, NTHU8 Pulse Tracking Procedure Pulse Position Tracking Algorithm Pulse Amplitude Estimation Algorithm
2001/07/18Chin-Kai Wu, CS, NTHU9 Pulse Position Tracking Algorithm (Step 1) Find the maximum absolute value within the frame and denote its position as P (peak position) tracking Set all sample with opposite sign to zero
2001/07/18Chin-Kai Wu, CS, NTHU10 Pulse Position Tracking Algorithm (Step 2) Construct a clipper by defining the clipping level from the maximum absolute value Set samples below the clipping level to zero
2001/07/18Chin-Kai Wu, CS, NTHU11 Pulse Position Tracking Algorithm (Step 3) Record peak position P and set a clearing region to reset all the samples within the region to zero
2001/07/18Chin-Kai Wu, CS, NTHU12 Pulse Position Tracking Algorithm (Step 4) Repeat Step 1 to 3 to recorder all the recorded peaks according to the positions and denote them as {p 1, p 2, …, p Np }
2001/07/18Chin-Kai Wu, CS, NTHU13 Pulse Position Tracking Algorithm (Result)
2001/07/18Chin-Kai Wu, CS, NTHU14 Pulse Amplitude Estimation Algorithm (Step 1) Compute the trend slope associated with the envelope of the speech peaks where Np: Number of peaks Slope of
2001/07/18Chin-Kai Wu, CS, NTHU15 Pulse Amplitude Estimation Algorithm (Step 2) Estimate the pulse position of the missing frame. Let be the set of estimated pulses, where
2001/07/18Chin-Kai Wu, CS, NTHU16 Pulse Amplitude Estimation Algorithm (Step 3) Estimate the envelope of the speech peaks in the missing frame Track the amplitude of each excitation pulse
2001/07/18Chin-Kai Wu, CS, NTHU17 Pulse Amplitude Estimation Algorithm (Result)
2001/07/18Chin-Kai Wu, CS, NTHU18 Experimental Results
2001/07/18Chin-Kai Wu, CS, NTHU19 Experimental Results (Cont’d)
2001/07/18Chin-Kai Wu, CS, NTHU20 Conclusion Packet loss degrades the speech quality of the analysis-by-synthesis coders seriously since the loss parameters not only affect the current speech frame but also produce error propagation problem The proposed recovery algorithm estimates the excitation information in the missing frame more accurately due to the selection of different excitation generation models