2001/07/18Chin-Kai Wu, CS, NTHU1 A Voicing-Driven Packet Loss Recovery Algorithm for Analysis- by-Synthesis Predictive Speech Coders over Internet Jhing-Fa.

Slides:



Advertisements
Similar presentations
SURE-LET for Orthonormal Wavelet-Domain Video Denoising Florian Luisier, Member, IEEE, Thierry Blu, Senior Member, IEEE, and Michael Unser, Fellow, IEEE.
Advertisements

STQ Workshop, Sophia-Antipolis, February 11 th, 2003 Packet loss concealment using audio morphing Franck Bouteille¹ Pascal Scalart² Balazs Kövesi² ¹ PRESCOM.
A Robust Algorithm for Pitch Tracking David Talkin Hsiao-Tsung Hung.
Page 0 of 34 MBE Vocoder. Page 1 of 34 Outline Introduction to vocoders MBE vocoder –MBE Parameters –Parameter estimation –Analysis and synthesis algorithm.
2004 COMP.DSP CONFERENCE Survey of Noise Reduction Techniques Maurice Givens.
Itay Ben-Lulu & Uri Goldfeld Instructor : Dr. Yizhar Lavner Spring /9/2004.
Speech in Multimedia Hao Jiang Computer Science Department Boston College Oct. 9, 2007.
December 2006 Cairo University Faculty of Computers and Information HMM Based Speech Synthesis Presented by Ossama Abdel-Hamid Mohamed.
Haojie Li Jinhui Tang Si Wu Yongdong Zhang Shouxun Lin Automatic Detection and Analysis of Player Action in Moving Background Sports Video Sequences IEEE.
2001/02/15Chin-Kai Wu, CS, NTHU1 Adaptive Playout Mechanisms for Packetized Audio Applications in Wide-Area Networks Ramachandran Ramjee, Jim Kurose, Don.
Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.
2000/10/31Chin-Kai Wu, CS, NTHU1 The Effect of Waveform Substitution on the Quality of PCM Packet Communications Ondria J. Wasem, David J. Goodman, Charles.
2002/04/18Chin-Kai Wu, CS, NTHU1 Jitter Control in QoS Network Yishay Mansour and Boaz Patt-Shamir IEEE/ACM TRANSACTIONS ON NETWORKING, VOL. 9, NO. 4,
Adaptive Rao-Blackwellized Particle Filter and It’s Evaluation for Tracking in Surveillance Xinyu Xu and Baoxin Li, Senior Member, IEEE.
Speech Enhancement Based on a Combination of Spectral Subtraction and MMSE Log-STSA Estimator in Wavelet Domain LATSI laboratory, Department of Electronic,
Wavelet Transform A very brief look.
2001/05/24Chin-Kai Wu, CS, NTHU1 Improved frame erasure concealment for CELP-based coders Juan Carlos De Martin, Takahiro Unno, Vishu Viswanathan DSPS.
Scalable Wavelet Video Coding Using Aliasing- Reduced Hierarchical Motion Compensation Xuguang Yang, Member, IEEE, and Kannan Ramchandran, Member, IEEE.
Communications & Multimedia Signal Processing Refinement in FTLP-HNM system for Speech Enhancement Qin Yan Communication & Multimedia Signal Processing.
A PRE-STUDY OF AUTOMATIC DETECTION OF LEP EVENTS ON THE VLF SİGNALS.
1 An Efficient Mode Decision Algorithm for H.264/AVC Encoding Optimization IEEE TRANSACTION ON MULTIMEDIA Hanli Wang, Student Member, IEEE, Sam Kwong,
1 An Efficient Method for DCT- Domain Image Resizing with Mixed Field/Frame-Mode Macroblocks Changhoon Yim and Michael A. Isnardi IEEE TRANSACTION ON CIRCUITS.
2001/11/29Chin-Kai Wu, CS, NTHU1 Characteristics of Network Delay and Delay Jitter and its Effect on Voice over IP Li Zheng, Liren Zhang, Dong Xu Communications,
2001/11/29Chin-Kai Wu, CS, NTHU1 Design and Simulation of an Efficient Real-Time Traffic Scheduler with Jitter and Delay Guarantees Fu-Ming Tsou, Hong-Bin.
2001/03/29Chin-Kai Wu, CS, NTHU1 Speech and Language Technologies for Audio Indexing and Retrieval JOHN MAKHOUL, FELLOW, IEEE, FRANCIS KUBALA, TIMOTHY.
2001/09/12Chin-Kai Wu, CS, NTHU1 Dynamic Resource Management Considering the Real Behavior of Aggregate Traffic José R. Gallardo, Dimitrios Makrakis, and.
A Concealment Method for Shape Information in MPEG-4 Coded Video Sequences Shahram Shirani, Berna Erol, and Faouzi Kossentini IEEE TRANSACTIONS ON MULTIMEDIA,
ACCURATE TELEMONITORING OF PARKINSON’S DISEASE SYMPTOM SEVERITY USING SPEECH SIGNALS Schematic representation of the UPDRS estimation process Athanasios.
The Wavelet Tutorial: Part3 The Discrete Wavelet Transform
1 A Portable Tele-Emergent System With ECG Discrimination in SCAN Devices Speaker : Ren-Guey Lee Date : 2004 Auguest 25 B.E. LAB National Taipei University.
Multimedia Specification Design and Production 2013 / Semester 2 / week 3 Lecturer: Dr. Nikos Gazepidis
1 CS 551/651: Structure of Spoken Language Lecture 8: Mathematical Descriptions of the Speech Signal John-Paul Hosom Fall 2008.
Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission.
UNIVERSITÉ DE SHERBROOKE - Philippe G OURNAY Senior Research Engineer VoiceAge Corporation University of Sherbrooke François R OUSSEAU, Roch L EFEBVRE.
Page 0 of 23 MELP Vocoders Nima Moghadam SN#: Saeed Nari SN#: Supervisor Dr. Saameti April 2005 Sharif University of Technology.
Chapter 16 Speech Synthesis Algorithms 16.1 Synthesis based on LPC 16.2 Synthesis based on formants 16.3 Synthesis based on homomorphic processing 16.4.
Speech Coding Submitted To: Dr. Mohab Mangoud Submitted By: Nidal Ismail.
Blind Pattern Matching Attack on Watermark Systems D. Kirovski and F. A. P. Petitcolas IEEE Transactions on Signal Processing, VOL. 51, NO. 4, April 2003.
Layered Coding Basic Overview. Outline Pyramidal Coding Scalability in the Standard Codecs Layered Coding with Wavelets Conclusion.
1 Linear Prediction. Outline Windowing LPC Introduction to Vocoders Excitation modeling  Pitch Detection.
Steganalysis of audio: attacking the Steghide
Scattered MIMO Pilot Allocation using cyclic shift IEEE Presentation Submission Template (Rev. 9) Document Number: IEEE S802.16m-08/068, Scattered.
Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh.
Wireless communications and mobile computing conference, p.p , July 2011.
ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska
VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.
CCN COMPLEX COMPUTING NETWORKS1 This research has been supported in part by European Commission FP6 IYTE-Wireless Project (Contract No: )
ITU-T G.729 EE8873 Rungsun Munkong March 22, 2004.
by Mitchell D. Swanson, Bin Zhu, and Ahmed H. Tewfik
1.INTRODUCTION The use of the adaptive codebook (ACB) in CELP-like speech coders allows the achievement of high quality speech, especially for voiced segments.
Factors affecting radar range resolution EECS 725 Introduction to Radar Presented by Susobhan Das.
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
1 Watermarking Scheme Capable of Resisting Sensitivity Attack IEEE signal processing letters, vol. 14, no. 2, February. 2007, pp Xinpeng Zhang.
Present document contains informations proprietary to France Telecom. Accepting this document means for its recipient he or she recognizes the confidential.
APPLICATION OF A WAVELET-BASED RECEIVER FOR THE COHERENT DETECTION OF FSK SIGNALS Dr. Robert Barsanti, Charles Lehman SSST March 2008, University of New.
Chapter 20 Speech Encoding by Parameters 20.1 Linear Predictive Coding (LPC) 20.2 Linear Predictive Vocoder 20.3 Code Excited Linear Prediction (CELP)
By Sarita Jondhale 1 Signal preprocessor: “conditions” the speech signal s(n) to new form which is more suitable for the analysis Postprocessor: operate.
CELP / FS-1016 – 4.8kbps Federal Standard in Voice Coding
Statistical Signal Processing Research Laboratory(SSPRL) UT Acoustic Laboratory(UTAL) SINGLE CHANNEL SPEECH ENHANCEMENT TECHNIQUE FOR LOW SNR QUASI-PERIODIC.
WAVELET NOISE REMOVAL FROM BASEBAND DIGITAL SIGNALS IN BANDLIMITED CHANNELS Dr. Robert Barsanti SSST March 2010, University of Texas At Tyler.
Smartphone-based Wi-Fi Pedestrian-Tracking System Tolerating the RSS Variance Problem Yungeun Kim, Hyojeong Shin, and Hojung Cha Yonsei University Bing.
1 Speech Compression (after first coding) By Allam Mousa Department of Telecommunication Engineering An Najah University SP_3_Compression.
HIGH-RESOLUTION SINUSOIDAL MODELING OF UNVOICED SPEECH GEORGE P. KAFENTZIS, YANNIS STYLIANOU MULTIMEDIA INFORMATICS LABORATORY DEPARTMENT OF COMPUTER SCIENCE.
PERFORMANCE OF A WAVELET-BASED RECEIVER FOR BPSK AND QPSK SIGNALS IN ADDITIVE WHITE GAUSSIAN NOISE CHANNELS Dr. Robert Barsanti, Timothy Smith, Robert.
Multiresolution Analysis (Chapter 7)
Vocoders.
1 Vocoders. 2 The Channel Vocoder (analyzer) : The channel vocoder employs a bank of bandpass filters,  Each having a bandwidth between 100 HZ and 300.
Mohamed Chibani, Roch Lefebvre and Philippe Gournay
Linear Prediction.
Presentation transcript:

2001/07/18Chin-Kai Wu, CS, NTHU1 A Voicing-Driven Packet Loss Recovery Algorithm for Analysis- by-Synthesis Predictive Speech Coders over Internet Jhing-Fa Wang, Fellow, IEEE, Jia-Ching Wang, Jar-Ferr Wang, and Jian-Jia Wang IEEE Transaction on Multimedia, VOL. 3, NO. 1, March 2001

2001/07/18Chin-Kai Wu, CS, NTHU2 Outline Introduction Analysis-by-Synthesis Predictive Coders Voicing-Driven Packet Loss Recovery Algorithm Multiresolution Excitation Generation Pulse Tracking Procedure Experimental Results Conclusion

2001/07/18Chin-Kai Wu, CS, NTHU3 Analysis-by-Synthesis Predictive Coders Codebook Index LPC parameter Total Excitation Gain aperiodic periodic

2001/07/18Chin-Kai Wu, CS, NTHU4 Voicing-Driven Packet Loss Recovery Algorithm nearest half previous frame

2001/07/18Chin-Kai Wu, CS, NTHU5 Multiresolution Excitation Generation Reason to use wavelet transform Good time-frequency localization Multiresolution characteristics Revealing some characteristics that other signal analysis techniques miss Two-stage two-band Wavelet analysis filter banks h(n): Lowpass filter coefficient g(n): Highpass filter coefficient c j : scaling coefficient d j : wavelet coefficient

2001/07/18Chin-Kai Wu, CS, NTHU6 Multiresolution Excitation Generation (Cont’d) voiced unvoiced

2001/07/18Chin-Kai Wu, CS, NTHU7 Multiresolution Excitation Generation (Cont’d)

2001/07/18Chin-Kai Wu, CS, NTHU8 Pulse Tracking Procedure Pulse Position Tracking Algorithm Pulse Amplitude Estimation Algorithm

2001/07/18Chin-Kai Wu, CS, NTHU9 Pulse Position Tracking Algorithm (Step 1) Find the maximum absolute value within the frame and denote its position as P (peak position) tracking Set all sample with opposite sign to zero

2001/07/18Chin-Kai Wu, CS, NTHU10 Pulse Position Tracking Algorithm (Step 2) Construct a clipper by defining the clipping level from the maximum absolute value Set samples below the clipping level to zero

2001/07/18Chin-Kai Wu, CS, NTHU11 Pulse Position Tracking Algorithm (Step 3) Record peak position P and set a clearing region to reset all the samples within the region to zero

2001/07/18Chin-Kai Wu, CS, NTHU12 Pulse Position Tracking Algorithm (Step 4) Repeat Step 1 to 3 to recorder all the recorded peaks according to the positions and denote them as {p 1, p 2, …, p Np }

2001/07/18Chin-Kai Wu, CS, NTHU13 Pulse Position Tracking Algorithm (Result)

2001/07/18Chin-Kai Wu, CS, NTHU14 Pulse Amplitude Estimation Algorithm (Step 1) Compute the trend slope associated with the envelope of the speech peaks where Np: Number of peaks Slope of

2001/07/18Chin-Kai Wu, CS, NTHU15 Pulse Amplitude Estimation Algorithm (Step 2) Estimate the pulse position of the missing frame. Let be the set of estimated pulses, where

2001/07/18Chin-Kai Wu, CS, NTHU16 Pulse Amplitude Estimation Algorithm (Step 3) Estimate the envelope of the speech peaks in the missing frame Track the amplitude of each excitation pulse

2001/07/18Chin-Kai Wu, CS, NTHU17 Pulse Amplitude Estimation Algorithm (Result)

2001/07/18Chin-Kai Wu, CS, NTHU18 Experimental Results

2001/07/18Chin-Kai Wu, CS, NTHU19 Experimental Results (Cont’d)

2001/07/18Chin-Kai Wu, CS, NTHU20 Conclusion Packet loss degrades the speech quality of the analysis-by-synthesis coders seriously since the loss parameters not only affect the current speech frame but also produce error propagation problem The proposed recovery algorithm estimates the excitation information in the missing frame more accurately due to the selection of different excitation generation models