In CELP coders, the past excitation signal used to build the adaptive codebook is the main source of error propagation when a frame is lost. We presents.

Slides:



Advertisements
Similar presentations
Wideband Speech Coding for CDMA2000® Systems
Advertisements

Packet Video Error Concealment With Auto Regressive Model Yongbing Zhang, Xinguang Xiang, Debin Zhao, Siwe Ma, Student Member, IEEE, and Wen Gao, Fellow,
STQ Workshop, Sophia-Antipolis, February 11 th, 2003 Packet loss concealment using audio morphing Franck Bouteille¹ Pascal Scalart² Balazs Kövesi² ¹ PRESCOM.
CHAPTER Modulation.
Speech & Audio Coding TSBK01 Image Coding and Data Compression Lecture 11, 2003 Jörgen Ahlberg.
Philippe Gournay, Bruno Bessette, Roch Lefebvre
Ranko Pinter Simoco Digital Systems
CELLULAR COMMUNICATIONS 5. Speech Coding. Low Bit-rate Voice Coding  Voice is an analogue signal  Needed to be transformed in a digital form (bits)
Speech Coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.
A STUDY OF DESIGN COMPROMISES FOR SPEECH CODERS IN PACKET NETWORKS 1.INTRODUCTION In voice over packet networks, the coding gain achieved by prediction-based.
Understanding the Internet Low Bit Rate Coder Jan Linden Vice President of Engineering Global IP Sound Presented by Jan Skoglund Sr. Research Scientist.
Voice over the Internet (the basics) CS 7270 Networked Applications & Services Lecture-2.
Department of electrical and computer engineering An Equalization Technique for High Rate OFDM Systems Mehdi Basiri.
SCHOOL OF COMPUTING SCIENCE SIMON FRASER UNIVERSITY CMPT 820 : Error Mitigation Schaar and Chou, Multimedia over IP and Wireless Networks: Compression,
Recursive End-to-end Distortion Estimation with Model-based Cross-correlation Approximation Hua Yang, Kenneth Rose Signal Compression Lab University of.
Efficient Motion Vector Recovery Algorithm for H.264 Based on a Polynomial Model Jinghong Zheng and Lap-Pui Chau IEEE TRANSACTIONS ON MULTIMEDIA, June.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
2001/05/24Chin-Kai Wu, CS, NTHU1 Improved frame erasure concealment for CELP-based coders Juan Carlos De Martin, Takahiro Unno, Vishu Viswanathan DSPS.
Efficient Fine Granularity Scalability Using Adaptive Leaky Factor Yunlong Gao and Lap-Pui Chau, Senior Member, IEEE IEEE TRANSACTIONS ON BROADCASTING,
Video Streaming: An FEC-Based Novel Approach Jianfei Cai, Chang Wen Chen Electrical and Computer Engineering, Canadian Conference on.
2001/07/18Chin-Kai Wu, CS, NTHU1 A Voicing-Driven Packet Loss Recovery Algorithm for Analysis- by-Synthesis Predictive Speech Coders over Internet Jhing-Fa.
Using Redundancy and Interleaving to Ameliorate the Effects of Packet Loss in a Video Stream Yali Zhu, Mark Claypool and Yanlin Liu Department of Computer.
4/24/2002SCL UCSB1 Optimal End-to-end Distortion Estimation for Drift Management in Scalable Video Coding H. Yang, R. Zhang and K. Rose Signal Compression.
COMP 249 :: Spring 2005 Slide: 1 Audio Coding Ketan Mayer-Patel.
09/24/02ICIP20021 Drift Management and Adaptive Bit Rate Allocation in Scalable Video Coding H. Yang, R. Zhang and K. Rose Signal Compression Lab ECE Department.
CS :: Fall 2003 Audio Coding Ketan Mayer-Patel.
Patrick-André Savard, Philippe Gournay and Roch Lefebvre Université de Sherbrooke, Québec, Canada.
OCTOBER 23-24, 2012 VOCODER TECHNOLOGY
Computer Networks Digitization. Spring 2006Computer Networks2 Transfer of an Analog Signal  When analog data (voice, pictures, video) are transformed.
Secure Steganography in Audio using Inactive Frames of VoIP Streams
Voice Over Packet Networks Getting the most from your voice codec Philippe Gournay VoiceAge Corp. 750 Lucerne Road, Suite 250 Montreal (Quebec) H3R 2H6.
UNIVERSITÉ DE SHERBROOKE - Philippe G OURNAY Senior Research Engineer VoiceAge Corporation University of Sherbrooke François R OUSSEAU, Roch L EFEBVRE.
Speech Coding Submitted To: Dr. Mohab Mangoud Submitted By: Nidal Ismail.
Abhik Majumdar, Rohit Puri, Kannan Ramchandran, and Jim Chou /24 1 Distributed Video Coding and Its Application Presented by Lei Sun.
1 Adaptable applications Towards Balancing Network and Terminal Resources to Improve Video Quality D. Jarnikov.
8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.
Compression No. 1  Seattle Pacific University Data Compression Kevin Bolding Electrical Engineering Seattle Pacific University.
Rate-distortion Optimized Mode Selection Based on Multi-channel Realizations Markus Gärtner Davide Bertozzi Classroom Presentation 13 th March 2001.
1 JEG hybrid model Iñigo Sedano June, Three years working at Tecnalia Technology Corporation, Telecom Unit, Broadband networks group, Spain (
Week 7 Lecture 1+2 Digital Communications System Architecture + Signals basics.
Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh.
VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.
“Compensating for Packet Loss in Real-Time Applications“
1.INTRODUCTION The use of the adaptive codebook (ACB) in CELP-like speech coders allows the achievement of high quality speech, especially for voiced segments.
IPTEL'2001, New York, USA1 Lingfen Sun Graham Wade, Benn Lines Emmanuel Ifeachor University of Plymouth, U.K. Impact of Packet Loss Location on Perceived.
Synchronization of Turbo Codes Based on Online Statistics
1 Audio Coding. 2 Digitization Processing Signal encoder Signal decoder samplingquantization storage Analog signal Digital data.
Present document contains informations proprietary to France Telecom. Accepting this document means for its recipient he or she recognizes the confidential.
A Very Low Bit Rate Protection Layer to Increase the Robustness of the AMR- WB+ Codec against Bit Errors Philippe Gournay Université de Sherbrooke Département.
3GPP2 Evolution Workshop Multimedia Codecs and Protocols 3GPP2 TSG-C SWG1.2.
Chapter 20 Speech Encoding by Parameters 20.1 Linear Predictive Coding (LPC) 20.2 Linear Predictive Vocoder 20.3 Code Excited Linear Prediction (CELP)
CS294-9 :: Fall 2003 Joint Source/Channel Coding Ketan Mayer-Patel.
CELP / FS-1016 – 4.8kbps Federal Standard in Voice Coding
MEMORY-LESS GAIN QUANTIZATION IN THE EVS CODEC Vladimir Malenovsky Milan Jelinek University of Sherbrooke/VoiceAge Corp. CANADA.
A Distributed and Adaptive Signal Processing Approach to Reducing Energy Consumption in Sensor Networks Jim Chou, et al Univ. of Califonia at Berkeley.
1 Speech Compression (after first coding) By Allam Mousa Department of Telecommunication Engineering An Najah University SP_3_Compression.
COMBINATIONAL AND SEQUENTIAL CIRCUITS Guided By: Prof. P. B. Swadas Prepared By: BIRLA VISHVAKARMA MAHAVDYALAYA.
Speech over Packet Networks Variable Jitter Buffering Decoder-Based Time-Scaling Performance Analysis Performance Analysis of a Decoder-Based Time-Scaling.
Scalable Speech Coding for IP Networks: Beyond iLBC
Digital Communications Chapter 13. Source Coding
Vocoders.
Injong Rhee ICMCS’98 Presented by Wenyu Ren
1 Vocoders. 2 The Channel Vocoder (analyzer) : The channel vocoder employs a bank of bandpass filters,  Each having a bandwidth between 100 HZ and 300.
Mohamed Chibani, Roch Lefebvre and Philippe Gournay
ON THE ARCHITECTURE OF THE CDMA2000® VARIABLE-RATE MULTIMODE WIDEBAND (VMR-WB) SPEECH CODING STANDARD Milan Jelinek†, Redwan Salami‡, Sassan Ahmadi*, Bruno.
Understanding the Internet Low Bit Rate Coder
Scalable Speech Coding for IP Networks: Beyond iLBC
Packet loss concealment using audio morphing
PCM & DPCM & DM.
UWB Receiver Design Simplification through Channel Shortening
Presentation transcript:

In CELP coders, the past excitation signal used to build the adaptive codebook is the main source of error propagation when a frame is lost. We presents a novel resynchronization technique using very low bit rate side information to correct the past excitation signal after a frame erasure. The novelty of this technique is that the correction is computed at the encoder in a closed loop fashion, based on the actual error introduced by the concealment. Objective and subjective test results show that this approach is a promising area for future research on frame loss recovery. A frame loss is simulated at the encoder (concealment) in order to determine the correction that should be applied to the past excitation signal (adaptive codebook). T YPE B ( LOST ALIGN.): For stationary voiced signals, the correction consists in a gain (g) and a shift (  ) T YPE A ( LOST ONSETS ): Side information describes the position and amplitude of the largest pulse We have demonstrated a concept which can be applied to any CELP codec:  Very efficient for single frame loss  Very limited bit rate (13 bits per frame)  Minimum complexity overhead at the encoder, no overhead at the decoder Various improvements for errors of type A and B, and various solutions for errors of type C, are proposed in the paper. Analysis of actual “good” and “bad” past excitation signals shows that typical CELP concealment introduce 3 types of errors which are characterized by strong error propagation:  Type A: Lost onsets  Type B: Lost alignments  Type C: Waveform mismatch The determination of the correction is done in the LPC excitation domain. The correction information depends on a signal classification step. To demonstrate the concept, we have chosen to concentrate on errors of types A and B. (a) no frame lost; (b) standard decoder; (c) modified decoder using side information; (d) and (e) error signals for the standard and modified decoders. AB comparison test between the standard and a modified AMR-WB codec; one lost frame every 10 frames; 32 sentence pairs (4 speakers); 6 experienced listeners. S TANDARD M ODIFIED P REFERENCE University of Sherbrooke Faculté de Génie 2500, boul. de l’Université Sherbrooke (Québec) J1K 2R1 Canada IMPROVED FRAME LOSS RECOVERY USING CLOSED-LOOP ESTIMATION OF VERY LOW BIT RATE SIDE INFORMATION Philippe Gournay VoiceAge Corporation 750 Chemin Lucerne, Suite 250 Montreal (Quebec) H3R 2H6 Canada 1. Abstract Bitstream OutSide Information z -1 Audio In Encoder Internal State old “good” past exc. Standard Encoder Concealment Correction new “good” past exc. “Good” past excitation “Bad” past excitation g  T0T0 “Good” past excitation “Bad” past excitation T0T0  5. Conclusions and Perspectives 2. Modified Encoder 3. Estimation of the Correction 4. Performance evaluation T YPE A ( LOST ONSET ) E RROR SIGNALS SHOW THE EFFECT OF A RESTORED ONSET T YPE B ( LOST ALIGN.) E RROR SIGNALS SHOW A FASTER RECONVERGENCE L OST F RAME Interspeech 2008, Brisbane, Australia