Voice Over Packet Networks Getting the most from your voice codec Philippe Gournay VoiceAge Corp. 750 Lucerne Road, Suite 250 Montreal (Quebec) H3R 2H6.

Slides:



Advertisements
Similar presentations
Switching Techniques In large networks there might be multiple paths linking sender and receiver. Information may be switched as it travels through various.
Advertisements

© 2006 Cisco Systems, Inc. All rights reserved. Optimizing Converged Cisco Networks (ONT) Module 2: Cisco VoIP Implementations.
NUS.SOC.CS5248 Ooi Wei Tsang 1 Adaptive Playout. NUS.SOC.CS5248 Ooi Wei Tsang 2 You are Here Network Encoder Sender Middlebox Receiver Decoder.
Using FEC for Rate Adaptation of Multimedia Streams Marcin Nagy Supervised by: Jörg Ott Instructed by: Varun Singh Conducted at Comnet, School of Electrical.
NUS.SOC.CS Roger Zimmermann (based in part on slides by Ooi Wei Tsang) 1 Adaptive Playout.
Fall Forum -- Keystone, Colorado USA -- November, 1999.
Yi Liang Department of Electrical Engineering Stanford University April 19, 2000 Loss Recovery and Adaptive Playout Control for Packet Voice Communications.
Chapter 6 outline r 6.1 Multimedia Networking Applications r 6.2 Streaming stored audio and video m RTSP r 6.3 Real-time, Interactive Multimedia: Internet.
Speech codecs and DCCP with TFRC VoIP mode Magnus Westerlund
A STUDY OF DESIGN COMPROMISES FOR SPEECH CODERS IN PACKET NETWORKS 1.INTRODUCTION In voice over packet networks, the coding gain achieved by prediction-based.
Voice over the Internet (the basics) CS 7270 Networked Applications & Services Lecture-2.
PROJECT PRESENTATION “ Analyzing Factors that affect VoIP Call Quality ” Presented By: Vamsi Krishna Karnati 11/24/2014.
ACM Multimedia October 4, 2001 Real-time Voice Communication over the Internet Using Packet Path Diversity Yi Liang, Eckehard Steinbach, and Bernd Girod.
EE 4272Spring, 2003 Chapter 10 Packet Switching Packet Switching Principles  Switching Techniques  Packet Size  Comparison of Circuit Switching & Packet.
© 2006 Cisco Systems, Inc. All rights reserved. 2.3: Encapsulating Voice Packets for Transport.
Yi Liang Multi-stream Voice Communication with Path Diversity.
Adaptive Playout Scheduling Using Time-scale Modification Yi Liang, Nikolaus Färber Bernd Girod, Balaji Prabhakar.
Multimedia Applications r Multimedia requirements r Streaming r Phone over IP r Recovering from Jitter and Loss r RTP r Diff-serv, Int-serv, RSVP.
Nov. 3, 2000 Adaptive Playout Scheduling in Packet Voice Communications.
A Study on Quality of Service Issues in Internet Telephony  IP Telephony – Applications and Services  Advantages and benefits of Voice over IP  Technical.
K. Salah 1 Chapter 28 VoIP or IP Telephony. K. Salah 2 VoIP Architecture and Protocols Uses one of the two multimedia protocols SIP (Session Initiation.
CS640: Introduction to Computer Networks
Packet Loss Characterization in WiFi-based Long Distance Networks Authors : Anmol Sheth, Sergiu Nedevschi, Rabin Patra, Lakshminarayanan Subramanian [INFOCOM.
VoIP Packets In the Air and Over the Wire J. Scott Haugdahl CTO
1 © 1999, Cisco Systems, Inc. Course Number Presentation_ID Voice Data Integration Cisco do Brasil Jonio Cavalcanti VoIP Network Design.
Analyze Assure Accelerate Network Model for Evaluating Multimedia Transmission Performance Over Internet Protocol PN Will become TIA/EIA-921 Jack.
Improving Voice Quality in International Mobile-to-Mobile Calls Aram Falsafi, Seattle, WA PIMRC September 2008.
Advanced Multimedia University of Palestine University of Palestine Eng. Wisam Zaqoot Eng. Wisam Zaqoot October 2010 October 2010 Ref: Computer Networking:
Computer Networks: Multimedia Applications Ivan Marsic Rutgers University Chapter 3 – Multimedia & Real-time Applications.
UNIVERSITÉ DE SHERBROOKE - Philippe G OURNAY Senior Research Engineer VoiceAge Corporation University of Sherbrooke François R OUSSEAU, Roch L EFEBVRE.
Introduction to Quality of Service. Engineering Internet QoS2  What is QoS?  Why QoS?  Large Bandwidth vs QoS?  Networking Trends Leading to QoS 
An Empirical Evaluation of VoIP Playout Buffer Dimensioning in Skype, Google Talk, and MSN Messenger Chen-Chi Wu, Kuan-Ta Chen, Yu-Chun Chang, and Chin-Laung.
1 Multimedia Communication Multimedia Systems(Module 5 Lesson 2) Summary: r Internet Phone Example m Making the Best use of Internet’s Best-Effort Service.
1 Lecture 17 – March 21, 2002 Content-delivery services. Multimedia services Reminder  next week individual meetings and project status report are due.
TELE202 Lecture 5 Packet switching in WAN 1 Lecturer Dr Z. Huang Overview ¥Last Lectures »C programming »Source: ¥This Lecture »Packet switching in Wide.
Voice over IP Why Challenges/solutions Voice codec and packet delay.
Network Instruments VoIP Analysis. VoIP Basics  What is VoIP?  Packetized voice traffic sent over an IP network  Competes with other traffic on the.
Voice Over Internet Protocol (VoIP). Basic Components of a Telephony Network.
Code : STM#220 Samsung Electronics Co., Ltd. IP Telephony System Error Handling & Management IP Telephony System Error Handling & Management Distribution.
New Models for Perceived Voice Quality Prediction and their Applications in Playout Buffer Optimization for VoIP Networks University of Plymouth United.
University of Plymouth United Kingdom {L.Sun; ICC 2002, New York, USA1 Lingfen Sun Emmanuel Ifeachor Perceived Speech Quality.
Changing the Dynamics of Network Analysis J. Scott Haugdahl CTO, WildPackets, Inc.
Packet switching network Data is divided into packets. Transfer of information as payload in data packets Packets undergo random delays & possible loss.
LOG Objectives  Describe some of the VoIP implementation challenges such as Delay/Latency, Jitter, Echo, and Packet Loss  Describe the voice encoding.
1.INTRODUCTION The use of the adaptive codebook (ACB) in CELP-like speech coders allows the achievement of high quality speech, especially for voiced segments.
In CELP coders, the past excitation signal used to build the adaptive codebook is the main source of error propagation when a frame is lost. We presents.
V. Fodor and Gy. Dan, KTH - Marholmen 2002 End-to-end control for audio-visual communication Viktoria Fodor and György Dán Laboratory for Communication.
CSE5803 Advanced Internet Protocols and Applications (14) Introduction Developed in recent years, for low cost phone calls (long distance in particular).
Comparisons of FEC and Codec Robustness on VoIP Quality and Bandwidth Efficiency Wenyu Jiang Henning Schulzrinne Columbia University ICN 2002, Atlanta,
A Very Low Bit Rate Protection Layer to Increase the Robustness of the AMR- WB+ Codec against Bit Errors Philippe Gournay Université de Sherbrooke Département.
Queuing Delay 1. Access Delay Some protocols require a sender to “gain access” to the channel –The channel is shared and some time is used trying to determine.
3GPP2 Evolution Workshop Multimedia Codecs and Protocols 3GPP2 TSG-C SWG1.2.
Voice Sampling. Sampling Rate Nyquist’s theorem states that a signal can be reconstructed if it is sampled at twice the maximum frequency of the signal.
RTP and playout delay compensation Henning Schulzrinne Dept. of Computer Science Columbia University Fall 2003.
Fundamentals of Multimedia Chapter 17 Wireless Networks 건국대학교 인터넷미디어공학부 임 창 훈.
August 3-4, 2004 San Jose, CA Successfully Offering VoIP- Enabled Applications Services Jan Linden Vice President of Engineering.
Chi-Cheng Lin, Winona State University CS412 Introduction to Computer Networking & Telecommunication Data Link Layer Part II – Sliding Window Protocols.
1 Speech Compression (after first coding) By Allam Mousa Department of Telecommunication Engineering An Najah University SP_3_Compression.
A Brief Look at VoIP QoS. Why is VoIP QoS important? PSTNs have played the first role in voice communications for a long time. But future is going to.
The Internet and Real-Time Applications Identifying Voice Transmission and Switching Issues for Packet Data Networks S. Wood Nov Copyright 2006 Modern.
Multimedia Communication Systems Techniques, Standards, and Networks Chapter 6 Multimedia Communication Across Networks.
Networked Multimedia Basics. Network Characteristics.
Speech over Packet Networks Variable Jitter Buffering Decoder-Based Time-Scaling Performance Analysis Performance Analysis of a Decoder-Based Time-Scaling.
Scalable Speech Coding for IP Networks
Wenyu Jiang Henning Schulzrinne Columbia University
Multimedia networking: outline
Mohamed Chibani, Roch Lefebvre and Philippe Gournay
Multimedia networking: outline
Project proposal Multi-stream and multi-path audio transmission
Presentation transcript:

Voice Over Packet Networks Getting the most from your voice codec Philippe Gournay VoiceAge Corp. 750 Lucerne Road, Suite 250 Montreal (Quebec) H3R 2H6 Canada Phone: Fax:

August 8-10, 2006 Santa Clara, California Hyatt Regency Santa Clara Choosing a codec for VoIP VoIP over a converged voice and data network reduces service delivery costs and enables innovative value-added applications (e.g., multimedia conferencing) Delivering acceptable QoS and PSTN-grade voice quality are challenging in some real-world networks (especially over wireless links) While being able to analyze your network for acceptable QoS / QoE is essential, the choice of the appropriate speech codec and processing technologies will help to ensure that your network consistently performs within your specification

August 8-10, 2006 Santa Clara, California Hyatt Regency Santa Clara Different Voice Coding Technologies, Different Compromises Three factors are decisive when choosing a codec for VoIP: –Bit rate, delay and robustness to packet losses –Quality in clear channel and complexity are also important factors –One can trade off between these factors (e.g. increase bit rate, delay or complexity to increase robustness) Sample-based codecs –PCM (G.711), ADPCM –Higher quality, lower delay and complexity –Higher bit rate (64 kbits/s for PCM) Frame-based (CELP) codecs –G.729, GSM codecs –Lower bit rate (typically 6-24 kbits/s) –Higher delay and complexity –Quality highly dependent on bit rate and input conditions

August 8-10, 2006 Santa Clara, California Hyatt Regency Santa Clara There’s more than just PCM or G.729 Although this is a very successful family, the list of standard-based voice codecs is not limited to ACELP…

August 8-10, 2006 Santa Clara, California Hyatt Regency Santa Clara Wideband Speech Communication Substantially increases captured speech information –Delivers double the audio signal bandwidth Enables digital end-to-end packet-based services to deliver much better speech communication quality than traditional PSTN circuit- switched telephony – VoIP quality differentiator

August 8-10, 2006 Santa Clara, California Hyatt Regency Santa Clara VoIP Presents Very Specific Challenges Variable transmission time –Jitter –Delay spikes –Clock drift between sender and receiver Transmission impairments –Out-of-sequence packets –Late packets –Lost packets The receiver in general, and the codec in particular, must be prepared to handle those impairments

August 8-10, 2006 Santa Clara, California Hyatt Regency Santa Clara Various Delay and Error Profiles 1.Low-amplitude, static jitter 2.High-amplitude, static jitter 3.Low/high amplitude changing jitter 4.Varying jitter situation with high packet loss rate 5.Moderate basic static jitter characteristics with occasional moderate delay spikes and significant packet loss rate 6.Moderate basic static jitter characteristics with frequent and severe delay spikes Proposed by 3GPP for jitter buffer characterization

August 8-10, 2006 Santa Clara, California Hyatt Regency Santa Clara Jitter Buffering Voice communications over packet networks (VoIP) is characterized by a variable transmission time (jitter) VoIP receivers generally use a jitter buffer to control the effect of the jitter The jitter buffer works by introducing an additional playout delay The playout delay is chosen to minimize the number of late packets while keeping the total end-to-end delay within acceptable limits Packets that arrive before their playout time are temporarily stored in a reception buffer

August 8-10, 2006 Santa Clara, California Hyatt Regency Santa Clara Jitter Buffering Strategies Fixed jitter buffer –The playout delay is chosen at the beginning of the conversation Variable jitter buffer –Under the basic talk-spurt based strategy, the playout delay is changed at the beginning of each silence period –For quickly varying networks, better results are obtained when the playout delay is also adapted during active speech

August 8-10, 2006 Santa Clara, California Hyatt Regency Santa Clara Adaptive Jitter Buffering Requires a means for deciding when and how to change the playout delay –Decision based either on a prediction of the jitter, or on an histogram of its past values –Jitter spikes, clock drift, and lost & out-of-order packets must be handled properly Also requires a means of time scaling of speech –Playing out a longer frame increases the playout delay, while playing out a shorter frame decreases it –Can be done either outside the decoder in the PCM domain, or inside the decoder

August 8-10, 2006 Santa Clara, California Hyatt Regency Santa Clara Packet Loss Concealment (PLC) What to do when a packet is missing? –Compute a replacement speech frame Sample-based codecs –Require an external PLC –G.711 Appendix I: "A high quality low-complexity algorithm for packet loss concealment with G.711" Predictive codecs (G.729, AMR-WB/G & VMR-WB) –PLC generally built into their standards –Based on prediction / extrapolation of speech –Interpolation is more effective but increases delay

August 8-10, 2006 Santa Clara, California Hyatt Regency Santa Clara Packet Loss Recovery What to do when packets are received again? –Just smoothing for sample-based or non-predictive codecs –Resynchronization to limit error propagation for CELP codecs There are a number of proprietary solutions for resynchronization –Using side information or not –Interoperability with standards may be an issue

August 8-10, 2006 Santa Clara, California Hyatt Regency Santa Clara Late Packet Processing Late packets are most often considered as lost and discarded The concealment procedure does not correctly update the internal state of predictive voice decoders –Error propagates well after the packet loss Using information in late packets instead of simply discarding them substantially improves the recovery of the voice decoder –Enables updating of the internal state of the decoder In a VoIP environment, late packet processing: –Enhances robustness against jitter without increasing the delay –Lowers the buffering delay without degrading quality

August 8-10, 2006 Santa Clara, California Hyatt Regency Santa Clara Forward Error Correction (FEC) Not all missing speech frames can be concealed, especially when concealment uses only the past signal –Onsets, transients (i.e., abrupt/quick changes in the speech signal ) The concealment error can propagate over several frames, even frames received correctly –Because of the use of (long-term) prediction Add redundancy at the packet level to decrease frame loss probability –Increases the robustness of the codec –Increases the bit rate, and possibly the delay

August 8-10, 2006 Santa Clara, California Hyatt Regency Santa Clara Add Redundancy to G.729 –Basic configuration –R=8kbps, D=25ms –Delay and redundancy –R=12kbps, D=35ms –Partial redundancy –More or less delay –R=14.1kbps, D=25ms/45ms –Redundancy + delay –R=16kbps, D=45ms …… F 2k-2 F 2k-1 F 2k F 2k+1 F 2k+2 F 2k+3 F 2k+4 …… F 2k-2 F 2k-1 F’ 2k-3 F 2k F 2k+1 F’ 2k-1 F 2k+2 F 2k+3 F’ 2k+1 F’ 2k-4 F’ 2k-2 F’ 2k …… F 2k-2 F 2k-1 F 2k-3 F 2k F 2k+1 F 2k-1 F 2k+2 F 2k+3 F 2k+1 F 2k-4 F 2k-2 F 2k …… F 2k-2 F 2k-1 P k-1 PkPk P k+1 F 2k F 2k+1 F 2k+2 F 2k+3 G G G / 3 G *The G.729 RTP payload already supports solutions G and G ms packet (network) 10ms frames (codec)

August 8-10, 2006 Santa Clara, California Hyatt Regency Santa Clara FEC Listening Test Results R (kbits/s) D(ms)

August 8-10, 2006 Santa Clara, California Hyatt Regency Santa Clara Conclusion Packet switching is a very challenging environment for voice codecs: –Jitter, packet losses These performance problems can be mitigated by the choice of the appropriate voice codec and processing technologies: –Jitter buffering, late packet processing –Packet loss concealment and recovery –Forward error correction –Use wideband speech codecs? The overall VoIP quality of service is a combination of: –Pure voice quality (MOS score) especially in the presence of packet losses (robustness) –Bit rate, Delay, Complexity … but one can trade off between these factors to provide the best possible quality to as many users as possible.

August 8-10, 2006 Santa Clara, California Hyatt Regency Santa Clara Philippe Gournay VoiceAge Corp. 750 Lucerne Road, Suite 250 Montreal (Quebec) H3R 2H6 Canada Phone: Fax:

August 8-10, 2006 Santa Clara, California Hyatt Regency Santa Clara Fixed Transmission Time (no jitter) Transmission delay nn+1 012n Sender Receiver Playout delay A fixed playout delay is enough to produce a sustained flow of speech to the listener

August 8-10, 2006 Santa Clara, California Hyatt Regency Santa Clara Variable Transmission Time (some jitter), Fixed Playout Delay Transmission delay Playout delay 012nn+1  n-1 nn  n+1 Sender Receiver Playout Some packets (n+1) arrive too late to be decoded