Adaptive Delay Concealment for Internet Voice Applications with Packet-Based Time-Scale Modification Fang Liu, JongWon Kim, C.-C. Jay Kuo IEEE ICASSP 2001.

Slides:



Advertisements
Similar presentations
STQ Workshop, Sophia-Antipolis, February 11 th, 2003 Packet loss concealment using audio morphing Franck Bouteille¹ Pascal Scalart² Balazs Kövesi² ¹ PRESCOM.
Advertisements

Tuning Skype Redundancy Control Algorithm for User Satisfaction Te-Yuan Huang, Kuan-Ta Chen, Polly Huang Proceedings of the IEEE Infocom Conference Rio.
Comp Spring 2003 Delay Jitter Ketan Mayer-Patel.
NUS.SOC.CS5248 Ooi Wei Tsang 1 Adaptive Playout. NUS.SOC.CS5248 Ooi Wei Tsang 2 You are Here Network Encoder Sender Middlebox Receiver Decoder.
A System for Hybridizing Vocal Performance By Kim Hang Lau.
1 NUS.SOC.CS5248 OOI WEI TSANG Playout Buffer Management.
NUS.SOC.CS Roger Zimmermann (based in part on slides by Ooi Wei Tsang) 1 Adaptive Playout.
Yi Liang Department of Electrical Engineering Stanford University April 19, 2000 Loss Recovery and Adaptive Playout Control for Packet Voice Communications.
Chapter 6 outline r 6.1 Multimedia Networking Applications r 6.2 Streaming stored audio and video m RTSP r 6.3 Real-time, Interactive Multimedia: Internet.
Voice Over Internet- Issues Dr. Abhay Karandikar Department of Electrical Engineering Indian Institute of Technology, Bombay. Dr. Abhay Karandikar Department.
Speech codecs and DCCP with TFRC VoIP mode Magnus Westerlund
Voice over the Internet (the basics) CS 7270 Networked Applications & Services Lecture-2.
1 Voice Over IP (VoIP) is a common technology for performing voice calls over the Internet. Voice Over IP (VoIP) is a common technology for performing.
University of Illinois, Urbana-Champaign
ACM Multimedia October 4, 2001 Real-time Voice Communication over the Internet Using Packet Path Diversity Yi Liang, Eckehard Steinbach, and Bernd Girod.
2001/02/15Chin-Kai Wu, CS, NTHU1 Adaptive Playout Mechanisms for Packetized Audio Applications in Wide-Area Networks Ramachandran Ramjee, Jim Kurose, Don.
A New Adaptive FEC Loss Control Algorithm for Voice Over IP Applications Chinmay Padhye, Kenneth Christensen and Wilfirdo Moreno Department of Computer.
Service Differentiated Peer Selection An Incentive Mechanism for Peer-to-Peer Media Streaming Ahsan Habib, Member, IEEE, and John Chuang, Member, IEEE.
On improving the intelligibility of synchronized over-lap-and-add (SOLA) at low TSM factor Wong, P.H.W.; Au, O.C.; Wong, J.W.C.; Lau, W.H.B. TENCON '97.
Packet audio playout delay adjustment Performance bounds and algorithms Moon, Kurose, Towsley.
Adaptive Playout Scheduling Using Time-scale Modification Yi Liang, Nikolaus Färber Bernd Girod, Balaji Prabhakar.
Voice Quality Evaluation for Wireless Transmission with ROHC S. Rein and F.H.P. Fitzek and M. Reisslein Voice Quality Evaluation for Wireless Transmission.
Yi Liang July 12, 2000 Adaptive Playout Time Control with Time-scale Packet Modification.
Multiple Sender Distributed Video Streaming Thinh Nguyen (IEEE Member) Avideh Zakhor (IEEE Fellow) IEEE Transactions on multimedia 2004.
Oct. 18, 2000 Subjective Tests Results Yi Liang Degradation Category Rating of Scaled Speech Three short network traces with different jitter statistics.
Nov. 3, 2000 Adaptive Playout Scheduling in Packet Voice Communications.
Adaptive Playout Scheduling Using Time- scale Modification in Packet Voice Communications Yi J. Liang, Nikolaus Farber, Bernd Girod Information Systems.
H. Sanneck*, A. Stenger, K. Ben Younes, B. Girod
2001/11/29Chin-Kai Wu, CS, NTHU1 Characteristics of Network Delay and Delay Jitter and its Effect on Voice over IP Li Zheng, Liren Zhang, Dong Xu Communications,
Brian White CS529 SPEAK WITH FORWARD ERROR CORRECTION: IMPLEMENTATION AND EVALUATION.
Using Redundancy and Interleaving to Ameliorate the Effects of Packet Loss in a Video Stream Yali Zhu, Mark Claypool and Yanlin Liu Department of Computer.
CSE 124 Networked Services Fall 2009 B. S. Manoj, Ph.D 10/8/20091CSE 124 Networked Services Fall 2009 Some of.
A Survey of Packet Loss Recovery Techniques for Streaming Audio Colin Perkins, Orion Hodson, and Vicky Hardman University College London IEEE Network.
An Adaptive Predictor for Media Playout Buffering Phillip DeLeon New Mexico State University Cormac J. Sreenan AT&T Labs ICASSP 99’
Multiple Sender Distributed Video Streaming Nguyen, Zakhor IEEE Transactions on Multimedia April 2004.
CS640: Introduction to Computer Networks
Secure Steganography in Audio using Inactive Frames of VoIP Streams
An Error – Concealment Technique for Wireless Digital Audio Delivery N. Tatlas, A. Floros, T. Zarouchas and J. Mourjopoulos.
Voice Over Packet Networks Getting the most from your voice codec Philippe Gournay VoiceAge Corp. 750 Lucerne Road, Suite 250 Montreal (Quebec) H3R 2H6.
Computer Networks: Multimedia Applications Ivan Marsic Rutgers University Chapter 3 – Multimedia & Real-time Applications.
UNIVERSITÉ DE SHERBROOKE - Philippe G OURNAY Senior Research Engineer VoiceAge Corporation University of Sherbrooke François R OUSSEAU, Roch L EFEBVRE.
Evalvid overview. Contents Introduction Framework and Design Functionalities Tools.
An Empirical Evaluation of VoIP Playout Buffer Dimensioning in Skype, Google Talk, and MSN Messenger Chen-Chi Wu, Kuan-Ta Chen, Yu-Chun Chang, and Chin-Laung.
1 Multimedia Communication Multimedia Systems(Module 5 Lesson 2) Summary: r Internet Phone Example m Making the Best use of Internet’s Best-Effort Service.
1 Lecture 17 – March 21, 2002 Content-delivery services. Multimedia services Reminder  next week individual meetings and project status report are due.
On Optimizing the Backoff Interval for Random Access Scheme Zygmunt J. Hass and Jing Deng IEEE Transactions on Communications, Dec 2003.
New Models for Perceived Voice Quality Prediction and their Applications in Playout Buffer Optimization for VoIP Networks University of Plymouth United.
An Experimental Evaluation of Voice Quality over the Datagram Congestion Control Protocol H. Balan International Univeristy Bremen L. Eggert Nokia Research.
An optimal power-saving class II for VoIP traffic and its performance evaluations in IEEE e JungRyun Lee School of Electrical and Electronics Eng,Chung-Ang.
An Energy-Efficient MAC Protocol for Wireless Sensor Networks Qingchun Ren and Qilian Liang Department of Electrical Engineering, University of Texas at.
Wireless communications and mobile computing conference, p.p , July 2011.
IPTEL'2001, New York, USA1 Lingfen Sun Graham Wade, Benn Lines Emmanuel Ifeachor University of Plymouth, U.K. Impact of Packet Loss Location on Perceived.
Page 1 The department of Information & Communications Engineering Dong-uk, kim A Survey of Packet Loss Recovery Techniques for Streaming.
Internet multimedia: simplest approach audio, video not streamed: r no, “pipelining,” long delays until playout! r audio or video stored in file r files.
Present document contains informations proprietary to France Telecom. Accepting this document means for its recipient he or she recognizes the confidential.
Bluetooth: Quality of Service Reference: “QoS based scheduling for incorporating variable rate coded voice in Bluetooth”; Chawla, S.; Saran, H.; Singh,
Doc.: IEEE /0632r1 Submission July 2005 Clifford Tavares, Hitachi America Ltd.Slide MAC extensions for high rate Video Notice: This document.
RTP and playout delay compensation Henning Schulzrinne Dept. of Computer Science Columbia University Fall 2003.
Motion-JPEG2000 Video Transmission Over Active Network Ruibiao Qiu.
Tutorial 12 Solutions.
Experimental Study on Wireless Multicast Scalability using Merged Hybrid ARQ with Staggered Adaptive FEC S. Makharia, D. Raychaudhuri, M. Wu*, H. Liu*,
Speech over Packet Networks Variable Jitter Buffering Decoder-Based Time-Scaling Performance Analysis Performance Analysis of a Decoder-Based Time-Scaling.
Analysis of Delay and Jitter of Voice Traffic Over the Internet
Scalable Speech Coding for IP Networks: Beyond iLBC
Impact of Packet Loss Location on Perceived Speech Quality
Adaptive Playout.
Streaming Music with Media Specific FEC
Mohamed Chibani, Roch Lefebvre and Philippe Gournay
Scalable Speech Coding for IP Networks: Beyond iLBC
Project proposal Multi-stream and multi-path audio transmission
Presentation transcript:

Adaptive Delay Concealment for Internet Voice Applications with Packet-Based Time-Scale Modification Fang Liu, JongWon Kim, C.-C. Jay Kuo IEEE ICASSP 2001

Outline n Introduction n Adaptive Playout Framework n Time-Scale Modification with SOLA n Adaptive Playout with Time-Scale Modification n Experimental Results n Conclusions

Introduction n Research on Internet audio streaming has focused on error control and delay concealment in the presence of delay jitter and packet loss. n A delay spike happens: –when several consecutive packets arrive at the receiver almost simultaneously. n Packet droppings by network delay jitter results in degradation of speech playout.

Introduction n Recover from network packet loss –redundant FEC ( Forward Error Correction ) –time-domain stretching n Reduce the jitter/loss effect –adaptively adjust the silence length between talkspurts n Could the adaptive playout work? –Stable –the silence detection is effective.

Introduction n If a delay spike happens in the middle of a talk spurt? n Extend the silence interval-based adaptive playout by exploiting the time-scale modification scheme. –Every packet could contributed in adapting to the network delay jitter/spike. n SOLA scheme is adopted. ( synchronized overlap-and-add)

Adaptive Payout Framework The short-time energy En(i) the zero crossing rate Zn(i)

Adaptive Payout Framework n A silence segment –En(i) < /8 and Zn(i) < 30 n A transient segment –En(i) / En(i-1) > 1.6 and En(i) > * 2 –En(i) / En(i-1) * 2 n A general segment –otherwise = γ + (1- γ )En(i)

Timings

Delay Correlation

Time-Scale Modification (SOLA) n SAMD ( short-time average magnitude difference )

Adaptive Playout with Time-scale modification n Goal : –to detect a delay spike as soon as possible when it happens n the first packet of each talk-spurt n all packets within a talk-spurt

Adaptive Playout with Time-scale modification

n For each packet received, we have a i = t i + D prop + v i and calculate ^b i and ^vb i n Calculate q i for the first packet and all other packets of each talk-spurt. n Update q i. In this case, it is assumed that the sender-generated content category, is transmitted to the receiver in-band. Calculate the playout length l i (^P) = q i+1 - q i and the target stretching factor α= l i (^P) / l i (O).

Adaptive Playout with Time-scale modification n At scheduling time q i, if a i > q i, packet i will be dropped. Proceed to packet i+1 and restart the algorithm with i = i+1 n Based on the final playout length, update the playout time. p i = q i + D calc Decode and perform time-scale modification based on α n Proceed to packet i+1 with the same algorithm

Experimental Results n Sampling rate : 8 kHz n 16 bits/sample n inter-packetization interval l i (O) : 20ms n 2000 packets in total (40 seconds of speech)

Experimental Results

Conclusions n Time-domain stretching introduce audio artifacts that is within an acceptable range of quality. n By using content-adaptive stretching, we successfully preserve the pitch and the continuity of the original speech. n 1.5~8% improvement over the reference algorithm.