Robust Low-Latency Voice and Video Communication over Best-Effort Networks Department of Electrical Engineering Stanford University March 12, 2003

Slides:



Advertisements
Similar presentations
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Advertisements

-1/20- MPEG 4, H.264 Compression Standards Presented by Dukhyun Chang
Yi Liang Department of Electrical Engineering Stanford University April 19, 2000 Loss Recovery and Adaptive Playout Control for Packet Voice Communications.
Chapter 6 outline r 6.1 Multimedia Networking Applications r 6.2 Streaming stored audio and video m RTSP r 6.3 Real-time, Interactive Multimedia: Internet.
Advances in Network-adaptive Video Streaming Bernd Girod J. Chakareski, M. Kalman, Y. J. Liang, E. Setton, R. Zhang Information Systems Laboratory Department.
June 3, A New Multipath Routing Protocol for Ad Hoc Wireless Networks Amit Gupta and Amit Vyas.
Bernd Girod. Joint Source-Network Coding for Real-time Media 1 Joint Source-Network Coding for Real-time Media Bernd Girod Information Systems Laboratory.
Rate Distortion Optimized Streaming Maryam Hamidirad CMPT 820 Simon Fraser Univerity 1.
SCHOOL OF COMPUTING SCIENCE SIMON FRASER UNIVERSITY CMPT 820 : Error Mitigation Schaar and Chou, Multimedia over IP and Wireless Networks: Compression,
Sang-Chun Han Hwangjun Song Jun Heo International Conference on Intelligent Hiding and Multimedia Signal Processing (IIH-MSP), Feb, /05 Feb 2009.
A Quality-Driven Decision Engine for Live Video Transmission under Service-Oriented Architecture DALEI WU, SONG CI, HAIYAN LUO, UNIVERSITY OF NEBRASKA-LINCOLN.
University of Illinois, Urbana-Champaign
ACM Multimedia October 4, 2001 Real-time Voice Communication over the Internet Using Packet Path Diversity Yi Liang, Eckehard Steinbach, and Bernd Girod.
Congestion-Distortion Optimized Peer-to-Peer Video Streaming Eric Setton*, Jeonghun Noh and Bernd Girod Information Systems Laboratory Stanford University.
End-to-End TCP-Friendly Streaming Protocol and Bit Allocation for Scalable Video Over Wireless Internet Fan Yang, Qian Zhang, Wenwu Zhu, and Ya-Qin Zhang.
Department of Electrical Engineering Stanford University Yi Liang, Eric Setton and Bernd Girod Channel-Adaptive Video Streaming Using Packet Path Diversity.
A New Adaptive FEC Loss Control Algorithm for Voice Over IP Applications Chinmay Padhye, Kenneth Christensen and Wilfirdo Moreno Department of Computer.
An Error-Resilient GOP Structure for Robust Video Transmission Tao Fang, Lap-Pui Chau Electrical and Electronic Engineering, Nanyan Techonological University.
Congestion-Aware Video Streaming over Peer-to-Peer Networks Eric Setton Information Systems Laboratory Stanford University.
Rate-Distortion Optimized Layered Coding with Unequal Error Protection for Robust Internet Video Michael Gallant, Member, IEEE, and Faouzi Kossentini,
Yi Liang Multi-stream Voice Communication with Path Diversity.
Adaptive Playout Scheduling Using Time-scale Modification Yi Liang, Nikolaus Färber Bernd Girod, Balaji Prabhakar.
Motion-compensation Fine-Granular-Scalability (MC-FGS) for wireless multimedia M. van der Schaar, H. Radha Proceedings of IEEE Symposium on Multimedia.
Real-time smoothing for network adaptive video streaming Kui Gao, Wen Gao, Simin He, Yuan Zhang J. Vis. Commun. Image R. 16 (2005)
1 WiSE Video: using in-band wireless loss notification to improve rate- controlled video streaming A. Markopoulou, E. Setton, M. Kalman, J. Apostolopoulos.
The Effectiveness of a QoE - Based Video Output Scheme for Audio- Video IP Transmission Shuji Tasaka, Hikaru Yoshimi, Akifumi Hirashima, Toshiro Nunome.
Yi Liang Aug. 2, 01 Low-latency Internet media streaming using packet path diversity - prior and future work.
Yi Liang July 12, 2000 Adaptive Playout Time Control with Time-scale Packet Modification.
Efficient Fine Granularity Scalability Using Adaptive Leaky Factor Yunlong Gao and Lap-Pui Chau, Senior Member, IEEE IEEE TRANSACTIONS ON BROADCASTING,
Multi-Path Multimedia Transmission in Ad-hoc Networks Related Work Marcin Michalak
Nov. 3, 2000 Adaptive Playout Scheduling in Packet Voice Communications.
Adaptive Playout Scheduling Using Time- scale Modification in Packet Voice Communications Yi J. Liang, Nikolaus Farber, Bernd Girod Information Systems.
Source-Channel Prediction in Error Resilient Video Coding Hua Yang and Kenneth Rose Signal Compression Laboratory ECE Department University of California,
Rate-Distortion Optimized Motion Estimation for Error Resilient Video Coding Hua Yang and Kenneth Rose Signal Compression Lab ECE Department University.
Transport of Real-Time Traffic over the Internet Bernd Girod Information Systems Laboratory Stanford University.
Using Redundancy and Interleaving to Ameliorate the Effects of Packet Loss in a Video Stream Yali Zhu, Mark Claypool and Yanlin Liu Department of Computer.
Multi-Path Transport of FGS Video Jian Zhou, Huai-Rong Shao, Chia Shen and Ming-Ting Sun ICME 2003.
H.264/AVC for Wireless Applications Thomas Stockhammer, and Thomas Wiegand Institute for Communications Engineering, Munich University of Technology, Germany.
4/24/2002SCL UCSB1 Optimal End-to-end Distortion Estimation for Drift Management in Scalable Video Coding H. Yang, R. Zhang and K. Rose Signal Compression.
09/24/02ICIP20021 Drift Management and Adaptive Bit Rate Allocation in Scalable Video Coding H. Yang, R. Zhang and K. Rose Signal Compression Lab ECE Department.
Low Latency Wireless Video Over Networks Using Path Diversity John Apostolopolous Wai-tian Tan Mitchell Trott Hewlett-Packard Laboratories Allen.
Variable Bit Rate Video Coding April 18, 2002 (Compressed Video over Networks: Chapter 9)
Computer Networks: Multimedia Applications Ivan Marsic Rutgers University Chapter 3 – Multimedia & Real-time Applications.
1 Dude, Where is My Packet?. NUS.SOC.CS5248 OOI WEI TSANG 2 Overview Characteristics of the Internet General techniques Error recovery for audio Effect.
1 Requirements for the Transmission of Streaming Video in Mobile Wireless Networks Vasos Vassiliou, Pavlos Antoniou, Iraklis Giannakou, and Andreas Pitsillides.
An Empirical Evaluation of VoIP Playout Buffer Dimensioning in Skype, Google Talk, and MSN Messenger Chen-Chi Wu, Kuan-Ta Chen, Yu-Chun Chang, and Chin-Laung.
Error control in video Streaming. Introduction Development of different types of n/ws such as internet, wireless and mobile networks has created new applications.
Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.
New Models for Perceived Voice Quality Prediction and their Applications in Playout Buffer Optimization for VoIP Networks University of Plymouth United.
Rate-distortion Optimized Mode Selection Based on Multi-channel Realizations Markus Gärtner Davide Bertozzi Classroom Presentation 13 th March 2001.
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 Efficient Path Aggregation and Error Control for Video Streaming OMESH TICKOO, Shiv Kalyanaraman,
Compression of Real-Time Cardiac MRI Video Sequences EE 368B Final Project December 8, 2000 Neal K. Bangerter and Julie C. Sabataitis.
LOG Objectives  Describe some of the VoIP implementation challenges such as Delay/Latency, Jitter, Echo, and Packet Loss  Describe the voice encoding.
Scalable Video Coding and Transport Over Broad-band wireless networks Authors: D. Wu, Y. Hou, and Y.-Q. Zhang Source: Proceedings of the IEEE, Volume:
Fast motion estimation and mode decision for H.264 video coding in packet loss environment Li Liu, Xinhua Zhuang Computer Science Department, University.
Proxy-Based Reference Picture Selection for Error Resilient Conversational Video in Mobile Networks Wei Tu and Eckehard Steinbach, IEEE Transactions on.
Rate-distortion Optimized Mode Selection Based on Multi-path Channel Simulation Markus Gärtner Davide Bertozzi Project Proposal Classroom Presentation.
Overview of Digital Video Compression Multimedia Systems and Standards S2 IF Telkom University.
3GPP2 Evolution Workshop Multimedia Codecs and Protocols 3GPP2 TSG-C SWG1.2.
A Comparison of RaDiO and CoDiO over IEEE WLANs May 25 th Jeonghun Noh Deepesh Jain A Comparison of RaDiO and CoDiO over IEEE WLANs.
Path Diversity for Media Streaming The Use of Multiple Description Coding J. Apostolopoulos, M. Trott and W. Tan Presented by Xiaoyuan GUO.
Fundamentals of Multimedia Chapter 17 Wireless Networks 건국대학교 인터넷미디어공학부 임 창 훈.
PATH DIVERSITY WITH FORWARD ERROR CORRECTION SYSTEM FOR PACKET SWITCHED NETWORKS Thinh Nguyen and Avideh Zakhor IEEE INFOCOM 2003.
Networked Multimedia Basics. Network Characteristics.
Project Proposal Due Next Mon. Submit by
Limitations of Traditional Error-Resilience Methods
Standards Presentation ECE 8873 – Data Compression and Modeling
EE 368C Project Multi-stream Audio Transmission with Path Diversity
Project proposal Multi-stream and multi-path audio transmission
Presentation transcript:

Robust Low-Latency Voice and Video Communication over Best-Effort Networks Department of Electrical Engineering Stanford University March 12, Yi Liang

Liang: Robust Low-Latency Voice and Video Communication 2 Media Delivery over IP Networks Internet

Liang: Robust Low-Latency Voice and Video Communication 3 QoS Concerns and Challenges Communication over best-effort networks … Delay Impairs interactivity of conversational services Voice over IP: recommended one way delay < 150 ms [ITU-T G.114] Packet loss Impairs perceptual quality Delay jitter Obstructs sequential and continuous media output

Liang: Robust Low-Latency Voice and Video Communication 4 Outline of Contributions Server Client Packet network I. Client side II. Transport III. Network- adaptive coding I.Client side Adaptive playout scheduling for VoIP that reduces latency and packet loss II.Transport Packet path diversity and applications in low-latency communications III.Network-adaptive coding Low-latency video communication that does not require packet retransmission

Liang: Robust Low-Latency Voice and Video Communication 5 Outline I.Client side Adaptive playout scheduling for VoIP that reduces latency and packet loss II.Transport Packet path diversity and applications in low-latency communications III.Network-adaptive coding Low-latency video communication that does not require packet retransmission

Liang: Robust Low-Latency Voice and Video Communication 6 Delay Jitter and Buffering Avg. buffering delay (ms) Late loss rate (%) Late loss Buffering delay Fixed Playout Schedule

Liang: Robust Low-Latency Voice and Video Communication 7 Adaptive Playout Scheduling (1) Buffer. delay Adaptive Playout Schedule Fixed schedule

Liang: Robust Low-Latency Voice and Video Communication 8 Adaptive Playout Scheduling (2) Requires media scaling Slow down Speed up Sender Receiver Playout time Packetization time 1.How to set the playout schedule? 2.How to scale the media? 3.Quality of scaled voice?

Liang: Robust Low-Latency Voice and Video Communication 9 Determine the Playout Schedule Delay (ms) Probability Delay histogram Next packet: Given the acceptable loss rate, find the playout deadline History-based estimation using past w delays Deadline Loss prob.

Liang: Robust Low-Latency Voice and Video Communication 10 Voice Scaling Using Time-Scale Modification Based on WSOLA [Verhelst ‘93] Preserves pitch Improved to scale short individual voice packets; no delay Output 1/20/12/33 4 Input Pitch period Template segment Similar segment Packet expansion

Liang: Robust Low-Latency Voice and Video Communication 11 Examples of Time-Scale Modification Speech scaling Audio scaling Original130%70%

Liang: Robust Low-Latency Voice and Video Communication 12 Quality of Time-Scale Modified Voice Packets scaled: 18.4 % Scaling ratio: % DMOS: 4.5 out of 5 [ITU-T P.800] Adaptive Playout Schedule ORIGINALMODIFIED

Liang: Robust Low-Latency Voice and Video Communication 13 Results and Comparison Algorithms: 1.Fixed playout schedule 2.Only adjust playout schedule during silence periods [Ramjee ’94; Moon ‘98] 3.Adaptive playout scheduling 1 2 3

Liang: Robust Low-Latency Voice and Video Communication 14 Overall Performance Alg.Loss rate MOS Alg. 210%2.6 Alg. 34%3.7 Stanford  Chicago MOS scale : [ITU-T P.800] -50%

Liang: Robust Low-Latency Voice and Video Communication 15 Subjective Listening Test Results Trace MOS Stanford  1.Chicago 2.Germany 3.MIT 4.China Alg. 2 Alg. 3

Liang: Robust Low-Latency Voice and Video Communication 16 Summary Adaptive Playout Scheduling Improves the tradeoff between buffering delay and packet loss Time-scale modification-based speech processing does not impair speech quality Overall speech quality improves by 1 on a 5-point MOS scale The passive algorithm can be easily implemented on client Audacity T2, 8X8, Inc.

Liang: Robust Low-Latency Voice and Video Communication 17 Outline I.Client side Adaptive playout scheduling for VoIP that reduces latency and packet loss II.Transport Packet path diversity and applications in low-latency communications III.Network-adaptive coding Low-latency video communication that does not require packet retransmission

Liang: Robust Low-Latency Voice and Video Communication 18 Packet Path Diversity Motivation Typically better alternative path exists [Savage, SigComm ‘99] Uncorrelated packet loss on independent paths [Apostolopoulos ‘01] Low-latency requirement Sender Receiver 1 2 Relay server Relay server

Liang: Robust Low-Latency Voice and Video Communication 19 Internet Experiments Qwest Exodu s Comm. BBN Planet Santa Clara, CA MIT Harvard (5ms) (45ms) (40ms) (5ms) Sender Relay Server Receiver (delay incurred on a link or ISP network)

Liang: Robust Low-Latency Voice and Video Communication 20 Measured Packet Delay Trace

Liang: Robust Low-Latency Voice and Video Communication 21 Adaptive Playout Scheduling for Two-Stream

Liang: Robust Low-Latency Voice and Video Communication 22 Multiple Description Speech Coding Complementary and redundant descriptions of media Stream 1: Even samples: finer quantization Odd samples: coarser quantization Stream 2: Vice versa [Jiang, Ortega ‘00] E s1 s2 O E O E O O E O E O E  E O Packet length Time

Liang: Robust Low-Latency Voice and Video Communication 23 Determine the Playout Schedule To minimize the Lagrangian cost function Stream 1 p1p1 p2p2 Delay d Prob. Stream 2

Liang: Robust Low-Latency Voice and Video Communication 24 Overall Performance Avg end-to-end delay (ms) Loss rate (%) -35ms

Liang: Robust Low-Latency Voice and Video Communication 25 Summary Packet Path Diversity Exploitation of statistically uncorrelated delay jitter and packet loss behavior Adaptive playout scheduling for multiple streams provides lower latency and reduced distortion

Liang: Robust Low-Latency Voice and Video Communication 26 Outline I.Client side Adaptive playout scheduling for VoIP that reduces latency and packet loss II.Transport Packet path diversity and applications in low-latency communications III.Network-adaptive coding Low-latency video communication that does not require packet retransmission

Liang: Robust Low-Latency Voice and Video Communication 27 Low-Latency Video Communication Motivation for low-latency video Real-time conversational services Interactive video streaming Voice vs. Video Voice over IPTypical video streaming < 150 ms5 ~ 15 seconds pre-roll time Weak or no dependency across packets Strong dependency across packets due to motion- compensated coding

Liang: Robust Low-Latency Voice and Video Communication 28 Low-Latency Video — Challenges What the problems are Packet dependency due to hybrid motion-compensated coding Large receiver buffer and packet retransmission employed I P P P P P P P Interframe prediction Time Transmission error The “P-I” scheme

Liang: Robust Low-Latency Voice and Video Communication 29 Approaches Goal Achieve VoIP-like latency Approach Eliminate the need for retransmission Robust network-adaptive coding by optimal packet dependency management

Liang: Robust Low-Latency Voice and Video Communication 30 Coding Mode P1 P2 P5 INTRA … … Coding mode Increased error- resilience

Liang: Robust Low-Latency Voice and Video Communication 31 Error-Resilience vs. Compression Efficiency Foreman sequence coded at PSNR=35.9 dB (H.26L TML8.5, 30 fps, 270 frames) INTRA Coding mode Increased error-resilience Decreased compression efficiency Rate (Kbps)

Liang: Robust Low-Latency Voice and Video Communication 32 Determine R-D Optimized Coding Modes Select the prediction mode that minimizes the R-D cost … Long-Term Memory V P2 : (R 2, D 2 ) … PV: (R V, D V ) I : (R , D  ) … P1 : (R 1, D 1 ) v : coding mode

Liang: Robust Low-Latency Voice and Video Communication 33 Estimation of Distortion 1-p p p p … p D 11, p 11 =(1-p) 3 D 12, p 12 =(1-p) 2 p D 18, p 18 =p 3 ………… P1 D 21, p 21 =(1-p) 2 1-p pD 22, p 22 =(1-p)p D 23, p 23 =p(1-p) D 24, p 24 =p 2 P2 n-3 n-2 n-1 n Channel feedback utilized at the source coder

Liang: Robust Low-Latency Voice and Video Communication 34 Experimental Results Comparing 1.Rate-distortion optimized dependency management 2.Simple P-I 1 2 IPPPPP …

Liang: Robust Low-Latency Voice and Video Communication 35 R-D Performance (1) No retransmission; no algorithm delay channel loss rate=10% 1.2dB 36%

Liang: Robust Low-Latency Voice and Video Communication 36 R-D Performance (2) No retransmission; no algorithm delay channel loss rate=10%

Liang: Robust Low-Latency Voice and Video Communication 37 R-D Performance (3) Bitrate 200 Kbps, various channel loss rates

Liang: Robust Low-Latency Voice and Video Communication 38 Video Demo (1) R-D optimizedSimple P-I Foreman, 109Kbps, 10% channel loss No retransmission; no algorithm delay

Liang: Robust Low-Latency Voice and Video Communication 39 Video Demo (2) R-D optimized Simple P-I Mother-Daughter, 318Kbps, 10% channel loss No retransmission; no algorithm delay

Liang: Robust Low-Latency Voice and Video Communication 40 Summary Network-Adaptive Packet Dependency Management R-D optimization improves the tradeoff between error-resilience and compression efficiency Eliminated the need for packet retransmission; achieved VoIP-like low latency

Liang: Robust Low-Latency Voice and Video Communication 41 Summary of Contributions Server Client Packet network I. Client side II. Transport III. Network- adaptive coding I.Client side Adaptive playout scheduling that reduces latency and packet loss II.Transport Packet path diversity that further reduces communication delay and distortion III.Network-adaptive coding A video communication system that requires no packet retransmission, which allows VoIP-like low-latency

Liang: Robust Low-Latency Voice and Video Communication 42 Other Contributions Other contributions not covered in this presentation A low-latency loss concealment scheme Packet path diversity for robust low-latency video communication A layered coding structure to avoid mismatch error for streaming of pre-coded video An accurate model to quantify video distortion as a result of packet losses A prescient scheme that optimizes the dependency for a group of packets for video streaming

Liang: Robust Low-Latency Voice and Video Communication 43 Publications Journal publications: 3 IEEE Transactions on Multimedia Journal of Wireless Communication and Mobile Computing IEEE Transactions on Circuits and Systems for Video Technology Invited papers : 4 Papers in conference proceedings: 8 Proceedings ACM Multimedia (SigMM) …

Liang: Robust Low-Latency Voice and Video Communication 44 Media Delivery over IP Networks Internet

Liang: Robust Low-Latency Voice and Video Communication 45 Low-Latency Media Communication

Liang: Robust Low-Latency Voice and Video Communication 46 Acknowledgements Committee members, EE faculty My family members Our sponsors IVMS group members and alumni, and assistants Many friends, in ISL, EE, and Stanford

Liang: Robust Low-Latency Voice and Video Communication 47 Backup Slides The following backup slides may or may not be used …

Liang: Robust Low-Latency Voice and Video Communication 48 Determine the Playout Schedule Delay (ms) Percentage Delay histogram

Liang: Robust Low-Latency Voice and Video Communication 49 Likelihood Ratio Factor [Gibbon, Little, ‘96]

Liang: Robust Low-Latency Voice and Video Communication 50 More Samples for Time-Scale Modification Audio scaling OriginalExpanded by 20%Compressed by 20%

Liang: Robust Low-Latency Voice and Video Communication 51 Low-Latency Loss Concealment Earlier work [Stenger ‘96] Algorithm delay reduced to one packet time Nicely integrates into adaptive playout system 20% random packet loss: Original: Loss: Concealed: i-2i-1i+1i+2 i-2i+2 time i lost i-1 i+1 L LL 2L2L 1.3L Alignment found by correlation

Liang: Robust Low-Latency Voice and Video Communication 52 Speech Samples Alg.Loss rate MOS Alg. 210%2.6 Alg. 34%3.7 Original4.4

Liang: Robust Low-Latency Voice and Video Communication 53 Overall Performance Stanford -> 1.Chicago 2.Germany 3.MIT 4.China 12 34

Liang: Robust Low-Latency Voice and Video Communication 54 Multi-Stream Playout Scheduling  Time Sending on path 1 Receiving on path 1 Playout Sending on path 2 Receiving on path 2 Packet path diversity reduces effective delay jitter and therefore late loss rate

Liang: Robust Low-Latency Voice and Video Communication 55 Path Diversity – Voice Demo Original  Average total end-to-end delay: 84 ms  Error concealment: speech segment repetition  Average total end-to-end delay: 84 ms  Error concealment: speech segment repetition Path DiversitySingle-stream with FEC at same data rate

Liang: Robust Low-Latency Voice and Video Communication 56 More Experiment Results Results obtained by varying 2 while keeping 1 fixed With higher delay: better chances to play both descriptions Observed lower playout rate variation by using multiple streams Jitter averaged; lower STD of min(d i, d j )

Liang: Robust Low-Latency Voice and Video Communication 57 PESQ Results  Perceptual Evaluation of Speech Quality (ITU-T Rec. P.862, Feb. 2001)  PESQ can be used for end-to-end quality assessment  Ranges from –0.5 to 4.5 but usually produces MOS-like scores between 1.0 and 4.5

Liang: Robust Low-Latency Voice and Video Communication 58 Internet Experiment (2) VBNS IP Backbone Service DANTE Operations UUNE T Tech. Erlangen Harvard (7ms) (40ms) AT&T (5ms) (10ms) New Jersey  Path 1 (direct): N. J. – Erlangen  Path 2 (alternative): N. J. – Harvard – Erlangen

Liang: Robust Low-Latency Voice and Video Communication 59 Results (2) Path 1 (direct): N. J. – Germany Path 2 (alternative): N. J. – Harvard – Germany Mean delay 61.3/65.0 ms link loss 0.6% / 1.1% Significant reduction of late loss and end-to-end delay by packet path diversity

Liang: Robust Low-Latency Voice and Video Communication 60 Video Streaming Using Path Diversity Path 1 Path 2 n-5 n-4 n-3 n-2 n-1 n Next frame to encode and send: n Goal Minimize distortion under rate constraint (1) Path selection to minimize the loss probability of frame n and maximize the benefit of path diversity Alternate when both channels are good Send small probe packets over the channel in bad state [Setton, Liang, Girod, ICME’03, submitted] (2) Source coding

Liang: Robust Low-Latency Voice and Video Communication 61 Determine Prediction Mode Long-Term Memory V=5 n-5 n-4 n-3 n-2 n-1 n Prediction modes: v =1, 2, … V, I V=1 V=2 V=3 V=5 Path 1 Path 2

Liang: Robust Low-Latency Voice and Video Communication 62 Results (1) Channel loss rate_1 =loss rate 2 =15% Avg burst len=8 Feedback delay=6 Comparing to RPS-NACK [Lin, ICME’01] Video redundancy coding (VRC) [H.263++]

Liang: Robust Low-Latency Voice and Video Communication 63 Results (2) Channel loss rate_1 =loss rate 2 =15% Avg burst len=8 Feedback delay=6

Liang: Robust Low-Latency Voice and Video Communication 64 Path Diversity Gain with Shared Link

Liang: Robust Low-Latency Voice and Video Communication 65 TCP-Friendly Streaming [Mahdavi, Floyd, ‘97] [Floyd, Handley, Padhye, Widmer, ‘00]

Liang: Robust Low-Latency Voice and Video Communication 66 Long-Term Memory Prediction and Packet Dependency To manage prediction dependency Long-term Memory (LTM) prediction on macroblock level [Wiegand, Zhang, Färber, Girod, ’99, ‘00] Reference Picture Selection (RPS) [Annex N H.263+, Annex U H.263++, H.26L] NEWPRED [ISO/IEC MPEG-4] NACK

Liang: Robust Low-Latency Voice and Video Communication 67 R-D Optimization [H.26L TML 8.5] [Wiegand, Girod, ICIP’01 ]

Liang: Robust Low-Latency Voice and Video Communication 68 Dynamic PSNRs

Liang: Robust Low-Latency Voice and Video Communication 69 Streaming of Pre-Encoded Media Media pre-coded and pre-stored offline Bit-stream assembly at streaming times Pre-coded content benefits large number of users One potential problem …

Liang: Robust Low-Latency Voice and Video Communication 70 Potential Mismatch Error Transmitted PPPIPP … IIIIII … S1 S2 Encoded PPPPPP … Previous schemes using S-frame [Färber, ICIP’97 ], SP-frame [H. 26L] alleviate or solve the problem at the cost of higher bitrate Decoded PPPIPP … Mismatch

Liang: Robust Low-Latency Voice and Video Communication 71 Layered Coding Structure for Bitstream Assembly P5P5P5P5 LAYER II IP5P5P5P5 IP5P5P5… IP5P5… V=5 SYNC-frames: allow switching LAYER I I T GOP =25 P5 LAYER III

Liang: Robust Low-Latency Voice and Video Communication 72 P-I for Comparison I P P P P P P P P P I P … I P P P P P P P P P I …

Liang: Robust Low-Latency Voice and Video Communication 73 R-D Performance (1) No retransmission; no algorithm delay channel loss rate=10% 1.2dB 36%

Liang: Robust Low-Latency Voice and Video Communication 74 R-D Performance (2) No retransmission; no algorithm delay channel loss rate=10%

Liang: Robust Low-Latency Voice and Video Communication 75 R-D Performance (3) Bitrate 200 Kbps, various channel loss rates

Liang: Robust Low-Latency Voice and Video Communication 76 Cost of Error-Resilience (1) Error-resilience / low-latency is not free PSNR (dB) Bitrate increase for 5% loss Bitrate increase for 10% loss %39% %43% %35% Distortion at the encoder

Liang: Robust Low-Latency Voice and Video Communication 77 Cost of Error-Resilience (2) PSNR (dB)Bitrate increase for 5% loss Bitrate increase for 10% loss %52% %45% %46% %40% Distortion at the encoder

Liang: Robust Low-Latency Voice and Video Communication 78 Cost of Layered Coding Structure (1) 23% 25% 30% Lossless channel 32%

Liang: Robust Low-Latency Voice and Video Communication 79 Cost of Layered Coding Structure (2) Channel loss rate=5%

Liang: Robust Low-Latency Voice and Video Communication 80 Comparing Different Error-Resilience Schemes LatencyR-D costResilience to burst loss ARQHighLow FECMedium-lowMedium-highMedium-low, depending on delay Dependency control Very lowMedium-highHigh