Low Bit Rate H.263 + Video Coding: Efficiency, Scalability and Error Resilience Faouzi Kossentini Signal Processing and Multimedia Group Department of.

Slides:



Advertisements
Similar presentations
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Advertisements

2005/01/191/14 Overview of Fine Granularity Scalability in MPEG-4 Video Standard Weiping Li Fellow, IEEE IEEE Transactions on Circuits and Systems for.
MPEG4 Natural Video Coding Functionalities: –Coding of arbitrary shaped objects –Efficient compression of video and images over wide range of bit rates.
A Performance Analysis of the ITU-T Draft H.26L Video Coding Standard Anthony Joch, Faouzi Kossentini, Panos Nasiopoulos Packetvideo Workshop 2002 Department.
Basics of MPEG Picture sizes: up to 4095 x 4095 Most algorithms are for the CCIR 601 format for video frames Y-Cb-Cr color space NTSC: 525 lines per frame.
-1/20- MPEG 4, H.264 Compression Standards Presented by Dukhyun Chang
Error Resilience for MPEG-4 Environment Nimrod Peleg Nov
Technion - IIT Dept. of Electrical Engineering Signal and Image Processing lab Transrating and Transcoding of Coded Video Signals David Malah Ran Bar-Sella.
VIPER DSPS 1998 Slide 1 A DSP Solution to Error Concealment in Digital Video Eduardo Asbun and Edward J. Delp Video and Image Processing Laboratory (VIPER)
Source Coding for Video Application
SWE 423: Multimedia Systems
H.264/AVC Baseline Profile Decoder Complexity Analysis Michael Horowitz, Anthony Joch, Faouzi Kossentini, and Antti Hallapuro IEEE TRANSACTIONS ON CIRCUITS.
SCHOOL OF COMPUTING SCIENCE SIMON FRASER UNIVERSITY CMPT 820 : Error Mitigation Schaar and Chou, Multimedia over IP and Wireless Networks: Compression,
Recursive End-to-end Distortion Estimation with Model-based Cross-correlation Approximation Hua Yang, Kenneth Rose Signal Compression Lab University of.
Overview of the H.264/AVC Video Coding Standard
H.264/Advanced Video Coding – A New Standard Song Jiqiang Oct 21, 2003.
An Error-Resilient GOP Structure for Robust Video Transmission Tao Fang, Lap-Pui Chau Electrical and Electronic Engineering, Nanyan Techonological University.
Rate-Distortion Optimized Layered Coding with Unequal Error Protection for Robust Internet Video Michael Gallant, Member, IEEE, and Faouzi Kossentini,
Overview of Fine Granularity Scalability in MPEG-4 Video Standard Weiping Li, Fellow, IEEE.
H.264 / MPEG-4 Part 10 Nimrod Peleg March 2003.
Scalable Wavelet Video Coding Using Aliasing- Reduced Hierarchical Motion Compensation Xuguang Yang, Member, IEEE, and Kannan Ramchandran, Member, IEEE.
School of Computing Science Simon Fraser University
Error Resilience in a Generic Compressed Video Stream Transmitted over a Wireless Channel Muhammad Bilal
Source-Channel Prediction in Error Resilient Video Coding Hua Yang and Kenneth Rose Signal Compression Laboratory ECE Department University of California,
Rate-Distortion Optimized Motion Estimation for Error Resilient Video Coding Hua Yang and Kenneth Rose Signal Compression Lab ECE Department University.
Fundamentals of Multimedia Chapter 11 MPEG Video Coding I MPEG-1 and 2
ITU-T Recommendation H.261
H.264/AVC for Wireless Applications Thomas Stockhammer, and Thomas Wiegand Institute for Communications Engineering, Munich University of Technology, Germany.
4/24/2002SCL UCSB1 Optimal End-to-end Distortion Estimation for Drift Management in Scalable Video Coding H. Yang, R. Zhang and K. Rose Signal Compression.
Error Resilience of Video Transmission By Rate-Distortion Optimization and Adaptive Packetization Yuxin Liu, Paul Salama and Edwad Delp ICME 2002.
09/24/02ICIP20021 Drift Management and Adaptive Bit Rate Allocation in Scalable Video Coding H. Yang, R. Zhang and K. Rose Signal Compression Lab ECE Department.
A Concealment Method for Shape Information in MPEG-4 Coded Video Sequences Shahram Shirani, Berna Erol, and Faouzi Kossentini IEEE TRANSACTIONS ON MULTIMEDIA,
An Introduction to H.264/AVC and 3D Video Coding.
January 26, Nick Feamster Development of a Transcoding Algorithm from MPEG to H.263.
MPEG-2 Digital Video Coding Standard
Kai-Chao Yang Hierarchical Prediction Structures in H.264/AVC.
MPEG-2 Standard By Rigoberto Fernandez. MPEG Standards MPEG (Moving Pictures Experts Group) is a group of people that meet under ISO (International Standards.
Electrical Engineering National Central University Video-Audio Processing Laboratory Data Error in (Networked) Video M.K.Tsai 04 / 08 / 2003.
 Coding efficiency/Compression ratio:  The loss of information or distortion measure:
Page 19/15/2015 CSE 40373/60373: Multimedia Systems 11.1 MPEG 1 and 2  MPEG: Moving Pictures Experts Group for the development of digital video  It is.
Video Coding. Introduction Video Coding The objective of video coding is to compress moving images. The MPEG (Moving Picture Experts Group) and H.26X.
Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.
Chapter 11 MPEG Video Coding I — MPEG-1 and 2
8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.
Compression video overview 演講者:林崇元. Outline Introduction Fundamentals of video compression Picture type Signal quality measure Video encoder and decoder.
- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison between H.264.
Watermarking Part 2: Future Work Electrical and Computer Engineering Department Villanova University 18 August 2004 Robert J. Berger II Michael P. Marcinak.
Image Processing Architecture, © 2001, 2002 Oleh TretiakPage 1Lecture 15 ECEC-453 Image Processing Architecture 3/11/2004 Exam Review Oleh Tretiak Drexel.
Rate-distortion Optimized Mode Selection Based on Multi-channel Realizations Markus Gärtner Davide Bertozzi Classroom Presentation 13 th March 2001.
Image Processing Architecture, © Oleh TretiakPage 1Lecture 10 ECEC 453 Image Processing Architecture Lecture 10, 2/17/2004 MPEG-2, Industrial.
MPEG-4: Multimedia Coding Standard Supporting Mobile Multimedia System -MPEG-4 Natural Video Coding April, 2001.
Fast motion estimation and mode decision for H.264 video coding in packet loss environment Li Liu, Xinhua Zhuang Computer Science Department, University.
Chapter 10 Basic Video Compression Techniques 10.1 Introduction to Video Compression 10.2 Video Compression with Motion Compensation 10.3 Search for Motion.
Block-based coding Multimedia Systems and Standards S2 IF Telkom University.
Video Compression and Standards
Overview of Digital Video Compression Multimedia Systems and Standards S2 IF Telkom University.
COMPARATIVE STUDY OF HEVC and H.264 INTRA FRAME CODING AND JPEG2000 BY Under the Guidance of Harshdeep Brahmasury Jain Dr. K. R. RAO ID MS Electrical.
1 Yu Liu 1, Feng Wu 2 and King Ngi Ngan 1 1 Department of Electronic Engineering, The Chinese University of Hong Kong 2 Microsoft Research Asia, Beijing,
Fundamentals of Multimedia Chapter 17 Wireless Networks 건국대학교 인터넷미디어공학부 임 창 훈.
Motion Estimation Multimedia Systems and Standards S2 IF Telkom University.
Introduction to MPEG Video Coding Dr. S. M. N. Arosha Senanayake, Senior Member/IEEE Associate Professor in Artificial Intelligence Room No: M2.06
Principles of Video Compression Dr. S. M. N. Arosha Senanayake, Senior Member/IEEE Associate Professor in Artificial Intelligence Room No: M2.06
H. 261 Video Compression Techniques 1. H.261  H.261: An earlier digital video compression standard, its principle of MC-based compression is retained.
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Present by 楊信弘 Advisor: 鄭芳炫
CSI-447: Multimedia Systems
VLIW DSP vs. SuperScalar Implementation of a Baseline H.263 Encoder
Standards Presentation ECE 8873 – Data Compression and Modeling
MPEG4 Natural Video Coding
Presentation transcript:

Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience Faouzi Kossentini Signal Processing and Multimedia Group Department of Electrical and Computer Engineering University of British Columbia

Outline  Introduction: Low bit rate video coding  The H.263/H.263+ standards and their optional modes  Efficiency: Performance and complexity of individual modes and combinations of modes  Efficiency: Real-time software-only encoding  Efficiency: Rate-distortion optimized video coding  Scalability: Description & Characteristics  Scalability: Rate-distortion optimized framework  Error resilience: Synchronization & error concealment  Error resilience: Multiple description video coding  Conclusions: H.263/H.263+, MPEG-4 and research directions Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 2 Copyright (C) 1998 Faouzi Kossentini

3 H.263 version 2 (H.263+): H.263 version 2 (H.263+): Higher coding efficiency, more flexibility, scalability support, error resilience support Video coding standards:  ITU-T H.261(1990), H.262 (1994), H.263 (1995), H.263+ (1998)  ISO/IEC MPEG1 (1992), MPEG2 (1994), MPEG4 (1999) Low Bit Rate Video coding Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience Video coding algorithms:  Waveform based coding: MC+DCT/wavelets, 3D subband, etc.  Object- and model-based coding : shape coding, wireframes, etc. Why?: Why?: Increasing demand for video conferencing and telephony applications, limited bandwidth in PSTN and wireless networks Copyright (C) 1998 Faouzi Kossentini

The H.263 Standard Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 4 Copyright (C) 1998 Faouzi Kossentini

5 The H.263 Standard Structure  Inter picture prediction to reduce temporal redundancies  DCT coding to reduce spatial redundancies in difference frames Major enhancements to H.261  More standardized input picture formats  Half pixel motion compensation  Optimized VLC tables  Better motion vector prediction  Four (optional) negotiable modes Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience Copyright (C) 1998 Faouzi Kossentini

6 Unrestricted Motion Vector (UMV) mode, Annex D: Motion vectors (MVs) allowed to point outside the picture, MV range extended to ±31.5 Syntax-based Arithmetic Coding (SAC) mode, Annex E : SAC used instead of VLCs Advanced Prediction (AP) mode, Annex F: Four MVs per macroblock, over-lapped motion compensation, MVs allowed to point outside the picture area PB-frames (PB) mode, Annex G: A Frame with a P-picture and a B- picture used as a unit Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience The H.263 Standard: Optional Modes Copyright (C) 1998 Faouzi Kossentini

The H.263+ Standard: Optional Modes Unrestricted Motion Vector (UMV) mode, Annex D: MV range extended to ±256 depending on the picture size, reversible VLCs used for MVs Advanced Intra Coding (AIC) mode, Annex I: Inter block prediction from neighboring intra coded blocks, modified quantization, optimized VLCs Deblocking Filter (DF) mode, Annex J: Deblocking filter inside the coding loop, four motion vectors per macroblock, MVs outside picture boundaries Slice Structured (SS) mode, Annex K: Slices used instead of GOBs Supplemental Enhancement Information (SEI) mode, Annex L: Supplemental information included in the stream to offer display capabilities Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 7 Copyright (C) 1998 Faouzi Kossentini

Improved PB-frames (IBP) mode, Annex M: Forward, backward and bi-directional prediction supported, delta vector not transmitted Reference Picture Selection (RPS) mode, Annex N: Reference picture selected for prediction to suppress temporal error propagation Temporal, SNR, and Spatial Scalability mode, Annex O: Syntax to support temporal, SNR, and spatial scalability Reference Picture Resampling (RPR) mode, Annex P: Warping of the reference picture prior to its use for prediction Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 8 The H.263+ Standard: Optional Modes Copyright (C) 1998 Faouzi Kossentini

The H.263+ Standard: Optional Modes Reduced Resolution Update (RRU) mode, Annex Q: Encoder allowed to send update information for a picture encoded at a lower resolution, while still maintaining a higher resolution for the reference picture Independently Segmented Decoding (ISD) mode, Annex R: Dependencies across the segment boundaries not allowed Alternative Inter VLC (AIV) mode, Annex S: The intra VLC table designed for encoding quantized intra DCT coefficients in the AIC mode used for inter coding Modified Quantization (MQ) mode, Annex T: Quantizer allowed to change at the macroblock layer, finer chrominance quantization employed, range of representable quantized DCT coefficients extended to [-127, +127] Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 9 Copyright (C) 1998 Faouzi Kossentini

Performance & Complexity: Compression efficiency for intra macroblocks increased, encoding time increased by ~5%, required memory slightly increased Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 10 Efficiency: Performance & Complexity of Individual Modes Advanced Intra Coding (AIC) mode, Annex I: Inter block prediction from neighboring intra coded blocks, modified quantization, optimized VLCs Copyright (C) 1998 Faouzi Kossentini

(a)(b) Without (a) and with (b) DF mode set on Performance: Subjective quality improved (blocking & mosquito artifacts removed), 5-10% additional encoding time Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 11 Efficiency: Performance & Complexity of Individual Modes Deblocking Filter (DF) mode, Annex J: Deblocking filter inside the coding loop, four motion vectors per macroblock, MVs outside picture boundaries Copyright (C) 1998 Faouzi Kossentini

Performance: Picture rate doubled, complexity increased, more memory required, one frame delay incurred Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 12 Efficiency: Performance & Complexity of Individual Modes Improved PB-frames (IBP) mode, Annex M: Forward, backward and bi-directional prediction supported, delta vector not transmitted Copyright (C) 1998 Faouzi Kossentini

Performance: Useful at high bit rates, bit savings of as much as 10% achieved, less than 2% additional encoding/decoding time required Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 13 Efficiency: Performance & Complexity of Individual Modes Alternative Inter VLC (AIV) mode, Annex S: The intra VLC table designed for encoding quantized intra DCT coefficients in the AIC mode used for inter coding Copyright (C) 1998 Faouzi Kossentini

Performance: Chrominance PSNR increased substantially at low bit rates, more flexible rate control supported, very little complexity and computation time added to the coder Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 14 Efficiency: Performance & Complexity of Individual Modes Modified Quantization (MQ) mode, Annex T: Quantizer allowed to change at the macroblock layer, finer chrominance quantization employed, range of representable quantized DCT coefficients extended to [-127, +127] Copyright (C) 1998 Faouzi Kossentini

H.263 PSNR = 32.44H.263+ PSNR = H.263+ modes: AIC, MQ, UMV, DF, AP H.261 PSNR = Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 15 Efficiency: Performance & Complexity of Combinations of Modes Copyright (C) 1998 Faouzi Kossentini

 Required for real-time video telephony and conferencing: encoding at a minimum of 8-10 fps for QCIF resolution Efficiency: Real-time Software-only Encoding Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 16 Copyright (C) 1998 Faouzi Kossentini

 Algorithmic approach: Developing efficient platform- independent video coding algorithms  Hardware dependent approach: Mapping SIMD oriented coding components onto Intel’s MMX architecture Efficiency: Real-time Software-only Encoding Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience Copyright (C) 1998 Faouzi Kossentini 17

18 Efficiency: Real-time Software-only Encoding Copyright (C) 1998 Faouzi Kossentini Efficient Video Coding Algorithms:  IDCT for blocks with all or most of the coefficients equal to zero avoided or efficiently computed (respectively)  SAD for most macroblocks computed efficiently via prediction  DCT & Quantization for most blocks avoided via prediction  Linear approximation methods used for half-pixel ME  Quantization: look-up tables employed Some Image and Video Processing Research the UBC Signal Processing and Multimedia Laboratory

Intel’s MMX Architecture:  Features :  SIMD structure  Four new data types  57 new instructions  Saturation arithmetic Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 19 Efficiency: Real-time Software-only Encoding Copyright (C) 1998 Faouzi Kossentini

MMX Mapping of SAD Computations:  Saturation arithmetic is used to perform absolute difference operations  Result: 4-5 times faster SAD computations  SAD computation function implemented in MMX for 16x16 blocks (baseline coding) and 8x8 blocks (optional modes) Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 20 Efficiency: Real-time Software-only Encoding Copyright (C) 1998 Faouzi Kossentini

DCT MMX Implementation:  Floating point arithmetic to fixed point arithmetic  Four rows processed at a time  Results: 4 times faster using fixed point arithmetic, 3 times faster using MMX as compared to the integer “C” implementation Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 21 Efficiency: Real-time Software-only Encoding Other MMX Implementations: Other MMX Implementations: Interleaved transfers for half pixel ME (7x), interp. (3x), motion comp. (2x), block data transfers (2x) Copyright (C) 1998 Faouzi Kossentini

Overall Performance:  fps baseline H.263+ encoding with fast ME  8-10 fps with all of the H.263+ optional modes  More than a 100% increase in speed using the efficient algorithms and more than a 25% additional increase in speed using MMX Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 22 Copyright (C) 1998 Faouzi Kossentini

Motivation:  Best possible tradeoffs between quality (distortion) and bit rate obtained via Rate-Distortion (RD) optimization, through the use of the Lagrangian cost measure Solution:  Motion vector selected that yields the best rate-quality tradeoff  Coding mode (Intra, Inter16, Inter8, Skipped) selected that yields the best rate-quality tradeoff based on the already determined motion vectors Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 23 Efficiency: Rate-Distortion Optimized Video Coding Copyright (C) 1998 Faouzi Kossentini

Result:  % savings in bit rate obtained using the RD optimized coder for most sequences over our reference software Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 24 Efficiency: Rate-Distortion Optimized Video Coding Copyright (C) 1998 Faouzi Kossentini

Description:  Multi-representation coding at different quality and resolution levels allowed  Three general types (SNR, spatial, temporal) of scalability supported Characteristics:  Each layer built incrementally on its previous layer, increasing the decoded picture quality, resolution, or both  Only the first (base) layer independently decoded  Information from temporally previous pictures in the same layer or temporally simultaneous pictures in a lower layer used Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 25 Scalability: Description & Characteristics Copyright (C) 1998 Faouzi Kossentini

Example:  Enhancement layer 1: Example of SNR scalability  Enhancement layer 2: Example of spatial and temporal scalability Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 26 Scalability: Description & Characteristics Copyright (C) 1998 Faouzi Kossentini

Motivation:  Compression efficiency of a higher layer dependent on that of all preceding (lower) layers  Higher compression efficiency and more flexible joint rate-distortion control achieved via an RD optimized framework Current research direction:  Employ (1) multiple rate constraints (explicit for each layer) OR (2) a single rate constraint with explicit distortion constraints for each layer.  Apply RD optimization to temporal, spatial and SNR scalable coding algorithms Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 27 Scalability: RD Optimized Framework Copyright (C) 1998 Faouzi Kossentini

Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 28 Error Resilience: Synchronization & Error Concealment Use of Synchronization Markers:  Different synchronization markers (between GOBs, slices, etc.) used for higher error resilience Error concealment:  Missing motion vectors for a macroblock received in error recovered from neighboring macroblocks (via spatial methods)  Texture information from the previous frame is compensated using the recovered motion vectors Copyright (C) 1998 Faouzi Kossentini

Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 29 Experimental Results:  Packetization: one packet consists of a complete GOB.  Using synchronization markers and error concealment yields PSNR gains of as much as 9 dB. Error Resilience: Synchronization & Error Concealment Copyright (C) 1998 Faouzi Kossentini

Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience 30 Use of Reference Picture Selection Mode: Use of different pictures (or picture segments) for prediction allowed by H.263+, hence multiple description video coding Multiple Description:  Different descriptions of the source sent on different channels, may be available at the decoder  Redundancy among descriptions minimized  Useful even if only one description is available Temporal Reference for Prediction:  Different temporal reference pictures employed for prediction of different pictures  Error propagation confined to those pictures dependent on the missing reference Error Resilience: Multiple Description Video Coding Copyright (C) 1998 Faouzi Kossentini

H.263+ versus H.263:  H.263+ backward compatible with the H.263 (same baseline)  Efficient ways provided by H.263+ of trading additional complexity for more compression efficiency, scalability, resilience and flexibility H.263/H.263+ versus MPEG-4:  H.263 output decodable by MPEG-4, H.263+/MPEG-4 sharing tools  Although somewhat available via chroma keying in H.263+, object- based functionality better provided by MPEG-4  H.263/H.263+ public-domain implementation: Future directions:  Still higher coding efficiency and better scalability/resilience possible  More object-based tools and capabilities, better (efficient) shape coding Low Bit Rate H Video Coding: Efficiency, Scalability and Error Resilience Copyright (C) 1998 Faouzi Kossentini 31 Conclusions