KIANOOSH MOKHTARIAN SCHOOL OF COMPUTING SCIENCE SIMON FRASER UNIVERSITY 6/24/2007 Overview of the Scalable Video Coding Extension of the H.264/AVC Standard.

Slides:



Advertisements
Similar presentations
AVC Compression Update: FRExt and future Matthew Goldman Vice President of Technology Compression Systems.
Advertisements

Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
2005/01/191/14 Overview of Fine Granularity Scalability in MPEG-4 Video Standard Weiping Li Fellow, IEEE IEEE Transactions on Circuits and Systems for.
MPEG-2 to H.264/AVC Transcoding Techniques Jun Xin Xilient Inc. Cupertino, CA.
A Performance Analysis of the ITU-T Draft H.26L Video Coding Standard Anthony Joch, Faouzi Kossentini, Panos Nasiopoulos Packetvideo Workshop 2002 Department.
Basics of MPEG Picture sizes: up to 4095 x 4095 Most algorithms are for the CCIR 601 format for video frames Y-Cb-Cr color space NTSC: 525 lines per frame.
Scalable ROI Algorithm for H.264/SVC-Based Video Streaming Jung-Hwan Lee and Chuck Yoo, Member, IEEE.
-1/20- MPEG 4, H.264 Compression Standards Presented by Dukhyun Chang
Technion - IIT Dept. of Electrical Engineering Signal and Image Processing lab Transrating and Transcoding of Coded Video Signals David Malah Ran Bar-Sella.
1 Video Coding Concept Kai-Chao Yang. 2 Video Sequence and Picture Video sequence Large amount of temporal redundancy Intra Picture/VOP/Slice (I-Picture)
Efficient Bit Allocation and CTU level Rate Control for HEVC Picture Coding Symposium, 2013, IEEE Junjun Si, Siwei Ma, Wen Gao Insitute of Digital Media,
SCHOOL OF COMPUTING SCIENCE SIMON FRASER UNIVERSITY CMPT 820 : Error Mitigation Schaar and Chou, Multimedia over IP and Wireless Networks: Compression,
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada End-to-End Secure Delivery of Scalable Video Streams Mohamed Hefeeda (Joint.
Overview of the Scalable Video Coding Extension of the H
Overview of the H.264/AVC Video Coding Standard
Video Transmission Adopting Scalable Video Coding over Time- varying Networks Chun-Su Park, Nam-Hyeong Kim, Sang-Hee Park, Goo-Rak Kwon, and Sung-Jea Ko,
Rate-Distortion Optimized Layered Coding with Unequal Error Protection for Robust Internet Video Michael Gallant, Member, IEEE, and Faouzi Kossentini,
Overview of Fine Granularity Scalability in MPEG-4 Video Standard Weiping Li, Fellow, IEEE.
Communication & Multimedia C. -Y. Tsai 2005/8/17 1 MCTF in Current Scalable Video Coding Schemes Student: Chia-Yang Tsai Advisor: Prof. Hsueh-Ming Hang.
Efficient Fine Granularity Scalability Using Adaptive Leaky Factor Yunlong Gao and Lap-Pui Chau, Senior Member, IEEE IEEE TRANSACTIONS ON BROADCASTING,
Block Partitioning Structure in the HEVC Standard
Adaptive Deblocking Filter in H.264 Ehsan Maani Course Project:
Institute of Electronics, National Chiao Tung University Scalable Extension of H.264/AVC Student: Hung-Chih Lin Advisor: Prof. Hsueh-Ming Hang.
Prof. V. M. Gadre Department of Electrical Engineering, IIT Bombay.
An Introduction to H.264/AVC and 3D Video Coding.
1. 1. Problem Statement 2. Overview of H.264/AVC Scalable Extension I. Temporal Scalability II. Spatial Scalability III. Complexity Reduction 3. Previous.
MPEG-2 Digital Video Coding Standard
Overview of the Scalable Video Coding Extension of the H.264/AVC Standard Kai-Chao Yang 12007/8Kai-Chao Yang, NTHU, Taiwan.
Heiko Schwarz, Detlev Marpe, and Thomas Wiegand CSVT, Sept. 2007
EE 5359 H.264 to VC 1 Transcoding Vidhya Vijayakumar Multimedia Processing Lab MSEE, University of Arlington Guided.
3D EXTENSION of HEVC: Multi-View plus Depth Parashar Nayana Karunakar Student Id: Department of Electrical Engineering.
3D EXTENSION of HEVC: Multi-View plus Depth Parashar Nayana Karunakar Student Id: Department of Electrical Engineering.
Kai-Chao Yang Hierarchical Prediction Structures in H.264/AVC.
-1/20- Scalable Video Coding Scalable Extension of H.264 / AVC.
Philipp Merkle, Aljoscha Smolic Karsten Müller, Thomas Wiegand CSVT 2007.
Farid Molazem Network Systems Lab Simon Fraser University Scalable Video Transmission for MobileTV.
Page 19/15/2015 CSE 40373/60373: Multimedia Systems 11.1 MPEG 1 and 2  MPEG: Moving Pictures Experts Group for the development of digital video  It is.
MPEG-1 and MPEG-2 Digital Video Coding Standards Author: Thomas Sikora Presenter: Chaojun Liang.
Profiles and levelstMyn1 Profiles and levels MPEG-2 is intended to be generic, supporting a diverse range of applications Different algorithmic elements.
Outline JVT/H.26L: History, Goals, Applications, Structure
FEC and RDO in SVC Thomas Wiegand 1. Outline Introduction SVC Bit-Stream Raptor Codes Layer-Aware FEC Simulation Results Linear Signal Model Description.
Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.
- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison of H.264/MPEG4.
June, 1999 An Introduction to MPEG School of Computer Science, University of Central Florida, VLSI and M-5 Research Group Tao.
Scalable Video Coding Prof. V. M. Gadre Department of Electrical Engineering, IIT Bombay.
Power saving control for the mobile DVB-H receivers based on H.264/SVC standard Eugeny Belyaev, Vitaly Grinko, Ann Ukhanova Saint-Petersburg State University.
Compression video overview 演講者:林崇元. Outline Introduction Fundamentals of video compression Picture type Signal quality measure Video encoder and decoder.
- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison between H.264.
Rate-distortion Optimized Mode Selection Based on Multi-channel Realizations Markus Gärtner Davide Bertozzi Classroom Presentation 13 th March 2001.
Figure 1.a AVS China encoder [3] Video Bit stream.
Guillaume Laroche, Joel Jung, Beatrice Pesquet-Popescu CSVT
Scalable Video Coding and Transport Over Broad-band wireless networks Authors: D. Wu, Y. Hou, and Y.-Q. Zhang Source: Proceedings of the IEEE, Volume:
High-efficiency video coding: tools and complexity Oct
Fine Granularity Scalability in MPEG-4 Video by Weiping Li Presentation by Warren Cheung.
Overview of Fine Granularity Scalability in MPEG-4 Video Standard Weiping Li Presented by : Brian Eriksson.
Video Compression—From Concepts to the H.264/AVC Standard
Video Compression and Standards
Flow Control in Compressed Video Communications #2 Multimedia Systems and Standards S2 IF ITTelkom.
COMPARATIVE STUDY OF HEVC and H.264 INTRA FRAME CODING AND JPEG2000 BY Under the Guidance of Harshdeep Brahmasury Jain Dr. K. R. RAO ID MS Electrical.
Introduction to MPEG Video Coding Dr. S. M. N. Arosha Senanayake, Senior Member/IEEE Associate Professor in Artificial Intelligence Room No: M2.06
Implementation and comparison study of H.264 and AVS china EE 5359 Multimedia Processing Spring 2012 Guidance : Prof K R Rao Pavan Kumar Reddy Gajjala.
MPEG Video Coding I: MPEG-1 1. Overview  MPEG: Moving Pictures Experts Group, established in 1988 for the development of digital video.  It is appropriately.
Computational Controlled Mode Selection for H.264/AVC June Computational Controlled Mode Selection for H.264/AVC Ariel Kit & Amir Nusboim Supervised.
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Quality Evaluation and Comparison of SVC Encoders
Overview of the Scalable Video Coding
Research Topic Error Concealment Techniques in H.264/AVC for Wireless Video Transmission Vineeth Shetty Kolkeri EE Graduate,UTA.
Viewport-based 360 Video Streaming:
Viewport-based 360 Video Streaming:
MPEG4 Natural Video Coding
Presentation transcript:

KIANOOSH MOKHTARIAN SCHOOL OF COMPUTING SCIENCE SIMON FRASER UNIVERSITY 6/24/2007 Overview of the Scalable Video Coding Extension of the H.264/AVC Standard

Motivation High heterogeneity among receivers  Connection quality  Display resolution  Processing power

Motivation High heterogeneity among receivers  Connection quality  Display resolution  Processing power Simulcasting

Motivation High heterogeneity among receivers  Connection quality  Display resolution  Processing power Simulcasting Transcoding

Motivation High heterogeneity among receivers  Connection quality  Display resolution  Processing power Simulcasting Transcoding Scalability  H.262|MPEG-2, H.263, MPEG-4 Visual

Overview Background Temporal scalability Spatial scalability Quality scalability Conclusion

Background Scalability  Temporal  Spatial  Quality (fidelity or SNR)  Object-based and region-of-interest  Hybrid

Background Scalability  Temporal  Spatial  Quality (fidelity or SNR)  Object-based and region-of-interest  Hybrid Applications  Encode once, decode many ways  Unequal importance + unequal error protection  Archiving in surveillance applications

Background Requirements for a scalable video coding technique  Similar coding efficiency to single-layer coding  Little increase in decoding complexity  Support of temporal, spatial, quality scalability  Backward compatibility of the base layer  Support of simple bitstream adaptations after encoding

Overview Background Temporal scalability Spatial scalability Quality scalability Conclusion

Temporal Scalability Enabled by restricting motion-compensated prediction

Temporal Scalability Enabled by restricting motion-compensated prediction  Already provided by H.264/AVC

Temporal Scalability Enabled by restricting motion-compensated prediction  Already provided by H.264/AVC Hierarchical prediction structure  Pictures of temporal enhancement layers: typically B-pictures  Group of Pictures (GoP)

Temporal Scalability: Hierarchical Pred’ Struct’ Dyadic temporal enhancement layers

Temporal Scalability: Hierarchical Pred’ Struct’ Non-dyadic case

Temporal Scalability: Hierarchical Pred’ Struct’ Other flexibilities  Multiple reference picture concept of H.264/AVC  Reference picture can be in the same layer as the target frame  Hierarchical prediction structure can be modified over time

Temporal Scalability: Hierarchical Pred’ Struct’ Adjusting the structural delay

Temporal Scalability: Coding Efficiency Highly dependent on quantization parameters  Intuitively, higher fidelity for the temporal base layer pictures How to choose QPs  Expensive rate-distortion analysis  QP T = QP T  High PSNR fluctuations inside a GoP  Subjectively shown to be temporally smooth

Temporal Scalability: Coding Efficiency Dyadic hierarchical B-pictures, no delay constraint

Temporal Scalability: Coding Efficiency High-delay test set, CIF 30Hz, 34dB, compared to IPPP

Temporal Scalability: Coding Efficiency Low-delay test set, 365x288, 25-30Hz, 38dB, delay is constrained to be zero compared to IPPP

Temporal Scalability: Conclusion Typically no negative impact on coding efficiency  But also significant improvement, especially when higher delays are tolerable  Minor losses in coding efficiency are possible when low delay is required

Overview Background Temporal scalability Spatial scalability Quality scalability Conclusion

Spatial Scalability Motion-compensated prediction and intra-prediction in each spatial layer, as for single-layer coding

Spatial Scalability Motion-compensated prediction and intra-prediction in each spatial layer, as for single-layer coding  Inter-layer prediction

Spatial Scalability Motion-compensated prediction and intra-prediction in each spatial layer, as for single-layer coding  Inter-layer prediction  Same coding order for all layers

Spatial Scalability Motion-compensated prediction and intra-prediction in each spatial layer, as for single-layer coding  Inter-layer prediction  Same coding order for all layers Access units

Spatial Scalability: Inter-Layer Prediction Previous standards  Inter-layer prediction by upsampling the reconstructed samples of the lower layer signal  Prediction signal formed by:  Upsampled lower layer signal  Temporal prediction inside the enhancement layer  Averaging both

Spatial Scalability: Inter-Layer Prediction Previous standards  Inter-layer prediction by upsampling the reconstructed samples of the lower layer signal  Prediction signal formed by:  Upsampled lower layer signal  Temporal prediction inside the enhancement layer  Averaging both Lower layer samples not necessarily the most suitable data for inter-layer prediction

Spatial Scalability: Inter-Layer Prediction Previous standards  Inter-layer prediction by upsampling the reconstructed samples of the lower layer signal  Prediction signal formed by:  Upsampled lower layer signal  Temporal prediction inside the enhancement layer  Averaging both Lower layer samples not necessarily the most suitable data for inter-layer prediction  Prediction of macroblock modes and associated motion parameters  Prediction of the residual signal

Spatial Scalability: Inter-Layer Prediction A new macroblock type signalled by base mode flag  Only a residual signal is transmitted  No intra-prediction mode or motion parameter

Spatial Scalability: Inter-Layer Prediction A new macroblock type signalled by base mode flag  Only a residual signal is transmitted  No intra-prediction mode or motion parameter  If the corresponding block in the reference layer is:  Intra-coded  inter-layer intra prediction The reconstructed intra-signal of the reference layer is upsampled as a predictor  Inter-coded  inter-layer motion prediction Partitioning data are upsampled, reference indexes are copied, and motion vectors are scaled up

Spatial Scalability: Inter-Layer Prediction Inter-layer motion prediction (for a 16x16, 16x8, 8x16, or 8x8 macroblock partition)  Reference indexes are copied  Scaled motion vectors are used as motion vector predictors

Spatial Scalability: Inter-Layer Prediction Inter-layer motion prediction (for a 16x16, 16x8, 8x16, or 8x8 macroblock partition)  Reference indexes are copied  Scaled motion vectors are used as motion vector predictors Inter-layer residual prediction  Can be used for any inter-coded macroblock, regardless of its base mode flag or inter-layer motion prediction  The residual signal of the reference layer is upsampled as a predictor

Spatial Scalability: Inter-Layer Prediction For a 16x16 macroblock in an enhancement layer: 1 base mode flag 0 Inter-layer intra prediction (samples values are predicted) Inter-layer motion prediction (partitioning data, ref. indexes, and motion vectors are derived) Inter-layer motion prediction (ref. indexes are derived, motion vectors are predicted) No inter-layer motion prediction Inter-layer residual prediction No inter-layer residual prediction

Spatial Scalability: Generalizing Not only dyadic Enhancement layer may represent only a selected rectangular area of its reference layer picture Enhancement layer may contain additional parts beyond the borders of its reference layer picture Tools for spatial scalable coding of interlaced sources

Spatial Scalability: Complexity Constraints Inter-layer intra-prediction is restricted  Although coding efficiency is improved by generally allowing this prediction mode  Each layer can be decoded by a single motion compensation loop, unlike previous coding standards

Spatial Scalability: Coding Efficiency Comparison to single-layer coding and simulcast Base/enhancement layer at 352x288 / 704x576 Only the first frame is intra-coded Inter-layer prediction (ILP):  Intra (I), motion (M), residual (R)

Spatial Scalability: Coding Efficiency Comparison to single-layer coding and simulcast Base/enhancement layer at 352x288 / 704x576 Only the first frame is intra-coded Inter-layer prediction (ILP):  Intra (I), motion (M), residual (R)

Spatial Scalability: Coding Efficiency Comparison to single-layer coding and simulcast Base/enhancement layer at 352x288 / 704x576 Only the first frame is intra-coded Inter-layer prediction (ILP):  Intra (I), motion (M), residual (R)

Spatial Scalability: Coding Efficiency Comparison of fully featured SVC “single-loop ILP (I, M, R)” to scalable profiles of previous standards “multi-loop ILP (I)”

Spatial Scalability: Encoder Control JSVM software encoder control  Base layer coding parameters are optimized for that layer only  performance equal to single-layer H.264/AVC

Spatial Scalability: Encoder Control JSVM software encoder control  Base layer coding parameters are optimized for that layer only  performance equal to single-layer H.264/AVC  Not necessarily suitable for an efficient enhancement layer coding

Spatial Scalability: Encoder Control JSVM software encoder control  Base layer coding parameters are optimized for that layer only  performance equal to single-layer H.264/AVC  Not necessarily suitable for an efficient enhancement layer coding Improved multi-layer encoder control  Optimized for both layers

Spatial Scalability: Encoder Control QP enhancement layer = QP base layer + 4 Hierarchical B-pictures, GoP size = 16 Bit-rate increase relative to single-layer for the same quality is always less than or equal to 10% for both layers

Overview Background Temporal scalability Spatial scalability Quality scalability Conclusion

Quality Scalability Special case of spatial scalability with identical picture sizes  No upsampling for inter-layer predictions  Inter-layer intra- and residual-prediction are directly performed in transform domain Different qualities achieved by decreasing quantization step along the layers Coarse-Grained Scalability (CGS)  A few selected bitrates are supported in the scalable bitstream  Quality scalability becomes less efficient when bitrate difference between CGS layers gets smaller

Quality Scalability: MGS Medium-Grained Scalability (MGS) improves:  Flexibility of the stream  Packet-level quality scalability  Error robustness  Controlling drift propagation  Coding efficiency  Use of more information for temporal prediction

Quality Scalability: MGS MGS: flexibility of the stream  Enhancement layer transform coefficients can be distributed among several slices  Packet-level quality scalability

Quality Scalability: MGS MGS: error robustness vs. coding efficiency

Quality Scalability: MGS MGS: error robustness vs. coding efficiency  Pictures of the coarsest temporal layer are transmitted as key pictures  Only for them the base layer picture needs to be present in decoding buffer  Re-synchronization points for controlling drift propagation  All other pictures use the highest available quality picture of the reference frames for motion compensation  High coding efficiency

Quality Scalability: Encoding, Extracting Encoder does not known what quality will be available in the decoder  Better to use highest quality references  Should not be mistaken with open-loop coding Bitstream extraction  based on priority identifier of NAL units assigned by encoder

Quality Scalability: Coding Efficiency BL-/EL-only control: motion compensation loop is closed at the base/enhancement layer 2-loop control: one motion compensation loop in each layer adapt. BL/EL control: use of key pictures

Quality Scalability: Coding Efficiency MGS vs. CGS

Overview Background Temporal scalability Spatial scalability Quality scalability Conclusion

SVC outperforms previous scalable video coding standards  Hierarchical B-pictures  Inter-layer prediction  MGS  Key pictures

Thank You Any Questions?

References H. Schwarz, D. Marpe, and T. Wiegand, “Overview of the scalable video coding extension of the H.264/AVC standard,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 9, pp. 1103–1120, September T.Wiegand, G. Sullivan, J. Reichel, H. Schwarz, and M.Wien, "Joint Draft ITU-T Rec. H.264 | ISO/IEC / Amd.3 Scalable video coding," Joint Video Team, Doc. JVT-X201, July H. Kirchhoffer, H. Schwarz, and T. Wiegand, "CE1: Simplified FGS," Joint Video Team, Doc. JVT-W090, April 2007.