VIDEO COMPRESSION USING NESTED QUADTREE STRUCTURES, LEAF MERGING, AND IMPROVED TECHNIQUES FOR MOTION REPRESENTATION AND ENTROPY CODING Present by fakewen.

Slides:

Advertisements

Similar presentations

Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.

Advertisements

H.264 Intra Frame Coder System Design Özgür Taşdizen Microelectronics Program at Sabanci University 4/8/2005.

Time Optimization of HEVC Encoder over X86 Processors using SIMD

MPEG4 Natural Video Coding Functionalities: –Coding of arbitrary shaped objects –Efficient compression of video and images over wide range of bit rates.

Overview of the H. 264/AVC video coding standard.

Technion - IIT Dept. of Electrical Engineering Signal and Image Processing lab Transrating and Transcoding of Coded Video Signals David Malah Ran Bar-Sella.

SWE 423: Multimedia Systems

H.264/AVC Baseline Profile Decoder Complexity Analysis Michael Horowitz, Anthony Joch, Faouzi Kossentini, and Antti Hallapuro IEEE TRANSACTIONS ON CIRCUITS.

1 Adaptive slice-level parallelism for H.264/AVC encoding using pre macroblock mode selection Bongsoo Jung, Byeungwoo Jeon Journal of Visual Communication.

Li Liu, Robert Cohen, Huifang Sun, Anthony Vetro, Xinhua Zhuang BMSB

Reji Mathew and David S. Taubman CSVT  Introduction  Quad-tree representation  Quad-tree motion modeling  Motion vector prediction strategies.

Ch. 6- H.264/AVC Part I (pp.160~199) Sheng-kai Lin

Overview of the H.264/AVC Video Coding Standard

Rate-Distortion Optimized Layered Coding with Unequal Error Protection for Robust Internet Video Michael Gallant, Member, IEEE, and Faouzi Kossentini,

Spatial and Temporal Data Mining

Analysis, Fast Algorithm, and VLSI Architecture Design for H

H.264 / MPEG-4 Part 10 Nimrod Peleg March 2003.

Losslessy Compression of Multimedia Data Hao Jiang Computer Science Department Sept. 25, 2007.

Decision Trees for Error Concealment in Video Decoding Song Cen and Pamela C. Cosman, Senior Member, IEEE IEEE TRANSACTION ON MULTIMEDIA, VOL. 5, NO. 1,

Scalable Wavelet Video Coding Using Aliasing- Reduced Hierarchical Motion Compensation Xuguang Yang, Member, IEEE, and Kannan Ramchandran, Member, IEEE.

CS :: Fall 2003 MPEG-1 Video (Part 1) Ketan Mayer-Patel.

Fundamentals of Multimedia Chapter 7 Lossless Compression Algorithms Ze-Nian Li and Mark S. Drew 건국대학교 인터넷미디어공학부 임 창 훈.

Fundamentals of Multimedia Chapter 10 Basic Video Compression Techniques Ze-Nian Li & Mark S. Drew 건국대학교 인터넷미디어공학부 임 창 훈.

Xinqiao LiuRate constrained conditional replenishment1 Rate-Constrained Conditional Replenishment with Adaptive Change Detection Xinqiao Liu December 8,

1 Image and Video Compression: An Overview Jayanta Mukhopadhyay Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur,

Image Compression - JPEG. Video Compression MPEG –Audio compression Lossy / perceptually lossless / lossless 3 layers Models based on speech generation.

Lossy Compression Based on spatial redundancy Measure of spatial redundancy: 2D covariance Cov X (i,j)=  2 e -  (i*i+j*j) Vertical correlation   

Kai-Chao Yang Hierarchical Prediction Structures in H.264/AVC.

 Coding efficiency/Compression ratio:  The loss of information or distortion measure:

MPEG-1 and MPEG-2 Digital Video Coding Standards Author: Thomas Sikora Presenter: Chaojun Liang.

: Chapter 12: Image Compression 1 Montri Karnjanadecha ac.th/~montri Image Processing.

Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.

Codec structuretMyn1 Codec structure In an MPEG system, the DCT and motion- compensated interframe prediction are combined. The coder subtracts the motion-compensated.

June, 1999 An Introduction to MPEG School of Computer Science, University of Central Florida, VLSI and M-5 Research Group Tao.

8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.

TM Paramvir Bahl Microsoft Corporation Adaptive Region-Based Multi-Scaled Motion- Compensated Video Coding for Error Prone Communication.

By: Hitesh Yadav Supervising Professor: Dr. K. R. Rao Department of Electrical Engineering The University of Texas at Arlington Optimization of the Deblocking.

High Efficiency Video Coding Kiana Calagari CMPT 880: Large-scale Multimedia Systems and Cloud Computing.

Lossless Compression CIS 465 Multimedia. Compression Compression: the process of coding that will effectively reduce the total number of bits needed to.

Compression video overview 演講者：林崇元. Outline Introduction Fundamentals of video compression Picture type Signal quality measure Video encoder and decoder.

Rate-distortion Optimized Mode Selection Based on Multi-channel Realizations Markus Gärtner Davide Bertozzi Classroom Presentation 13 th March 2001.

Figure 1.a AVS China encoder [3] Video Bit stream.

Guillaume Laroche, Joel Jung, Beatrice Pesquet-Popescu CSVT

Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp

Fast Mode Decision Algorithm for Residual Quadtree Coding in HEVC Visual Communications and Image Processing (VCIP), 2011 IEEE.

High-efficiency video coding: tools and complexity Oct

Vamsi Krishna Vegunta University of Texas, Arlington

IEEE Transactions on Consumer Electronics, Vol. 58, No. 2, May 2012 Kyungmin Lim, Seongwan Kim, Jaeho Lee, Daehyun Pak and Sangyoun Lee, Member, IEEE 報告者：劉冠宇.

1 Modular Refinement of H.264 Kermin Fleming. 2 What is H.264? Mobile Devices Low bit-rate Video Decoder –Follow on to MPEG-2 and H.26x Operates on pixel.

Video Compression Using Nested Quadtree Structures, Leaf Merging, and Improved Techniques for Motion Representation and Entropy Coding Present by fakewen.

JPEG Image Compression Standard Introduction Lossless and Lossy Coding Schemes JPEG Standard Details Summary.

Unified Loop Filter for High-performance Video Coding Yu Liu and Yan Huo ICME2010, July 19-23, Singapore.

Video Compression—From Concepts to the H.264/AVC Standard

Overview of the High Efficiency Video Coding (HEVC) Standard

Chapter 8 Lossy Compression Algorithms. Fundamentals of Multimedia, Chapter Introduction Lossless compression algorithms do not deliver compression.

3-D WAVELET BASED VIDEO CODER By Nazia Assad Vyshali S.Kumar Supervisor Dr. Rajeev Srivastava.

Chapter 7 Lossless Compression Algorithms 7.1 Introduction 7.2 Basics of Information Theory 7.3 Run-Length Coding 7.4 Variable-Length Coding (VLC) 7.5.

Motion Estimation Multimedia Systems and Standards S2 IF Telkom University.

Time Optimization of HEVC Encoder over X86 Processors using SIMD Kushal Shah Advisor: Dr. K. R. Rao Spring 2013 Multimedia.

Highly Parallel Mode Decision Method for HEVC Jun Zhang, Feng Dai, Yike Ma, and Yongdong Zhang Picture Coding Symposium (PCS),

Hierarchical Systolic Array Design for Full-Search Block Matching Motion Estimation Noam Gur Arie,August 2005.

Complexity varying intra prediction in H.264 Supervisors: Dr. Ofer Hadar, Mr. Evgeny Kaminsky Students: Amit David, Yoav Galon.

Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.

Present by 楊信弘 Advisor: 鄭芳炫

Data Transformation: Normalization

Adaptive Block Coding Order for Intra Prediction in HEVC

Huffman Coding, Arithmetic Coding, and JBIG2

Quad-Tree Motion Modeling with Leaf Merging

Supplement, Chapters 6 MC Course, 2009.

Progress & schedule Presenter : YY Date : 2014/10/3.

Presentation transcript:

VIDEO COMPRESSION USING NESTED QUADTREE STRUCTURES, LEAF MERGING, AND IMPROVED TECHNIQUES FOR MOTION REPRESENTATION AND ENTROPY CODING Present by fakewen

introduction

Video Compression Using Nested Quadtree Structures  A video coding architecture is described that is based on nested and pre-conﬁgurable quadtree structures for ﬂexible and signal-adaptive picture partitioning.  partitioning concept is to provide a high degree of adaptability for both temporal and spatial prediction

Leaf merging  leaf merging mechanism is included in order to prevent excessive partitioning of a picture into prediction blocks and to reduce the amount of bits for signaling the prediction signal.

Improved Techniques for Motion Representation  For fractional-sample motion-compensated prediction, a fixed-point implementation of the maximal-order-minimum-support algorithm is presented that uses a combination of infinite impulse response and FIR filtering.

Entropy Coding  Entropy coding utilizes the concept of probability interval partitioning entropy codes that offers new ways for parallelization and enhanced throughput.

outline  Overview of the Video Coding Scheme  Picture Partitioning for Prediction and Residual Coding  Motion-Compensated Prediction  Spatial Intra Prediction  Variable Block-Size Spatial Transforms and Quantization  Internal Bit Depth Increase  In-Loop Filtering  Entropy Coding  Encoder Control  Coding Conditions and Results

outline  Overview of the Video Coding Scheme  Picture Partitioning for Prediction and Residual Coding  Motion-Compensated Prediction  Spatial Intra Prediction  Variable Block-Size Spatial Transforms and Quantization  Internal Bit Depth Increase  In-Loop Filtering  Entropy Coding  Encoder Control  Coding Conditions and Results

Overview of the Video Coding Scheme  Wide-range variable block-size prediction  Nested wide-range variable block-size residual coding  Merging of prediction blocks  Fractional-sample MOMS interpolation  Adaptive in-loop ﬁlter  PIPE coding

Wide-range variable block-size prediction  the size of prediction blocks can be adaptively chosen by using a quadtree-based partitioning.  Maximum (Nmax ) and minimum (Nmin ) admissible block edge length can be speciﬁed as a side information. Nmax = 64 and Nmin = 4.

Nested wide-range variable block- size residual coding  the block size used for discrete cosine transform (DCT)-based residual coding is adapted to the characteristics of the residual signal by using a nested quadtree-based partitioning of the corresponding prediction block.

Merging of prediction blocks  in order to reduce the side information required for signaling the prediction parameters, neighboring blocks can be merged into one region that is assigned only a single set of prediction parameters.

Fractional-sample MOMS interpolation  Interpolation of fractional-sample positions for motion-compensated prediction is based on a fixed-point implementation of the maximal-order- minimum-support (MOMS) algorithm using an infinite impulse response (IIR)/FIR filter

Adaptive in-loop filter  in addition to the deblocking filter, a separable 2-D Wiener filter is applied within the coding loop. The filter is adaptively applied to selected regions indicated by the use of quadtree- based partitioning

PIPE coding  the novel probability interval partitioning entropy (PIPE) coding scheme provides the coding efﬁciency and probability modeling capability of arithmetic coding at the complexity level of Huffman coding.

outline  Overview of the Video Coding Scheme  Picture Partitioning for Prediction and Residual Coding  Motion-Compensated Prediction  Spatial Intra Prediction  Variable Block-Size Spatial Transforms and Quantization  Internal Bit Depth Increase  In-Loop Filtering  Entropy Coding  Encoder Control  Coding Conditions and Results

Picture Partitioning for Prediction and Residual Coding  The concept of a macroblock as the basic processing unit in standardized video coding is generalized to what we call a coding tree block (CTB).

 Dividing each picture into CTBs and further recursively subdividing each CTB into square blocks of variable size allows to partition a given picture of a video signal in such a way that both the block sizes and the block coding parameters such as prediction or residual coding modes will be adapted to the speciﬁc characteristics of the signal at hand.

 Fig. 1 illustrates  an example, where transform blocks and their corresponding  RQTs are shown in dashed lines. Note that the transform  block size that corresponds to the root node of a given RQT  is identical to the size of the related prediction block, or  equivalently, the leaf of the prediction quadtree, to which the  RQT is associated.

outline  Overview of the Video Coding Scheme  Picture Partitioning for Prediction and Residual Coding  Motion-Compensated Prediction  Spatial Intra Prediction  Variable Block-Size Spatial Transforms and Quantization  Internal Bit Depth Increase  In-Loop Filtering  Entropy Coding  Encoder Control  Coding Conditions and Results

Motion-Compensated Prediction  Fractional-Sample Interpolation Using MOMS  Generalized interpolation using MOMS  Choice of MOMS basis functions  Implementation aspects of cubic and quintic O- MOMS  Interleaved Motion-Vector Prediction  Merging of Motion-Compensated Predicted Blocks

 each MCP block is associated with one or two sets of motion parameters, where each set of motion parameters consists of a picture reference index and a translational motion vector.

Generalized interpolation using MOMS  Generating the prediction signal for motion vectors not pointing to an integer-sample position requires the use of a fractional-sample interpolation method  8-tap or 12-tap ﬁlters

Choice of MOMS basis functions  For our application case of fractional-sample interpolation, we have considered two members of the family of so-called O-MOMS (optimal MOMS) with interpolation kernels of degree L = 3 (cubic) and L = 5 (quintic).

Implementation Aspects of Cubic and Quintic O-MOMS  The prefiltering step can be efficiently realized by separably applying a discrete 1-D IIR filter along rows and columns of the reconstructed picture. In the case of cubic or quintic O- MOMS, this IIR filter can be factorized into one or two sets of first-order causal and anti-causal recursive filters, respectively.

Interleaved Motion-Vector Prediction  In order to reduce the bit rate required for transmitting the motion vectors  ﬁrst step, the vertical motion vector component is predicted using conventional median prediction  Then, only those motion vectors of neighboring blocks for which the absolute difference between their vertical component and the vertical component for the current motion vector is minimized are used for the prediction of the horizontal motion- vector component

Merging of Motion-Compensated Predicted Blocks  However, in general, quadtree-based block partitioning may result in an over-segmentation due to the fact that, without any further provision, at each interior node of a quadtree, four subblocks are generated while merging of blocks is possible only by pruning complete branches consisting of at least four child nodes in the parent- child relationship within a quadtree.

outline  Overview of the Video Coding Scheme  Picture Partitioning for Prediction and Residual Coding  Motion-Compensated Prediction  Spatial Intra Prediction  Variable Block-Size Spatial Transforms and Quantization  Internal Bit Depth Increase  In-Loop Filtering  Entropy Coding  Encoder Control  Coding Conditions and Results

Spatial Intra Prediction  For all prediction block sizes, eight directional intra-prediction modes and one additional averaging (DC) mode are available.

outline  Overview of the Video Coding Scheme  Picture Partitioning for Prediction and Residual Coding  Motion-Compensated Prediction  Spatial Intra Prediction  Variable Block-Size Spatial Transforms and Quantization  Internal Bit Depth Increase  In-Loop Filtering  Entropy Coding  Encoder Control  Coding Conditions and Results

Variable Block-Size Spatial Transforms and Quantization  each prediction block can be further subdivided for the purpose of transform coding with the subdivision being determined by the corresponding RQT. Transform block sizes in the range of 4 × 4 to 64 × 64  The transform kernel for each supported transform block size is given by a separable integer approximation of the 2-D type-II discrete cosine transform of the corresponding block size

outline  Overview of the Video Coding Scheme  Picture Partitioning for Prediction and Residual Coding  Motion-Compensated Prediction  Spatial Intra Prediction  Variable Block-Size Spatial Transforms and Quantization  Internal Bit Depth Increase  In-Loop Filtering  Entropy Coding  Encoder Control  Coding Conditions and Results

Internal Bit Depth Increase  The internal bit depth d i for generating the prediction signal  bit depth do of luma and chroma samples of the original input video signal.  d s = d i − d o  increased internal bit depth d I Original input:do di bit for prediction do:in-loop filter

outline  Overview of the Video Coding Scheme  Picture Partitioning for Prediction and Residual Coding  Motion-Compensated Prediction  Spatial Intra Prediction  Variable Block-Size Spatial Transforms and Quantization  Internal Bit Depth Increase  In-Loop Filtering  Entropy Coding  Encoder Control  Coding Conditions and Results

In-Loop Filtering  Our proposed video coding scheme utilizes two types of cascaded in-loop filters:  Deblocking filter  The filtering operations are applied to samples at block boundaries of the reconstructed signal  Quadtree-Based Separable 2-D Wiener Filter  The quadtree-based Wiener filter as part of our proposed video coding approach, is designed as a separable filter with the advantage of providing a better tradeoff in computational cost versus rate-distortion (R-D) performance compared to nonseparable Wiener filters

outline  Overview of the Video Coding Scheme  Picture Partitioning for Prediction and Residual Coding  Motion-Compensated Prediction  Spatial Intra Prediction  Variable Block-Size Spatial Transforms and Quantization  Internal Bit Depth Increase  In-Loop Filtering  Entropy Coding  Encoder Control  Coding Conditions and Results

Entropy Coding  Probability Interval Partitioning Entropy Coding  PIPE Coding Using Arithmetic Codes  PIPE Coding Using V2V Codes

Entropy Coding(cont.)  Binary arithmetic decoding can be parallelized

outline  Overview of the Video Coding Scheme  Picture Partitioning for Prediction and Residual Coding  Motion-Compensated Prediction  Spatial Intra Prediction  Variable Block-Size Spatial Transforms and Quantization  Internal Bit Depth Increase  In-Loop Filtering  Entropy Coding  Encoder Control  Coding Conditions and Results

Encoder Control  the number of possible partitionings for prediction alone  exceeds 2 4 D − 1  and thus, for a more realistic conﬁguration with  an CTB size of 64 × 64 and D = 4, more than 2 64 partitionings  need to be considered for selecting the optimal one. At least  for this example, a brute force exhaustive search is clearly not  feasible.

 Application of a Fast Optimal Tree Pruning Algorithm  G-BFOS algorithm  Mode Decision Process  Residual Quadtree Pruning Process

outline  Overview of the Video Coding Scheme  Picture Partitioning for Prediction and Residual Coding  Motion-Compensated Prediction  Spatial Intra Prediction  Variable Block-Size Spatial Transforms and Quantization  Internal Bit Depth Increase  In-Loop Filtering  Entropy Coding  Encoder Control  Coding Conditions and Results

 we used for the generation  of our submitted CS 1 bitstreams a hierarchical B picture  coding structure [37] with 4 layers and a corresponding intra  frame period. For CS 2, a structural delay is not allowed  and random access capabilities are not required

 ﬁxed CTB size of  64 × 64 (for luma) and a maximum prediction quadtree depth  of D = 4.

Coding Conditions and Results