New Direction in Wyner-Ziv Video Coding: On the Importance of Modeling Virtual Correlation Channel (VCC) Xin Li LDCSEE, WVU “ If.

Slides:

Advertisements

Similar presentations

Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.

Advertisements

1 A Brief Review of Joint Source-Channel Coding CUBAN/BEATS Meeting 29th April, 2004 Fredrik Hekland Department of Electronics and Telecommunication NTNU.

A Performance Analysis of the ITU-T Draft H.26L Video Coding Standard Anthony Joch, Faouzi Kossentini, Panos Nasiopoulos Packetvideo Workshop 2002 Department.

1 Distributed Source Coding Trial Lecture Fredrik Hekland 1. June 2007.

Rate-Distortion Optimal Skeleton-Based Shape Coding Haohong Wang, Aggelos K. Katsaggelos, and Thrasyvoulos N. Pappas Image Processing, Proceedings.

Efficient Bit Allocation and CTU level Rate Control for HEVC Picture Coding Symposium, 2013, IEEE Junjun Si, Siwei Ma, Wen Gao Insitute of Digital Media,

{ Fast Disparity Estimation Using Spatio- temporal Correlation of Disparity Field for Multiview Video Coding Wei Zhu, Xiang Tian, Fan Zhou and Yaowu Chen.

Limin Liu, Member, IEEE Zhen Li, Member, IEEE Edward J. Delp, Fellow, IEEE CSVT 2009.

Light Field Compression Using 2-D Warping and Block Matching Shinjini Kundu Anand Kamat Tarcar EE398A Final Project 1 EE398A - Compression of Light Fields.

CABAC Based Bit Estimation for Fast H.264 RD Optimization Decision

Video Coding with Spatio-temporal Texture Synthesis and Edge-based inpainting Chunbo Zhu, Xiaoyan Sun, Feng Wu, and Houqiang Li ICME 2008.

Wei Zhu, Xiang Tian, Fan Zhou and Yaowu Chen IEEE TCE, 2010.

Recursive End-to-end Distortion Estimation with Model-based Cross-correlation Approximation Hua Yang, Kenneth Rose Signal Compression Lab University of.

Video Coding with Linear Compensation (VCLC) Arif Mahmood, Zartash Afzal Uzmi, Sohaib A Khan Department of Computer.

Reinventing Compression: The New Paradigm of Distributed Video Coding Bernd Girod Information Systems Laboratory Stanford University.

1 Static Sprite Generation Prof ︰ David, Lin Student ︰ Jang-Ta, Jiang

Location Estimation in Sensor Networks Moshe Mishali.

Distributed Video Coding Bernd Girod, Anne Margot Aagon and Shantanu Rane, Proceedings of IEEE, Jan, 2005 Presented by Peter.

Wyner-Ziv Coding of Motion Video

Scalable Wavelet Video Coding Using Aliasing- Reduced Hierarchical Motion Compensation Xuguang Yang, Member, IEEE, and Kannan Ramchandran, Member, IEEE.

Encoder and Decoder Optimization for Source-Channel Prediction in Error Resilient Video Transmission Hua Yang and Kenneth Rose Signal Compression Lab ECE.

Efficient Fine Granularity Scalability Using Adaptive Leaky Factor Yunlong Gao and Lap-Pui Chau, Senior Member, IEEE IEEE TRANSACTIONS ON BROADCASTING,

1 A Unified Rate-Distortion Analysis Framework for Transform Coding Student : Ho-Chang Wu Student : Ho-Chang Wu Advisor : Prof. David W. Lin Advisor :

Lattices for Distributed Source Coding - Reconstruction of a Linear function of Jointly Gaussian Sources -D. Krithivasan and S. Sandeep Pradhan - University.

Transform Domain Distributed Video Coding. Outline  Another Approach  Side Information  Motion Compensation.

Wyner-Ziv Residual Coding of Video Anne Aaron, David Varodayan and Bernd Girod Information Systems Laboratory Stanford University.

Investigation of Motion-Compensated Lifted Wavelet Transforms Information Systems Laboratory Department of Electrical Engineering Stanford University Markus.

Source-Channel Prediction in Error Resilient Video Coding Hua Yang and Kenneth Rose Signal Compression Laboratory ECE Department University of California,

1 Department of Electrical Engineering Stanford University Anne Aaron, Shantanu Rane and Bernd Girod Wyner-Ziv Video Coding with Hash-Based Motion Compensation.

Rate-Distortion Optimized Motion Estimation for Error Resilient Video Coding Hua Yang and Kenneth Rose Signal Compression Lab ECE Department University.

EE569 Digital Video Processing

BIN LI, HOUQIAN LI, LI LI, AND JINLEI ZHANG IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL.23, NO.9, SEPTEMBER

Xinqiao LiuRate constrained conditional replenishment1 Rate-Constrained Conditional Replenishment with Adaptive Change Detection Xinqiao Liu December 8,

1 Department of Electrical Engineering, Stanford University Anne Aaron, Shantanu Rane, Eric Setton and Bernd Girod Transform-domain Wyner-Ziv Codec for.

Distributed Video Coding Bernd Girod, Anne Margot Aaron, Shantanu Rane, and David Rebollo-Monedero IEEE Proceedings 2005.

Distributed Video Coding VLBV, Sardinia, September 16, 2005 Bernd Girod Information Systems Laboratory Stanford University.

Handwritten Character Recognition using Hidden Markov Models Quantifying the marginal benefit of exploiting correlations between adjacent characters and.

Kai-Chao Yang Hierarchical Prediction Structures in H.264/AVC.

 Coding efficiency/Compression ratio:  The loss of information or distortion measure:

Frame by Frame Bit Allocation for Motion-Compensated Video Michael Ringenburg May 9, 2003.

1 Security and Robustness Enhancement for Image Data Hiding Authors: Ning Liu, Palak Amin, and K. P. Subbalakshmi, Senior Member, IEEE IEEE TRANSACTIONS.

Image Processing and Computer Vision: 91. Image and Video Coding Compressing data to a smaller volume without losing (too much) information.

Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.

Statistical Characteristics of Simple Wyner-Ziv Frames Jin-soo KIM.

Image Compression Supervised By: Mr.Nael Alian Student: Anwaar Ahmed Abu-AlQomboz ID: IT College “Multimedia”

Low-Power H.264 Video Compression Architecture for Mobile Communication Student: Tai-Jung Huang Advisor: Jar-Ferr Yang Teacher: Jenn-Jier Lien.

Sub pixel motion estimation for Wyner-Ziv side information generation Subrahmanya M V (Under the guidance of Dr. Rao and Dr.Jin-soo Kim)

Rate-distortion Optimized Mode Selection Based on Multi-channel Realizations Markus Gärtner Davide Bertozzi Classroom Presentation 13 th March 2001.

Brief Overview of Wyner-Ziv CODEC and Research Plan Jin-soo KIM.

Compression of Real-Time Cardiac MRI Video Sequences EE 368B Final Project December 8, 2000 Neal K. Bangerter and Julie C. Sabataitis.

Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp

Fast motion estimation and mode decision for H.264 video coding in packet loss environment Li Liu, Xinhua Zhuang Computer Science Department, University.

Wyner-Ziv Coding of Motion Video Presented by fakewen.

3-D Direction Aligned Wavelet Transform for Scalable Video Coding Yu Liu 1, King Ngi Ngan 1, and Feng Wu 2 1 Department of Electronic Engineering The Chinese.

C.K. Kim, D.Y. Suh, J. Park, B. Jeon ha 強壯 !. DVC bitstream reorganiser.

Li-Wei Kang and Chun-Shien Lu Institute of Information Science, Academia Sinica Taipei, Taiwan, ROC {lwkang, April IEEE.

EE5965 Advanced Image Processing Copyright Xin Li Post-processing: Fighting Against Coding Artifacts Deblocking of DCT coded images – Image.

1 Department of Electrical Engineering, Stanford University Anne Aaron, Shantanu Rane, Rui Zhang and Bernd Girod Wyner-Ziv Coding for Video: Applications.

Samuel Cheng, Shuang Wang and Lijuan Cui University of Oklahoma

1 Department of Electrical Engineering, Stanford University EE 392J Final Project Presentation Shantanu Rane Hash-Aided Motion Estimation & Rate Control.

Image Processing Architecture, © Oleh TretiakPage 1Lecture 5 ECEC 453 Image Processing Architecture Lecture 5, 1/22/2004 Rate-Distortion Theory,

Distributed Video System realized on mobile device with efficient Feedback channel 分散式影像編碼在手機上的實現與有效率的回饋通道 1 Chen,chun-yuan 陳群元 Advisor:Prof. Wu,Ja-Ling.

Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.

BITS Pilani Pilani Campus EEE G612 Coding Theory and Practice SONU BALIYAN 2017H P.

2018/9/16 Distributed Source Coding Using Syndromes (DISCUS): Design and Construction S.Sandeep Pradhan, Kannan Ramchandran IEEE Transactions on Information.

Limitations of Traditional Error-Resilience Methods

Wyner-Ziv Coding of Video - Towards Practical Distributed Coding -

MPEG4 Natural Video Coding

Progress & schedule Presenter : YY Date : 2014/10/3.

Wavelet-based Compression of 3D Mesh Sequences

Presentation transcript:

New Direction in Wyner-Ziv Video Coding: On the Importance of Modeling Virtual Correlation Channel (VCC) Xin Li LDCSEE, WVU “ If you can’t solve a problem, then there is an easier problem you can solve: find it.” - George Pólya

Formulation of a Simpler Problem x 2t x 2t-2 x 2t-1 key frames WZ frames I frames B frames Conventional video coding (  source coding) Wyner-Ziv video coding (  joint source-channel coding) x 2t-1 x 2t x 2t-2 Assuming I or key frames are coded by the same intra-frame encoder, can we achieve comparable coding efficiency on WZ frames to H.264 (state-of- the-art techniques of coding B frames)?

Outline of Our Attack Motivating observations –Characterizing the nonstationary virtual correlation channel (VCC) by a mixture model Theoretical derivation –Classification gain (dual to that in conventional source coding) Classification-based DVC algorithm –Approximate solution to the simplified problem Experimental results –Comparable R-D performance to H.264 JM11.0 (for certain type of video sequences: slow motion) Discussions and perspectives –Dualities between conventional and distributed video coding –DVC=video modeling + DSC (Rate) + Estimation (Distortion)

Motivations Learn from the conventional wisdom: What is the major factor contributing to the success of existing image/video coding standards such as JPEG2000 and H.264? –It is the source classification principle and its subtle implications rooted in the earlier pioneering works such as EZW/SPIHT and multi-hypothesis MCP Therefore, by following the duality, it is natural to consider the idea of classifying the virtual correlation channel in distributed source coding –Unlike conventional video coding, motion estimation (ME) is done at the decoder instead of encoder side in WZ video coding (we have addressed this issue separately under a different context 1 ) 1 X. Li, “Video processing via implicit and mixture motion model,” IEEE Trans. on Cir. Sys. for Video Tech., vol. 17, no. 8, pp , Aug

Modeling Non-stationary VCC Why is the virtual correlation channel is nonstationary? –Misaligned edges, deformable motion, illumination variations are all spatio-temporally varying phenomena Mixture modeling of virtual correlation channel WT of Interpolated WZ frames (side information) WT of original WZ frames additive errors (e.g., significant vs. insignificant wavelet coefficients) (e.g., significant vs. insignificant temporal interpolation errors)

Summary of Theoretical Results Rate-Distortion optimization problem formulation s.t. Conventional source codingDistributed source coding R-D function Rate allocation Classification gain

Implications into WZ Video Coding In conventional source coding, classification gain implies that subsource of larger variance be assigned a higher priority in rate allocation In distributed source coding, similar conclusion can be made except that the variance of “subsource” is now determined by the virtual correlation channel OR

Conclusion: the class of significant coefficients that are poorly motion compensated have the largest R-D slope (they should be coded first: where are they? and what are they?)

Rate Control Dilemma How can we estimate the second-order statistics of VCC:  z 2 (the accuracy of side information y t generated by temporal interpolation)? –At the encoder, we have access to x t (original WZ frames) but not y t (side information) 1 –At the decoder, we have access to y t (side information) but not x t (original WZ frames) 2 –We have adopted decoder-based approach based on a feedback channel and scale invariance assumption about z t (an approximate but tractable solution) 1 Berkeley’s PRISM scheme allows simple temporal dependency estimation at the encoder. 2 Stanford’s researchers suggested the use of feedback channel for rate control.

Feedback via Scale Invariance of Interpolation Errors x 2t x 2t+2 x 2t-2 x 2t-1 key frames WZ frames oracleactual hall foreman Block-based significance map of z t Fine-resolution: oracle S.I. Coarse-resolution: Interpolated Key frame

wavelet transform advanced temporal interpolation Joint Exploitation WZ frames SW lossless coding of significance map SW lossless coding of significance coeff. SI CI decoded WZ frames decoded I frames EncoderDecoder block-based classification Classification-based WZ Video Coding System In a nutshell, we only allocate bits to the class of poorly motion  x 2 and  z 2 compensated significant coefficients: both  x 2 and  z 2 are large WT feedback channel

Joint Exploitation of Side and Coded Information at the Decoder z~N(0,  z 2 ) y SI CI=Q(x) Target of estimation: E[x|y,Q(x)] Latent variable: z (we don’t know  z 2 ) x Update estimate of x Update estimate of  z 2 initial guess

Justification of Distortion Reduction foreman-qcif, block size 16  16, 18.3% blocks are coded SI alone SI+CI

Coding Experiments Setup Parameter setting –Block size: –Block size: 16  16, WT: Daubechies’ 9-7, Slepian-Wolf lossless encoder: LDPC-based 1, uniform quantizer (∆=8) – –Rate control: th x, th z - significance thresholds for x and z respectively – –SI generation: Implicit MC vs. Explicit MC Benchmark: H.264 JM11.0 implementation (QP of I frames is small and fixed ) 1 Liveris, A.D.; Zixiang Xiong; Georghiades, C.N., "Compression of binary sources with side information at the decoder using LDPC codes," IEEE Communications Letters, vol.6, no.10, pp , Oct 2002

Comparison of Temporal Interpolation Foreman-qcif, ad-hoc fusion by simple averaging 1 X. Li, “Video processing via implicit and mixture motion model,” IEEE Trans. on Cir. Sys. for Video Tech., vol. 17, no. 8, pp , Aug Implicit MC 1 Explicit MC 2 2 Tourapis, A.M.; Hye-Yeon Cheong; Liou, M.L.; Au, O.C., "Temporal interpolation of video sequences using zonal based algorithms," Proc. of ICIP, pp vol.3, 2001

R-D Performance Comparison (I) Foreman-qcif, 30framesHall-qcif, 30frames

R-D Performance Comparison (II) Container-qcif, 30frames Football-qcif, 30frames

Dualities between Conventional and WZ Video Coding Exploitation of motion-related temporal dependency –In traditional video coding, prediction is based on original frames (overhead is involved) –In WZ video coding, interpolation is based on reconstructed key frames (no overhead) –Importance of SI generation 1 R-D optimization shifted from encoder to decoder –In traditional video coding, decoder is often fixed but encoder enjoys considerable flexibility –In WZ video coding, rate control through the feedback channel offers great flexibility to the decoder without touching encoder 2 –Importance of matching SW lossless encoder with the statistics of virtual correlation channel (UEP is desirable) 1 L. Lu, D. He, A. Jagmohan, “Robust Multi-Frame Side Information Generation For Distributed Video Coding”, Proc. Of ICIP’ Girod, B.; Aaron, A.M.; Rane, S.; Rebollo-Monedero, D., "Distributed Video Coding,“ Proceedings of the IEEE, vol.93, no.1, pp.71-83, Jan. 2005

Acknowledgement Ligang Lu and Dake He for inviting me to participate this special session Zixiang Xiong for sharing with me his students’ implementation of LDPC-based Slepian-Wolf coding algorithm E. Simoncelli for stimulating discussions on distributed motion representations