Rate-distortion modeling of scalable video coders 指導教授:許子衡 教授 學生:王志嘉.

Slides:



Advertisements
Similar presentations
Capacity of MIMO Channels: Asymptotic Evaluation Under Correlated Fading Presented by: Zhou Yuan University of Houston 10/22/2009.
Advertisements

LOGO Video Packet Selection and Scheduling for Multipath Streaming IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 9, NO. 3, APRIL 2007 Dan Jurca, Student Member,
CELLULAR COMMUNICATIONS 5. Speech Coding. Low Bit-rate Voice Coding  Voice is an analogue signal  Needed to be transformed in a digital form (bits)
A Matlab Playground for JPEG Andy Pekarske Nikolay Kolev.
Efficient Bit Allocation and CTU level Rate Control for HEVC Picture Coding Symposium, 2013, IEEE Junjun Si, Siwei Ma, Wen Gao Insitute of Digital Media,
1 School of Computing Science Simon Fraser University, Canada Rate-Distortion Optimized Streaming of Fine-Grained Scalable Video Sequences Mohamed Hefeeda.
Novel Point-Oriented Inner Searches for Fast Block Motion Lai-Man Po, Chi-Wang Ting, Ka-Man Wong, and Ka-Ho Ng IEEE TRANSACTIONS ON MULTIMEDIA, VOL.9,
1 Robust Video Stabilization Based on Particle Filter Tracking of Projected Camera Motion (IEEE 2009) Junlan Yang University of Illinois,Chicago.
Recursive End-to-end Distortion Estimation with Model-based Cross-correlation Approximation Hua Yang, Kenneth Rose Signal Compression Lab University of.
1 School of Computing Science Simon Fraser University, Canada Rate-Distortion Optimized Streaming of Fine-Grained Scalable Video Sequences Mohamed Hefeeda.
A Layered Hybrid ARQ Scheme for Scalable Video Multicast over Wireless Networks Zhengye Liu, Joint work with Zhenyu Wu.
End-to-End TCP-Friendly Streaming Protocol and Bit Allocation for Scalable Video Over Wireless Internet Fan Yang, Qian Zhang, Wenwu Zhu, and Ya-Qin Zhang.
Real-time smoothing for network adaptive video streaming Kui Gao, Wen Gao, Simin He, Yuan Zhang J. Vis. Commun. Image R. 16 (2005)
Scalable Wavelet Video Coding Using Aliasing- Reduced Hierarchical Motion Compensation Xuguang Yang, Member, IEEE, and Kannan Ramchandran, Member, IEEE.
Frederic Payan, Marc Antonini
Scene-dependent Frequency Weighting for Subjective Quality Improvement of MPEG-4 Fine- Granularity-Scalability Sharon Peng and Mihaela van der Schaar Philips.
Efficient Fine Granularity Scalability Using Adaptive Leaky Factor Yunlong Gao and Lap-Pui Chau, Senior Member, IEEE IEEE TRANSACTIONS ON BROADCASTING,
1 A Unified Rate-Distortion Analysis Framework for Transform Coding Student : Ho-Chang Wu Student : Ho-Chang Wu Advisor : Prof. David W. Lin Advisor :
Wireless FGS video transmission using adaptive mode selection and unequal error protection Jianhua Wu and Jianfei Cai Nanyang Technological University.
Lattices for Distributed Source Coding - Reconstruction of a Linear function of Jointly Gaussian Sources -D. Krithivasan and S. Sandeep Pradhan - University.
Seamless Switching of Scalable Video Bitstreams for Efficient Streaming Xiaoyan Sun, Feng Wu, Shipeng Li, Wen, Gao, and Ya-Qin Zhang.
Source-Channel Prediction in Error Resilient Video Coding Hua Yang and Kenneth Rose Signal Compression Laboratory ECE Department University of California,
A New Rate-Complexity-QP Algorithm for HEVC Intra-Picture Rate Control LING TIAN, YIMIN ZHOU, AND XIAOJUN CAO 2014 INTERNATIONAL CONFERENCE ON COMPUTING,
09/24/02ICIP20021 Drift Management and Adaptive Bit Rate Allocation in Scalable Video Coding H. Yang, R. Zhang and K. Rose Signal Compression Lab ECE Department.
Compression Efficiency and Delay Tradeoffs for Hierarchical B-Pictures and Pulsed-Quality Frames Athanasios Leontaris, Pamela C. Cosman Univ. of California.
 Coding efficiency/Compression ratio:  The loss of information or distortion measure:
Frame by Frame Bit Allocation for Motion-Compensated Video Michael Ringenburg May 9, 2003.
1 Efficient Reference Frame Selector for H.264 Tien-Ying Kuo, Hsin-Ju Lu IEEE CSVT 2008.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
IMAGE COMPRESSION USING BTC Presented By: Akash Agrawal Guided By: Prof.R.Welekar.
REVISED CONTEXTUAL LRT FOR VOICE ACTIVITY DETECTION Javier Ram’ırez, Jos’e C. Segura and J.M. G’orriz Dept. of Signal Theory Networking and Communications.
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada Optimal Partitioning of Fine-Grained Scalable Video Streams Mohamed Hefeeda.
Video Compression: Performance evaluation of available codec software Sridhar Godavarthy.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.
8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.
Data Hiding in Image and Video Part I: Fundamental Issues and Solutions ECE 738 Class Presentation By Tanaphol Thaipanich
Low-Power H.264 Video Compression Architecture for Mobile Communication Student: Tai-Jung Huang Advisor: Jar-Ferr Yang Teacher: Jenn-Jier Lien.
Adaptive Rate Control for HEVC Visual Communications and Image Processing (VCIP), 2012 IEEE Junjun Si, Siwei Ma, Xinfeng Zhang, Wen Gao 1.
Compression video overview 演講者:林崇元. Outline Introduction Fundamentals of video compression Picture type Signal quality measure Video encoder and decoder.
Rate-GOP Based Rate Control for HEVC SHANSHE WANG, SIWEI MA, SHIQI WANG, DEBIN ZHAO, AND WEN GAO IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING,
Rate-distortion Optimized Mode Selection Based on Multi-channel Realizations Markus Gärtner Davide Bertozzi Classroom Presentation 13 th March 2001.
Compression of Real-Time Cardiac MRI Video Sequences EE 368B Final Project December 8, 2000 Neal K. Bangerter and Julie C. Sabataitis.
Chapter 17 Image Compression 17.1 Introduction Redundant and irrelevant information  “Your wife, Helen, will meet you at Logan Airport in Boston.
Guillaume Laroche, Joel Jung, Beatrice Pesquet-Popescu CSVT
Content Adaptation for Virtual Office Environment Using Scalable Video Coding 作者: C. T. E. R. Hewage, H. Kodikara Arachchi, T. Masterton, A. C. Yu, H.
Fast motion estimation and mode decision for H.264 video coding in packet loss environment Li Liu, Xinhua Zhuang Computer Science Department, University.
100 kbps. 100 kbps Current and Future Directions Class Histories –Are the significance probabilities of the coefficients.
Fine Granularity Scalability in MPEG-4 Video by Weiping Li Presentation by Warren Cheung.
1 Source Coding and Compression Dr.-Ing. Khaled Shawky Hassan Room: C3-222, ext: 1204, Lecture 10 Rate-Distortion.
Rate-Control in Video Codec Heejune AHN Embedded Communications Laboratory Seoul National Univ. of Technology Fall 2008 Last updated
MPEG4 Fine Grained Scalable Multi-Resolution Layered Video Encoding Authors from: University of Georgia Speaker: Chang-Kuan Lin.
Architecture and algorithms for an IEEE based multi-channel wireless mesh network 指導教授:許子衡 老師 學生:王志嘉.
Unified Loop Filter for High-performance Video Coding Yu Liu and Yan Huo ICME2010, July 19-23, Singapore.
Video Compression—From Concepts to the H.264/AVC Standard
Global MINMAX Interframe Bit Allocation for Embedded Video Coding Michael Ringenburg Qualifying Project Presentation Advisors: Richard Ladner (CSE) and.
Image Processing Architecture, © Oleh TretiakPage 1Lecture 4 ECE-C490 Winter 2004 Image Processing Architecture Lecture 4, 1/20/2004 Principles.
Chapter 8 Lossy Compression Algorithms. Fundamentals of Multimedia, Chapter Introduction Lossless compression algorithms do not deliver compression.
COMPARATIVE STUDY OF HEVC and H.264 INTRA FRAME CODING AND JPEG2000 BY Under the Guidance of Harshdeep Brahmasury Jain Dr. K. R. RAO ID MS Electrical.
A Frame-Level Rate Control Scheme Based on Texture and Nontexture Rate Models for HEVC IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY,
Implementation of Wavelet-Based Robust Differential Control for Electric Vehicle Application IEEE TRANSACTIONS ON POWER ELECTRONICS, VOL. 30, NO. 12, DECEMBER.
CORRELATION-REGULATION ANALYSIS Томский политехнический университет.
Chapter 8 Lossy Compression Algorithms
Quality Evaluation and Comparison of SVC Encoders
Adaptive Block Coding Order for Intra Prediction in HEVC
Distributed Compression For Still Images
Chapter 2 Minimum Variance Unbiased estimation
Presenter by : Mourad RAHALI
Scalable Speech Coding for IP Networks: Beyond iLBC
Foundation of Video Coding Part II: Scalar and Vector Quantization
Presentation transcript:

Rate-distortion modeling of scalable video coders 指導教授:許子衡 教授 學生:王志嘉

2 Introduction (i) R-D models can be classified into two categories based on the theory they apply: models based on Shannon's rate- distortion theory and those derived from high-rate quantization theory These two theories are complementary, converge to the same lower hound D~ e -αR when the input block size goes to infinity.

3 Introduction (ii) Block length cannot be infinite in real coding systems, it is widely recognized that classical rate-distortion theory is often not suitable for accurate modeling of actual R-D curves. Adjustable parameters are often incorporated into the theoretical R-D models to keep up with the complexity of coding systems and the diversity of video sources

4 Introduction (iii) Recall that most current R-D models are built for images or non-scalable video coders. In this paper, we complete the work and examine R-D models from a different perspective. We first derive a distortion model based on approximation theory and then incorporate the ρ-domain bitrate model into the final result.

5 Introduction (iv) We also show that the unifying ρ -domain model is very accurate in both Fine Granular Scalability (FGS) and Progressive FGS (PFGS) coders. Our work demonstrates that distortion D can be modeled by a function of function of both bitrat R its logarithm log R: variance of the source constants

6 Motivation (i) A typical scalable coder includes one base layer and one or more enhancement layers. We examine the accuracy of current R-D models for scalable coders. with R representing the bitrate of the enhancement layer. Without loss of generality, we use peak signal-to-noise ratio (PSNR) to measure the quality of video sequences.

7 Motivation (ii) With the PSNR measure, it is well-known that the classical model becomes a linear function of coding rate R: Fig. I shows that PSNR is linear with respect to R only when the bitrate is sufficiently high and also that model (6) has much higher convexity than the actual R-D curve. constants

8 Fig.1

9 Motivation (iii) This bound is specifically developed for wavelet-based coding schemes. Mallat extend it to transform-based low bitrate images: constant parameter

10 R-D Model For Scalable Coders— Preliminaries (i) Uniform quantizers are widely applied to video coders due to their asymptotic optimality. We show the lower bound on distortion in quantization theory assuming seminorm-based distortion measures and uniform quantizers. If X, are k-dimensional vectors and the distortion between X and is d(X, ) = || X- || τ, the minimum distortion for uniform quantizers is

11 R-D Model For Scalable Coders— Preliminaries (ii) 2 Gamma function △ is the quantization step.

12 R-D Model For Scalable Coders— Preliminaries (iii) When r= 2, k = 1. we obtain the popular MSE formula for uniform quantizers: β is 12 if the quantization step is much smaller than the signal variance

13 Distortion Analysis (i) In the transform domain, distortion D consists of two parts: 1) distortion D i from discarding the insignificant coefficients in (- △, △ ) 2) distortion D s from quantizing the significant coefficients Given this notation, we have the following lemma. Lemma 1: Assuming that the total number of transform coefficients U is N and the number of significant coefficients is M,MSE distortion D is:

14 Distortion Analysis (ii) In Fig. 2, the left side shows an example of actual distortion D and simulation results of model (10) for frame 3 in FGS-coded CIF Foreman, and the right side shows the average absolute error between model (10) and the actual distortion in FGS-coded CIF Foreman and Carphone sequences

15 Fig.2

16 R-D Modeling (i) To improve the unsatisfactory accuracy of current R-D models in scalable coders. we derive an accurate R-D model based on source statistical properties and a recent ρ domain model. Bitrate R is a linear function of the percentage of significant coefficients z in each video frame. We extensively examined the relationship between R and z in various video frames and found this linear model holds very well for scalable coders.

17 R-D Modeling (ii) Fig. 3 demonstrates two typical examples of the actual bitrate Rand its linear estiniation in FGS and PFGS video frames. Using the ρ-domain model, we have our main result as following. Theorem 1 : The distortion of scalable video coders is given by:

18 Experimental Results (i) We apply the proposed model (14) to various scalable video frames to evaluate its accuracy. Fig. 4 shows two examples of R-D curves for I (left) and P (right) frames of FGS-coded CIF Foreman. All results shown in this paper utilize videos in the CIF format with the base layer coded at 128 kb/s and 10 frames/s. We contrast the performance of the proposed model with that of the other two models in FGS-coded Foreman and Carphone in Fig. 5.

19 Fig.4

20 Fig.5

21 Experimental Results (ii) Additionally, Fig. 6 shows the same comparison in PFGS-coded Coastguard and Mobile.

22 Conclusion This paper analyzed the distortion of scalable coders and proposed a novel R-D model from the perspective of approximation theory. Given the lack of R-D modeling of scalable coders, we believe this work will benefit both Internet streaming applications and theoretical discussion in this area.