Aditya Mavlankar, Pierpaolo Baccichet, David Varodayan and Bernd Girod

Slides:



Advertisements
Similar presentations
KIANOOSH MOKHTARIAN SCHOOL OF COMPUTING SCIENCE SIMON FRASER UNIVERSITY 6/24/2007 Overview of the Scalable Video Coding Extension of the H.264/AVC Standard.
Advertisements

MPEG4 Natural Video Coding Functionalities: –Coding of arbitrary shaped objects –Efficient compression of video and images over wide range of bit rates.
Basics of MPEG Picture sizes: up to 4095 x 4095 Most algorithms are for the CCIR 601 format for video frames Y-Cb-Cr color space NTSC: 525 lines per frame.
Scalable ROI Algorithm for H.264/SVC-Based Video Streaming Jung-Hwan Lee and Chuck Yoo, Member, IEEE.
-1/20- MPEG 4, H.264 Compression Standards Presented by Dukhyun Chang
A presentation by Modupe Omueti For CMPT 820:Multimedia Systems
Chapter 7 End-to-End Data
H.264/AVC Baseline Profile Decoder Complexity Analysis Michael Horowitz, Anthony Joch, Faouzi Kossentini, and Antti Hallapuro IEEE TRANSACTIONS ON CIRCUITS.
MATCHSLIDE : INT contribution
1 Adaptive slice-level parallelism for H.264/AVC encoding using pre macroblock mode selection Bongsoo Jung, Byeungwoo Jeon Journal of Visual Communication.
Department of Computer Science, University of Maryland, College Park, USA TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.:
Fast Mode Decision for Multiview Video Coding Liquan Shen, Tao Yan, Zhi Liu, Zhaoyang Zhang, Ping An, Lei Yang ICIP
Spring 2003CS 4611 Multimedia Outline Compression RTP Scheduling.
Overview of Error Resiliency Schemes in H.264/AVC Standard Sunil Kumar, Liyang Xu, Mrinal K. Mandal, and Sethuraman Panchanathan Elsevier Journal of Visual.
Department of Electrical Engineering Stanford University Yi Liang, Eric Setton and Bernd Girod Channel-Adaptive Video Streaming Using Packet Path Diversity.
Reinventing Compression: The New Paradigm of Distributed Video Coding Bernd Girod Information Systems Laboratory Stanford University.
High-Quality Video View Interpolation
Error Concealment For Fine Granularity Scalable Video Transmission Hua Cai; Guobin Shen; Feng Wu; Shipeng Li; Bing Zeng; Multimedia and Expo, Proceedings.
Bernd Girod: Image Compression and Graphics 1 Image Compression and Graphics: More Than a Sum of Parts? Bernd Girod Collaborators: Peter Eisert, Marcus.
Image (and Video) Coding and Processing Lecture: Motion Compensation Wade Trappe Most of these slides are borrowed from Min Wu and KJR Liu of UMD.
Lattices for Distributed Source Coding - Reconstruction of a Linear function of Jointly Gaussian Sources -D. Krithivasan and S. Sandeep Pradhan - University.
Source-Channel Prediction in Error Resilient Video Coding Hua Yang and Kenneth Rose Signal Compression Laboratory ECE Department University of California,
` 1 Department of Electrical Engineering, Stanford University Anne Aaron, Prashant Ramanathan and Bernd Girod Wyner-Ziv Coding of Light Fields for Random.
Xinqiao LiuRate constrained conditional replenishment1 Rate-Constrained Conditional Replenishment with Adaptive Change Detection Xinqiao Liu December 8,
Error Resilience of Video Transmission By Rate-Distortion Optimization and Adaptive Packetization Yuxin Liu, Paul Salama and Edwad Delp ICME 2002.
An Introduction to H.264/AVC and 3D Video Coding.
1 CCTV SYSTEMS RESOLUTIONS USED IN CCTV. 2 CCTV SYSTEMS CCTV resolution is measured in vertical and horizontal pixel dimensions and typically limited.
Pre-fetching based on video analysis for interactive region-of- interest streaming of soccer sequences Authors: Aditya Mavlankar and Bernd Girod Information.
JPEG 2000 Image Type Image width and height: 1 to 2 32 – 1 Component depth: 1 to 32 bits Number of components: 1 to 255 Each component can have a different.
Kai-Chao Yang Hierarchical Prediction Structures in H.264/AVC.
-1/20- Scalable Video Coding Scalable Extension of H.264 / AVC.
Philipp Merkle, Aljoscha Smolic Karsten Müller, Thomas Wiegand CSVT 2007.
Introduction Compression Performance Conclusions Large Camera Arrays Capture multi-viewpoint images of a scene/object. Potential applications abound: surveillance,
High-Resolution Interactive Panoramas with MPEG-4 발표자 : 김영백 임베디드시스템연구실.
Sparse Matrix Factorizations for Hyperspectral Unmixing John Wright Visual Computing Group Microsoft Research Asia Sept. 30, 2010 TexPoint fonts used in.
High-Resolution Video Streaming with Interactive Region-of-Interest Aditya Mavlankar and Piyush Agrawal {maditya, Information Systems.
Adaptive Multi-path Prediction for Error Resilient H.264 Coding Xiaosong Zhou, C.-C. Jay Kuo University of Southern California Multimedia Signal Processing.
June, 1999 An Introduction to MPEG School of Computer Science, University of Central Florida, VLSI and M-5 Research Group Tao.
Image Compression Supervised By: Mr.Nael Alian Student: Anwaar Ahmed Abu-AlQomboz ID: IT College “Multimedia”
Compression video overview 演講者:林崇元. Outline Introduction Fundamentals of video compression Picture type Signal quality measure Video encoder and decoder.
Spring 2000CS 4611 Multimedia Outline Compression RTP Scheduling.
Digital Image Processing In The Name Of God Digital Image Processing Lecture2: Digital Image Fundamental M. Ghelich Oghli By: M. Ghelich Oghli
Time-Shifted Streaming in a P2P Video Multicast System Jeonghun Noh, Aditya Mavlankar, Pierpaolo Baccichet 1, and Bernd Girod Information Systems Laboratory.
MPEG4 Fine Grained Scalable Multi-Resolution Layered Video Encoding Authors from: University of Georgia Speaker: Chang-Kuan Lin.
Page 11/28/2016 CSE 40373/60373: Multimedia Systems Quantization  F(u, v) represents a DCT coefficient, Q(u, v) is a “quantization matrix” entry, and.
Video Compression and Standards
Fundamentals of Multimedia Chapter 17 Wireless Networks 건국대학교 인터넷미디어공학부 임 창 훈.
Fine-granular Motion Matching for Inter-view Motion Skip Mode in Multi-view Video Coding Haitao Yanh, Yilin Chang, Junyan Huo CSVT.
Date of download: 7/10/2016 Copyright © 2016 SPIE. All rights reserved. A graphical overview of the proposed compressed gated range sensing (CGRS) architecture.
ITEC2110, Digital Media Chapter 2 Fundamentals of Digital Imaging 1 GGC -- ITEC Digital Media.
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Yimin Zhou, Hongyu Wang, Ling Tian and Ce Zhu
Multimedia Outline Compression RTP Scheduling Spring 2000 CS 461.
WP3: Visualization services
H.264/SVC Video Transmission Over P2P Networks
JPEG Image Coding Standard
Overview of the Scalable Video Coding
Aditya Mavlankar and David Varodayan
Streaming To Mobile Users In A Peer-to-Peer Network
Coding Approaches for End-to-End 3D TV Systems
Digital Multimedia Coding
Research Topic Error Concealment Techniques in H.264/AVC for Wireless Video Transmission Vineeth Shetty Kolkeri EE Graduate,UTA.
Fast Decision of Block size, Prediction Mode and Intra Block for H
Viewport-based 360 Video Streaming:
Viewport-based 360 Video Streaming:
MPEG4 Natural Video Coding
© 2016 Pearson Education, Inc., Hoboken, NJ. All rights reserved.
Foundation of Video Coding Part II: Scalar and Vector Quantization
VIDEO COMPRESSION FUNDAMENTALS
Bongsoo Jung, Byeungwoo Jeon
Presentation transcript:

Aditya Mavlankar, Pierpaolo Baccichet, David Varodayan and Bernd Girod Optimal Slice Size for Streaming Regions of High-Resolution Video with Virtual Pan/Tilt/Zoom Functionality Aditya Mavlankar, Pierpaolo Baccichet, David Varodayan and Bernd Girod Information Systems Laboratory Stanford University TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAAA

Outline High-resolution video streaming with IROI Proposed coding scheme for IROI video streaming Analysis of optimal slice size selection Experimental results

High-Resolution Video Streaming with IROI Related work Interactive image browsing with JPEG-2000 [Taubman et al. 2003] Interactive streaming of lightfields [Ramanathan et al. 2004] Interactive streaming of panoramic videos [Heymann et al. 2005] ... Sources of high-resolution videos High-resolution digital imaging sensors (CMOS technology) High-resolution videos stitched from multiple cameras Application scenarios Surveillance Instructional videos Snow cams in ski resorts Interactive TV with virtual pan/tilt/zoom

Demo

H.264/AVC Based Coding Scheme ROI Resolution layer N - ↑ ROI Resolution layer 1 - Need enough random access. Should be able to stream portions of video from any resolution and any region. Temporal prediction only on the base layer. ↑ P slices Overview video Hierarchical B pictures

Tradeoff due to Slice Size Small slice size Entire scene takes more bits to encode Slice headers Lack of context continuation across slices for context adaptive coding Cannot exploit inter-pixel correlation across slices Less pixel overhead: Can adapt to ROI due to fine granularity of slice grid Pixel Overhead ROI

Tradeoff Observed for Pedestrian Area, layer 2 1 1.5 2 2.5 Number of pixels transmitted per rendered pixel 0.2 0.3 0.4 0.5 Bit per pixel for coding given layer Pedestrian Area zf 2. 160x160 128x128 64x64 32x32 Slice size in pixels [ ]

Tradeoff Observed for Pedestrian Area, layer 2 0.4 0.45 0.5 0.55 0.6 Bits transmitted per rendered pixel Pedestrian Area zf 2. 160x160 128x128 64x64 32x32 Slice size in pixels [ ]

Tradeoff Observed for Pedestrian Area, layer 3 1 1.5 2 2.5 Number of pixels transmitted per rendered pixel 0.1 0.2 0.3 0.4 Bit per pixel for coding given layer Pedestrian Area zf 3. 160x160 128x128 64x64 32x32 Slice size in pixels [ ]

Tradeoff Observed for Pedestrian Area, layer 3 0.28 0.3 0.32 0.34 0.36 0.38 0.4 Bits transmitted per rendered pixel Pedestrian Area zf 3. 160x160 128x128 64x64 32x32 Slice size in pixels [ ]

Pixel Overhead Analysis in 1-D segment index Imagine an infinitely long line of pixels. In this example, SOI SOI SOI SOI # pixels transmitted (random variable) To simplify the analysis, consider the 1D case.

Pixel Overhead Analysis in 2-D ROI Expected number of pixels transmitted

Optimization Criterion and Constraints Practical constraints narrow down the search: slice dimensions have to be multiples of macroblock width many values can be ruled out since they are likely to be suboptimal constraints due to display dimensions, e.g., restrictions on translation of ROI Bit per pixel for coding given layer Number of pixels transmitted per rendered pixel Goal is to minimize the bits transmitted per second. … for instance for a given resolution layer, if o_h,I is equal to the height of the display area then the the ROI cannot translate vertically. It can only move horizontally. Then you don’t need slices in the vertical direction.

Model Vs Experimental Results (Pedestrian Area, layer 2) 1 1.5 2 2.5 Number of pixels transmitted per rendered pixel 0.2 0.3 0.4 0.5 Bit per pixel for coding given layer Model Experiments 160x160 128x128 64x64 32x32 Slice size in pixels [ ]

Model Vs Experimental Results (Pedestrian Area, layer 2) 0.4 0.45 0.5 0.55 0.6 Bits transmitted per rendered pixel Model Experiments 160x160 128x128 64x64 32x32 Slice size in pixels [ ]

Model Vs Experimental Results (Pedestrian Area, layer 3) 1 1.5 2 2.5 Number of pixels transmitted per rendered pixel 0.1 0.2 0.3 0.4 Bit per pixel for coding given layer Model Experiments 160x160 128x128 64x64 32x32 Slice size in pixels [ ]

Model Vs Experimental Results (Pedestrian Area, layer 3) 0.28 0.3 0.32 0.34 0.36 0.38 0.4 Bits transmitted per rendered pixel Model Experiments 160x160 128x128 64x64 32x32 Slice size in pixels [ ]

Summary Coding scheme provides random access to arbitrary resolutions arbitrary spatial regions within every resolution Slice size is optimized given the video signal the QP the ROI display area dimensions Other coding parameters could be further optimized, for example, joint selection of the QP for the base layer and the enhancement layers

The End

Backup Slides Follow Hereafter

Parts of the Client’s Display Overview display area ROI display area Just to define the terminology: we call this the overview display and we call this the ROI display. The location of the ROI is shown by overlaying a rectangle on the overview video. You might have noticed that the size and color of the rectangle vary according to the zoom factor.

Region-of-Interest Trajectory ROI ROI ROI Original video is available in resolutions Now we define that the ROI trajectory as the path over which the ROI moves. Lets say these are the various resolutions or zoom factors possible. And the ROI is allowed to move about within any of these resolutions, e.g. this animation. by for and , i.e., highest resolution

Pixel Overhead Analysis in 1-D segment index Imagine an infinitely long line of pixels. In this example, SOI SOI SOI SOI Pixel Overhead Theorem: Given that , increases monotonically with is independent of To simplify the analysis, consider the 1D case.

Pixel Overhead Analysis in 2-D ROI Expected value of pixel overhead in 2-D Expected number of pixels to be transmitted