Viewport-based 360 Video Streaming:

Slides:



Advertisements
Similar presentations
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
Advertisements

KIANOOSH MOKHTARIAN SCHOOL OF COMPUTING SCIENCE SIMON FRASER UNIVERSITY 6/24/2007 Overview of the Scalable Video Coding Extension of the H.264/AVC Standard.
MPEG-1: A Standard for Digital Storage of Audio and Video Nimrod Peleg Update: Dec
MPEG4 Natural Video Coding Functionalities: –Coding of arbitrary shaped objects –Efficient compression of video and images over wide range of bit rates.
Basics of MPEG Picture sizes: up to 4095 x 4095 Most algorithms are for the CCIR 601 format for video frames Y-Cb-Cr color space NTSC: 525 lines per frame.
Scalable ROI Algorithm for H.264/SVC-Based Video Streaming Jung-Hwan Lee and Chuck Yoo, Member, IEEE.
2004 NTU CSIE 1 Ch.6 H.264/AVC Part2 (pp.200~222) Chun-Wei Hsieh.
-1/20- MPEG 4, H.264 Compression Standards Presented by Dukhyun Chang
Error Resilience for MPEG-4 Environment Nimrod Peleg Nov
Technion - IIT Dept. of Electrical Engineering Signal and Image Processing lab Transrating and Transcoding of Coded Video Signals David Malah Ran Bar-Sella.
MULTIMEDIA PROCESSING
A Highly Parallel Framework for HEVC Coding Unit Partitioning Tree Decision on Many-core Processors Chenggang Yan, Yongdong Zhang, Jizheng Xu, Feng Dai,
Software Architecture of High Efficiency Video Coding for Many-Core Systems with Power- Efficient Workload Balancing Muhammad Usman Karim Khan, Muhammad.
{ Fast Disparity Estimation Using Spatio- temporal Correlation of Disparity Field for Multiview Video Coding Wei Zhu, Xiang Tian, Fan Zhou and Yaowu Chen.
Fast Mode Decision for Multiview Video Coding Liquan Shen, Tao Yan, Zhi Liu, Zhaoyang Zhang, Ping An, Lei Yang ICIP
Yu-Han Chen, Tung-Chien Chen, Chuan-Yung Tsai, Sung-Fang Tsai, and Liang-Gee Chen, Fellow, IEEE IEEE CSVT
Shaobo Zhang, Xiaoyun Zhang, Zhiyong Gao
Ch. 6- H.264/AVC Part I (pp.160~199) Sheng-kai Lin
Overview of the Scalable Video Coding Extension of the H
Video Transmission Adopting Scalable Video Coding over Time- varying Networks Chun-Su Park, Nam-Hyeong Kim, Sang-Hee Park, Goo-Rak Kwon, and Sung-Jea Ko,
1 Single Reference Frame Multiple Current Macroblocks Scheme for Multiple Reference IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY Tung-Chien.
Error Concealment For Fine Granularity Scalable Video Transmission Hua Cai; Guobin Shen; Feng Wu; Shipeng Li; Bing Zeng; Multimedia and Expo, Proceedings.
Complexity Model Based Load- balancing Algorithm For Parallel Tools Of HEVC Yong-Jo Ahn, Tae-Jin Hwang, Dong-Gyu Sim, and Woo-Jin Han 2013 IEEE International.
H.264/AVC for Wireless Applications Thomas Stockhammer, and Thomas Wiegand Institute for Communications Engineering, Munich University of Technology, Germany.
Error Resilience of Video Transmission By Rate-Distortion Optimization and Adaptive Packetization Yuxin Liu, Paul Salama and Edwad Delp ICME 2002.
An Introduction to H.264/AVC and 3D Video Coding.
January 26, Nick Feamster Development of a Transcoding Algorithm from MPEG to H.263.
MPEG-2 Digital Video Coding Standard
Liquan Shen Zhi Liu Xinpeng Zhang Wenqiang Zhao Zhaoyang Zhang An Effective CU Size Decision Method for HEVC Encoders IEEE TRANSACTIONS ON MULTIMEDIA,
MPEG-2 Standard By Rigoberto Fernandez. MPEG Standards MPEG (Moving Pictures Experts Group) is a group of people that meet under ISO (International Standards.
Page 19/15/2015 CSE 40373/60373: Multimedia Systems 11.1 MPEG 1 and 2  MPEG: Moving Pictures Experts Group for the development of digital video  It is.
By Abhishek Hassan Thungaraj Supervisor- Dr. K. R. Rao.
Codec structuretMyn1 Codec structure In an MPEG system, the DCT and motion- compensated interframe prediction are combined. The coder subtracts the motion-compensated.
TM Paramvir Bahl Microsoft Corporation Adaptive Region-Based Multi-Scaled Motion- Compensated Video Coding for Error Prone Communication.
High Efficiency Video Coding Kiana Calagari CMPT 880: Large-scale Multimedia Systems and Cloud Computing.
- By Naveen Siddaraju - Under the guidance of Dr K R Rao Study and comparison between H.264.
Figure 1.a AVS China encoder [3] Video Bit stream.
Guillaume Laroche, Joel Jung, Beatrice Pesquet-Popescu CSVT
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
Fast motion estimation and mode decision for H.264 video coding in packet loss environment Li Liu, Xinhua Zhuang Computer Science Department, University.
MPEG-1Standard By Alejandro Mendoza. Introduction The major goal of video compression is to represent a video source with as few bits as possible while.
Overview of Fine Granularity Scalability in MPEG-4 Video Standard Weiping Li Presented by : Brian Eriksson.
Video Compression—From Concepts to the H.264/AVC Standard
Video Compression and Standards
COMPARATIVE STUDY OF HEVC and H.264 INTRA FRAME CODING AND JPEG2000 BY Under the Guidance of Harshdeep Brahmasury Jain Dr. K. R. RAO ID MS Electrical.
Time Optimization of HEVC Encoder over X86 Processors using SIMD Kushal Shah Advisor: Dr. K. R. Rao Spring 2013 Multimedia.
Outline  Introduction  Observations and analysis  Proposed algorithm  Experimental results 2.
Introduction to MPEG Video Coding Dr. S. M. N. Arosha Senanayake, Senior Member/IEEE Associate Professor in Artificial Intelligence Room No: M2.06
MPEG Video Coding I: MPEG-1 1. Overview  MPEG: Moving Pictures Experts Group, established in 1988 for the development of digital video.  It is appropriately.
Computational Controlled Mode Selection for H.264/AVC June Computational Controlled Mode Selection for H.264/AVC Ariel Kit & Amir Nusboim Supervised.
Complexity varying intra prediction in H.264 Supervisors: Dr. Ofer Hadar, Mr. Evgeny Kaminsky Students: Amit David, Yoav Galon.
Introduction to H.264 / AVC Video Coding Standard Multimedia Systems Sharif University of Technology November 2008.
A Measurement Study of Oculus 360 Degree Video Streaming
HEVC Intra Prediction Prepared by Shevach Riabtsev
H.264/SVC Video Transmission Over P2P Networks
Thomas Daede October 5, 2017 AV1 Update Thomas Daede October 5, 2017.
Overview of the Scalable Video Coding
Aditya Mavlankar, Pierpaolo Baccichet, David Varodayan and Bernd Girod
Future Video Coding Prof. Dr.-Ing. Jörn Ostermann.
User-Oriented Approach in Spatial and Temporal Domain Video Coding
Title: Bandwidth and Latency Requirements for Virtual Reality
Supplement, Chapters 6 MC Course, 2009.
ENEE 631 Project Video Codec and Shot Segmentation
/ Fast block partitioning method in HEVC Intra coding for UHD video /
Viewport-based 360 Video Streaming:
MPEG4 Natural Video Coding
MPEG-Immersive 3DoF+ Standard Work:
Progress & schedule Presenter : YY Date : 2014/10/3.
Terms for MPEG-Immersive 3DoF+ Standard Work
Scalable light field coding using weighted binary images
Presentation transcript:

Viewport-based 360 Video Streaming: Motion-Constrained Tile and Viewport Extraction Presenter: Eun-Seok Ryu (esryu@gachon.ac.kr) Dongmin Jang, Jong-Beom Jeong, Eun-Seok Ryu Multimedia Communications and Systems Lab. (MCSL) http://mcsl.gachon.ac.kr Department of Computer Engineering Gachon University

The emergence of various HMD Requirements for high quality VR Introduction High Bandwidth Requirement of VR Recently, various HMD devices are on the market Recommend 12K resolution for reducing nausea with high quality VR High Bandwidth and high computational complexity are huddle Need to reduce the required bandwidth! Requirement details pixels/degree 40 pix/deg video resolution 11520x6480 framerate 90 fps The emergence of various HMD (Gear VR, Oculus Rift, Daydream, PlayStation VR) Requirements for high quality VR Source:Technicolor, Oct. 2016 (m39532, MPEG 116th Meeting) 3079-18-0045-00-0003

Viewport Independent vs Viewport Dependent (Proposed Method) Viewport Independent Transmit whole picture Projection and packing Downsampling / adjusting QP Transmit viewport only Bitrate saving ecoding complexity reducing But, delay viewport Select a tile set corresponding to the viewport Decoder Viewpoint-based decoding 3079-18-0045-00-0003

Keypoint for Viewport Dependent Streaming Field of View(FOV) The field of view (FOV) in the HMDs : 96° to 110° Part of the 360° picture Tiles Parallelization Tools Divide into rectangular regions Flexible horizontal and vertical boundaries The user’s current viewport : high resolution Remaining part : low resolution Field of View (FOV) Spatially refers to only its own tile, but temporally refers to other tiles Decoding problems occurs when transmitting only some tiles A frame divided into 8 Tiles 3079-18-0045-00-0003

Proposed Motion Constraint Tile Sets Reference Picture Motion vectors can refer to anyware Original Motion Estimation before applying MCTS Modify motion vector range Motion Estimation after applying MCTS Reference Current Picture Motion vectors only refer to same position Tile Adopt our MCTS in MPEG (HM ver. 16.18) 3079-18-0045-00-0003

MCTS with HEVC and SHVC - Structure SHM Encoder HM Encoder When referring to the same tile, TIP is performed according to the considerations When referring to the same tile, TIP is performed according to the considerations PicEL t-1 PicEL t Pic t-1 Pic t x x When referring to the other tiles, ILP is performed using Upsampled BL When referring to the other tiles, Intra Prediction is performed EL: Enhancement Layer BL: Base Layer TIP: Temporal Inter Prediction ILP: Inter Layer Prediction Upsampled PicBL t Considerations : Interpolation, Temporal Candidate of AMVP and MERGE 3079-18-0045-00-0003

MCTS Considerations (1/2) Interpolation Use an eight-tap filter to interpolate luma prediction Use 3 pixels of left and top, 4 pixels of right and bottom for Interpolation Modify reference range of motion vectors 3 Pixels 4 Pixels Current Pixel Pixel used for interpolation The current pixel and the pixels used for interpolation Interpolation problem of referring to a tile at the same position in TIP 3079-18-0045-00-0003

MCTS Considerations (2/2) Temporal Candidate of AMVP and MERGE Temporal candidates : C3 and H block (right figure) Problem: cross the column boundary between tiles Exclude H block at the column boundary between tiles Tile 1 Tile 2 PU H Temporal candidate problem at column boundary between Tiles 3079-18-0045-00-0003

Extraction Information Sets (EIS) SEI Message Contains replacement parameter set (max: around 2000 MCTS sets) MCTS set contains a set of tiles to be extracted Extraction Information Set (1 ~ 2048) Parameter set Replacement Information VPS Level and Tier SPS Picture Resolution PPS Tile Partition MCTS Set ( 1 ~ 2048) Parameter Set (1) Tile MCTS index1 VPS, SPS, PPS Tile MCTS index2 Slice Reordering ( n ∝ MCTS Set) Replacement Information of Parameter Set … Slice address … Original VPS Original SPS Original PPS EIS SEI Slice / Tile Slice / Tile NAL Unit NAL Unit NAL Unit NAL Unit NAL Unit NAL Unit Bitstream before extraction 3079-18-0045-00-0003

NAL Bitstream Extractor: Fucntional Flow MCTS Bitstream Extracted Bitstream Target EIS Id Target MCTS set Id Target Highest Temporal Id Input Option Replacement VPS Original VPS Pharse Original PPS Number of Original Tiles Original SPS Replacement SPS Number of EISs, MCTS sets, MCTSs MCTS Id Slice Reordering Info Replacement Parameter set (VPS, SPS, PPS) Original PPS Replacement PPS Pharse EIS SEI Message EIS SEI Target Slice / Tile Slice / Tile Replace Original Parameter set With Replacement Parameter set Target Slice / Tile Slice / Tile Select Target Tile / Slice Corresponding to Input Option Slice / Tile Target Slice / Tile … … First Slice Segment in Pic Flag Slice Segment Address Adjust Slice Header 3079-18-0045-00-0003

Implemented Renderer and Player 3079-18-0045-00-0003

Experimental Results (1/2) Experimental Setup 8K test sequences defined in JVET CTC (common test condition) Random Access (RA) coding structure Uniform 3x3, 9 Tiles HM ver. 16.16 encoder / SHM ver. 12.3 encoder Coding Option SHM Parameter HM parameter Version 12.3 16.16 CTU size 64×64 Coding structure RA QP - Base Layer QP 22 Enhancement Layer QP Tile Uniformly 3x3 = 9 tiles Slice mode Disable all slice options WPP mode Disable all wpp options Name Resolution Frame Length Frame Rate KiteFlite 8192×4096 300 30 fps Harbor Trolley GasLamp 8K Test sequences Coding options 3079-18-0045-00-0003

Experimental Results (2/2) Bitrate savings in case of transmiting some Tiles only using proposed MCTS (among 9 Tiles) Various number of Tiles and eye-tracking with DL are under researching now; expecting NOSSDAV 2019. : ) Proposed SHM Proposed HM Name 4 tiles bitrate saving 1 tile bitrate saving KiteFlite 52% 88% 51% 87% Harbor 53% Trolley 50% 49% GasLamp 47% 86% Average bitrate saving 3079-18-0045-00-0003

Conclusion Motivation High Quality VR >= 12k resolution High BW, High computational complexity Viewport tile streaming for 360 VR Proposed method Motion Constrained Tile Sets (MCTS) Extraction Information Set SEI Message (EIS SEI) NAL Packet Extractor for Selected (ROI) Tiles Results Transmit Selected Tiles without Decoding Errors Save Bitrate, Reduce Computational Complexity at Decoder Side Future work Eye tracking for Accurate Viewport Extraction Deep Learning for ROI Estimation and Prefetching 3079-18-0045-00-0003