Coding Approaches for End-to-End 3D TV Systems

Slides:



Advertisements
Similar presentations
Basics of MPEG Picture sizes: up to 4095 x 4095 Most algorithms are for the CCIR 601 format for video frames Y-Cb-Cr color space NTSC: 525 lines per frame.
Advertisements

Motivation Application driven -- VoD, Information on Demand (WWW), education, telemedicine, videoconference, videophone Storage capacity Large capacity.
Anahita: A System for 3D Video Streaming with Depth Customization
Video enhances, dramatizes, and gives impact to your multimedia application. Your audience will better understand the message of your application.
Plenoptic Stitching: A Scalable Method for Reconstructing 3D Interactive Walkthroughs Daniel G. Aliaga Ingrid Carlbom
HK UST * Hong Kong University of Science and Technology HK UST Modeling Hair from Multiple Views Y. Wei, E. Ofek, L. Quan and H. Shum.
Overview of Multi-view Video Coding Yo-Sung Ho; Kwan-Jung Oh; Systems, Signals and Image Processing, 2007 and 6th EURASIP Conference focused on Speech.
Computational Photography Light Field Rendering Jinxiang Chai.
CSCE 641: Computer Graphics Image-based Rendering Jinxiang Chai.
1 Angel: Interactive Computer Graphics 4E © Addison-Wesley 2005 Models and Architectures Ed Angel Professor of Computer Science, Electrical and Computer.
2010/10/13VCLAB, National Tsing Hua University, Taiwan1 Multiview Video Kai-Chao Yang.
Image Formation and Digital Video
+ Video Compression Rudina Alhamzi, Danielle Guir, Scott Hansen, Joe Jiang, Jason Ostroski.
3D/Multview Video. Outline Introduction 3D Perception and HVS 3D Displays 3D Video Representation Compression.
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
Efficient Editing of Aged Object Textures By: Olivier Clément Jocelyn Benoit Eric Paquette Multimedia Lab.
Profiles and levelstMyn1 Profiles and levels MPEG-2 is intended to be generic, supporting a diverse range of applications Different algorithmic elements.
© 2011 The McGraw-Hill Companies, Inc. All rights reserved Chapter 6: Video.
Video Video.
DIGITAL Video. Video Creation Video captures the real world therefore video cannot be created in the same sense that images can be created video must.
Advanced Computer Technology II FTV and 3DV KyungHee Univ. Master Course Kim Kyung Yong 10/10/2015.
High-Resolution Interactive Panoramas with MPEG-4 발표자 : 김영백 임베디드시스템연구실.
CCTV Camera Component and Technology Imaging sensor Optic - Lens Camera Technology IP vs analogue CCTV Uniview IPC features Fundamental of CCTV.
CSC 461: Lecture 3 1 CSC461 Lecture 3: Models and Architectures  Objectives –Learn the basic design of a graphics system –Introduce pipeline architecture.
A Camera-Projector System for Real-Time 3D Video Marcelo Bernardes, Luiz Velho, Asla Sá, Paulo Carvalho IMPA - VISGRAF Laboratory Procams 2005.
1 Introduction to Computer Graphics with WebGL Ed Angel Professor Emeritus of Computer Science Founding Director, Arts, Research, Technology and Science.
1Computer Graphics Lecture 4 - Models and Architectures John Shearer Culture Lab – space 2
MULTIMEDIA INPUT / OUTPUT TECHNOLOGIES
Scalable Video Coding and Transport Over Broad-band wireless networks Authors: D. Wu, Y. Hou, and Y.-Q. Zhang Source: Proceedings of the IEEE, Volume:
112/5/ :54 Graphics II Image Based Rendering Session 11.
UNDER THE GUIDANCE DR. K. R. RAO SUBMITTED BY SHAHEER AHMED ID : Encoding H.264 by Thread Level Parallelism.
IntroductiontMyn1 Introduction MPEG, Moving Picture Experts Group was started in 1988 as a working group within ISO/IEC with the aim of defining standards.
Distribution of 3D standard contents June Jun Okamoto (NTT)
Image-Based Rendering Geometry and light interaction may be difficult and expensive to model –Think of how hard radiosity is –Imagine the complexity of.
1 E. Angel and D. Shreiner: Interactive Computer Graphics 6E © Addison-Wesley 2012 Models and Architectures 靜宜大學 資訊工程系 蔡奇偉 副教授 2012.
Digital Video Representation Subject : Audio And Video Systems Name : Makwana Gaurav Er no.: : Class : Electronics & Communication.
Presenting: Shlomo Ben-Shoshan, Nir Straze Supervisors: Dr. Ofer Hadar, Dr. Evgeny Kaminsky.
NETWORK VIDEO SURVEILLANCE. CCTV Closed-Circuit Television (CCTV) is the use of video cameras to transmit signal to a specific place on a designated device.
INTERNET PROTOCOL TELEVISION (IP-TV)
UFOCaptureV2 Time-Shift-Motion-Detect-Video-Recording software for complete records of un-expected events April / 2010 UFOCapture.
Padmasri Dr.BV Raju Institute Of Technology
Prepared by jajal patel (09dit008.)
CSI-447 : Multimedia Systems
Jun Shimamura, Naokazu Yokoya, Haruo Takemura and Kazumasa Yamazawa
Paper – Stephen Se, David Lowe, Jim Little
Steps Towards the Convergence of Graphics, Vision, and Video
DCT IMAGE COMPRESSION.
H.264/SVC Video Transmission Over P2P Networks
Aditya Mavlankar, Pierpaolo Baccichet, David Varodayan and Bernd Girod
StreamNetTM Soft Decoder – VIEW Multimedia Receiver (VMR) 1/21/2013.
3D TV TECHNOLOGY.
Image Segmentation Classify pixels into groups having similar characteristics.
Compression of 3D images using disparity based coder
Data Compression.
Video Compression - MPEG
3D Graphics Rendering PPT By Ricardo Veguilla.
Chapter 6: Video.
INTERNET PROTOCOL TELEVISION (IP-TV)
Models and Architectures
Models and Architectures
Models and Architectures
Introduction to Computer Graphics with WebGL
Overview What is Multimedia? Characteristics of multimedia
Mipmapped High Resolution Video for Immersive Virtual 3D Environments
Digital Image Fundamentals
Models and Architectures
Models and Architectures
RADEON™ 9700 Architecture and 3D Performance
Scalable light field coding using weighted binary images
Presentation transcript:

Coding Approaches for End-to-End 3D TV Systems Seo, DongMahn 24th Jan., 2006 Anthony Vetro, Wojciech Matusik, Hanspeter Pfister, Jun Xin Mitsubishi Electric Research Laboratories, Cambridge

Contents Introduction System Overview Multi-View Video Coding Results Concluding Remarks 24 January 2005

Introduction Three-dimensional TV The next revolution in the history of television. Not viable in the early 1960’s until recently Requires capturing and displaying multi-view video of real-life dynamic scenes High processing and bandwidth requirements 24 January 2005

Introduction (cont.) Prototype 3D TV system End-to-End 3D TV Except for broadcasting over a digital channel All aspects of 3D TV : multi-view video acquisition, compression, and 3D display Distributed Architecture Distributed clusters of PCs To handle the large processing and bandwidth requirements of multi-view video. Real-Time Performance Acquired and displayed in real-time with only a minimal amount of lag 24 January 2005

Introduction (cont.) Scalability Projection-based 3D display completely scalable in the number of acquired, transmitted, and displayed views Projection-based 3D display array of 16 projectors to high-resolution display with 16×1024 ×768 pixels A lenticular screen provides auto-stereocopic images with horizontal parallax and 16 views Computational alignment image alignment and intensity adjustment of 3D display completely automatic using a camera in the loop 24 January 2005

System Overview All PCs 3 GHz Pentium 4 processors, 2 GB of RAM, and run WindowsXP 24 January 2005

System Overview (cont.) Acquisition Some systems 2D array of lenslets optical fibers in combination with a Fresnel lens in front of a high-definition camera to capture multiple views simultaneously multiple views per lenslet both horizontal and vertical directions limited resolution of the camera sensor (fixed at HDTV) very limited number of lenslets to acquire high-resolution multi-view video an array of synchronized cameras camera with cluster of PCs Stanford multi-camera array : up to 128 cameras this paper 16 high-resolution (1300×1030) cameras, progressive video at 12 frames per second pair of cameras are connected by IEEE 1394 bus to one of eight producer PCs. 24 January 2005

System Overview (cont.) to guarantee good image quality on the display side to interpolate dense virtual views from the sparse video data impossible without a scene model such as per-pixel depth maps a prior model of the acquired objects densely-spaced linear array of 17 cameras roughly perpendicular to a common camera plane the output views on the display correspond to the acquired views of the cameras impossible to align multiple cameras precisely using lightfield rendering on the display side to synthesize new views acquire or compute per-pixel depth maps to improve the view-interpolation quality 24 January 2005

System Overview (cont.) 3D Display 24 January 2005

System Overview (cont.) Parallax and Lenticular Displays 24 January 2005

System Overview (cont.) Multi-Projector Implementation 24 January 2005

System Overview (cont.) 24 January 2005

System Overview (cont.) View Interpolation one possible implementation of our 3D TV system one-to-one mapping of cameras to projectors without view-interpolation very simple and scales well not very flexible cameras and projectors need to be equally spaced, which is hard to achieve in practice cannot handle the case when the number of cameras and projectors is not the same not provide the ability to interactively control the viewpoint, a feature that has been termed free-viewpoint video 24 January 2005

System Overview (cont.) performance of our lightfield rendering implementation completely independent of the total number of transmitted views each virtual output view required only a small number of source frames maximum bandwidth on the network between consumers limited, which is important to provide scalability 24 January 2005

Multi-View Video Coding 3D TV broadcasting transmitted all views to multiple users simultaneously transmitting 16 uncompressed video streams with 1280×720 resolution (4:2:0 format) at 30 frames per second 5.3Gb/sec bandwidth efficient multi-view video coding to make 3D TV attractive to broadcasters and network operators simulcast temporally encode the individual video streams our current prototype system full camera resolution (1300×1030) encoded in real-time with MPEG-2 and decoded on the producer PCs advantage : commercially available codecs could be used coding efficiency is generally lower than with other multi-view coding approaches 24 January 2005

Multi-View Video Coding (cont.) Another approach to multi-view video compression :: European ATTEST project to reduce the data to a single view with per-pixel depth map compressed in real-time and broadcast as an MPEG-2 enhancement layer receiver side : stereo or multi-view images are generated using image-based rendering difficult to generate high-quality output occlusions or high disparity in the scene single view cannot capture view-dependent appearance effects such as reflections and specular highlights MPEG H.264/AVC 24 January 2005

Multi-View Video Coding (cont.) additional issues to consider to achieve optimal compression efficiency cameras are not expected to be perfectly aligned illumination and color between views is not consistent due to intrinsic parameters of each camera sampling theory minimal number of views required for transmission and rendering lower bound for the minimum number of samples required for lightfield/lumigraph rendering related to camera resolution and scene depth complexity the adaptive dropping of segments in the multi-view video entire frames or blocks interpolated or synthesized at the receiving end have strong potential to improve the coding efficiency in the rate-distortion sense 24 January 2005

Result top : viewer side of the front-projection display bottom : corresponding images of the camera array 24 January 2005

Concluding Remarks implemented the first real-time end-to-end 3D TV system with enough views and resolution to provide a truly immersive 3D experience 24 January 2005

Thank you! 24 January 2005