Anahita: A System for 3D Video Streaming with Depth Customization

Slides:



Advertisements
Similar presentations
Mobile TV Future Innovative Service from Broadcasters Perspective.
Advertisements

CAUSES & CURE OF LATENCY IN THE INTERNET TELEPHONY DR. OLUMIDE SUNDAY ADEWALE Dept of Industrial Math & Computer Science Federal University of Technology.
Fortune >80% deploying mobile clients Smartphones 289M in 2010 >900M in Slates 55M in 2011 >200M in Gartner Forecast: Mobile.
Panasonic Singapore Labs – Network Team QoS and Delivery Context in Rule-Based Edge Services Prepared for IWCW2002 By Ng Chan Wah
Pouya Ostovari and Jie Wu Computer and Information Sciences
Building Cloud-ready Video Transcoding System for Content Delivery Networks(CDNs) Zhenyun Zhuang and Chun Guo Speaker: 饒展榕.
NDN in Local Area Networks Junxiao Shi The University of Arizona
The Fundamentals of Stereoscopic 3D (S3D) Display Technologies for Virtual Reality, Film, and Video Games Mark Newburn Vizics Inc.
Doc.: IEEE /0604r1 Submission May 2014 Slide 1 Modeling and Evaluating Variable Bit rate Video Steaming for ax Date: Authors:
Time Slicing in Mobile TV Broadcast Networks with Arbitrary Channel Bit Rates Cheng-Hsin Hsu Joint work with Dr. Mohamed Hefeeda April 23, 2009 Simon Fraser.
Personalized Abstraction of Broadcasted American Football Video by Highlight Selection Noboru Babaguchi (Professor at Osaka Univ.) Yoshihiko Kawai and.
Implement and deploy mobile learning at open universities -- The big promise and many challenges ahead Harris Wang School of Computing and information.
3dtv.at Stereoscopic Player and Stereoscopic Multiplexer S3D-Today November 2006 Munich, Germany.
3D Displays Duncan Lindbo, Rebecca Brown, Bao Khang Nguyen.
ADVISE: Advanced Digital Video Information Segmentation Engine
1 Lecture 11 Scene Modeling. 2 Multiple Orthographic Views The easiest way is to project the scene with parallel orthographic projections. Fast rendering.
Analysis of compressed depth and image streaming on unreliable networks Pietro Zanuttigh, Andrea Zanella, Guido M. Cortelazzo.
V1.00 © 2009 Research In Motion Limited Introduction to Mobile Device Web Development Trainer name Date.
Android in the Cloud Chromebooks, BYOD and Wearables Joel Isaacson Copyright 2014 Joel Isaacson
Overview of Search Engines
Bhojan Anand‡, Karthik Thirugnanam†, Jeena Sebastian‡, Pravein G. Kannan‡, Akhihebbal L. Ananda‡, Mun Choon Chan‡ and Rajesh Krishna Balan† ‡ National.
Motion graphics. Film title sequence Film title sequence is a way of presenting a film title or television programs which includes the cast members, producer,
1 Motivation Video Communication over Heterogeneous Networks –Diverse client devices –Various network connection bandwidths Limitations of Scalable Video.
Speaker: Chi-Yu Hsu Advisor: Prof. Jian-Jung Ding Leveraging Stereopsis for Saliency Analysis, CVPR 2012.
3D Stereo Video Coding Heejune AHN Embedded Communications Laboratory Seoul National Univ. of Technology Fall 2013 Last updated
3D/Multview Video. Outline Introduction 3D Perception and HVS 3D Displays 3D Video Representation Compression.
New requirement of subjective video quality assessment methodologies for 3DTV Wei Chen(*) 1,2, Jérôme Fournier 1, Marcus Barkowsky 2, Patrick Le Callet.
A computer is a machine that manipulates data according to a list of instructions There are a lot of terms can be found over the internet related to computer.
Android in the Cloud Chromebooks, BYOD and Wearables Joel Isaacson Copyright 2014 Joel Isaacson
Stereoscopic Analyzer On-Set Assistance System for 3D Capturing Frederik Zilly.
Maria Grazia Albanesi, Riccardo Amadeo University of Pavia, Faculty of Engineering, Computer Department Impact of Fixation Time on Subjective Video Quality.
Eric Moore Computer Users Group of Greeley.  Linked to a local network or Internet  Networking may be built-in or an add-on (set-top box)  Examples.
By Qian Deng MobiUS: Enable Together-Viewing Video Experience across Two Mobile Devices.
New ITU-R Question 128/6: Digital three-dimensional (3D) TV broadcasting 1)What are the user requirements for digital 3D TV broadcasting systems? 2)What.
Exploiting Proxy-Based Transcoding to Increase the User Quality of Experience in Networked Applications Maarten Wijnants Patrick Monsieurs Peter Quax Wim.
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada Video Streaming over Cooperative Wireless Networks Mohamed Hefeeda (Joint.
Context-Aware Interactive Content Adaptation Iqbal Mohomed, Jim Cai, Sina Chavoshi, Eyal de Lara Department of Computer Science University of Toronto MobiSys2006.
Mark Claypool’s MQP Projects Network Games Streaming Media.
ACM NOSSDAV 2007, June 5, 2007 IPTV Experiments and Lessons Learned Panelist: Klara Nahrstedt Panel: Large Scale Peer-to-Peer Streaming & IPTV Technologies.
The Way Forward Factors Driving Video Conferencing Dr. Jan Linden, VP of Engineering Global IP Solutions.
Pillars Of FPD Growth Full HD 3D Smart TV (Internet Ready)
Technische Universität Berlin Communication Systems Group Director: Prof. Thomas Sikora Sebastian Knorr 21/08/2007 Super-Resolution.
CSE 185 Introduction to Computer Vision Stereo. Taken at the same time or sequential in time stereo vision structure from motion optical flow Multiple.
Standards and Protocols Slide show for WiFi Characteristics of a wireless local network. It was named after a group of people who invented. The.
For Wi-Fi The is internationally describing characteristics of a wireless local network. It was named after a group of people who invented.
Counting How Many Words You Read
Immersive Rendering. General Idea ► Head pose determines eye position  Why not track the eyes? ► Eye position determines perspective point ► Eye properties.
Vinay.R Mar Networked Media Lab GIST Multi-site Visualization Sharing.
MACHINE VISION GROUP MOBILE FEATURE-CLOUD PANORAMA CONSTRUCTION FOR IMAGE RECOGNITION APPLICATIONS Miguel Bordallo, Jari Hannuksela, Olli silvén Machine.
Saving Bitrate vs. Users: Where is the Break-Even Point in Mobile Video Quality? ACM MM’11 Presenter: Piggy Date:
Camera surface reference images desired ray ‘closest’ ray focal surface ‘closest’ camera Light Field Parameterization We take a non-traditional approach.
 Security issues that are often faced when using a Net connected desktop or laptop computer are to be anticipated on the mobile Web.  There are concerns.
Active learning assignment Subject :- AVS Topic:- 3-D TV Guided by :- Mr. Gunjan Jani Branch :- Electronics and communication Batch :- b2 Group members:-
Vidhyaeep Institute of Engineering & Technology. Name: Divyesh Kabariya Branch : E.C. Sem.: 5 th Enroll. No.: Subject : Audio Video System.
La Salle University – Fall 2013 INL 880 – Capstone Presentation Presented by: Loc Nguyen & Shweta Somalwar December 18, 2013.
A Framework for Perceptual Studies in Photorealistic Augmented Reality Martin Knecht 1, Andreas Dünser 2, Christoph Traxler 1, Michael Wimmer 1 and Raphael.
1 2D TO 3D IMAGE AND VIDEO CONVERSION. INTRODUCTION The goal is to take already existing 2D content, and artificially produce the left and right views.
Digital Video Representation Subject : Audio And Video Systems Name : Makwana Gaurav Er no.: : Class : Electronics & Communication.
Presenting: Shlomo Ben-Shoshan, Nir Straze Supervisors: Dr. Ofer Hadar, Dr. Evgeny Kaminsky.
A Measurement Study of Oculus 360 Degree Video Streaming
Padmasri Dr.BV Raju Institute Of Technology
Prepared by jajal patel (09dit008.)
A SEMINAR ON ROVER TECHNOLOGY
A Novel 2D-to-3D Conversion System Using Edge Information
3D TV TECHNOLOGY.
Distributed Multimedia Systems
Coding Approaches for End-to-End 3D TV Systems
GATES: A Grid-Based Middleware for Processing Distributed Data Streams
TIMING/VIDEO Remove auto-advancing after creating a video version:
Various mobile devices
Presentation transcript:

Anahita: A System for 3D Video Streaming with Depth Customization K. Calagari, K. Templin, T. Elgamal, K. Diab, P. Didyk, W. Matusik, M. Hefeeda QCR & MIT & SFU 5 November 2014 1 1

Motivations 3D content is popular especially in theaters  Controlled, homogenous environments 2 2

Internet 3D Video Streaming … … is a lot more challenging because of … complexity of perceiving and rendering depth on different displays & in dynamic network conditions 3 3

Sample 3D Displays 4

How do we see 3D? 5

Approximate Model for Depth [Holliman 04] Positive disparity Negative disparity z d d Left eye Left eye z e e p p Right eye Right eye 6 6

Depth & Comfort Zone [Shibata 11] Comfort zone depends on: Presented content Viewing condition Screen distance … This plot shows how the depth range that is comfortable for viewers depends on viewing distance. For large viewing distances we can reproduce depth up to infinity. This is why stereo works currently so well for cinema but not for desktop and mobile devices where the depth range is significantly reduced. “The zone of comfort: Predicting visual discomfort with stereo displays” by Shibata et al. 2011 7

Rendering 3D Content Many different display technologies From anaglyph to auto stereoscopic 8 8

3D Display: Active/Passive Glasses 9

3D Display: Glasses-Free Autostereoscopic Parallax Barriers: Blocks light in certain direction Lenticular Lens: Reflects light to certain direction 10

3D Display: Multi-views 11

Network Dynamics Dynamic conditions Bandwidth Loss rate Delay 12 12

In Summary … Depth perception heavily depends on Display size and technology User preferences Current 3D streaming systems are simple add-on to 2D streaming  Poor depth/visual perception and/or discomfort 13 13

3D Streaming Systems Need to .. Customize depth for different displays Big TV … to phone Serve different formats Stereo, V+D, MV + D, … Adapt to network dynamics Wired, wireless, …, varying bandwidth Manage many versions Compared to 1—3 versions for 2D videos Compared to 1-few versions for 2D videos 14

5 x 7 x 3 = 105 versions for each video Example Display Technology Network Bitrate Display Size Side By Side Top Bottom Frame Sequential Anaglyph Row-Interleaved Column-Interleaved Video Plus Depth High Quality TV Desktop Laptop Tablet Cell Phone Medium Quality Low Quality 5 x 7 x 3 = 105 versions for each video 15

Our Challenge Enable all devices to render best possible 3D videos From mobiles to big TVs 16

Our Solution … In ancient time, Anahita is the source of all clean water streams that flow through golden channels to all lands and oceans on Earth.  In Internet age, Anahita is the source of all high-quality 3D video streams that flow through network channels to all types of displays. 17

Our Contributions Show the need for depth optimization Subjective study on different displays Anahita: complete system for 3D streaming Supports most current displays/technologies Provides efficient management of 3D versions Enables personalized depth Is scalable and dynamic (using DASH) Method for depth expansion and compression Optimizes depth: from mobile phones to big TVs Preserves scene structure (important for sports) Does not introduce visual artifacts Is computationally inexpensive 18 18

The Need for Depth Customization Subjective study 10 subjects 2 displays (phone & 55” TV) Series of short 3D clips (from soccer) Show six different versions Original Depth compression Depth expansion (4 cases) Versions are shown in random to subjects Subjects are asked which version is preferred 19 19

Results Best version is NOT the original one Depth customization depends on display size 20 20

Depth Customization SPSS: Structure Preserving Scene Shifting Expands or compresses depth in stereo video Simple image processing operations Preserves lines and planes Suitable for field sports: soccer, football, tennis, … 21 21

SPSS: Basic Idea Control disparity of pixels Two operations  increase/decrease depth Two operations Slant (affects vertical component of disparity) Stretch (affects horizontal component of disparity) 22 22

Depth Gradient 2D+ depth - Depth map g = (gx, gy) 23

Depth Gradient 2D+ depth - Depth map g = (gx, gy) 24

SPSS: Slant Left eye image Right eye image We propose depth manipulation technique which uses simple operations which does not require dense depth information or image-based rendering. As a result the manipulartion are very efficient and do not produce noticable artifacts which are often for image-based rendering techniques. 25

SPSS: Slant gy is changed 26

SPSS: Stretch gx is changed 27

SPSS: Main Steps Calculate gradient g = (gx, gy) of scene disparity Compute slant and stretch factor Remap left/right views using: 28 28

SPSS: Sample Results Before e is the expansion value After 29

SPSS: Sample Results Before After e is the expansion value 30

SPSS: Coverage SPSS applies to scenes with planar depth  long shots in field sports Our analysis of multiple full games 60—70% of the game Close-ups and short shots do not benefit from SPSS Anahita: has automatic shot classifier 31 31

Anahita: System Architecture 32

Anahita: 3D Version Manager 33

Adaptive Streamer Based on DASH Versions are organized in segments Most suitable 3D version is chosen for each client Client can personalize: ask for more/less depth Served using DASH 34 34

Implementation: Whole System Server Deployed on Amazon Implements many 3D video processing operations, and our SPSS depth customization method Clients Mobile, tablet (mobile apps) Stereo TV (web) Desktop display (web) Auto stereoscopic 55” display (custom app) 35 35

Evaluation: Subjective Study 5 displays mobile, tablet, 15.6” laptop, 27” desktop, 55” TV 3 soccer 3D video clips from YouTube Man United vs. Wigan (60 sec) Chelsea vs. Wigan (24 sec) Chelsea vs. Plymouth (20 sec) 15 subjects viewed all clips (original & optimized) on all displays in random Ranked depth from 1-poor to 5-Excellent 36 36

Sample Results Significant improvements in all cases up to 35% 37 37

Conclusions 3D streaming systems are more complex than 2D Showed the need for depth adaptation Proposed method to customize depth Designed 3D streaming system Evaluation study shows large gains 38 38

Anahita: Meaning Ancient Persia: Anahita is source of all water, where warm and clear streams flow through golden channels to all lands and oceans on Earth  Internet age: Anahita is source of all 3D videos, where high-quality streams of 3D videos flow through network channels to all types of displays