Introduction Michael Bleyer LVA Stereo Vision. VU Stereo Vision (3.0 ECTS/2.0 WS)  Anrechenbarkeit: Wahlfach im Masterstudium “Computergraphik & Digitale.

Slides:



Advertisements
Similar presentations
How Do 3D Glasses Work? Caitlin Riddle Types Of Paper 3D Glasses b Anaglyph b Pulfrich b Polarized.
Advertisements

Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.
CS 376b Introduction to Computer Vision 04 / 21 / 2008 Instructor: Michael Eckmann.
Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923.
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image Where does the.
Stereo.
Boundary matting for view synthesis Samuel W. Hasinoff Sing Bing Kang Richard Szeliski Computer Vision and Image Understanding 103 (2006) 22–32.
Last Time Pinhole camera model, projection
CS6670: Computer Vision Noah Snavely Lecture 17: Stereo
Multi-view stereo Many slides adapted from S. Seitz.
Introduction to Computer Vision 3D Vision Topic 9 Stereo Vision (I) CMPSCI 591A/691A CMPSCI 570/670.
High-Quality Video View Interpolation
Stereopsis Mark Twain at Pool Table", no date, UCR Museum of Photography.
The plan for today Camera matrix
1 Lecture 11 Scene Modeling. 2 Multiple Orthographic Views The easiest way is to project the scene with parallel orthographic projections. Fast rendering.
Stockman MSU Fall Computing Motion from Images Chapter 9 of S&S plus otherwork.
Stereo Computation using Iterative Graph-Cuts
Lecture 11: Stereo and optical flow CS6670: Computer Vision Noah Snavely.
A Novel 2D To 3D Image Technique Based On Object- Oriented Conversion.
CSE473/573 – Stereo Correspondence
Announcements PS3 Due Thursday PS4 Available today, due 4/17. Quiz 2 4/24.
Visualization- Determining Depth From Stereo Saurav Basu BITS Pilani 2002.
COMP322/S2000/L271 Stereo Imaging Ref.V.S.Nalwa, A Guided Tour of Computer Vision, Addison Wesley, (ISBN ) Slides are adapted from CS641.
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2006 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
Stereo matching Class 10 Read Chapter 7 Tsukuba dataset.
Stereo vision A brief introduction Máté István MSc Informatics.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm – 4:20pm Lecture #15.
Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #15.
3D Stereo Video Coding Heejune AHN Embedded Communications Laboratory Seoul National Univ. of Technology Fall 2013 Last updated
Structure from images. Calibration Review: Pinhole Camera.
Michael Bleyer LVA Stereo Vision
Last Week Recognized the fact that the 2D image is a representation of a 3D scene thus contains a consistent interpretation –Labeled edges –Labeled vertices.
Surface Stereo with Soft Segmentation Michael Bleyer 1, Carsten Rother 2, Pushmeet Kohli 2 1 Vienna University of Technology, Austria 2 Microsoft Research.
Lecture 12 Stereo Reconstruction II Lecture 12 Stereo Reconstruction II Mata kuliah: T Computer Vision Tahun: 2010.
Integral University EC-024 Digital Image Processing.
Advanced Computer Technology II FTV and 3DV KyungHee Univ. Master Course Kim Kyung Yong 10/10/2015.
Recap from Monday Image Warping – Coordinate transforms – Linear transforms expressed in matrix form – Inverse transforms useful when synthesizing images.
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
Image-based rendering Michael F. Cohen Microsoft Research.
Stereo Readings Szeliski, Chapter 11 (through 11.5) Single image stereogram, by Niklas EenNiklas Een.
Stereo Many slides adapted from Steve Seitz.
Feature-Based Stereo Matching Using Graph Cuts Gorkem Saygili, Laurens van der Maaten, Emile A. Hendriks ASCI Conference 2011.
Project 2 code & artifact due Friday Midterm out tomorrow (check your ), due next Fri Announcements TexPoint fonts used in EMF. Read the TexPoint.
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image image 1image.
Computer Vision, Robert Pless
Michael Bleyer LVA Stereo Vision
Computer Vision Stereo Vision. Bahadir K. Gunturk2 Pinhole Camera.
1 Artificial Intelligence: Vision Stages of analysis Low level vision Surfaces and distance Object Matching.
Computer Vision Lecture #10 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.
Bahadir K. Gunturk1 Phase Correlation Bahadir K. Gunturk2 Phase Correlation Take cross correlation Take inverse Fourier transform  Location of the impulse.
Lecture 16: Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.
stereo Outline : Remind class of 3d geometry Introduction
Digital Image Processing
776 Computer Vision Jan-Michael Frahm Spring 2012.
Segmentation- Based Stereo Michael Bleyer LVA Stereo Vision.
Journal of Visual Communication and Image Representation
Paper presentation topics 2. More on feature detection and descriptors 3. Shape and Matching 4. Indexing and Retrieval 5. More on 3D reconstruction 1.
Perception and VR MONT 104S, Fall 2008 Lecture 8 Seeing Depth
Project 2 due today Project 3 out today Announcements TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAA.
John Morris Stereo Vision (continued) Iolanthe returns to the Waitemata Harbour.
Correspondence and Stereopsis. Introduction Disparity – Informally: difference between two pictures – Allows us to gain a strong sense of depth Stereopsis.
CSE 185 Introduction to Computer Vision Stereo 2.
Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.
Noah Snavely, Zhengqi Li
CS 6501: 3D Reconstruction and Understanding Stereo Cameras
A Plane-Based Approach to Mondrian Stereo Matching
Lecture 31: Graph-Based Image Segmentation
Computer Vision Stereo Vision.
Occlusion and smoothness probabilities in 3D cluttered scenes
Stereo vision Many slides adapted from Steve Seitz.
Presentation transcript:

Introduction Michael Bleyer LVA Stereo Vision

VU Stereo Vision (3.0 ECTS/2.0 WS)  Anrechenbarkeit: Wahlfach im Masterstudium “Computergraphik & Digitale Bildverarbeitung” Wahlfach im Masterstudium “Medieninformatik”  Webseite der LVA:

VU Stereo Vision (3.0 ECTS/2.0 WS)  Vorlesungstermine (9 Einheiten): Fr 4. März ( ) Fr 11. März ( ) Fr 18. März ( ) Fr 25. März ( ) Fr 01. April ( ) Fr 08. April ( ) Fr 15. April ( ) Fr 06. Mai ( ) Fr 13. Mai ( ) Mündliche Prüfung nach Vereinbarung  Ort: Seminarraum 188/2

Topics Covered in the Lecture (1)  Session 1 - Introduction 3D perception What is disparity? Applications  Session 2 – Basics 3D geometry Challenges in stereo matching Assumptions  Session 3 – Local methods Principle Adaptive windows  Section 4 – Global methods Stereo as energy minimization problem Dynamic programming

Topics Covered in the Lecture (2)  Session 5 - Graph ‐ Cuts Alpha-expansions Fusion-moves  Session 6 – Smoothness Term Belief propagation Different smoothness terms  Session 7 – Data Term Different match measures Role of color  Session 8 - Segmentation ‐ Based Stereo Occlusion handling An example algorithm Stereo and matting  Session 9 – Surface Stereo One of our recent paper Demo - Autostereoscopic display

Homework  Implement a block matching algorithm You will get more details in session 3. The algorithm is very simple. I do not care about programming languages. You have to present your algorithm as part of the oral exam.

My Promise  After attending the lecture you should: Know the basics of stereo visions Be able to understand the current state-of-the art Have understood several principles that you can also use for other vision problems:  Optical flow  Segmentation  Matting  Inpainting  Image Restoration  …

Michael Bleyer LVA Stereo Vision 3D Perception

Human-Eye Separation(~6.5cm) Left 2D ImageRight 2D Image Brain 3D View

3D Perception Human-Eye Separation(~6.5cm) Left 2D ImageRight 2D Image Brain 3D View If we ensure that the left eye sees a 2D image and the right eye sees another one, our brain will try to overlay the images to generate a 3D impression. How can we use this for watching 3D movies?

Anaglyphs  Two images of complementary color are overlaid to generate one image.  Glasses required (e.g. red/green)  Red filter cancels out red image component, green filter cancels out green component  Each eye gets one image => 3D impression  Current 3D cinemas use this principle. However, polarization filters are used instead of color filters. (Anaglyph Image) (Red/Green Glasses)

Shutter Glasses  Display flickers between left and right image (i.e. each even frame shows left image, each odd frame shows right image)  When left frame is shown, shutter glasses close right eye and vice versa.  Requires new displays of high frame rate (120Hz).  Currently pushed by Nvidea to address gaming market. (Shutter Glasses and 120 Hz Display) (Nvidea Artwork)

Autostereoscopic Displays  No glasses required!  Matrix of many transparent lenses put on the display.  Lenses distort pixels so that left eye gets a left image and right eye gets a right image (if you are standing in a sweet spot) => 3D impression  Novel viewpoint capability: You can walk in front of the display and get a perceptively correct depth impression depending on your current viewpoint.  You will get a demo soon (Philips Wowvx Display)

Free Viewing (No glasses required, but some practice)  The way how you usually look at the display (no 3D):

Free Viewing (No glasses required, but some practice)  Parallel Viewing: Left Image Right Image

Free Viewing (No glasses required, but some practice)  Cross Eye Viewing:  Most likely the simpler method. Right Image Left Image

Learning Cross Eye Viewing  Take a pencil and hold it in the middle of your eyes.  Look at the pencil and slowly change its distance to your eyes  If you found the right distance, you see a third image inbetween left and right images.  This third image is in 3D  Practise, it is worth the effort. Right 2D Image Left 2D Image

3D on YouTube

Michael Bleyer LVA Stereo Vision Computational Stereo

Left 2D ImageRight 2D Image Brain 3D View Replace human eyes with a pair of slightly displaced cameras.

Computational Stereo Left 2D ImageRight 2D Image Brain 3D View Displacement (Stereo Baseline) Replace human eyes with a pair of slightly displaced cameras.

Displacement (Stereo Baseline) Computational Stereo Left 2D ImageRight 2D Image Brain 3D View

Displacement (Stereo Baseline) Computational Stereo Left 2D ImageRight 2D Image Computer 3D View

Displacement (Stereo Baseline) Computational Stereo Left 2D ImageRight 2D Image Computer 3D View

Displacement (Stereo Baseline) Computational Stereo Left 2D ImageRight 2D Image Computer 3D View How can we accomplish a fully automatic 2D to 3D conversion?

What is Disparity?  The amount to which a single pixel is displaced in the two images is called disparity.  A pixel’s disparity is inversely proportional to its depth in the scene.

What is Disparity?  The amount to which a single pixel is displaced in the two images is called disparity.  A pixel’s disparity is inversely proportional to its depth in the scene. Background Disparity (Small)

What is Disparity?  The amount to which a single pixel is displaced in the two images is called disparity.  A pixel’s disparity is inversely proportional to its depth in the scene. Foreground Disparity (Large)

Disparity Encoding  The disparity of each pixel is encoded by a grey value.  High grey values represent high disparities (and low gray values small disparities).  The resulting image is called disparity map.

Disparity and Depth  The disparity map contains sufficient information for generating a 3D model.

Disparity and Depth  The disparity map contains sufficient information for generating a 3D model. The challenging part is to compute the disparity map. This task is known as the stereo matching problem. Stereo matching will be the topic of this lecture!!!

Applications (just a few examples)

3D Reconstruction from aerial images  Stereo cameras are mounted on an airplane to obtain a terrain map.  Images taken from

3D Reconstruction of Cities  City of Dubrovnik reconstructed from images taken from Flickr in a fully automatic way.  [S. Agarwal, N. Snavely, I. Simon, S. Seitz and R. Szeliski “Building Rome in a Day”, ICCV, 2009]

Driver Assistance / Autonomous driving cars  For example, use stereo to measure distance to other cars.  DARPA Grand Challenge  Image taken from

The Mars Rover  Reconstruct the surface of Mars using stereo vision

Human Motion Capture  Fit a 3D model of the human body to the computed point cloud.  [R. Plänkers and P. Fua, “Articulated Soft Objects for Multiview Shape and Motion Capture”, PAMI, 2003]

Bilayer Segmentation – Z-Keying  Goal: Divide image into a foreground and a background region.  Simple background subtraction will fail if there is motion in the background.  Solution: Compute depth map If the depth of a pixel is larger than a predefined threshold, pixels belongs to the foreground  [A. Criminisi, G. Cross, A. Blake and V. Kolmogorov, “Bilayer Segmentation of Live Video”, CVPR, 2006]

Novel View Generation  Given a 3D model of the scene, one can use a virtual camera to record new views from arbitrary viewpoints.  For example: Freeze frame effect from the movie Matrix.  [L. Zitnick, S. Kang, M. Uyttendaele, S. Winder, and R. Szeliski, "High-quality video view interpolation using a layered representation", SIGGRAPH, 2004] Left View (recorded) Right View (recorded) Virtual Interpolated View (not recorded)

MS Kinect

Understanding Human Vision  If we can teach the computer to see in 3D, we can also learn more about the way how the human perceives depth.

Summary  3D Perception Principle of human 3D vision Ways for watching movies in 3D  Computational Stereo Stereo Matching Problem  Applications