MultiView Stereo Steve Seitz CSE590SS: Vision for Graphics.

Slides:



Advertisements
Similar presentations
Multi-view Stereo via Volumetric Graph-cuts George Vogiatzis, Philip H. S. Torr Roberto Cipolla.
Advertisements

Multi-view Stereo via Volumetric Graph-cuts
The fundamental matrix F
CSE473/573 – Stereo and Multiple View Geometry
Stereo matching Class 7 Read Chapter 7 of tutorial Tsukuba dataset.
Multiview Reconstruction. Why More Than 2 Views? BaselineBaseline – Too short – low accuracy – Too long – matching becomes hard.
Some books on linear algebra Linear Algebra, Serge Lang, 2004 Finite Dimensional Vector Spaces, Paul R. Halmos, 1947 Matrix Computation, Gene H. Golub,
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image Where does the.
Stereo.
Multiview Reconstruction. Why More Than 2 Views? BaselineBaseline – Too short – low accuracy – Too long – matching becomes hard.
Silhouettes in Multiview Stereo Ian Simon. Multiview Stereo Problem Input: – a collection of images of a rigid object (or scene) – camera parameters for.
Structure from motion.
Last Time Pinhole camera model, projection
Project 3 extension: Wednesday at noon Final project proposal extension: Friday at noon >consult with Steve, Rick, and/or Ian now! Project 2 artifact winners...
Lecture 12: Structure from motion CS6670: Computer Vision Noah Snavely.
Computer Vision CSE576, Spring 2005 Richard Szeliski
Lecture 23: Structure from motion and multi-view stereo
Computational Photography: Image-based Modeling Jinxiang Chai.
CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2005 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
Project 2 winners Think about Project 3 Guest lecture on Monday: Aseem Announcements.
Direct Methods for Visual Scene Reconstruction Paper by Richard Szeliski & Sing Bing Kang Presented by Kristin Branson November 7, 2002.
Structure from motion. Multiple-view geometry questions Scene geometry (structure): Given 2D point matches in two or more images, where are the corresponding.
Multiview stereo. Volumetric stereo Scene Volume V Input Images (Calibrated) Goal: Determine occupancy, “color” of points in V.
Multi-view stereo Many slides adapted from S. Seitz.
The plan for today Camera matrix
Stereo and Structure from Motion
Project 1 grades out Announcements. Multiview stereo Readings S. M. Seitz and C. R. Dyer, Photorealistic Scene Reconstruction by Voxel Coloring, International.
Multiview Reconstruction. Why More Than 2 Views? BaselineBaseline – Too short – low accuracy – Too long – matching becomes hard.
May 2004Stereo1 Introduction to Computer Vision CS / ECE 181B Tuesday, May 11, 2004  Multiple view geometry and stereo  Handout #6 available (check with.
Announcements Project Update Extension: due Friday, April 20 Create web page with description, results Present your project in class (~10min/each) on Friday,
Stereo Matching Computer Vision CSE576, Spring 2005 Richard Szeliski.
Announcements PS3 Due Thursday PS4 Available today, due 4/17. Quiz 2 4/24.
From Images to Voxels Steve Seitz Carnegie Mellon University University of Washington SIGGRAPH 2000 Course on 3D Photography.
Lecture 25: Multi-view stereo, continued
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2006 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.
Manhattan-world Stereo Y. Furukawa, B. Curless, S. M. Seitz, and R. Szeliski 2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.
Review: Binocular stereo If necessary, rectify the two stereo images to transform epipolar lines into scanlines For each pixel x in the first image Find.
3-D Scene u u’u’ Study the mathematical relations between corresponding image points. “Corresponding” means originated from the same 3D point. Objective.
CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.
Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #15.
Camera Calibration & Stereo Reconstruction Jinxiang Chai.
Reconstructing Relief Surfaces George Vogiatzis, Philip Torr, Steven Seitz and Roberto Cipolla BMVC 2004.
Some books on linear algebra Linear Algebra, Serge Lang, 2004 Finite Dimensional Vector Spaces, Paul R. Halmos, 1947 Matrix Computation, Gene H. Golub,
Structure from images. Calibration Review: Pinhole Camera.
Graph Cut Algorithms for Binocular Stereo with Occlusions
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
Stereo Many slides adapted from Steve Seitz.
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image image 1image.
Lec 22: Stereo CS4670 / 5670: Computer Vision Kavita Bala.
Computer Vision No. 7: Stereo part 2. Today’s topics u Projective geometry u Camera Projection Matrix u F Matrix u Multi View Shape Reconstruction.
Computer Vision Lecture #10 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.
Bahadir K. Gunturk1 Phase Correlation Bahadir K. Gunturk2 Phase Correlation Take cross correlation Take inverse Fourier transform  Location of the impulse.
stereo Outline : Remind class of 3d geometry Introduction
875 Dynamic Scene Reconstruction
776 Computer Vision Jan-Michael Frahm & Enrique Dunn Spring 2013.
Paper presentation topics 2. More on feature detection and descriptors 3. Shape and Matching 4. Indexing and Retrieval 5. More on 3D reconstruction 1.
The University of Ontario From photohulls to photoflux optimization Yuri Boykov University of Western Ontario Victor Lempitsky Moscow State University.
Project 2 due today Project 3 out today Announcements TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAA.
CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.
Photoconsistency constraint C2 q C1 p l = 2 l = 3 Depth labels If this 3D point is visible in both cameras, pixels p and q should have similar intensities.
CSE 185 Introduction to Computer Vision Stereo 2.
Robot Vision SS 2011 Matthias Rüther / Matthias Straka 1 ROBOT VISION Lesson 7: Volumetric Object Reconstruction Matthias Straka.
Lecture 11 Homogeneous coordinates. Stereo.
CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo.
Announcements Midterm due now Project 2 artifacts: vote today!
MSR Image based Reality Project
Computer Vision CSE576, Spring 2005 Richard Szeliski
Announcements Project 3 out today (help session at end of class)
Presentation transcript:

MultiView Stereo Steve Seitz CSE590SS: Vision for Graphics

Stereo Reconstruction Steps Calibrate cameras Rectify images Compute disparity Estimate depth f uu’ baseline z CC’ X f

Choosing the Baseline What’s the optimal baseline? Too small: large depth error Too large: difficult search problem Large Baseline Small Baseline

The Effect of Baseline on Depth Estimation

Multibaseline Stereo Basic Approach Choose a reference view Use your favorite stereo algorithm BUT >replace two-view SSD with SSD over all baselines Limitations Must choose a reference view (bad) Visibility!

Video

Epipolar-Plane Images [Bolles 87] Lesson: Beware of occlusions

Volumetric Stereo Scene Volume V Input Images (Calibrated) Goal: Determine transparency, radiance of points in V

Discrete Formulation: Voxel Coloring Discretized Scene Volume Input Images (Calibrated) Goal: Assign RGBA values to voxels in V photo-consistent with images

Complexity and Computability Discretized Scene Volume N voxels C colors 3 All Scenes ( C N 3 ) Photo-Consistent Scenes True Scene

Issues Theoretical Questions Identify class of all photo-consistent scenes Practical Questions How do we compute photo-consistent models?

1. C=2 (silhouettes) Volume intersection [Martin 81, Szeliski 93] 2. C unconstrained, viewpoint constraints Voxel coloring algorithm [Seitz & Dyer 97] 3. General Case Space carving [Kutulakos & Seitz 98] Voxel Coloring Solutions

Reconstruction from Silhouettes (C = 2) Binary Images Approach: Backproject each silhouette Intersect backprojected volumes

Volume Intersection Reconstruction Contains the True Scene But is generally not the same In the limit get visual hull >Complement of all lines that don’t intersect S

Voxel Algorithm for Volume Intersection Color voxel black if on silhouette in every image O(MN 3 ), for M images, N 3 voxels Don’t have to search 2 N 3 possible scenes!

Properties of Volume Intersection Pros Easy to implement, fast Accelerated via octrees [Szeliski 1993] Cons No concavities Reconstruction is not photo-consistent Requires identification of silhouettes

1. C=2 (silhouettes) Volume intersection [Martin 81, Szeliski 93] 2. C unconstrained, viewpoint constraints Voxel coloring algorithm [Seitz & Dyer 97] 3. General Case Space carving [Kutulakos & Seitz 98] Voxel Coloring Solutions

1. Choose voxel 2. Project and correlate 3.Color if consistent (standard deviation of pixel colors below threshold) Voxel Coloring Approach Visibility Problem: in which images is each voxel visible?

The Global Visibility Problem Inverse Visibility known images Unknown Scene Which points are visible in which images? Known Scene Forward Visibility known scene

Layers Depth Ordering: visit occluders first! SceneTraversal Condition: depth order is view-independent

For example: f = distance from separating plane f What is A View-Independent Depth Order? A function f over a scene S and a camera volume C C S p q v Such that for all p and q in S, v in C p occludes q from v only if f(p) < f(q)

Panoramic Depth Ordering Cameras oriented in many different directions Planar depth ordering does not apply

Panoramic Depth Ordering Layers radiate outwards from cameras

Panoramic Layering Layers radiate outwards from cameras

Panoramic Layering Layers radiate outwards from cameras

Compatible Camera Configurations Depth-Order Constraint Scene outside convex hull of camera centers Outward-Looking cameras inside scene Inward-Looking cameras above scene

Calibrated Image Acquisition Calibrated Turntable 360° rotation (21 images) Selected Dinosaur Images Selected Flower Images

Voxel Coloring Results (Video) Dinosaur Reconstruction 72 K voxels colored 7.6 M voxels tested 7 min. to compute on a 250MHz SGI Flower Reconstruction 70 K voxels colored 7.6 M voxels tested 7 min. to compute on a 250MHz SGI

Limitations of Depth Ordering A view-independent depth order may not exist pq Need more powerful general-case algorithms Unconstrained camera positions Unconstrained scene geometry/topology

1. C=2 (silhouettes) Volume intersection [Martin 81, Szeliski 93] 2. C unconstrained, viewpoint constraints Voxel coloring algorithm [Seitz & Dyer 97] 3. General Case Space carving [Kutulakos & Seitz 98] Voxel Coloring Solutions

Space Carving Algorithm Image 1 Image N …... Initialize to a volume V containing the true scene Repeat until convergence Choose a voxel on the current surface Carve if not photo-consistent Project to visible input images

Convergence Consistency Property The resulting shape is photo-consistent >all inconsistent points are removed Convergence Property Carving converges to a non-empty shape >a point on the true scene is never removed p

What is Computable? The Photo Hull is the UNION of all photo-consistent scenes in V It is a photo-consistent scene reconstruction Tightest possible bound on the true scene True Scene V Photo Hull V

Space Carving Algorithm The Basic Algorithm is Unwieldy Complex update procedure Alternative: Multi-Pass Plane Sweep Efficient, can use texture-mapping hardware Converges quickly in practice Easy to implement Results Algorithm

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence True SceneReconstruction

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Multi-Pass Plane Sweep Sweep plane in each of 6 principle directions Consider cameras on only one side of plane Repeat until convergence

Space Carving Results: African Violet Input Image (1 of 45)Reconstruction

Space Carving Results: Hand Input Image (1 of 100) Views of Reconstruction

House Walkthrough 24 rendered input views from inside and outside

Space Carving Results: House Input Image (true scene) Reconstruction 370,000 voxels

Space Carving Results: House Input Image (true scene) Reconstruction 370,000 voxels

New View (true scene) Space Carving Results: House Reconstruction New View (true scene) Reconstruction Reconstruction (with new input view)

Other Features Coarse-to-fine Reconstruction Represent scene as octree Reconstruct low-res model first, then refine Hardware-Acceleration Use texture-mapping to compute voxel projections Process voxels an entire plane at a time Limitations Need to acquire calibrated images Restriction to simple radiance models Bias toward maximal (fat) reconstructions Transparency not supported

Other Approaches Level-Set Methods [Faugeras & Keriven 1998] Evolve implicit function by solving PDE’s Transparency and Matting [Szeliski & Golland 1998] Compute voxels with alpha-channel Max Flow/Min Cut [Roy & Cox 1998] Graph theoretic formulation Mesh-Based Stereo [Fua & Leclerc 95] Mesh-based but similar consistency formulation Virtualized Reality [Narayan, Rander, Kanade 1998] Perform stereo 3 images at a time, merge results

Level Set Stereo Pose Stereo as Energy Minimization First idea: find best surface S(u,v) to match images This is a variational minimization problem >solved by deforming surface infinitesimally >deformation given by Euler-Lagrange equations Problem—how to handle case where object is not a single surface? Can use level-set formulation >represent the object as a function f(x,y,z) whose zero- set is the object’s surface >evolve f instead of S

Volume Intersection Martin & Aggarwal, “Volumetric description of objects from multiple views”, Trans. Pattern Analysis and Machine Intelligence, 5(2), 1991, pp Szeliski, “Rapid Octree Construction from Image Sequences”, Computer Vision, Graphics, and Image Processing: Image Understanding, 58(1), 1993, pp Voxel Coloring and Space Carving Seitz & Dyer, “Photorealistic Scene Reconstruction by Voxel Coloring”, Proc. Computer Vision and Pattern Recognition (CVPR), 1997, pp Seitz & Kutulakos, “Plenoptic Image Editing”, Proc. Int. Conf. on Computer Vision (ICCV), 1998, pp Kutulakos & Seitz, “A Theory of Shape by Space Carving”, Proc. ICCV, 1998, pp Bibliography

Related References Bolles, Baker, and Marimont, “Epipolar-Plane Image Analysis: An Approach to Determining Structure from Motion”, International Journal of Computer Vision, vol 1, no 1, 1987, pp Faugeras & Keriven, “Variational principles, surface evolution, PDE's, level set methods and the stereo problem", IEEE Trans. on Image Processing, 7(3), 1998, pp Szeliski & Golland, “Stereo Matching with Transparency and Matting”, Proc. Int. Conf. on Computer Vision (ICCV), 1998, Roy & Cox, “A Maximum-Flow Formulation of the N-camera Stereo Correspondence Problem”, Proc. ICCV, 1998, pp Fua & Leclerc, “Object-centered surface reconstruction: Combining multi-image stereo and shading", International Journal of Computer Vision, 16, 1995, pp Narayanan, Rander, & Kanade, “Constructing Virtual Worlds Using Dense Stereo”, Proc. ICCV, 1998, pp Bibliography