Recent work in image-based rendering from unstructured image collections and remaining challenges Sudipta N. Sinha Microsoft Research, Redmond, USA.

Slides:



Advertisements
Similar presentations
3D Model Matching with Viewpoint-Invariant Patches(VIP) Reporter :鄒嘉恆 Date : 10/06/2009.
Advertisements

The fundamental matrix F
Registration for Robotics Kurt Konolige Willow Garage Stanford University Patrick Mihelich JD Chen James Bowman Helen Oleynikova Freiburg TORO group: Giorgio.
For Internal Use Only. © CT T IN EM. All rights reserved. 3D Reconstruction Using Aerial Images A Dense Structure from Motion pipeline Ramakrishna Vedantam.
1. 2 An extreme occurrence of the missing data W I D E B A S E L I N E – no point in more than 2 images!
Face Alignment with Part-Based Modeling
MASKS © 2004 Invitation to 3D vision Lecture 7 Step-by-Step Model Buidling.
Multiple View Reconstruction Class 24 Multiple View Geometry Comp Marc Pollefeys.
Parallel Tracking and Mapping for Small AR Workspaces Vision Seminar
A Global Linear Method for Camera Pose Registration
Silhouette Lookup for Automatic Pose Tracking N ICK H OWE.
Discrete-Continuous Optimization for Large-scale Structure from Motion David Crandall, Andrew Owens, Noah Snavely, Dan Huttenlocher Presented by: Rahul.
Dana Cobzas-PhD thesis Image-Based Models with Applications in Robot Navigation Dana Cobzas Supervisor: Hong Zhang.
Reconstructing Building Interiors from Images Yasutaka Furukawa Brian Curless Steven M. Seitz University of Washington, Seattle, USA Richard Szeliski Microsoft.
Reconstructing Building Interiors from Images Yasutaka Furukawa Brian Curless Steven M. Seitz University of Washington, Seattle, USA 2011/01/16 蔡禹婷.
Computing F and rectification class 14 Multiple View Geometry Comp Marc Pollefeys.
Automatic Panoramic Image Stitching using Local Features Matthew Brown and David Lowe, University of British Columbia.
Multi-view stereo Many slides adapted from S. Seitz.
Many slides and illustrations from J. Ponce
Multiple View Geometry Marc Pollefeys University of North Carolina at Chapel Hill Modified by Philippos Mordohai.
Lecture 11: Structure from motion CS6670: Computer Vision Noah Snavely.
Triangulation and Multi-View Geometry Class 9 Read notes Section 3.3, , 5.1 (if interested, read Triggs’s paper on MVG using tensor notation, see.
Image-Based Rendering using Hardware Accelerated Dynamic Textures Keith Yerex Dana Cobzas Martin Jagersand.
Assignment 2 Compute F automatically from image pair (putative matches, 8-point, 7-point, iterative, RANSAC, guided matching) (due by Wednesday 19/03/03)
Multiple View Reconstruction Class 23 Multiple View Geometry Comp Marc Pollefeys.
Linearizing (assuming small (u,v)): Brightness Constancy Equation: The Brightness Constraint Where:),(),(yxJyxII t  Each pixel provides 1 equation in.
CSCE 641 Computer Graphics: Image-based Modeling (Cont.) Jinxiang Chai.
Sebastian Thrun CS223B Computer Vision, Winter Stanford CS223B Computer Vision, Winter 2005 Lecture 3 Advanced Features Sebastian Thrun, Stanford.
Constructing immersive virtual space for HAI with photos Shingo Mori Yoshimasa Ohmoto Toyoaki Nishida Graduate School of Informatics Kyoto University GrC2011.
Global Alignment and Structure from Motion Computer Vision CSE455, Winter 2008 Noah Snavely.
CSCE 641 Computer Graphics: Image-based Modeling (Cont.) Jinxiang Chai.
Lecture 12: Structure from motion CS6670: Computer Vision Noah Snavely.
Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.
Registration for Robotics Kurt Konolige Willow Garage Stanford University Patrick Mihelich JD Chen James Bowman Helen Oleynikova Freiburg TORO group: Giorgio.
Final Exam Review CS485/685 Computer Vision Prof. Bebis.
Linearizing (assuming small (u,v)): Brightness Constancy Equation: The Brightness Constraint Where:),(),(yxJyxII t  Each pixel provides 1 equation in.
The Brightness Constraint
Flow Separation for Fast and Robust Stereo Odometry [ICRA 2009]
Dynamic 3D Scene Analysis from a Moving Vehicle Young Ki Baik (CV Lab.) (Wed)
Multi-Output Learning for Camera Relocalization Abner Guzmán-Rivera UIUC Pushmeet Kohli Ben Glocker Jamie Shotton Toby Sharp Andrew Fitzgibbon Shahram.
Example: line fitting. n=2 Model fitting Measure distances.
CSCE 643 Computer Vision: Structure from Motion
© 2005 Martin Bujňák, Martin Bujňák Supervisor : RNDr.
HONGIK UNIVERSITY School of Radio Science & Communication Engineering Visual Information Processing Lab Hong-Ik University School of Radio Science & Communication.
Barcelonnette 10/2013 STAGE INSTRUMENTAIRE MASTER 2 3D SURFACE RECONSTRUCTION WITH MULTIVIEW-STEREO.
Communication Systems Group Technische Universität Berlin S. Knorr A Geometric Segmentation Approach for the 3D Reconstruction of Dynamic Scenes in 2D.
Epitomic Location Recognition A generative approach for location recognition K. Ni, A. Kannan, A. Criminisi and J. Winn In proc. CVPR Anchorage,
IIIT HYDERABAD Image-based walkthroughs from partial and incremental scene reconstructions Kumar Srijan Syed Ahsan Ishtiaque C. V. Jawahar Center for Visual.
Peter Henry1, Michael Krainin1, Evan Herbst1,
3D reconstruction from uncalibrated images
Fast Semi-Direct Monocular Visual Odometry
Visual Odometry David Nister, CVPR 2004
Visual Odometry for Ground Vehicle Applications David Nistér, Oleg Naroditsky, and James Bergen Sarnoff Corporation CN5300 Princeton, New Jersey
Linearizing (assuming small (u,v)): Brightness Constancy Equation: The Brightness Constraint Where:),(),(yxJyxII t  Each pixel provides 1 equation in.
Announcements No midterm Project 3 will be done in pairs same partners as for project 2.
Person Following with a Mobile Robot Using Binocular Feature-Based Tracking Zhichao Chen and Stanley T. Birchfield Dept. of Electrical and Computer Engineering.
Linearizing (assuming small (u,v)): Brightness Constancy Equation: The Brightness Constraint Where:),(),(yxJyxII t  Each pixel provides 1 equation in.
Lecture 22: Structure from motion CS6670: Computer Vision Noah Snavely.
IIIT HYDERABAD Techniques for Organization and Visualization of Community Photo Collections Kumar Srijan Faculty Advisor : Dr. C.V. Jawahar.
Discrete-Continuous Optimization for Large-scale Structure from Motion
The Brightness Constraint
Modeling the world with photos
3D Photography: Epipolar geometry
Structure from motion Input: Output: (Tomasi and Kanade)
The Brightness Constraint
The Brightness Constraint
Combining Geometric- and View-Based Approaches for Articulated Pose Estimation David Demirdjian MIT Computer Science and Artificial Intelligence Laboratory.
Noah Snavely.
Structure from motion Input: Output: (Tomasi and Kanade)
Lecture 15: Structure from motion
Presentation transcript:

Recent work in image-based rendering from unstructured image collections and remaining challenges Sudipta N. Sinha Microsoft Research, Redmond, USA

Image-based maps

Structure from motion (Sfm) Robust depth-map estimation Rendering Key Steps

Structure from motion (Sfm) Robust depth-map estimation Image-based navigation Recent results A multi-stage linear approach to structure from motion Sinha, Steedly & Szeliski, RMLE –ECCV workshop 2010 Piecewise planar stereo for image-based rendering Sinha, Steedly & Szeliski, ICCV 2009 Image-based walkthroughs from incremental and partial scene reconstructions Kumar, Ahsan, Sinha & Jawahar, BMVC 2010

Sequential Sfm Fitzgibbon’98, Pollefeys’98, Nister’01, Schaffalitzky’02, Vergauwen’06, Snavely’06, Snavely’07, Agarwal’09, Gherardi’10

Sequential Sfm Fitzgibbon’98, Pollefeys’98, Nister’01, Schaffalitzky’02, Vergauwen’06, Snavely’06, Snavely’07, Agarwal’09, Gherardi’10 Initial seed pair Pose estimation, triangulation Refinement

Sequential Sfm Fitzgibbon’98, Pollefeys’98, Nister’01, Schaffalitzky’02, Vergauwen’06, Snavely’06, Snavely’07, Agarwal’09, Gherardi’10 Initial seed pair Pose estimation, triangulation Refinement

Contributions Vanishing point (VP) constraints reduces drift in rotations – more accurate than [Govindu’04, Martinec’07] for urban scenes. – Faster pairwise matching + geometric verification New practical linear structure and translation estimation – more stable than the known linear method [Rother’03] – robust to outliers in 2D observations – easy to parallelize – faster than sequential Sfm – much faster than L ∞ - methods Linear multi-stage approach to structure from motion Sinha et. al (ECCV-RMLE workshop)

Vanishing Point (VP) Detection Pair Matching 2 – VP + 2 point RANSAC VP tracks relative rotations Feature Extraction VPs interest pts Images Global Rotation Estimation Linear Reconstruction 2-view Reconstruction Robust Alignment Global Scale & Translation Estimation VP tracks relative rotations global camera orientations relative pose estimates Full Sfm initialization Final Bundle Adjustment Linear multi-stage approach to structure from motion Sinha et. al (ECCV-RMLE workshop)

Results

Timings

Break-up of Timings

Comparison with sequential Sfm

STREET sequence HALLWAY sequence OURS (65 cams, 52K pts) before Bundle Adjustment BUNDLER (65 cams, 22K pts) BUNDLER (139 cams, 13K pts) OURS (184 cams, 27K pts) Comparison with sequential Sfm

Piecewise Planar Stereo for image-based rendering Feature matching Graph-cut based energy minimization Sinha et. al. ICCV 2009

Piecewise Planar Stereo for image-based rendering Sinha et. al. ICCV 2009 Planar Stereo Results

also handle non-planar scenes now... Piecewise Planar Stereo for image-based rendering

Skip global scene reconstruction (Sfm) step, Generate several overlapping, partial reconstructions instead. During navigation, jump between local coordinate frames. Scales easily, also parallelizable Incremental matching & reconstruction (images appear over time) Image-based walkthroughs from incremental and partial scene reconstructions Kumar et. al. BMVC 2010 Fort sequence (~5800 images)

Accuracy vs. Connectedness Reliable results from sparse, unstructured imagery – wide-baseline matching is still difficult Representations: – metric vs. topological reconstructions ? hybrid ? Reconstructing Indoors – Bottlenecks: doorways, corridors. – fewer features, non-Lambertian surfaces Existing issues in unstructured Sfm

Acquisition – Images vs. video – Short-term dynamics vs. long-term dynamics Need truly incremental Sfm – Start with scratch but keep going … ? – Interleaving matching, Sfm and dense stereo – Hybrid matching (2D—2D, 2D – 3D, 3D – 3D) Dynamic Image-based Maps: Challenges

Temporal appearance changes – Illumination: day/night, seasons, weather, lights on/off Cyclic, predictable – Albedo changes Store-fronts, ads-billboards, irreversible Geometric changes: – temporary vs. permanent Mid-level features for higher level recognition Dynamic Image-based Maps: Challenges

Questions ?