Modeling the world with photos

Slides:

Advertisements

Similar presentations

Distinctive Image Features from Scale-Invariant Keypoints

Advertisements

Feature Detection. Description Localization More Points Robust to occlusion Works with less texture More Repeatable Robust detection Precise localization.

Structure from motion.

Assignment 4 Cloning Yourself

The fundamental matrix F

Lecture 11: Two-view geometry

Presented by Xinyu Chang

For Internal Use Only. © CT T IN EM. All rights reserved. 3D Reconstruction Using Aerial Images A Dense Structure from Motion pipeline Ramakrishna Vedantam.

Building Rome in a Day Sameer Agarwal1 Noah Snavely2 Ian Simon1 Steven M. Seitz1 Richard Szeliski3 1University of Washington 2Cornell University 3Microsoft.

TP14 - Local features: detection and description Computer Vision, FCUP, 2014 Miguel Coimbra Slides by Prof. Kristen Grauman.

A Global Linear Method for Camera Pose Registration

Photosynth is an entirely new visual medium developed by Microsoft Live Labs. Photosynth is an entirely new visual medium developed by Microsoft Live.

Discrete-Continuous Optimization for Large-scale Structure from Motion David Crandall, Andrew Owens, Noah Snavely, Dan Huttenlocher Presented by: Rahul.

Structure from motion.

Lecture 23: Structure from motion and multi-view stereo

Lecture 11: Structure from motion, part 2 CS6670: Computer Vision Noah Snavely.

Direct Methods for Visual Scene Reconstruction Paper by Richard Szeliski & Sing Bing Kang Presented by Kristin Branson November 7, 2002.

Structure from motion. Multiple-view geometry questions Scene geometry (structure): Given 2D point matches in two or more images, where are the corresponding.

Automatic Panoramic Image Stitching using Local Features Matthew Brown and David Lowe, University of British Columbia.

Lecture 11: Structure from motion CS6670: Computer Vision Noah Snavely.

Lecture 22: Structure from motion CS6670: Computer Vision Noah Snavely.

Scale Invariant Feature Transform (SIFT)

CS664 Lecture #19: Layers, RANSAC, panoramas, epipolar geometry Some material taken from:  David Lowe, UBC  Jiri Matas, CMP Prague

CSCE 641 Computer Graphics: Image-based Modeling (Cont.) Jinxiang Chai.

Sebastian Thrun CS223B Computer Vision, Winter Stanford CS223B Computer Vision, Winter 2005 Lecture 3 Advanced Features Sebastian Thrun, Stanford.

Global Alignment and Structure from Motion Computer Vision CSE455, Winter 2008 Noah Snavely.

Lecture 6: Feature matching and alignment CS4670: Computer Vision Noah Snavely.

Scale-Invariant Feature Transform (SIFT) Jinxiang Chai.

CSCE 641 Computer Graphics: Image-based Modeling (Cont.) Jinxiang Chai.

Lecture 12: Structure from motion CS6670: Computer Vision Noah Snavely.

3-D Scene u u’u’ Study the mathematical relations between corresponding image points. “Corresponding” means originated from the same 3D point. Objective.

Mosaics CSE 455, Winter 2010 February 8, 2010 Neel Joshi, CSE 455, Winter Announcements  The Midterm went out Friday  See to the class.

Automatic Camera Calibration

Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.

Automatic Registration of Color Images to 3D Geometry Computer Graphics International 2009 Yunzhen Li and Kok-Lim Low School of Computing National University.

Lecture 4: Feature matching CS4670 / 5670: Computer Vision Noah Snavely.

CSCE 643 Computer Vision: Structure from Motion

Lecture 7: Features Part 2 CS4670/5670: Computer Vision Noah Snavely.

IIIT HYDERABAD Image-based walkthroughs from partial and incremental scene reconstructions Kumar Srijan Syed Ahsan Ishtiaque C. V. Jawahar Center for Visual.

Scene Reconstruction Seminar presented by Anton Jigalin Advanced Topics in Computer Vision ( )

Various camera configurations Single point of fixation where optical axes intersect Optical axes parallel (fixation at infinity) General case.

Lecture 8: Feature matching CS6670: Computer Vision Noah Snavely.

COS429 Computer Vision =++ Assignment 4 Cloning Yourself.

CSE 185 Introduction to Computer Vision Feature Matching.

A Tutorial on using SIFT Presented by Jimmy Huff (Slightly modified by Josiah Yoder for Winter )

Local features: detection and description

Announcements No midterm Project 3 will be done in pairs same partners as for project 2.

Lecture 22: Structure from motion CS6670: Computer Vision Noah Snavely.

IIIT HYDERABAD Techniques for Organization and Visualization of Community Photo Collections Kumar Srijan Faculty Advisor : Dr. C.V. Jawahar.

776 Computer Vision Jan-Michael Frahm Spring 2012.

Visual homing using PCA-SIFT

SIFT Scale-Invariant Feature Transform David Lowe

Lecture 07 13/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.

Paper – Stephen Se, David Lowe, Jim Little

Scale Invariant Feature Transform (SIFT)

TP12 - Local features: detection and description

RANSAC and mosaic wrap-up

Structure from motion Input: Output: (Tomasi and Kanade)

Features Readings All is Vanity, by C. Allan Gilbert,

Object recognition Prof. Graeme Bailey

Lecture 5: Feature detection and matching

Lecture 23: Structure from motion 2

Structure from motion.

CSE 185 Introduction to Computer Vision

Computational Photography

Calibration and homographies

Structure from motion Input: Output: (Tomasi and Kanade)

Lecture 15: Structure from motion

1 Motivation & System Overview

Presentation transcript:

Modeling the world with photos Authors: Noah Snavely · Steven M. Seitz · Richard Szeliski

Introduction Large database of photos off the internet Navigation through photos

Concepts Feature detection Structure from Motion & Image Based Modeling Image Based Rendering

Feature detection SIFT (Scaled Invariant Feature Transform) 1. Constructing a scale space 2. LoG Approximation 3. Finding keypoints 4. Get rid of bad key points 5. Assigning an orientation to the keypoints 6. Generate SIFT features

Structure from motion Bundle adjustment Refine 3d coordinates and optical characteristics of the camera Minimize reprojection error Nonlinear least-squares algorithms Last step

Image Based Rendering Synthesizing new views Sparse 3d model Goal is to browse through the images

Annotation Assign information Transfer between all the images 2D annotation

Main challenge Match and reconstruct from large database of images uncontrolled

Other systems Other systems needs to have parameters of the camera (GPS, Focal length, geometry of the area) Snavely’s system mostly doesn’t need that. Instead uses computer vision.

Matching SIFT Key descriptors Approximate nearest neighbor (ANN), ratio threshold of 0.6 Organized by tracks Track is a connected set of points across multiple images

Image Connectivity graph Neato toolkit in Graphiz, neato mass spring system.

Mapping out the camera Start off with a single pair of cameras Triangulates, then bundle adjustment Add another camera repeat Pair of cameras need large number of matches

Camera Locations

Image Connectivity graph

Image Connectivity graph

Geo-Registration Determine absolute geocentric coordinates of the camera Optional Estimate ‘gravity’ vector, if correct simply align manually

Absolute coordinates

PHOTOEXPLORER Interface for navigating through photos Viewing through a virtual camera (can overlap images taken) Non-photorealistic rendering

PhotoEXplorer

Object-based navigation Feature matching Find’s best photo based on score. Visibility of points Angle Image resolution

Registering new images Mode for overview scene Drag and drop new image at approximate location SIFT keypoints matched with 20 nearest camera.

Results Notre Dame Mount Rushmore Trafalgar Square Yosemite Trevi Fountain Sphinx St. Basil’s Colosseum Prague Annecy Great Wall

RUNTIME

Code Bundler: Structure from Motion (SfM) for Unordered Image Collections Written by Noah Snavely Builds 3d space Inputs are image set, image features, and image matches Why no image matching included?

THINGS TO IMPROVE Scale Variability Online algorithm Operate on millions of images Variability More capabilities, such as matching ground view to satellite Online algorithm Annotate real time using phone camera

Sources http://phototour.cs.washington.edu/ by Noah Snavely