Registration for Robotics Kurt Konolige Willow Garage Stanford University Patrick Mihelich JD Chen James Bowman Helen Oleynikova Freiburg TORO group: Giorgio.

Slides:



Advertisements
Similar presentations
Presented by Xinyu Chang
Advertisements

For Internal Use Only. © CT T IN EM. All rights reserved. 3D Reconstruction Using Aerial Images A Dense Structure from Motion pipeline Ramakrishna Vedantam.
Summary of Friday A homography transforms one 3d plane to another 3d plane, under perspective projections. Those planes can be camera imaging planes or.
3D Object Recognition Pipeline Kurt Konolige, Radu Rusu, Victor Eruhmov, Suat Gedikli Willow Garage Stefan Holzer, Stefan Hinterstoisser TUM Morgan Quigley,
MASKS © 2004 Invitation to 3D vision Lecture 7 Step-by-Step Model Buidling.
Recent work in image-based rendering from unstructured image collections and remaining challenges Sudipta N. Sinha Microsoft Research, Redmond, USA.
Patch to the Future: Unsupervised Visual Prediction
Object Recognition using Invariant Local Features Applications l Mobile robots, driver assistance l Cell phone location or object recognition l Panoramas,
Low Complexity Keypoint Recognition and Pose Estimation Vincent Lepetit.
Nikolas Engelhard 1, Felix Endres 1, Jürgen Hess 1, Jürgen Sturm 2, Wolfram Burgard 1 1 University of Freiburg, Germany 2 Technical University Munich,
Probabilistic Robotics
Parallel Tracking and Mapping for Small AR Workspaces Vision Seminar
Computer vision for robotics
CS4670 / 5670: Computer Vision Bag-of-words models Noah Snavely Object
Localization in indoor environments by querying omnidirectional visual maps using perspective images Miguel Lourenco, V. Pedro and João P. Barreto ICRA.
Special Topic on Image Retrieval Local Feature Matching Verification.
Image alignment Image from
Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.
Robust and large-scale alignment Image from
Object retrieval with large vocabularies and fast spatial matching
Adam Rachmielowski 615 Project: Real-time monocular vision-based SLAM.
Visual Odometry Michael Adams CS 223B Problem: Measure trajectory of a mobile platform using visual data Mobile Platform (Car) Calibrated Camera.
Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2005 with a lot of slides stolen from Steve Seitz and.
Visual Odometry for Ground Vehicle Applications David Nister, Oleg Naroditsky, James Bergen Sarnoff Corporation, CN5300 Princeton, NJ CPSC 643, Presentation.
Building Recognition Landry Huet Sung Hee Park DW Wheeler.
Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2006 with a lot of slides stolen from Steve Seitz and.
Sebastian Thrun CS223B Computer Vision, Winter Stanford CS223B Computer Vision, Winter 2005 Lecture 3 Advanced Features Sebastian Thrun, Stanford.
Lecture 12: Structure from motion CS6670: Computer Vision Noah Snavely.
Visual Odometry Chris Moore Mark Huetsch Firouzeh Jalilian.
Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.
Registration for Robotics Kurt Konolige Willow Garage Stanford University Patrick Mihelich JD Chen James Bowman Helen Oleynikova Freiburg TORO group: Giorgio.
SLAM (Simultaneously Localization and Mapping)
Object Tracking/Recognition using Invariant Local Features Applications l Mobile robots, driver assistance l Cell phone location or object recognition.
3D SLAM for Omni-directional Camera
Flow Separation for Fast and Robust Stereo Odometry [ICRA 2009]
Dynamic 3D Scene Analysis from a Moving Vehicle Young Ki Baik (CV Lab.) (Wed)
Multi-Output Learning for Camera Relocalization Abner Guzmán-Rivera UIUC Pushmeet Kohli Ben Glocker Jamie Shotton Toby Sharp Andrew Fitzgibbon Shahram.
Example: line fitting. n=2 Model fitting Measure distances.
CSCE 643 Computer Vision: Structure from Motion
Young Ki Baik, Computer Vision Lab.
Fast Similarity Search for Learned Metrics Prateek Jain, Brian Kulis, and Kristen Grauman Department of Computer Sciences University of Texas at Austin.
Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic and Andrew Zisserman.
Visual SLAM Visual SLAM SPL Seminar (Fri) Young Ki Baik Computer Vision Lab.
Peter Henry1, Michael Krainin1, Evan Herbst1,
Bundle Adjustment A Modern Synthesis Bill Triggs, Philip McLauchlan, Richard Hartley and Andrew Fitzgibbon Presentation by Marios Xanthidis 5 th of No.
CSE 185 Introduction to Computer Vision Feature Matching.
Visual Odometry David Nister, CVPR 2004
Lecture 9 Feature Extraction and Motion Estimation Slides by: Michael Black Clark F. Olson Jean Ponce.
Visual Odometry for Ground Vehicle Applications David Nistér, Oleg Naroditsky, and James Bergen Sarnoff Corporation CN5300 Princeton, New Jersey
776 Computer Vision Jan-Michael Frahm Spring 2012.
Lecture 22: Structure from motion CS6670: Computer Vision Noah Snavely.
Invariant Local Features Image content is transformed into local feature coordinates that are invariant to translation, rotation, scale, and other imaging.
IIIT HYDERABAD Techniques for Organization and Visualization of Community Photo Collections Kumar Srijan Faculty Advisor : Dr. C.V. Jawahar.
776 Computer Vision Jan-Michael Frahm Spring 2012.
CSE-473 Mobile Robot Mapping.
Paper – Stephen Se, David Lowe, Jim Little
Nearest-neighbor matching to feature database
+ SLAM with SIFT Se, Lowe, and Little Presented by Matt Loper
RANSAC and mosaic wrap-up
Modeling the world with photos
Features Readings All is Vanity, by C. Allan Gilbert,
Nearest-neighbor matching to feature database
Feature Matching and RANSAC
Noah Snavely.
SIFT.
CSE 185 Introduction to Computer Vision
Computational Photography
Lecture 15: Structure from motion
INDOOR DENSE DEPTH MAP AT DRONE HOVERING
Robust Feature Matching and Fast GMS Solution
Presentation transcript:

Registration for Robotics Kurt Konolige Willow Garage Stanford University Patrick Mihelich JD Chen James Bowman Helen Oleynikova Freiburg TORO group: Giorgio Grisetti Cyrill Stachness Rainer Kummerle

Outline Applications Overview of Registration Front end and image matching Visual Odometry Place recognition Global SBA Extensions

Applications of Registration Photo tourism [Snavely, Seitz, Szeliski 2006]

Applications of Registration Object Modeling [Lovi et al. 2010] [Newcombe and Davison 2010]

Applications of Registration Face Tracking [WATSON: Morency 2003]

Applications of Registration Visual Odometry [Willow Garage]

Applications of Registration Visual SLAM [Willow Garage]

Outline Applications Overview of Registration Front end, image matching Visual Odometry Place recognition Global SBA Extensions

Registration Elements Incremental Extract keypoints Compute features Match against local views Estimate pose Optimize Global Add new view to map Match against global views [Place recognition] Estimate pose Optimize

Mapping Systems MonoSLAM [Davison 2003] Parallel Tracking and Mapping [Klein and Murray 2007] View-based Maps [Konolige et al. 2009] Map type World points Views and world points Views [and implicit world points]

Map Representations Map type World points Views and world points Views [and implicit world points] Covariance Matrix over points p0 p1 p2 … Extended Kalman Filter update

MonoSLAM

Map Representations Map type World points Views and world points Views [and implicit world points] Information matrix over views and world points Nonlinear least squares update H  x = -g c0 c1 c2 … p0 p1 p2 … c0 c1 c2 … p0 p1 p2 …

Parallel Tracking and Mapping

Map Representations Map type World points Views and world points Views [and implicit world points] Information matrix over views c0 c1 c2 … Nonlinear least squares update H  x = -g

View-based Maps

Comparison of Mapping Methods Map type World points Useful in small spaces - World points for global matching - World point Covar matrix update Not as accurate [Strasdat et al. 2010] Views and world points Useful in small spaces - World points for global matching - View and point Info matrix update Highly accurate Views [and implicit world points] Useful in larger spaces - View matching - View Info matrix update Not as accurate

Outline Applications Overview of Registration Front end and image matching Visual Odometry Place recognition Global SBA Extensions

Interest points FAST [Rosten and Drummond 2006] Harris SIFT [Lowe 1999] SUSAN, CenSure, MSER, … Adaptive threshold Gridding for spatial diversity

Feature matching SIFT / SURF features Planar patches Random-tree signatures [Calonder et al. 2009] … Tracking with motion model Windowed brute-force KD tree nearest-neighbor 930 features354 matched183 inliers

Outlier detection and pose estimation RANSAC to estimate pose –3 points for stereo (3D-3D) –3-point PnP for monocular (2D-3D) Outlier rejection Refinement –Least-squares –Reprojection error Scavenging 930 features354 matched183 inliers

Outline Applications Overview of Registration Front end and image matching Visual Odometry Place recognition Global SBA Extensions

Visual Odometry – Sliding Window 2-view pose estimates are unreliable –Triangulation is narrow-baseline Multi-view estimates are more accurate –Triangulation is wide-baseline –More points, wider baseline => more accuracy Keyframes –Many close-together views don’t add much –Too far apart leads to low inlier count Sliding window of views –Track points as long as possible –~20 – 40 keyframes in window

Visual Odometry – Bundle Adjustment Measurement model: For Gaussian distributions ( is covariance) [Gauss-Newton / Normal] Sparseness of H H 20 camera views 5000 features H is (20x x3) 2 c0 c1 c2 … p0 p1 p2 … c0 c1 c2 … p0 p1 p2 …

Visual Odometry examples [courtesy Andrew Comport, INRIA] Outdoor sequence in Versailles 1 m stereo baseline, narrow FOV ~400 m sequence Average frame distance: 0.6 m Max frame distance: 1.1 m

26 Visual Odometry examples [courtesy Andrew Comport, INRIA] Indoor Willow Garage sequence 10 cm stereo, wide FOV ~100 m sequence Average frame distance: 0.3 m

Outline Applications Overview of Registration Front end and image matching Visual Odometry Place recognition Global SBA Extensions

Place Recognition K-nearest neighbor feature matching across database of images –KD tree [Lowe 1999, Eade and Drummond 2008, Williams et al. 2007] Bag-of-words –visual vocabulary [Stewenius and Nister 2006, Cummins and Newman 2008] DB test DB test

Place Recognition: Vocabulary Trees [Nister and Stewenius CVPR06] - “Bag of words” retrieval - Vocab tree created offline - For recognition: - Image keypoints extracted - Tree encodes approximate NN search - Inverted index of images at leaves [Image from Nister and Stewenius CVPR06] [Cummins and Newman ICRA07 Cullmer et al. ACRA08 Fraundorfer et al. IROS07]

Place Recognition: Vocabulary Trees Performance on Indoor dataset

View-based Maps [Konolige et al. 2009] Performance on Indoor dataset

View-based Maps [Konolige et al. 2009] Performance on Indoor dataset

Trajectory synthesis

Outline Applications Overview of Registration Front end and image matching Visual Odometry Place recognition Global SBA Extensions

Comparison of Mapping Methods Map type World points Useful in small spaces - World points for global matching - World point Covar matrix update Not as accurate [xxx and Davison 2010] Views and world points Useful in small spaces - World points for global matching - View and point Info matrix update Highly accurate Views [and implicit world points] Useful in larger spaces - View matching - View Info matrix update Not as accurate

Fast SBA [Konolige BMVC 2010, Jeong et al. CVPR 2010] Take advantage of sparse secondary structure of Hessian Use fast linear solvers –Davis’ CHOLMOD –Block preconditioned conjugate gradient

Fast SBA in VSLAM

Fast SBA in Reconstruction

Open-Source in ROS sba package for fast SBA frame_common and posest for front-end and pose estimation vocabulary_tree for place recognition vslam_system for Visual Odometry and VSLAM