SLAM (Simultaneously Localization and Mapping)

Slides:



Advertisements
Similar presentations
Mobile Robot Localization and Mapping using the Kalman Filter
Advertisements

Feature Based Image Mosaicing
The fundamental matrix F
Registration for Robotics Kurt Konolige Willow Garage Stanford University Patrick Mihelich JD Chen James Bowman Helen Oleynikova Freiburg TORO group: Giorgio.
For Internal Use Only. © CT T IN EM. All rights reserved. 3D Reconstruction Using Aerial Images A Dense Structure from Motion pipeline Ramakrishna Vedantam.
MASKS © 2004 Invitation to 3D vision Lecture 7 Step-by-Step Model Buidling.
Low Complexity Keypoint Recognition and Pose Estimation Vincent Lepetit.
Parallel Tracking and Mapping for Small AR Workspaces Vision Seminar
Silvina Rybnikov Supervisors: Prof. Ilan Shimshoni and Prof. Ehud Rivlin HomePage:
Chapter 6 Feature-based alignment Advanced Computer Vision.
Cambridge, Massachusetts Pose Estimation in Heavy Clutter using a Multi-Flash Camera Ming-Yu Liu, Oncel Tuzel, Ashok Veeraraghavan, Rama Chellappa, Amit.
3D M otion D etermination U sing µ IMU A nd V isual T racking 14 May 2010 Centre for Micro and Nano Systems The Chinese University of Hong Kong Supervised.
Daniel Shepard and Todd Humphreys
Two-view geometry.
Structure from motion.
Accurate Non-Iterative O( n ) Solution to the P n P Problem CVLab - Ecole Polytechnique Fédérale de Lausanne Francesc Moreno-Noguer Vincent Lepetit Pascal.
Adam Rachmielowski 615 Project: Real-time monocular vision-based SLAM.
Introduction to Kalman Filter and SLAM Ting-Wei Hsu 08/10/30.
Active Simultaneous Localization and Mapping Stephen Tully, : Robotic Motion Planning This project is to actively control the.
Epipolar geometry. (i)Correspondence geometry: Given an image point x in the first view, how does this constrain the position of the corresponding point.
Structure from motion. Multiple-view geometry questions Scene geometry (structure): Given 2D point matches in two or more images, where are the corresponding.
Lecture 11: Structure from motion CS6670: Computer Vision Noah Snavely.
Visual Odometry for Ground Vehicle Applications David Nister, Oleg Naroditsky, James Bergen Sarnoff Corporation, CN5300 Princeton, NJ CPSC 643, Presentation.
Multiple View Geometry Marc Pollefeys University of North Carolina at Chapel Hill Modified by Philippos Mordohai.
Multiple View Reconstruction Class 23 Multiple View Geometry Comp Marc Pollefeys.
CSCE 641 Computer Graphics: Image-based Modeling (Cont.) Jinxiang Chai.
Registration for Augmented Reality Neil Birkbeck 3/27/2006.
Global Alignment and Structure from Motion Computer Vision CSE455, Winter 2008 Noah Snavely.
CSCE 641 Computer Graphics: Image-based Modeling (Cont.) Jinxiang Chai.
Lecture 12: Structure from motion CS6670: Computer Vision Noah Snavely.
3-D Scene u u’u’ Study the mathematical relations between corresponding image points. “Corresponding” means originated from the same 3D point. Objective.
Registration for Robotics Kurt Konolige Willow Garage Stanford University Patrick Mihelich JD Chen James Bowman Helen Oleynikova Freiburg TORO group: Giorgio.
The 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems October 11-15, 2009 St. Louis, USA.
Multi-view geometry. Multi-view geometry problems Structure: Given projections of the same 3D point in two or more images, compute the 3D coordinates.
Automatic Camera Calibration
Computer vision: models, learning and inference
Kalman filter and SLAM problem
Final Exam Review CS485/685 Computer Vision Prof. Bebis.
KinectFusion : Real-Time Dense Surface Mapping and Tracking IEEE International Symposium on Mixed and Augmented Reality 2011 Science and Technology Proceedings.
3D SLAM for Omni-directional Camera
Flow Separation for Fast and Robust Stereo Odometry [ICRA 2009]
(Wed) Young Ki Baik Computer Vision Lab.
CSCE 643 Computer Vision: Structure from Motion
Young Ki Baik, Computer Vision Lab.
Multiview Geometry and Stereopsis. Inputs: two images of a scene (taken from 2 viewpoints). Output: Depth map. Inputs: multiple images of a scene. Output:
1 Formation et Analyse d’Images Session 7 Daniela Hall 25 November 2004.
Visual SLAM Visual SLAM SPL Seminar (Fri) Young Ki Baik Computer Vision Lab.
HONGIK UNIVERSITY School of Radio Science & Communication Engineering Visual Information Processing Lab Hong-Ik University School of Radio Science & Communication.
Computer Vision Lecture #10 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.
Source: Computer Vision and Pattern Recognition Workshops (CVPRW), 2010 IEEE Computer Society Conference on Author: Paucher, R.; Turk, M.; Adviser: Chia-Nian.
Real-Time Simultaneous Localization and Mapping with a Single Camera (Mono SLAM) Young Ki Baik Computer Vision Lab. Seoul National University.
Raquel A. Romano 1 Scientific Computing Seminar May 12, 2004 Projective Geometry for Computer Vision Projective Geometry for Computer Vision Raquel A.
Vision and SLAM Ingeniería de Sistemas Integrados Departamento de Tecnología Electrónica Universidad de Málaga (Spain) Acción Integrada –’Visual-based.
Two-view geometry. Epipolar Plane – plane containing baseline (1D family) Epipoles = intersections of baseline with image planes = projections of the.
Feature Matching. Feature Space Outlier Rejection.
3D reconstruction from uncalibrated images
Bundle Adjustment A Modern Synthesis Bill Triggs, Philip McLauchlan, Richard Hartley and Andrew Fitzgibbon Presentation by Marios Xanthidis 5 th of No.
776 Computer Vision Jan-Michael Frahm & Enrique Dunn Spring 2013.
Visual Odometry David Nister, CVPR 2004
Vision-based SLAM Enhanced by Particle Swarm Optimization on the Euclidean Group Vision seminar : Dec Young Ki BAIK Computer Vision Lab.
Lecture 9 Feature Extraction and Motion Estimation Slides by: Michael Black Clark F. Olson Jean Ponce.
Visual Odometry for Ground Vehicle Applications David Nistér, Oleg Naroditsky, and James Bergen Sarnoff Corporation CN5300 Princeton, New Jersey
Robust Estimation Course web page: vision.cis.udel.edu/~cv April 23, 2003  Lecture 25.
Lecture 22: Structure from motion CS6670: Computer Vision Noah Snavely.
Correspondence and Stereopsis. Introduction Disparity – Informally: difference between two pictures – Allows us to gain a strong sense of depth Stereopsis.
11/25/03 3D Model Acquisition by Tracking 2D Wireframes Presenter: Jing Han Shiau M. Brown, T. Drummond and R. Cipolla Department of Engineering University.
SLAM : Simultaneous Localization and Mapping
Paper – Stephen Se, David Lowe, Jim Little
Florian Shkurti, Ioannis Rekleitis, Milena Scaccia and Gregory Dudek
Noah Snavely.
Presentation transcript:

SLAM (Simultaneously Localization and Mapping) Presenter : Jeongkyun Lee

Contents What is SLAM SfM-based SLAM Filter-based SLAM Comparison Other SLAMs Research topics

What is SLAM Simultaneously Localization and Mapping Unknown Environment Given only images Unknown Pose

What is SLAM How to localize & map Structure-from-Motion based Filtering based Pay attention to : Initialization Measures ( Matching features ) Localization & Mapping

Fundamentals Geometry Projection matrix Calibration matrix Rotation matrix Translation vector 3D homogeneous vector 2D image point Normalized point Fundamental matrix Essential matrix where * http://www.umiacs.umd.edu/~ramani/cmsc828d/lecture27.pdf * Multiple View Geometry in Computer Vision, R. Hartley and A. Zisserman, Cambridge, University Press, 2000

Fundamentals 5-point algorithm1) SfM-based SLAM Fundamentals 5-point algorithm1) Rotation matrix : 3 DoF (Rodrigues’ formula) Translation vector : 3 DoF Thus, is 5 DoF. ( 3 + 3 – 1, 1 DoF for scaling factor ) Given 5 pairs of points on the image planes, We can obtain . PnP problem (Perspective-n-Point problem) Given n 3D-to-2D point correspondences We can obtain . Grunert’s algorithm2) (P3P) EPnP3) Robust PnP4) ... Known Environment Unknown pose Corresponding image points 1) D. Nister, An Efficient Solution to the Five-Point Relative Pose Problem, IEEE Conference on Computer Vision and Pattern Recognition, Volume 2, pp. 195-202, 2003. 2) R. M. Haralick, C. N. Lee, K. Ottenberg and M. Nolle, Review and Analysis of Solutions of the Three Point Perspective Pose Estimation Problem, International Journal of Computer Vision, 1994. 3) F. Moreno-Noguer, V. Lepetit and P. Fua , Accurate Non-Iterative O(n) Solution to the PnP Problem, IEEE International Conference on Computer Vision, Rio de Janeiro, Brazil, October 2007. 4) S. Li, C. Xu, M. Xie, A Robust O(n) Solution to the Perspective-n-Point Problem, IEEE Transactions on Pattern Analysis and Machine Intelligence (2012) Volume 34, Issue 7, pp. 1444-1450

SfM-based SLAM Visual Odometry1) … Feature Detection : Harris corners Feature Matching : Normalized Corss Correlation (NCC) Only matches between detected features within a fixed distance. Procedure 5-point algorithm3),4) & RANSAC3) Given 3 frames P3P algorithm & RANSAC3) Triangulation Re-triangulation using first & last observations Given 1 frames … P3P 5P 1) D. Nister, O. Naroditsky, J. Bergen, Visual odometry, Computer Vision and Pattern Recognition, July 2004. 2) D. Nister, Preemptive RANSAC for Live Structure and Motion Estimation, IEEE International Conference on Computer Vision, 2003. 3) D. Nister, An Efficient Solution to the Five-Point Relative Pose Problem, IEEE Conference on Computer Vision and Pattern Recognition, Volume 2, pp. 195-202, 2003. 4) D. Nister, An Efficient Solution to the Five-Point Relative Pose Problem, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2004.

SfM-based SLAM Real Time Localization and 3D Reconstruction1) LBA2) (Levenberg-Marquardt algorithm (LM)) Minimize where Visual odometry Local bundle adjustment (LBA) + : Extrinsic parameters : Projection matrix : The square of Euclidean distance : Estimated projection of point through the camera : Observation 1) E. Mouragnon, M. Lhuillier, M. Dhome, F. Dekeyser, P. Sayd, Real Time Localization and 3D Reconstruction, Computer Vision and Pattern Recognition, 2006. 2) B. Triggs, P. F. McLauchlan, R. I. Hartley & A. W. Fitzibbon, Bundle adjustment – A modern synthesis, in Vision Algorithms: Theory and Practice, LNCS, pp. 298-375, Springer Verlag, 2000.

SfM-based SLAM Real Time Localization and 3D Reconstruction n : number of optimized camera poses N : number of cameras used for reprojection criterion minimization

SfM-based SLAM Real Time Localization and 3D Reconstruction Key frame selection Number of matched points Uncertainty of camera pose ( Obtained from the hessian inverse ) Complexity : Experiments 512 x 384 pixes, 75 fps, 94 key frames from a series of 445.

Measurements Acquisition Filter-based SLAM MonoSLAM1) EKF-based Filter initialization Map management ( Generate & delete features ) Prediction Measurements acquisition Data association Update To easily explain…. Prediction Measurements Acquisition Update 1) A. J. Davison, I. D. Reid, N. D. Molton, O. Stasse, MonoSLAM: Real-Time Single Camera SLAM, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 29, no. 6, June 2007.

Measurements Acquisition Filter-based SLAM MonoSLAM Prediction Measurements Acquisition Update State Prediction Dynamic System Model (Constant Velocity Model) : 3D position vector : orientation quaternion : linear velocity vector : angular velocity vector : landmark position vector

Measurements Acquisition Filter-based SLAM MonoSLAM Prediction Measurements Acquisition Update For Matching the patch by NCC at Max NCC value at > threshold Measurement Prediction of measurements Find measurements Active search1),2) : a covariance matrix for the 2D position of i th landmark 1) A. J. Davison, Active Search for Real-Time Vision, International Conference Computer Vision, 2005. 2) M. Chli, A. J. Davison, Active Matching for Visual Tracking, Robotics and Autonomous Systems, 57(12):1173-1187, 2009.

Measurements Acquisition Filter-based SLAM MonoSLAM Prediction Measurements Acquisition Update Update : a Kalman gain at time t 1) A. J. Davison, Active Search for Real-Time Vision, International Conference Computer Vision, 2005. 2) M. Chli, A. J. Davison, Active Matching for Visual Tracking, Robotics and Autonomous Systems, 57(12):1173-1187, 2009.

Filter-based SLAM MonoSLAM Initialization of features Data association Delayed : SfM Undelayed : Inverse depth parameterization1) Data association 1-point RANSAC2) Joint Compatibility Branch and Bound3) (JCBB) Experiment 1.6GHz Pentium M processor 1) J. Civera, A. J. Davison, J. M. M. Montieal, Inverse Depth Parametrization for Monocular SLAM, IEEE Transactions on Robotics 24(5):932-945, 2008. 2) J. Civera, O. G. Grasa, A. J. Davison, J. M. M. Montiel, 1-Point RANSAC for EKF Filtering. Application to Real-Time Structure from Motion and Visual Odometry , Journal of Field Robotics, 2010 3) J. Neira, J. D. Tardos, Data association in stochastic mapping using the joint compatibility test, IEEE Transactions on Robotics and Automation, 17(6):890-897, Dec 2001.

Comparison SfM-based Filter-based Initialization 5-point algorithm Delayed : SfM Undelayed : Inverse depth parameterization Measurement NCC matching (from extracted feature points) KLT tracker Active search (prediction & templete matching) Estimation technique LBA (after p3p algorithm) Kalman filtering (prediction & update) Tracking 3~400 points in a frame Working in real time within 100 landmarks.

Searching a large number (1000) of the re-projected features Other SLAMs PTAM (Parallel Tracking and Mapping)1) Separate Tracking / Mapping Redundancy : use only key frames. Accuracy : available to optimization. Tracking Mapping Pose estimate & Map point projection Searching a small number (50) of the coarsest-scale features by pyramid Pose update Searching a large number (1000) of the re-projected features Final pose estimation 1) G. Klein, D. Murray, Parallel Tracking and Mapping for Small AR Workspaces, ACM International Symposium on Mixed and Augmented Reality, 2007.

Other SLAMs PTAM (Parallel Tracking and Mapping) Features Experiments Matching : zero mean SSD Key frame > 20 frames from the last key frame. Minimum distance away from the nearest key point. Point initialization Epipolar search Data association refinement Create new features in older keyframes. Re-measure outlier measurements. Experiments Intel Core 2 Duo 2.66GHz, 600x480 pixels 6000 point, 150 keyframes.

Topics Filter-based… Divergence Error accumulation Data association Beyond Spatial Pyramids: A New Feature Extraction Framework with Dense Spatial Sampling for Image Classification Topics Filter-based… Divergence Relocalization Multiple model Resilience Other filtering techniques Error accumulation Loop-closing Combining LBA, visual odometry Data association 1-point RANSAC ICNN, SCNN, JCBB Dynamic environment SLAMMOT (SLAM and Moving Object Tracking) Multi-view, Sensor fusion Application Dense 3d reconstruction AR Deblurring

Thank you!