René Vidal and Xiaodong Fan Center for Imaging Science

Slides:



Advertisements
Similar presentations
Epipolar Geometry.
Advertisements

Lecture 11: Two-view geometry
Two-View Geometry CS Sastry and Yang
Two-view geometry.
Camera calibration and epipolar geometry
Structure from motion.
MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.
Geometry of Images Pinhole camera, projection A taste of projective geometry Two view geometry:  Homography  Epipolar geometry, the essential matrix.
Epipolar geometry. (i)Correspondence geometry: Given an image point x in the first view, how does this constrain the position of the corresponding point.
Structure from motion. Multiple-view geometry questions Scene geometry (structure): Given 2D point matches in two or more images, where are the corresponding.
Uncalibrated Geometry & Stratification Sastry and Yang
Lecture 21: Multiple-view geometry and structure from motion
Multiple-view Reconstruction from Points and Lines
CAU Kiel DAGM 2001-Tutorial on Visual-Geometric 3-D Scene Reconstruction 1 The plan for today Leftovers and from last time Camera matrix Part A) Notation,
© 2003 by Davi GeigerComputer Vision October 2003 L1.1 Structure-from-EgoMotion (based on notes from David Jacobs, CS-Maryland) Determining the 3-D structure.
Previously Two view geometry: epipolar geometry Stereo vision: 3D reconstruction epipolar lines Baseline O O’ epipolar plane.
Lec 21: Fundamental Matrix
Camera parameters Extrinisic parameters define location and orientation of camera reference frame with respect to world frame Intrinsic parameters define.
Multiple View Geometry. THE GEOMETRY OF MULTIPLE VIEWS Reading: Chapter 10. Epipolar Geometry The Essential Matrix The Fundamental Matrix The Trifocal.
3-D Scene u u’u’ Study the mathematical relations between corresponding image points. “Corresponding” means originated from the same 3D point. Objective.
55:148 Digital Image Processing Chapter 11 3D Vision, Geometry Topics: Basics of projective geometry Points and hyperplanes in projective space Homography.
Multi-view geometry. Multi-view geometry problems Structure: Given projections of the same 3D point in two or more images, compute the 3D coordinates.
Automatic Camera Calibration
Computer vision: models, learning and inference
Epipolar geometry The fundamental matrix and the tensor
1 Preview At least two views are required to access the depth of a scene point and in turn to reconstruct scene structure Multiple views can be obtained.
Projective cameras Motivation Elements of Projective Geometry Projective structure from motion Planches : –
Lecture 04 22/11/2011 Shai Avidan הבהרה : החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
Announcements Project 3 due Thursday by 11:59pm Demos on Friday; signup on CMS Prelim to be distributed in class Friday, due Wednesday by the beginning.
Geometry of Multiple Views
Affine Structure from Motion
Raquel A. Romano 1 Scientific Computing Seminar May 12, 2004 Projective Geometry for Computer Vision Projective Geometry for Computer Vision Raquel A.
Two-view geometry. Epipolar Plane – plane containing baseline (1D family) Epipoles = intersections of baseline with image planes = projections of the.
EECS 274 Computer Vision Affine Structure from Motion.
Feature Matching. Feature Space Outlier Rejection.
Mixing Catadioptric and Perspective Cameras
55:148 Digital Image Processing Chapter 11 3D Vision, Geometry Topics: Basics of projective geometry Points and hyperplanes in projective space Homography.
Computer vision: models, learning and inference M Ahad Multiple Cameras
MASKS © 2004 Invitation to 3D vision Uncalibrated Camera Chapter 6 Reconstruction from Two Uncalibrated Views Modified by L A Rønningen Oct 2008.
Reconstruction from Two Calibrated Views Two-View Geometry
Uncalibrated reconstruction Calibration with a rig Uncalibrated epipolar geometry Ambiguities in image formation Stratified reconstruction Autocalibration.
Stereo March 8, 2007 Suggested Reading: Horn Chapter 13.
Structure from motion Multi-view geometry Affine structure from motion Projective structure from motion Planches : –
Lec 26: Fundamental Matrix CS4670 / 5670: Computer Vision Kavita Bala.
Multi-view geometry. Multi-view geometry problems Structure: Given projections of the same 3D point in two or more images, compute the 3D coordinates.
55:148 Digital Image Processing Chapter 11 3D Vision, Geometry
A Closed Form Solution to Direct Motion Segmentation
Motion Segmentation with Missing Data using PowerFactorization & GPCA
Segmentation of Dynamic Scenes
Part II Applications of GPCA in Computer Vision
L-infinity minimization in geometric vision problems.
René Vidal Time/Place: T-Th 4.30pm-6pm, Hodson 301
Segmentation of Dynamic Scenes
A Unified Algebraic Approach to 2D and 3D Motion Segmentation
Segmentation of Dynamic Scenes from Image Intensities
Optical Flow Estimation and Segmentation of Moving Dynamic Textures
The Brightness Constraint
Two-view geometry Computer Vision Spring 2018, Lecture 10
Epipolar geometry.
3D Photography: Epipolar geometry
Structure from motion Input: Output: (Tomasi and Kanade)
Omnidirectional epipolar geometry
Uncalibrated Geometry & Stratification
George Mason University
Segmentation of Dynamical Scenes
Two-view geometry.
Two-view geometry.
Multi-view geometry.
Single-view geometry Odilon Redon, Cyclops, 1914.
Structure from motion Input: Output: (Tomasi and Kanade)
Presentation transcript:

Two-View Motion Segmentation by Exploiting the Rank and Geometry of the Multibody Fundamental Matrix René Vidal and Xiaodong Fan Center for Imaging Science Johns Hopkins University Add d15.avi

Structure and motion recovery problem Input: Corresponding points in multiple images Output: camera motion, scene structure, camera calibration Structure = 3D surface Motion = camera position and orientation Copyright @ Rene Vidal

The fundamental matrix Input: point correspondences Output: rigid motion Rotation: Translation: Epipolar constraint Estimate fundamental matrix F linearly Eight-point algorithm Project onto the essential manifold for noise reduction Recover (R,T) from F Write Sym(F_1,…., F_n). Copyright @ Rene Vidal

Projection onto the essential manifold Characterizing essential manifold F Extend to multiple motions! Projection – minimizing the Frobenious norm: Copyright @ Rene Vidal

3-D motion segmentation problem Given a set of point correspondences in multiple views, determine Number of motion models Motion model: affine Segmentation: model to which each pixel belongs Mathematics of the problem depends on Number of frames (2, 3, multiple) Projection model (affine, perspective) Motion model (affine, translational, planar motion, rigid motion) 3-D structure (planar or not) Copyright @ Rene Vidal

Our approach to motion segmentation Estimation of multiple motion models equivalent to estimation of one multibody motion model Eliminate feature clustering: multiplication Estimate a single multibody motion model: polynomial fitting Segment multibody motion model: polynomial differentiation chicken-and-egg Copyright @ Rene Vidal

A unified approach to motion segmentation Applies to most motion models in computer vision All motion models can be segmented algebraically by Fitting multibody model: real or complex polynomial to all data Fitting individual model: differentiate polynomial at a data point Copyright @ Rene Vidal

Multibody epipolar constraint Rotation: Translation: Epipolar constraint Multiple motions Multibody epipolar constraint Satisfied by ALL points regardless of segmentation Segmentation is algebraically eliminated!!! Copyright @ Rene Vidal

Multibody fundamental matrix Lifting Embedding Bilinear on embedded data! Veronese map (polynomial embedding) Multibody fundamental matrix Copyright @ Rene Vidal

Multibody Fundamental Matrix n-body motion 1-body motion Copyright @ Rene Vidal

Epipoles in the nullspace of F Embedded epipoles are in the null space of the multibody fundamental matrix Are all vectors in Null(F) embedded epipoles, or their linear combinations? Are embedded epipoles linearly independent? F The dimension of Null(F) Copyright @ Rene Vidal

Rank constraint of F 1-body motion 2-body motion In general, we prove: # of distinct epipoles Key observation: repeated epipoles will enlarge the null(F) ! time each eipipole repeates Copyright @ Rene Vidal

Dimension of Null(F) Let be Different epipoles (up to scales), each of which repeats times Define as the span of the l-th order derivatives of Copyright @ Rene Vidal

Dimension of Null(F) Lemma 1: Lemma 2: when Lemma 3: Copyright @ Rene Vidal

Projection onto the essential manifold Characterizing essential manifold F Extend to multiple motions! Projection – minimizing the Frobenious norm: Copyright @ Rene Vidal

Multibody essential manifold with common rotations n-body motion 1-body motion Copyright @ Rene Vidal

SVD of When the number of motions is odd When there are two motions Theorem: The singular values of a skew-symmetric matrix are When there are two motions Theorem: The singular values satisfies Copyright @ Rene Vidal

Projection onto the multibody essential manifold Based on the proposed projection theorem, the projection process is: Linearly estimate F Compute SVD Replace by the “desired” singular values Satisfy the multi-body essential manifold constraint Copyright @ Rene Vidal

Compute the “desired” singular values Using Lagrangian multiplier method to solve the constraint optimization problem Two motions with a common rotation Odd number of motions Copyright @ Rene Vidal

Multibody 8-point algorithm Multibody epipolar constraint Multibody epipolar transfer Multibody epipole Fundamental matrices Given rank condition n linear system F Lifting Embedding Sym(F1,,,F_n) This can be solved using a collection of GPCA’s Copyright @ Rene Vidal

Experiments Compare three methods: GPCA (Vidal and Ma, 2004): specifically designed for the purely translational motions. Multibody epipolar constraint without projection (MEC-noprojection): using the proposed approach without projecting the estimated multibody fundamental matrix F onto the multibody essential manifold. Multibody epipolar constraint with projection (MEC-projection): project the estimated multibody fundamental matrix F onto the multibody essential manifold. Copyright @ Rene Vidal

Experimental results on synthesized data Error in the estimation of translational component of motion (in degree) as a function of noise in the image points (std in pixels) for two pure translational motions (left column); and as a function of rotation angle (in degree) for two independent motions with a common rotation (right column). (a) in translation Copyright @ Rene Vidal

Experimental results on synthesized data Error in the estimation of rotational component of motion (in degree) as a function of noise in the image points (std in pixels) for two pure translational motions (left column); and as a function of rotation angle (in degree) for two independent motions with a common rotation (right column). (b) in rotation Copyright @ Rene Vidal

Experimental results on synthesized data Error in the segmentation (in percentage) as a function of noise in the image points (std in pixels) for two pure translational motions (left column); and as a function of rotation angle (in degree) for two independent motions with a common rotation (right column). (c) in segmentation Copyright @ Rene Vidal

Experimental results on real images Experiment setup: The original sequence contains three independent motions, the first two of which are translational. Point correspondences, Compute the rotational component R of the third motion. Rotate the feature points in the first frame that belong to the first motion, Then undergo a two-body motion with a common rotation. Copyright @ Rene Vidal

Experimental results on real images Percentage of mis-segmentation on a real sequence with two independent motions (with a common rotation) for different pairs of frames. The x-axis indicates the index of frame pairs. Copyright @ Rene Vidal

Conclusions Unified algebraic approach to motion segmentation Fit a polynomial to all image data Differentiate the polynomial to obtain motion parameters Applies to most motion models in vision Two views 2-D: translational, similarity, affine 3-D: translational, fundamental matrices, homographies Three views Multibody trifocal tensor Multiple views Affine cameras Two perspective views Rank of multibody fundamental matrices Geometry of multibody fundamental matrices Copyright @ Rene Vidal