MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

Slides:



Advertisements
Similar presentations
Epipolar Geometry.
Advertisements

Motion.
Lecture 11: Two-view geometry
MASKS © 2004 Invitation to 3D vision Lecture 7 Step-by-Step Model Buidling.
Two-View Geometry CS Sastry and Yang
Computer Vision Optical Flow
Camera calibration and epipolar geometry
Structure from motion.
MASKS © 2004 Invitation to 3D vision Lecture 11 Vision-based Landing of an Unmanned Air Vehicle.
Multiple View Geometry
Parameter estimation class 5 Multiple View Geometry Comp Marc Pollefeys.
Epipolar geometry. (i)Correspondence geometry: Given an image point x in the first view, how does this constrain the position of the corresponding point.
Structure from motion. Multiple-view geometry questions Scene geometry (structure): Given 2D point matches in two or more images, where are the corresponding.
Uncalibrated Geometry & Stratification Sastry and Yang
Multiple View Geometry Marc Pollefeys University of North Carolina at Chapel Hill Modified by Philippos Mordohai.
Multiple-view Reconstruction from Points and Lines
CSc83029 – 3-D Computer Vision/ Ioannis Stamos 3-D Computational Vision CSc Optical Flow & Motion The Factorization Method.
Image Segmentation Some slides: courtesy of O. Capms, Penn State, J.Ponce and D. Fortsyth, Computer Vision Book.
Image-Based Rendering using Hardware Accelerated Dynamic Textures Keith Yerex Dana Cobzas Martin Jagersand.
Previously Two view geometry: epipolar geometry Stereo vision: 3D reconstruction epipolar lines Baseline O O’ epipolar plane.
Computer Vision Structure from motion Marc Pollefeys COMP 256 Some slides and illustrations from J. Ponce, A. Zisserman, R. Hartley, Luc Van Gool, …
Image Matching via Saliency Region Correspondences Alexander Toshev Jianbo Shi Kostas Daniilidis IEEE Conference on Computer Vision and Pattern Recognition.
Lec 21: Fundamental Matrix
Multi-view geometry. Multi-view geometry problems Structure: Given projections of the same 3D point in two or more images, compute the 3D coordinates.
Computing the Fundamental matrix Peter Praženica FMFI UK May 5, 2008.
Geometry and Algebra of Multiple Views
Epipolar geometry The fundamental matrix and the tensor
Projective cameras Motivation Elements of Projective Geometry Projective structure from motion Planches : –
The Brightness Constraint
CSCE 643 Computer Vision: Structure from Motion
Lec 22: Stereo CS4670 / 5670: Computer Vision Kavita Bala.
Announcements Project 3 due Thursday by 11:59pm Demos on Friday; signup on CMS Prelim to be distributed in class Friday, due Wednesday by the beginning.
Parameter estimation. 2D homography Given a set of (x i,x i ’), compute H (x i ’=Hx i ) 3D to 2D camera projection Given a set of (X i,x i ), compute.
Affine Structure from Motion
Raquel A. Romano 1 Scientific Computing Seminar May 12, 2004 Projective Geometry for Computer Vision Projective Geometry for Computer Vision Raquel A.
EECS 274 Computer Vision Affine Structure from Motion.
Reconstruction from Two Calibrated Views Two-View Geometry
Geometry Reconstruction March 22, Fundamental Matrix An important problem: Determine the epipolar geometry. That is, the correspondence between.
Uncalibrated reconstruction Calibration with a rig Uncalibrated epipolar geometry Ambiguities in image formation Stratified reconstruction Autocalibration.
Structure from motion Multi-view geometry Affine structure from motion Projective structure from motion Planches : –
Parameter estimation class 5 Multiple View Geometry CPSC 689 Slides modified from Marc Pollefeys’ Comp
MASKS © 2004 Invitation to 3D vision Lecture 3 Image Primitives andCorrespondence.
Motion Segmentation CAGD&CG Seminar Wanqiang Shen
Lec 26: Fundamental Matrix CS4670 / 5670: Computer Vision Kavita Bala.
11/25/03 3D Model Acquisition by Tracking 2D Wireframes Presenter: Jing Han Shiau M. Brown, T. Drummond and R. Cipolla Department of Engineering University.
8/16/99 Computer Vision: Vision and Modeling. 8/16/99 Lucas-Kanade Extensions Support Maps / Layers: Robust Norm, Layered Motion, Background Subtraction,
A Closed Form Solution to Direct Motion Segmentation
Motion Segmentation with Missing Data using PowerFactorization & GPCA
Part I 1 Title 2 Motivation 3 Problem statement 4 Brief review of PCA
Computer Vision, Robotics, Machine Learning and Control Lab
René Vidal and Xiaodong Fan Center for Imaging Science
Segmentation of Dynamic Scenes
Part II Applications of GPCA in Computer Vision
L-infinity minimization in geometric vision problems.
René Vidal Center for Imaging Science
Motion Detection And Analysis
René Vidal Time/Place: T-Th 4.30pm-6pm, Hodson 301
Segmentation of Dynamic Scenes
A Unified Algebraic Approach to 2D and 3D Motion Segmentation
Segmentation of Dynamic Scenes from Image Intensities
Optical Flow Estimation and Segmentation of Moving Dynamic Textures
Dynamic Scene Reconstruction using GPCA
Generalized Principal Component Analysis CVPR 2008
Parameter estimation class 5
Epipolar geometry.
Estimating 2-view relationships
Uncalibrated Geometry & Stratification
George Mason University
Segmentation of Dynamical Scenes
Chapter 11: Stereopsis Stereopsis: Fusing the pictures taken by two cameras and exploiting the difference (or disparity) between them to obtain the depth.
Presentation transcript:

MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes

MASKS © 2004 Invitation to 3D vision One-body two-views

MASKS © 2004 Invitation to 3D vision One-body multiple-views

MASKS © 2004 Invitation to 3D vision Motivation and problem statement A static scene: multiple 2D motion models A dynamic scene: multiple 3D motion models Given an image sequence, determine Number of motion models (affine, Euclidean, etc.) Motion model: affine (2D) or Euclidean (3D) Segmentation: model to which each pixel belongs

MASKS © 2004 Invitation to 3D vision Previous work on 2D motion segmentation Local methods (Wang-Adelson ’93) Estimate one model per pixel using data in a window Cluster models with K-means Iterate Aperture problem Motion across boundaries Global methods (Irani-Peleg ‘92) Dominant motion: fit one motion model to all pixels Look for misaligned pixels & fit a new model to them Iterate Normalized cuts (Shi- Malik ‘98) Similarity matrix based on motion profile Segment pixels using eigenvector

MASKS © 2004 Invitation to 3D vision Previous work on 3D motion segmentation Factorization techniques Orthographic/discrete: Costeira-Kanade ’98, Gear ‘98 Perspective/continuous: Vidal-Soatto-Sastry ’02 Omnidirectional/continuous: Shakernia-Vidal-Sastry ’03 Special cases: Points in a line (orth-discrete): Han and Kanade ’00 Points in a conic (perspective): Avidan-Shashua ’01 Points in a line (persp.-continuous): Levin-Shashua ’01 2-body case: Wolf-Shashua ‘01

MASKS © 2004 Invitation to 3D vision Previous work: probabilistic techniques Probabilistic approaches Generative model: data membership + motion model Obtain motion models using Expectation Maximization E-step: Given motion models, segment image data M-step: Given data segmentation, estimate motion models 2D Motion Segmentation Layered representation (Jepson-Black’93, Ayer-Sawhney ’95, Darrel-Pentland’95, Weiss-Adelson’96, Weiss’97, Torr-Szeliski- Anandan ’99) 3D Motion Segmentation EM+Reprojection Error: Feng-Perona’98 EM+Model Selection: Torr ’98 How to initialize iterative algorithms?

MASKS © 2004 Invitation to 3D vision Our approach to motion segmentation This work considers full perspective projection multiple objects general motions We show that Problem is equivalent to polynomial factorization There is a unique global closed form solution if n<5 Exact solution is obtained using linear algebra Can be used to initialize EM-based algorithms Image points Number of motions Multibody Fund. Matrix Epipolar lines Epipoles Fundamental Matrices Motion segmentation Multi epipolar lines Multi epipole

MASKS © 2004 Invitation to 3D vision One-dimensional segmentation Number of models?

MASKS © 2004 Invitation to 3D vision One-dimensional segmentation For n groups Number of groups Groups

MASKS © 2004 Invitation to 3D vision One-dimensional segmentation Solution is unique if Solution is closed form if and only if Solves the eigenvector segmentation problem e.g. normalized cuts

MASKS © 2004 Invitation to 3D vision Three-dimensional motion segmentation Generalized PCA (Vidal, Ma ’02) Solve for the roots of a polynomial of degree in one variable Solve for a linear system in variables

MASKS © 2004 Invitation to 3D vision The multibody epipolar constraint Rotation: Translation: Epipolar constraint Multiple motions Multibody epipolar constraint Satisfied by ALL points regardless of segmentation Segmentation is algebraically eliminated!!!

MASKS © 2004 Invitation to 3D vision The multibody fundamental matrix Lifting Embedding Bilinear on embedded data! Veronese map (polynomial embedding) Multibody fundamental matrix

MASKS © 2004 Invitation to 3D vision Estimation of the number of motions Theorem: Given image points corresponding to motions, if at least 8 points correspond to each object, then Minimum number of points

MASKS © 2004 Invitation to 3D vision Estimation of multibody fundamental matrix 1-body motion n-body motion

MASKS © 2004 Invitation to 3D vision Segmentation of fundamental matrices Given rank condition for n motions linear system F Multibody epipolar transfer Multibody epipole Fundamental matrices

MASKS © 2004 Invitation to 3D vision Multibody epipolar transfer Multibody epipolar line Lifting Polynomial factorization

MASKS © 2004 Invitation to 3D vision Multibody epipole The multibody epipole is the solution of the linear system Epipoles are obtained using polynomial factorization Lifting Number of distinct epipoles

MASKS © 2004 Invitation to 3D vision From images to epipoles

MASKS © 2004 Invitation to 3D vision Fundamental matrices Columns of are epipolar lines Polynomial factorization to compute them up to scale Scales can be computed linearly

MASKS © 2004 Invitation to 3D vision The multibody 8-point algorithm Image point Multibody epipolar line Multibody epipolar transfer Polynomial Factorization Linear system Polynomial Factorization Epipolar lines Multibody epipole Epipoles Embedded image point Veronese map Fundamental matrix Linear system

MASKS © 2004 Invitation to 3D vision Remarks about the algorithm Algebraically equivalent to polynomial factorization Requires solving for roots of polynomial of degree n in one variable There is a closed form solution if n<5 The algorithm is probably polynomial time It requires O(n 4 ) image points It neglects internal structure of the multibody fundamental matrix

MASKS © 2004 Invitation to 3D vision Optimal 3D motion segmentation Zero-mean Gaussian noise Constrained optimization problem on Optimal function for 1 motion Optimal function for n motions Solved using Riemanian Gradien Descent

MASKS © 2004 Invitation to 3D vision Comparison of 1 body and n bodies

MASKS © 2004 Invitation to 3D vision Other cases: linearly moving objects Multibody epipole Recovery of epipoles Fundamental matrices Feature segmentation Minimum number of points

MASKS © 2004 Invitation to 3D vision Other cases: affine flows Affine motion segmentation: constant brightness constraint 3D motion segmentation: epipolar constraint In linear motions, geometric constraints are linear Two-view motion constraints could be bilinear!!!

MASKS © 2004 Invitation to 3D vision Other cases: multiple views Compute optical flow for N pixels in F frames relative to reference frame Form optical flow matrix Similarity matrix is Apply GPCA to firstsingular vectors of to obtain the segmentation of the image using motion 0 F 2 1

MASKS © 2004 Invitation to 3D vision 3D motion segmentation results N = = 173

MASKS © 2004 Invitation to 3D vision Results: affine motion segmentation

MASKS © 2004 Invitation to 3D vision Results

MASKS © 2004 Invitation to 3D vision Results

MASKS © 2004 Invitation to 3D vision More results

MASKS © 2004 Invitation to 3D vision Conclusions There is an analytic solution to 3D motion segmentation based on Multibody epipolar constraint: it does not depend on the segmentation of the data Polynomial factorization: linear algebra Solution is closed form iff n<5 A similar technique also applies to Eigenvector segmentation: from similarity matrices Generalized PCA: mixtures of subspaces 2-D motion segmentation: of affine motions Future work Reduce data complexity, sensitivity analysis, robustness

MASKS © 2004 Invitation to 3D vision References R. Vidal, Y. Ma, S. Soatto and S. Sastry. Two-view multibody structure from motion, International Journal of Computer Vision, 2004 R. Vidal and S. Sastry. Optimal segmentation of dynamic scenes from two perspective views, International Conference on Computer Vision and Pattern Recognition, 2003 R. Vidal and S. Sastry. Segmentation of dynamic scenes from image intensities, IEEE Workshop on Vision and Motion Computing, 2002.