Segmentation of Dynamic Scenes

Slides:

Advertisements

Similar presentations

Lecture 11: Two-view geometry

Advertisements

Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.

MASKS © 2004 Invitation to 3D vision Lecture 7 Step-by-Step Model Buidling.

Two-View Geometry CS Sastry and Yang

On Constrained Optimization Approach To Object Segmentation Chia Han, Xun Wang, Feng Gao, Zhigang Peng, Xiaokun Li, Lei He, William Wee Artificial Intelligence.

Computer Vision Optical Flow

Structure from motion.

MASKS © 2004 Invitation to 3D vision Lecture 8 Segmentation of Dynamical Scenes.

Multiple View Geometry

Parameter estimation class 5 Multiple View Geometry Comp Marc Pollefeys.

Epipolar geometry. (i)Correspondence geometry: Given an image point x in the first view, how does this constrain the position of the corresponding point.

Structure from motion. Multiple-view geometry questions Scene geometry (structure): Given 2D point matches in two or more images, where are the corresponding.

Uncalibrated Geometry & Stratification Sastry and Yang

Multiple View Geometry Marc Pollefeys University of North Carolina at Chapel Hill Modified by Philippos Mordohai.

Computer Vision Structure from motion Marc Pollefeys COMP 256 Some slides and illustrations from J. Ponce, A. Zisserman, R. Hartley, Luc Van Gool, …

Image Matching via Saliency Region Correspondences Alexander Toshev Jianbo Shi Kostas Daniilidis IEEE Conference on Computer Vision and Pattern Recognition.

Lec 21: Fundamental Matrix

Computer vision: models, learning and inference

Computing the Fundamental matrix Peter Praženica FMFI UK May 5, 2008.

Epipolar geometry The fundamental matrix and the tensor

The Brightness Constraint

CSCE 643 Computer Vision: Structure from Motion

Non-Euclidean Example: The Unit Sphere. Differential Geometry Formal mathematical theory Work with small ‘patches’ –the ‘patches’ look Euclidean Do calculus.

Lec 22: Stereo CS4670 / 5670: Computer Vision Kavita Bala.

Parameter estimation. 2D homography Given a set of (x i,x i ’), compute H (x i ’=Hx i ) 3D to 2D camera projection Given a set of (X i,x i ), compute.

Affine Structure from Motion

Raquel A. Romano 1 Scientific Computing Seminar May 12, 2004 Projective Geometry for Computer Vision Projective Geometry for Computer Vision Raquel A.

EECS 274 Computer Vision Affine Structure from Motion.

Reconstruction from Two Calibrated Views Two-View Geometry

Affine Registration in R m 5. The matching function allows to define tentative correspondences and a RANSAC-like algorithm can be used to estimate the.

Structure from motion Multi-view geometry Affine structure from motion Projective structure from motion Planches : –

Parameter estimation class 5 Multiple View Geometry CPSC 689 Slides modified from Marc Pollefeys’ Comp

Motion Segmentation at Any Speed Shrinivas J. Pundlik Department of Electrical and Computer Engineering, Clemson University, Clemson, SC.

Motion Segmentation CAGD&CG Seminar Wanqiang Shen

Lec 26: Fundamental Matrix CS4670 / 5670: Computer Vision Kavita Bala.

8/16/99 Computer Vision: Vision and Modeling. 8/16/99 Lucas-Kanade Extensions Support Maps / Layers: Robust Norm, Layered Motion, Background Subtraction,

Reconstruction of a Scene with Multiple Linearly Moving Objects Mei Han and Takeo Kanade CISC 849.

A Closed Form Solution to Direct Motion Segmentation

Motion Segmentation with Missing Data using PowerFactorization & GPCA

Projective Factorization of Multiple Rigid-Body Motions

Part I 1 Title 2 Motivation 3 Problem statement 4 Brief review of PCA

Computer Vision, Robotics, Machine Learning and Control Lab

René Vidal and Xiaodong Fan Center for Imaging Science

Pursuit Evasion Games and Multiple View Geometry

Part II Applications of GPCA in Computer Vision

L-infinity minimization in geometric vision problems.

René Vidal Center for Imaging Science

Observability and Identification of Linear Hybrid Systems

René Vidal Time/Place: T-Th 4.30pm-6pm, Hodson 301

Segmentation of Dynamic Scenes

Vision Based Motion Estimation for UAV Landing

A Unified Algebraic Approach to 2D and 3D Motion Segmentation

Segmentation of Dynamic Scenes from Image Intensities

Pursuit Evasion Games and Multiple View Geometry

Optical Flow Estimation and Segmentation of Moving Dynamic Textures

Observability, Observer Design and Identification of Hybrid Systems

Dynamic Scene Reconstruction using GPCA

Generalized Principal Component Analysis CVPR 2008

Parameter estimation class 5

Epipolar geometry.

3D Photography: Epipolar geometry

EE 290A Generalized Principal Component Analysis

Omnidirectional epipolar geometry

Estimating 2-view relationships

Motion Segmentation at Any Speed

Uncalibrated Geometry & Stratification

George Mason University

Segmentation of Dynamical Scenes

Chapter 11: Stereopsis Stereopsis: Fusing the pictures taken by two cameras and exploiting the difference (or disparity) between them to obtain the depth.

Optical flow and keypoint tracking

Presentation transcript:

Segmentation of Dynamic Scenes René Vidal Department of EECS, UC Berkeley

Motivation and problem statement A static scene: multiple 2D motion models A dynamic scene: multiple 3D motion models Given an image sequence, determine Number of motion models (affine, Euclidean, etc.) Motion model: affine (2D) or Euclidean (3D) Segmentation: model to which each pixel belongs

Previous work on 2D motion segmentation Local methods (Wang-Adelson ’93) Estimate one model per pixel using a data in a window Cluster models with K-means Iterate Aperture problem Motion across boundaries Global methods (Irani-Peleg ‘92) Dominant motion: fit one motion model to all pixels Look for misaligned pixels & fit a new model to them Normalized cuts (Shi-Malik ‘98) Similarity matrix based on motion profile Segment pixels using eigenvector

Previous work on 3D motion segmentation Factorization techniques Orthographic/discrete: Costeira-Kanade ’98, Gear ‘98 Perspective/continuous: Vidal-Soatto-Sastry ’02 Omnidirectional/continuous: Shakernia-Vidal-Sastry ’03 Special cases: Points in a line (orth-discrete): Han and Kanade ’00 Points in a conic (perspective): Avidan-Shashua ’01 Points in a line (persp.-continuous): Levin-Shashua ’01 2-body case: Wolf-Shashua ‘01

Previous work: probabilistic techniques Probabilistic approaches Generative model: data membership + motion model Obtain motion models using Expectation Maximization E-step: Given motion models, segment image data M-step: Given data segmentation, estimate motion models 2D Motion Segmentation Layered representation (Jepson-Black’93, Ayer-Sawhney ’95, Darrel-Pentland’95, Weiss-Adelson’96, Weiss’97, Torr-Szeliski-Anandan ’99) 3D Motion Segmentation EM+Reprojection Error: Feng-Perona’98 EM+Model Selection: Torr ’98 How to initialize iterative algorithms?

Our approach to motion segmentation This work considers full perspective projection multiple objects general motion We show that Problem is equivalent to polynomial factorization There is a unique global closed form solution if n<5 Exact solution is obtained using linear algebra Can be used to initialize EM-based algorithms Image points Number of motions Multibody Fund. Matrix Multi epipolar lines Epipolar lines Multi epipole Epipoles Fundamental Matrices Motion segmentation

One-dimensional segmentation Number of models?

One-dimensional segmentation For n groups Number of groups Groups

Three-dimensional motion segmentation Generalized PCA (Vidal et.al. ‘02) Solve for the roots of a polynomial of degree in one variable Solve for a linear system in variables

The multibody epipolar constraint Rotation: Translation: Epipolar constraint Multiple motions Multibody epipolar constraint Satisfied by ALL points regardless of segmentation Segmentation is algebraically eliminated!!!

The multibody fundamental matrix Lifting Embedding Bilinear on embedded data! Veronese map (polynomial embedding) Multibody fundamental matrix

Estimation of the number of motions Theorem: Given image points corresponding to motions, if at least 8 points correspond to each object, then 1 2 4 3 8 35 99 225 Minimum number of points

Estimation of multibody fundamental matrix 1-body motion n-body motion

Segmentation of fundamental matrices Given rank condition for n motions linear system F Multibody epipolar transfer Multibody epipole Fundamental matrices

Multibody epipolar transfer Lifting Multibody epipolar line Polynomial factorization

Multibody epipole Lifting The multibody epipole is the solution of the linear system Number of distinct epipoles Epipoles are obtained using polynomial factorization

Fundamental matrices Columns of are epipolar lines Polynomial factorization to compute them up to scale Scales can be computed linearly

The multibody 8-point algorithm Image point Veronese map Embedded image point Multibody epipolar transfer Multibody epipolar line Polynomial Factorization Epipolar lines Linear system Multibody epipole Polynomial Factorization Epipoles Linear system Fundamental matrix

Optimal 3D motion segmentation Zero-mean Gaussian noise Constrained optimization problem on Optimal function for 1 motion Optimal function for n motions Solved using Riemanian Gradient Descent

Comparison of 1 and n bodies

Linearly moving objects 1 2 10 5 20 65 4 3 8 35 99 225 Minimum number of points Multibody epipole Recovery of epipoles Fundamental matrices Feature segmentation

3D motion segmentation results

Conclusions There is an analytic solution to 3D motion segmentation based on Multibody epipolar constraint: it does not depend on the segmentation of the data Polynomial factorization: linear algebra Solution is closed form iff n<5 A similar technique also applies to Eigenvector segmentation: from similarity matrices Generalized PCA: mixtures of subspaces 2-D motion segmentation: of affine motions Future work Reduce data complexity, sensitivity analysis, robustness

References R. Vidal, Y. Ma, S. Soatto and S. Sastry. Two-view multibody structure from motion, International Journal of Computer Vision, 2004 R. Vidal and S. Sastry. Optimal segmentation of dynamic scenes from two perspective views, International Conference on Computer Vision and Pattern Recognition, 2003 R. Vidal and S. Sastry. Segmentation of dynamic scenes from image intensities, IEEE Workshop on Vision and Motion Computing, 2002.