Structure from motion II

Slides:



Advertisements
Similar presentations
The fundamental matrix F
Advertisements

Structure from motion Digital Visual Effects Yung-Yu Chuang with slides by Richard Szeliski, Steve Seitz, Zhengyou Zhang and Marc Pollefyes.
Structure from motion Digital Visual Effects, Spring 2005 Yung-Yu Chuang 2005/4/20 with slides by Richard Szeliski, Steve Seitz, Zhengyou Zhang and Marc.
PCA + SVD.
3D Reconstruction – Factorization Method Seong-Wook Joo KG-VISA 3/10/2004.
N-view factorization and bundle adjustment CMPUT 613.
Structure from motion.
Algorithms & Applications in Computer Vision
Stanford CS223B Computer Vision, Winter 2005 Lecture 11: Structure From Motion 2 Sebastian Thrun, Stanford Rick Szeliski, Microsoft Hendrik Dahlkamp and.
Today Feature Tracking Structure from Motion on Monday (1/29)
Stanford CS223B Computer Vision, Winter 2007 Lecture 8 Structure From Motion Professors Sebastian Thrun and Jana Košecká CAs: Vaibhav Vaish and David Stavens.
Direct Methods for Visual Scene Reconstruction Paper by Richard Szeliski & Sing Bing Kang Presented by Kristin Branson November 7, 2002.
Epipolar geometry. (i)Correspondence geometry: Given an image point x in the first view, how does this constrain the position of the corresponding point.
Structure from motion. Multiple-view geometry questions Scene geometry (structure): Given 2D point matches in two or more images, where are the corresponding.
Uncalibrated Geometry & Stratification Sastry and Yang
Real-time Combined 2D+3D Active Appearance Models Jing Xiao, Simon Baker,Iain Matthew, and Takeo Kanade CVPR 2004 Presented by Pat Chan 23/11/2004.
Stanford CS223B Computer Vision, Winter 2006 Lecture 8 Structure From Motion Professor Sebastian Thrun CAs: Dan Maynes-Aminzade, Mitul Saha, Greg Corrado.
Lecture 11: Structure from motion CS6670: Computer Vision Noah Snavely.
Structure From Motion Sebastian Thrun, Gary Bradski, Daniel Russakoff
Previously Two view geometry: epipolar geometry Stereo vision: 3D reconstruction epipolar lines Baseline O O’ epipolar plane.
CSCE 641 Computer Graphics: Image-based Modeling (Cont.) Jinxiang Chai.
Camera parameters Extrinisic parameters define location and orientation of camera reference frame with respect to world frame Intrinsic parameters define.
CSCE 641 Computer Graphics: Image-based Modeling (Cont.) Jinxiang Chai.
Lecture 12: Structure from motion CS6670: Computer Vision Noah Snavely.
Computer vision: models, learning and inference
CSE554AlignmentSlide 1 CSE 554 Lecture 8: Alignment Fall 2014.
Camera Geometry and Calibration Thanks to Martial Hebert.
CSE554AlignmentSlide 1 CSE 554 Lecture 5: Alignment Fall 2011.
Course 12 Calibration. 1.Introduction In theoretic discussions, we have assumed: Camera is located at the origin of coordinate system of scene.
Structure from Motion Computer Vision CS 143, Brown James Hays 11/18/11 Many slides adapted from Derek Hoiem, Lana Lazebnik, Silvio Saverese, Steve Seitz,
Structure from Motion Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/06/12 Many slides adapted from Lana Lazebnik, Silvio Saverese,
EMIS 8381 – Spring Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.
Camera calibration Digital Visual Effects Yung-Yu Chuang with slides by Richard Szeliski, Steve Seitz,, Fred Pighin and Marc Pollefyes.
CSCE 643 Computer Vision: Structure from Motion
Lec 22: Stereo CS4670 / 5670: Computer Vision Kavita Bala.
Last lecture Passive Stereo Spacetime Stereo Multiple View Stereo.
Affine Structure from Motion
Advanced Computer Vision Structure from Motion1 Chapter 7 S TRUCTURE FROM M OTION.
A Flexible New Technique for Camera Calibration Zhengyou Zhang Sung Huh CSPS 643 Individual Presentation 1 February 25,
Last lecture Passive Stereo Spacetime Stereo.
EECS 274 Computer Vision Affine Structure from Motion.
Structure from motion Digital Visual Effects, Spring 2008 Yung-Yu Chuang 2008/4/22 with slides by Richard Szeliski, Steve Seitz, Zhengyou Zhang and Marc.
776 Computer Vision Jan-Michael Frahm & Enrique Dunn Spring 2013.
Large-Scale Matrix Factorization with Missing Data under Additional Constraints Kaushik Mitra University of Maryland, College Park, MD Sameer Sheoreyy.
Structure from Motion ECE 847: Digital Image Processing
Structure from Motion Paul Heckbert, Nov , Image-Based Modeling and Rendering.
Structure from motion Multi-view geometry Affine structure from motion Projective structure from motion Planches : –
Announcements No midterm Project 3 will be done in pairs same partners as for project 2.
EECS 274 Computer Vision Projective Structure from Motion.
Structure from Motion. For now, static scene and moving cameraFor now, static scene and moving camera – Equivalently, rigidly moving scene and static.
Camera Calibration Course web page: vision.cis.udel.edu/cv March 24, 2003  Lecture 17.
Structure from motion Digital Visual Effects, Spring 2007 Yung-Yu Chuang 2007/4/24 with slides by Richard Szeliski, Steve Seitz, Zhengyou Zhang and Marc.
Reconstruction of a Scene with Multiple Linearly Moving Objects Mei Han and Takeo Kanade CISC 849.
Structure from motion Digital Visual Effects Yung-Yu Chuang with slides by Richard Szeliski, Steve Seitz, Zhengyou Zhang and Marc Pollefyes.
CSE 554 Lecture 8: Alignment
CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo.
Motion Segmentation with Missing Data using PowerFactorization & GPCA
Digital Visual Effects, Spring 2007 Yung-Yu Chuang 2007/4/17
Digital Visual Effects, Spring 2006 Yung-Yu Chuang 2005/4/19
Digital Visual Effects, Spring 2008 Yung-Yu Chuang 2008/4/15
SFM under orthographic projection
Parameter estimation class 5
Two-view geometry Computer Vision Spring 2018, Lecture 10
Epipolar geometry.
Structure from motion Input: Output: (Tomasi and Kanade)
Digital Visual Effects Yung-Yu Chuang
Digital Visual Effects, Spring 2006 Yung-Yu Chuang 2005/4/26
Structure from motion.
Structure from motion Input: Output: (Tomasi and Kanade)
Presentation transcript:

Structure from motion II Digital Visual Effects, Spring 2005 Yung-Yu Chuang 2005/5/4 with slides by Richard Szeliski, Steve Seitz, Marc Pollefyes and Daniel Martinec

Announcements Project #2 artifacts voting. Project #3 will be online tomorrow, hopefully. Scribe schedule.

Outline Factorization methods Project #3 Orthogonal Missing data Projective Projective with missing data Project #3

Recap: epipolar geometry

Structure from motion

Structure from motion SFM pipeline 2D feature geometry optimization tracking 3D estimation optimization (bundle adjust) geometry fitting SFM pipeline

Factorization methods

Notations n 3D points are seen in m views q=(u,v,1): 2D image point p=(x,y,z,1): 3D scene point : projection matrix : projection function qij is the projection of the i-th point on image j ij projective depth of qij

SFM under orthographic projection matrix 2D image point 3D scene point image offset Trick Choose scene origin to be centroid of 3D points Choose image origins to be centroid of 2D points Allows us to drop the camera translation:

factorization (Tomasi & Kanade) projection of n features in one image: projection of n features in m images W measurement M motion S shape Key Observation: rank(W) <= 3

Factorization Factorization Technique known solve for Factorization Technique W is at most rank 3 (assuming no noise) We can use singular value decomposition to factor W: S’ differs from S by a linear transformation A: Solve for A by enforcing metric constraints on M

Metric constraints Orthographic Camera Enforcing “Metric” Constraints Rows of P are orthonormal: Enforcing “Metric” Constraints Compute A such that rows of M have these properties Trick (not in original Tomasi/Kanade paper, but in followup work) Constraints are linear in AAT : Solve for G first by writing equations for every Pi in M Then G = AAT by SVD (since U = V)

Factorization with noisy data SVD gives this solution Provides optimal rank 3 approximation W’ of W Approach Estimate W’, then use noise-free factorization of W’ as before Result minimizes the SSD between positions of image features and projection of the reconstruction

Factorization method with missing data

Why missing data? occlusions tracking failure W is only partially filled, factorization doesn’t work

Tomasi & Kanade Hallucination/propagation 4 points in 3 views determine structure and motion

Tomasi & Kanade

Tomasi & Kanade Solve for i4 and j4:

Tomasi & Kanade Alternatively, first apply factorization on

Tomasi & Kanade Disadvantages Finding the largest full submatrix of a matrix with missing elements is NP-hard. The data is not used symmetrically, these inaccuracies will propagate in the computation of additional missing elements.

Shum, Ikeuchi & Reddy Treat SVD as a PCA with missing data problem which is a weighted least square problem. Assume that W consists of n m-d points with mean t and covariance . If the rank of W is r, the problem of PCA is to find U,S,V such that is minimal. If W is incomplete, it becomes

Shum, Ikeuchi & Reddy To be solvable, the number of observable elements c in W must be larger than r(m+n-r) If we arrange W as an c-d vector w, we can rewrite it as To reach minimum, u and v satisfies:

Shum, Ikeuchi & Reddy Nonlinear, solved by iterating between “fixing v and solving u” and “fixing u and solving v” initialize v update stop if convergence, or go back to step 2 The above procedure can be further simplified by taking advantage of the sparse structure.

Shum, Ikeuchi & Reddy Disadvantages: sensitive to the starting point

Linear fitting Try to find a rank-r matrix so that is minimal. Each column of W is an m-d vector. SVD tries to find an r-d linear space L that is closest to these n m-d vectors and projects these vectors to L. A matrix describes a vector space.

Linear fitting Without noise, each triplet of columns of M exactly specifies L. When there is missing data, each triplet only forms a constraint. For SFM, r=3, We can combine constraints to find L

Linear fitting Let Nt denote a matrix representation of , that is, each column of Nt is a vector orthogonal to the space . If N=[N1, N2,… Nl], then L is the null space of N. Because of noise, the matrix N will typically have full rank. Taking the SVD of N, and find its three least significant components. If fourth smallest singular value of this matrix is less than 0.001, the result is unreliable. This method can be used as the initialization for Shum’s method.

Factorization method with projective projection

Factorization for projective projection depth W has rank at most 4. The problem is that we don’t know .

Sturm & Triggs For the p-th point, its projective depths for the i-th and j-th images are related by

Sturm & Triggs

Factorization method with projective projection and missing data

Mahamud et. al.

Mahamud et. al.

Project #3 Matchmove Assigned: 5/4 Due: 11:59pm 5/24 Work in pairs Implement Tomasi/Kanade factorization method. Some matlab implementations are provided as reference for implementation details.

Bells & whistles Tracking Extensions of factorization methods (Jacobs, Mahamud are recommended) Bundle adjustment Better graphics composition

Artifacts Take your own movie and insert some objects into it. Sony TRV900, progressive mode, 15fps Capturing machine in 219 Demo of how to capturing video

Submission You have to turn in your complete source, the executable, a html report and an artifact. Report page contains: description of the project, what do you learn, algorithm, implementation details, results, bells and whistles… Artifacts must be made using your own program. artifacts voting on forum.

Reference software Famous matchmove software include 3D-Equalizer, boujou, REALVIS MatchMover, PixelFarm PFTrack... Most are very expensive We will use Icarus, predecessor of PFTrack. It will be available at project’s page (id/password).

ICARUS Three main components: Capturing video Distortion Calibration Reconstruction Capturing video Enough depth variance Fixed zoom if possible Static scene if possible

Reference Heung-Yeung Shum, Katsushi Ikeuchi and Raj Reddy, Principal Component Analysis with Missing Data and Its Application to Polyhedral Object Modeling, PAMI 17(9), 1995. David Jacobs, Linear Fitting with Missing Data for Structure from Motion, Computer Vision and Image Understanding, 2001 Peter Sturm and Bill Triggs, A factorization Based Algorithm for Multi-Image Projective Structure and Motion, ECCV 1996. Shyjan Mahamud, Martial Hebert, Yasuhiro Omori and Jean Ponce, Provably-Convergent Iterative Methods for Projective Structure from Motion, ICCV 2001.