3D Reconstruction – Factorization Method Seong-Wook Joo KG-VISA 3/10/2004.

Slides:



Advertisements
Similar presentations
3D Geometry for Computer Graphics
Advertisements

Announcements. Structure-from-Motion Determining the 3-D structure of the world, and/or the motion of a camera using a sequence of images taken by a moving.
Invariants (continued).
3D reconstruction.
Robot Vision SS 2005 Matthias Rüther 1 ROBOT VISION Lesson 3: Projective Geometry Matthias Rüther Slides courtesy of Marc Pollefeys Department of Computer.
Projective Geometry- 3D
Two-View Geometry CS Sastry and Yang
Camera Models CMPUT 498/613 Richard Hartley and Andrew Zisserman, Multiple View Geometry, Cambridge University Publishers, 2000 Readings: HZ Ch 6, 7.
Camera Models A camera is a mapping between the 3D world and a 2D image The principal camera of interest is central projection.
Structure from motion.
Computer Graphics Recitation 5.
Linear Algebra and SVD (Some slides adapted from Octavia Camps)
3D Geometry for Computer Graphics
Multiple View Geometry Marc Pollefeys University of North Carolina at Chapel Hill Modified by Philippos Mordohai.
Structure-from-Motion Determining the 3-D structure of the world, and/or the motion of a camera using a sequence of images taken by a moving camera. –Equivalently,
Epipolar geometry. (i)Correspondence geometry: Given an image point x in the first view, how does this constrain the position of the corresponding point.
Structure from motion. Multiple-view geometry questions Scene geometry (structure): Given 2D point matches in two or more images, where are the corresponding.
Uncalibrated Geometry & Stratification Sastry and Yang
Real-time Combined 2D+3D Active Appearance Models Jing Xiao, Simon Baker,Iain Matthew, and Takeo Kanade CVPR 2004 Presented by Pat Chan 23/11/2004.
Multiple-view Reconstruction from Points and Lines
Uncalibrated Epipolar - Calibration
Multiple View Geometry Marc Pollefeys University of North Carolina at Chapel Hill Modified by Philippos Mordohai.
Cameras and Projections Dan Witzner Hansen Course web page:
© 2003 by Davi GeigerComputer Vision October 2003 L1.1 Structure-from-EgoMotion (based on notes from David Jacobs, CS-Maryland) Determining the 3-D structure.
Previously Two view geometry: epipolar geometry Stereo vision: 3D reconstruction epipolar lines Baseline O O’ epipolar plane.
Affine Structure-from-Motion: A lot of frames (1) ISP.
The Pinhole Camera Model
CSCE 641 Computer Graphics: Image-based Modeling (Cont.) Jinxiang Chai.
Linear Algebra Review By Tim K. Marks UCSD Borrows heavily from: Jana Kosecka Virginia de Sa (UCSD) Cogsci 108F Linear.
Alignment Introduction Notes courtesy of Funk et al., SIGGRAPH 2004.
Euclidean cameras and strong (Euclidean) calibration Intrinsic and extrinsic parameters Linear least-squares methods Linear calibration Degenerate point.
Epipolar geometry The fundamental matrix and the tensor
1 Preview At least two views are required to access the depth of a scene point and in turn to reconstruct scene structure Multiple views can be obtained.
© 2005 Yusuf Akgul Gebze Institute of Technology Department of Computer Engineering Computer Vision Geometric Camera Calibration.
Course 12 Calibration. 1.Introduction In theoretic discussions, we have assumed: Camera is located at the origin of coordinate system of scene.
Geometric Models & Camera Calibration
Structure from Motion Computer Vision CS 143, Brown James Hays 11/18/11 Many slides adapted from Derek Hoiem, Lana Lazebnik, Silvio Saverese, Steve Seitz,
视觉的三维运动理解 刘允才 上海交通大学 2002 年 11 月 16 日 Understanding 3D Motion from Images Yuncai Liu Shanghai Jiao Tong University November 16, 2002.
Multiview Geometry and Stereopsis. Inputs: two images of a scene (taken from 2 viewpoints). Output: Depth map. Inputs: multiple images of a scene. Output:
Scientific Computing Singular Value Decomposition SVD.
Parameter estimation. 2D homography Given a set of (x i,x i ’), compute H (x i ’=Hx i ) 3D to 2D camera projection Given a set of (X i,x i ), compute.
Affine Structure from Motion
Advanced Computer Vision Structure from Motion1 Chapter 7 S TRUCTURE FROM M OTION.
EECS 274 Computer Vision Affine Structure from Motion.
1 Chapter 2: Geometric Camera Models Objective: Formulate the geometrical relationships between image and scene measurements Scene: a 3-D function, g(x,y,z)
EIGENSYSTEMS, SVD, PCA Big Data Seminar, Dedi Gadot, December 14 th, 2014.
776 Computer Vision Jan-Michael Frahm & Enrique Dunn Spring 2013.
Structure from Motion ECE 847: Digital Image Processing
Image processing and computer vision
Reconstruction from Two Calibrated Views Two-View Geometry
Determining 3D Structure and Motion of Man-made Objects from Corners.
Uncalibrated reconstruction Calibration with a rig Uncalibrated epipolar geometry Ambiguities in image formation Stratified reconstruction Autocalibration.
Structure from Motion Paul Heckbert, Nov , Image-Based Modeling and Rendering.
Camera Models class 8 Multiple View Geometry Comp Marc Pollefeys.
Structure from Motion. For now, static scene and moving cameraFor now, static scene and moving camera – Equivalently, rigidly moving scene and static.
Parameter estimation class 5 Multiple View Geometry CPSC 689 Slides modified from Marc Pollefeys’ Comp
Reconstruction of a Scene with Multiple Linearly Moving Objects Mei Han and Takeo Kanade CISC 849.
CS246 Linear Algebra Review. A Brief Review of Linear Algebra Vector and a list of numbers Addition Scalar multiplication Dot product Dot product as a.
55:148 Digital Image Processing Chapter 11 3D Vision, Geometry
Structure from Motion ECE 847: Digital Image Processing
Epipolar geometry.
Structure from motion Input: Output: (Tomasi and Kanade)
Multiple View Geometry Comp Marc Pollefeys
Uncalibrated Geometry & Stratification
Principal Component Analysis (PCA)
George Mason University
Reconstruction.
Course 7 Motion.
Lecture 13: Singular Value Decomposition (SVD)
Structure from motion Input: Output: (Tomasi and Kanade)
Presentation transcript:

3D Reconstruction – Factorization Method Seong-Wook Joo KG-VISA 3/10/2004

Problem Setup P feature points (u p,v p ) from F frames Input: measurement matrix centroid = origin assumed for all frames Goal: Find motion and structure u 11 u 12    u 21 u 22    v 11 v 12    v 21 v 22    features frames

Camera model: orthographic camera i and j are unit vectors representing the x and y axis of the image plane in world coordinates. Camera matrix is essentially the rotation matrix with orthographic projection (no third row) No translation

Rotation matrix and shape matrix Measurement matrix W can be expressed as Where R, S represents rotation and shape

The Rank Theorem Since R is 2F  3 and S is 3  P, in the ideal case (without noise), W is at most of rank three. The rank theorem says the measurement matrix is highly redundant. In fact it resides in a 3-dimensional subspace.

The Factorization Algorithm SVD is used to decompose W into R and S. (Assuming 2F  P) Since the rank of W is at most 3, only the first three singular values (diagonal elements in  ) should be non-zero. But this does not hold in practice because of noise. Therefore the best rank-3 approximation W to the ideal W is obtained by taking the top 3 singular values.

Affine Reconstruction define so that e.g., However the decomposition is not unique. If Q is any invertible matrix, below is also a valid decomposition.

Euclidean Reconstruction Suppose the true R and S can be obtained by the linear transformation Q To find Q, We use the constraint that R consists of orthonormal vectors.

Shape Reconstruction Result

Extensions to Other Camera Models Affine camera –Scaled orthographic (weak perspective) Unknown scale factor f for each frame –Paraperspective camera matrix is still a 2x3 matrix (affine), with unknown offset m f and scale f for each frame –Same as orthographic case up to the affine reconstruction step –Use orthonormality of Rotation vectors to also solve for the additional unknowns Projective camera –Use depth( fp )-multiplied measurement matrix W –Depth estimation is another issue Reference –

Could we have used PCA? Measurement vectors A Px2F = [u 1 …u F v 1 …v F ] –Suppose we don’t know anything about camera geometry –Noisy measurements of unknown (hopefully linear) process We want –Invariant structure underlying the measurement data  shape –(variant) coefficients that gives a particular frame  motion PCA –Largest Eigenvectors of AA T : e 1, e 2, e 3  E AA T = E D E T –Comparing with the SVD A=W T =O 2 T  O 1 T AA T = O 2 T  O 1 T O 1  O 2 = O 2  2 O 2 T –E is essentially O 2, the “structure”

SVD output formats “Economy size” A mxn U mxn D nxn V T nxn m>n m<n Matlab default: D is the same size as A A mxn U mxm D mxn V T nxn A mxn U mxm D mxn V T nxn m>n m<n A mxn U mxm D mxm V T mxn (possible in theory, but Matlab doesn’t give this)