Large-Scale Matrix Factorization with Missing Data under Additional Constraints Kaushik Mitra University of Maryland, College Park, MD 20742 Sameer Sheoreyy.

Slides:

Advertisements

Similar presentations

Active Appearance Models

Advertisements

Factorial Mixture of Gaussians and the Marginal Independence Model Ricardo Silva Joint work-in-progress with Zoubin Ghahramani.

The fundamental matrix F

Invariants (continued).

Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.

3D reconstruction.

Globally Optimal Estimates for Geometric Reconstruction Problems Tom Gilat, Adi Lakritz Advanced Topics in Computer Vision Seminar Faculty of Mathematics.

Algebraic, transcendental (i.e., involving trigonometric and exponential functions), ordinary differential equations, or partial differential equations...

T HE POWER OF C ONVEX R ELAXATION : N EAR - OPTIMAL MATRIX COMPLETION E MMANUEL J. C ANDES AND T ERENCE T AO M ARCH, 2009 Presenter: Shujie Hou February,

Computer vision: models, learning and inference

by Rianto Adhy Sasongko Supervisor: Dr.J.C.Allwright

Graph Laplacian Regularization for Large-Scale Semidefinite Programming Kilian Weinberger et al. NIPS 2006 presented by Aggeliki Tsoli.

Data mining and statistical learning - lecture 6

Bayesian Robust Principal Component Analysis Presenter: Raghu Ranganathan ECE / CMR Tennessee Technological University January 21, 2011 Reading Group (Xinghao.

Visual Recognition Tutorial

Computer Vision Optical Flow

Structure from motion.

Active Calibration of Cameras: Theory and Implementation Anup Basu Sung Huh CPSC 643 Individual Presentation II March 4 th,

Probabilistic video stabilization using Kalman filtering and mosaicking.

Epipolar geometry. (i)Correspondence geometry: Given an image point x in the first view, how does this constrain the position of the corresponding point.

Structure from motion. Multiple-view geometry questions Scene geometry (structure): Given 2D point matches in two or more images, where are the corresponding.

Uncalibrated Geometry & Stratification Sastry and Yang

Visibility Subspaces: Uncalibrated Photometric Stereo with Shadows Kalyan Sunkavalli, Harvard University Joint work with Todd Zickler and Hanspeter Pfister.

Multiple-view Reconstruction from Points and Lines

Many slides and illustrations from J. Ponce

Previously Two view geometry: epipolar geometry Stereo vision: 3D reconstruction epipolar lines Baseline O O’ epipolar plane.

Course AE4-T40 Lecture 5: Control Apllication

Structure Computation. How to compute the position of a point in 3- space given its image in two views and the camera matrices of those two views Use.

Computer vision: models, learning and inference

Jinhui Tang †, Shuicheng Yan †, Richang Hong †, Guo-Jun Qi ‡, Tat-Seng Chua † † National University of Singapore ‡ University of Illinois at Urbana-Champaign.

Computing the Fundamental matrix Peter Praženica FMFI UK May 5, 2008.

Matrix Completion IT530 Lecture Notes.

1/20 Obtaining Shape from Scanning Electron Microscope Using Hopfield Neural Network Yuji Iwahori 1, Haruki Kawanaka 1, Shinji Fukui 2 and Kenji Funahashi.

Camera Geometry and Calibration Thanks to Martial Hebert.

Epipolar geometry The fundamental matrix and the tensor

© 2005 Yusuf Akgul Gebze Institute of Technology Department of Computer Engineering Computer Vision Geometric Camera Calibration.

Course 12 Calibration. 1.Introduction In theoretic discussions, we have assumed: Camera is located at the origin of coordinate system of scene.

Handling Outliers and Missing Data in Statistical Data Models Kaushik Mitra Date: 17/1/2011 ECSU Seminar, ISI.

Y. Moses 11 Combining Photometric and Geometric Constraints Yael Moses IDC, Herzliya Joint work with Ilan Shimshoni and Michael Lindenbaum, the Technion.

视觉的三维运动理解刘允才上海交通大学 2002 年 11 月 16 日 Understanding 3D Motion from Images Yuncai Liu Shanghai Jiao Tong University November 16, 2002.

Orthogonalization via Deflation By Achiya Dax Hydrological Service Jerusalem, Israel

Efficient computation of Robust Low-Rank Matrix Approximations in the Presence of Missing Data using the L 1 Norm Anders Eriksson and Anton van den Hengel.

Low-Rank Kernel Learning with Bregman Matrix Divergences Brian Kulis, Matyas A. Sustik and Inderjit S. Dhillon Journal of Machine Learning Research 10.

A Note on Rectangular Quotients By Achiya Dax Hydrological Service Jerusalem, Israel

E XACT MATRIX C OMPLETION VIA CONVEX OPTIMIZATION E MMANUEL J. C ANDES AND B ENJAMIN R ECHT M AY 2008 Presenter: Shujie Hou January, 28 th,2011 Department.

A Flexible New Technique for Camera Calibration Zhengyou Zhang Sung Huh CSPS 643 Individual Presentation 1 February 25,

EECS 274 Computer Vision Affine Structure from Motion.

Multi-area Nonlinear State Estimation using Distributed Semidefinite Programming Hao Zhu October 15, 2012 Acknowledgements: Prof. G.

Robust Principal Components Analysis IT530 Lecture Notes.

STATIC ANALYSIS OF UNCERTAIN STRUCTURES USING INTERVAL EIGENVALUE DECOMPOSITION Mehdi Modares Tufts University Robert L. Mullen Case Western Reserve University.

Tal Amir Advanced Topics in Computer Vision May 29 th, 2015 COUPLED MOTION- LIGHTING ANALYSIS.

Affine Registration in R m 5. The matching function allows to define tentative correspondences and a RANSAC-like algorithm can be used to estimate the.

Structure from motion Multi-view geometry Affine structure from motion Projective structure from motion Planches : –

Instructor: Mircea Nicolescu Lecture 9

EECS 274 Computer Vision Projective Structure from Motion.

Lec 26: Fundamental Matrix CS4670 / 5670: Computer Vision Kavita Bala.

MAN-522 Computer Vision Spring

Lecture 16: Image alignment

René Vidal and Xiaodong Fan Center for Imaging Science

Matrix Completion from a few entries

Parameter estimation class 5

Two-view geometry Computer Vision Spring 2018, Lecture 10

Epipolar geometry.

Structure from motion Input: Output: (Tomasi and Kanade)

Nuclear Norm Heuristic for Rank Minimization

Chapter 6. Large Scale Optimization

Reconstruction.

Structure from motion Input: Output: (Tomasi and Kanade)

Chapter 6. Large Scale Optimization

Alan Girling University of Birmingham, UK

Presentation transcript:

Large-Scale Matrix Factorization with Missing Data under Additional Constraints Kaushik Mitra University of Maryland, College Park, MD Sameer Sheoreyy Toyota Technological Institute, Chicago Rama Chellappa University of Maryland, College Park, MD 20742

Introduction Computer vision problems such as SfM, non-rigid SfM and photometric stereo can be formulated as a matrix factorization problem. In these problems, the measured data are observations of the elements of an m * n measurement matrix M of known rank r. The objective is to factorize this measurement matrix M into factors A and B of dimensions m * r and n * r, respectively such that the error ||M – AB T ||is minimized.

Introduction In real applications many of the elements of M will be missing and we need to solve a modified problem given by

Introduction Matrix factorization with missing data is a difficult non-convex problem with no known globally convergent algorithm. The damped Newton algorithm is one popular algorithms for solving this problem. But this algorithm has high computational complexity and memory requirements and so cannot be used for solving large scale problems. The paper formulates matrix factorization with missing data problem as a LRSDP, which is essentially a rank constrained semidefinite programming problem (SDP) to solve large SDP in an efficient way. Advantages of formulating the matrix factorization problem as a LRSDP : 1) It inherits the efficiency of the LRSDP algorithm. The LRSDP algorithm is based on a quasi-Newton method which has lower computational complexity and memory requirements than that of Newton's method, and so is ideally suited for solving large scale problems. 2) Many additional constraints, such as the ortho-normality constraints for the orthographic SfM, can be easily incorporated into the LRSDP-based factorization formulation.

Low-rank semidefinite programming (LRSDP) LRSDP was proposed to efficiently solve a large scale SDP. SDP is a subfield of convex optimization concerned with the optimization of a linear objective function over the intersection of the cone of positive semidefinite matrices with an affine space. The standard-form SDP is given by: standard-form SDP is given by:

Low-rank semidefinite programming (LRSDP) The LRSDP formulation is given by Where X = RR T and R is real n * r matrix with r <= n Intention is to choose small r while keeping the problem same. Advantage : removes non linear constraint X >= 0 Cost : No longer a convex problem A non-linear optimization technique called the augmented Lagrangian method is used for solving. The majority of the iterations in this algorithm involve the minimization of an augmented Lagrangian function with respect to the variable R.

Matrix factorization using LRSDP Matrix factorization with missing data as an LRSDP problem. Cases : 1.Noiseless case 2.Noisy case 3.Enforcing Additional Constraints

Noiseless Case

So the LRSDP form equation can be represented as, This completes formulation of matrix factorization problem as an LRDSP problem for noiseless case.

Noisy case When Matrix M is corrupted with noise, the cost function to minimize will be, To formulate this as an LRSDP problem, we introduce noise variables el, l = 1, 2, 3, … | Ω| which are defined as el = (M – (AB T )) l. So the equation will be,

Noisy case To formulate this as a LRSDP problem. We construct an augmented noise vector E = [e T 1] T and define R to be Then RR T will be So LRDSP can be expressed as, This completes formulation fo LRSDP problem in Noisy case.

Enforcing Additional Constraints To illustrate constraints can be easily incorporated in the LRSDP formulation authors have used example of orthographic SfM. SfM is the problem of reconstructing the scene structure (3-D point positions and camera parameters) from 2-D projections of the points in the cameras. Suppose that m/2 cameras are looking at n 3-D points, then under the affine camera model, the 2-D imaged points can be arranged as an m * n measurement matrix M with columns corresponding to the n 3-D points and rows corresponding to the m/2 cameras. M can be factorized as M = AB T, where A is a m * 4 camera matrix and B is a n * 4 structure matrix. Further, under the orthographic camera model, A has more structure (constraints): pair of 'rows' that corresponds to the same camera is ortho-normal. To state this constraints precisely, we decompose the A matrix as A = [P t] where P is a m * 3 sub-matrix consisting of the first three columns and t is the last column vector. We can now express the camera ortho-normality constraint through the PP T matrix. AB T = PX + t1T and the observation error can be expressed as el = (M - PX) l – t i for Ω (l) = (I, j).

Enforcing Additional Constraints Cost optimization function to solve would be to minimize the observational error subject to ortho-normality constraints : To formulate this as an LRSDP problem, we introduce the augmented translation variable T = [t T 1] T, and propose the following block-diagonal matrix R As following earlier steps we get LRSDP equation. This example shows how we can incorporate application-specific constraints into the LRSDP.

Matrix Completion Theory Discuss result of the matrix completion theory and its implications for the matrix factorization problem. Matrix completion theory considers the problem of recovering a low-rank matrix from a few samples of its entries: It considers the questions like : 1) when does a partially observed matrix have a unique low-rank solution? 2) How can this matrix be recovered? Conditions : If the matrix M, that we want to recover, has row and columns spaces incoherent with the standard basis and 2) we are given enough entries (>= O(rd 6/5 log d), where d = max(m, n)), then Low-rank the solution can be obtained by solving a convex relaxation of above equation given by

Relations with Matrix Factorization and its Implications Matrix completion and factorization problems are defined differently, but they are closely related as revealed by their very similar Lagrangian formulations. This fact has been used in solving the matrix completion problem via matrix factorization with an appropriate rank “When is missing data matrix factorization unique” - Conditions of the matrix completion theory are sufficient for guaranteeing us the required uniqueness. In experimental evaluations it is found that the LRSDP formulation, though a non-convex problem in general, converges to the global minimum solution under these conditions.

Experimental Evaluation We evaluate the performance of the proposed LRSDP-based factorization algorithm (MF-LRSDP) on both synthetic and real data and compare it against other algorithms such as alternation, damped Newton and OptSpace

Evaluation with Synthetic Data The important parameters in the matrix factorization with missing data problem are: the size of the matrix M characterized by m and n, rank r, fraction of missing data and the variance σ 2 of the observation noise. We evaluate the factorization algorithms by varying these parameters. Two cases to consider data without noise and data with noise.

Evaluation with Synthetic Data Exact Factorization: a first comparison. We study the reconstruction rate of different algorithms by varying the fraction of revealed entries per column | Ω|/n for noiseless matrices of rank 5. Reconstruction rate is defined as the fraction of trials for which the matrix was successfully reconstructed. In all the synthetic data experiments 10 trials were performed.

Evaluation with Synthetic Data

Exact Factorization: vary size. We study the reconstruction rate vs. fraction of revealed entries per column | Ω|/n for different sizes n of rank 5 square matrices. Exact Factorization: vary rank. We study the reconstruction rate vs. | Ω|/n as we vary the rank r of 500 * 500 matrices. Noisy Factorization: vary noise standard deviation. For noisy data, we use the root mean square error as a performance measure. We vary the standard deviation σ of the additive noise for rank 5, 200 * 200 matrices and study the performance.

Evaluation with Synthetic Data

Evaluation with Real Data Affine SfM. 1.M is generally an incomplete matrix because not all the points are visible in all the cameras 2.We evaluate the performance of MF-LRSDP on the `Dinosaur' sequence, for which M is a 72 * 319 matrix with 72% missing entries. We perform 25 trials. 3.MF-LRSDP gives the best performance followed by damped Newton, alternation and OptSpace. Non-rigid SfM. 1.In non-rigid SfM, non-rigid objects are expressed as a linear combination of b basis shapes. 2.We test the performance of the algorithms on the 'Giraffe' sequence for which M is a 240 * 167 matrix with 30% missing entries. 3.MF-LRSDP, alternation and damped Newton give good results

Evaluation with Real Data Photometric Stereo. 1.Photometric stereo is the problem of estimating the surface normals of an object by imaging that object under different lighting conditions. Suppose we have n images of the object under different lighting conditions with each image consisting of m pixels and we arrange them as an m * n measurement matrix M. 2.Some of the image pixels are likely to be affected by shadows and specularities and those pixels should not be included in the M matrix as they do not obey the Lambertian assumption. This makes M, an incomplete matrix 3.We test the algorithms on the `Face' sequence for which M is a 2944 * 20 matrix with 42% missing entries. 4.MF-LRSDP and damped Newton gives the best results followed by alternation and OptSpace.

Evaluation with Real Data

Additional constraints: Orthographic SfM. 1.Orthographic SfM is a special case of affine SfM, where the camera matrix A satisfies the additional constraint of ortho-normality. 2.Incorporating these constraints leads to a better solution and MR- LRSDP provides a very flexible framework for doing so.

Evaluation with Real Data

Conclusion Paper formulates the matrix factorization with missing data problem as a low-rank semidefinite programming problem, MF-LRSDP. MF-LRSDP is an efficient algorithm that can be used for solving large-scale factorization problems. MF-LRSDP is flexible for handling many additional constraints such as the ortho-normality constraints of orthographic SfM. Evaluations on synthetic data show that MF-LRSDP needs fewer observations for matrix factorization as compared to other algorithms and it gives very good results on the real problems of SfM, non-rigid SfM and photometric stereo. Though MF-LRSDP is a non-convex problem, it finds the global minimum under the conditions of matrix completion theory.

Thank You! Questions?