Distributed Nuclear Norm Minimization for Matrix Completion

Slides:



Advertisements
Similar presentations
1 Closed-Form MSE Performance of the Distributed LMS Algorithm Gonzalo Mateos, Ioannis Schizas and Georgios B. Giannakis ECE Department, University of.
Advertisements

Modeling Maze Navigation Consider the case of a stationary robot and a mobile robot moving towards a goal in a maze. We can model the utility of sharing.
Kick-off Meeting, July 28, 2008 ONR MURI: NexGeNetSci Distributed Coordination, Consensus, and Coverage in Networked Dynamic Systems Ali Jadbabaie Electrical.
Mobility assisted routing CS 218 F2008 Ad hoc mobility generally harmful Can mobility help in routing? –Mobility induced distributed route/directory tree.
Carnegie Mellon Distributed Inference: High Dimensional Consensus TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAA.
Javad Lavaei Department of Electrical Engineering Columbia University Graph-Theoretic Algorithm for Nonlinear Power Optimization Problems.
Probabilistic Graph and Hypergraph Matching
Brian Baingana, Gonzalo Mateos and Georgios B. Giannakis Dynamic Structural Equation Models for Tracking Cascades over Social Networks Acknowledgments:
Belief Propagation on Markov Random Fields Aggeliki Tsoli.
1 Morteza Mardani, Gonzalo Mateos and Georgios Giannakis ECE Department, University of Minnesota Acknowledgment: AFOSR MURI grant no. FA
ACCESS Group meeting Mikael Johansson Novel algorithms for peer-to-peer optimization in networked systems Björn Johansson and Mikael.
© University of Minnesota Data Mining for the Discovery of Ocean Climate Indices 1 CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance.
Learning Low-Level Vision William T. Freeman Egon C. Pasztor Owen T. Carmichael.
1 Localization Technologies for Sensor Networks Craig Gotsman, Technion/Harvard Collaboration with: Yehuda Koren, AT&T Labs.
Research at Intel Distributed Localization of Modular Robot Ensembles Robotics: Science and Systems 25 June 2008 Stanislav Funiak, Michael Ashley-Rollman.
Jana van Greunen - 228a1 Analysis of Localization Algorithms for Sensor Networks Jana van Greunen.
The Role of Specialization in LDPC Codes Jeremy Thorpe Pizza Meeting Talk 2/12/03.
Group Norm for Learning Latent Structural SVMs Overview Daozheng Chen (UMD, College Park), Dhruv Batra (TTI Chicago), Bill Freeman (MIT), Micah K. Johnson.
Peter Richtárik (joint work with Martin Takáč) Distributed Coordinate Descent Method AmpLab All Hands Meeting - Berkeley - October 29, 2013.
Sparsity-Aware Adaptive Algorithms Based on Alternating Optimization and Shrinkage Rodrigo C. de Lamare* + and Raimundo Sampaio-Neto * + Communications.
כמה מהתעשייה? מבנה הקורס השתנה Computer vision.
Mathematical Programming in Support Vector Machines
Walter Hop Web-shop Order Prediction Using Machine Learning Master’s Thesis Computational Economics.
1 Sparsity Control for Robustness and Social Data Analysis Gonzalo Mateos ECE Department, University of Minnesota Acknowledgments: Profs. Georgios B. Giannakis,
1 Decentralized Jointly Sparse Optimization by Reweighted Lq Minimization Qing Ling Department of Automation University of Science and Technology of China.
A Distributed and Privacy Preserving Algorithm for Identifying Information Hubs in Social Networks M.U. Ilyas, Z Shafiq, Alex Liu, H Radha Michigan State.
Brian Baingana, Gonzalo Mateos and Georgios B. Giannakis A Proximal Gradient Algorithm for Tracking Cascades over Networks Acknowledgments: NSF ECCS Grant.
Principled Regularization for Probabilistic Matrix Factorization Robert Bell, Suhrid Balakrishnan AT&T Labs-Research Duke Workshop on Sensing and Analysis.
1 Unveiling Anomalies in Large-scale Networks via Sparsity and Low Rank Morteza Mardani, Gonzalo Mateos and Georgios Giannakis ECE Department, University.
Efficient Gathering of Correlated Data in Sensor Networks
1 Exact Recovery of Low-Rank Plus Compressed Sparse Matrices Morteza Mardani, Gonzalo Mateos and Georgios Giannakis ECE Department, University of Minnesota.
Consensus-based Distributed Estimation in Camera Networks - A. T. Kamal, J. A. Farrell, A. K. Roy-Chowdhury University of California, Riverside
“Study on Parallel SVM Based on MapReduce” Kuei-Ti Lu 03/12/2015.
Adaptive CSMA under the SINR Model: Fast convergence using the Bethe Approximation Krishna Jagannathan IIT Madras (Joint work with) Peruru Subrahmanya.
EMIS 8381 – Spring Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.
Recovering low rank and sparse matrices from compressive measurements Aswin C Sankaranarayanan Rice University Richard G. Baraniuk Andrew E. Waters.
Scalable and Fully Distributed Localization With Mere Connectivity.
BrainStorming 樊艳波 Outline Several papers on icml15 & cvpr15 PALM Information Theory Learning.
1 Sparsity Control for Robust Principal Component Analysis Gonzalo Mateos and Georgios B. Giannakis ECE Department, University of Minnesota Acknowledgments:
1 Self-stabilizing Algorithms and Frequency Assignment Problems.
Fast Maximum Margin Matrix Factorization for Collaborative Prediction Jason Rennie MIT Nati Srebro Univ. of Toronto.
Efficient computation of Robust Low-Rank Matrix Approximations in the Presence of Missing Data using the L 1 Norm Anders Eriksson and Anton van den Hengel.
Inference of Poisson Count Processes using Low-rank Tensor Data Juan Andrés Bazerque, Gonzalo Mateos, and Georgios B. Giannakis May 29, 2013 SPiNCOM, University.
Javad Lavaei Department of Electrical Engineering Columbia University Convex Relaxation for Polynomial Optimization: Application to Power Systems and Decentralized.
Multiuser Receiver Aware Multicast in CDMA-based Multihop Wireless Ad-hoc Networks Parmesh Ramanathan Department of ECE University of Wisconsin-Madison.
Multi-area Nonlinear State Estimation using Distributed Semidefinite Programming Hao Zhu October 15, 2012 Acknowledgements: Prof. G.
Network Lasso: Clustering and Optimization in Large Graphs
CoFi Rank : Maximum Margin Matrix Factorization for Collaborative Ranking Markus Weimer, Alexandros Karatzoglou, Quoc Viet Le and Alex Smola NIPS’07.
Rank Minimization for Subspace Tracking from Incomplete Data
Large-Scale Matrix Factorization with Missing Data under Additional Constraints Kaushik Mitra University of Maryland, College Park, MD Sameer Sheoreyy.
1 Robust Nonparametric Regression by Controlling Sparsity Gonzalo Mateos and Georgios B. Giannakis ECE Department, University of Minnesota Acknowledgments:
HASE: A Hybrid Approach to Selectivity Estimation for Conjunctive Queries Xiaohui Yu University of Toronto Joint work with Nick Koudas.
1 Consensus-Based Distributed Least-Mean Square Algorithm Using Wireless Ad Hoc Networks Gonzalo Mateos, Ioannis Schizas and Georgios B. Giannakis ECE.
Cameron Rowe.  Introduction  Purpose  Implementation  Simple Example Problem  Extended Kalman Filters  Conclusion  Real World Examples.
Optimal Reverse Prediction: Linli Xu, Martha White and Dale Schuurmans ICML 2009, Best Overall Paper Honorable Mention A Unified Perspective on Supervised,
Dimension reduction (2) EDR space Sliced inverse regression Multi-dimensional LDA Partial Least Squares Network Component analysis.
Zhu Han University of Houston Thanks for Dr. Mingyi Hong’s slides
Privacy and Fault-Tolerance in Distributed Optimization Nitin Vaidya University of Illinois at Urbana-Champaign.
Matrix Completion from a few entries
USPACOR: Universal Sparsity-Controlling Outlier Rejection
Degree and Eigenvector Centrality
Informed Non-convex Robust Principal Component Analysis with Features
Estimating Networks With Jumps
Topic models for corpora and for graphs
The European Conference on e-learing ,2017/10
Markov Random Fields Presented by: Vladan Radosavljevic.
Topic models for corpora and for graphs
Role of Stein’s Lemma in Guaranteed Training of Neural Networks
ADMM and DSO.
Sebastian Semper1 and Florian Roemer1,2
Presentation transcript:

Distributed Nuclear Norm Minimization for Matrix Completion Morteza Mardani, Gonzalo Mateos and Georgios Giannakis ECE Department, University of Minnesota Acknowledgments: MURI (AFOSR FA9550-10-1-0567) grant Cesme, Turkey June 19, 2012 1 1

Learning from “Big Data” `Data are widely available, what is scarce is the ability to extract wisdom from them’ Hal Varian, Google’s chief economist Fast BIG Ubiquitous Revealing Productive Smart Messy 2 K. Cukier, ``Harnessing the data deluge,'' Nov. 2011. 2

Context Imputation of network data Preference modeling Imputation of network data Smart metering Network cartography Goal: Given few incomplete rows per agent, impute missing entries in a distributed fashion by leveraging low-rank of the data matrix. 3 3 3 3

Low-rank matrix completion Consider matrix , set Sampling operator ? ? ? ? Given incomplete (noisy) data ? ? ? ? ? ? (as) has low rank Goal: denoise observed entries, impute missing ones ? ? Nuclear-norm minimization [Fazel’02],[Candes-Recht’09] Noisy Noise-free s.t. 4 4

Problem statement Network: undirected, connected graph ? ? ? ? ? ? n ? ? ? ? Goal: Given per node and single-hop exchanges, find (P1) Challenges Nuclear norm is not separable Global optimization variable 5 5

Separable regularization Key result [Recht et al’11] Lxρ ≥rank[X] New formulation equivalent to (P1) (P2) Nonconvex; reduces complexity: Proposition 1. If stationary pt. of (P2) and , then is a global optimum of (P1). 6 6

Distributed estimator (P3) Consensus with neighboring nodes Network connectivity (P2) (P3) Alternating-directions method of multipliers (ADMM) solver Method [Glowinski-Marrocco’75], [Gabay-Mercier’76] Learning over networks [Schizas et al’07] Primal variables per agent : n Message passing: 7 7

Distributed iterations 8 8

Attractive features Highly parallelizable with simple recursions Unconstrained QPs per agent No SVD per iteration Low overhead for message exchanges is and is small Comm. cost independent of network size Recap: (P1) (P2) (P3) Centralized Convex Sep. regul. Nonconvex Consensus Nonconvex Stationary (P3) Stationary (P2) Global (P1) 9 9

Optimality Proposition 2. If converges to and , then: i) ii) is the global optimum of (P1). ADMM can converge even for non-convex problems [Boyd et al’11] Simple distributed algorithm for optimal matrix imputation Centralized performance guarantees e.g., [Candes-Recht’09] carry over 10 10

Synthetic data Random network topology N=20, L=66, T=66 Data , 11 11

Real data Network distance prediction [Liau et al’12] Abilene network data (Aug 18-22,2011) End-to-end latency matrix N=9, L=T=N 80% missing data Figures: 1) ROC 2) 3D plot of the detected anomalies like the proposal Relative error: 10% 12 Data: http://internet2.edu/observatory/archive/data-collections.html 12