Submodular set functions Set function z on V is called submodular if For all A,B µ V: z(A)+z(B) ¸ z(A[B)+z(AÅB) Equivalent diminishing returns characterization:

Slides:



Advertisements
Similar presentations
C&O 355 Lecture 22 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A A A A A A A.
Advertisements

Beyond Convexity – Submodularity in Machine Learning
Primal Dual Combinatorial Algorithms Qihui Zhu May 11, 2009.
1 LP Duality Lecture 13: Feb Min-Max Theorems In bipartite graph, Maximum matching = Minimum Vertex Cover In every graph, Maximum Flow = Minimum.
Solving IPs – Cutting Plane Algorithm General Idea: Begin by solving the LP relaxation of the IP problem. If the LP relaxation results in an integer solution,
C&O 355 Mathematical Programming Fall 2010 Lecture 22 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A.
Randomized Sensing in Adversarial Environments Andreas Krause Joint work with Daniel Golovin and Alex Roper International Joint Conference on Artificial.
Provable Submodular Minimization using Wolfe’s Algorithm Deeparnab Chakrabarty (Microsoft Research) Prateek Jain (Microsoft Research) Pravesh Kothari (U.
UMass Lowell Computer Science Analysis of Algorithms Prof. Karen Daniels Fall, 2002 Lecture 8 Tuesday, 11/19/02 Linear Programming.
C&O 355 Mathematical Programming Fall 2010 Lecture 21 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A.
C&O 355 Lecture 12 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A A A A A A A.
How should we define corner points? Under any reasonable definition, point x should be considered a corner point x What is a corner point?
CSCI 3160 Design and Analysis of Algorithms Tutorial 6 Fei Chen.
UMass Lowell Computer Science Analysis of Algorithms Prof. Karen Daniels Fall, 2006 Lecture 9 Wednesday, 11/15/06 Linear Programming.
Totally Unimodular Matrices Lecture 11: Feb 23 Simplex Algorithm Elliposid Algorithm.
Introduction to Linear and Integer Programming Lecture 7: Feb 1.
Pushkar Tripathi Georgia Institute of Technology Approximability of Combinatorial Optimization Problems with Submodular Cost Functions Based on joint work.
Approximation Algorithm: Iterative Rounding Lecture 15: March 9.
Online Oblivious Routing Nikhil Bansal, Avrim Blum, Shuchi Chawla & Adam Meyerson Carnegie Mellon University 6/7/2003.
Computability and Complexity 24-1 Computability and Complexity Andrei Bulatov Approximation.
Unconstrained Optimization Problem
Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy under contract.
Polyhedral Optimization Lecture 3 – Part 3 M. Pawan Kumar Slides available online
1 Spanning Tree Polytope x1 x2 x3 Lecture 11: Feb 21.
Integrality Gaps for Sparsest Cut and Minimum Linear Arrangement Problems Nikhil R. Devanur Subhash A. Khot Rishi Saket Nisheeth K. Vishnoi.
Submodularity in Machine Learning
C&O 355 Lecture 2 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A.
Polyhedral Optimization Lecture 1 – Part 2 M. Pawan Kumar Slides available online
Online Oblivious Routing Nikhil Bansal, Avrim Blum, Shuchi Chawla & Adam Meyerson Carnegie Mellon University 6/7/2003.
June 10, 2003STOC 2003 Optimal Oblivious Routing in Polynomial Time Harald Räcke Paderborn Edith Cohen AT&T Labs-Research Yossi Azar Amos Fiat Haim Kaplan.
Linear Programming Piyush Kumar. Graphing 2-Dimensional LPs Example 1: x y Feasible Region x  0y  0 x + 2 y  2 y  4 x  3 Subject.
Polynomial Time Algorithms P = { computational problems that can be solved efficiently } i.e., solved in time · n c, for some constant c, where n=input.
Approximation algorithms for sequential testing of Boolean functions Lisa Hellerstein Polytechnic Institute of NYU Joint work with Devorah Kletenik (Polytechnic.
Minimizing general submodular functions
Martin Grötschel  Institute of Mathematics, Technische Universität Berlin (TUB)  DFG-Research Center “Mathematics for key technologies” (M ATHEON ) 
Linear Programming – Simplex Method
C&O 355 Mathematical Programming Fall 2010 Lecture 18 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A.
Chapter 2 Greedy Strategy I. Independent System Ding-Zhu Du.
Approximation Algorithms for Prize-Collecting Forest Problems with Submodular Penalty Functions Chaitanya Swamy University of Waterloo Joint work with.
C&O 355 Mathematical Programming Fall 2010 Lecture 8 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A.
§1.4 Algorithms and complexity For a given (optimization) problem, Questions: 1)how hard is the problem. 2)does there exist an efficient solution algorithm?
5 Maximizing submodular functions Minimizing convex functions: Polynomial time solvable! Minimizing submodular functions: Polynomial time solvable!
Optimization - Lecture 4, Part 1 M. Pawan Kumar Slides available online
Tractable Higher Order Models in Computer Vision (Part II) Slides from Carsten Rother, Sebastian Nowozin, Pusohmeet Khli Microsoft Research Cambridge Presented.
Implicit Hitting Set Problems Richard M. Karp Erick Moreno Centeno DIMACS 20 th Anniversary.
MAP Estimation in Binary MRFs using Bipartite Multi-Cuts Sashank J. Reddi Sunita Sarawagi Sundar Vishwanathan Indian Institute of Technology, Bombay TexPoint.
Submodularity Reading Group Submodular Function Minimization via Linear Programming M. Pawan Kumar
Common Intersection of Half-Planes in R 2 2 PROBLEM (Common Intersection of half- planes in R 2 ) Given n half-planes H 1, H 2,..., H n in R 2 compute.
Submodularity Reading Group Matroid Polytopes, Polymatroid M. Pawan Kumar
Lap Chi Lau we will only use slides 4 to 19
Monitoring rivers and lakes [IJCAI ‘07]
Topics in Algorithms Lap Chi Lau.
6.5 Stochastic Prog. and Benders’ decomposition
Proving that a Valid Inequality is Facet-defining
Distributed Submodular Maximization in Massive Datasets
Basic Algorithms Christina Gallner
Example: Feature selection Given random variables Y, X1, … Xn Want to predict Y from subset XA = (Xi1,…,Xik) Want k most informative features: A*
Chapter 6. Large Scale Optimization
Finding Subgraphs with Maximum Total Density and Limited Overlap
Chapter 6. Large Scale Optimization
Factoring distributions V X1 X2 X3 X4 X5 X6 X7 Given random variables X1,…,Xn Partition variables V into sets A and VnA as independent as possible.
Flow Feasibility Problems
Proving that a Valid Inequality is Facet-defining
Part 3. Linear Programming
6.5 Stochastic Prog. and Benders’ decomposition
BASIC FEASIBLE SOLUTIONS
1.2 Guidelines for strong formulations
Chapter 6. Large Scale Optimization
1.2 Guidelines for strong formulations
Guess Free Maximization of Submodular and Linear Sums
Presentation transcript:

Submodular set functions Set function z on V is called submodular if For all A,B µ V: z(A)+z(B) ¸ z(A[B)+z(AÅB) Equivalent diminishing returns characterization: + ¸ + A A [ B B AÅB B + S Large improvement Submodularity: A + S Small improvement For AµB, sB, z(A [ {s}) – z(A) ¸ z(B [ {s}) – z(B)

Submodularity and convexity For V = {1,…,n}, and A µ V, let 1A = (w1A,…,wnA) with wiA = 1 if i 2 A, 0 otherwise Key result [Lovasz ’83]: Every submodular set function z induces a function g on Rn+, such that (i) z(A) = g(1A) for all A µ V (ii) g(w) is convex (iii) minA z(A) = minw g(w) s.t. w 2 [0,1]n

The submodular polyhedron Pz Pz = {x 2 Rn: x(A) · z(A) for all A µ V} Example: V = {a,b} A z(A) ; {a} -1 {b} 2 {a,b} x(A) = i2 A xi -1 x{a} x{b} 1 2 -2 x({b}) · z({b}) Pz x({a,b}) · z({a,b}) x({a}) · z({a})

Evaluating g(w) g(w) = maxx2Pz wTx -1 x{a} x{b} 1 2 xw Evaluating g(w) w g(w) = maxx2Pz wTx -2 Pz = {x 2 Rn: x(A) · z(A) for all A µ V} Theorem [Edmonds ’71, Lovasz ‘83]: For any given w, can get optimal solution xw to the LP using the following greedy algorithm: Order V={e1,…,en} so that w(e1)¸ …¸ w(en) Let xw(ei) = z({e1,…,ei}) – z({e1,…,ei-1}) Then wT xw = g(w) = maxx2 Pz wT x

Ellipsoid algorithm [Grötschel, Lovasz, Schrijver ’81] Feasible region minc,w c s.t. c ¸ wT x for all x2Pz w2 [0,1]n Separation oracle: Find most violated constraint: maxx wT x – c s.t. x 2 Pz Can solve separation using the greedy algorithm!!  Ellipsoid algorithm minimizes SFs in poly-time!

Minimizing submodular functions Ellipsoid algorithm not very practical Want combinatorial algorithm for minimization! Theorem [Iwata (2001)] There is a fully combinatorial, strongly polynomial algorithm for minimizing SFs, that runs in time O(n8 log2 n) Polynomial-time = Practical ???

A more practical alternative? [Fujishige ’91, Fujishige et al ‘06] x({a,b})=z({a,b}) -1 x{a} x{b} 1 2 -2 A z(A) ; {a} -1 {b} 2 {a,b} x* Base polytope: Bz = Pz Å {x(V) = z(V)} [-1,1] M Minimum norm algorithm: Find x* = argmin ||x||2 s.t. x 2 Bz x*=[-1,1] Return A* = {i: x*(i) < 0} A*={a} Theorem [Fujishige ’91]: A* is an optimal solution! Note: Can solve 1. using Wolfe’s algorithm Runtime finite but unknown!! 

Empirical comparison [Fujishige et al ’06] Cut functions from DIMACS Challenge Running time (seconds) Lower is better (log-scale!) Minimum norm algorithm 64 128 256 512 1024 Problem size (log-scale!) Minimum norm algorithm orders of magnitude faster! Matlab implementation can solve n = 10k in < 6 minutes!