5 Maximizing submodular functions Minimizing convex functions: Polynomial time solvable! Minimizing submodular functions: Polynomial time solvable!

Slides:



Advertisements
Similar presentations
Algorithms for MAP estimation in Markov Random Fields Vladimir Kolmogorov University College London Tutorial at GDR (Optimisation Discrète, Graph Cuts.
Advertisements

Beyond Convexity – Submodularity in Machine Learning
Convex Programming Brookes Vision Reading Group. Huh? What is convex ??? What is programming ??? What is convex programming ???
Submodular Set Function Maximization A Mini-Survey Chandra Chekuri Univ. of Illinois, Urbana-Champaign.
Submodularity for Distributed Sensing Problems Zeyn Saigol IR Lab, School of Computer Science University of Birmingham 6 th July 2010.
Submodular Set Function Maximization via the Multilinear Relaxation & Dependent Rounding Chandra Chekuri Univ. of Illinois, Urbana-Champaign.
1 LP Duality Lecture 13: Feb Min-Max Theorems In bipartite graph, Maximum matching = Minimum Vertex Cover In every graph, Maximum Flow = Minimum.
Approximation Algorithms
1 Adaptive Submodularity: A New Approach to Active Learning and Stochastic Optimization Daniel Golovin and Andreas Krause.
Bilge Mutlu, Andreas Krause, Jodi Forlizzi, Carlos Guestrin, and Jessica Hodgins Human-Computer Interaction Institute, Carnegie Mellon University Robust,
Maximizing the Spread of Influence through a Social Network
Dependent Randomized Rounding in Matroid Polytopes (& Related Results) Chandra Chekuri Jan VondrakRico Zenklusen Univ. of Illinois IBM ResearchMIT.
Carnegie Mellon Selecting Observations against Adversarial Objectives Andreas Krause Brendan McMahan Carlos Guestrin Anupam Gupta TexPoint fonts used in.
Near-Optimal Sensor Placements in Gaussian Processes Carlos Guestrin Andreas KrauseAjit Singh Carnegie Mellon University.
Learning with Inference for Discrete Graphical Models Nikos Komodakis Pawan Kumar Nikos Paragios Ramin Zabih (presenter)
Efficient Informative Sensing using Multiple Robots
Algorithms for Max-min Optimization
Near-optimal Nonmyopic Value of Information in Graphical Models Andreas Krause, Carlos Guestrin Computer Science Department Carnegie Mellon University.
Sensor placement applications Monitoring of spatial phenomena Temperature Precipitation... Active learning, Experiment design Precipitation data from Pacific.
Non-myopic Informative Path Planning in Spatio-Temporal Models Alexandra Meliou Andreas Krause Carlos Guestrin Joe Hellerstein.
Approximation Algorithms
Optimal Nonmyopic Value of Information in Graphical Models Efficient Algorithms and Theoretical Limits Andreas Krause, Carlos Guestrin Computer Science.
Message Passing Algorithms for Optimization
Near-optimal Observation Selection using Submodular Functions Andreas Krause joint work with Carlos Guestrin (CMU)
Nonmyopic Active Learning of Gaussian Processes An Exploration – Exploitation Approach Andreas Krause, Carlos Guestrin Carnegie Mellon University TexPoint.
Computability and Complexity 24-1 Computability and Complexity Andrei Bulatov Approximation.
Integer Programming Difference from linear programming –Variables x i must take on integral values, not real values Lots of interesting problems can be.
Facility Location using Linear Programming Duality Yinyu Ye Department if Management Science and Engineering Stanford University.
Efficiently handling discrete structure in machine learning Stefanie Jegelka MADALGO summer school.
Approximation Algorithms: Bristol Summer School 2008 Seffi Naor Computer Science Dept. Technion Haifa, Israel TexPoint fonts used in EMF. Read the TexPoint.
LP formulation of Economic Dispatch
Active Learning for Probabilistic Models Lee Wee Sun Department of Computer Science National University of Singapore LARC-IMS Workshop.
Submodularity in Machine Learning
Computational Geometry Piyush Kumar (Lecture 5: Linear Programming) Welcome to CIS5930.
Minimizing general submodular functions
Martin Grötschel  Institute of Mathematics, Technische Universität Berlin (TUB)  DFG-Research Center “Mathematics for key technologies” (M ATHEON ) 
Approximation Algorithms
Carnegie Mellon Maximizing Submodular Functions and Applications in Machine Learning Andreas Krause, Carlos Guestrin Carnegie Mellon University.
Chapter 2 Greedy Strategy I. Independent System Ding-Zhu Du.
Randomized Composable Core-sets for Submodular Maximization Morteza Zadimoghaddam and Vahab Mirrokni Google Research New York.
Algorithms for MAP estimation in Markov Random Fields Vladimir Kolmogorov University College London.
Approximation Algorithms for Prize-Collecting Forest Problems with Submodular Penalty Functions Chaitanya Swamy University of Waterloo Joint work with.
Tractable Higher Order Models in Computer Vision (Part II) Slides from Carsten Rother, Sebastian Nowozin, Pusohmeet Khli Microsoft Research Cambridge Presented.
Implicit Hitting Set Problems Richard M. Karp Erick Moreno Centeno DIMACS 20 th Anniversary.
Vasilis Syrgkanis Cornell University
Submodular set functions Set function z on V is called submodular if For all A,B µ V: z(A)+z(B) ¸ z(A[B)+z(AÅB) Equivalent diminishing returns characterization:
Submodular Set Function Maximization A Mini-Survey Chandra Chekuri Univ. of Illinois, Urbana-Champaign.
Deterministic Algorithms for Submodular Maximization Problems Moran Feldman The Open University of Israel Joint work with Niv Buchbinder.
Maximizing Symmetric Submodular Functions Moran Feldman EPFL.
Efficient Point Coverage in Wireless Sensor Networks Jie Wang and Ning Zhong Department of Computer Science University of Massachusetts Journal of Combinatorial.
Approximation Algorithms Duality My T. UF.
Linear Programming Piyush Kumar Welcome to CIS5930.
Submodularity Reading Group Matroid Polytopes, Polymatroid M. Pawan Kumar
Submodularity Reading Group Matroids, Submodular Functions M. Pawan Kumar
Monitoring rivers and lakes [IJCAI ‘07]
Near-optimal Observation Selection using Submodular Functions
IE 635 Combinatorial Optimization
Polynomial Norms Amir Ali Ahmadi (Princeton University) Georgina Hall
Distributed Submodular Maximization in Massive Datasets
Example: Feature selection Given random variables Y, X1, … Xn Want to predict Y from subset XA = (Xi1,…,Xik) Want k most informative features: A*
Pseudo-Boolean Optimization
Coverage Approximation Algorithms
Linear Programming and Approximation
Cost-effective Outbreak Detection in Networks
Rule Selection as Submodular Function
Submodular Maximization Through the Lens of the Multilinear Relaxation
Flow Feasibility Problems
Near-Optimal Sensor Placements in Gaussian Processes
Submodular Maximization with Cardinality Constraints
Guess Free Maximization of Submodular and Linear Sums
Presentation transcript:

5 Maximizing submodular functions Minimizing convex functions: Polynomial time solvable! Minimizing submodular functions: Polynomial time solvable! Maximizing convex functions: NP hard! Maximizing submodular functions: NP hard! But can get approximation guarantees

6 Example: Set cover Node predicts values of positions with some radius For A µ V: z(A) = “area covered by sensors placed at A” Formally: W finite set, collection of n subsets S i µ W For A µ V={1,…,n} define z(A) = |  i2 A S i | Want to cover floorplan with discs Place sensors in building Possible locations V

7 Set cover is submodular S1S1 S2S2 S1S1 S2S2 S3S3 S4S4 S’ A={S 1,S 2 } B = {S 1,S 2,S 3,S 4 } z(A [ {S’})-z(A) z(B [ {S’})-z(B) ¸

8 Example: Feature selection Given random variables Y, X 1, … X n Want to predict Y from subset X A = (X i 1,…,X i k ) Want k most informative features: A* = argmax IG(X A ; Y) s.t. |A| · k where IG(X A ; Y) = H(Y) - H(Y | X A ) Y “Sick” X 1 “Fever” X 2 “Rash” X 3 “Male” Naïve Bayes Model Uncertainty before knowing X A Uncertainty after knowing X A

9 Example: Submodularity of info-gain Y 1,…,Y m, X 1, …, X n discrete RVs z(A) = IG(Y; X A ) = H(Y)-H(Y | X A ) z(A) is always monotonic However, NOT always submodular Theorem [Krause & Guestrin UAI’ 05] If X i are all conditionally independent given Y, then z(A) is submodular! Y1Y1 X1X1 Y2Y2 X2X2 Y3Y3 X4X4 X3X3 Hence, greedy algorithm works! In fact, NO algorithm can do better than (1-1/e) approximation!

10 People sit a lot Activity recognition in assistive technologies Seating pressure as user interface Equipped with 1 sensor per cm 2 ! Costs $16,000!  Can we get similar accuracy with fewer, cheaper sensors? Lean forward SlouchLean left 82% accuracy on 10 postures! [Tan et al] Building a Sensing Chair [Mutlu, Krause, Forlizzi, Guestrin, Hodgins UIST ‘07]

11 How to place sensors on a chair? Sensor readings at locations V as random variables Predict posture Y using probabilistic model P(Y,V) Pick sensor locations A* µ V to minimize entropy: Possible locations V AccuracyCost Before82%$16,000  After79%$100 Placed sensors, did a user study: Similar accuracy at <1% of the cost!

12 Bounds on optimal solution [Krause et al., J Wat Res Mgt ’08] Submodularity gives data-dependent bounds on the performance of any algorithm Sensing quality z(A) Higher is better Water networks data Offline (Nemhauser) bound Data-dependent bound Greedy solution Number of sensors placed

Summary (1) Minimization of submodular functions –Submodularity and convexity –Submodular Polyhedron –Symmetric submodular functions

Summary (2) Pseudo-boolean functions –Representation (polynomial, posiform, tableau, graph cut) –Reduction to quadratic polynomial –Necessary and sufficient conditions for submodularity –Minimization of quadratic and cubic submodular functions via graph cuts –Lower bound via roof duality LP via posiform representation LP via linear relaxation Max flow via symmetric graph construction

Further reading Combinatorial algorithms for submodular (and bisubmodular) function minimization More algorithms/bounds for maximizing submodular functions Linear and semidefinite relaxations Matroids, greedoids, intersection of matroids, polymatroids and more Generalized roof duality