Download presentation
1
Minimizing general submodular functions
CVPR 2015 Tutorial Stefanie Jegelka MIT
2
( ) = The set function view cost of buying items together, or
( ) = cost of buying items together, or utility, or probability, … We will assume: . black box “oracle” to evaluate F
3
Set functions and energy functions
any set function with … is a function on binary vectors! 1 a b c d A a b d c F: \{0,1\}^n \to \mathbb{R} binary labeling problems = subset selection problems!
4
Discrete Labeling sky tree house grass
TODO: also stereo, 3d segmentation?
5
Summarization
6
Influential subsets
7
Submodularity extra cost: extra cost: free refill one drink
\underbrace{\textcolor{white}{\hspace{25pt}.}} extra cost: one drink extra cost: free refill diminishing marginal costs
8
The big picture graph theory electrical networks game theory
(Frank 1993) electrical networks (Narayanan 1997) game theory (Shapley 1970) G. Choquet J. Edmonds combinatorial optimization submodular functions matroid theory (Whitney, 1935) computer vision & machine learning stochastic processes (Macchi 1975, Borodin 2009) L. Lovász L.S. Shapley
9
Examples sensing: F(S) = information gained from locations S
10
Example: cover
11
Maximizing Influence Kempe, Kleinberg & Tardos 2003
12
Submodular set functions
Diminishing gains: for all Union-Intersection: for all B A + e + e
13
Submodularity: boolean & sets
14
Graph cuts Cut for one edge: cut of one edge is submodular!
cut of one edge is submodular! large graph: sum of edges Useful property: sum of submodular functions is submodular
15
Other closedness properties
submodular on . The following are submodular: Restriction: ----- Meeting Notes (8/14/12 09:55) ----- illustrations S W V S V
16
Other closedness properties
submodular on . The following are submodular: Restriction: Conditioning: ----- Meeting Notes (8/14/12 09:55) ----- illustrations S W V S V
17
Closedness properties
submodular on . The following are submodular: Restriction: Conditioning: Reflection: ----- Meeting Notes (8/14/12 09:55) ----- illustrations S V
18
Submodular optimization
subset selection: min / max F(S) minimizing submodular functions: next maximizing submodular functions: afternoon convex … … and concave aspects!
19
Minimizing submodular functions
Why? energy minimization variational inference (marginals) structured sparse estimation … How? graph cuts – fast, not always possible convex relaxations – can be fast, always possible …
20
submodularity & convexity
… is a function on binary vectors! any set function with pseudo-boolean function A 1 a b c d a b d c F: \{0,1\}^n \to \mathbb{R}
21
Relaxation: idea
22
A relaxation (extension)
have want: extension ( ) + (0.5 – 0.2) + (0.2) x = \sum_{i=1}^k\; \alpha_i\, \mathbf{1}_{S_i}
23
The Lovász extension have want: extension
24
Examples truncation cut function “total variation”! 1.0 - 0.5
F(S) = \begin{cases} 1 &\text{ if } S = \{1\}, \,\{2\}\\ 0 &\text{ if } S = \emptyset,\, \{1,2\} \end{cases} “total variation”!
25
Alternative characterization
if F is submodular, this is equivalent to: Theorem (Lovász, 1983) Lovasz extension is convex F is submodular.
26
Submodular polyhedra submodular polyhedron: Base polytope
\mathcal{P}_F = \{ y\in \mathbb{R}^n \mid y(A) \leq F(A) \text{ for all } A \subseteq \mathcal{V}\} \mathcal{B}_F = \{y \in \mathcal{P}_F \mid y(\mathcal{V}) = F(\mathcal{V})\} \begin{tabular}{c|r} $A$ & $F(A)$\\ \hline $\emptyset$ & $0$\\ $a$ & $-1$ \\ $b$ & $2$\\ $\{a,b\}$ & $0$ \end{tabular}
27
Base polytope Base polytope Edmonds 1970: “magic”
exponentially many constraints! Edmonds 1970: “magic” compute argmax in O(n log n) basis of (almost all) optimization! -- separation oracle – subgradient --
28
Base polytopes Base polytope 2D (2 elements) 3D (3 elements)
29
Convex relaxation relaxation: convex optimization (non-smooth)
\min_{S \subseteq \mathcal{V}}\, F(S) \min_{x \in [0,1]^n}\; f(x) relaxation: convex optimization (non-smooth) relaxation is exact! submodular minimization in polynomial time! (Grötschel, Lovász, Schrijver 1981)
30
Submodular minimization
minimize subgradient descent smoothing (special cases) solve dual: combinatorial algorithms foundations: Edmonds, Cunningham first poly-time algorithms: (Iwata-Fujishige-Fleischer 2001, Schrijver 2000) many more after that …
31
Minimum-norm-point algorithm
Fujishige ‘91, Fujishige & Isotani ‘11 Lovász extension proximal problem dual: minimum norm problem -1 1 -1 a a minimizes F ! b \min_{x \in [0,1]^n} f(x) + \tfrac{1}{2}\|x\|^2 \min_{u \in B(F)} \tfrac{1}{2}\|u\|^2 A^* = \arg\min_{A \subseteq V} F(A) A^* = \{ i \mid u^*(i) \leq 0\}
32
Minimum-norm-point algorithm
1. optimization: find 2. rounding: -0.5 0.8 1.0 a b c d a b d c
33
The bigger story projection proximal parametric thresholding
TODO: refs divide-and-conquer (Fujishige & Isotani 11, Nagano, Gallo-Grigoriadis-Tarjan 06, Hochbaum 01, Chambolle & Darbon 09, …)
34
Minimum-norm-point algorithm
how solve? 1. optimization: find 2. rounding: Polytope has exponentially many inequalities / faces BUT: can do linear optimization over Frank-Wolfe or Fujishige-Wolfe algorithm a b d c -0.5 0.8 1.0
35
Frank-Wolfe: main idea
36
Empirically convergence of relaxation convergence of S min-norm point
(Figure from Bach, 2012)
37
Recap – links to convexity
submodular function F(S) convex extension f(x) can compute it! submodular minimization as convex optimization -- can solve it! What can we do with it?
38
Links to convexity What can we do with it?
MAP inference / energy minimization (out-of-the-box) variational inference (Djolonga & Krause 2014) structured sparsity (Bach 2010) decomposition & parallel algorithms
39
Structured sparsity and submodularity
40
Sparse reconstruction
Assumption: x is sparse subset selection: S = {1,3,4,7} discrete regularization on support S of x relax to convex envelope \Omega(x) = f(|x|) sparsity pattern often not random…
41
Structured sparsity Assumption: support of x has structure
express by set function!
42
Preference for trees Set function: if T is a tree and S not
|S| = |T| use as regularizer?
43
Sparsity x sparse x structured sparse
submodular function discrete regularization on support S of x relax to convex envelope \Omega(x) = f(|x|) Lovász extension Optimization: submodular minimization (min-norm) (Bach2010)
44
Special case minimize a sum of submodular functions
“easy” combinatorial algorithms (Kolmogorov 12, Fix-Joachims-Park-Zabih 13, Fix-Wang-Zabih 14) convex relaxations
45
Relaxation convex Lovász extension: tight relaxation
dual decomposition: parallel algorithms (Komodakis-Paragios-Tziritas 11, Savchynskyy-Schmidt-Kappes-Schnörr 11, J-Bach-Sra 13) \min_{S \subseteq \mathcal{V}}\; \sum\nolimits_{i} F_i(S) \;\; = \; \min_{x \in [0,1]^n}\; \sum\nolimits_i f_i(x)
46
Results: dual decomposition
relaxation I relax II convergence discrete problem smooth dual non-smooth dual faster parallel algorithms (Jegelka, Bach, Sra 2013; Nishihara, Jegelka, Jordan 2014)
47
Summary Submodular functions – diminishing returns/costs
convex relations: exact relaxation structured norms fast algorithms more soon: constraints maximization: diversity, information
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.