ICCV 2007 tutorial on Discrete Optimization Methods in Computer Vision part I Basic overview of graph cuts.

Slides:



Advertisements
Similar presentations
Primal-dual Algorithm for Convex Markov Random Fields Vladimir Kolmogorov University College London GDR (Optimisation Discrète, Graph Cuts et Analyse d'Images)
Advertisements

Algorithms for MAP estimation in Markov Random Fields Vladimir Kolmogorov University College London Tutorial at GDR (Optimisation Discrète, Graph Cuts.
Multi-view Stereo via Volumetric Graph-cuts
Graph Cut Algorithms for Computer Vision & Medical Imaging Ramin Zabih Computer Science & Radiology Cornell University Joint work with Y. Boykov, V. Kolmogorov,
Introduction to Markov Random Fields and Graph Cuts Simon Prince
I Images as graphs Fully-connected graph – node for every pixel – link between every pair of pixels, p,q – similarity w ij for each link j w ij c Source:
The University of Ontario CS 4487/9687 Algorithms for Image Analysis Multi-Label Image Analysis Problems.
GrabCut Interactive Image (and Stereo) Segmentation Carsten Rother Vladimir Kolmogorov Andrew Blake Antonio Criminisi Geoffrey Cross [based on Siggraph.
1 s-t Graph Cuts for Binary Energy Minimization  Now that we have an energy function, the big question is how do we minimize it? n Exhaustive search is.
GrabCut Interactive Image (and Stereo) Segmentation Joon Jae Lee Keimyung University Welcome. I will present Grabcut – an Interactive tool for foreground.
Learning with Inference for Discrete Graphical Models Nikos Komodakis Pawan Kumar Nikos Paragios Ramin Zabih (presenter)
Pseudo-Bound Optimization for Binary Energies Presenter: Meng Tang Joint work with Ismail Ben AyedYuri Boykov 1 / 27.
Corp. Research Princeton, NJ Computing geodesics and minimal surfaces via graph cuts Yuri Boykov, Siemens Research, Princeton, NJ joint work with Vladimir.
Silhouettes in Multiview Stereo Ian Simon. Multiview Stereo Problem Input: – a collection of images of a rigid object (or scene) – camera parameters for.
1 Can this be generalized?  NP-hard for Potts model [K/BVZ 01]  Two main approaches 1. Exact solution [Ishikawa 03] Large graph, convex V (arbitrary.
Robust Higher Order Potentials For Enforcing Label Consistency
The University of Ontario University of Bonn July 2008 Optimization of surface functionals using graph cut algorithms Yuri Boykov presenting joint work.
Improved Moves for Truncated Convex Models M. Pawan Kumar Philip Torr.
2010/5/171 Overview of graph cuts. 2010/5/172 Outline Introduction S-t Graph cuts Extension to multi-label problems Compare simulated annealing and alpha-
Stereo & Iterative Graph-Cuts Alex Rav-Acha Vision Course Hebrew University.
The plan for today Camera matrix
Announcements Readings for today:
Lecture 10: Stereo and Graph Cuts
Stereo Computation using Iterative Graph-Cuts
Comp 775: Graph Cuts and Continuous Maximal Flows Marc Niethammer, Stephen Pizer Department of Computer Science University of North Carolina, Chapel Hill.
What Energy Functions Can be Minimized Using Graph Cuts? Shai Bagon Advanced Topics in Computer Vision June 2010.
Relaxations and Moves for MAP Estimation in MRFs M. Pawan Kumar STANFORDSTANFORD Vladimir KolmogorovPhilip TorrDaphne Koller.
Measuring Uncertainty in Graph Cut Solutions Pushmeet Kohli Philip H.S. Torr Department of Computing Oxford Brookes University.
Computer vision: models, learning and inference
Extensions of submodularity and their application in computer vision
Multiplicative Bounds for Metric Labeling M. Pawan Kumar École Centrale Paris École des Ponts ParisTech INRIA Saclay, Île-de-France Joint work with Phil.
Graph-based Segmentation
Reconstructing Relief Surfaces George Vogiatzis, Philip Torr, Steven Seitz and Roberto Cipolla BMVC 2004.
Fast Approximate Energy Minimization via Graph Cuts
A Selective Overview of Graph Cut Energy Minimization Algorithms Ramin Zabih Computer Science Department Cornell University Joint work with Yuri Boykov,
Michael Bleyer LVA Stereo Vision
Graph Cut & Energy Minimization
MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 02/24/10.
Graph Cut Algorithms for Binocular Stereo with Occlusions
Graph Cut 韋弘 2010/2/22. Outline Background Graph cut Ford–Fulkerson algorithm Application Extended reading.
City University of Hong Kong 18 th Intl. Conf. Pattern Recognition Self-Validated and Spatially Coherent Clustering with NS-MRF and Graph Cuts Wei Feng.
CS774. Markov Random Field : Theory and Application Lecture 13 Kyomin Jung KAIST Oct
Interactive Graph Cuts for Optimal Boundary & Region Segmentation of Objects in N-D Images (Fri) Young Ki Baik, Computer Vision Lab.
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
Multiplicative Bounds for Metric Labeling M. Pawan Kumar École Centrale Paris Joint work with Phil Torr, Daphne Koller.
Lena Gorelick joint work with O. Veksler I. Ben Ayed A. Delong Y. Boykov.
The University of Ontario How to fit a surface to a point cloud? or optimization of surface functionals in computer vision Yuri Boykov TRICS seminar Computer.
Graph Cuts Marc Niethammer. Segmentation by Graph-Cuts A way to compute solutions to the optimization problems we looked at before. Example: Binary Segmentation.
CS 4487/6587 Algorithms for Image Analysis
Algorithms for MAP estimation in Markov Random Fields Vladimir Kolmogorov University College London.
1 Markov Random Fields with Efficient Approximations Yuri Boykov, Olga Veksler, Ramin Zabih Computer Science Department CORNELL UNIVERSITY.
Machine Learning – Lecture 15
Lecture 19: Solving the Correspondence Problem with Graph Cuts CAP 5415 Fall 2006.
Presenter : Kuang-Jui Hsu Date : 2011/3/24(Thur.).
The University of Ontario CS 4487/9687 Algorithms for Image Analysis Multi-Label Image Analysis Problems.
Gaussian Mixture Models and Expectation-Maximization Algorithm.
Machine Learning – Lecture 15
Yuri Boykov jointly with
Pseudo-Bound Optimization for Binary Energies Meng Tang 1 Ismail Ben Ayed 2 Yuri Boykov 1 1 University of Western Ontario, Canada 2 GE Healthcare Canada.
The University of Ontario From photohulls to photoflux optimization Yuri Boykov University of Western Ontario Victor Lempitsky Moscow State University.
A global approach Finding correspondence between a pair of epipolar lines for all pixels simultaneously Local method: no guarantee we will have one to.
Graph Algorithms for Vision Amy Gale November 5, 2002.
Photoconsistency constraint C2 q C1 p l = 2 l = 3 Depth labels If this 3D point is visible in both cameras, pixels p and q should have similar intensities.
Energy functions f(p)  {0,1} : Ising model Can solve fast with graph cuts V( ,  ) = T[  ] : Potts model NP-hard Closely related to Multiway Cut Problem.
Energy minimization Another global approach to improve quality of correspondences Assumption: disparities vary (mostly) smoothly Minimize energy function:
The University of Ontario CS 4487/9687 Algorithms for Image Analysis Combining Color and Boundary and other extensions (graph cuts)
Graph Cuts vs. Level Sets
Markov Random Fields with Efficient Approximations
Multi-view reconstruction
Discrete Optimization Methods Basic overview of graph cuts
Presentation transcript:

ICCV 2007 tutorial on Discrete Optimization Methods in Computer Vision part I Basic overview of graph cuts

Disclaimer n Can not possibly cover all discrete optimization methods widely used in computer vision in the last 30 years n We mainly concentrate on Discrete energy minimization methods that can be applied to Markov Random Fields with binary or n-labels –applicable to a wide spectrum of problems in vision Methods motivated by LP relaxations –good bounds on the solutions

Discrete Optimization Methods in Computer Vision n Part I: basic overview of graph cuts binary labeling –a few basic examples –energy optimization submodularity (discrete view) continuous functionals (geometric view) posterior MRF energy (statistical view) extensions to multi-label problems –interactions: convex, robust, metric –move-based optimization

Shortest paths approach (live wire, intelligent scissors) 2D Graph cut shortest path on a graph Example: find the shortest closed contour in a given domain of a graph Compute the shortest path p ->p for a point p. p Graph Cuts approach Compute the minimum cut that separates red region from blue region Repeat for all points on the gray line. Then choose the optimal contour.

Graph cuts for optimal boundary detection (B&J, ICCV’01) n-links s t a cut hard constraint hard constraint Minimum cost cut can be computed in polynomial time (max-flow/min-cut algorithms)

Standard minimum s-t cuts algorithms n Augmenting paths [Ford & Fulkerson, 1962] n Push-relabel [Goldberg-Tarjan, 1986] n Tree recycling (dynamic trees) [B&K, 2004] n Flow recycling (dynamic cuts) [Kohli & Torr, 2005] n Cut recycling (active cuts) [Juan & Boykov, 2006] n Hierarchical methods - in search space [Lombaert et al., CVPR 2005] - in edge weights (capacity scaling) [Juan et al., ICCV07] adapted to N-D grids used in computer vision

Optimal boundary in 2D “max-flow = min-cut”

Optimal boundary in 3D 3D bone segmentation (real time screen capture)

Graph cuts applied to multi-view reconstruction CVPR’05 slides from Vogiatzis, Torr, Cippola visual hull (silhouettes) surface of good photoconsistency

Adding regional properties (B&J, ICCV’01) n-links s t a cut t-link NOTE: hard constrains are not required, in general. regional bias example suppose are given “expected” intensities of object and background

NOTE: hard constrains are not required, in general. EM-style optimization of piece-vice constant Mumford-Shah model Adding regional properties (B&J, ICCV’01) n-links s t a cut t-link “expected” intensities of object and background can be re-estimated

Adding regional properties (B&J, ICCV’01) a cut given object and background intensity histograms s t More generally, regional bias can be based on any intensity models of object and background

Adding regional properties (B&J, ICCV’01)

Iterative learning of regional color-models n GMMRF cuts (Blake et al., ECCV04) n Grab-cut (Rother et al., SIGGRAPH 04) parametric regional model – Gaussian Mixture (GM) designed to guarantee convergence

At least three ways to look at energy of graph cuts I: Binary submodular energy II: Approximating continuous surface functionals III: Posterior energy (MAP-MRF)

Simple example of energy t-links n-links Boundary term Regional term binary object segmentation n-links s t a cut t-link

Graph cuts for minimization of submodular binary energies I n Characterization of binary energies that can be globally minimized by s-t graph cuts [Boros&Hummer, 2002, K&Z 2004] n Non-submodular cases can be addressed with some optimality guarantees, e.g. QPBO algorithm (see Boros&Hummer, 2002, Tavares et al. 06, Rother et al. 07) E(L) can be minimized by s-t graph cuts t-links n-links Boundary term Regional term Submodularity (“convexity”)

Graph cuts for minimization of continuous surface functionals II [K&B, ICCV 2005] [B&K, ICCV 2003] n Characterization of energies of binary cuts C as functionals of continuous surfaces Geometric length any convex, symmetric metric g e.g. Riemannian Flux any vector field v Regional bias any scalar function f

One extension using parametric max-flow methods n optimization of ratio functionals n In 2D can use DP [Cox et al’96, Jermyn&Ishikawa’01] n In 3D, see a poster on Tuesday (Kolmogorov, Boykov, Rother)

Graph cuts for minimization of posterior energy III n Greig at al. [IJRSS, 1989] Posterior energy (MRF, Ising model) Example: binary image restoration Spatial prior (regularization) Likelihood (data term)

Graph cuts algorithms can minimize multi-label energies as well

Multi-scan-line stereo with s-t graph cuts (Roy&Cox’98) s t cut L(p) p “cut” x y labels x y Disparity labels

s-t graph-cuts for multi-label energy minimization n Ishikawa 1998, 2000, 2003 n Generalization of construction by Roy&Cox 1998 V(dL) dL=Lp-Lq V(dL) dL=Lp-Lq Linear interactions “Convex” interactions

Pixel interactions V: “convex” vs. “discontinuity-preserving” V(dL) dL=Lp-Lq Potts model Robust or “discontinuity preserving” Interactions V V(dL) dL=Lp-Lq “Convex” Interactions V V(dL) dL=Lp-Lq V(dL) dL=Lp-Lq “linear” model (weak membrane models, see a book by Blake and Zisserman, 87)

Pixel interactions: “convex” vs. “discontinuity-preserving” “linear” V truncated “linear” V

Robust interactions n NP-hard problem (3 or more labels) two labels can be solved via s-t cuts (Greig at. al., 1989) n a-expansion approximation algorithm (Boykov, Veksler, Zabih 1998, 2001) guaranteed approximation quality (Veksler, thesis 2001) –within a factor of 2 from the global minima (Potts model) applies to a wide class of energies with robust interactions – Potts model (BVZ 1989) –“metric” interactions (BVZ 2001) –can be extended to arbitrary interactions with weaker guarantees truncation (Kolmogorov et al. 2005) QPBO (Boros and Hummer, 2002) n Other “move” algorithms (e.g. a-b swap, jump-moves) n More is coming later in this tutorial

a-expansion algorithm 1.Start with any initial solution 2.For each label “a” in any (e.g. random) order 1.Compute optimal a-expansion move (s-t graph cuts) 2.Decline the move if there is no energy decrease 3.Stop when no expansion move would decrease energy

other labels a a-expansion move Basic idea: break multi-way cut computation into a sequence of binary s-t cuts

a-expansion moves initial solution -expansion In each a-expansion a given label “a” grabs space from other labels For each move we choose expansion that gives the largest decrease in the energy: binary optimization problem

Metric interactions V(a,b)=0 iff a=b V(a,b) = V(b,a) >= 0 V(a,c) <= V(a,b)+V(b,c) Triangular inequality Implies that every expansion move (a binary problem) is submodular

a-expansions: examples of metric interactions Potts V “noisy diamond” “noisy shaded diamond” Truncated “linear” V

Multi-way graph cuts Multi-object Extraction

Multi-way graph cuts Stereo/Motion with slanted surfaces (Birchfield &Tomasi 1999) Labels = parameterized surfaces EM based: E step = compute surface boundaries M step = re-estimate surface parameters

Multi-way graph cuts stereo vision original pair of “stereo” images depth map ground truth BVZ 1998 KZ 2002

Multi-way graph cuts Graph-cut textures (Kwatra, Schodl, Essa, Bobick 2003) similar to “image-quilting” (Efros & Freeman, 2001) A B C D E F G H I J A B G D C F H I J E

normalized correlation, start for annealing, 24.7% err simulated annealing, 19 hours, 20.3% err a-expansions (BVZ 89,01) 90 seconds, 5.8% err a-expansions vs. simulated annealing