The M-Best Mode Problem

Slides:



Advertisements
Similar presentations
Mean-Field Theory and Its Applications In Computer Vision1 1.
Advertisements

Algorithms for MAP estimation in Markov Random Fields Vladimir Kolmogorov University College London Tutorial at GDR (Optimisation Discrète, Graph Cuts.
Bayesian Belief Propagation
O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
Diverse M-Best Solutions in Markov Random Fields
The Layout Consistent Random Field for detecting and segmenting occluded objects CVPR, June 2006 John Winn Jamie Shotton.
Focused Inference with Local Primal-Dual Gaps
Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.
Constrained Approximate Maximum Entropy Learning (CAMEL) Varun Ganapathi, David Vickrey, John Duchi, Daphne Koller Stanford University TexPoint fonts used.
Joint Optimisation for Object Class Segmentation and Dense Stereo Reconstruction Ľubor Ladický, Paul Sturgess, Christopher Russell, Sunando Sengupta, Yalin.
Learning to Combine Bottom-Up and Top-Down Segmentation Anat Levin and Yair Weiss School of CS&Eng, The Hebrew University of Jerusalem, Israel.
Shape Sharing for Object Segmentation
Scene Labeling Using Beam Search Under Mutex Constraints ID: O-2B-6 Anirban Roy and Sinisa Todorovic Oregon State University 1.
Learning with Inference for Discrete Graphical Models Nikos Komodakis Pawan Kumar Nikos Paragios Ramin Zabih (presenter)
ICCV 2007 tutorial Part III Message-passing algorithms for energy minimization Vladimir Kolmogorov University College London.
Efficient Inference for Fully-Connected CRFs with Stationarity
Models for Scene Understanding – Global Energy models and a Style-Parameterized boosting algorithm (StyP-Boost) Jonathan Warrell, 1 Simon Prince, 2 Philip.
Learning to Detect A Salient Object Reporter: 鄭綱 (3/2)
Parsing Human Motion with Stretchable Models
Robust Higher Order Potentials For Enforcing Label Consistency
Schedule Introduction Models: small cliques and special potentials Tea break Inference: Relaxation techniques:
P 3 & Beyond Solving Energies with Higher Order Cliques Pushmeet Kohli Pawan Kumar Philip H. S. Torr Oxford Brookes University CVPR 2007.
Graph Cut based Inference with Co-occurrence Statistics Ľubor Ladický, Chris Russell, Pushmeet Kohli, Philip Torr.
Linear Solution to Scale and Rotation Invariant Object Matching Professor: 王聖智 教授 Student : 周 節.
What Energy Functions Can be Minimized Using Graph Cuts? Shai Bagon Advanced Topics in Computer Vision June 2010.
Relaxations and Moves for MAP Estimation in MRFs M. Pawan Kumar STANFORDSTANFORD Vladimir KolmogorovPhilip TorrDaphne Koller.
Measuring Uncertainty in Graph Cut Solutions Pushmeet Kohli Philip H.S. Torr Department of Computing Oxford Brookes University.
What, Where & How Many? Combining Object Detectors and CRFs
MAP Estimation Algorithms in M. Pawan Kumar, University of Oxford Pushmeet Kohli, Microsoft Research Computer Vision - Part I.
Multiplicative Bounds for Metric Labeling M. Pawan Kumar École Centrale Paris École des Ponts ParisTech INRIA Saclay, Île-de-France Joint work with Phil.
Learning Based Hierarchical Vessel Segmentation
MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 02/24/10.
City University of Hong Kong 18 th Intl. Conf. Pattern Recognition Self-Validated and Spatially Coherent Clustering with NS-MRF and Graph Cuts Wei Feng.
Planar Cycle Covering Graphs for inference in MRFS The Typhon Algorithm A New Variational Approach to Ground State Computation in Binary Planar Markov.
CVPR 2013 Diversity Tutorial Closing Remarks: What can we do with multiple diverse solutions? Dhruv Batra Virginia Tech.
Leo Zhu CSAIL MIT Joint work with Chen, Yuille, Freeman and Torralba 1.
Multiplicative Bounds for Metric Labeling M. Pawan Kumar École Centrale Paris Joint work with Phil Torr, Daphne Koller.
Lena Gorelick joint work with O. Veksler I. Ben Ayed A. Delong Y. Boykov.
An Efficient Message-Passing Algorithm for the M-Best MAP Problem Dhruv Batra (Currently) Research Assistant Professor TTI-Chicago (Spring 2013) Assistant.
Algorithms for MAP estimation in Markov Random Fields Vladimir Kolmogorov University College London.
Associative Hierarchical CRFs for Object Class Image Segmentation Ľubor Ladický 1 1 Oxford Brookes University 2 Microsoft Research Cambridge Based on the.
Discrete Optimization Lecture 3 – Part 1 M. Pawan Kumar Slides available online
Probabilistic Inference Lecture 5 M. Pawan Kumar Slides available online
Dynamic Tree Block Coordinate Ascent Daniel Tarlow 1, Dhruv Batra 2 Pushmeet Kohli 3, Vladimir Kolmogorov 4 1: University of Toronto3: Microsoft Research.
Update any set S of nodes simultaneously with step-size We show fixed point update is monotone for · 1/|S| Covering Trees and Lower-bounds on Quadratic.
O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
CVPR 2013 Diversity Tutorial Beyond MAP: Making Multiple Predictions: Diversity, DPPs and more. Dhruv Batra Virginia Tech Alex Kulesza Univ. of Michigan.
Beyond Trees: MRF Inference via Outer-Planar Decomposition [(To appear) CVPR ‘10] Dhruv Batra Carnegie Mellon University Joint work: Andrew Gallagher (Eastman.
Inference for Learning Belief Propagation. So far... Exact methods for submodular energies Approximations for non-submodular energies Move-making ( N_Variables.
1 Scale and Rotation Invariant Matching Using Linearly Augmented Tree Hao Jiang Boston College Tai-peng Tian, Stan Sclaroff Boston University.
Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.
Tightening LP Relaxations for MAP using Message-Passing David Sontag Joint work with Talya Meltzer, Amir Globerson, Tommi Jaakkola, and Yair Weiss.
Linear Solution to Scale and Rotation Invariant Object Matching Hao Jiang and Stella X. Yu Computer Science Department Boston College.
IEEE 2015 Conference on Computer Vision and Pattern Recognition Active Learning for Structured Probabilistic Models with Histogram Approximation Qing SunAnkit.
Image segmentation.
Rounding-based Moves for Metric Labeling M. Pawan Kumar École Centrale Paris INRIA Saclay, Île-de-France.
Alexander Shekhovtsov and Václav Hlaváč
ECE 5424: Introduction to Machine Learning
Nonparametric Semantic Segmentation
Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT
Group Norm for Learning Latent Structural SVMs
Janardhan Rao (Jana) Doppa, Alan Fern, and Prasad Tadepalli
Efficient Graph Cut Optimization for Full CRFs with Quantized Edges
Rob Fergus Computer Vision
Learning to Combine Bottom-Up and Top-Down Segmentation
Integration and Graphical Models
Diversity meets Deep Networks: Inference, Ensembles, and Applications
Expectation-Maximization & Belief Propagation
Discrete Inference and Learning
“Traditional” image segmentation
Presentation transcript:

The M-Best Mode Problem Dhruv Batra Research Assistant Professor TTI-Chicago Joint work with: Abner Guzman-Rivera (UIUC), Greg Shakhnarovich (TTIC), Payman Yadollahpour (TTIC).

slide credit: Fei-Fei Li, Rob Fergus & Antonio Torralba Local Ambiguity (C) Dhruv Batra slide credit: Fei-Fei Li, Rob Fergus & Antonio Torralba

Local Ambiguity “While hunting in Africa, I shot an elephant in my pajamas. How an elephant got into my pajamas, I’ll never know!” Groucho Marx (1930) (C) Dhruv Batra

Output-Space Explosion +1, -1 k Classes all graph-labelings Exponentially Many Classes (C) Dhruv Batra

Structured Output Segmentation (#Labels)#Pixels (C) Dhruv Batra [Batra et al. CVPR ‘10, IJCV ’11] [Batra et al. CVPR ’08], [Batra ICML ‘11, CVPR ‘11] (#Labels)#Pixels sky cow grass (C) Dhruv Batra

Structured Output Object Detection: parts-based models (#Pixels)#Parts [Felzenszwalb et al. PAMI ‘10], [Yang and Ramanan, ICCV ‘11] (#Pixels)#Parts (C) Dhruv Batra

Structured Output Dependency parsing |Sentence-Length||Sentence-length|-2 (C) Dhruv Batra Figure courtesy Rush & Collins NIPS11

Conditional Random Fields Discrete random variables Factored-Exponential Model X1 X2 … Xn Xi kx1 1 1 10 0 kxk 10 Edge Energies / Distributed Prior Node Energies / Local Costs (C) Dhruv Batra

MAP Inference In general NP-hard [Shimony ‘94] Approximate Inference Heuristics: Loopy BP [Pearl, ‘88] Greedy: α-Expansion [Boykov ’01, Komodakis ‘05] LP Relaxations: [Schlesinger ‘76, Wainwright ’05, Sontag ’08, Batra ‘10] QP/SDP Relaxations: [Ravikumar ’06, Kumar ‘09] (C) Dhruv Batra

This is a job for Optimization Man MAP Inference This is a job for Optimization Man In general NP-hard [Shimony ‘94] Approximate Inference Heuristics: Loopy BP [Pearl, ‘88] Greedy: α-Expansion [Boykov ’01, Komodakis ‘05] LP Relaxations: [Schlesinger ‘76, Wainwright ’05, Sontag ’08, Batra ‘10] QP/SDP Relaxations: [Ravikumar ’06, Kumar ‘09] (C) Dhruv Batra

I have a new Fancy Approximate Inference Alg. Worship Me! (C) Dhruv Batra

MAP ≠ Ground-truth Large-scale studies “the global OPT does not solve many of the problems in the BP or Graph Cuts solutions.” [Meltzer, Yanover, Weiss ICCV05] “the ground truth has substantially lower score [than MAP]” [Szeliski et al. PAMI08] Implication: Models are inaccurate. Ground-Truth (C) Dhruv Batra

✓ Possible Solution Ask for more than MAP! Better Problem: Flerova et al., 2011 Rollon et al., 2011 Fromer et al., 2009 Yanover et al., 2003 Nilsson,1998 Seroussi et al., 1994 Lawler, 1972 M-Best MAP Problem Better Problem: M-Best Modes ✓ (C) Dhruv Batra

Formulation Over-Complete Representation Inconsistent (C) Dhruv Batra 1 kx1 1 1 Inconsistent 1000000000000000 0100000000000000 k2x1 (C) Dhruv Batra

Formulation Score = Dot Product kx1 k2x1 (C) Dhruv Batra

Formulation MAP Integer Program Black-Box (C) Dhruv Batra

Formulation 2nd-Best Mode MAP 2nd-Mode MAP (C) Dhruv Batra

Approach 2nd-Best Mode Lagrangian Relaxation Primal Diversity-Augmented Score 2nd-Best Mode Lagrangian Relaxation Convergence & other guarantees Large class of Delta-functions allowed See paper for details Primal Dualize Dual Binary Search in 1-D Subgradient Descent in N-D Primal-OPT Convex (Non-smooth) Upper-Bound on Primal-OPT (C) Dhruv Batra

Dot-Product Dissimilarity Diversity Augmented Inference: For integral solution, equivalent to Hamming! 1 Simply edit node-terms. Reuse MAP machinery! (C) Dhruv Batra

Theorem Statement Theorem [Batra et al ’12]: Lagrangian Dual corresponds to solving the Relaxed Primal: Based on result from [Geoffrion ‘74] Dual Relaxed Primal (C) Dhruv Batra

How Much Diversity? Empirical Solution: Cross-Val for More Efficient: Cross-Val for (C) Dhruv Batra

Experiment #1 Interactive Segmentation Model from [Batra et al. CVPR’10] Image + Scribbles MAP 2nd Best MAP 2nd Best Mode (C) Dhruv Batra

Experiment #1 Better MAP (C) Dhruv Batra

Experiment #2 Pose Estimation (C) Dhruv Batra

Experiment #2 Mixture of Parts Model Model from [Yang, Ramanan, ICCV ‘11] Tree of Parts Histogram of Oriented Gradient (HOG) Features (C) Dhruv Batra

Experiment #2 Pose Tracking w/ Chain CRF M-Modes (C) Dhruv Batra

Experiment #2 MAP M-Modes + Viterbi (C) Dhruv Batra

Experiment #2 Accuracy #Modes / Frame M-Modes 25% Better Baseline #1 (C) Dhruv Batra

Experiment #3 Pascal Segmentation Challenge 20 categories + background Competitive international challenge (2007-2012) (C) Dhruv Batra

Experiment #3 Hierarchical CRF model [Ladicky et al. ECCV ‘10, BMVC ’10, ICCV ‘09] Pixel potential: textons, color, HOG Pairwise potentials between pixels: Potts Segment potentials: histogram of pixel features Pairwise potentials between segments (C) Dhruv Batra

Examples: Test Set Input MAP Best Mode (C) Dhruv Batra

Experiment #3 Accuracy #Modes / Image M-Modes Baseline Better State of the art Accuracy Baseline MAP #Modes / Image (C) Dhruv Batra

Future Directions M-Best Modes More applications Object Detection, Medical Segmentation Cascaded Models with Modes passed on General Trick for Combinatorial Structures Step 1 Step 2 Step 3 Top M hypotheses (C) Dhruv Batra

Future Directions M-Best Modes Improved Learning with Modes Posterior Summaries with Modes (C) Dhruv Batra

Take-Away Message (Part #1) Think about YOUR problem. Are you or a loved one, tired of a single solution? If yes, then M-Modes might be right for you!* * M-Modes is not suited for everyone. People with perfect models, and love of continuous variables should not use M-Modes. Consult your local optimization expert before starting M-Modes. Please do not drive or operate heavy machinery while on M-Modes. (C) Dhruv Batra

Thank You! M-Best Modes Payman Yadollahpour (TTIC) Abner Guzman-Rivera (UIUC) Greg Shakhnarovich (TTIC)

(C) Dhruv Batra

slide credit: Andrew Gallagher Local Ambiguity [Smyth et al., 1994] (C) Dhruv Batra slide credit: Andrew Gallagher

|Patch-Dictionary|#Patches Structured Output Super-Resolution [Baker, Kanade, PAMI ‘02], [Freeman et al, IJCV ‘00] |Patch-Dictionary|#Patches (C) Dhruv Batra

Figure courtesy Yanover & Weiss NIPS02 Structured Output Protein Side-Chain Prediction (#Angles)#Sites (C) Dhruv Batra Figure courtesy Yanover & Weiss NIPS02

Applications What can we do with multiple solutions? More choices for “human/expert in the loop” (C) Dhruv Batra

Applications What can we do with multiple solutions? More choices for “human/expert in the loop” Input to next system in cascade Top M Top M Step 1 Step 2 Step 3 hypotheses hypotheses (C) Dhruv Batra

Applications What can we do with multiple solutions? More choices for “human in the loop” Rank solutions [Carreira and Sminchisescu, CVPR10] State-of-art segmentation on PASCAL Challenge 2011 ~10,000 (C) Dhruv Batra

Dissimilarity A number of special cases 0-1 Dissimilarity M-Best MAP Large class of Delta-functions allowed Hamming distance Higher-Order Dissimilarity (C) Dhruv Batra

Higher-Order Dissimilarity Cardinality Potential Efficient Inference Cardinality [Tarlow ‘10] Lower Linear envelop [Kohli ‘10] Pattern Potentials [Rother ‘10] (C) Dhruv Batra

Example Results (C) Dhruv Batra

Examples: Validation Set Input Ground-Truth MAP Best Mode (C) Dhruv Batra

Experiment #3 (C) Dhruv Batra

Experiment #3 (C) Dhruv Batra