Learning a Region-based Scene Segmentation Model

Slides:



Advertisements
Similar presentations
Self-Paced Learning for Semantic Segmentation
Advertisements

POSE–CUT Simultaneous Segmentation and 3D Pose Estimation of Humans using Dynamic Graph Cuts Mathieu Bray Pushmeet Kohli Philip H.S. Torr Department of.
O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
Solving Markov Random Fields using Second Order Cone Programming Relaxations M. Pawan Kumar Philip Torr Andrew Zisserman.
Automatic Photo Pop-up Derek Hoiem Alexei A.Efros Martial Hebert Carnegie Mellon University.
Weakly supervised learning of MRF models for image region labeling Jakob Verbeek LEAR team, INRIA Rhône-Alpes.
Learning Specific-Class Segmentation from Diverse Data M. Pawan Kumar, Haitherm Turki, Dan Preston and Daphne Koller at ICCV 2011 VGG reading group, 29.
Top-Down & Bottom-Up Segmentation
Online Max-Margin Weight Learning for Markov Logic Networks Tuyen N. Huynh and Raymond J. Mooney Machine Learning Group Department of Computer Science.
Constrained Approximate Maximum Entropy Learning (CAMEL) Varun Ganapathi, David Vickrey, John Duchi, Daphne Koller Stanford University TexPoint fonts used.
Scene Labeling Using Beam Search Under Mutex Constraints ID: O-2B-6 Anirban Roy and Sinisa Todorovic Oregon State University 1.
Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.
Max-Margin Latent Variable Models M. Pawan Kumar.
1 Building a Dictionary of Image Fragments Zicheng Liao Ali Farhadi Yang Wang Ian Endres David Forsyth Department of Computer Science, University of Illinois.
GrabCut Interactive Image (and Stereo) Segmentation Carsten Rother Vladimir Kolmogorov Andrew Blake Antonio Criminisi Geoffrey Cross [based on Siggraph.
GrabCut Interactive Foreground Extraction using Iterated Graph Cuts Carsten Rother Vladimir Kolmogorov Andrew Blake Microsoft Research Cambridge-UK.
GrabCut Interactive Foreground Extraction using Iterated Graph Cuts Carsten Rother Vladimir Kolmogorov Andrew Blake Microsoft Research Cambridge-UK.
A Gimp Plugin that uses “GrabCut” to perform image segmentation
Stephen J. Guy 1. Photomontage Photomontage GrabCut – Interactive Foreground Extraction 1.
GrabCut Interactive Image (and Stereo) Segmentation Joon Jae Lee Keimyung University Welcome. I will present Grabcut – an Interactive tool for foreground.
Hierarchical Saliency Detection School of Electronic Information Engineering Tianjin University 1 Wang Bingren.
Optimization & Learning for Registration of Moving Dynamic Textures Junzhou Huang 1, Xiaolei Huang 2, Dimitris Metaxas 1 Rutgers University 1, Lehigh University.
Learning to Detect A Salient Object Reporter: 鄭綱 (3/2)
Robust Higher Order Potentials For Enforcing Label Consistency
Schedule Introduction Models: small cliques and special potentials Tea break Inference: Relaxation techniques:
P 3 & Beyond Solving Energies with Higher Order Cliques Pushmeet Kohli Pawan Kumar Philip H. S. Torr Oxford Brookes University CVPR 2007.
Motion Detail Preserving Optical Flow Estimation Li Xu 1, Jiaya Jia 1, Yasuyuki Matsushita 2 1 The Chinese University of Hong Kong 2 Microsoft Research.
The Layout Consistent Random Field for Recognizing and Segmenting Partially Occluded Objects By John Winn & Jamie Shotton CVPR 2006 presented by Tomasz.
Recovering Articulated Object Models from 3D Range Data Dragomir Anguelov Daphne Koller Hoi-Cheung Pang Praveen Srinivasan Sebastian Thrun Computer Science.
Graph Cut based Inference with Co-occurrence Statistics Ľubor Ladický, Chris Russell, Pushmeet Kohli, Philip Torr.
What Energy Functions Can be Minimized Using Graph Cuts? Shai Bagon Advanced Topics in Computer Vision June 2010.
Learning to Segment from Diverse Data M. Pawan Kumar Daphne KollerHaithem TurkiDan Preston.
Transfer Learning of Object Classes: From Cartoons to Photographs NIPS Workshop Inductive Transfer: 10 Years Later Geremy Heitz Gal Elidan Daphne Koller.
Hierarchical Graph Cuts for Semi-Metric Labeling M. Pawan Kumar Joint work with Daphne Koller.
Automatic Photo Popup Derek Hoiem Alexei A. Efros Martial Hebert Carnegie Mellon University.
3D Scene Models Object recognition and scene understanding Krista Ehinger.
Probabilistic Inference Lecture 4 – Part 2 M. Pawan Kumar Slides available online
MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 02/24/10.
Planar Cycle Covering Graphs for inference in MRFS The Typhon Algorithm A New Variational Approach to Ground State Computation in Binary Planar Markov.
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
Rounding-based Moves for Metric Labeling M. Pawan Kumar Center for Visual Computing Ecole Centrale Paris.
Learning a Small Mixture of Trees M. Pawan Kumar Daphne Koller Aim: To efficiently learn a.
#MOTION ESTIMATION AND OCCLUSION DETECTION #BLURRED VIDEO WITH LAYERS
Supervised Learning of Edges and Object Boundaries Piotr Dollár Zhuowen Tu Serge Belongie.
Markov Random Fields Probabilistic Models for Images
Discrete Optimization Lecture 3 – Part 1 M. Pawan Kumar Slides available online
Probabilistic Inference Lecture 5 M. Pawan Kumar Slides available online
Efficient Discriminative Learning of Parts-based Models M. Pawan Kumar Andrew Zisserman Philip Torr
Associative Hierarchical CRFs for Object Class Image Segmentation
O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
Non-Ideal Iris Segmentation Using Graph Cuts
Learning from Big Data Lecture 5
CS654: Digital Image Analysis Lecture 28: Advanced topics in Image Segmentation Image courtesy: IEEE, IJCV.
Machine Vision ENT 273 Regions and Segmentation in Images Hema C.R. Lecture 4.
MAP Estimation of Semi-Metric MRFs via Hierarchical Graph Cuts M. Pawan Kumar Daphne Koller Aim: To obtain accurate, efficient maximum a posteriori (MAP)
Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.
Machine Vision ENT 273 Lecture 4 Hema C.R.
Markov Random Fields with Efficient Approximations
Nonparametric Semantic Segmentation
Saliency detection Donghun Yeo CV Lab..
Computer Vision Lecture 12: Image Segmentation II
Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT
Efficiently Selecting Regions for Scene Understanding
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Efficient Graph Cut Optimization for Full CRFs with Quantized Edges
Learning to Combine Bottom-Up and Top-Down Segmentation
CSSE463: Image Recognition Day 23
Tractable MAP Problems
“The Truth About Cats And Dogs”
“Traditional” image segmentation
Presentation transcript:

Learning a Region-based Scene Segmentation Model F O R D Learning a Region-based Scene Segmentation Model M. Pawan Kumar Daphne Koller

Aim To learn an accurate scene segmentation model Divide image into non-overlapping regions Assign each region to a semantic class Features extracted from each region Spatial prior

Why Regions? Black or Brown? Shape?

Why Regions? Black or Brown? Shape?

Why Regions? Two concentric circles (shape) Inner circle is metallic (texture) Outer circle is black (color)

Which Regions? Bottom-up Over-segmentation Mean-Shift, N-cuts Too small to capture useful cues Not faithful to boundaries between scene entities

Which Regions? Choose regions according to a global energy function Regions that give the best accuracy

Outline Region-based Segmentation Model Learning from Coarse Labels Inference Results

Region-based Segmentation Model G = (V(fP),E(fP)) fR: Regions  Classes fP: Pixels  Regions Unknown Number Gould et al., 2009

Region-based Segmentation Model E(f) = ∑w1(fR(r))T r(fP=r) + ∑w2T rs(fP=r,fP=s) Region Features Region Pairwise Features Boundary Contrast Shape, Texture, Color Internal Pixels Contrast Fraction of pixels above/below horizon w : Model parameters to be learnt Gould et al., 2009

Outline Region-based Segmentation Model Learning from Coarse Labels Inference Results

Ground Truth Labeling NO Amazon’s Mechanical Turk Are they the best regions? Car Top-half (transparent) Bottom-half (solid) Wheels (circular) Tree Crown (leafy) Trunk (brown)

Ground Truth Labeling Desired ground truth f* Coarser version fC

Learning with Coarse Labels Refined labeling fR faithful to coarse label fC ∑ iwTi* – log Zi(w) HUGE summation!! Marginalize over fR?? Very difficult!! MAP is bread-and-butter of Vision An accurate MAP estimation algorithm !!!

Learning with Coarse Labels Use max-margin learning framework At each iteration: Complete the ground truth Find the most violated constraint Approximate MAP Inference

Outline Region-based Segmentation Model Learning from Coarse Labels Inference Results

Region Selection Image Dictionary of Regions Human Annotation

Region Selection Image Dictionary of Regions Each super-pixel covered by exactly one selected region Super-Pixels

Integer Program miny ∑ r(i)yr(i) + ∑ rs(i,j)yrs(i,j) Binary yr(0) = 1 iff r is not selected Binary yr(1) = 1 iff r is selected miny ∑ r(i)yr(i) + ∑ rs(i,j)yrs(i,j) Minimize the energy s.t. yr(0) + yr(1) = 1 Assign one label to r from L yrs(i,0) + yrs(i,1) = yr(i) Ensure yrs(i,j) = yr(i)ys(j) yrs(0,j) + yrs(1,j) = ys(j) ∑r “covers” u yr(1) = 1 Each super-pixel is covered by exactly one selected region yr(i), yrs(i,j)  {0,1} Binary variables

Linear Program miny ∑ r(i)yr(i) + ∑ rs(i,j)yrs(i,j) Binary yr(0) = 1 iff r is not selected Binary yr(1) = 1 iff r is selected miny ∑ r(i)yr(i) + ∑ rs(i,j)yrs(i,j) s.t. yr(0) + yr(1) = 1 yrs(i,0) + yrs(i,1) = yr(i) yrs(0,j) + yrs(1,j) = ys(j) ∑r “covers” u yr(1) = 1 yr(i), yrs(i,j)  [0,1]

Linear Program Clique of overlapping and neighboring regions (non-submodular potentials) Mutual exclusivity Covering constraint Computationally expensive? Efficient Dual Decomposition

The Learning Approach At each iteration Given (i) fC (coarse labeling) (ii) current set of parameters Find fR (refined labeling) Region Partitioning Problem Most violated constraint Region Partitioning + Label assignment

Outline Region-based Segmentation Model Learning from Coarse Labels Inference Results

Stanford Background Dataset 715 outdoor scenes 7 background + 1 foreground Amazon’s Mechanical Turk Available for download: http://www.stanford.edu/~sgould

Completing the Ground Truth

Completing the Ground Truth

Completing the Ground Truth

Pixel-wise Accuracy Pixel-based Model: 67.71% Human Labeled Regions: 64.85% Our Approach: 72.49%

Examples

Examples

Examples

Summary Coarse labels are easy to obtain Refined labels are easy to train with Marginalization is not possible Approximate MAP (required for testing) is used to complete the labeling

Future Work Convergence analysis Region features + context Learning with Different Labelings

http://dags.stanford.edu Questions?