Learning Specific-Class Segmentation from Diverse Data M. Pawan Kumar, Haitherm Turki, Dan Preston and Daphne Koller at ICCV 2011 VGG reading group, 29.

Slides:

Advertisements

Similar presentations

Self-Paced Learning for Semantic Segmentation

Advertisements

Attribute Learning for Understanding Unstructured Social Activity

Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.

Curriculum Learning for Latent Structural SVM

1 General Structural Equation (LISREL) Models Week #2 Class #2.

Constrained Approximate Maximum Entropy Learning (CAMEL) Varun Ganapathi, David Vickrey, John Duchi, Daphne Koller Stanford University TexPoint fonts used.

From Interactive to Semantic Image Segmentation Varun Gulshan Supervisors: Prof. Andrew Blake Prof. Andrew Zisserman 20 Jan 2012.

A generic model to compose vision modules for holistic scene understanding Adarsh Kowdle *, Congcong Li *, Ashutosh Saxena, and Tsuhan Chen Cornell University,

Ľubor Ladický1 Phil Torr2 Andrew Zisserman1

Learning with Inference for Discrete Graphical Models Nikos Komodakis Pawan Kumar Nikos Paragios Ramin Zabih (presenter)

Simultaneous Image Classification and Annotation Chong Wang, David Blei, Li Fei-Fei Computer Science Department Princeton University Published in CVPR.

Loss-based Visual Learning with Weak Supervision M. Pawan Kumar Joint work with Pierre-Yves Baudin, Danny Goodman, Puneet Kumar, Nikos Paragios, Noura.

Max-Margin Latent Variable Models M. Pawan Kumar.

Intelligent Systems Lab. Recognizing Human actions from Still Images with Latent Poses Authors: Weilong Yang, Yang Wang, and Greg Mori Simon Fraser University,

Learning Structural SVMs with Latent Variables Xionghao Liu.

Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.

Models for Scene Understanding – Global Energy models and a Style-Parameterized boosting algorithm (StyP-Boost) Jonathan Warrell, 1 Simon Prince, 2 Philip.

Restrict learning to a model-dependent “easy” set of samples General form of objective: Introduce indicator of “easiness” v i : K determines threshold.

Learning to Segment with Diverse Data M. Pawan Kumar Stanford University.

1. Introduction Humanising GrabCut: Learning to segment humans using the Kinect Varun Gulshan, Victor Lempitksy and Andrew Zisserman Dept. of Engineering.

Training Regimes Motivation  Allow state-of-the-art subcomponents  With “Black-box” functionality  This idea also occurs in other application areas.

MSRC Summer School - 30/06/2009 Cambridge – UK Hybrids of generative and discriminative methods for machine learning.

Learning to Segment from Diverse Data M. Pawan Kumar Daphne KollerHaithem TurkiDan Preston.

Region Based Image Annotation Through Multiple-Instance Learning By: Changbo Yang Wayne State University Department of Computer Science.

Cue Integration in Figure/Ground Labeling Xiaofeng Ren, Charless Fowlkes and Jitendra Malik, U.C. Berkeley We present a model of edge and region grouping.

What, Where & How Many? Combining Object Detectors and CRFs

Group Norm for Learning Latent Structural SVMs Overview Daozheng Chen (UMD, College Park), Dhruv Batra (TTI Chicago), Bill Freeman (MIT), Micah K. Johnson.

Loss-based Learning with Weak Supervision M. Pawan Kumar.

Self-paced Learning for Latent Variable Models

Loss-based Learning with Latent Variables M. Pawan Kumar École Centrale Paris École des Ponts ParisTech INRIA Saclay, Île-de-France Joint work with Ben.

Crowdsourcing with Multi- Dimensional Trust Xiangyang Liu 1, He He 2, and John S. Baras 1 1 Institute for Systems Research and Department of Electrical.

Ranking with High-Order and Missing Information M. Pawan Kumar Ecole Centrale Paris Aseem BehlPuneet KumarPritish MohapatraC. V. Jawahar.

Optimizing Average Precision using Weakly Supervised Data Aseem Behl IIIT Hyderabad Under supervision of: Dr. M. Pawan Kumar (INRIA Paris), Prof. C.V.

Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.

Indirect Supervision Protocols for Learning in Natural Language Processing II. Learning by Inventing Binary Labels This work is supported by DARPA funding.

Beyond Nouns Exploiting Preposition and Comparative adjectives for learning visual classifiers.

Tell Me What You See and I will Show You Where It Is Jia Xu 1 Alexander G. Schwing 2 Raquel Urtasun 2,3 1 University of Wisconsin-Madison 2 University.

Associative Hierarchical CRFs for Object Class Image Segmentation

Multi-core Structural SVM Training Kai-Wei Chang Department of Computer Science University of Illinois at Urbana-Champaign Joint Work With Vivek Srikumar.

Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.

Learning from Big Data Lecture 5

Feedforward semantic segmentation with zoom-out features

Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.

Optimizing Average Precision using Weakly Supervised Data Aseem Behl 1, C.V. Jawahar 1 and M. Pawan Kumar 2 1 IIIT Hyderabad, India, 2 Ecole Centrale Paris.

Loss-based Learning with Weak Supervision M. Pawan Kumar.

Edge Preserving Spatially Varying Mixtures for Image Segmentation Giorgos Sfikas, Christophoros Nikou, Nikolaos Galatsanos (CVPR 2008) Presented by Lihan.

Gaussian Conditional Random Field Network for Semantic Segmentation

Discriminative Machine Learning Topic 4: Weak Supervision M. Pawan Kumar Slides available online

CS 4501: Introduction to Computer Vision Object Localization, Detection, Semantic Segmentation Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy.

CNN-RNN: A Uniﬁed Framework for Multi-label Image Classiﬁcation

Learning Deep Generative Models by Ruslan Salakhutdinov

Visual Attributes in Video

Learning a Region-based Scene Segmentation Model

Object Detection based on Segment Masks

Krishna Kumar Singh, Yong Jae Lee University of California, Davis

Classification of unlabeled data:

Multimodal Learning with Deep Boltzmann Machines

Object Localization Goal: detect the location of an object within an image Fully supervised: Training data labeled with object category and ground truth.

Structured Predictions with Deep Learning

Group Norm for Learning Latent Structural SVMs

Accounting for the relative importance of objects in image retrieval

Normalized Cut Loss for Weakly-supervised CNN Segmentation

Cascaded Classification Models

Faster R-CNN By Anthony Martinez.

Adarsh Kowdle*, Congcong Li*, Ashutosh Saxena, and Tsuhan Chen

How to and how not to use graph cuts

Deep Object Co-Segmentation

Deep Structured Scene Parsing by Learning with Image Descriptions

Report 2 Brandon Silva.

Introduction Face detection and alignment are essential to many applications such as face recognition, facial expression recognition, age identification,

Presentation transcript:

Learning Specific-Class Segmentation from Diverse Data M. Pawan Kumar, Haitherm Turki, Dan Preston and Daphne Koller at ICCV 2011 VGG reading group, 29 Nov 2011, presented by Varun Gulshan

Semantic image segmentation

Main idea High level: Getting fully labelled data for training is expensive, use other easily available ‘diverse’ data for learning (bounding boxes, classification labels for image). Tags: Car, people Person bounding box

Implementing the idea The bounding box/image classification data is incomplete for segmentation, fill in the missing information using latent variables. Setup the training cost function using latent variables. Use their self- paced learning algorithm for Latent-SVM’s [NIPS2010] to optimise the training cost function. While inferring latent variables, make sure latent variable estimation is consistent with the weak annotation. Setting up the inference problems to ensure this condition.

Energy function without latent variables Notation: Image Parameters to be trained Joint feature vector (essentially the terms of a CRF)

Structured output training Ground truth labels Loss function

Introducing latent variables

But we don’t know what h k is (its latent), so maximise it out.

Introducing latent variables

Self-paced optimisation

Indicator variable to switch off the harder cases.

Second idea: Latent variable estimation The algorithm involves estimating annotation consistent latent variables in the following equation: More precisely

Move to white-board Me You Beware of Equations