Layered Object Detection for Multi-Class Image Segmentation UC Irvine Yi Yang Sam Hallman Deva Ramanan Charless Fowlkes.

Slides:

Advertisements

Similar presentations

Semantic Contours from Inverse Detectors Bharath Hariharan et.al. (ICCV-11)

Advertisements

The Layout Consistent Random Field for detecting and segmenting occluded objects CVPR, June 2006 John Winn Jamie Shotton.

Presenter: Duan Tran (Part of slides are from Pedro’s)

Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.

Spectral graph reduction for image and streaming video segmentation Fabio Galasso 1 Margret Keuper 2 Thomas Brox 2 Bernt Schiele 1 1 Max Planck Institute.

Ľubor Ladický1 Phil Torr2 Andrew Zisserman1

Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.

Part 4: Combined segmentation and recognition by Rob Fergus (MIT)

Learning to Combine Bottom-Up and Top-Down Segmentation Anat Levin and Yair Weiss School of CS&Eng, The Hebrew University of Jerusalem, Israel.

Shape Sharing for Object Segmentation

Recovering Human Body Configurations: Combining Segmentation and Recognition Greg Mori, Xiaofeng Ren, and Jitentendra Malik (UC Berkeley) Alexei A. Efros.

Large Scale Visual Recognition Challenge (ILSVRC) 2013: Detection spotlights.

Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.

Parsing Clothing in Fashion Photographs

Steerable Part Models Hamed Pirsiavash and Deva Ramanan

Multi-Local Feature Manifolds for Object Detection Oscar Danielsson Stefan Carlsson Josephine Sullivan

Qualifying Exam: Contour Grouping Vida Movahedi Supervisor: James Elder Supervisory Committee: Minas Spetsakis, Jeff Edmonds York University Summer 2009.

LOCUS (Learning Object Classes with Unsupervised Segmentation) A variational approach to learning model- based segmentation. John Winn Microsoft Research.

1 P. Arbelaez, M. Maire, C. Fowlkes, J. Malik. Contour Detection and Hierarchical image Segmentation. IEEE Trans. on PAMI, Student: Hsin-Min Cheng.

Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA

Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.

Biased Normalized Cuts 1 Subhransu Maji and Jithndra Malik University of California, Berkeley IEEE Conference on Computer Vision and Pattern Recognition.

More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.

Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.

1. Introduction Humanising GrabCut: Learning to segment humans using the Kinect Varun Gulshan, Victor Lempitksy and Andrew Zisserman Dept. of Engineering.

Good morning, everyone, thank you for coming to my presentation.

Statistical Recognition Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Kristen Grauman.

Abstract We present a model of curvilinear grouping using piecewise linear representations of contours and a conditional random field to capture continuity.

Measuring the Ecological Statistics of Figure-Ground Charless Fowlkes, David Martin, Jitendra Malik.

TextonBoost : Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton*, J. Winn†, C. Rother†, and A.

1 Learning to Detect Natural Image Boundaries David Martin, Charless Fowlkes, Jitendra Malik Computer Science Division University of California at Berkeley.

Oxford Brookes Seminar Thursday 3 rd September, 2009 University College London1 Representing Object-level Knowledge for Segmentation and Image Parsing:

A Database of Human Segmented Natural Images and Two Applications David Martin, Charless Fowlkes, Doron Tal, Jitendra Malik UC Berkeley

Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.

Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.

Cue Integration in Figure/Ground Labeling Xiaofeng Ren, Charless Fowlkes and Jitendra Malik, U.C. Berkeley We present a model of edge and region grouping.

What, Where & How Many? Combining Object Detectors and CRFs

Generic object detection with deformable part-based models

Hamed Pirsiavash, Deva Ramanan, Charless Fowlkes

The Three R’s of Vision Jitendra Malik.

1 Mean shift and feature selection ECE 738 course project Zhaozheng Yin Spring 2005 Note: Figures and ideas are copyrighted by original authors.

Unsupervised Object Segmentation with a Hybrid Graph Model (HGM) Reporter: 鄭綱 (6/14)

A General Framework for Tracking Multiple People from a Moving Camera

“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)

Detecting Curved Symmetric Parts using a Deformable Disc Model Tom Sie Ho Lee, University of Toronto Sanja Fidler, TTI Chicago Sven Dickinson, University.

Detection, Segmentation and Fine-grained Localization

#MOTION ESTIMATION AND OCCLUSION DETECTION #BLURRED VIDEO WITH LAYERS

Object Detection with Discriminatively Trained Part Based Models

Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.

Announcements  Final exam: Friday Dec 19, 8am-11am, 1 Pimentel  Closed-book, closed-notes except 1 page of notes  Grading  Your midterm score is replaced.

Deformable Part Model Presenter ： Liu Changyu Advisor ： Prof. Alex Hauptmann Interest ： Multimedia Analysis April 11 st, 2013.

Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.

Associative Hierarchical CRFs for Object Class Image Segmentation Ľubor Ladický 1 1 Oxford Brookes University 2 Microsoft Research Cambridge Based on the.

IIIT Hyderabad Learning Semantic Interaction among Graspable Objects Swagatika Panda, A.H. Abdul Hafez, C.V. Jawahar Center for Visual Information Technology,

O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.

CS 1699: Intro to Computer Vision Detection II: Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 12, 2015.

Recognition Using Visual Phrases

Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.

Part 4: combined segmentation and recognition Li Fei-Fei.

Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,

Learning Hierarchical Features for Scene Labeling Cle’ment Farabet, Camille Couprie, Laurent Najman, and Yann LeCun by Dong Nie.

1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.

Object detection with deformable part-based models

Nonparametric Semantic Segmentation

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

Bilinear Classifiers for Visual Recognition

Learning to Combine Bottom-Up and Top-Down Segmentation

“The Truth About Cats And Dogs”

Outline Background Motivation Proposed Model Experimental Results

Presentation transcript:

Layered Object Detection for Multi-Class Image Segmentation UC Irvine Yi Yang Sam Hallman Deva Ramanan Charless Fowlkes

Introduction PASCAL competition 20 object categories + “background” Goal Desired result

Introduction

* We use part-based detectors from Felzenszwalb, Gorelick, McAllester, & Ramanan PAMI 09

Layered Representation Model each detection in “2.1D”: detections ordered by depth Desired result

Related work Related work combining segmentation and recognition – Biasing segmentation output based on object models [Yu et al. 03, Ramanan 06, Kumar et al. 05] – Iterating between bottom-up and top-down cues [Leibe et al. 04, Tu et al. 05] Related work on layers – Work in the video domain [Wang & Adelson 94, Jojic & Frey 01, Kumar et al. 04] – Some work in image domain [Gao et al. 07, Nitzberg et al. 93]

Issues Need detector thresholds Need comparable confidence scores

Detector calibration Find optimal threshold for pixel labeling

Model

Inference: coordinate descent Step 1: Step 2: (fit color models to segments and background) (per-pixel segmentation given shape prior & color likelihood) Color models are learned on-the-fly for each detection instance (e.g., people may wear blue or red shirts)

Use hierarchical segmentation of Arbelaez, Maire, Fowlkes, Malik (2009) at a threshold which generates ~200 segments per image. Bottom-up segmentation

Inference: coordinate descent Step 1: Step 2: (fit color models to segments and background) (per-pixel segmentation given shape prior & color likelihood) = set of pixel labelings consistent with superpixel map

Building P(z i |d π ) person detections true positives ground truth segmentations shape prior for the person class

Shape priors (cont’d)

Bicycle part-based priors

Motorcycle part-based priors

Horse part-based prior

Bottle part-based priors

Building P(z i |d π )

[ 1, 0, 0, 0, 0, 0 ] [ 0.1, 0.9, 0, 0, 0, 0 ] [ 0.099, 0.891, 0.01, 0, 0, 0 ]

Building P(z i |d π ) [ 1, 0, 0, 0, 0, 0 ] [ 0.1, 0.9, 0, 0, 0, 0 ]

Building P(z i |d π ) [ 1, 0, 0, 0, 0, 0 ] [ 0.1, 0.9, 0, 0, 0, 0 ] [ 0.099, 0.891, 0.01, 0, 0, 0 ]

Building P(z i |d π )

The algorithm There is no secret method for finding π… Just try all of them! Luckily, the number of permutations to test is usually small. :

Results!

PASCAL 2009 Segmentation Challenge Performs well where detector works well (objects with well defined, fairly rigid shapes)

Color helps person segmentation Superpixels really help bikes Parts provide small but noticeable improvement What aspects of the model are useful?

Does ordering help? Default ordering based on detector score works fairly well (after calibration) PASCAL segmentation benchmark limitations: – images often only have a single object – scoring is per-class rather than per-instance Ordering between same class detections doesn’t affect benchmark score We found ordering helped on a subset of PASCAL with overlapping objects

Conclusions A simple layered model for compiling multi- object detections into segmentations – Deformable shape prior built on part-based detector – Per-instance color model Provides a globally consistent 2.1D interpretation Good performance on PASCAL segmentation benchmark

Thanks!