Object-Graphs for Context-Aware Category Discovery Yong Jae Lee and Kristen Grauman University of Texas at Austin 1.

Slides:

Advertisements

Similar presentations

Automatic Photo Pop-up Derek Hoiem Alexei A.Efros Martial Hebert Carnegie Mellon University.

Advertisements

Weakly supervised learning of MRF models for image region labeling Jakob Verbeek LEAR team, INRIA Rhône-Alpes.

Context-based object-class recognition and retrieval by generalized correlograms by J. Amores, N. Sebe and P. Radeva Discussion led by Qi An Duke University.

Adding Unlabeled Samples to Categories by Learned Attributes Jonghyun Choi Mohammad Rastegari Ali Farhadi Larry S. Davis PPT Modified By Elliot Crowley.

Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.

Interactively Co-segmentating Topically Related Images with Intelligent Scribble Guidance Dhruv Batra, Carnegie Mellon University Adarsh Kowdle, Cornell.

Shape Sharing for Object Segmentation

Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.

Wrap Up. We talked about Filters Edges Corners Interest Points Descriptors Image Stitching Stereo SFM.

LARGE-SCALE IMAGE PARSING Joseph Tighe and Svetlana Lazebnik University of North Carolina at Chapel Hill road building car sky.

1 Part 1: Classical Image Classification Methods Kai Yu Dept. of Media Analytics NEC Laboratories America Andrew Ng Computer Science Dept. Stanford University.

INTRODUCTION Heesoo Myeong, Ju Yong Chang, and Kyoung Mu Lee Department of EECS, ASRI, Seoul National University, Seoul, Korea Learning.

Real-Time Human Pose Recognition in Parts from Single Depth Images Presented by: Mohammad A. Gowayyed.

Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.

Boundary Preserving Dense Local Regions

Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots Chao-Yeh Chen and Kristen Grauman University of Texas at Austin.

Bag-of-features models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Unsupervised Learning of Visual Taxonomies IEEE conference on CVPR 2008 Evgeniy Bart – Caltech Ian Porteous – UC Irvine Pietro Perona – Caltech Max Welling.

Recognition: A machine learning approach

Robust Higher Order Potentials For Enforcing Label Consistency

Lecture 28: Bag-of-words models

LARGE-SCALE NONPARAMETRIC IMAGE PARSING Joseph Tighe and Svetlana Lazebnik University of North Carolina at Chapel Hill CVPR 2011Workshop on Large-Scale.

Learning Spatial Context: Using stuff to find things Geremy Heitz Daphne Koller Stanford University October 13, 2008 ECCV 2008.

Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Unsupervised discovery of visual object class hierarchies Josef Sivic (INRIA / ENS), Bryan Russell (MIT), Andrew Zisserman (Oxford), Alyosha Efros (CMU)

WORD-PREDICTION AS A TOOL TO EVALUATE LOW-LEVEL VISION PROCESSES Prasad Gabbur, Kobus Barnard University of Arizona.

Learning Spatial Context: Can stuff help us find things? Geremy Heitz Daphne Koller April 14, 2008 DAGS Stuff (n): Material defined by a homogeneous or.

Discriminative and generative methods for bags of features

Unsupervised Learning of Categories from Sets of Partially Matching Image Features Kristen Grauman and Trevor Darrel CVPR 2006 Presented By Sovan Biswas.

Step 3: Classification Learn a decision rule (classifier) assigning bag-of-features representations of images to different classes Decision boundary Zebra.

CSE 185 Introduction to Computer Vision Pattern Recognition.

Object Detection Sliding Window Based Approach Context Helps

Watch, Listen and Learn Sonal Gupta, Joohyun Kim, Kristen Grauman and Raymond Mooney -Pratiksha Shah.

Computer Vision CS 776 Spring 2014 Recognition Machine Learning Prof. Alex Berg.

Recognition using Regions (Demo) Sudheendra V. Outline Generating multiple segmentations –Normalized cuts [Ren & Malik (2003)] Uniform regions –Watershed.

Interactive Discovery and Semantic Labeling of Patterns in Spatial Data Thomas Funkhouser, Adam Finkelstein, David Blei, and Christiane Fellbaum Princeton.

Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,

Reading Between The Lines: Object Localization Using Implicit Cues from Image Tags Sung Ju Hwang and Kristen Grauman University of Texas at Austin Jingnan.

Efficient Region Search for Object Detection Sudheendra Vijayanarasimhan and Kristen Grauman Department of Computer Science, University of Texas at Austin.

80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.

Supervised Learning of Edges and Object Boundaries Piotr Dollár Zhuowen Tu Serge Belongie.

INTRODUCTION Heesoo Myeong and Kyoung Mu Lee Department of ECE, ASRI, Seoul National University, Seoul, Korea Tensor-based High-order.

Unsupervised Learning of Visual Sense Models for Polysemous Words Kate Saenko Trevor Darrell Deepak.

Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.

Grouplet: A Structured Image Representation for Recognizing Human and Object Interactions Bangpeng Yao and Li Fei-Fei Computer Science Department, Stanford.

Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.

Discovering Objects and their Location in Images Josef Sivic 1, Bryan C. Russell 2, Alexei A. Efros 3, Andrew Zisserman 1 and William T. Freeman 2 Goal:

Towards Total Scene Understanding: Classiﬁcation, Annotation and Segmentation in an Automatic Framework N 工科所錢雅馨 2011/01/16 Li-Jia Li, Richard.

Context Neelima Chavali ECE /21/2013. Roadmap Introduction Paper1 – Motivation – Problem statement – Approach – Experiments & Results Paper 2 Experiments.

Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

Sung Ju Hwang and Kristen Grauman University of Texas at Austin.

6.S093 Visual Recognition through Machine Learning Competition Image by kirkh.deviantart.com Joseph Lim and Aditya Khosla Acknowledgment: Many slides from.

Example Apply hierarchical clustering with d min to below data where c=3. Nearest neighbor clustering d min d max will form elongated clusters!

Learning Hierarchical Features for Scene Labeling

CS 2750: Machine Learning Clustering Prof. Adriana Kovashka University of Pittsburgh January 25, 2016.

Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.

BMVC 2010 Sung Ju Hwang and Kristen Grauman University of Texas at Austin.

The topic discovery models

Data Driven Attributes for Action Detection

Krishna Kumar Singh, Yong Jae Lee University of California, Davis

Nonparametric Semantic Segmentation

Paper Presentation: Shape and Matching

The topic discovery models

Finding Clusters within a Class to Improve Classification Accuracy

Object-Graphs for Context-Aware Category Discovery

CS 2750: Machine Learning Clustering

CS 1674: Intro to Computer Vision Scene Recognition

Brief Review of Recognition + Context

The topic discovery models

Presentation transcript:

Object-Graphs for Context-Aware Category Discovery Yong Jae Lee and Kristen Grauman University of Texas at Austin 1

Motivation Unlabeled Image DataDiscovered categories 1) reveal structure in very large image collections 2) greatly reduce annotation time and effort 3) training data is not always available 2

Existing approaches Previous work treats unsupervised visual discovery as an appearance-grouping problem. - Topic models e.g., pLSA, LDA. [Fergus et al. 2005], [Sivic et al. 2005], [Quelhas et al. 2005], [Fei-Fei & Perona 2005], [Liu & Chen 2007], [Russell et al. 2006] - Partitioning of the image data. [Grauman & Darrell 2006], [Dueck & Frey 2007], [Kim et al. 2008], [Lee & Grauman 2008], [Lee & Grauman 2009] 3

Existing approaches Previous work treats unsupervised visual discovery as an appearance-grouping problem Can you identify the recurring pattern?

How can seeing previously learned objects in novel images help to discover new categories? Our idea 5 Can you identify the recurring pattern?

Discover visual categories within unlabeled images by modeling interactions between the unfamiliar regions and familiar objects. Our idea Can you identify the recurring pattern?

drive- way sky house ? grass Context-aware visual discovery grass sky truck house ? drive- way grass sky house drive- way fence ? ? ?? 7 Context in supervised recognition: [Torralba 2003], [Hoiem et al. 2006], [He et al. 2004], [Shotton et al. 2006], [Heitz & Koller 2008], [Rabinovich et al. 2007], [Galleguillos et al. 2008], [Tu 2008], [Parikh et al. 2008], [Gould et al. 2009], [Malisiewicz & Efros 2009], [Lazebnik 2009]

Key Ideas Context-aware category discovery treating previously learned categories as object-level context. Object-Graph descriptor to encode surrounding object- level context. *Note: Different from semi-supervised learning – unlabeled data do not necessarily belong to categories of the labeled data. 8

Approach Overview 9 Learn category models for some classes Detect unknowns in unlabeled images Describe object-level context via Object-Graph Group regions to discover new categories

Learn “Known” Categories Offline: Train region-based classifiers for N “known” categories using labeled training data. sky road building tree 10 Detect Unknowns Object-level Context Discovery Learn Models

Identifying Unknown Objects Input: unlabeled pool of novel images Compute multiple-segmentations for each unlabeled image 11 Detect Unknowns Object-level Context Discovery Learn Models e.g., [Hoiem et al. 2006], [Russell et al. 2006], [Rabinovich et al. 2007]

P(class | region) bldg tree sky road P(class | region) bldg tree sky road P(class | region) bldg tree sky road P(class | region) bldg tree sky road Prediction: known High entropy → Prediction: unknown For all segments, use classifiers to compute posteriors for the N “known” categories. Deem each segment as “known” or “unknown” based on resulting entropy. 12 Identifying Unknown Objects Detect Unknowns Object-level Context Discovery Learn Models

Model the topology of category predictions relative to the unknown (unfamiliar) region. Incorporate uncertainty from classifiers. An unknown region within an image 0 13 Object-Graphs Detect Unknowns Object-level Context Discovery Learn Models

An unknown region within an image 0 Closest nodes in its object-graph 2a 2b 1b 1a 3a 3b Consider spatially near regions above and below, record distributions for each known class. S b t s r 1a above 1b below H 1 (s) b t s r H 0 (s) 0 self g(s) = [,,, ] H R (s) b t s r Ra above Rb below 1 st nearest regionout to R th nearest b t s r 0 self Object-Graphs Detect Unknowns Object-level Context Discovery Learn Models 14

Object-Graphs Average across segmentations N posterior prob.’s per pixel b t s r N posterior prob.’s per superpixel b t s r Obtain per-pixel measures of class posteriors on larger spatial extents. 15 Detect Unknowns Object-level Context Discovery Learn Models

g(s 1 ) = [ :,, : ] b t g r abovebelow H R (s)H 1 (s) abovebelow b t g r g(s 2 ) = [ :,, : ] b t g r abovebelow H R (s)H 1 (s) abovebelow b t g r Object-graphs are very similar  produces a strong match Known classes b: building t: tree g: grass r: road 16 Object-Graph matching Detect Unknowns Object-level Context Discovery Learn Models building ? road building / road building / road tree / road building ? road building / road

grass ? g(s 1 ) = [ :,, : ] b t g r abovebelow H R (s)H 1 (s) abovebelow b t g r g(s 2 ) = [ :,, : ] b t g r abovebelow H R (s)H 1 (s) abovebelow b t g r Object-graphs are partially similar  produces a fair match Known classes b: building t: tree g: grass r: road 17 Object-Graph matching Detect Unknowns Object-level Context Discovery Learn Models building ? road building / road building / road building road

Unknown Regions Clusters from region-region affinities 18 Detect Unknowns Object-level Context Discovery Learn Models

Object Discovery Accuracy Four datasets Multiple splits for each dataset; varying categories and number of knowns/unknowns Train 40% (for known categories), Test 60% of data Textons, Color histograms, and pHOG Features MSRC-v2 PASCAL 2008 Corel MSRC-v0 19

20 MSRC-v2 PASCAL 2008 Corel MSRC-v0 Object Discovery Accuracy

Comparison with State-of-the-art Russell et al., 2006: Topic model (LDA) to discover categories among multiple segmentations using appearance only. Significant improvement over existing state-of-the-art. 21 MSRC-v2

Example Object-Graphs buildingsky roadunknown 22 Color in superpixel nodes indicate the predicted known category.

Examples of Discovered Categories 23

Collect-Cut (poster Thursday) 24 Best Bottom-up (with multi-segs) Collect-Cut (ours) Discovered Ensemble from Unlabeled Multi-Object Images Unlabeled Images Use discovered shared top-down cues to refine both the segments and discovered categories with an energy function that can be minimized with graph cuts. Unsupervised Segmentation Examples

Conclusions Discover new categories in the context of those that have already been directly taught. Substantial improvement over traditional unsupervised appearance-based methods. Future work: Continuously expand the object- level context for future discoveries. 25

Category Retrieval Results 26

27 Impact of Known/Unknown Decisions Red star denotes the cutoff (half of max possible entropy value). Regions considered for discovery are almost all true unknowns (and vice versa), at some expense of misclassification.

Impact of Object-Graph Descriptor How does the object-graph descriptor compare to a simpler alternative that directly encodes the surrounding appearance features? 28 Appearance-level context Object-level context

29 Perfect Known/Unknown Separation Performance attainable were we able to perfectly separate segments according to whether they are known or unknown.

Random Splits of Known/Unknown 30

31 Previous Work: [Scholkopf 2000], [Markou & Singh 2003], [Weinshall et al. 2008] ImageGT known/unknown Multiple-Segmentation Entropy Maps unknowns building tree knowns sky road Identifying Unknown Objects Detect Unknowns Object-level Context Discovery Learn Models