Large-Scale Object Recognition using Label Relation Graphs Jia Deng 1,2, Nan Ding 2, Yangqing Jia 2, Andrea Frome 2, Kevin Murphy 2, Samy Bengio 2, Yuan.

Slides:



Advertisements
Similar presentations
Mean-Field Theory and Its Applications In Computer Vision1 1.
Advertisements

Semantic Contours from Inverse Detectors Bharath Hariharan et.al. (ICCV-11)
Attributes for Classifier Feedback Amar Parkash and Devi Parikh.
Weakly supervised learning of MRF models for image region labeling Jakob Verbeek LEAR team, INRIA Rhône-Alpes.
Adding Unlabeled Samples to Categories by Learned Attributes Jonghyun Choi Mohammad Rastegari Ali Farhadi Larry S. Davis PPT Modified By Elliot Crowley.
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
Constrained Approximate Maximum Entropy Learning (CAMEL) Varun Ganapathi, David Vickrey, John Duchi, Daphne Koller Stanford University TexPoint fonts used.
EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.
A generic model to compose vision modules for holistic scene understanding Adarsh Kowdle *, Congcong Li *, Ashutosh Saxena, and Tsuhan Chen Cornell University,
Learning to Combine Bottom-Up and Top-Down Segmentation Anat Levin and Yair Weiss School of CS&Eng, The Hebrew University of Jerusalem, Israel.
Scene Labeling Using Beam Search Under Mutex Constraints ID: O-2B-6 Anirban Roy and Sinisa Todorovic Oregon State University 1.
Maximum Margin Markov Network Ben Taskar, Carlos Guestrin Daphne Koller 2004.
C ONSTRAINED S EMI -S UPERVISED L EARNING USING A TTRIBUTES AND C OMPARATIVE A TTRIBUTES Abhinav Shrivastava, Saurabh Singh, Abhinav Gupta The Robotics.
Beyond Mindless Labeling: Really Leveraging Humans to Build Intelligent Machines Devi Parikh Virginia Tech.
INTRODUCTION Heesoo Myeong, Ju Yong Chang, and Kyoung Mu Lee Department of EECS, ASRI, Seoul National University, Seoul, Korea Learning.
Mixture of trees model: Face Detection, Pose Estimation and Landmark Localization Presenter: Zhang Li.
Face Detection, Pose Estimation, and Landmark Localization in the Wild
Learning Scalable Discriminative Dictionaries with Sample Relatedness a.k.a. “Infinite Attributes” Jiashi Feng, Stefanie Jegelka, Shuicheng Yan, Trevor.
Object-centric spatial pooling for image classification Olga Russakovsky, Yuanqing Lin, Kai Yu, Li Fei-Fei ECCV 2012.
Unsupervised Transfer Classification Application to Text Categorization Tianbao Yang, Rong Jin, Anil Jain, Yang Zhou, Wei Tong Michigan State University.
IJCAI Wei Zhang, 1 Xiangyang Xue, 2 Jianping Fan, 1 Xiaojing Huang, 1 Bin Wu, 1 Mingjie Liu 1 Fudan University, China; 2 UNCC, USA {weizh,
Discriminative and generative methods for bags of features
A Graphical Model For Simultaneous Partitioning And Labeling Philip Cowans & Martin Szummer AISTATS, Jan 2005 Cambridge.
Relative Attributes Presenter: Shuai Zheng (Kyle) Supervised by Philip H.S. Torr Author: Devi Parikh (TTI-Chicago) and Kristen Grauman (UT-Austin)
Context-Aware Query Classification Huanhuan Cao 1, Derek Hao Hu 2, Dou Shen 3, Daxin Jiang 4, Jian-Tao Sun 4, Enhong Chen 1 and Qiang Yang 2 1 University.
Predictive Automatic Relevance Determination by Expectation Propagation Yuan (Alan) Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani.
Supervised Distance Metric Learning Presented at CMU’s Computer Vision Misc-Read Reading Group May 9, 2007 by Tomasz Malisiewicz.
Semi-Supervised Clustering Jieping Ye Department of Computer Science and Engineering Arizona State University
Distributed Representations of Sentences and Documents
Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)
Semantic Kernel Forests from Multiple Taxonomies Sung Ju Hwang (University of Texas at Austin), Fei Sha (University of Southern California), and Kristen.
Label Tree in Large-Scale Mun Jonghwan. From small to large scale 2.
Describing People: A Poselet-Based Approach to Attribute Classification Lubomir Bourdev 1,2 Subhransu Maji 1 Jitendra Malik 1 1 EECS U.C. Berkeley 2 Adobe.
Computer vision: models, learning and inference
Kuan-Chuan Peng Tsuhan Chen
Why Categorize in Computer Vision ?. Why Use Categories? People love categories!
Enhancing Human-Machine Communication via Visual Attributes Devi Parikh Virginia Tech.
Semantic Embedding Space for Zero ­ Shot Action Recognition Xun XuTimothy HospedalesShaogang GongAuthors: Computer Vision Group Queen Mary University of.
Xiangnan Kong,Philip S. Yu Multi-Label Feature Selection for Graph Classification Department of Computer Science University of Illinois at Chicago.
Associative Hierarchical CRFs for Object Class Image Segmentation Ľubor Ladický 1 1 Oxford Brookes University 2 Microsoft Research Cambridge Based on the.
Tell Me What You See and I will Show You Where It Is Jia Xu 1 Alexander G. Schwing 2 Raquel Urtasun 2,3 1 University of Wisconsin-Madison 2 University.
A New Method for Automatic Clothing Tagging Utilizing Image-Click-Ads Introduction Conclusion Can We Do Better to Reduce Workload?
Fully Convolutional Networks for Semantic Segmentation
Discussion of Pictorial Structures Pedro Felzenszwalb Daniel Huttenlocher Sicily Workshop September, 2006.
Context-Aware Query Classification Huanhuan Cao, Derek Hao Hu, Dou Shen, Daxin Jiang, Jian-Tao Sun, Enhong Chen, Qiang Yang Microsoft Research Asia SIGIR.
Towards Total Scene Understanding: Classification, Annotation and Segmentation in an Automatic Framework N 工科所 錢雅馨 2011/01/16 Li-Jia Li, Richard.
Augmenting Probabilistic Graphical Models with Ontology Information: Object Classification R. Möller Institute of Information Systems University of Luebeck.
Joint Object-Material Category Segmentation from Audio-Visual Cues
C ONSTRAINED S EMI -S UPERVISED L EARNING USING A TTRIBUTES AND C OMPARATIVE A TTRIBUTES Presenter : Ankit Laddha Most of the slides are borrowed from.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
Conditional Random Fields & Table Extraction Dongfang Xu School of Information.
Multi-label Classification Yusuke Miyao. N. Ghamrawi, A. McCallum. Collective multi-label classification. CIKM S. Godbole, S. Sarawagi. Discriminative.
Gaussian Conditional Random Field Network for Semantic Segmentation
Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.
Recent developments in object detection
Semi-Supervised Clustering
Object Detection based on Segment Masks
Object detection with deformable part-based models
Web-Mining Agents Augmenting Probabilistic Graphical Models with Ontology Information: Object Classification Prof. Dr. Ralf Möller Dr. Özgür L. Özçep Universität.
Announcements Project proposal due tomorrow
Nonparametric Semantic Segmentation
Learning to Combine Bottom-Up and Top-Down Segmentation
[Figure taken from googleblog
RCNN, Fast-RCNN, Faster-RCNN
GANG: Detecting Fraudulent Users in OSNs
Learning Object Context for Dense Captioning
Zeroshot Learning Mun Jonghwan.
Ping LUO*, Fen LIN^, Yuhong XIONG*, Yong ZHAO*, Zhongzhi SHI^
Human-object interaction
ICCV 2019.
Presentation transcript:

Large-Scale Object Recognition using Label Relation Graphs Jia Deng 1,2, Nan Ding 2, Yangqing Jia 2, Andrea Frome 2, Kevin Murphy 2, Samy Bengio 2, Yuan Li 2, Hartmut Neven 2, Hartwig Adam 2 University of Michigan 1, Google 2

Object Classification Assign semantic labels to objects Corgi Puppy Dog Cat ✔ ✔ ✖ ✔

Object Classification Assign semantic labels to objects Probabilities Corgi Puppy Dog Cat

Object Classification Assign semantic labels to objects Feature Extractor FeaturesClassifier Probabilities Corgi Puppy Dog Cat

Object Classification Multiclass classifier: Softmax Corgi Puppy Dog Cat / / / / + Assumes mutual exclusive labels Independent binary classifiers: Logistic Regression Corgi Puppy Dog Cat No assumptions about relations.

Object labels have rich relations Corgi Puppy Dog Cat Exclusion Hierarchical Dog Cat Corg i Puppy Overlap Softmax: all labels are mutually exclusive  Logistic Regression: all labels overlap 

Goal: A new classification model Respects real world label relations Corgi Puppy Dog Cat Corgi Puppy Dog Cat

Visual Model + Knowledge Graph Corgi Puppy Dog Cat Visual Model Knowledge Graph Joint Inference Assumption in this work: Knowledge graph is given and fixed.

Agenda Encoding prior knowledge (HEX graph) Classification model Efficient Exact Inference Experiments Conclusion and Future Work

Agenda Encoding prior knowledge (HEX graph) Classification model Efficient Exact Inference Experiments Conclusion and Future Work

Hierarchy and Exclusion (HEX) Graph Corgi Puppy Dog Cat Exclusion Hierarchical Hierarchical edges (directed) Exclusion edges (undirected)

Examples of HEX graphs Car Bird Dog Cat Male Female Person Child Boy Round Red Shiny Thick Mutually exclusiveAll overlapping Combination Girl

State Space: Legal label configurations DogCatCorgiPuppy … … Corgi Puppy Dog Cat Each edge defines a constraint.

State Space: Legal label configurations Corgi Puppy Dog Cat DogCatCorgiPuppy … … Hierarchy: (dog, corgi) can’t be (0,1) Each edge defines a constraint.

State Space: Legal label configurations Corgi Puppy Dog Cat DogCatCorgiPuppy … … Exclusion: (dog, cat) can’t be (1,1) Hierarchy: (dog, corgi) can’t be (0,1) Each edge defines a constraint.

Agenda Encoding prior knowledge (HEX graph) Classification model Efficient Exact Inference Experiments Conclusion and Future Work

HEX Classification Model Pairwise Conditional Random Field (CRF) Input scores Binary Label vector

HEX Classification Model Pairwise Conditional Random Field (CRF) Binary Label vector Unary: same as logistic regression Input scores

HEX Classification Model Pairwise Conditional Random Field (CRF) Binary Label vector  All illegal configurations have probability zero. Unary: same as logistic regression If violates constraints Otherwise 0 Pairwise: set illegal configuration to zero Input scores

HEX Classification Model Pairwise Conditional Random Field (CRF) Binary Label vector Partition function: Sum over all (legal) configurations Input scores

HEX Classification Model Pairwise Conditional Random Field (CRF) Binary Label vector Probability of a single label: marginalize all other labels. Input scores

Special Case of HEX Model Softmax Car Bird Dog Cat Round Red Shiny Mutually exclusive All overlapping Thick Logistic Regressions

Learning Corgi Puppy Dog Cat DNN Label: Dog Maximize marginal probability of observed labels Back Propagation Dog Corgi Puppy Cat 1 ? ? ?

Agenda Encoding prior knowledge (HEX graph) Classification model Efficient Exact Inference Experiments Conclusion

Naïve Exact Inference is Intractable Inference: – Computing partition function – Perform marginalization HEX-CRF can be densely connected (large treewidth)

Observation 1: Exclusions are good Car Bird Dog Cat Lots of exclusions  Small state space  Efficient inference Realistic graphs have lots of exclusions. Rigorous analysis in paper. Number of legal states is O(n), not O(2 n ).

Observation 2: Equivalent graphs Dog Cat Corgi Puppy Pembroke Welsh Corgi Cardigan Welsh Corgi Dog Cat Corgi Puppy Pembroke Welsh Corgi Cardigan Welsh Corgi

Observation 2: Equivalent graphs Sparse equivalent Small Treewidth Dynamic programming Dog Cat Corgi Puppy Pembroke Welsh Corgi Cardigan Welsh Corgi Dog Cat Corgi Puppy Pembroke Welsh Corgi Cardigan Welsh Corgi Dog Cat Corgi Puppy Pembroke Welsh Corgi Cardigan Welsh Corgi Dense equivalent Prune states Can brute force

A B F B E D C G F B C F HEX Graph Inference A B E D C G F A B E D C G F A B E D C G F A B F B E D C G F B C F 1. Sparsify (offline) 3.Densify (offline) 2.Build Junction Tree (offline) 4.Prune Clique States (offline) 5. Message Passing on legal states (online)

Agenda Encoding prior knowledge (HEX graph) Classification model Efficient Exact Inference Experiments Conclusion and Future Work

Exp 1: Learning with weak labels Many basic category labels Few fine-grained labels Dog Corgi Animal … … Weak labels: No information on subcategories.

Corgi Puppy Dog Cat DNN Label: Dog Dog Corgi Puppy Cat 1 ? ? ? Hypothesis: HEX models can improve fine-grained recognition using basic level labels. Exp 1: Learning with weak labels

ILSVRC 2012: “relabel” or “weaken” a portion of fine-grained leaf labels to basic level labels. Evaluate on fine-grained recognition Exp 1: Learning with weak labels Dog Corgi Animal … Husky Dog Corgi Animal … Husky Relabel Training (“weakened” labels) Test Dog Corgi Animal … Husky Original ILSVRC 2012 (leaf labels)

ILSVRC 2012: “relabel” or “weaken” a portion of fine-grained leaf labels to basic level labels. Evaluate on fine-grained recognition. Consistently outperforms baselines. Exp 1: Learning with weak labels Top 1 accuracy (top 5 accuracy)

Exp 2: Zero-Shot Recognition using Object-Attribute Knowledge Animals with Attribute (AwA) dataset (Lampert et al. 2009) Training: Observe only a subset of animal labels. Given all animal-attribute relations Indirectly learns attributes. Test: predict new classes with no images in training. DAP (Lampert et al.)IAP (Lampert et al.)Ours 40.5%27.8%38.5% polar bear black: no white: yes brown: no stripes: no polar bear black white zebra brown stripes zebra black: yes white: yes brown: no stripes: yes … …

Related Work Multilabel Annotation & Hierarchy [Lampert et al. NIPS’11] [Chen et al. ICCV’11] [Bi & Kwok, NIPS’12] [Bucak et al. CVPR’11] Ours: Unifies hierarchy and exclusion. Transfer learning & Attributes [Rohrbach et al. CVPR’10] [Lampert et al. CVPR’09] [Kuettel et al. ECCV’12] [Akata et al. CVPR’13] Ours: A classification model that allows transferring. Extracting Common Sense Knowledge [Chen et al. ICCV’13] [Zitnick & Parikh CVPR’13] Ours: Assumes knowledge is given. [Hwang et al. CVPR’11] [Kang et al. CVPR’06] [Marszalek & Schmid CVPR’07] [Zweig & Weinshall CVPR’07] [Farhadi et al. CVPR’10] [Lim et al. NIPS’11] [Yu et al. CVPR’13] [Fergus et al. ECCV’10] [Zhu et al. ECCV’14] [Fouhey & Zitnick CVPR’14] Visual Model External Knowledge

Conclusions A unified framework for single object classification – Generalizes standard classification models – Leverages a knowledge graph – Efficient exact inference Future work – Non-absolute relations – Spatial relations between object instances