Attributes for Classifier Feedback Amar Parkash and Devi Parikh.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Rich feature Hierarchies for Accurate object detection and semantic segmentation Ross Girshick, Jeff Donahue, Trevor Darrell, Jitandra Malik (UC Berkeley)
Thomas Berg and Peter Belhumeur
Learning Shared Body Plans Ian Endres University of Illinois work with Derek Hoiem, Vivek Srikumar and Ming-Wei Chang.
Learning Semantics with Less Supervision
Exploiting Big Data via Attributes (Offline Contd.)
Describing Images Using Attributes. Describing Images Farhadi et.al. CVPR 2009.
Adding Unlabeled Samples to Categories by Learned Attributes Jonghyun Choi Mohammad Rastegari Ali Farhadi Larry S. Davis PPT Modified By Elliot Crowley.
A Unified Framework for Context Assisted Face Clustering
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.
Limin Wang, Yu Qiao, and Xiaoou Tang
A generic model to compose vision modules for holistic scene understanding Adarsh Kowdle *, Congcong Li *, Ashutosh Saxena, and Tsuhan Chen Cornell University,
SPONSORED BY SA2014.SIGGRAPH.ORG Annotating RGBD Images of Indoor Scenes Yu-Shiang Wong and Hung-Kuo Chu National Tsing Hua University CGV LAB.
Machine learning continued Image source:
C ONSTRAINED S EMI -S UPERVISED L EARNING USING A TTRIBUTES AND C OMPARATIVE A TTRIBUTES Abhinav Shrivastava, Saurabh Singh, Abhinav Gupta The Robotics.
Capturing Human Insight for Visual Learning Kristen Grauman Department of Computer Science University of Texas at Austin Work with Sudheendra Vijayanarasimhan,
Beyond Mindless Labeling: Really Leveraging Humans to Build Intelligent Machines Devi Parikh Virginia Tech.
Object-centric spatial pooling for image classification Olga Russakovsky, Yuanqing Lin, Kai Yu, Li Fei-Fei ECCV 2012.
Discriminative Segment Annotation in Weakly Labeled Video Kevin Tang, Rahul Sukthankar Appeared in CVPR 2013 (Oral)
EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.
Analyzing Semantic Segmentation Using Hybrid Human-Machine CRFs Roozbeh Mottaghi 1, Sanja Fidler 2, Jian Yao 2, Raquel Urtasun 2, Devi Parikh 3 1 UCLA.
Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.
Relative Attributes Presenter: Shuai Zheng (Kyle) Supervised by Philip H.S. Torr Author: Devi Parikh (TTI-Chicago) and Kristen Grauman (UT-Austin)
MSRC Summer School - 30/06/2009 Cambridge – UK Hybrids of generative and discriminative methods for machine learning.
Generic object detection with deformable part-based models
Describing People: A Poselet-Based Approach to Attribute Classification Lubomir Bourdev 1,2 Subhransu Maji 1 Jitendra Malik 1 1 EECS U.C. Berkeley 2 Adobe.
Unsupervised Learning of Categories from Sets of Partially Matching Image Features Kristen Grauman and Trevor Darrel CVPR 2006 Presented By Sovan Biswas.
Watch, Listen and Learn Sonal Gupta, Joohyun Kim, Kristen Grauman and Raymond Mooney -Pratiksha Shah.
Semisupervised Learning A brief introduction. Semisupervised Learning Introduction Types of semisupervised learning Paper for review References.
“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)
Svetlana Lazebnik, Cordelia Schmid, Jean Ponce
Learning Collections of Parts for Object Recognition and Transfer Learning University of Illinois at Urbana- Champaign.
Enhancing Human-Machine Communication via Visual Attributes Devi Parikh Virginia Tech.
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
Sharing Features Between Objects and Their Attributes Sung Ju Hwang 1, Fei Sha 2 and Kristen Grauman 1 1 University of Texas at Austin, 2 University of.
C. Lawrence Zitnick Microsoft Research, Redmond Devi Parikh Virginia Tech Bringing Semantics Into Focus Using Visual.
Methods for classification and image representation
HAITHAM BOU AMMAR MAASTRICHT UNIVERSITY Transfer for Supervised Learning Tasks.
WhittleSearch: Image Search with Relative Attribute Feedback CVPR 2012 Adriana Kovashka Devi Parikh Kristen Grauman University of Texas at Austin Toyota.
Recognition Using Visual Phrases
Iterative similarity based adaptation technique for Cross Domain text classification Under: Prof. Amitabha Mukherjee By: Narendra Roy Roll no: Group:
Context Neelima Chavali ECE /21/2013. Roadmap Introduction Paper1 – Motivation – Problem statement – Approach – Experiments & Results Paper 2 Experiments.
Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.
Describing People: A Poselet-Based Approach to Attribute Classification.
Convolutional Restricted Boltzmann Machines for Feature Learning Mohammad Norouzi Advisor: Dr. Greg Mori Simon Fraser University 27 Nov
Interactively Discovery of Attributes Vocabulary Devi Parikh and Kristen Grauman.
Richer Human-Machine Communication in Attributes-based Visual Recognition Devi Parikh TTIC.
Fine-grained Fine-grained Recognition( 细粒度分类 ) 沈志强.
C ONSTRAINED S EMI -S UPERVISED L EARNING USING A TTRIBUTES AND C OMPARATIVE A TTRIBUTES Presenter : Ankit Laddha Most of the slides are borrowed from.
PANDA: Pose Aligned Networks for Deep Attribute Modeling Ning Zhang 1,2 Manohar Paluri 1 Marć Aurelio Ranzato 1 Trevor Darrell 2 Lumbomir Boudev 1 1 Facebook.
NEIL: Extracting Visual Knowledge from Web Data Xinlei Chen, Abhinav Shrivastava, Abhinav Gupta Carnegie Mellon University CS381V Visual Recognition -
Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.
Zhuode Liu 2016/2/13 University of Texas at Austin CS 381V: Visual Recognition Discovering the Spatial Extent of Relative Attributes Xiao and Lee, ICCV.
Compact Bilinear Pooling
Object detection with deformable part-based models
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
Data Driven Attributes for Action Detection
Krishna Kumar Singh, Yong Jae Lee University of California, Davis
Transfer Learning in Astronomy: A New Machine Learning Paradigm
Mingxia Liu Relative Attributes Mingxia Liu
Part-Based Room Categorization for Household Service Robots
Introductory Seminar on Research: Fall 2017
Thesis Advisor : Prof C.V. Jawahar
Attributes and Simile Classifiers for Face Verification
CS 1674: Intro to Computer Vision Scene Recognition
Outline Background Motivation Proposed Model Experimental Results
Adarsh Kowdle*, Congcong Li*, Ashutosh Saxena, and Tsuhan Chen
Zeroshot Learning Mun Jonghwan.
Towards an Unequivocal Representation of Actions
Presentation transcript:

Attributes for Classifier Feedback Amar Parkash and Devi Parikh

You can teach a child by examples...

And on, and on, and on…

Is this a giraffe?No. Is this a giraffe?Yes.Is this a giraffe?No.

And on, and on, and on…

Proposed Active Learning Scenario

I think this is a giraffe. What do you think? No, its neck is too short for it to be a giraffe. Ah! These must not be giraffes either then. [Animals with even shorter necks] …… Current belief Focused feedback Knowledge of the world Feedback on one, transferred to many Learner learns better from its mistakes Accelerated discriminative learning with few examples Learner learns better from its mistakes Accelerated discriminative learning with few examples

Communication Need a language that is Machine understandable Human understandable Attributes! Mid-level shareable Visual Semantic

Proposed Active Learning Concepts to teach: C1C1 C2C2 CKCK … Unlabeled pool of images Classifiers: h1h1 h2h2 hKhK … [Label-feedback] [Attributes-based feedback] Attribute predictors: a1a1 a2a2 aMaM … [Predicted Label] Any feature space Any discriminative learning algorithm

Relative Attributes [Parikh and Grauman, ICCV 2011] Openness Unlabeled pool of images Attribute predictors: a1a1 a2a2 aMaM … Image features Parameters

Attributes-based Feedback Unlabeled pool of images No, It is too open to be a forest Attribute predictors: a1a1 a2a2 aMaM … Forest Openness Not Forest

Attributes-based Feedback Unlabeled pool of images No, It is too open to be a forest Attribute predictors: a1a1 a2a2 aMaM … Forest Classifiers: h1h1 h2h2 hKhK … Not Forest

Proposed Active Learning Concepts to teach: C1C1 C2C2 CKCK … Unlabeled pool of images Classifiers: h1h1 h2h2 hKhK … [Label-feedback] [Attributes-based feedback] Attribute predictors: a1a1 a2a2 aMaM … [Predicted Label]

Label-based Feedback Not our contribution Experiment with different scenarios Benefits of attributes-based feedback – Small when label-based is very informative – Large when label-based feedback is weak

Unlabeled pool of images Forest Label-based Feedback Accept: Yes, this is a forest. – Strong: It is not anything else Example: Classification – Weak: It can be other things Example: Annotation

Label-based Feedback Reject: – Strong: No, it is a coast. Example: Classification with few classes – Weak: No, this is not a forest. Example: Large-scale classification Example: Biased binary classification 4 different scenarios in experiments Unlabeled pool of images Forest

Datasets Datasets and relative attribute predictors from [Parikh and Grauman, ICCV 2011]

Datasets Faces: – 8 celebrity categories – 11 attributes (chubby, white, etc.) [Kumar et al., ICCV 2009]

Datasets Scenes: – 8 categories – 6 attributes (open, natural, etc.) [Oliva and Torralba, IJCV 2001]

Settings Feedback from MTurk Features: – Raw image features (gist, color) – Attribute scores Category classifiers: SVM with RBF kernel Results on – 2 datasets x 2 features x 4 label-feedback scenarios – Show 2 here, rest in paper. This image doesnt have enough perspective to be a street scene.

Results Faces, Attribute features, Strong label-feedback # iterations Accuracy

Results Scenes, Image features, Weak label-feedback # iterations Accuracy 1/4 th ! More results in the paper.

Conclusion Attributes for providing classifier feedback Novel learning paradigm with enhanced human-machine communication Discriminative learning + domain knowledge Learning with few examples Connections to semi-supervised learning – Shrivastava, Singh and Gupta: Up Next!

Thank you!

Backup Slides

Negative Feedback Many reasons come together to make a concept – Hard to describe why an image is a concept One reason can break a concept – Easier to describe why an image is not a concept (Arguably) waste to give feedback when right

Discriminative – No domain knowledge is conveyed – Many training images – Discriminative model – Classification in any feature space – State-of-art performance Related Work Zero-shot learning – Convey domain knowledge – Zero training images – Generative model – Classification in attribute space – Performance compromised Proposed paradigm – Convey domain knowledge to transfer – Few training images – Discriminative model – Classification in any feature space – Performance maintained bea r turtlerabbit furry big [Lampert et al., CVPR 2009] C Smiling Age S J H C is younger than H C smiles more than H M M is younger than J M smiles more than J [Parikh and Grauman, ICCV 2011] – – –

Related Work Focused discrimination – Mining of hard negatives [Felzenszwalb 2010] – To understand classifier [Golland 2001] – Here, supervisor provides the discriminative direction by verbalizing semantic knowledge [Golland, NIPS 2001][Felzenszwalb et al., CVPR 2008]

Related Work What we are not doing: Collecting deeper annotations of images Segmentation masks [Russell 2008], Parts [Farhadi 2010], Pose [Bourdev 2009], Attributes [Kumar 2009] We use attributes for broad propagation of category labels to unlabeled images

Related Work What we are not doing: Actively interleaving attribute annotations Object & attributes [Kovashka 2011], Image, boxes & segments [Vijaynarsimhan 2008], Parts & attributes [Wah 2011], etc. Human-in-the-loop at test time [Branson 2010] Our supervisor provides additional information at training time which is leveraged for better category models

Related Work Rationales – Human feature selection in NLP [Raghavan 2005] – Spatial and attribute rationales [Donahue 2011] – Restricted to classification in attribute space – We can operate in any feature space [Donahue and Grauman, ICCV 2011]

Imperfect Attribute Predictors In the end, all images labeled with ground truth Discriminative training can deal with outliers Attributes are pre-trained, and so unlikely to be severely flawed Experiments: used predictors directly from Parikh and Grauman, 2011

Large-scale Classification Categories may require expert knowledge, attributes need not Can show a few exemplars to verify category

Results Faces, Classification, Attribute features (overestimate) # iterations Accuracy

Label-based Feedback ClassificationLarge-scale Classification Annotation Street Outdoor City Grayscale …. Biased Binary Classification Simulate the different scenarios in responses Benefits of attributes-based feedback Small when label-based is very informative Large when label-based feedback is weak Simulate the different scenarios in responses Benefits of attributes-based feedback Small when label-based is very informative Large when label-based feedback is weak

Results Scenes, Annotation, Image features # iterations Accuracy

Results Faces, Biased binary classification, Attribute features # iterations Accuracy More results in the paper.

Traditional Active Learning Concepts to teach: C1C1 C2C2 CKCK … Unlabeled pool of images Classifiers: h1h1 h2h2 hKhK … [Label] Highest Entropy