Towards Open World Recognition Abhijit Bendale, Terrance Boult University of Colorado of Colorado Springs Poster no 85.

Slides:



Advertisements
Similar presentations
Automatic Image Annotation Using Group Sparsity
Advertisements

Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.
Classification using intersection kernel SVMs is efficient Joint work with Subhransu Maji and Alex Berg Jitendra Malik UC Berkeley.
Machine learning continued Image source:
Clustering with k-means and mixture of Gaussian densities Jakob Verbeek December 3, 2010 Course website:
Multi-layer Orthogonal Codebook for Image Classification Presented by Xia Li.
Learning Globally-Consistent Local Distance Functions for Shape-Based Image Retrieval and Classification Computer Vision, ICCV IEEE 11th International.
Gustavo Carneiro The Automatic Design of Feature Spaces for Local Image Descriptors using an Ensemble of Non-linear Feature Extractors.
Lecture 31: Modern object recognition
Mixture of trees model: Face Detection, Pose Estimation and Landmark Localization Presenter: Zhang Li.
Low Complexity Keypoint Recognition and Pose Estimation Vincent Lepetit.
Steerable Part Models Hamed Pirsiavash and Deva Ramanan
Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.
Object-centric spatial pooling for image classification Olga Russakovsky, Yuanqing Lin, Kai Yu, Li Fei-Fei ECCV 2012.
Large-Scale Object Recognition with Weak Supervision
Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA
Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.
Discriminative and generative methods for bags of features
Landmark Classification in Large- scale Image Collections Yunpeng Li David J. Crandall Daniel P. Huttenlocher ICCV 2009.
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Fuzzy Support Vector Machines (FSVMs) Weijia Wang, Huanren Zhang, Vijendra Purohit, Aditi Gupta.
Graz University of Technology, AUSTRIA Institute for Computer Graphics and Vision Fast Visual Object Identification and Categorization Michael Grabner,
Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson
5/30/2006EE 148, Spring Visual Categorization with Bags of Keypoints Gabriella Csurka Christopher R. Dance Lixin Fan Jutta Willamowski Cedric Bray.
Discriminative and generative methods for bags of features
Machine learning & category recognition Cordelia Schmid Jakob Verbeek.
Exercise Session 10 – Image Categorization
Jinhui Tang †, Shuicheng Yan †, Richang Hong †, Guo-Jun Qi ‡, Tat-Seng Chua † † National University of Singapore ‡ University of Illinois at Urbana-Champaign.
Step 3: Classification Learn a decision rule (classifier) assigning bag-of-features representations of images to different classes Decision boundary Zebra.
The EM algorithm, and Fisher vector image representation
Classification 2: discriminative models
General Tensor Discriminant Analysis and Gabor Features for Gait Recognition by D. Tao, X. Li, and J. Maybank, TPAMI 2007 Presented by Iulian Pruteanu.
Marcin Marszałek, Ivan Laptev, Cordelia Schmid Computer Vision and Pattern Recognition, CVPR Actions in Context.
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 24 – Classifiers 1.
Video Tracking Using Learned Hierarchical Features
Svetlana Lazebnik, Cordelia Schmid, Jean Ponce
Yao, B., and Fei-fei, L. IEEE Transactions on PAMI(2012)
Kernel Methods A B M Shawkat Ali 1 2 Data Mining ¤ DM or KDD (Knowledge Discovery in Databases) Extracting previously unknown, valid, and actionable.
Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.
BING: Binarized Normed Gradients for Objectness Estimation at 300fps
Classifiers Given a feature representation for images, how do we learn a model for distinguishing features from different classes? Zebra Non-zebra Decision.
Semantic Embedding Space for Zero ­ Shot Action Recognition Xun XuTimothy HospedalesShaogang GongAuthors: Computer Vision Group Queen Mary University of.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.
Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.
Kylie Gorman WEEK 1-2 REVIEW. CONVERTING AN IMAGE FROM RGB TO HSV AND DISPLAY CHANNELS.
Optimal Dimensionality of Metric Space for kNN Classification Wei Zhang, Xiangyang Xue, Zichen Sun Yuefei Guo, and Hong Lu Dept. of Computer Science &
Image Classification over Visual Tree Jianping Fan Dept of Computer Science UNC-Charlotte, NC
Recognition Using Visual Phrases
Classifying Covert Photographs CVPR 2012 POSTER. Outline  Introduction  Combine Image Features and Attributes  Experiment  Conclusion.
Carl Vondrick, Aditya Khosla, Tomasz Malisiewicz, Antonio Torralba Massachusetts Institute of Technology
Combining multiple learners Usman Roshan. Decision tree From Alpaydin, 2010.
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
Recent developments in object detection
MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.
The Problem: Classification
Data Driven Attributes for Action Detection
Learning Mid-Level Features For Recognition
Trees, bagging, boosting, and stacking
Correlative Multi-Label Multi-Instance Image Annotation
Nonparametric Semantic Segmentation
Object Localization Goal: detect the location of an object within an image Fully supervised: Training data labeled with object category and ground truth.
Training Techniques for Deep Neural Networks
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
RGB-D Image for Scene Recognition by Jiaqi Guo
Open-Category Classification by Adversarial Sample Generation
Outline Background Motivation Proposed Model Experimental Results
MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.
Presentation transcript:

Towards Open World Recognition Abhijit Bendale, Terrance Boult University of Colorado of Colorado Springs Poster no 85

Multi-Class Classification System 2 W Scheirer, A Rocha, A Sapkota, T Boult “Towards Open Set Recognition” IEEE TPAMI 2013

Out in the Real-World Detect New Category 3 Pool Table Bowling Pin Boxing glove Calculator Chess board ? ? ? ?

Open World Recognition 4 World with Knowns (K) & Unknowns Unknowns (UU) Detect as Unknown NU: Novel Unknowns Label Data LU: Labeled Unknowns Incremental Class Learning K: Known Scale

Open Set Learning Incremental Learning Scalable Learning Ristin+ (CVPR’14) Yeh+ (CVPR’08) Li+ CVPR’07) Mensink+ (PAMI’13) Related Work Jain+ (ECCV’14) Scheirer+ PAMI’13) Scheirer+(PAMI’1 4) Deng+ (NIPS’11) Marszalek+ (ECCV’08) Liu+ (CVPR’13) 5

Open Space in Classification ? ? ? ? ? ? ? ?? ?? ?? Open Set Recognition Empirical risk function Regularization constant Open space risk W Scheirer, A Rocha, A Sapkota, T Boult “Towards Open Set Recognition” IEEE TPAMI Closed Space Open Space

NCM – Metric Learning 7 NCM Classifier with Metric Learning T Mensink, J Verbeek, F Perronin, G Csurka “Distance based Image Classification: Generalizing to New Classes at Near Zero Cost” IEEE TPAMI 2013 M Ristin, M Guillaumin, J Gall, L Van Gool “Incremental Learning of NCM Forests for Large-Scale Image Classification” CVPR 2014

Compact Abating Probability (CAP) Models W Scheirer, L Jain, T Boult “Probability Models for Open Set Recognition” IEEE TPAMI Class Mean

Theorem 1: Open Space Risk for Model Combination 9

Theorem 2: Open Space Risk for Transformed Spaces 10

Opening an Existing Algorithm: Nearest Non-Outlier (NNO) Algorithm 11 W = Linear Transformation (weight matrix from metric learning) Standard gamma function In volume of m-D ball Class mean for class i τ is threshold for open world Probability

Open World Evaluation Training phase Testing phase Parameter Learning Phase Incremental Learning Phase Closed Set Testing Unknown Categories Known Categories 12 Open Set Testing ? ?

Training for Open World 13 NCM - ML NNO Parameter Learning with initial set of categories Estimation of τ for open set learning to balance open space risk Optimize for Known vs Unknown Errors Incrementally add new categories

Experiments 14 Datasets ILSVRC’10: 1.2M training images, 1000 classes ILSVRC’12: 1.2M training images, 1000 classes Datasets ILSVRC’10: 1.2M training images, 1000 classes ILSVRC’12: 1.2M training images, 1000 classes Features Dense SIFT features, Quantized into 1000 Bag of Visual Words Publically available features LBP, HOG, Dense SIFT (for ILSVRC’12) Features Dense SIFT features, Quantized into 1000 Bag of Visual Words Publically available features LBP, HOG, Dense SIFT (for ILSVRC’12) Algorithms Nearest Class Mean - ML Classifier (NCM) [Mensink etal PAMI 2013] Nearest Non-Outlier Algorithm (NNO) [This Paper] 1vSet [Scheirer etal PAMI 2013] Linear SVM [Liblinear, Fan etal JMLR 2008] Algorithms Nearest Class Mean - ML Classifier (NCM) [Mensink etal PAMI 2013] Nearest Non-Outlier Algorithm (NNO) [This Paper] 1vSet [Scheirer etal PAMI 2013] Linear SVM [Liblinear, Fan etal JMLR 2008]

50 Initial Categories Increasing # of unknown categories during testing i.e. increasing openness of problem Incrementally adding categories during training Closed Set testing

Initial Categories Increasing # of unknown categories during testing i.e. increasing openness of problem Incrementally adding categories during training Closed Set testing 500 known unknown categories

Formalized Open World Recognition and showed how to “Open” an existing algorithm. NNO allows construction of scalable systems that can be updated incrementally Conclusion & Future Work 17 See us at Poster no 85 …!!!! Exploring sophisticated novelty detectors, open world detection, “opening” other baseline algorithms etc. Open World Deep Learning methods Happy to Collaborate…!!!