Correlative Multi-Label Multi-Instance Image Annotation

Slides:

Advertisements

Similar presentations

Latent Space Domain Transfer between High Dimensional Overlapping Distributions Sihong Xie Wei Fan Jing Peng* Olivier Verscheure Jiangtao Ren Sun Yat-Sen.

Advertisements

Location Recognition Given: A query image A database of images with known locations Two types of approaches: Direct matching: directly match image features.

Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?

Local Discriminative Distance Metrics and Their Real World Applications Local Discriminative Distance Metrics and Their Real World Applications Yang Mu,

Active Learning for Streaming Networked Data Zhilin Yang, Jie Tang, Yutao Zhang Computer Science Department, Tsinghua University.

Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.

Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.

Machine learning continued Image source:

Proposed concepts illustrated well on sets of face images extracted from video: Face texture and surface are smooth, constraining them to a manifold Recognition.

CVPR2013 Poster Representing Videos using Mid-level Discriminative Patches.

LPP-HOG: A New Local Image Descriptor for Fast Human Detection Andy Qing Jun Wang and Ru Bo Zhang IEEE International Symposium.

Discriminative Segment Annotation in Weakly Labeled Video Kevin Tang, Rahul Sukthankar Appeared in CVPR 2013 (Oral)

IJCAI Wei Zhang, 1 Xiangyang Xue, 2 Jianping Fan, 1 Xiaojing Huang, 1 Bin Wu, 1 Mingjie Liu 1 Fudan University, China; 2 UNCC, USA {weizh,

São Paulo Advanced School of Computing (SP-ASC’10). São Paulo, Brazil, July 12-17, 2010 Looking at People Using Partial Least Squares William Robson Schwartz.

Discriminative and generative methods for bags of features

Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?

Reduced Support Vector Machine

Principle of Locality for Statistical Shape Analysis Paul Yushkevich.

Image Categorization by Learning and Reasoning with Regions Yixin Chen, University of New Orleans James Z. Wang, The Pennsylvania State University Published.

Supervised Distance Metric Learning Presented at CMU’s Computer Vision Misc-Read Reading Group May 9, 2007 by Tomasz Malisiewicz.

Discriminative and generative methods for bags of features

© 2013 IBM Corporation Efficient Multi-stage Image Classification for Mobile Sensing in Urban Environments Presented by Shashank Mujumdar IBM Research,

A k-Nearest Neighbor Based Algorithm for Multi-Label Classification Min-Ling Zhang

Step 3: Classification Learn a decision rule (classifier) assigning bag-of-features representations of images to different classes Decision boundary Zebra.

Richard Socher Cliff Chiung-Yu Lin Andrew Y. Ng Christopher D. Manning

Classification 2: discriminative models

Mining Discriminative Components With Low-Rank and Sparsity Constraints for Face Recognition Qiang Zhang, Baoxin Li Computer Science and Engineering Arizona.

Marcin Marszałek, Ivan Laptev, Cordelia Schmid Computer Vision and Pattern Recognition, CVPR Actions in Context.

Special topics on text mining [ Part I: text classification ] Hugo Jair Escalante, Aurelio Lopez, Manuel Montes and Luis Villaseñor.

Kernel Methods A B M Shawkat Ali 1 2 Data Mining ¤ DM or KDD (Knowledge Discovery in Databases) Extracting previously unknown, valid, and actionable.

Towards Open World Recognition Abhijit Bendale, Terrance Boult University of Colorado of Colorado Springs Poster no 85.

Classifiers Given a feature representation for images, how do we learn a model for distinguishing features from different classes? Zebra Non-zebra Decision.

Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.

Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.

Color Image Segmentation Speaker: Deng Huipeng 25th Oct ， 2007.

Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.

Indirect Supervision Protocols for Learning in Natural Language Processing II. Learning by Inventing Binary Labels This work is supported by DARPA funding.

Towards Semantic Embedding in Visual Vocabulary Towards Semantic Embedding in Visual Vocabulary The Twenty-Third IEEE Conference on Computer Vision and.

Optimal Dimensionality of Metric Space for kNN Classification Wei Zhang, Xiangyang Xue, Zichen Sun Yuefei Guo, and Hong Lu Dept. of Computer Science &

An Approximate Nearest Neighbor Retrieval Scheme for Computationally Intensive Distance Measures Pratyush Bhatt MS by Research(CVIT)

A New Supervised Over-Sampling Algorithm with Application to Protein-Nucleotide Binding Residue Prediction Li Lihong (Anna Lee) Cumputer science 22th,Apr.

Feature Selction for SVMs J. Weston et al., NIPS 2000 오장민 (2000/01/04) Second reference : Mark A. Holl, Correlation-based Feature Selection for Machine.

Mete Ozay, Fatos T. Yarman Vural —Presented by Tianxiao Jiang

Next, this study employed SVM to classify the emotion label for each EEG segment. The basic idea is to project input data onto a higher dimensional feature.

Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.

CNN-RNN: A Uniﬁed Framework for Multi-label Image Classiﬁcation

Compact Bilinear Pooling

Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek

Deep Compositional Cross-modal Learning to Rank via Local-Global Alignment Xinyang Jiang, Fei Wu, Xi Li, Zhou Zhao, Weiming Lu, Siliang Tang, Yueting.

Saliency-guided Video Classification via Adaptively weighted learning

Hybrid Features based Gender Classification

Metric Learning for Clustering

Recognition using Nearest Neighbor (or kNN)

Introduction Feature Extraction Discussions Conclusions Results

Thesis Advisor : Prof C.V. Jawahar

Adversarially Tuned Scene Generation

Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science

Machine Learning Week 1.

Learning with information of features

Local Binary Patterns (LBP)

Discriminative Frequent Pattern Analysis for Effective Classification

Outline Background Motivation Proposed Model Experimental Results

Shih-Wei Lin, Kuo-Ching Ying, Shih-Chieh Chen, Zne-Jung Lee

View Inter-Prediction GAN: Unsupervised Representation Learning for 3D Shapes by Learning Global Shape Memories to Support Local View Predictions 1,2 1.

Learning to Rank with Ties

Deep Object Co-Segmentation

“Traditional” image segmentation

Deep Structured Scene Parsing by Learning with Image Descriptions

SFNet: Learning Object-aware Semantic Correspondence

Random Neural Network Texture Model

Presentation transcript:

Correlative Multi-Label Multi-Instance Image Annotation Overview Correlative Multi-Label Multi-Instance Image Annotation Xiangyang Xue, Wei Zhang, Jie Zhang, Bin Wu, Jianping Fan, and Yao Lu Fudan University, China & UNC-Charlotte The global visual features of the entire image and the local features of the regions are extracted to capture coarse and fine patterns, respectively. The associations between semantic concepts and visual features are mined both at image level and at region level. Inter-label correlations are captured by a co-occurence matrix of concept pairs. The cross-level label coherence encodes the consistency between the labels at image level and the labels at region level. The Model: are the (nonlinear) functions mapping the input global features of the entire image and the local features of the image region to the kernel spaces, respectively. denotes the label vector of an image. denotes the concept label vector of the r-th region in the image. denotes the set of all concepts related with the l–th concept. are the parameter vectors to be learned; are the bias parameters. Part I encodes the associations between image-level labels and global visual features; Part II models the associations between region-level labels and local visual features; Part III captures the inter-label correlations dependent on the image features; Part IV measures the coherence between image-level labels and region-level labels. Learning the model by minimizing the cost function: Based on the design of the proposed model, we can divide the optimization problem into inter-related sub-problems and then learn the model efficiently. Experimental Results Conclusions Results on Corel Dataset Results on MSRC Dataset Both image-level labels and region-level labels can be obtained in a single framework by capturing the feature-label associations, the inter-label correlations, and the cross-level label coherence. Structural max-margin technique is used to formulate the proposed model. By decoupling the annotation task into inter-dependant subproblems, we learn multiple interrelated classifiers jointly. We evaluate our method on MSRC and Corel image datasets in comparisons with other related competitive algorithms: RML[1], RankSVM[1], MLknn[3], and TagProp[4]. The inter-label correlation matrix based on the harmonic mean of empirical conditional probabilities illustrates the interdependency between concepts on the MSRC dataset. The brighter the block is, the stronger the correlation between labels exists. The results of our method in comparison with other related in terms of F score for 10 labels at the image-level from the Corel dataset. The results of our method in comparison with other related competitive algorithms in terms of F score for individual labels at the image-level on the MSRC dataset. References [1] James Petterson and Tiberio Caetano. Reverse multi-label learning. In NIPS, 2010. [2] Andre Elisseeff and Jason Weston. A kernel method for multi-labeled classification. In NIPS, 2002. [3] M-L Zhang and Z-H Zhou. Ml-knn: A lazy learning approach to multi-label learning. Pattern Recognition, 40(7): 2038–2048, 2007 [4]M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid. Tagprop: Discriminative metric learning in nearest neighbor models for image auto-annotation. In ICCV, 2009. Region-level labeling results of our method for some exemplary images from MSRC. Top: the ground truth; Bottom: our results. Region-level labeling results of our method for some exemplary images from the Corel dataset.