Yu-Feng Li 1, James T. Kwok2, Ivor W. Tsang3 and Zhi-Hua Zhou1

Slides:



Advertisements
Similar presentations
A Support Vector Method for Optimizing Average Precision
Advertisements

Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
On-line learning and Boosting
Tighter and Convex Maximum Margin Clustering Yu-Feng Li (LAMDA, Nanjing University, China) Ivor W. Tsang.
Structured SVM Chen-Tse Tsai and Siddharth Gupta.
Pattern Recognition and Machine Learning
SVM—Support Vector Machines
Support vector machine
Machine learning continued Image source:
CSCI 347 / CS 4206: Data Mining Module 07: Implementations Topic 03: Linear Models.
Ziming Zhang*, Ze-Nian Li, Mark Drew School of Computing Science Simon Fraser University Vancouver, Canada {zza27, li, AdaMKL: A Novel.
Learning on Probabilistic Labels Peng Peng, Raymond Chi-wing Wong, Philip S. Yu CSE, HKUST 1.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Multiple Instance Learning
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Fuzzy Support Vector Machines (FSVMs) Weijia Wang, Huanren Zhang, Vijendra Purohit, Aditi Gupta.
Support Vector Machines (SVMs) Chapter 5 (Duda et al.)
Multiple-Instance Learning Paper 1: A Framework for Multiple-Instance Learning [Maron and Lozano-Perez, 1998] Paper 2: EM-DD: An Improved Multiple-Instance.
Efficient Convex Relaxation for Transductive Support Vector Machine Zenglin Xu 1, Rong Jin 2, Jianke Zhu 1, Irwin King 1, and Michael R. Lyu 1 4. Experimental.
Support Vector Machines Pattern Recognition Sergios Theodoridis Konstantinos Koutroumbas Second Edition A Tutorial on Support Vector Machines for Pattern.
Support Vector Machines for Multiple- Instance Learning Authors: Andrews, S.; Tsochantaridis, I. & Hofmann, T. (Advances in Neural Information Processing.
Support Vector Machines Based on Burges (1998), Scholkopf (1998), Cristianini and Shawe-Taylor (2000), and Hastie et al. (2001) David Madigan.
1 Integrating User Feedback Log into Relevance Feedback by Coupled SVM for Content-Based Image Retrieval 9-April, 2005 Steven C. H. Hoi *, Michael R. Lyu.
Image Categorization by Learning and Reasoning with Regions Yixin Chen, University of New Orleans James Z. Wang, The Pennsylvania State University Published.
Presentation in IJCNN 2004 Biased Support Vector Machine for Relevance Feedback in Image Retrieval Hoi, Chu-Hong Steven Department of Computer Science.
Dept. of Computer Science & Engineering, CUHK Pseudo Relevance Feedback with Biased Support Vector Machine in Multimedia Retrieval Steven C.H. Hoi 14-Oct,
1 Multiple Kernel Learning Naouel Baili MRL Seminar, Fall 2009.
Wayne State University, 1/31/ Multiple-Instance Learning via Embedded Instance Selection Yixin Chen Department of Computer Science University of.
Online Learning Algorithms
Non Negative Matrix Factorization
10/18/ Support Vector MachinesM.W. Mak Support Vector Machines 1. Introduction to SVMs 2. Linear SVMs 3. Non-linear SVMs References: 1. S.Y. Kung,
Multiple Instance Real Boosting with Aggregation Functions Hossein Hajimirsadeghi and Greg Mori School of Computing Science Simon Fraser University International.
Kernel Methods A B M Shawkat Ali 1 2 Data Mining ¤ DM or KDD (Knowledge Discovery in Databases) Extracting previously unknown, valid, and actionable.
ICML2004, Banff, Alberta, Canada Learning Larger Margin Machine Locally and Globally Kaizhu Huang Haiqin Yang, Irwin King, Michael.
Classifiers Given a feature representation for images, how do we learn a model for distinguishing features from different classes? Zebra Non-zebra Decision.
Extending the Multi- Instance Problem to Model Instance Collaboration Anjali Koppal Advanced Machine Learning December 11, 2007.
Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.
Multiple Instance Learning via Successive Linear Programming Olvi Mangasarian Edward Wild University of Wisconsin-Madison.
Ohad Hageby IDC Support Vector Machines & Kernel Machines IP Seminar 2008 IDC Herzliya.
1  The Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.
Multiple Instance Learning for Sparse Positive Bags Razvan C. Bunescu Machine Learning Group Department of Computer Sciences University of Texas at Austin.
Support vector machine LING 572 Fei Xia Week 8: 2/23/2010 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A 1.
1  Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.
Presented by: Mingkui Tan, Li Wang, Ivor W. Tsang School of Computer Engineering June 21-24, ICML2010 Haifa, Israel Learning Sparse SVM.
Non-separable SVM's, and non-linear classification using kernels Jakob Verbeek December 16, 2011 Course website:
Combining Models Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya.
Semi-Supervised Learning Using Label Mean
PREDICT 422: Practical Machine Learning
LINEAR CLASSIFIERS The Problem: Consider a two class task with ω1, ω2.
Linli Xu Martha White Dale Schuurmans University of Alberta
Large Margin classifiers
MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.
Glenn Fung, Murat Dundar, Bharat Rao and Jinbo Bi
Support Vector Machines
Support Vector Machines
J. Zhu, A. Ahmed and E.P. Xing Carnegie Mellon University ICML 2009
Data Mining Practical Machine Learning Tools and Techniques
A New Boosting Algorithm Using Input-Dependent Regularizer
Statistical Learning Dong Liu Dept. EEIS, USTC.
Multiple Instance Learning: applications to computer vision
COSC 4335: Other Classification Techniques
Presented by: Chang Jia As for: Pattern Recognition
SVM-based Deep Stacking Networks
Shih-Wei Lin, Kuo-Ching Ying, Shih-Chieh Chen, Zne-Jung Lee
Usman Roshan CS 675 Machine Learning
View Inter-Prediction GAN: Unsupervised Representation Learning for 3D Shapes by Learning Global Shape Memories to Support Local View Predictions 1,2 1.
Learning Incoherent Sparse and Low-Rank Patterns from Multiple Tasks
Primal Sparse Max-Margin Markov Networks
SVMs for Document Ranking
MIRA, SVM, k-NN Lirong Xia. MIRA, SVM, k-NN Lirong Xia.
Presentation transcript:

Yu-Feng Li 1, James T. Kwok2, Ivor W. Tsang3 and Zhi-Hua Zhou1 A Convex Method for Locating Regions of Interest with Multi-Instance Learning Yu-Feng Li 1, James T. Kwok2, Ivor W. Tsang3 and Zhi-Hua Zhou1 1 LAMDA Group, Nanjing University, China 2 Hong Kong University of Science & Technology, Hong Kong 3 Nanyang Technological University, Singapore {liyf,zhouzh}@lamda.nju.edu.cn, jamesk@cse.ust.hk, IvorTsang@ntu.edu.sg

Multi-Instance Learning MIL [Dietterich et al., AIJ97] attempts to learn from a training set consists of bags each containing many instances - A bag is positive if it contains at least one positive instances; otherwise negative. - The labels of training bags are known, however, the labels of instances in the bags are unknown. Multi-instance learning Traditional supervised learning In MIL, identifying positive instances is an important problem understanding the relation between the bag and input patterns. Figure reprinted from [Zhi-Hua Zhou et al., icml09]

Example: ROI In relevant feedback of CBIR, usually user is only interested in some regions, i.e., regions of interest (ROI), in the images. Locating ROIs could be used in other area, i.e., image screening…

The Problem How to develop an efficient and theoretical supported method for locating the ROIs? …This presentation

Outline Introduction Our Methods Experiments Conclusion

Multi-Instance Learning Introduction Multi-Instance Learning Originated from the research on drug activity prediction [Dietterich et al. AIJ97] Drugs are small molecules working by binding to the target area For molecules qualified to make the drug, one of its shapes could tightly bind to the target area A molecule may have many alternative shapes The difficulty: Biochemists know that whether a molecule is qualified or not, but do not know which shape responses for the qualification Figure reprinted from [Dietterich et al., AIJ97]

Multi-Instance Learning (con’t) Introduction Multi-Instance Learning (con’t) Each shape can be represented by a feature vector, i.e., an instance … [a1, a2, …, am]T [b1, b2, …, bm]T [u1, u2, …, um]T one bag one molecule Thus, a molecule is a bag of instances A bag is positive if it contains at least one positive instance; otherwise it is negative The labels of the training bags are known The labels of the instances in the training bags are unknown Figure reprinted from [Zhi-Hua Zhou et al., icml09]

MIL for Image Analysis Various applications Introduction MIL for Image Analysis Various applications Image categorization [Maron & Ratan, ICML’98; Chen & Wang, JMLR04; Chen et al., PAMI06] Image retrieval [Zhang et al., ICDE’02; Zhou et al., AJCAI’05] Face detection [Viola et al., NIPS’05; Zhang & Viola, NIPS’07] Computer-aided medical diagnosis [Fung et al., NIPS’06] … …

Related works for locating ROIs Introduction Related works for locating ROIs Diverse Density and its variants [Maron & Lozano-Pérez, NIPS98] [Maron & Ratan, ICML98] [Zhang & Goldman, NIPS02] [Ray & Craven, icml05] Effective in locating the ROIs Huge time cost, suffers from local minima CKNN-ROI [zhou et al., AJCAI05] A variation of Citation-kNN [wang & zucker,icml00] Efficient, but based on heuristic SVM based methods Very few focus on locating the ROIs, except MI-SVM [Andrew et al., nips03] Effect and efficient, but still suffers from local minima

Multi-Instance SVM [Andrew et al, nips03] Introduction Multi-Instance SVM [Andrew et al, nips03] Main Idea : Large Margin Principle Positive bag Negative bag Key instance, i.e., roi Optimal Hyperplane

A set of training bags: , where and Introduction Problem Setup A set of training bags: , where and The decision function: The goal is to learn f minimizes the structural risk  is strictly monotonically increasing function, l() is the loss function, C is the regularization parameter

consider and square hinge loss Introduction Problem Setup (cont.) consider and square hinge loss So, (1) becomes where This, however, is a non-convex problem because of the max operator and it may get stuck in local solution. In this paper, we propose a convex-relaxation method to solve such an non-convex problem.

Outline Introduction Our Methods Experiments Conclusion

Instance-level Key Instance SVM Our Methods Instance-level Key Instance SVM By introducing the indicator variables d and s(i,j) λ balances the slack variables from positive and negative bags

Bag-level Key Instance SVM Our Methods Bag-level Key Instance SVM Ins-KI-SVM may involves large number of constraints in optimization. Simply represent the negative bag by mean of its instances [Gartner et al., icml02] [Xu & Frank, pakdd04].

More General Formulation Our Methods More General Formulation It involves Ins-KI-SVM and Bag-KI-SVM as special cases when r = N and , , it reduces to Ins-KI-SVM when r = m and , it reduces to Bag-KI-SVM In this paper, we aim at solving above optimization problem instead!

Generate the most informative label Our Methods Convex relaxation – Intuitional View Our main observation efficient hard ? SVM 1 -1 SVM Note that Each label assignment corresponds to a kind of label-kernel in SVM dual. Learn weights for different label assignments reduces to multiple kernel learning which is convex and efficient in general. Our simple strategy 1 -1 SVM … 1 -1 ? Generate the most informative label

Convex relaxation approach Our Methods Convex relaxation approach Consider the dual Consider the minimax relaxation [Li et al., AISTATS’09] Multiple Kernel Learning

Convex relaxation approach (cont.) Our Methods Convex relaxation approach (cont.) Exponential number of base kernels…. Too expensive Cutting plane algorithm Adaptive SimpleMKL How?

Find the most violated d Our Methods Find the most violated d To find the most violated d, we need to solve the following maximization problem Rewritten as It is a concave QP, and could not be solved efficiently… However, the cutting plane method only requires to add a violated constraint at each iteration Approximated by Inf-norm Hence, we propose a simple and efficient method for finding a good approximation of the most violated d Linear Problem, can be solved by sorting

Outline Introduction Our Methods Experiments Conclusion

Experiments Two Kinds of Tasks CBIR image data Benchmark data sets

CBIR image data [zhou et al., AJCAI05] Experiments CBIR image data [zhou et al., AJCAI05] Some statistics Image size: 160 × 160 Feature representation by using SBN [Maron & Ratan ICML98] Evaluation: success rate, i.e., the ratio of the number of successes divided by the total number of relevant images

Experiments CBIR image data (cont.) One-vs-rest, 50 train/ 450 test (10/90 are relevant, respectively), 30 runs

Experiments Some examples

Different number of confident bags Experiments Different number of confident bags Ins-KI-SVM is consistently better than all the other SVM-based methods across all the settings.

Average cpu time at ordinary PC Experiments Speed Average cpu time at ordinary PC Performance per time cost = (1/rank)/time Bag-KI-SVM achieves highest performance per time cost among all svm-type methods.

10 fold Cross-validation Experiments Benchmark datasets 10 fold Cross-validation The performance of KI-SVMs are competitive with all these state-of-the-art methods

Thanks! Conclusion Main contribution: Future work: A convex method for locating ROIs with multi-instance learning Future work: Locating multiple rois or roi groups Thanks!