The Chinese University of Hong Kong Learning Larger Margin Machine Locally and Globally Dept. of Computer Science and Engineering The Chinese University.

Slides:

Advertisements

Similar presentations

Introduction to Support Vector Machines (SVM)

Advertisements

ECG Signal processing (2)

ICONIP 2005 Improve Naïve Bayesian Classifier by Discriminative Training Kaizhu Huang, Zhangbing Zhou, Irwin King, Michael R. Lyu Oct

Tighter and Convex Maximum Margin Clustering Yu-Feng Li (LAMDA, Nanjing University, China) Ivor W. Tsang.

Support Vector Machines Instructor Max Welling ICS273A UCIrvine.

Support Vector Machines

SVM—Support Vector Machines

CSCI 347 / CS 4206: Data Mining Module 07: Implementations Topic 03: Linear Models.

Separating Hyperplanes

Second order cone programming approaches for handing missing and uncertain data P. K. Shivaswamy, C. Bhattacharyya and A. J. Smola Discussion led by Qi.

Fei Xing1, Ping Guo1,2 and Michael R. Lyu2

Support Vector Machines (SVMs) Chapter 5 (Duda et al.)

Learning Maximum Likelihood Bounded Semi-Naïve Bayesian Network Classifier Kaizhu Huang, Irwin King, Michael R. Lyu Multimedia Information Processing Laboratory.

Support Vector Classification (Linearly Separable Case, Primal) The hyperplanethat solves the minimization problem: realizes the maximal margin hyperplane.

Efficient Convex Relaxation for Transductive Support Vector Machine Zenglin Xu 1, Rong Jin 2, Jianke Zhu 1, Irwin King 1, and Michael R. Lyu 1 4. Experimental.

Variations of Minimax Probability Machine Huang, Kaizhu

MURI Meeting July 2002 Gert Lanckriet ( ) L. El Ghaoui, M. Jordan, C. Bhattacharrya, N. Cristianini, P. Bartlett.

Classification with reject option in gene expression data Blaise Hanczar and Edward R Dougherty BIOINFORMATICS Vol. 24 no , pages

1 Integrating User Feedback Log into Relevance Feedback by Coupled SVM for Content-Based Image Retrieval 9-April, 2005 Steven C. H. Hoi *, Michael R. Lyu.

Classification Problem 2-Category Linearly Separable Case A- A+ Malignant Benign.

Support Vector Machine Regression for Volatile Stock Market Prediction Haiqin Yang, Laiwan Chan, and Irwin King Department of Computer Science and Engineering.

Constructing a Large Node Chow-Liu Tree Based on Frequent Itemsets Kaizhu Huang, Irwin King, Michael R. Lyu Multimedia Information Processing Laboratory.

Sample-Separation-Margin Based Minimum Classification Error Training of Pattern Classifiers with Quadratic Discriminant Functions Yongqiang Wang 1,2, Qiang.

Discriminative Naïve Bayesian Classifiers Kaizhu Huang Supervisors: Prof. Irwin King, Prof. Michael R. Lyu Markers: Prof. Lai Wan Chan, Prof. Kin Hong.

Dept. of Computer Science & Engineering, CUHK Pseudo Relevance Feedback with Biased Support Vector Machine in Multimedia Retrieval Steven C.H. Hoi 14-Oct,

Finite mixture model of Bounded Semi- Naïve Bayesian Network Classifiers Kaizhu Huang, Irwin King, Michael R. Lyu Multimedia Information Processing Laboratory.

A Study of the Relationship between SVM and Gabriel Graph ZHANG Wan and Irwin King, Multimedia Information Processing Laboratory, Department of Computer.

Support Vector Machines

Dept. of C.S.E., C.U.H.K. 1 Learning From Data Locally and Globally Kaizhu Huang Supervisors: Prof. Irwin King, Prof. Michael R. Lyu Prof. Michael R. Lyu.

What is Learning All about ?  Get knowledge of by study, experience, or being taught  Become aware by information or from observation  Commit to memory.

SVM (Support Vector Machines) Base on statistical learning theory choose the kernel before the learning process.

Optimization Theory Primal Optimization Problem subject to: Primal Optimal Value:

Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)

An Introduction to Support Vector Machines Martin Law.

Soft Margin Estimation for Speech Recognition Main Reference: Jinyu Li, " SOFT MARGIN ESTIMATION FOR AUTOMATIC SPEECH RECOGNITION," PhD thesis, Georgia.

Incremental Support Vector Machine Classification Second SIAM International Conference on Data Mining Arlington, Virginia, April 11-13, 2002 Glenn Fung.

Efficient Model Selection for Support Vector Machines

CS 8751 ML & KDDSupport Vector Machines1 Support Vector Machines (SVMs) Learning mechanism based on linear programming Chooses a separating plane based.

Feature Selection in Nonlinear Kernel Classification Olvi Mangasarian Edward Wild University of Wisconsin Madison.

1 SUPPORT VECTOR MACHINES İsmail GÜNEŞ. 2 What is SVM? A new generation learning system. A new generation learning system. Based on recent advances in.

IEEE TRANSSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE

10/18/ Support Vector MachinesM.W. Mak Support Vector Machines 1. Introduction to SVMs 2. Linear SVMs 3. Non-linear SVMs References: 1. S.Y. Kung,

An Introduction to Support Vector Machine (SVM) Presenter : Ahey Date : 2007/07/20 The slides are based on lecture notes of Prof. 林智仁 and Daniel Yeung.

Machine Learning Using Support Vector Machines (Paper Review) Presented to: Prof. Dr. Mohamed Batouche Prepared By: Asma B. Al-Saleh Amani A. Al-Ajlan.

Proximal Support Vector Machine Classifiers KDD 2001 San Francisco August 26-29, 2001 Glenn Fung & Olvi Mangasarian Data Mining Institute University of.

ICML2004, Banff, Alberta, Canada Learning Larger Margin Machine Locally and Globally Kaizhu Huang Haiqin Yang, Irwin King, Michael.

CS Statistical Machine learning Lecture 18 Yuan (Alan) Qi Purdue CS Oct

An Introduction to Support Vector Machines (M. Law)

Using Support Vector Machines to Enhance the Performance of Bayesian Face Recognition IEEE Transaction on Information Forensics and Security Zhifeng Li,

Computational Intelligence: Methods and Applications Lecture 23 Logistic discrimination and support vectors Włodzisław Duch Dept. of Informatics, UMK Google:

Dept. of C.S.E., C.U.H.K. 1 Learning From Data Locally and Globally Kaizhu Huang Supervisors: Prof. Irwin King, Prof. Michael R. Lyu Prof. Michael R. Lyu.

An Introduction to Support Vector Machine (SVM)

Final Exam Review CS479/679 Pattern Recognition Dr. George Bebis 1.

Classification Course web page: vision.cis.udel.edu/~cv May 14, 2003  Lecture 34.

Feature Selction for SVMs J. Weston et al., NIPS 2000 오장민 (2000/01/04) Second reference : Mark A. Holl, Correlation-based Feature Selection for Machine.

Greg GrudicIntro AI1 Support Vector Machine (SVM) Classification Greg Grudic.

Proximal Plane Classification KDD 2001 San Francisco August 26-29, 2001 Glenn Fung & Olvi Mangasarian Second Annual Review June 1, 2001 Data Mining Institute.

A Parallel Mixture of SVMs for Very Large Scale Problems Ronan Collobert Samy Bengio Yoshua Bengio Prepared ： S.Y.C. Neural Information Processing Systems,

Computational Intelligence: Methods and Applications Lecture 24 SVM in the non-linear case Włodzisław Duch Dept. of Informatics, UMK Google: W Duch.

Incremental Reduced Support Vector Machines Yuh-Jye Lee, Hung-Yi Lo and Su-Yun Huang National Taiwan University of Science and Technology and Institute.

SUPPORT VECTOR MACHINES Presented by: Naman Fatehpuria Sumana Venkatesh.

Discriminative Training of Chow-Liu tree Multinet Classifiers

Robust Optimization and Applications in Machine Learning

Pattern Recognition CS479/679 Pattern Recognition Dr. George Bebis

Learning with information of features

Statistical Learning Dong Liu Dept. EEIS, USTC.

Minimax Probability Machine (MPM)

Zhenjiang Lin, Michael R. Lyu and Irwin King

A Classification-based Approach to Question Routing in Community Question Answering Tom Chao Zhou 22, Feb, 2010 Department of Computer.

Linear Discrimination

Presentation transcript:

The Chinese University of Hong Kong Learning Larger Margin Machine Locally and Globally Dept. of Computer Science and Engineering The Chinese University of Hong Kong Shatin, NT. Hong Kong Kaizhu Huang February 9, 2004

The Chinese University of Hong Kong Contributions Theory: A unified model of Support Vector Machine (SVM), Minimax Probability Machine (MPM), and Linear Discriminant Analysis (LDA). Practice: A sequential Conic Programming Problem.

The Chinese University of Hong Kong Outline Background And Motivation Maxi-Min Margin Machine(M 4 ) –Model Definition –Geometrical Interpretation –Solving Methods –Connections With Other Models M 4 : Non-separable case Experimental Results Future Work Conclusion

The Chinese University of Hong Kong Background: Classifier

The Chinese University of Hong Kong Background: SVM A more reasonable decision plane SVM Support Vectors

The Chinese University of Hong Kong Maxi-Min Margin Machine(M 4 )

The Chinese University of Hong Kong M4:Geometrical Interpretation

The Chinese University of Hong Kong M4:Solving Method Basic Technique: Divide and Conquer –If we fix to a specific, the problem changes to check whether this satisfies the following constraints: –If yes, we increase ; otherwise, we decrease it. Second Order Cone Programming Problem!!!

The Chinese University of Hong Kong M4:Solving Method (Continue) Iterate the following two steps to solve M 4 :

The Chinese University of Hong Kong M4:Solving Method (Continue) can it satisfy the constraints? Yes No

The Chinese University of Hong Kong Connection with MPM Span all the data points and add them together + Exactly MPM Optimization Problem!!!

The Chinese University of Hong Kong Connection with MPM Remarks: –The procedure is not reversible: MPM is a special case of M 4 –MPM focuses on building decision boundary GLOBALLY, i.e., it exclusively depends on the means and covariances. However, means and covariances may not be accurately estimated. MPM M 4

The Chinese University of Hong Kong Connection With SVM The magnitude of w can scale up without influencing the optimization SVM with a further assumption:

The Chinese University of Hong Kong Connection With SVM M 4 SVM M 4 SVM

The Chinese University of Hong Kong Connection With SVM SVM assumes

The Chinese University of Hong Kong Links With LDA Perform the similar procedure as in MPM LDA

The Chinese University of Hong Kong Link With LDA

The Chinese University of Hong Kong Non-separable Case

The Chinese University of Hong Kong Experimental Results-Synthetic Toy example

The Chinese University of Hong Kong Experimental Results-Benchmark Datasets

The Chinese University of Hong Kong Future Work Kernelization? –Nonlinear extension of M4 Speed-up algorithms? –Is critical in large-scale applications Generation error bound? –SVM and MPM have both error bounds. Multi-way classification extension?

The Chinese University of Hong Kong Conclusion Propose a unified model of MPM and SVM Propose feasible solving methods based on sequential Second Order Cone Programming.