Download presentation
Presentation is loading. Please wait.
1
Support Vector Machines part 2
21 March 2013 Some slides from F. Bach and Z. Harchaoui
2
Motivation Max-margin classification Classification with kernels
Left image: Right image:
3
Primal and Dual formulations
Maximize where Subject to these constraints:
4
Primal vs Dual Formulations
n (# samples) params. Can efficiently handle very high dimensional data No need for explicit features (kernel trick) Is not very efficient for very large data sets Need to store the support vectors Primal d (data dimension) parameters Efficient when the number of samples is high (millions) – stochastic grad. desc. Easy on memory (store only w and b)
5
SVM Kernel Functions K(a,b)=(a . b +1)d is an example of an SVM Kernel Function Beyond polynomials there are other very high dimensional basis functions that can be made practical by finding the right Kernel Function Radial-Basis-style Kernel Function: Bandwidth: from linear classifier to NN methos Copyright © 2001, 2003, Andrew W. Moore
6
Kernel Tricks Replacing dot product with a kernel function
Not all functions are kernel functions Need to be decomposable K(a,b) = (a) (b) Could K(a,b) = (a-b)3 be a kernel function ? Could K(a,b) = (a-b)4 – (a+b)2 be a kernel function? Copyright © 2001, 2003, Andrew W. Moore
7
SVM for CSE 802 Project Copyright © 2001, 2003, Andrew W. Moore
8
SVM - FAQ Which formulation to use? How to set C?
Which kernel to use? How to determine the kernel parameters? How to use SVM for multi-class problems (project)?
9
Multi-class / multi-label SVM
Multi-class classification: each instance belong to one of the K classes Multi-label classification: each instance might belong to one or more members of the K classes
10
Multi-class / multi-label SVM
For multi-class only!!!
11
OvsO vs OvsR OvR: K classifiers with n instances each
OvO: K(K-1)/2 classifiers with O(2n/K) instances each (average) OvR: Data imbalance problem OvO: Hot to decide the winner
12
Data imbalance problem
13
A small experiment
14
A small experiment
15
SVM for CSE 802 Project
16
SVM for CSE 802 Project I have many feature or kernel parameter (type, bandwidth, degree) options and I am indecisive. I have many feature or kernel parameter (type, bandwidth, degree) options and I want to use them all somehow.
19
Gradient descend
20
Stochastic gradient descend
21
SVM - FAQ Which formulation to use? How to set C?
Which kernel to use? How to determine the kernel parameters? How to use SVM for multi-class problems (project)?
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.