Axel Naumann, DØ University of Nijmegen, The Netherlands June 24, 2002 ACAT02, Moscow 1 Support Vector Regression
Axel Naumann, DØ University of Nijmegen, The Netherlands June 24, 2002 ACAT02, Moscow 2 SVR Drawings and illustrations from Bernhard Schölkopf, and Alex Smola: Learning with Kernels (MIT Press, Cambridge, MA, 2002)
Axel Naumann, DØ University of Nijmegen, The Netherlands June 24, 2002 ACAT02, Moscow 3 SVR - History Based on Learning Theory, consisting of few axioms on learning errors Started in 1960’s, still actively developed SVRs recently outperformed NNs in recognition tests on US Postal Service’s standard set of handwritten characters libSVM by Chih-Chung Chang and Chih-Jen Lin provides fast and simple to use implementation, extended as requests (e.g. from HEP) come in
Axel Naumann, DØ University of Nijmegen, The Netherlands June 24, 2002 ACAT02, Moscow 4 Training sample X, observed results Y Goal: f with y=f(x) Simplicity: Linear case, Formulation of Problem
Axel Naumann, DØ University of Nijmegen, The Netherlands June 24, 2002 ACAT02, Moscow 5 Optimal confidence = maximal margin Minimize quadratic problem with Quadratic problem: Unique solution! Optimizing the Confidence
Axel Naumann, DØ University of Nijmegen, The Netherlands June 24, 2002 ACAT02, Moscow 6 Non-Linearity Introduce mapping to higher dimensional space e.g. Gaussian kernel:
Axel Naumann, DØ University of Nijmegen, The Netherlands June 24, 2002 ACAT02, Moscow 7 Calculation
Axel Naumann, DØ University of Nijmegen, The Netherlands June 24, 2002 ACAT02, Moscow 8 L2 b Tagger Parameters
Axel Naumann, DØ University of Nijmegen, The Netherlands June 24, 2002 ACAT02, Moscow 9 L2 b Tagger Parameters
Axel Naumann, DØ University of Nijmegen, The Netherlands June 24, 2002 ACAT02, Moscow 10 L2 b Tagger Output SVRNN
Axel Naumann, DØ University of Nijmegen, The Netherlands June 24, 2002 ACAT02, Moscow 11 L2 b Tagger Discussion Complex problem increases number of SVs Almost non-separable classes still almost non-separable in high dimensional space High processing time due to large number of SVs NNs show better performance for low-information, low- separability problems
Axel Naumann, DØ University of Nijmegen, The Netherlands June 24, 2002 ACAT02, Moscow 12 Higgs Parameters Higgs SVR analysis by Daniel Whiteson, UC Berkley
Axel Naumann, DØ University of Nijmegen, The Netherlands June 24, 2002 ACAT02, Moscow 13 Higgs Parameters
Axel Naumann, DØ University of Nijmegen, The Netherlands June 24, 2002 ACAT02, Moscow 14 Higgs Output Background Signal
Axel Naumann, DØ University of Nijmegen, The Netherlands June 24, 2002 ACAT02, Moscow 15 Higgs Purity / Efficiency Purity
Axel Naumann, DØ University of Nijmegen, The Netherlands June 24, 2002 ACAT02, Moscow 16 Kernel Width Integrated Significance
Axel Naumann, DØ University of Nijmegen, The Netherlands June 24, 2002 ACAT02, Moscow 17 Summary SVR often superior to NN Not stuck in local minima: unique solution Better performance for many problems Implementation exists, actively supported by the development community Further information: Time for HEP!
Axel Naumann, DØ University of Nijmegen, The Netherlands June 24, 2002 ACAT02, Moscow 18 L2 b Tagger Correlation budcs SVR NN