Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 1 Multivariate Methods in HEP Pushpa Bhat Fermilab.

Slides:



Advertisements
Similar presentations
Experimental Particle Physics PHYS6011 Joel Goldstein, RAL 1.Introduction & Accelerators 2.Particle Interactions and Detectors (2) 3.Collider Experiments.
Advertisements

Freiburg Seminar, Sept Sascha Caron Finding the Higgs or something else ideas to improve the discovery ideas to improve the discovery potential at.
1 Data Analysis II Beate Heinemann UC Berkeley and Lawrence Berkeley National Laboratory Hadron Collider Physics Summer School, Fermilab, August 2008.
Search for Large Extra Dimensions at the Tevatron Bob Olivier, LPNHE Paris XXXVI ème Rencontre de Moriond Mars Search for Large Extra Dimensions.
Tau dilepton channel The data sample used in this analysis comprises high-p T inclusive lepton events that contain an electron with E T >20 GeV or a muon.
ACAT2000 Oct , 2000 Pushpa Bhat1 Advanced Analysis Techniques in HEP Pushpa Bhat Fermilab ACAT2000 Fermilab, IL October 2000 A reasonable man adapts.
Top Thinkshop-2 Nov , 2000 Pushpa Bhat1 Advanced Analysis Algorithms for Top Analysis Pushpa Bhat Fermilab Top Thinkshop 2 Fermilab, IL November.
Summary of Results and Projected Sensitivity The Lonesome Top Quark Aran Garcia-Bellido, University of Washington Single Top Quark Production By observing.
Searching for Single Top Using Decision Trees G. Watts (UW) For the DØ Collaboration 5/13/2005 – APSNW Particles I.
Summary of Results and Projected Precision Rediscovering the Top Quark Marc-André Pleier, Universität Bonn Top Quark Pair Production and Decay According.
Recent Electroweak Results from the Tevatron Weak Interactions and Neutrinos Workshop Delphi, Greece, 6-11 June, 2005 Dhiman Chakraborty Northern Illinois.
Top Turns Ten March 2 nd, Measurement of the Top Quark Mass The Low Bias Template Method using Lepton + jets events Kevin Black, Meenakshi Narain.
Kevin Black Meenakshi Narain Boston University
Top Physics at the Tevatron Mike Arov (Louisiana Tech University) for D0 and CDF Collaborations 1.
The new Silicon detector at RunIIb Tevatron II: the world’s highest energy collider What’s new?  Data will be collected from 5 to 15 fb -1 at  s=1.96.
Daniele Benedetti CMS and University of Perugia Chicago 07/02/2004 High Level Trigger for the ttH channel in fully hadronic decay at LHC with the CMS detector.
Introduction to Single-Top Single-Top Cross Section Measurements at ATLAS Patrick Ryan (Michigan State University) The measurement.
On the Trail of the Higgs Boson Meenakshi Narain.
Bayesian Neural Networks Pushpa Bhat Fermilab Harrison Prosper Florida State University.
Top Mass Measurement at the Tevatron HEP2005 Europhysics Conference Lisboa, Portugal, June 22, 2005 Koji Sato (Univ. of Tsukuba) for CDF and D0 Collaborations.
Single-Top Cross Section Measurements at ATLAS Patrick Ryan (Michigan State University) Introduction to Single-Top The measurement.
July 11, 2001Daniel Whiteson Support Vector Machines: Get more Higgs out of your data Daniel Whiteson UC Berkeley.
Alexander Khanov 25 April 2003 DIS’03, St.Petersburg 1 Recent B Physics results from DØ The B Physics program in D Ø Run II Current analyses – First results.
1 g g s Richard E. Hughes The Ohio State University for The CDF and D0 Collaborations Low Mass SM Higgs Search at the Tevatron hunting....
W properties AT CDF J. E. Garcia INFN Pisa. Outline Corfu Summer Institute Corfu Summer Institute September 10 th 2 1.CDF detector 2.W cross section measurements.
G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 3: Multivariate Methods (II) 清华大学高能物理研究中心 2010 年 4 月 12—16.
Jet Studies at CMS and ATLAS 1 Konstantinos Kousouris Fermilab Moriond QCD and High Energy Interactions Wednesday, 18 March 2009 (on behalf of the CMS.
Matthew Schwartz Harvard University with J. Gallicchio, PRL, 105:022001,2010 (superstructure) with K. Black, J. Gallicchio, J. Huth, M. Kagan and B. Tweedie.
Use of Multivariate Analysis (MVA) Technique in Data Analysis Rakshya Khatiwada 08/08/2007.
W+jets and Z+jets studies at CMS Christopher S. Rogan, California Institute of Technology - HCP Evian-les-Bains Analysis Strategy Analysis Overview:
Gavril Giurgiu, Carnegie Mellon, FCP Nashville B s Mixing at CDF Frontiers in Contemporary Physics Nashville, May Gavril Giurgiu – for CDF.
August 30, 2006 CAT physics meeting Calibration of b-tagging at Tevatron 1. A Secondary Vertex Tagger 2. Primary and secondary vertex reconstruction 3.
Calibration of the CMS Electromagnetic Calorimeter with first LHC data
Experimental aspects of top quark mass measurement Regina Demina University of Rochester 2008 Winter Conference Aspen, CO 01/15/08.
October 19, 2000ACAT 2000, Fermilab, Suman B. Beri Top Quark Mass Measurements Using Neural Networks Suman B. Beri, Rajwant Kaur Panjab University, India.
Measurements of Top Quark Properties at Run II of the Tevatron Erich W.Varnes University of Arizona for the CDF and DØ Collaborations International Workshop.
Higgs Reach Through VBF with ATLAS Bruce Mellado University of Wisconsin-Madison Recontres de Moriond 2004 QCD and High Energy Hadronic Interactions.
Top mass error predictions with variable JES for projected luminosities Joshua Qualls Centre College Mentor: Michael Wang.
F Don Lincoln, Fermilab f Fermilab/Boeing Test Results for HiSTE-VI Don Lincoln Fermi National Accelerator Laboratory.
RECENT RESULTS FROM THE TEVATRON AND LHC Suyong Choi Korea University.
Susan Burke DØ/University of Arizona DPF 2006 Measurement of the top pair production cross section at DØ using dilepton and lepton + track events Susan.
October 2011 David Toback, Texas A&M University Research Topics Seminar1 David Toback Texas A&M University For the CDF Collaboration CIPANP, June 2012.
1 Measurement of the Mass of the Top Quark in Dilepton Channels at DØ Jeff Temple University of Arizona for the DØ collaboration DPF 2006.
Single top quark physics Peter Dong, UCLA on behalf of the CDF and D0 collaborations Les Rencontres de Physique de la Vallee d’Aoste Wednesday, February.
April 7, 2008 DIS UCL1 Tevatron results Heidi Schellman for the D0 and CDF Collaborations.
G. Cowan Lectures on Statistical Data Analysis Lecture 6 page 1 Statistical Data Analysis: Lecture 6 1Probability, Bayes’ theorem 2Random variables and.
Kinematics of Top Decays in the Dilepton and the Lepton + Jets channels: Probing the Top Mass University of Athens - Physics Department Section of Nuclear.
La Thuile, March, 15 th, 2003 f Makoto Tomoto ( FNAL ) Prospects for Higgs Searches at DØ Makoto Tomoto Fermi National Accelerator Laboratory (For the.
1 UCSD Meeting Calibration of High Pt Hadronic W Haifeng Pi 10/16/2007 Outline Introduction High Pt Hadronic W in TTbar and Higgs events Reconstruction.
Backup slides Z 0 Z 0 production Once  s > 2M Z ~ GeV ÞPair production of Z 0 Z 0 via t-channel electron exchange. e+e+ e-e- e Z0Z0 Z0Z0 Other.
SEARCH FOR DIRECT PRODUCTION OF SUPERSYMMETRIC PAIRS OF TOP QUARKS AT √ S = 8 TEV, WITH ONE LEPTON IN THE FINAL STATE. Juan Pablo Gómez Cardona PhD Candidate.
Single Top Quark Production at D0, L. Li (UC Riverside) EPS 2007, July Liang Li University of California, Riverside On Behalf of the DØ Collaboration.
A Precision Measurement of the Mass of the Top Quark Abazov, V. M. et al. (D0 Collaboration). Nature 429, (2004) Presented by: Helen Coyle.
Viktor Veszpremi Purdue University, CDF Collaboration Tev4LHC Workshop, Oct , Fermilab ZH->vvbb results from CDF.
Studies of the Higgs Boson at the Tevatron Koji Sato On Behalf of CDF and D0 Collaborations 25th Rencontres de Blois Chateau Royal de Blois, May 29, 2013.
Search for Standard Model Higgs in ZH  l + l  bb channel at DØ Shaohua Fu Fermilab For the DØ Collaboration DPF 2006, Oct. 29 – Nov. 3 Honolulu, Hawaii.
Suyong Choi (SKKU) SUSY Standard Model Higgs Searches at DØ Suyong Choi SKKU, Korea for DØ Collaboration.
Low Mass Standard Model Higgs Boson Searches at the Tevatron Andrew Mehta Physics at LHC, Split, Croatia, September 29th 2008 On behalf of the CDF and.
Study of Diboson Physics with the ATLAS Detector at LHC Hai-Jun Yang University of Michigan (for the ATLAS Collaboration) APS April Meeting St. Louis,
ICHEP 2002, Amsterdam Marta Calvi - Study of Spectral Moments… 1 Study of Spectral Moments in Semileptonic b Decays with the DELPHI Detector at LEP Marta.
1 Donatella Lucchesi July 22, 2010 Standard Model High Mass Higgs Searches at CDF Donatella Lucchesi For the CDF Collaboration University and INFN of Padova.
First Evidence for Electroweak Single Top Quark Production
Integration and alignment of ATLAS SCT
An Important thing to know.
Experimental Particle PhysicsPHYS6011 Performing an analysis Lecture 5
Top mass measurements at the Tevatron and the standard model fits
Greg Heath University of Bristol
Measurement of the Single Top Production Cross Section at CDF
Susan Burke, University of Arizona
Measurement of b-jet Shapes at CDF
Presentation transcript:

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 1 Multivariate Methods in HEP Pushpa Bhat Fermilab

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 2 Outline Introduction/History Physics Analysis Examples Popular Methods Likelihood Discriminants Neural Networks Bayesian Learning Decision Trees Future Issues and Concerns Summary

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 3 Some History In 1990 most of the HEP community was skeptical towards use of multivariate methods, particularly so in case of neural networks (NN) NN as a black box  Can’t understand weights  Nonlinear mapping; higher order correlations  Though mathematical function can’t explain in terms of physics  Can’t calculate systematic errors reliably  Uni-variate or “cut-based” analysis was the norm Some were pursuing application of neural network methods to HEP around 1990 Peterson, Lonnblad, Denby, Becks, Seixas, Lindsey, etc First AIHENP (Artificial Intelligence in High Energy & Nuclear Physics) workshop was in Organizers included D. Perret-Gallix, K.H. Becks, R. Brun, J.Vermaseren. AIHENP metamorphosed into ACAT ten years later, in 2000 Multivariate methods such as Fisher discriminants were in limited use. In 1990, I began to pursue the use of multivariate methods, especially NN, in top quark searches at Dzero.

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 4 Mid-1990’s LEP experiments had been using NN and likelihood discriminants for particle-ID applications and eventually for signal searches (Steinberger; tau-ID) H1 at HERA successfully implemented and used NN for triggering (Kiesling). Hardware NN was attempted at Fermilab at CDF Fermilab Advanced Analysis Methods Group brought CDF and DØ together for discussion of these methods and applications in physics analyses.

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 5 The Top Quark Post-Evidence, Pre-Discovery ! Fisher Analysis of tt  e  channel One candidate event (S/B)(m t = 180 GeV) = 18 w.r.t. Z  = 10 w.r.t WW NN Analysis tt  e+jets channel tt W+jets tt160Data P. Bhat, DPF94

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 6 Cut Optimization for Top Discovery Feb. ‘95 Signal Background Jan. ’95 (Aspen) cut Mar. ’95 Discovery cut Contours: Possible NN cuts Feb. ‘95 Sig. Eff. S/B (Feb-Mar, 95 -Discovery Conventional cut) S/B reach with 2-v NN analysis for similar efficiency (Jan, 95 –Aspen mtg. Conventional cut) Neural Network Equi-probability Contour cuts from 2-variable analysis compared with conventional cuts used in Jan. ’95 and in Observation paper P. Bhat, H.Prosper, E. Amidi D0 Top Marathon, Feb. ‘95

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 7 Measurement of the Top Quark Mass Discriminant variables m t = ± 5.6(stat.) ± 6.2 (syst.) GeV/c 2 The Discriminants DØ Lepton+jets Fit performed in 2-D: (D LB/NN, m fit ) Run I (1996) result with NN and likelihood Recent (CDF+D0) m t measurement: m t = ± 2.1 Gev/c 2 First significant physics result using multivariate methods

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 8 Higgs, the Holy Grail of HEP Discovery Reach at the Tevatron The challenges are daunting! But using NN provides same reach with a factor of 2 less luminosity w.r.t. conventional analysis Improved bb mass resolution & b-tag efficiency crucial Run II Higgs study hep-ph/ (Oct-2000) P.C.Bhat, R.Gilmartin, H.Prosper, Phys.Rev.D.62 (2000)

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 9 Then, it got easier One of the important steps in getting the NN accepted at the Tevatron experiments was to make the Bayesian connection. Another important message to drive home was “the maximal use of information in the event” for the job at hand Developed a random grid search technique that can be used as baseline for comparison Neural network methods now have become popular due to the ease of use, power and many successful applications Maybe too easy??

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 10 Optimal Event Selection r(x,y) = constant defines an optimal decision boundary r(x,y) = constant defines an optimal decision boundary Feature space S =B = Conventional cuts

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 11 The NN-Bayesian Connection Output of a feed forward neural network can approximate the posterior probability P(s|x 1,x 2 ).

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 12 Limitations of “Conventional NN” The training yields one set of weights or network parameters Need to look for “best” network, but avoid overfitting Heuristic decisions on network architecture Inputs, number of hidden nodes, etc. No direct way to compute uncertainties

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 13 Ensembles of Networks NN 1 NN 2 NN 3 NN M X y1y1 y2y2 y3y3 yMyM Decision by averaging over many networks (a committee of networks) has lower error than that of any individual network.

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 14 Bayesian Learning The result of Bayesian training is a posterior density of the network weights  P(w|training data) Generate a sequence of weights (network parameters) in the network parameter space i.e., a sequence of networks. The optimal network is approximated by averaging over the last K points:

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 15 Bayesian Learning – 2 Advantages Less prone to over-fitting Less need to optimize the size of the network. Can use a large network! Indeed, number of weights can be greater than number of training events! p(t|x)In principle, provides best estimate of p(t|x) Disadvantages Computationally demanding! The dimensionality of the parameter space is, typically, large There could be multiple maxima in the likelihood function p(t|x,w), or, equivalently, multiple minima in the error function E(x,w).

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 16 Example: Single Top Search Training Data 2000 events (1000 tqb-  Wbb-  ) Standard set of 11 variables Network 391(11, 30, 1) Network (391 parameters!) Markov Chain Monte Carlo (MCMC) 500 iterations, but use last 100 iterations 20 MCMC steps per iteration NN-parameters stored after each iteration 10,000 steps ~ 1000 steps / hour (on 1 GHz, Pentium III laptop)

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 17 Signal:tqb; Background:Wbb Distributions Example: Single Top Search

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 18

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 19 Decision Trees Recover events that fail criteria in cut-based analyses Start at first “node” with a fraction of the “training sample” Select best variable and cut with best separation to produce two “branches ” of events, (F)ailed and (P)assed cut Repeat recursively on successive nodes Stop when improvement stops or when too few events are left Terminal node is called a “leaf ” with purity = N s /(N s +N b ) Run remaining events and data through the tree to derive results Boosting DT: Boosting is a recently developed technique that improves any weak classifier (decision tree, neural network, etc) Boosting averages the results of many trees, dilutes the discrete nature of the output, improves the performance DØ single top analysis

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 20 Matrix Element Method Example: Top mass measurement Maximal use of information in each event by calculating event-by- event signal and background probabilities based on the respective matrix element x: reconstructed kinematic variables of final state objects JES: jet energy Scale from Mw constraint Signal and background probabilities from differential cross sections Write combined likelihood for all events Maximize likelihood w.r.t. m top, JES

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 21 Summary Multivariate methods are now used extensively in HEP data analysis Neural networks, because of their ease of use and power, are favorites for particle-ID and signal/background discrimination Bayesian neural networks take us one step closer to optimization Likelihood discriminants and Decision trees are becoming popular because they are easier to “defend” (no “black-box” stigma) Many issues remain to be addressed as we get ready to deploy the multivariate methods for discoveries in HEP

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 22 Nothing tends so much to the advancement of knowledge as the application of a new instrument - Humphrey Davy No amount of experimentation can ever prove me right; a single experiment can prove me wrong. - Albert Einstein

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 23 CDF DØ Booster World’s Highest Energy Laboratory (for now)

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 24 Our Fancy New Toys LHC Ring SPS Ring PS Circumference = 27km Beam Energy = 7.7 TeV Luminosity =1.65x10 34 cm -2 sec -1 Startup date: 2007 p p LHC Magnet LHC Tunnel TI 2 TI 8 The Large Hadron Collider CMS

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 25 LHC Environment 14 TeV Proton colliding beams ParameterValue Bunch-crossing frequency40 MHz Average # of collisions / crossing 20 “interaction rate”~10 9 Average # of charged tracks1000 Radiation fieldsevere CMS ParameterValue Level-1 trigger rate100 kHz Mean time between triggers10  sec Trigger latency 3.2  sec Solenoid field4 T

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 26 CMS Silicon Tracker Challenges

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 27 CMS Si Tracker 5.4 m 2,4 m Inner Barrel & Disks (TIB & TID) Pixels Outer Barrel (TOB)

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 28 Lots of Silicon 214m 2 of silicon sensors 11.4 million silicon strips 66 million pixels!

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 29 Si Tracker Challenges Large and complex system 77.4 million total channels (out of a total of 78.2 M for experiment) Detector monitoring, data organization, data quality monitoring, analysis, visualization, interpretation all daunting! Need to monitor every channel and make sure most of the detector is working at all times (live fraction of the detector and efficiencies bound to decrease with time) Need to verify data integrity and data quality for physics Diagnose and fix problems ASAP Keep calibration and alignment parameters current

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 30 Detector/Data Monitoring Monitor Environmental variables Temperatures, coolant flow rates, interlocks, radiation doses Hardware status Voltages, currents Channel Data Readout states, Errors, missing data/channels, bad ID for channel/module  many kinds to be categorized and tracked and displayed  should be able to find rare problems/errors (with low occurrence rate) that may corrupt data Problems (Rare problems may indicate a developing failure mode or hidden bad behavior)  Correlate problem/noisy channels with history, temperature, currents, etc.

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 31 Data Quality Monitoring Monitor Raw Data Pedestals, noise, adc counts, occupancies, efficiencies Processed high level objects Clusters, tracks, etc. Evaluate thousands of histograms Can’t visually examine all Automatically evaluate histograms by comparing to reference histograms Adaptive, efficient, find evolving patterns over time Quantiles? q-q plots/comparison instead of KS test? A variety of 2D “heat” maps Occupancies, #of bad channels/module, #of errors/module, etc. Typical occupancy ~ 2% in strip tracker 200,000 channels written out 100 times/sec

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 32 Module Assembly Precision Example of a “Heat” map

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 33 Need smart approaches What are the best techniques for data-mining? To organize data for analysis and data visualization complex geometry/addressing makes visualization difficult For finding problematic channels quickly, efficiently  clustering, exploratory data-mining For finding anomalies, corrupt data, patterns of behavior  Feature-finding algorithms, superpose many events, time evolution, spatial and temporal correlations Noise Correlations Via correlation coefficients of defined groups Correlate to history (time variations), environmental variables

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 34 Data Visualization Based on hierarchical/geometrical structure of the tracker Display every channel, attach objects/info to each Sub-structures Layers/rings Modules Readout Chips

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 35 Multivariate Analysis Issues Dimensionality Reduction Choosing Variables optimally without losing information Choosing the right method for the problem Controlling Model Complexity Testing Convergence Validation Given a limited sample what is the best way? Computational Efficiency

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 36 Multivariate Analysis Issues Correctness of modeling How do we make sure the multivariate modeling is correct? The data used for training or building PDEs represent reality. Is it sufficient to check the modeling in the mapped variable? Pair-wise correlations? Higher order correlations? How do we show that the background is modeled well? How do we quantify the correctness of modeling? In conventional analysis, we normally look for variables that are well modeled in order to apply cuts How well is the background modeled in the signal region? Worries about hidden bias Worries about underestimating errors

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 37 Sociological Issues We have been conservative in the use of MV methods for discovery. We have been more aggressive in the use of MV methods for setting limits. But discovery is more important and needs all the power you can muster! This is expected to change at LHC.

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 38 Summary The next generation of experiments will need to adopt advanced data mining and data analysis techniques Conventional/routine tasks such as alignment, detector performance and data quality monitoring and data visualization will be challenging and require new approaches Many issues regarding use of multivariate methods of data analysis for discoveries and measurements need to be addressed to make optimal use of data

Pushpa Bhat, Fermilab ACAT 2007 Apr 23-27, Amsterdam 39 MV: Where can we use them? Almost everywhere since HEP events are multivariate Improve several aspects of analysis Event selection Triggering, Real-time Filters, Data Streaming Event reconstruction Tracking/vertexing, particle ID Signal/Background Discrimination Higgs discovery, SUSY discovery, Single top, … Functional Approximation Jet energy corrections, tag rates, fake rates Parameter estimation Top quark mass, Higgs mass, SUSY model parameters Data Exploration Knowledge Discovery via data-mining Data-driven extraction of information, latent structure analysis