Matthew Schwartz Harvard University March 24, Boost 2011.

Slides:



Advertisements
Similar presentations
S.Towers TerraFerMA TerraFerMA A Suite of Multivariate Analysis tools Sherry Towers SUNY-SB Version 1.0 has been released! useable by anyone with access.
Advertisements

Search for Charged Higgs in the Decay of Top Quarks at D0.
STAR Status of J/  Trigger Simulations for d+Au Running Trigger Board Meeting Dec5, 2002 MC & TU.
1 Reinforcement Learning Introduction & Passive Learning Alan Fern * Based in part on slides by Daniel Weld.
Summary of Results and Projected Sensitivity The Lonesome Top Quark Aran Garcia-Bellido, University of Washington Single Top Quark Production By observing.
Searching for Single Top Using Decision Trees G. Watts (UW) For the DØ Collaboration 5/13/2005 – APSNW Particles I.
8. Hypotheses 8.4 Two more things K. Desch – Statistical methods of data analysis SS10 Inclusion of systematic errors LHR methods needs a prediction (from.
Top Turns Ten March 2 nd, Measurement of the Top Quark Mass The Low Bias Template Method using Lepton + jets events Kevin Black, Meenakshi Narain.
Kevin Black Meenakshi Narain Boston University
Jet Reconstruction Algorithms Eric Vazquez PHENIX- Journal Club.
Optimization of Signal Significance by Bagging Decision Trees Ilya Narsky, Caltech presented by Harrison Prosper.
Update on Tools FTK Meeting 06/06/06 Erik Brubaker U of Chicago.
1 Update on Photons Graham W. Wilson Univ. of Kansas 1.More on  0 kinematic fit potential in hadronic events. 2.Further H-matrix studies (with Eric Benavidez).
Higgs Detection Sensitivity from GGF H  WW Hai-Jun Yang University of Michigan, Ann Arbor ATLAS Higgs Meeting October 3, 2008.
Three kinds of learning
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 6 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
Single-Top Cross Section Measurements at ATLAS Patrick Ryan (Michigan State University) Introduction to Single-Top The measurement.
Crash Course on Machine Learning
Optimizing Higgs Analysis at DØ John Sandy – SULI Intern (Texas Tech University) Mentors: Michael P Cooke and Ryuji Yamada (Fermilab National Accelerator.
ROOT: A Data Mining Tool from CERN Arun Tripathi and Ravi Kumar 2008 CAS Ratemaking Seminar on Ratemaking 17 March 2008 Cambridge, Massachusetts.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #14.
Vector Boson Scattering At High Mass
Genetic Regulatory Network Inference Russell Schwartz Department of Biological Sciences Carnegie Mellon University.
G. Cowan Lectures on Statistical Data Analysis Lecture 7 page 1 Statistical Data Analysis: Lecture 7 1Probability, Bayes’ theorem 2Random variables and.
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
Harrison B. Prosper Workshop on Top Physics, Grenoble Bayesian Statistics in Analysis Harrison B. Prosper Florida State University Workshop on Top Physics:
Michigan REU Final Presentations, August 10, 2006Matt Jachowski 1 Multivariate Analysis, TMVA, and Artificial Neural Networks Matt Jachowski
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 3: Multivariate Methods (II) 清华大学高能物理研究中心 2010 年 4 月 12—16.
Comparison of Bayesian Neural Networks with TMVA classifiers Richa Sharma, Vipin Bhatnagar Panjab University, Chandigarh India-CMS March, 2009 Meeting,
Matthew Schwartz Harvard University with J. Gallicchio, PRL, 105:022001,2010 (superstructure) with K. Black, J. Gallicchio, J. Huth, M. Kagan and B. Tweedie.
Use of Multivariate Analysis (MVA) Technique in Data Analysis Rakshya Khatiwada 08/08/2007.
Alex Melnitchouk DPF conference - May Search for Fermiophobic Higgs in the h   Channel DPF 2002, Williamsburg Alex Melnitchouk (Brown University)
Improving b-jet energy resolution b-resolution mtg Sept. 26, 2007 P. Grannis Improving b-jet energy resolution and raising the di-b jet tag efficiency.
Measurements of Top Quark Properties at Run II of the Tevatron Erich W.Varnes University of Arizona for the CDF and DØ Collaborations International Workshop.
CSE 517 Natural Language Processing Winter 2015
1 Update on tt-bar signal and background simulation Stan Bentvelsen.
Analysis of H  WW  l l Based on Boosted Decision Trees Hai-Jun Yang University of Michigan (with T.S. Dai, X.F. Li, B. Zhou) ATLAS Higgs Meeting September.
1 Introduction to Statistics − Day 2 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.
LECTURE 07: CLASSIFICATION PT. 3 February 15, 2016 SDS 293 Machine Learning.
October 2011 David Toback, Texas A&M University Research Topics Seminar1 David Toback Texas A&M University For the CDF Collaboration CIPANP, June 2012.
Single top quark physics Peter Dong, UCLA on behalf of the CDF and D0 collaborations Les Rencontres de Physique de la Vallee d’Aoste Wednesday, February.
Ignacio Suárez Andrés Universidad de Oviedo XXXV Bienal de la RSEF 13/07/2015.
Search for the Standard Model Higgs in  and  lepton final states P. Grannis, ICHEP 2012 for the DØ Collaboration Tevatron, pp √s = 1.96 TeV -
David Farhi (Harvard University) Work in progress with Ilya Feige, Marat Freytsis, Matthew Schwartz SCET Workshop, 3/27/2014.
One framework for most common MVA-techniques, available in ROOT Have a common platform/interface for all MVA classification and regression-methods: Have.
Single Top Quark Production at D0, L. Li (UC Riverside) EPS 2007, July Liang Li University of California, Riverside On Behalf of the DØ Collaboration.
Moriond 2001Jets at the TeVatron1 QCD: Approaching True Precision or, Latest Jet Results from the TeVatron Experimental Details SubJets and Event Quantities.
1 Review BDT developments Migrate to version 12, AOD BDT for W , Z  AS group.
M. Wolter Optimization of tau identification in ATLAS using multivariate tools On behalf of the ATLAS Collaboration Marcin Wolter Institute.
Helge VossAdvanced Scientific Computing Workshop ETH Multivariate Methods of data analysis Helge Voss Advanced Scientific Computing Workshop ETH.
1 Donatella Lucchesi July 22, 2010 Standard Model High Mass Higgs Searches at CDF Donatella Lucchesi For the CDF Collaboration University and INFN of Padova.
Identification of isolated photons
Multivariate Methods of
More Precision, Less Work
First Evidence for Electroweak Single Top Quark Production
Top Tagging at CLIC 1.4TeV Using Jet Substructure
Study of Gamma+Jets production
Search for WHlnbb at the Tevatron DPF 2009
Multi-dimensional likelihood
10th China HEPS Particle Physics Meeting
ISTEP 2016 Final Project— Project on
VBF H(->bb)+photon
Computing and Statistical Data Analysis Stat 5: Multivariate Methods
Data/ Monte Carlo comparisons for the EMC
Dilepton Mass. Progress report.
Statistical Methods for Data Analysis Multivariate discriminators with TMVA Luca Lista INFN Napoli.
Measurement of the Single Top Production Cross Section at CDF
What is Artificial Intelligence?
Presentation transcript:

Matthew Schwartz Harvard University March 24, Boost 2011

We can think about and visualize single variables Two variables are harder Nobody who thinks in 11 dimensions is in this room! There things that computers are just better at. Multivariate approach lets you figure out how well you could possibly do Save you the trouble or looking for good variables ( project killer ) EFFICIENCY See if simple variables can do as well ( establishes the goal ) FRAMING Sometimes they are really necessary (e.g. ZH) POWER Let me do the work for you!

Lots of methods (all in the TMVA package for root) Boosted Decision Trees Artificial Neural Networks Fischer Discriminants Rectangular cut optimization Projective Likelihood Estimator H-matrix discriminant Predictive learning/Rule ensemble Support Vector Machines K-nearest neighbor … For particle physics, Boosted Decision Trees work best Useful in many areas of science, such as artificial intelligence Easy to understand Train fast Excellent efficiencies

Boosting : train successive trees on misclassified events by enhancing their importance One decision tree Two decision trees

(Aside for Pekka: Maybe one reason jet masses more correlated at high pT)

What about cross sections? Can cuts purify the samples?

Optimize efficiency using BDT classifier with parton momenta as inputs (6 or 9 inputs) Hard cuts on BDT No Cuts Now you’re talking!

 + Look at the best discriminants, ranked by cuts The rapidity of the photon and the rapidity of the second hardest jet look good But cutting on just   or just  J2 does not help much

Contours of Distribution of

BDT results Single variable BDTs led us to the variable, but with the variable we don’t need BDTs

b+2 jets or trijets look promising

This is my favorite part Now try to find a single variable that works as well…

 For quarks, look at gamma + jet  cut on  For gluons  Look at b+2 jets  look at trijets  Cut on

Can get 50% quarks and 4% gluons Need one “count” variable and one “moment” variable Beyond that, probably not worth it

Sometimes one variable is good enough sometimes many variables are needed but

Kinematic variables MVA + 20% Every little bit helps

7 MC insensitive variables 1.Jet mass with R=0.5 2.Jet mass with R=0.4 3.Filtered Jet mass 4.Mass of hardest subjet, from filtering 5.Mass of second hardest subjet, from filtering 6.Ratio of the pT's of the two hardest subjets 7.Planar Flow This is the ultimate goal of W-tagging Challenge: can fewer variables do as well?

For matrix-element level (fixed # dof) BDT much easier to implement than matrix element method MVA kinematic discriminants can be similar to cutting on one smart variable (e.g. in finding pure samples of Quark and Gluon jets) For showered samples (W-tagging, HZ) Subtle correlations make thinking difficult Some discriminants (like pull) don’t work on their own but work well as the 5 th or 6 th variable added to a BDT Sometimes few variables are enough (e.g. Quark vs Glue) Sometimes many variables are needed (e.g. W-tagging) We desperately need data on 2d correlations!! Most useful with pure sampes (quark, glue, hadronic-W, hadronic-top)

Multivariate analysis quickly tells you how well you could possibly do Saves you the trouble or looking for good variables EFFICIENCY See if simple variables can do as well FRAMING Sometimes MVA is really necessary POWER Need to have correlations in real data If 2D distributions agree with the Monte Carlo, the BDTs can be trusted If they don’t agree, we have a great opportunity to improve the MCs Boosted Decision Trees are here to help you Let me do the work for you!