B-tagging Performance based on Boosted Decision Trees Hai-Jun Yang University of Michigan (with X. Li and B. Zhou) ATLAS B-tagging Meeting February 9,

Slides:



Advertisements
Similar presentations
NA-ATLAS mtg. Experience from the Tevatron – Chlebana FNAL.
Advertisements

First look at  -jet Samples using BDT e-ID Algorithm Hai-Jun Yang University of Michigan (with X. Li and B. Zhou) ATLAS egamma Meeting March 9, 2009.
Effect of b-tagging Scale Factors on M bb invariant mass distribution Ricardo Gonçalo.
1 B-tagging meeting overview Li bo Shandong University.
Implementation of e-ID based on BDT in Athena EgammaRec Hai-Jun Yang University of Michigan, Ann Arbor (with T. Dai, X. Li, A. Wilson, B. Zhou) US-ATLAS.
Searching for Single Top Using Decision Trees G. Watts (UW) For the DØ Collaboration 5/13/2005 – APSNW Particles I.
September 27, 2005FAKT 2005, ViennaSlide 1 B-Tagging and ttH, H → bb Analysis on Fully Simulated Events in the ATLAS Experiment A.H. Wildauer Universität.
Top Turns Ten March 2 nd, Measurement of the Top Quark Mass The Low Bias Template Method using Lepton + jets events Kevin Black, Meenakshi Narain.
Kevin Black Meenakshi Narain Boston University
Ensemble Learning: An Introduction
Optimization of Signal Significance by Bagging Decision Trees Ilya Narsky, Caltech presented by Harrison Prosper.
Introduction to Single-Top Single-Top Cross Section Measurements at ATLAS Patrick Ryan (Michigan State University) The measurement.
Higgs Detection Sensitivity from GGF H  WW Hai-Jun Yang University of Michigan, Ann Arbor ATLAS Higgs Meeting October 3, 2008.
1 Andrea Bangert, ATLAS SCT Meeting, Monte Carlo Studies Of Top Quark Pair Production Andrea Bangert, Max Planck Institute of Physics, CSC T6.
Boosted Decision Trees, a Powerful Event Classifier
Single-Top Cross Section Measurements at ATLAS Patrick Ryan (Michigan State University) Introduction to Single-Top The measurement.
Chris Barnes, Imperial CollegeWIN 2005 B mixing at DØ B mixing at DØ WIN 2005 Delphi, Greece Chris Barnes, Imperial College.
Vector Boson Scattering At High Mass
Tau Jet Identification in Charged Higgs Search Monoranjan Guchait TIFR, Mumbai India-CMS collaboration meeting th March,2009 University of Delhi.
E. Devetak - LCWS t-tbar analysis at SiD Erik Devetak Oxford University LCWS /11/2008 Flavour tagging for ttbar Hadronic ttbar events ID.
1 g g s Richard E. Hughes The Ohio State University for The CDF and D0 Collaborations Low Mass SM Higgs Search at the Tevatron hunting....
Search for Z’  tt at LHC with the ATLAS Detector Hai-Jun Yang University of Michigan, Ann Arbor US-ATLAS Jamboree September 9, 2008.
G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 3: Multivariate Methods (II) 清华大学高能物理研究中心 2010 年 4 月 12—16.
Sung-Won Lee 1 Study of Hadronic W Decays in the Jets + MET Final LHC Kittikul Kovitanggoon * & Sung-Won Lee Texas Tech University Michael Weinberger.
WW and WZ Analysis Based on Boosted Decision Trees Hai-Jun Yang University of Michigan (contributed from Tiesheng Dai, Alan Wilson, Zhengguo Zhao, Bing.
MiniBooNE Event Reconstruction and Particle Identification Hai-Jun Yang University of Michigan, Ann Arbor (for the MiniBooNE Collaboration) DNP06, Nashville,
August 30, 2006 CAT physics meeting Calibration of b-tagging at Tevatron 1. A Secondary Vertex Tagger 2. Primary and secondary vertex reconstruction 3.
HERA-LHC, CERN Oct Preliminary study of Z+b in ATLAS /1 A preliminary study of Z+b production in ATLAS The D0 measurement of  (Z+b)/  (Z+jet)
Training of Boosted DecisionTrees Helge Voss (MPI–K, Heidelberg) MVA Workshop, CERN, July 10, 2009.
Possibility of tan  measurement with in CMS Majid Hashemi CERN, CMS IPM,Tehran,Iran QCD and Hadronic Interactions, March 2005, La Thuile, Italy.
LCFI Package and Flavour 3TeV Tomáš Laštovička Institute of Physics AS CR CLIC WG3 Meeting 9/6/2010.
Measurement of b-tagging Fake rates in Atlas Data M. Saleem * In collaboration with Alexandre Khanov** F. Raztidinova**, P.Skubic * *University of Oklahoma,
25 sep Reconstruction and Identification of Hadronic Decays of Taus using the CMS Detector Michele Pioppi – CERN On behalf.
B-Tagging Algorithms for CMS Physics
DPF2000, 8/9-12/00 p. 1Richard E. Hughes, The Ohio State UniversityHiggs Searches in Run II at CDF Prospects for Higgs Searches at CDF in Run II DPF2000.
Marcel Vreeswijk (NIKHEF) B tagging, performance vertexing Neural Net studies tt event selection mass reconstruction in tt events conclusions B tagging.
Tth study Lepton ID with BDT 2015/02/27 Yuji Sudo Kyushu University 1.
Benchmarking Tracking & Vertexing Andrei Nomerotski (Oxford) SiD Tracking Meeting, 7 December 2007.
Jet Tagging Studies at TeV LC Tomáš Laštovička, University of Oxford Linear Collider Physics/Detector Meeting 14/9/2009 CERN.
Kalanand Mishra BaBar Coll. Meeting February, /8 Development of New Kaon Selectors Kalanand Mishra University of Cincinnati.
By Henry Brown Henry Brown, LHCb, IOP 10/04/13 1.
G. Cowan IDPASC School of Flavour Physics, Valencia, 2-7 May 2013 / Statistical Analysis Tools 1 Statistical Analysis Tools for Particle Physics IDPASC.
Analysis of H  WW  l l Based on Boosted Decision Trees Hai-Jun Yang University of Michigan (with T.S. Dai, X.F. Li, B. Zhou) ATLAS Higgs Meeting September.
B-tagging based on Boosted Decision Trees
Classification and Regression Trees
E. Devetak - IOP 081 Anomalous – from tools to physics Erik Devetak Oxford - RAL IOP 2008 Lancaster‏ Anomalous coupling (Motivation – Theory)
B-Tagging Algorithms at the CMS Experiment Gavril Giurgiu (for the CMS Collaboration) Johns Hopkins University DPF-APS Meeting, August 10, 2011 Brown University,
Jessica Levêque Rencontres de Moriond QCD 2006 Page 1 Measurement of Top Quark Properties at the TeVatron Jessica Levêque University of Arizona on behalf.
Search for H  WW*  l l Based on Boosted Decision Trees Hai-Jun Yang University of Michigan LHC Physics Signature Workshop January 5-11, 2008.
Single Top Quark Production at D0, L. Li (UC Riverside) EPS 2007, July Liang Li University of California, Riverside On Behalf of the DØ Collaboration.
Current Status of e-ID based on BDT Algorithm Hai-Jun Yang University of Michigan (with X. Li, T. Dai, A. Wilson, B. Zhou) BNL Analysis Jamboree March.
Studies of the Higgs Boson at the Tevatron Koji Sato On Behalf of CDF and D0 Collaborations 25th Rencontres de Blois Chateau Royal de Blois, May 29, 2013.
Marcello Barisonzi First results of AOD analysis on t-channel Single Top DAD 30/05/2005 Page 1 AOD on Single Top Marcello Barisonzi.
VHF working meeting, 4 Oct Measurement of associated charm production in W final states at  s=7TeV J. Alcaraz, I. Josa, J. Santaolalla (CIEMAT,
Search for Standard Model Higgs in ZH  l + l  bb channel at DØ Shaohua Fu Fermilab For the DØ Collaboration DPF 2006, Oct. 29 – Nov. 3 Honolulu, Hawaii.
Eric COGNERAS LPC Clermont-Ferrand Prospects for Top pair resonance searches in ATLAS Workshop on Top Physics october 2007, Grenoble.
Study of the electron identification algorithms in TRD Andrey Lebedev 1,3, Semen Lebedev 2,3, Gennady Ososkov 3 1 Frankfurt University, 2 Giessen University.
Study of Diboson Physics with the ATLAS Detector at LHC Hai-Jun Yang University of Michigan (for the ATLAS Collaboration) APS April Meeting St. Louis,
Exploring Artificial Neural Networks to Discover Higgs at LHC Using Neural Networks for B-tagging By Rohan Adur
WW and WZ Analysis Based on Boosted Decision Trees
Top Tagging at CLIC 1.4TeV Using Jet Substructure
B Tagging Efficiency and Mistag Rate Measurement in ATLAS
MiniBooNE Event Reconstruction and Particle Identification
Jet Identification in QCD Samples at DØ
Electron Identification Based on Boosted Decision Trees
Update of Electron Identification Performance Based on BDTs
Implementation of e-ID based on BDT in Athena EgammaRec
b-tagging commissioning strategy for ATLAS
Performance of BDTs for Electron Identification
Northern Illinois University / NICADD
Presentation transcript:

B-tagging Performance based on Boosted Decision Trees Hai-Jun Yang University of Michigan (with X. Li and B. Zhou) ATLAS B-tagging Meeting February 9, 2009

H. Yang - BDT B-tagging2 Motivation To evaluate the performance of Boosted Decision Trees by combining the information from different ATLAS b-tagging algorithms into a single discriminator for jet classification Ref: J. Bastos, ATL-PHYS-PUB To improve the b-tagging performance

H. Yang - BDT B-tagging3 Data Samples Ttbar (DS5200, 387K) based on release v13. Preselection cuts: Et (jet) > 15 GeV, |  |<2.5 Jets Total JetsFor trainingFor testing Total No. of Jets B jets (33.8%) C jets ( 7.3%)  jets ( 4.4%) Light jets(54.5%)

H. Yang - BDT B-tagging4 19 Variables for BDT Training IP2D – jet weight from transverse impact parameters IP3D – jet weight from 3D impact parameters SV1 – jet weigh from secondary vertices SV2 – jet weight from secondary vertices

H. Yang - BDT B-tagging5 Input Variables lhsig – combination of jet weights IP1D, IP2D & SVBU weight – combination of jet weights IP3D and SV1 softm – soft muon based tagger softe – soft electron based tagger

H. Yang - BDT B-tagging6 Input Variables mass – mass of particles which participate in vertex fit efrc – energy ratio between particles in vertex and in jet N2t – No. of 2-track vertices Ntrk(vertex) – number of tracks in vertex

H. Yang - BDT B-tagging7 Input Variables Largest IP2D – largest transverse IP significance of tracks in the jet Largest IP3D – largest longitudinal IP significance of tracks in the jet Largest Pt – largest transverse momentum of tracks in the jet Ntrk(jet) – track multiplicity in jet

H. Yang - BDT B-tagging8 Input Variables Et(jet) – Et of jet Eta(jet) –  of jet Phi(jet) –  of jet

H. Yang - BDT B-tagging9 “A procedure that combines many weak classifiers to form a powerful committee” Boosted Decision Trees H. Yang et.al. NIM A555 (2005)370, NIM A543 (2005)577, NIM A574(2007) 342  Relatively new in HEP – MiniBooNE, BaBar, D0(single top discovery), ATLAS  Advantages: robust, understand ‘powerful’ variables, relatively transparent, … BDT Training Process Split data recursively based on input variables until a stopping criterion is reached (e.g. purity, too few events) Every event ends up in a “signal” or a “background” leaf Misclassified events will be given larger weight in the next decision tree (boosting)

H. Yang - BDT B-tagging10 A set of decision trees can be developed, each re-weighting the events to enhance identification of backgrounds misidentified by earlier trees (“boosting”) For each tree, the data event is assigned +1 if it is identified as signal, - 1 if it is identified as background. The total for all trees is combined into a “score” negativepositiveBackground-likesignal-like BDT discriminator

H. Yang - BDT B-tagging11 B-tagging Weights vs BDT Discriminator

H. Yang - BDT B-tagging12 Discriminating Power of Input Variables RankDescription of Input VariableGini Index(%) 1 IP3D + SV % 2 Ntrk(jet) – number of tracks in jet 6.96% 3 Largest trans. IP significance of tracks in jet 2.41% 4 Largest Pt(trk) – largest Pt of tracks in jet 2.28% 5 Softm – soft muon based tagger 1.24% 6 Efrc – e ratio of particles in vertex & in jet 0.79% 7 Mass – mass of particles used in vertex fit 0.63% 8 Et(jet) – transverse energy of jet 0.54% 9 IP2D – jet weight from transverse IP 0.44% 10 IP3D – jet weight from 3D IP 0.28%

H. Yang - BDT B-tagging13 Results: B-jets vs Light-jets Rejection (BDT) Rejection (IP3D+SV1) 1.42

H. Yang - BDT B-tagging14 B-jet Eff. vs P T

H. Yang - BDT B-tagging15 Light-jet Rejection vs 

H. Yang - BDT B-tagging16 Results: B-jets vs C-jets Rejection (BDT) Rejection (IP3D+SV1) 1.36

H. Yang - BDT B-tagging17 C-jets Rejection vs P T, 

H. Yang - BDT B-tagging18 Results: B-jets vs  -jets Rejection (BDT) Rejection (IP3D+SV1) 6.3

H. Yang - BDT B-tagging19  -jets Rejection vs P T, 

H. Yang - BDT B-tagging20 Summary and Future Plan BDT works better than IP3D+SV1 by combining several existing b-tagging weights. The light jet rejection is improved by 40%-60% for wide b-tagging efficiency range of 30%-80%. For 60% b-tagging efficiency, c jet and  jet rejection are improved by 1.36 and 6.3, respectively. Considering more discriminating variables which may help for b-tagging, eg. signed transverse impact parameter, individual track probability, jet probability / mass / width, number of tracks with certain decay length, 2D/3D decay length. Using v14 MC samples (  s = 10 TeV) to evaluate b- tagging performance.

H. Yang - BDT B-tagging21 Backup Slides for BDT

H. Yang - BDT B-tagging22 Criterion for “ Best ” Tree Split Purity, P, is the fraction of the weight of a node (leaf) due to signal events. Gini Index: Note that Gini index is 0 for all signal or all background. The criterion is to minimize Gini_left_node+ Gini_right_node.

H. Yang - BDT B-tagging23 Criterion for Next Node to Split Pick the node to maximize the change in Gini index. Criterion = Gini parent_node – Gini right_child_node – Gini left_child_node We can use Gini index contribution of tree split variables to sort the importance of input variables. We can also sort the importance of input variables based on how often they are used as tree splitters.

H. Yang - BDT B-tagging24 Signal and Background Leaves Assume an equal weight of signal and background training events. If event weight of signal is larger than ½ of the total weight of a leaf, it is a signal leaf; otherwise it is a background leaf. Signal events on a background leaf or background events on a signal leaf are misclassified events.

H. Yang - BDT B-tagging25 How to Boost Decision Trees ?  For each tree iteration, same set of training events are used but the weights of misclassified events in previous iteration are increased (boosted). Events with higher weights have larger impact on Gini index values and Criterion values. The use of boosted weights for misclassified events makes them possible to be correctly classified in succeeding trees.  Typically, one generates several hundred to thousand trees until the performance is optimal.  The score of a testing event is assigned as follows: If it lands on a signal leaf, it is given a score of 1; otherwise -1. The sum of scores (weighted) from all trees is the final score of the event.

H. Yang - BDT B-tagging26 Two Boosting Algorithms I = 1, if a training event is misclassified; Otherwise, I = 0

H. Yang - BDT B-tagging27 Example AdaBoost: the weight of misclassified events is increased by –error rate=0.1 and  = 0.5,  m = 1.1, exp(1.1) = 3 –error rate=0.4 and  = 0.5,  m = 0.203, exp(0.203) = –Weight of a misclassified event is multiplied by a large factor which depends on the error rate.  boost: the weight of misclassified events is increased by –If  exp(2*0.01) = 1.02 –If  exp(2*0.04) = –It changes event weight a little at a time.  AdaBoost converges faster than  -boost. However, the performance of AdaBoost and  boost are very comparable with sufficient tree iterations.