Implementation of e-ID based on BDT in Athena EgammaRec

Slides:



Advertisements
Similar presentations
Tracey Berry1 Looking into e &  for high energy e/  Dr Tracey Berry Royal Holloway.
Advertisements

J. Nielsen1 Measuring Trigger Efficiency Important component of cross section measurement: it is NOT in general 1.0! Need to measure this from data because.
First look at  -jet Samples using BDT e-ID Algorithm Hai-Jun Yang University of Michigan (with X. Li and B. Zhou) ATLAS egamma Meeting March 9, 2009.
INTRODUCTION TO e/ ɣ IN ATLAS In order to acquire the full physics potential of the LHC, the ATLAS electromagnetic calorimeter must be able to identify.
Implementation of e-ID based on BDT in Athena EgammaRec Hai-Jun Yang University of Michigan, Ann Arbor (with T. Dai, X. Li, A. Wilson, B. Zhou) US-ATLAS.
1 N. Davidson E/p single hadron energy scale check with minimum bias events Jet Note 8 Meeting 15 th May 2007.
 Track-First E-flow Algorithm  Analog vs. Digital Energy Resolution for Neutral Hadrons  Towards Track/Cal hit matching  Photon Finding  Plans E-flow.
Higgs Detection Sensitivity from GGF H  WW Hai-Jun Yang University of Michigan, Ann Arbor ATLAS Higgs Meeting October 3, 2008.
1 N. Davidson E/p minimum bias update with Athena Analysis Meeting 12 th June 2007.
1 N. Davidson E/p minimum bias update with Athena Jet Note 8 Meeting 7 th June 2007.
1 Andrea Bangert, ATLAS SCT Meeting, Monte Carlo Studies Of Top Quark Pair Production Andrea Bangert, Max Planck Institute of Physics, CSC T6.
1 The CMS Heavy Ion Program Michael Murray Kansas.
Progress on B-tagging Efficiency Monica Dunford May 22 nd, 2007.
Analysis Meeting – April 17 '07 Status and plan update for single hadron scale check with minimum bias events N. Davidson.
1 N. Davidson Calibration with low energy single pions Tau Working Group Meeting 23 rd July 2007.
Using Track based missing Et tools to reject fake MET background Zhijun Liang,Song-Ming Wang,Dong liu, Rachid Mazini Academia Sinica 8/28/20151 TWiki page.
Tau Jet Identification in Charged Higgs Search Monoranjan Guchait TIFR, Mumbai India-CMS collaboration meeting th March,2009 University of Delhi.
Lepton efficiency & fake rate Yousuke Kataoka University of Tokyo Content definitions of leptons p2 efficiency and fake rate for SU3 ( ) p3, p4.
B-tagging Performance based on Boosted Decision Trees Hai-Jun Yang University of Michigan (with X. Li and B. Zhou) ATLAS B-tagging Meeting February 9,
A few slides to summarise what Alessandro and I were up to for March 24th video meeting Taking for granted that W+/- are good measurements to make- are.
Possibility of tan  measurement with in CMS Majid Hashemi CERN, CMS IPM,Tehran,Iran QCD and Hadronic Interactions, March 2005, La Thuile, Italy.
Valeria Perez Reale University of Bern On behalf of the ATLAS Physics and Event Selection Architecture Group 1 ATLAS Physics Workshop Athens, May
1 c. mills (Harvard U.) 20 September, 2010 W and Z Physics at ATLAS Corrinne Mills Harvard DOE Site Visit 20 September 2010.
María Cepeda (CIEMAT, Madrid) Valencia, II CPAN days 1.
25 sep Reconstruction and Identification of Hadronic Decays of Taus using the CMS Detector Michele Pioppi – CERN On behalf.
Measurement of the branching ratios for Standard Model Higgs decays into muon pairs and into Z boson pairs at 1.4 TeV CLIC Gordana Milutinovic-Dumbelovic,
1 Silke Duensing DØ Analysis Status NIKHEF Annual Scientific Meeting Analysing first D0 data  Real Data with:  Jets  Missing Et  Electrons 
Tth study Lepton ID with BDT 2015/02/27 Yuji Sudo Kyushu University 1.
CALOR April Algorithms for the DØ Calorimeter Sophie Trincaz-Duvoid LPNHE – PARIS VI for the DØ collaboration  Calorimeter short description.
Software offline tutorial, CERN, Dec 7 th Electrons and photons in ATHENA Frédéric DERUE – LPNHE Paris ATLAS offline software tutorial Detectors.
Study on search of a SM Higgs (120GeV) produced via VBF and decaying in two hadronic taus V.Cavasinni, F.Sarri, I.Vivarelli.
E. Soldatov Tight photon efficiency study using radiative Z decays (update) E.Yu.Soldatov 1, 1 National Research Nuclear University “MEPhI” Outline:
Analysis of H  WW  l l Based on Boosted Decision Trees Hai-Jun Yang University of Michigan (with T.S. Dai, X.F. Li, B. Zhou) ATLAS Higgs Meeting September.
B-tagging based on Boosted Decision Trees
Calice Meeting Argonne Muon identification with the hadron calorimeter Nicola D’Ascenzo.
A search for the ZZ signal in the 3 lepton channel Azeddine Kasmi Robert Kehoe Southern Methodist University Thanks to: H. Ma, M. Aharrouche.
Trigger study on photon slice Yuan Li Feb 27 th, 2009 LPNHE ATLAS group meeting.
C.Clement Eta RegionTraining regionVariable set |  |
Search for H  WW*  l l Based on Boosted Decision Trees Hai-Jun Yang University of Michigan LHC Physics Signature Workshop January 5-11, 2008.
Electron Identification Efficiency from Z→ee Maria Fiascaris University of Oxford In collaboration with Tony Weidberg and Lucia di Ciaccio ATLAS UK SM.
Atautau  lh Carlos Solans TileCal Valencia meeting 20th September 2007.
Current Status of e-ID based on BDT Algorithm Hai-Jun Yang University of Michigan (with X. Li, T. Dai, A. Wilson, B. Zhou) BNL Analysis Jamboree March.
 reconstruction and identification in CMS A.Nikitenko, Imperial College. LHC Days in Split 1.
VBF H- > ττ- > lept had Electron Channel Analysis Jaspreet Sidhu
Photon purity measurement on JF17 Di jet sample using Direct photon working Group ntuple Z.Liang (Academia Sinica,TaiWan) 6/24/20161.
1 Review BDT developments Migrate to version 12, AOD BDT for W , Z  AS group.
XLIX International Winter Meeting on Nuclear Physics January 2011 Bormio, Italy G. Cattani, on behalf of the ATLAS Collaboration Measurement of.
ATLAS results on inclusive top quark pair
Some introduction Cosmics events can produce energetic jets and missing energy. They need to be discriminated from collision events with true MET and jets.
Jet Energy Scale and Calibration Framework
Particle detection and reconstruction at the LHC (IV)
Measurement of the γ,W,Z with ATLAS for the ATLAS Collaboration
Venkat Kaushik, Jae Yu University of Texas at Arlington
Study of Gamma+Jets production
Hyeon Jin Kim March 31, 2006 DOSAR
May 2006 Hadronic Calibration Workshop Jet Session at Munich
NIKHEF / Universiteit van Amsterdam
Electron Identification Based on Boosted Decision Trees
Update of Electron Identification Performance Based on BDTs
Top Quark Production at the Large Hadron Collider
Missing Energy and Tau-Lepton Reconstruction in ATLAS
Plans for checking hadronic energy
Observation and measurement of HW+W-
 discrimination with converted photons
ZZ→llnn Analysis Thomas Barber University of Cambridge
Julie Kirk Rutherford Appleton Laboratory
Performance of BDTs for Electron Identification
SUSY particles searches with R-parity violation at DØ, Tevatron
Semileptonic ttH(Hbb) studies at UCL
Susan Burke, University of Arizona
Presentation transcript:

Implementation of e-ID based on BDT in Athena EgammaRec Hai-Jun Yang University of Michigan, Ann Arbor (with T. Dai, X. Li, A. Wilson, B. Zhou) ATLAS HSG3 Meeting November 19, 2008

Motivation Lepton (e, m, t) Identification is crucial for new physics discoveries at the LHC, such as H ZZ4 leptons, HWW 2 leptons + MET etc. ATLAS default electron-ID (IsEM) has relatively low efficiency (~67%), which has significant impact on ATLAS early discovery potential in HWW, ZZ detection with electron final states. It is important and also feasible to improve e-ID efficiency and to reduce jet fake rate by making full use of available variables using BDT. Electron ID with BDT 2 2

Electron ID Studies with BDT Select electrons in two steps 1) Pre-selection: an EM cluster matching a track 2) Apply electron ID based on pre-selected samples with different e-ID algorithms (IsEM, Likelihood ratio, AdaBoost and EBoost). New BDT e-ID development at U. Michigan (Rel. v12) H. Yang’s talk at US-ATLAS Jamboree on Sept. 10, 2008 http://indico.cern.ch/conferenceDisplay.py?confId=38991 New BDT e-ID (EBoost) based on Rel. v13 H. Yang’s talk at ATLAS performance and physics workshop at CERN on Oct. 2, 2008 http://indico.cern.ch/conferenceDisplay.py?confId=39296 Implementation of EBoost in EgammaRec (Rel. v14) Electron ID with BDT 3

Electrons W en MC Electrons Electrons after Pre-selection Electron ID with BDT 4 4

Electron Pre-selection Efficiency The inefficiency mainly due to track matching W en Electron ID with BDT 5 5

Variables Used for BDT e-ID (EBoost) The same variables for IsEM are used egammaPID::ClusterHadronicLeakage fraction of transverse energy in TileCal 1st sampling egammaPID::ClusterMiddleSampling Ratio of energies in 3*7 & 7*7 window Ratio of energies in 3*3 & 7*7 window Shower width in LAr 2nd sampling Energy in LAr 2nd sampling egammaPID::ClusterFirstSampling Fraction of energy deposited in 1st sampling Delta Emax2 in LAr 1st sampling Emax2-Emin in LAr 1st sampling Total shower width in LAr 1st sampling Shower width in LAr 1st sampling Fside in LAr 1st sampling egammaPID::TrackHitsA0 B-layer hits, Pixel-layer hits, Precision hits Transverse impact parameter egammaPID::TrackTRT Ratio of high threshold and all TRT hits egammaPID::TrackMatchAndEoP Delta eta between Track and egamma Delta phi between Track and egamma E/P – egamma energy and Track momentum ratio Track Eta and EM Eta Electron isolation variables: Number of tracks (DR=0.3) Sum of track momentum (DR=0.3) Ratio of energy in DR=0.2-0.45 and DR=0.45 6 6

BDT e-ID (EBoost) Training (v13) BDT multivariate pattern recognition technique: [ H. Yang et. al., NIM A555 (2005) 370-385 ] BDT e-ID training signal and backgrounds (jet faked e) Wen as electron signal (DS 5104, v13) Di-jet samples (J0-J6), Pt=[8-1120] GeV (DS 5009-5015, v13) BDT e-ID training procedure Event weight training based on background cross sections [ H. Yang et. al., JINST 3 P04004 (2008) ] Apply additional cuts on the training samples to select hardly identified jet faked electron as background for BDT training to make the BDT training more effective. Apply additional event weight to high PT backgrounds to effective reduce the jet fake rate at high PT region. Electron ID with BDT 7 7

Implementation of BDT Trees in EgammaRec Package and Test E-ID based on BDT has been implemented into egammaRec (04-02-98) package (private). We run through the whole reconstruction package based on v14.2.22 to test the BDT e-ID. AOD RDO Digitized raw data Reconstruction with egammaRec Rel. V14.2.22 (*EleAOD_BDT) CBNT (*Ele_BDT)

E-ID Testing Samples (v13) Wenu: DS5104 (Eff_precuts = 89.1%) 46554 electrons with Et>10 GeV, |h|<2.5 41457 electrons after pre-selection cuts JF17: DS5802 (Eff_precuts = 7.7%) 14560093 jets with Et>10 GeV, |h|<2.5 1123231 jets after pre-selection

Comparison of e-ID Algorithms (v13) IsEM (tight) Eff = 65.7% jet fake rate = 6.9E-4 Likelihood Ratio (>6.5) Eff = 78.5% jet fake rate = 3.7E-4 AdaBoost (>6) Eff = 79.8% jet fake rate = 2.8E-4 EBoost (>100) Eff = 84.3% jet fake rate = 1.9E-4

E-ID Testing Samples (v14) Wenu: DS106020 (Eff_precuts = 86.9%) 173930 events, 173822 electrons 130589 electrons with Et>10GeV, |h|<2.5 113500 electrons with pre-selection cuts JF17: DS105802 (Eff_precuts = 8%) 475900 events, 1793636 jets With pre-selection, 143167 jets

E-ID Discriminators (v13 vs v14)

Comparison of e-ID Algorithms (v14) IsEM (tight) Eff = 68.7% jet fake rate = 1.1E-3 Likelihood Ratio (>6.5) Eff = 70.9% jet fake rate = 4.6E-4 AdaBoost (>6) Eff = 73% jet fake rate = 2.9E-4 EBoost (>100) Eff = 80% jet fake rate = 1.9E-4

Overall E-ID Efficiency and Jet Fake Rates (v13 vs. v14) Test MC Precuts IsEM(tight) LH>6.5 AdaBoost > 6 EBoost > 100 Wen (v13) 89.1% 65.7% 78.5% 79.8% 84.3% Wen (v14) 86.9% 68.7% 70.9% 73.0% 80.0% JF17 (v13) 7.7E-2 6.9E-4 3.7E-4 2.8E-4 1.9E-4 JF17 (v14) 8.0E-2 11E-4 4.6E-4 2.9E-4

E-ID Efficiency vs Pt (v14) EBoost IsEM AdaBoost Likelihood

E-ID Efficiency vs h (v14) EBoost AdaBoost IsEM Likelihood

Future Plan We have requested to add EBoost in ATLAS official egammaRec package and make EBoost discriminator variable available for physics analysis. We will provide EBoost trees to ATLAS egammaRec for each major software release Explore new variables and BDT training techniques to further improve the e-ID performance

Backup Slides

List of Variables for BDT Ratio of Et(DR=0.2-0.45) / Et(DR=0.2) Number of tracks in DR=0.3 cone Energy leakage to hadronic calorimeter EM shower shape E237 / E277 Dh between inner track and EM cluster Ratio of high threshold and all TRT hits Number of pixel hits and SCT hits Df between track and EM cluster Emax2 – Emin in LAr 1st sampling Number of B layer hits Number of TRT hits Emax2 in LAr 1st sampling EoverP – ratio of EM energy and track momentum Number of pixel hits Fraction of energy deposited in LAr 1st sampling Et in LAr 2nd sampling h of EM cluster D0 – transverse impact parameter EM shower shape E233 / E277 Shower width in LAr 2nd sampling Fracs1 – ratio of (E7strips-E3strips)/E7strips in LAr 1st sampling Sum of track Pt in DR=0.3 cone Total shower width in LAr 1st sampling Shower width in LAr 1st sampling 19 19

EM Shower shape distributions of discriminating Variables (signal vs EM Shower shape distributions of discriminating Variables (signal vs. background) EM Shower Shape in ECal Energy Leakage in HCal 20 20

ECal and Inner Track Match P E E/P Ratio of EM Cluster Dh of EM Cluster & Track 21 21

Electron Isolation Variables Ntrk around Electron Track ET(DR=0.2-0.45)/ET of EM 22 22

Example: H WW lnln Studies [ H. Yang et.al., ATL-COM-PHYS-2008-023 ] At least one lepton pair (ee, mm, em) with PT > 10 GeV, ||<2.5 Missing ET > 20 GeV, max(PT (l) ,PT(l)) > 25 GeV |Mee – Mz| > 10 GeV, |Mmm – Mz| > 15 GeV to suppress background from Z  ee, mm Used ATLAS electron ID: IsEM & 0x7FF == 0 Electron ID with BDT 23 23 23 23

Comparison of e-ID Algorithms (v14) IsEM (tight) Eff = 70.2% jet fake rate = 1.1E-3 Likelihood Ratio (>6.5) Eff = 73.4% jet fake rate = 4.6E-4 AdaBoost (>6) Eff = 74.2% jet fake rate = 2.9E-4 EBoost (>100) Eff = 81.1% jet fake rate = 1.9E-4

Signal Pre-selection: MC electrons MC True electron from Wen by requiring |he| < 2.5 and ETtrue>10 GeV (Ne) Match MC e/g to EM cluster: DR<0.2 and 0.5 < ETrec / ETtrue< 1.5 (NEM) Match EM cluster with an inner track: eg_trkmatchnt > -1 (NEM/track) Pre-selection Efficiency = NEM/Track / Ne Electron ID with BDT 25 25

Pre-selection of Jet Faked Electrons Count number of jets with |hjet| < 2.5, ETjet >10 GeV (Njet) Loop over all EM clusters; each cluster matches with a jet ETEM > 10 GeV (NEM) Match EM cluster with an inner track: eg_trkmatchnt > -1 (NEM/track) Pre-selection Acceptance = NEM/Track / Njet Electron ID with BDT 26 26

Comparisons of v13 and v14

Comparisons of v13 and v14