Implementation of e-ID based on BDT in Athena EgammaRec Hai-Jun Yang University of Michigan, Ann Arbor (with T. Dai, X. Li, A. Wilson, B. Zhou) US-ATLAS.

Slides:



Advertisements
Similar presentations
Tracey Berry1 Looking into e &  for high energy e/  Dr Tracey Berry Royal Holloway.
Advertisements

J. Nielsen1 Measuring Trigger Efficiency Important component of cross section measurement: it is NOT in general 1.0! Need to measure this from data because.
First look at  -jet Samples using BDT e-ID Algorithm Hai-Jun Yang University of Michigan (with X. Li and B. Zhou) ATLAS egamma Meeting March 9, 2009.
INTRODUCTION TO e/ ɣ IN ATLAS In order to acquire the full physics potential of the LHC, the ATLAS electromagnetic calorimeter must be able to identify.
1 N. Davidson E/p single hadron energy scale check with minimum bias events Jet Note 8 Meeting 15 th May 2007.
 Track-First E-flow Algorithm  Analog vs. Digital Energy Resolution for Neutral Hadrons  Towards Track/Cal hit matching  Photon Finding  Plans E-flow.
Higgs Detection Sensitivity from GGF H  WW Hai-Jun Yang University of Michigan, Ann Arbor ATLAS Higgs Meeting October 3, 2008.
1 Hadronic In-Situ Calibration of the ATLAS Detector N. Davidson The University of Melbourne.
1 Andrea Bangert, ATLAS SCT Meeting, Monte Carlo Studies Of Top Quark Pair Production Andrea Bangert, Max Planck Institute of Physics, CSC T6.
1 The CMS Heavy Ion Program Michael Murray Kansas.
Analysis Meeting – April 17 '07 Status and plan update for single hadron scale check with minimum bias events N. Davidson.
In order to acquire the full physics potential of the LHC, the ATLAS electromagnetic calorimeter must be able to efficiently identify photons and electrons.
1 N. Davidson Calibration with low energy single pions Tau Working Group Meeting 23 rd July 2007.
Track Extrapolation/Shower Reconstruction in a Digital HCAL – ANL Approach Steve Magill ANL 1 st step - Track extrapolation thru Cal – substitute for Cal.
Using Track based missing Et tools to reject fake MET background Zhijun Liang,Song-Ming Wang,Dong liu, Rachid Mazini Academia Sinica 8/28/20151 TWiki page.
Tau Jet Identification in Charged Higgs Search Monoranjan Guchait TIFR, Mumbai India-CMS collaboration meeting th March,2009 University of Delhi.
Lepton efficiency & fake rate Yousuke Kataoka University of Tokyo Content definitions of leptons p2 efficiency and fake rate for SU3 ( ) p3, p4.
B-tagging Performance based on Boosted Decision Trees Hai-Jun Yang University of Michigan (with X. Li and B. Zhou) ATLAS B-tagging Meeting February 9,
A few slides to summarise what Alessandro and I were up to for March 24th video meeting Taking for granted that W+/- are good measurements to make- are.
LHC France 2013, 3 rd April ATLAS results on inclusive top quark pair production cross section in dilepton channel Frédéric Derue, LPNHE Paris Rencontres.
Possibility of tan  measurement with in CMS Majid Hashemi CERN, CMS IPM,Tehran,Iran QCD and Hadronic Interactions, March 2005, La Thuile, Italy.
Valeria Perez Reale University of Bern On behalf of the ATLAS Physics and Event Selection Architecture Group 1 ATLAS Physics Workshop Athens, May
1 c. mills (Harvard U.) 20 September, 2010 W and Z Physics at ATLAS Corrinne Mills Harvard DOE Site Visit 20 September 2010.
A Study of Electron Identification Jim Branson UCSD with collaborators from FNAL, UCSB & UCSD.
María Cepeda (CIEMAT, Madrid) Valencia, II CPAN days 1.
25 sep Reconstruction and Identification of Hadronic Decays of Taus using the CMS Detector Michele Pioppi – CERN On behalf.
Measurement of the branching ratios for Standard Model Higgs decays into muon pairs and into Z boson pairs at 1.4 TeV CLIC Gordana Milutinovic-Dumbelovic,
1 Silke Duensing DØ Analysis Status NIKHEF Annual Scientific Meeting Analysing first D0 data  Real Data with:  Jets  Missing Et  Electrons 
FIMCMS, 26 May, 2008 S. Lehti HIP Charged Higgs Project Preparative Analysis for Background Measurements with Data R.Kinnunen, M. Kortelainen, S. Lehti,
DPF2000, 8/9-12/00 p. 1Richard E. Hughes, The Ohio State UniversityHiggs Searches in Run II at CDF Prospects for Higgs Searches at CDF in Run II DPF2000.
Tth study Lepton ID with BDT 2015/02/27 Yuji Sudo Kyushu University 1.
CALOR April Algorithms for the DØ Calorimeter Sophie Trincaz-Duvoid LPNHE – PARIS VI for the DØ collaboration  Calorimeter short description.
Software offline tutorial, CERN, Dec 7 th Electrons and photons in ATHENA Frédéric DERUE – LPNHE Paris ATLAS offline software tutorial Detectors.
Thibault Guillemin LAPP, Annecy, France W and Z total cross sections measurements ATLAS-LAPP & LAPTH – Japan meeting, 21/01/08.
Study on search of a SM Higgs (120GeV) produced via VBF and decaying in two hadronic taus V.Cavasinni, F.Sarri, I.Vivarelli.
E. Soldatov Tight photon efficiency study using radiative Z decays (update) E.Yu.Soldatov 1, 1 National Research Nuclear University “MEPhI” Outline:
Analysis of H  WW  l l Based on Boosted Decision Trees Hai-Jun Yang University of Michigan (with T.S. Dai, X.F. Li, B. Zhou) ATLAS Higgs Meeting September.
B-tagging based on Boosted Decision Trees
Electron Calibration and Performance ( ) N. Benekos (MPI), R. Nikolaidou (Saclay), S. Paganis (Sheffield) Contributions from: A. Farbin (CERN) +
VBF H- > ττ- > lept had Electron Channel Jaspreet Sidhu ATLAS Toronto meeting
Electron physics object tutorial C. Charlot / LLR Automn08 tutorials, 14 oct
A search for the ZZ signal in the 3 lepton channel Azeddine Kasmi Robert Kehoe Southern Methodist University Thanks to: H. Ma, M. Aharrouche.
Trigger study on photon slice Yuan Li Feb 27 th, 2009 LPNHE ATLAS group meeting.
Régis Lefèvre (LPC Clermont-Ferrand - France)ATLAS Physics Workshop - Lund - September 2001 In situ jet energy calibration General considerations The different.
10 January 2008Neil Collins - University of Birmingham 1 Tau Trigger Performance Neil Collins ATLAS UK Physics Meeting Thursday 10 th January 2008.
Search for the Standard Model Higgs in  and  lepton final states P. Grannis, ICHEP 2012 for the DØ Collaboration Tevatron, pp √s = 1.96 TeV -
C.Clement Eta RegionTraining regionVariable set |  |
Search for H  WW*  l l Based on Boosted Decision Trees Hai-Jun Yang University of Michigan LHC Physics Signature Workshop January 5-11, 2008.
1 UCSD Meeting Calibration of High Pt Hadronic W Haifeng Pi 10/16/2007 Outline Introduction High Pt Hadronic W in TTbar and Higgs events Reconstruction.
Electron Identification Efficiency from Z→ee Maria Fiascaris University of Oxford In collaboration with Tony Weidberg and Lucia di Ciaccio ATLAS UK SM.
Atautau  lh Carlos Solans TileCal Valencia meeting 20th September 2007.
Current Status of e-ID based on BDT Algorithm Hai-Jun Yang University of Michigan (with X. Li, T. Dai, A. Wilson, B. Zhou) BNL Analysis Jamboree March.
 reconstruction and identification in CMS A.Nikitenko, Imperial College. LHC Days in Split 1.
VBF H- > ττ- > lept had Electron Channel Analysis Jaspreet Sidhu
Photon purity measurement on JF17 Di jet sample using Direct photon working Group ntuple Z.Liang (Academia Sinica,TaiWan) 6/24/20161.
1 Review BDT developments Migrate to version 12, AOD BDT for W , Z  AS group.
XLIX International Winter Meeting on Nuclear Physics January 2011 Bormio, Italy G. Cattani, on behalf of the ATLAS Collaboration Measurement of.
ATLAS results on inclusive top quark pair
Venkat Kaushik, Jae Yu University of Texas at Arlington
Study of Gamma+Jets production
NIKHEF / Universiteit van Amsterdam
Electron Identification Based on Boosted Decision Trees
Update of Electron Identification Performance Based on BDTs
Implementation of e-ID based on BDT in Athena EgammaRec
Top Quark Production at the Large Hadron Collider
Missing Energy and Tau-Lepton Reconstruction in ATLAS
Plans for checking hadronic energy
 discrimination with converted photons
Performance of BDTs for Electron Identification
Semileptonic ttH(Hbb) studies at UCL
Presentation transcript:

Implementation of e-ID based on BDT in Athena EgammaRec Hai-Jun Yang University of Michigan, Ann Arbor (with T. Dai, X. Li, A. Wilson, B. Zhou) US-ATLAS Egamma Meeting November 20, 2008

2Electron ID with BDT2 Motivation Lepton (e,  ) Identification is crucial for new physics discoveries at the LHC, such as H  ZZ  4 leptons, H  WW  2 leptons + MET etc. ATLAS default electron-ID (IsEM) has relatively low efficiency (~67%), which has significant impact on ATLAS early discovery potential in H  WW, ZZ detection with electron final states. It is important and also feasible to improve e-ID efficiency and to reduce jet fake rate by making full use of available variables using BDT.

3 Electron ID Studies with BDT Select electrons in two steps 1) Pre-selection: an EM cluster matching a track 2) Apply electron ID based on pre-selected samples with different e-ID algorithms (IsEM, Likelihood ratio, AdaBoost and EBoost). New BDT e-ID development at U. Michigan (Rel. v12) –H. Yang’s talk at US-ATLAS Jamboree on Sept. 10, New BDT e-ID (EBoost) based on Rel. v13 –H. Yang’s talk at ATLAS performance and physics workshop at CERN on Oct. 2, Implementation of EBoost in EgammaRec (Rel. v14) Electron ID with BDT3

4 4 Electrons W  e MC Electrons Electrons after Pre-selection

5Electron ID with BDT5 Electron Pre-selection Efficiency W  e The inefficiency mainly due to track matching

66 Variables Used for BDT e-ID (EBoost) The same variables for IsEM are used  egammaPID::ClusterHadronicLeakage fraction of transverse energy in TileCal 1 st sampling  egammaPID::ClusterMiddleSampling Ratio of energies in 3*7 & 7*7 window Ratio of energies in 3*3 & 7*7 window Shower width in LAr 2 nd sampling Energy in LAr 2 nd sampling  egammaPID::ClusterFirstSampling Fraction of energy deposited in 1 st sampling Delta Emax2 in LAr 1 st sampling Emax2-Emin in LAr 1 st sampling Total shower width in LAr 1 st sampling Shower width in LAr 1 st sampling Fside in LAr 1 st sampling  egammaPID::TrackHitsA0 B-layer hits, Pixel-layer hits, Precision hits Transverse impact parameter  egammaPID::TrackTRT Ratio of high threshold and all TRT hits  egammaPID::TrackMatchAndEoP Delta eta between Track and egamma Delta phi between Track and egamma E/P – egamma energy and Track momentum ratio  Track Eta and EM Eta  Electron isolation variables: Number of tracks (  R=0.3) Sum of track momentum (  R=0.3) Ratio of energy in  R= and  R=0.45

7Electron ID with BDT7 BDT e-ID (EBoost) Training (v13) BDT multivariate pattern recognition technique: –[ H. Yang et. al., NIM A555 (2005) ] BDT e-ID training signal and backgrounds (jet faked e) –W  e as electron signal (DS 5104, v13) –Di-jet samples (J0-J6), Pt=[8-1120] GeV (DS , v13) BDT e-ID training procedure –Event weight training based on background cross sections [ H. Yang et. al., JINST 3 P04004 (2008) ] –Apply additional cuts on the training samples to select hardly identified jet faked electron as background for BDT training to make the BDT training more effective. –Apply additional event weight to high P T backgrounds to effective reduce the jet fake rate at high P T region.

8 Implementation of BDT Trees in EgammaRec Package and Test E-ID based on BDT has been implemented into egammaRec ( ) package (private). We run through the whole reconstruction package based on v to test the BDT e-ID. RDO Digitized raw data Reconstruction with egammaRec Rel. V AOD CBNT (*Ele_BDT) egammaPID::EBoost

9 E-ID Testing Samples (v13) Wenu: DS5104 (Eff_precuts = 89.1%) –46554 electrons with Et>10 GeV, |  |<2.5 –41457 electrons after pre-selection cuts JF17: DS5802 (Eff_precuts = 7.7%) – events, jets – jets after pre-selection

10 Comparison of e-ID Algorithms (v13)  IsEM (tight) Eff = 65.7% jet fake rate = 6.9E-4  Likelihood Ratio (>6.5) Eff = 78.5% jet fake rate = 3.7E-4  AdaBoost (>6) Eff = 79.8% jet fake rate = 2.8E-4  EBoost (>100) Eff = 84.3% jet fake rate = 1.9E-4

11 E-ID Testing Samples (v14) Wenu: DS (Eff_precuts = 86.9%) – events, electrons – electrons with Et>10GeV, |  |<2.5 – electrons with pre-selection cuts JF17: DS (Eff_precuts = 8%) – events, jets –With pre-selection, jets

12 E-ID Discriminators (v13 vs v14)

13 Comparison of e-ID Algorithms (v14)  IsEM (tight) Eff = 68.7% jet fake rate = 1.1E-3  Likelihood Ratio (>6.5) Eff = 70.9% jet fake rate = 4.6E-4  AdaBoost (>6) Eff = 73% jet fake rate = 2.9E-4  EBoost (>100) Eff = 80% jet fake rate = 1.9E-4

14 Overall E-ID Efficiency and Jet Fake Rates (v13 vs. v14) Test MCPrecutsIsEM(tight)LH>6.5AdaBoost > 6 EBoost > 100 W  e (v13) 89.1%65.7%78.5%79.8%84.3% W  e (v14) Eff. change 86.9% -2.2% 68.7% +3% 70.9% -7.6% 73.0% -6.8% 80.0% -4.3% JF17 (v13)7.7E-26.9E-43.7E-42.8E-41.9E-4 JF17 (v14) Relative change 8.0E-2 +4% 11E-4 +59% 4.6E-4 +24% 2.9E % 1.9E-4 0

15 E-ID Efficiency vs Pt (v14) EBoost AdaBoost IsEM Likelihood

16 E-ID Efficiency vs  (v14) EBoostAdaBoost IsEM Likelihood

17 Future Plan We have requested to add EBoost in ATLAS official egammaRec package and make EBoost discriminator variable available for physics analysis. We will provide EBoost trees to ATLAS egammaRec for each major software release Explore new variables and BDT training techniques to further improve the e-ID performance

18 Backup Slides

19 Jet Fake Rate (v14) EBoost AdaBoost IsEM Likelihood

20 List of Variables for BDT 1.Ratio of Et(  R= ) / Et(  R=0.2) 2.Number of tracks in  R=0.3 cone 3.Energy leakage to hadronic calorimeter 4.EM shower shape E237 / E  between inner track and EM cluster 6.Ratio of high threshold and all TRT hits 7.Number of pixel hits and SCT hits  between track and EM cluster 9.Emax2 – Emin in LAr 1 st sampling 10.Number of B layer hits 11.Number of TRT hits 12.Emax2 in LAr 1 st sampling 13.EoverP – ratio of EM energy and track momentum 14.Number of pixel hits 15.Fraction of energy deposited in LAr 1 st sampling 16.Et in LAr 2nd sampling  of EM cluster 18.D0 – transverse impact parameter 19.EM shower shape E233 / E Shower width in LAr 2 nd sampling 21.Fracs1 – ratio of (E7strips-E3strips)/E7strips in LAr 1 st sampling 22.Sum of track Pt in DR=0.3 cone 23.Total shower width in LAr 1 st sampling 24.Shower width in LAr 1 st sampling

21 Energy Leakage in HCal EM Shower Shape in ECal EM Shower shape distributions of discriminating Variables (signal vs. background)

22 ECal and Inner Track Match  of  EM Cluster & Track E/P Ratio of EM Cluster EP

23 Electron Isolation Variables N trk around Electron Track E T (  R= )/E T of EM

24Electron ID with BDT24 Example: H  WW  l l Studies [ H. Yang et.al., ATL-COM-PHYS ] At least one lepton pair (ee, , e  ) with P T > 10 GeV, |  |<2.5 Missing E T > 20 GeV, max(P T (l),P T (l)) > 25 GeV |M ee – M z | > 10 GeV, |M  – M z | > 15 GeV to suppress background from Z  ee,  Used ATLAS electron ID: IsEM & 0x7FF == 0

25 Comparison of e-ID Algorithms (v14)  IsEM (tight) Eff = 70.2% jet fake rate = 1.1E-3  Likelihood Ratio (>6.5) Eff = 73.4% jet fake rate = 4.6E-4  AdaBoost (>6) Eff = 74.2% jet fake rate = 2.9E-4  EBoost (>100) Eff = 81.1% jet fake rate = 1.9E-4

26Electron ID with BDT26 Signal Pre-selection: MC electrons MC True electron from W  e by requiring –  e  10 GeV (N e ) Match MC e/  to EM cluster: –  R<0.2 and 0.5 < E T rec / E T true < 1.5 (N EM ) Match EM cluster with an inner track: –eg_trkmatchnt > -1 (N EM/track ) Pre-selection Efficiency = N EM/Track / N e

27Electron ID with BDT27 Pre-selection of Jet Faked Electrons Count number of jets with –  jet  10 GeV (N jet ) Loop over all EM clusters; each cluster matches with a jet – E T EM > 10 GeV (N EM ) Match EM cluster with an inner track: –eg_trkmatchnt > -1 (N EM/track ) Pre-selection Acceptance = N EM/Track / N jet

28 Comparisons of v13 and v14

29 Comparisons of v13 and v14