Physics Analysis Tools for the CMS experiment at LHC

Slides:



Advertisements
Similar presentations
March 24-28, 2003Computing for High-Energy Physics Configuration Database for BaBar On-line Rainer Bartoldus, Gregory Dubois-Felsmann, Yury Kolomensky,
Advertisements

David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
Reconstruction and Analysis on Demand: A Success Story Christopher D. Jones Cornell University, USA.
New muon EF trigger with offline supertools Sergio Grancagnolo INFN Lecce & Salento University.
Usage of the Python Programming Language in the CMS Experiment Rick Wilkinson (Caltech), Benedikt Hegner (CERN) On behalf of CMS Offline & Computing 1.
The Event as an Object-Relational Database: Avoiding the Dependency Nightmare Christopher D. Jones Cornell University, USA.
Automated Tests in NICOS Nightly Control System Alexander Undrus Brookhaven National Laboratory, Upton, NY Software testing is a difficult, time-consuming.
CLEO’s User Centric Data Access System Christopher D. Jones Cornell University.
L3 Filtering: status and plans D  Computing Review Meeting: 9 th May 2002 Terry Wyatt, on behalf of the L3 Algorithms group. For more details of current.
PAT-driven Efficiency Measurements and Calculations A. Singh, Amandeep Singh, S. Beri, Pb. University(Chd.)‏ J. Berryhill, K. Misra FermiLab (U.S)
Data Acquisition Data acquisition (DAQ) basics Connecting Signals Simple DAQ application Computer DAQ Device Terminal Block Cable Sensors.
Energy Flow and Jet Calibration Mark Hodgkinson Artemis Meeting 27 September 2007 Contains work by R.Duxfield,P.Hodgson, M.Hodgkinson,D.Tovey.
Event Data Model in ATLAS Edward Moyse (CERN) On behalf of the ATLAS collaboration CHEP 2004, Interlaken.
David N. Brown Lawrence Berkeley National Lab Representing the BaBar Collaboration The BaBar Mini  BaBar  BaBar’s Data Formats  Design of the Mini 
Event Data History David Adams BNL Atlas Software Week December 2001.
Stuart Wakefield Imperial College London Evolution of BOSS, a tool for job submission and tracking W. Bacchi, G. Codispoti, C. Grandi, INFN Bologna D.
Datasets on the GRID David Adams PPDG All Hands Meeting Catalogs and Datasets session June 11, 2003 BNL.
Introduction Advantages/ disadvantages Code examples Speed Summary Running on the AOD Analysis Platforms 1/11/2007 Andrew Mehta.
19/07/20061 Nectarios Ch. Benekos 1, Rosy Nicolaidou 2, Stathes Paganis 3, Kirill Prokofiev 3 for the collaboration among: 1 Max-Planck-Institut für Physik,
Gaudi Framework Tutorial, April Algorithm Tools: what they are, how to write them, how to use them.
AMB HW LOW LEVEL SIMULATION VS HW OUTPUT G. Volpi, INFN Pisa.
Vertex finding and B-Tagging for the ATLAS Inner Detector A.H. Wildauer Universität Innsbruck CERN ATLAS Computing Group on behalf of the ATLAS collaboration.
CaloTopoCluster Based Energy Flow and the Local Hadron Calibration Mark Hodgkinson June 2009 Hadronic Calibration Workshop.
D. Cavalli, S. Resconi 2 Dec 2008 EtMiss Software updates Jet/EtMiss Meeting D. Cavalli, S. Resconi.
David Adams ATLAS Virtual Data in ATLAS David Adams BNL May 5, 2002 US ATLAS core/grid software meeting.
September 2007CHEP 07 Conference 1 A software framework for Data Quality Monitoring in ATLAS S.Kolos, A.Corso-Radu University of California, Irvine, M.Hauschild.
Artemis School On Calibration and Performance of ATLAS Detectors Jörg Stelzer / David Berge.
David Adams ATLAS DIAL: Distributed Interactive Analysis of Large datasets David Adams BNL August 5, 2002 BNL OMEGA talk.
CBM ECAL simulation status Prokudin Mikhail ITEP.
A New Tool For Measuring Detector Performance in ATLAS ● Arno Straessner – TU Dresden Matthias Schott – CERN on behalf of the ATLAS Collaboration Computing.
05/04/06Predrag Krstonosic - Cambridge True particle flow and performance of recent particle flow algorithms.
Integration of the ATLAS Tag Database with Data Management and Analysis Components Caitriana Nicholson University of Glasgow 3 rd September 2007 CHEP,
Jean-Roch Vlimant, CERN Physics Performance and Dataset Project Physics Data & MC Validation Group McM : The Evolution of PREP. The CMS tool for Monte-Carlo.
Online Monitoring System at KLOE Alessandra Doria INFN - Napoli for the KLOE collaboration CHEP 2000 Padova, 7-11 February 2000 NAPOLI.
Luca Lista Object Oriented Reconstruction Software for the IFR Detector of B A B AR Experiment Luca Lista INFN, Sezione di Napoli for the BaBar Computing.
General requirements for BES III offline & EF selection software Weidong Li.
1 OO Muon Reconstruction in ATLAS Michela Biglietti Univ. of Naples INFN/Naples Atlas offline software MuonSpectrometer reconstruction (Moore) Atlas combined.
The “Comparator” Atlfast vs. Full Reco Automated Comparison Chris Collins-Tooth 19 th February 2006.
28/4/2006Chris Collins-Tooth tth, (h → bb) with EventViews Chris Collins-Tooth, Christian Shaw 03-May-2006.
Electron physics object tutorial C. Charlot / LLR Automn08 tutorials, 14 oct
Muon Persistency Persistent Analysis Objects Muon Persistency Norbert Neumeister µ-PRS meeting February 10, 2004.
Marco Cattaneo, 6-Apr Issues identified in sub-detector OO software reviews Calorimeters:18th February Tracking:24th March Rich:31st March.
Introduction to FCC Software FCC Istanbul 11 March, 2016 Alice Robson (CERN/UNIGE) on behalf of / with thanks to the FCC software group.
ATLAS Physics Analysis Framework James R. Catmore Lancaster University.
David Lange Lawrence Livermore National Laboratory
Analysis Tools interface - configuration Wouter Verkerke Wouter Verkerke, NIKHEF 1.
ATLAS Distributed Computing Tutorial Tags: What, Why, When, Where and How? Mike Kenyon University of Glasgow.
Status of the Higgs to tau tau analysis Carlos Solans Cristobal Cuenca.
Pierre-Antoine Delsart Kurtis L. Geerlings & Joey Huston
Product Training Program
Erik Devetak Oxford University 18/09/2008
CMS High Level Trigger Configuration Management
FCC Software Status Readiness for FCC-ee Physics
ALICE analysis preservation
Tree based validation tool for track reconstruction
slicPandora: slic + pandoraPFANew
OO Muon Reconstruction in ATLAS
Applied Software Implementation & Testing
Vincenzo Innocente CERN/EP/CMC
Linear Collider Simulation Tools
Dtk-tools Benoit Raybaud, Research Software Manager.
Introduction to Data Structure
2 Getting Started.
2 Getting Started.
The ATLAS Computing Model
2 Getting Started.
Linear Collider Simulation Tools
M. Kezunovic (P.I.) S. S. Luo D. Ristanovic Texas A&M University
CMS Software Architecture
Overview Activities from additional UP disciplines are needed to bring a system into being Implementation Testing Deployment Configuration and change management.
Presentation transcript:

Physics Analysis Tools for the CMS experiment at LHC Luca Lista, INFN Napoli Francesco Fabozzi, INFN Napoli Benedikt Hegner, DESY Christopher D. Jones, Cornell

Outline Data Tiers in CMS EDM Analysis Tools Analysis Workflow Luca Lista, CHEP 2007

Main Features of CMS EDM CMS Event Data Model (EDM) is the uniform format for all CMS event data An Event is a container of many “products” of any possible (C++) type Most of the products are collections of objects such as tracks, clusters, particles, … The EDM allows no “C” pointers allowed, and provides custom persistent references Product ID and indices in a collection identify referred objects Persistent and transient data representations are identical (based on ROOT I/O) All EDM data are accessible with ROOT interactively See  Chris Jones’ talk, Event processing session Reflex dictionaries must be provided for all products Luca Lista, CHEP 2007

Data Tiers and Analysis Object Data CMS defines different data tiers containing different levels of details of an event FEVT: full event output, containing (almost…) the complete output of all intermediate reconstruction steps RECO: detailed reconstruction output allowing to apply new calibrations and alignments, and reprocess many of the products AOD: a proper subset of RECO chosen to satisfy the needs of a large fraction of analysis studies Adding or dropping object collections to/from AOD/RECO/FEVT is just a matter of changing a job’s configuration The actual AOD content (and disk size…) is till under definition, it will likely evolve also with data taking Luca Lista, CHEP 2007

Modular Event Products Object collections can be split into different products This allows us to define different levels of details avoiding to store redundant information t Tracks … Kinematics (helix parameters) AOD T TracksExtra … Track extrapolation, references to RecHits RECO TracksHits h h h h h h h h h h h h h h … RecHits Luca Lista, CHEP 2007

Particle Candidates Candidate is a common base class for all high-level physics objects Muons, electrons, photons, jets, missing ET, … inherit from Candidate Can contain references to AOD components, like tracks, clusters, calorimeter towers, … Supports mother(s)daughter(s) navigation in specialized sub-classes Composite particle reconstruction from multi-body decay chains uses specialized Candidates E.g.: Z, HZZee, BsJ/KK, … Event generator tree in AOD is stored using Candidates with mother/daughter references Luca Lista, CHEP 2007

Jet from Heterogeneous Sources CaloTowers Muons Electrons AOD Collections t t t t t t m m m e e e Jet constituents (Candidates) c c c c c c c c c c c c Contain updated kinematics info, so energy corrections can be applied j j j j Jets Multiple Jet collections can have links to the same constituent collection Further energy corrections can be applied Luca Lista, CHEP 2007

Candidates and Associated Data Electron isolation i i i i i Associated collection Standard RECO collection used as “master clone” Electrons e e e e e Electrons clones with reference to master (“shallow” clones) e e e e e e Z Z Z Z candidates Luca Lista, CHEP 2007

Framework modules Reconstruction and analysis code is organized as independent modules steered by the framework A job configuration script defines the modules to be loaded (as plugins), their parameters and their execution order Modules execution sequences are organized into “paths” Each module can get data from the Event and can add new products to the Event Product provenance tracking including module parameters is saved as part of the Event output file Once a product is added to the Event it can’t be changed by another module Modules can act as event filters, stopping the processing path if a condition is not fulfilled E.g.: High Level Trigger paths Luca Lista, CHEP 2007

Available Common Tools Layered approach to common tools: AOD (and RECO…): basic “primitive” objects for analysis Tracks, super-clusters, calo-towers, , e,, jets, MET Mainly data container, no “fancy” C++ structures Generic common tools (for AOD and more) Selectors, filters, lepton isolation, matching tools Particle Candidates Generic class hierarchy to manage particles for analysis Base class for high level objects: , e,, jets, Met, gen-particles, composite decays (Z, J/, Bs, Higgs, …) Particle Candidates common tools Combiners, selectors, filters, overlap removal MC truth matching tools Generic isolation algorithms Constrained fitters (initial integration examples) Event collections Algorithms and modules Luca Lista, CHEP 2007

Generic AOD Framework Modules Uniform interface is enforced throughout AOD classes Everywhere pt(), eta(), phi(), etc. Generic programming is used to write algorithms applicable to different object types A suite of generic selector and filter modules is provided as part of the common Physics Tools More high level algorithms are being written using generic programming Isolation algorithms can run on muons, electrons, tracks, … Luca Lista, CHEP 2007

Generic Object Selectors A selection criteria can generate specialized selectors performing specific actions: Save clones of the selected objects Save references to the selected objects (i.e.: “indices”) Clone the selected objects and all the underlying constituents e.g.: clone selected electrons with clones of tracks and clusters Internal implementation specializations use template traits on the basis of the input and output collection types The simplest object selections can be written as a simple function object (returning a Boolean result) A string-configurable selector functor is provided to parse a configurable string-based cut: string cut = "(pt>10 & abs(eta)<2.5) & normalizedChi2<10" Variable names are mapped to objects methods via Reflex dictionary Luca Lista, CHEP 2007

Generic Selector Examples struct PtMinSelector { PtMinSelector(double ptMin) : ptMin_(ptMin) { } template<typename T> bool operator()(const T& t) const { return t.pt()>=ptMin; } private: double ptMin_; }; typedef SingleObjectSelector< reco::MuonCollection, PtMinSelector> PtMinMuonSelector; typedef SingleObjectSelector< reco::TrackCollection, StringCutObjectSelector<reco::Track> > TrackSelector; typedef SingleObjectSelector< reco::TrackCollection, StringCutObjectSelector<reco::Track>, reco::TrackRefVector> TrackRefSelector; Luca Lista, CHEP 2007

Selector configuration module highPtMuons = PtMinMuonSelector { InputTag src = allMuons double ptMin = 10 } module bestTracks = TrackSelector { InputTag src = allTracks string cut = "pt > 10 & normalizedChi2 < 20" module bestTrackReferences = TrackRefSelector { Luca Lista, CHEP 2007

Common Physics Tools Combinatorial analysis Overlap checking Monte Carlo matching tools Implement navigation to parent to find matching to a composite particle Constrained fitter Examples of integration with external fitting packages exist Covariance matrices (5x5) are fetched from AOD object for vertex fits using tracks Specialized candidate containing error matrices are being developed for the cases where errors are not stored in AOD objects E.g.: jet or photon mass-constrained fits require Ecal and Hcal energy resolutions, retrieved from specialized framework services Luca Lista, CHEP 2007

Example of Combinatorial Search module JPsiCandidates = CandCombiner { string decay = "muonCandidates@+ muonCandidates@-" string cut = "2.8 < mass < 3.4" } module PhiCandidates = CandCombiner { string decay = "trackCandidates@+ trackCandidates@-" string cut = "0.9 < mass < 1.1" module BsCandidates = CandCombiner { string decay = "JPsiCandidates PhiCandidates" string cut = "5.3 < mass < 5.6" Luca Lista, CHEP 2007

Analysis Custom Data Types Analysis Groups can easily define new data types to be added to the Event for analysis The output of a Analysis jobs is fully configurable Needs not always be standard RECO or AOD Analysis “skim” productions run centrally Event pre-selection is performed in central skims New analysis collection can be added to standard AOD (or any other data format) for the events selected by each particular analysis skim Analysis collections can contain either standard or any user-defined type Particle Candidate collections can be added to the Event as analysis output Luca Lista, CHEP 2007

CMS Analysis Work-Flow First pass at Tier0/CAF RAW RECO AOD RECO, AOD shipped at Tier1 Central analysis skims at Tier1 AOD + AOD Analysis algos Analysis skim output shipped at Tier2 Analysis Data Final analysis pre-selection at Tier2 Final samples shipped at Tier3 AOD + Further selection, Reduced output Fewer AOD coll. Analysis Data Analysis Data fast processing and FWLite at Tier3 Luca Lista, CHEP 2007

CMS Analysis Work-Flow First pass at Tier0/CAF Full reprocessing ~ twice a year (?) RAW RECO AOD RECO, AOD shipped at Tier1 Central analysis skims at Tier1 Reprocess central analysis skims every ~3 months (?) AOD + AOD Analysis algos Analysis skim output shipped at Tier2 Analysis Data Final analysis pre-selection at Tier2 Reprocess Tier2 analysis selection every ~2 weeks Final samples shipped at Tier3 AOD + Further selection, Reduced output Fewer AOD coll. Analysis Data Analysis Data fast processing and FWLite at Tier3 Analyze data locally daily with frequent developments Luca Lista, CHEP 2007

Conclusions A flexible event content and a variety of common tools help implement the most commonly required tasks needed for CMS analysis. The organization of data formats and tools is designed to be integrated with CMS analysis workflow running on distributed computing as well as for the final stage of analysis. A realistic exercise of analysis skims using custom data formats containing analysis collections reconstructed with common analysis modules is being put in production Will run in summer and autumn this year. Luca Lista, CHEP 2007

Backup slides

Polymorphism and “Views” Modules can retrieve event products in a type safe way specifying the collection type: Handle<MuonCollection> muons, event.getByLabel(“muons”, muons); Modules can also specify the base class of contained (or referred to) objects via collection “View”: Handle<View<Candidate> > leptons; event.getByLabel(tag, leptons); Both collections of objects and collections of references are supported Product tag, typically part of the configuration Luca Lista, CHEP 2007

Generic Selectors Development The selection criteria definition is decoupled from the technical implementation details of selector module specializations Specific selections are written for alignment and calibration samples by people with no necessary experience with “core” software No explicit definition of cut configuration, reference and clone management is needed in most of the cases The most commonly used framework module are provided as part of the release, need not be explicitly instantiated by users If new modules are needed, most of the users request them centrally rather then instantiating them privately The reuse of common module occurs very naturally Luca Lista, CHEP 2007

Utility Classes vs Modules Many common utilities are provided as framework modules Plugging modules into sequences is easy to do, and module reuse is very simple EDM Provenance mechanism is useful to tack the analysis process A number of tools are also provided as utility class that can be included in “private” modules Framework overhead is reduced Luca Lista, CHEP 2007