Metadata requirements for HEP Paul Millar
Slide 2 12 September 2007 Metadata requirements for HEP Some of the players in this game... WLCG – Umbrella organisation for three grids: Nordugrid, EGEE (mostly Europe) and OSG. EGEE – Provides grid software, deployed mostly in Europe. Atlas/LHCb/CMS/Alice – the experiments GridPP – UK's contribution to HEP Grid. Metadata group – initiated within GridPP, members from HEP experiments.
Slide 3 12 September 2007 Metadata requirements for HEP Overview of LHC/HEP
Slide 4 12 September 2007 Metadata requirements for HEP View of ATLAS
Slide 5 12 September 2007 Metadata requirements for HEP How we get data...
Slide 6 12 September 2007 Metadata requirements for HEP Simulated 3D collision
Slide 7 12 September 2007 Metadata requirements for HEP Work flows Production: –Raw --> ESD (500kB) --> AOD (100kB) + Tag (1kB) –Raw --> Digi --> DST --> RDST -->... Monte Carlo –Physical model --> Raw --> [ ESD -->... ] Analysis –dataset selection (querying dataset metadata) –dataset creation (AOD) –AOD --> DPD -->...
Slide 8 12 September 2007 Metadata requirements for HEP Conditions database Measurements of physical conditions of the detector –In fact, the conditions of the sub-detectors. Measures “physical” conditions: –Magnetic field, temperature, electrical,... Large! Stored in database, replicated to some sites. Is this metadata?
Slide 9 12 September 2007 Metadata requirements for HEP Different metadata Job metadata –production –analysis Sites “datasets” –location of files of a dataset –provenance system –Physics metadata common to dataset.
Slide September 2007 Metadata requirements for HEP Existing solutions Atlas: AMI, DQ2, prod.sys Amga (LHCb logging and bookkeeping) Alice (Alien) CMS dataset selector. Metadata group: –Case study: “Unlucky for some: the 13 core use- cases in HEP” –Review of metadata Schema. –Help in creating a Metadata Query Language