M. Oldenburg GridPP Metadata Workshop — July 4–7 2006, Oxford University 1 Markus Oldenburg GridPP Metadata Workshop July 4–7 2006, Oxford University ALICE.

Slides:



Advertisements
Similar presentations
T0 offline status Alla Maevskaya Institute for Nuclear Research, Moscow 8 October 2007 ALICE offline week For T0 group.
Advertisements

M. D'Amato, M. Mennea, L.Silvestris INFN-Bari CMS Data Model 9-11 Aprile 2001, Catania I Workshop INFN Grid CMS DATA MODEL M. D’Amato, M. Mennea, L. Silvestris.
1 Databases in ALICE L.Betev LCG Database Deployment and Persistency Workshop Geneva, October 17, 2005.
Data Quality Assurance Linda R. Coney UCR CM26 Mar 25, 2010.
ATLAS Analysis Model. Introduction On Feb 11, 2008 the Analysis Model Forum published a report (D. Costanzo, I. Hinchliffe, S. Menke, ATL- GEN-INT )
EventStore Managing Event Versioning and Data Partitioning using Legacy Data Formats Chris Jones Valentin Kuznetsov Dan Riley Greg Sharp CLEO Collaboration.
A Guide to Oracle9i1 Introduction To Forms Builder Chapter 5.
Magda – Manager for grid-based data Wensheng Deng Physics Applications Software group Brookhaven National Laboratory.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
TREND simulation step by step Gu Junhua, Olivier Martineau-Huynh & Valentin Niess March 21st 2014.
Shuei MEG review meeting, 2 July MEG Software Status MEG Software Group Framework Large Prototype software updates Database ROME Monte Carlo.
Alexandre A. P. Suaide VI DOSAR workshop, São Paulo, 2005 STAR grid activities and São Paulo experience.
David Adams ATLAS ATLAS Distributed Analysis David Adams BNL March 18, 2004 ATLAS Software Workshop Grid session.
Conditions DB in LHCb LCG Conditions DB Workshop 8-9 December 2003 P. Mato / CERN.
INFSO-RI Enabling Grids for E-sciencE Project Gridification: the UNOSAT experience Patricia Méndez Lorenzo CERN (IT-PSS/ED) CERN,
LHC: ATLAS Experiment meeting “Conditions” data challenge Elizabeth Gallas - Oxford - August 29, 2009 XLDB3.
110/10/06 - AliEn AliEn Tutorial Solutions Panos Christakoglou University of Athens - CERN.
Framework for Raw Data Thomas Kuhr Offline Week 29/06/2004.
Real data reconstruction A. De Caro (University and INFN of Salerno) CERN Building 29, December 9th, 2009ALICE TOF General meeting.
Bookkeeping Tutorial. Bookkeeping & Monitoring Tutorial2 Bookkeeping content  Contains records of all “jobs” and all “files” that are created by production.
Event Data History David Adams BNL Atlas Software Week December 2001.
Metadata requirements for HEP Paul Millar. Slide 2 12 September 2007 Metadata requirements for HEP Some of the players in this game... WLCG – Umbrella.
STAR Software Walk-Through. Doing analysis in a large collaboration: Overview The experiment: – Collider runs for many weeks every year. – A lot of data.
SkimData and Replica Catalogue Alessandra Forti BaBar Collaboration Meeting November 13 th 2002 skimData based replica catalogue RLS (Replica Location.
1 Behaviour of the Silicon Strip Detector modules for the Alice experiment: simulation and test with minimum ionizing particles Federica Benedosso Utrecht,
David Adams ATLAS DIAL/ADA JDL and catalogs David Adams BNL December 4, 2003 ATLAS software workshop Production session CERN.
ALICE Condition DataBase Magali Gruwé CERN PH/AIP Alice Offline week May 31 st 2005.
Experiment Management System CSE 423 Aaron Kloc Jordan Harstad Robert Sorensen Robert Trevino Nicolas Tjioe Status Report Presentation Industry Mentor:
Andrei Gheata, Mihaela Gheata, Andreas Morsch ALICE offline week, 5-9 July 2010.
Working with AliEn Kilian Schwarz ALICE Group Meeting April
Linux+ Guide to Linux Certification, Third Edition
Conditions Metadata for TAGs Elizabeth Gallas, (Ryan Buckingham, Jeff Tseng) - Oxford ATLAS Software & Computing Workshop CERN – April 19-23, 2010.
ALICE analysis framework References for Analysis Tools used to the ALICE simulated data.
David Adams ATLAS DIAL: Distributed Interactive Analysis of Large datasets David Adams BNL August 5, 2002 BNL OMEGA talk.
STAR C OMPUTING Plans for Production Use of Grand Challenge Software in STAR Torre Wenaus BNL Grand Challenge Meeting LBNL 10/23/98.
Integration of the ATLAS Tag Database with Data Management and Analysis Components Caitriana Nicholson University of Glasgow 3 rd September 2007 CHEP,
Bookkeeping Tutorial. 2 Bookkeeping content  Contains records of all “jobs” and all “files” that are produced by production jobs  Job:  In fact technically.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL November 17, 2003 SC2003 Phoenix.
Predrag Buncic Future IT challenges for ALICE Technical Workshop November 6, 2015.
JAliEn Java AliEn middleware A. Grigoras, C. Grigoras, M. Pedreira P Saiz, S. Schreiner ALICE Offline Week – June 2013.
Armenuhi Abramyan, Narine Manukyan ALICE team of A.I. Alikhanian National Scientific Laboratory {aabramya,
Notes About MARS background simulations for BTeV A Summary of how far we’ve come and how far we have to go. By DJ Wagner 9/12/98 Vanderbilt University.
Computing for Alice at GSI (Proposal) (Marian Ivanov)
Oct HPS Collaboration Meeting Jeremy McCormick (SLAC) HPS Web 2.0 OR Web Apps and Databases (Oh My!) Jeremy McCormick (SLAC)
Summary of Metadata Workshop Peter Hristov 28 February 2005 Alice Computing Day.
1 Offline Week, October 28 th 2009 PWG3-Muon: Analysis Status From ESD to AOD:  inclusion of MC branch in the AOD  standard AOD creation for PDC09 files.
David Adams ATLAS Datasets for the Grid and for ATLAS David Adams BNL September 24, 2003 ATLAS Software Workshop Database Session CERN.
Summary of User Requirements for Calibration and Alignment Database Magali Gruwé CERN PH/AIP ALICE Offline Week Alignment and Calibration Workshop February.
Javier Castillo 1 Muon Embedding Status & Open Issues PWG3 - CERN - 15/02/2011.
Summary of Workshop on Calibration and Alignment Database Magali Gruwé CERN PH/AIP ALICE Computing Day February 28 th 2005.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
Level 1-2 Trigger Data Base development Current status and overview Myron Campbell, Alexei Varganov, Stephen Miller University of Michigan August 17, 2000.
David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.
GUINEA-PIG: Beam-beam interaction simulation status M. Alabau, P. Bambade, O. Dadoun, G. Le Meur, C. Rimbault, F. Touze LAL - Orsay D. Schulte CERN - Genève.
Finding Data in ATLAS. May 22, 2009Jack Cranshaw (ANL)2 Starting Point Questions What is the latest reprocessing of cosmics? Are there are any AOD produced.
David Adams ATLAS ATLAS Distributed Analysis and proposal for ATLAS-LHCb system David Adams BNL March 22, 2004 ATLAS-LHCb-GANGA Meeting.
AliRoot Classes for access to Calibration and Alignment objects Magali Gruwé CERN PH/AIP ALICE Offline Meeting February 17 th 2005 To be presented to detector.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES The AliEn File Catalogue Jamboree on Evolution of WLCG Data &
AliEn Tutorial ALICE workshop Sibiu 20 th August, 2008 Pablo Saiz.
Joe Foster 1 Two questions about datasets: –How do you find datasets with the processes, cuts, conditions you need for your analysis? –How do.
ATLAS Distributed Computing Tutorial Tags: What, Why, When, Where and How? Mike Kenyon University of Glasgow.
Monthly video-conference, 18/12/2003 P.Hristov1 Preparation for physics data challenge'04 P.Hristov Alice monthly off-line video-conference December 18,
MIKADO – Generation of ISO – SeaDataNet metadata files
AliEn Tutorial Panos Christakoglou University of Athens - CERN
Basic aliensh S. Bagnasco, INFN Torino CNAF Nov 27-28, 2007.
Status of the Analysis Task Force
ALICE analysis preservation
INFN-GRID Workshop Bari, October, 26, 2004
HLT & Calibration.
Offline framework for conditions data
Presentation transcript:

M. Oldenburg GridPP Metadata Workshop — July 4–7 2006, Oxford University 1 Markus Oldenburg GridPP Metadata Workshop July 4–7 2006, Oxford University ALICE metadata

M. Oldenburg GridPP Metadata Workshop — July 4–7 2006, Oxford University 2 Overview AliEn and aliensh File Catalogue –structure –path/file name definition Additional Run and File Level Metadata Event Level Metadata ‘Working’ Example Summary and Outlook

M. Oldenburg GridPP Metadata Workshop — July 4–7 2006, Oxford University 3 AliEn and aliensh AliEn (Alice Environment) –distributed computer environment for Alice –core services in PERL –provides Database interface (MySQL) File Catalogue Metadata Catalogue other services… aliensh –provides commands to access AliEn GRID computing resources and the AliEn virtual file system –bash like behaviour –interactive, single-command-, or script-execution informative + convenience commands (whoami, less, …) virtual file catalogue + data management commands (cp, rm, find, …) TaskQueue/job management commands (submit, ps, kill, …)

M. Oldenburg GridPP Metadata Workshop — July 4–7 2006, Oxford University 4 Structure of the File Catalogue File Catalogue –acts as and looks like a ‘File System’ –doesn’t own the files, just associates logical file names (LFN) with physical locations/physical file names (PFN) –MySQL database each virtual directory is represented by one table subdirectories are connected to directories by sub-table entries LFNs (base names) are represented as entries in directory tables These entries hold the name (LFN) and the PFN. PFN contains –protocol how to access the data –host where to find the data –access port –directory entry (= file name)

M. Oldenburg GridPP Metadata Workshop — July 4–7 2006, Oxford University 5 Pathname Definitions for real data /alice/data/‹Year›/‹AcceleratorPeriod›/‹RunNumber›/ for simulated data /alice/sim/‹Year›/‹ProductionType›/‹RunNumber›/ subdirectories: for raw data raw/ for links to calibration and condition files reco/‹PassX›/cond/ for ESD and corresponding tag files reco/‹PassX›/ESD for AOD files reco/‹PassX›/AOD

M. Oldenburg GridPP Metadata Workshop — July 4–7 2006, Oxford University 6 Filename Definition for ESD files ‹xxxx›.AliESD.root similar for all other files (except for condition files) a tool will be provided to generate ‘meaningful’ file names if somebody wants to make a local copy files to be registered in the file catalogue –raw data files, –AliESD.root files, –AliESDfriends.root files, and –ESDtags.root files

M. Oldenburg GridPP Metadata Workshop — July 4–7 2006, Oxford University 7 Run and File Level Metadata Metadata Catalogue –additional tables can be attached to each ‘directory’/table of the MySQL database  metadata –directory structure (grouping of ‘similar’ files) allows for reduction of (additional) metadata for a given directory enhancement of search performance

M. Oldenburg GridPP Metadata Workshop — July 4–7 2006, Oxford University 8 MetaData Overview I run comment run type –physics, laser, pulser, pedestal, simulation run start time run stop time run stop reason –Normal, beam loss, detector failure, … magnetic field setting –FullField, ReversedField, ZeroField, HalfField collision system –PbPb, pp, pPb, … collision energy trigger setup name detectors present in this run # of events in this run run sanity RunEvent File file sanity flag (“online/offline”, “available/ not available”) event id centrality multiplicity –an array for different detectors? luminosity magnetic field value trigger condition detectors with data in this event mean p T max p T # of protons # pions # of strange particles # of pos. charges # of neg. charges # of  … event sanity

M. Oldenburg GridPP Metadata Workshop — July 4–7 2006, Oxford University 9 MetaData Overview II for produced events production tag production software library version for simulation generator generator version generator comments generator parameters detector geometry detector configuration simulation comments RunEvent File All this is additional information to what is stored in the path name!

M. Oldenburg GridPP Metadata Workshop — July 4–7 2006, Oxford University 10 Event Level Metadata raw data is processed right after data taking some physical quantities will be extracted right away –multiplicity –vertex position –… each file containing physics events gets an additional file containing this event level metadata ‘attached’  ESDtags file –root file –stored in the same directory as the physics data file content can be extended later (or each user can even create his/her own tag files)

M. Oldenburg GridPP Metadata Workshop — July 4–7 2006, Oxford University 11 Event Level Metadata Creation/Selection RECONSTRUCTION POST PROCESS INDEX BUILDER BITMAP INDICES ANALYSIS CODE QUERY LIST OF EVENTS GROUPED BY GUID QUERY LIST OF EVENTS GROUPED BY GUID PROOF/AliEn P. Christakoglou

M. Oldenburg GridPP Metadata Workshop — July 4–7 2006, Oxford University 12 Working Example user wants to analyse –AliESDs –pp collisions –taken on 19. and , before 10:20:33 h –… $ find -x pp /alice/data/2007/LHC07a/*/reco/Pass3/*AliESDs.root Run:collision_system=”pp” and Run:stop " " > pp.xml the events should meet the following additional specifications –properly reconstructed vertex –vertex z position in between ±1 cm –… Loop over list of events grouped by GUID/file for the file collection specified by ‘pp.xml’. Run Event

M. Oldenburg GridPP Metadata Workshop — July 4–7 2006, Oxford University 13 Summary and Outlook System is fully setup and functional: –File Catalogue (with defined directory structure) exists and works –run and file level Metadata Catalogue (data fields) is defined and exists –event level metadata is defined, index builder is functional –all stages were tested and work properly But… –no large scale tests yet –many tables/catalogues not filled yet (at least not automatically) –not enough simulation data to effectively stress test the system Currently –large test production running –we start adding output files automatically to the file catalogue –overall system performance to be seen…