The ZEUS Event Store An object-oriented tag database for physics analysis Adrian Fox-Murphy, DESY CHEP2000, Padova.

Slides:



Advertisements
Similar presentations
Object Persistency & Data Handling Session C - Summary Object Persistency & Data Handling Session C - Summary Dirk Duellmann.
Advertisements

Michigan State University 4/15/ Simulation for Level 2 James T. Linnemann Michigan State University NIU Triggering Workshop October 17, 1997.
CHEP 2012 – New York City 1.  LHC Delivers bunch crossing at 40MHz  LHCb reduces the rate with a two level trigger system: ◦ First Level (L0) – Hardware.
Grid and CDB Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
Reconstruction and Analysis on Demand: A Success Story Christopher D. Jones Cornell University, USA.
ACAT 2002, Moscow June 24-28thJ. Hernández. DESY-Zeuthen1 Offline Mass Data Processing using Online Computing Resources at HERA-B José Hernández DESY-Zeuthen.
The Event as an Object-Relational Database: Avoiding the Dependency Nightmare Christopher D. Jones Cornell University, USA.
Zeus Monday Meeting, DESY, The ZEUS Event Store (ZES) How to make the most efficient data selection in ZEUS Ulrich Fricke, DESY, Hamburg.
L3 Filtering: status and plans D  Computing Review Meeting: 9 th May 2002 Terry Wyatt, on behalf of the L3 Algorithms group. For more details of current.
2/10/2000 CHEP2000 Padova Italy The BaBar Online Databases George Zioulas SLAC For the BaBar Computing Group.
A Linux PC Farm for Physics Analysis at the ZEUS Experiment Marek Kowal, Krzysztof Wrona, Tobias Haas, Ingo Martens, Rainer Mankel DESY, Notkestrasse 85,
25 February 2000Tim Adye1 Using an Object Oriented Database to Store BaBar's Terabytes Tim Adye Particle Physics Department Rutherford Appleton Laboratory.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
Designing a HEP Experiment Control System, Lessons to be Learned From 10 Years Evolution and Operation of the DELPHI Experiment. André Augustinus 8 February.
Introduzione al Software di CMS N. Amapane. Nicola AmapaneTorino, Aprile Outline CMS Software projects The framework: overview Finding more.
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
Introduction to Database Management. 1-2 Outline  Database characteristics  DBMS features  Architectures  Organizational roles.
D0 Farms 1 D0 Run II Farms M. Diesburg, B.Alcorn, J.Bakken, T.Dawson, D.Fagan, J.Fromm, K.Genser, L.Giacchetti, D.Holmgren, T.Jones, T.Levshina, L.Lueking,
Farm Management D. Andreotti 1), A. Crescente 2), A. Dorigo 2), F. Galeazzi 2), M. Marzolla 3), M. Morandin 2), F.
EGEE is a project funded by the European Union under contract IST HEP Use Cases for Grid Computing J. A. Templon Undecided (NIKHEF) Grid Tutorial,
1 GCA Application in STAR GCA Collaboration Grand Challenge Architecture and its Interface to STAR Sasha Vaniachine presenting for the Grand Challenge.
Introduction CMS database workshop 23 rd to 25 th of February 2004 Frank Glege.
Databases in CMS Conditions DB workshop 8 th /9 th December 2003 Frank Glege.
CIS/SUSL1 Fundamentals of DBMS S.V. Priyan Head/Department of Computing & Information Systems.
The PHysics Analysis SERver Project (PHASER) CHEP 2000 Padova, Italy February 7-11, 2000 M. Bowen, G. Landsberg, and R. Partridge* Brown University.
The BaBar Prompt Reconstruction Manager: a Real Life Example of a Constructive Approach to Software Development. Francesco Safai Tehrani Istituto Nazionale.
LHC Physics Analysis and Databases or: “How to discover the Higgs Boson inside a database” Maaike Limper.
Introduction What is detector simulation? A detector simulation program must provide the possibility of describing accurately an experimental setup (both.
A New Tool For Measuring Detector Performance in ATLAS ● Arno Straessner – TU Dresden Matthias Schott – CERN on behalf of the ATLAS Collaboration Computing.
STAR C OMPUTING Plans for Production Use of Grand Challenge Software in STAR Torre Wenaus BNL Grand Challenge Meeting LBNL 10/23/98.
Integration of the ATLAS Tag Database with Data Management and Analysis Components Caitriana Nicholson University of Glasgow 3 rd September 2007 CHEP,
Development of the CMS Databases and Interfaces for CMS Experiment: Current Status and Future Plans D.A Oleinik, A.Sh. Petrosyan, R.N.Semenov, I.A. Filozova,
Claudio Grandi INFN-Bologna CHEP 2000Abstract B 029 Object Oriented simulation of the Level 1 Trigger system of a CMS muon chamber Claudio Grandi INFN-Bologna.
Online Monitoring System at KLOE Alessandra Doria INFN - Napoli for the KLOE collaboration CHEP 2000 Padova, 7-11 February 2000 NAPOLI.
6 march Building the INFN Grid Proposal outline a.ghiselli,l.luminari,m.sgaravatto,c.vistoli INFN Grid meeting, milano.
TAGS in the Analysis Model Jack Cranshaw, Argonne National Lab September 10, 2009.
D0 Farms 1 D0 Run II Farms M. Diesburg, B.Alcorn, J.Bakken, R. Brock,T.Dawson, D.Fagan, J.Fromm, K.Genser, L.Giacchetti, D.Holmgren, T.Jones, T.Levshina,
Muon Persistency Persistent Analysis Objects Muon Persistency Norbert Neumeister µ-PRS meeting February 10, 2004.
STAR Scheduler Gabriele Carcassi STAR Collaboration.
Валидация TRT DCS CONDITIONS SERVICE Евгений Солдатов, НИЯУ МИФИ “Physics&Computing in ATLAS” – 22/09/2010.
L1Calo EM Efficiency Maps Hardeep Bansil University of Birmingham L1Calo Weekly Meeting 07/03/2011.
1/5/09 Jiri Masik1 Event Filter Tracking in the Inner Detector Jiri Masik EFID Trigger related work –Trigger bytestream –Event display.
Barthélémy von Haller CERN PH/AID For the ALICE Collaboration The ALICE data quality monitoring system.
ATLAS Distributed Computing Tutorial Tags: What, Why, When, Where and How? Mike Kenyon University of Glasgow.
The purpose of a CPU is to process data Custom written software is created for a user to meet exact purpose Off the shelf software is developed by a software.
Databases and DBMSs Todd S. Bacastow January 2005.
Fundamentals of Information Systems, Sixth Edition
U.S. ATLAS Grid Production Experience
CMS High Level Trigger Configuration Management
The Object-Oriented Database System Manifesto
Migration of reconstruction and analysis software to C++
The COMPASS event store in 2002
The Client/Server Database Environment
Introduction to Operating System (OS)
TriggerDB copy in TriggerTool
(CMS GEANT4 simulation)
Data, Databases, and DBMSs
COMPASS off-line computing
The AMI Database Project
MANAGING DATA RESOURCES
Data base management system dbms
Ainsley Smith Tel: Ex
ALICE Data Challenges Fons Rademakers Click to add notes.
Using an Object Oriented Database to Store BaBar's Terabytes
Geant4 in HARP V.Ivanchenko For the HARP Collaboration
2 Getting Started.
2 Getting Started.
2 Getting Started.
Use of GEANT4 in CMS The OSCAR Project
Presentation transcript:

The ZEUS Event Store An object-oriented tag database for physics analysis Adrian Fox-Murphy, DESY CHEP2000, Padova

The ZEUS Experiment ZEUS experiment at HERA, Hamburg. Electron-proton collisions at 318GeV. Operational since 1992 In 1996, started to augment existing offline environment with a tag database (ZES) based on Objectivity/DB. First experience with OO software for ZEUS. Tag database operational since 1997. CHEP2000, Padova

ZEUS Offline Environment Raw data rate: 10 Terabyte / year, 1MB/s Event rate: ~10Hz, 100Kb/event Centralised batch processing for physicists’ jobs 3 SGI Challenge XL machines Migrating soon to farm of 30 Linux PCs (M. Kowal’s talk) Raw and reconstructed data stored in ADAMO database User analysis jobs in FORTRAN. CHEP2000, Padova

CHEP2000, Padova

ZEUS Event Store (ZES) Must access required events efficiently Formerly Calculated 128 ‘DST bits’ Made boolean selection of bits Now... Populate a database with (250) physical variables calculated at reconstruction for each event. Kinematics, electrons, muons, hadrons, jets Calorimetry, triggers Select data on the values of the physical quantities. CHEP2000, Padova

CHEP2000, Padova

ZES (contd.) Cons: Use commercial product Objectivity/DB Pros: (v3.8 and v4.02). Pros: exists; meets our current and future requirements; tested in HEP community; OO/C++. Cons: need special interface to existing event database (ADAMO); different language depend on Objectivity (support, survival) CHEP2000, Padova

Database schema Objectivity persistent object hierarchy Federated Database Database (correspond to 200-450MB files) Container (HERA Run) Basic object (Event, ADAMO File) Currently 315 databases. Data in ooVarray variable length arrays gives flexibility for expansion CHEP2000, Padova

Lots of glue... 18k lines of C++ provide Interfaces and tools Interface to main ADAMO database Interface to EAZE analysis job environment Predicate string querying User command line query tool Ntuple generation tool Administration Database population consistency checking, troubleshooting

Selecting Events Card file interface select run range select triggers cut on physical variables using Objectivity/DB predicate strings : ((Ee>5)and(Zvtx>-50)and(Zvtx<50) and(Eminpz>35)and(Eminpz<65)and(Yjb>0.04)) Intuitive for physicist. Precisely tailored event selection (efficient). CHEP2000, Padova

Populating the database Two things required : The physical variables to load. Calculated during offline reconstruction. A pointer to the location of the event in the ADAMO database. Actually use a two-step process first store physical variables temporarily in ntuples (used also for data quality monitoring) then load ZES OO database from ntuples CHEP2000, Padova

Status of the ZES database CHEP2000, Padova

Experience Positive response from users. Adopted as standard. Can greatly reduce the time required to analyse the data. Factor 3-10 improvement typical. On request, the number of physical variables stored for each event was increased from 93 to 250. Problems database population must be done with care. Many things can go wrong. Consistency checking vital. CHEP2000, Padova

Summary ZES is an efficient and flexible tag database Successfully integrated into ZEUS (ADAMO/EAZE). Now holds 100 Million events from 5 years. Variables expanded to 250. Jobs run faster. ZES and Objectivity are flexible and powerful enough to meet current and future requirements. OO/C++ advantages Development, maintenance HEP OO analysis and visualisation tools Future evolution into fully OO software.