The Persistency Patterns of Time Evolving Conditions for ATLAS and LCG António Amorim CFNUL- FCUL - Universidade de Lisboa 2004-09 1 A. António, Dinis.

Slides:



Advertisements
Similar presentations
We provided support of all database activities during the ATLAS testbeam period. A full Conditions Database (CondDB) package has already been released.
Advertisements

Chapter 10: Designing Databases
Database System Concepts and Architecture
March 24-28, 2003Computing for High-Energy Physics Configuration Database for BaBar On-line Rainer Bartoldus, Gregory Dubois-Felsmann, Yury Kolomensky,
1 Databases in ALICE L.Betev LCG Database Deployment and Persistency Workshop Geneva, October 17, 2005.
ATLAS Databases: An Overview, Athena use of Geometry/Conditions DB, and Conditions Metadata Elizabeth Gallas - Oxford ATLAS-UK Distributed Computing Tutorial.
D. Düllmann - IT/DB LCG - POOL Project1 POOL Release Plan for 2003 Dirk Düllmann LCG Application Area Meeting, 5 th March 2003.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
Data Management Design
Experience with the Open Source based implementation for ATLAS Conditions Data Management System Jorge Lima - FCUL21 March 2003CHEP 2003 A.Amorim, J.Lima,
Online Access to the ATLAS Conditions Databases Online Access to the ATLAS Conditions Databases L. Lopes, A. Amorim, J. Simões, P. Pereira (Faculty of.
CERN - IT Department CH-1211 Genève 23 Switzerland t Partitioning in COOL Andrea Valassi (CERN IT-DM) R. Basset (CERN IT-DM) Distributed.
IT The Relational DBMS Section 06. Relational Database Theory Physical Database Design.
Conditions DB in LHCb LCG Conditions DB Workshop 8-9 December 2003 P. Mato / CERN.
Marianne BargiottiBK Workshop – CERN - 6/12/ Bookkeeping Meta Data catalogue: present status Marianne Bargiotti CERN.
LHC: ATLAS Experiment meeting “Conditions” data challenge Elizabeth Gallas - Oxford - August 29, 2009 XLDB3.
A. Amorim 1 The HERA-B database services detector configuration, calibration, alignment, slow control, data classification The HERA-B detector The database.
Time and storage patterns in Conditions: old extensions and new proposals António Amorim CFNUL- FCUL - Universidade de Lisboa ● The “Extended”
Software Solutions for Variable ATLAS Detector Description J. Boudreau, V. Tsulaia University of Pittsburgh R. Hawkings, A. Valassi CERN A. Schaffer LAL,
CHEP 2006, Mumbai13-Feb-2006 LCG Conditions Database Project COOL Development and Deployment: Status and Plans Andrea Valassi On behalf of the COOL.
Databases E. Leonardi, P. Valente. Conditions DB Conditions=Dynamic parameters non-event time-varying Conditions database (CondDB) General definition:
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
NOVA Networked Object-based EnVironment for Analysis P. Nevski, A. Vaniachine, T. Wenaus NOVA is a project to develop distributed object oriented physics.
ALICE, ATLAS, CMS & LHCb joint workshop on
ATLAS Detector Description Database Vakho Tsulaia University of Pittsburgh 3D workshop, CERN 14-Dec-2004.
Clara Gaspar, March 2005 LHCb Online & the Conditions DB.
CHEP /21/03 Detector Description Framework in LHCb Sébastien Ponce CERN.
Introduction CMS database workshop 23 rd to 25 th of February 2004 Frank Glege.
Databases in CMS Conditions DB workshop 8 th /9 th December 2003 Frank Glege.
ATLAS Offline Database Architecture for Time-varying Data, with Requirements for the Common Project David M. Malon LCG Conditions Database Workshop CERN,
CERN - IT Department CH-1211 Genève 23 Switzerland t COOL Conditions Database for the LHC Experiments Development and Deployment Status Andrea.
The POOL Persistency Framework POOL Project Review Introduction & Overview Dirk Düllmann, IT-DB & LCG-POOL LCG Application Area Internal Review October.
Peter Chochula ALICE Offline Week, October 04,2005 External access to the ALICE DCS archives.
GDB Meeting - 10 June 2003 ATLAS Offline Software David R. Quarrie Lawrence Berkeley National Laboratory
SYS364 Database Design Continued. Database Design Definitions Initial ERD’s Normalization of data Final ERD’s Database Management Database Models File.
4 Oct 2005 / Offline week Shuttle program for gathering conditions data from external DB Boyko Yordanov 4 October 2005 ALICE Offline week.
NOVA A Networked Object-Based EnVironment for Analysis “Framework Components for Distributed Computing” Pavel Nevski, Sasha Vanyashin, Torre Wenaus US.
Integration of the ATLAS Tag Database with Data Management and Analysis Components Caitriana Nicholson University of Glasgow 3 rd September 2007 CHEP,
CHEP /21/03 Detector Description Framework in LHCb Sébastien Ponce CERN.
D. Duellmann - IT/DB LCG - POOL Project1 The LCG Pool Project and ROOT I/O Dirk Duellmann What is Pool? Component Breakdown Status and Plans.
The Lisbon Team - 8 December 2003The Lisbon team - 25 November 2003 ConditionsDB – Lisbon API Wide access to CondDB data and schema LCG Conditions DB Workshop.
Andrea Valassi (CERN IT-DB)CHEP 2004 Poster Session (Thursday, 30 September 2004) 1 HARP DATA AND SOFTWARE MIGRATION FROM TO ORACLE Authors: A.Valassi,
The ATLAS DAQ System Online Configurations Database Service Challenge J. Almeida, M. Dobson, A. Kazarov, G. Lehmann-Miotto, J.E. Sloper, I. Soloviev and.
Overview of C/C++ DB APIs Dirk Düllmann, IT-ADC Database Workshop for LHC developers 27 January, 2005.
11th November Richard Hawkings Richard Hawkings (CERN) ATLAS reconstruction jobs & conditions DB access  Conditions database basic concepts  Types.
CERN IT Department CH-1211 Genève 23 Switzerland t COOL Performance Tests ATLAS Conditions Database example Romain Basset, IT-DM October.
LHCb Configuration Database Lana Abadie, PhD student (CERN & University of Pierre et Marie Curie (Paris VI), LIP6.
CORAL CORAL a software system for vendor-neutral access to relational databases Ioannis Papadopoulos, Radoval Chytracek, Dirk Düllmann, Giacomo Govi, Yulia.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
G.Govi CERN/IT-DB 1GridPP7 June30 - July 2, 2003 Data Storage with the POOL persistency framework Motivation Strategy Storage model Storage operation Summary.
Status of tests in the LCG 3D database testbed Eva Dafonte Pérez LCG Database Deployment and Persistency Workshop.
Conditions Database Status and Plans for 2005 Andrea Valassi (CERN IT-ADC) LCG Applications Area Review 31 March 2005.
M.Frank, CERN/LHCb Persistency Workshop, Dec, 2004 Distributed Databases in LHCb  Main databases in LHCb Online / Offline and their clients  The cross.
Maria del Carmen Barandela Pazos CERN CHEP 2-7 Sep 2007 Victoria LHCb Online Interface to the Conditions Database.
ATLAS The ConditionDB is accessed by the offline reconstruction framework (ATHENA). COOLCOnditions Objects for LHC The interface is provided by COOL (COnditions.
CERN - IT Department CH-1211 Genève 23 Switzerland t Persistency Framework CORAL, POOL, COOL status and plans Andrea Valassi (IT-PSS) On.
uses of DB systems DB environment DB structure Codd’s rules current common RDBMs implementations.
L1Calo Databases ● Overview ● Trigger Configuration DB ● L1Calo OKS Database ● L1Calo COOL Database ● ACE Murrough Landon 16 June 2008.
L1Calo Databases ● Overview ● Recent Activitity ● To Do List ● Run types Murrough Landon 4 June 2007.
Jean-Philippe Baud, IT-GD, CERN November 2007
Online Database Work Overview Work needed for OKS database
Database Replication and Monitoring
(on behalf of the POOL team)
Dirk Düllmann CERN Openlab storage workshop 17th March 2003
Conditions Data access using FroNTier Squid cache Server
Grid Data Integration In the CMS Experiment
LHCb Detector Description Framework Radovan Chytracek CERN Switzerland
Andrea Valassi Pere Mato
LHCb Detector Description Framework Radovan Chytracek CERN Switzerland
Presentation transcript:

The Persistency Patterns of Time Evolving Conditions for ATLAS and LCG António Amorim CFNUL- FCUL - Universidade de Lisboa A. António, Dinis Klose, Luis Pedro, Nuno Barros, Tiago Franco Andrea Valassi, Dirk Dullman The “Extended” Conditions Interface (MySQL): The ATLAS Experience Dealing with files or references Features being investigated Conclusions and open questions

The Extended Conditions Interface (MySQL) ● The main ATLAS domains with time management: – Calibration/Alignment ; (Slow) Control; Configuration; Monitoring ● CondDB initially developed by BaBar using an ODBMS. ● It was re-designed at CERN and later re-implemented in ORACLE ● We implemented in MySQL and saw the need to extend: – It contained only BLOBS with time intervals, versions and tags. – outside of the “ATLAS Rec. Framework” the objects were meaningless – The time behavior was not appropriate for Control, Configuration online – It did not scale with data that keeps being loaded. 2

The Extended Conditions Interface Schema in DB The CondDBtable container: A single transient C++ class for a generalized table, including arrays of any types as cells. Open Source RDBMS available on all OS+Comp. Improved folders Online Folders: not knowing the end validity Removed 3

Generalized Container ● Abstract Interface -> ICondDBTable ● Performance Optimized implementation based on – Variable type STL containers – Numerical data in binary form – Including vectors as DB entry values ● Used in all DB operations ● Being extended for generic object columns ● Extend to ROOT or POOL container behavior?

Online Folders ● Normal folder – Diff versions for t 0 – Many versions for large t – Can correct for old times – Versions can be tagged – Can be our containers or blobs ● Online Folder – Any update cuts the previous interval – Single version – Can not correct – Fast and efficient t0t0 version t0t0 Calibration/AlignmentDCSDAQLOG

Asynchronous Folders (ID) ● DCS one channel varies and the others do not ● Configuration/geometry/parameters – A small number of parameters is changed often – The others are kept valid ID t0t0 Often used for DCS, configuration, detector description

Partitions on time (scaling) ● To scale the DB servers the DB administrator can partition the folders in time such that ex: – year 2004 in server atl04 DB conditions04 – year 2005 (up to June) in server atl05 DB conditions05-1 – year 2005 (after June) is in server atl05 DB conditions05-2 –... ● This is transparent to the user since the first query is always to the partition master ● Objects overlapping the partition boundaries are replicated in both partitions. 7

Running for ATLAS ● DCS – Detector (Slow) Control – PVSS (SCADA) -> Conditions ● T/DAQ – CDI (Interface) – Online Information System, Messages -> Conditions ● Reconstruction Framework (ATHENA) – Conditions Athena (Conversion Service) ● ATLAS Sub-detector configuration and monitoring 8 PVSS Manager CDI Interface DB Container Conversion Service

CondDB Browser C++ API -> php binding -> Browser Integrated with the ATLAS NOVA database system Can see the data in the Gen. Container 9

ATLAS Test-beam operation ● Combined test-beam (ATLAS slice test) ● Online MySQL server -> Offline MySQL server ● ~ 8.5 GBytes of database data ● 1859 Folders in total ● 43 Generic Containers; 1809 Online folders; 7 Online Asynch (References to NOVA tables) More detains in the page: Running successfully! Simulation/Reconstruction: Calibration and Alignment is a refinement – Not yet tested extensively 10

Schema - Let the DB do its work Folder Specific Folder & Partition Specific

Could it all be files? ● Online: – Many small objects with irregular access pattern – Distributed environment: all files should be accessible always to all nodes. – Event -> Index -> (logical) Calibration Files -> Files ● Offline – Data Sets associated with Calibration/Online files in Index File - > (logical) Calibration Files -> Files – All (small) files should be accessible always ● DB effort shifted -> replica catalog and to file server. ● Bringing to the client objects (file) not needed for the job.

Dealing with references ● References to external objects have to be managed very carefully ● Any reference is a possible break of integrity: “referential integrity” (even using logical file names) ● A reference to an object in a file in my laptop can be lost just because it is stolen! ● Instead of:Investigate: databases references Data databases references Data copy Cli ent Data Local list MD5 folder+ MD5 index 13 Cli ent

Features Being Investigated ● Abstract interface: implementations in ORACLE and MYSQL – interface classes associated to specific objects ex: FolderManager object is one particular folder – Possible namespace – Use exceptions plus a wrapper to access without exceptions ● Keep Hierarchical view (folders and folder sets (/../../../..)) ● Extended collection view (generalized CondDBTable) – Column types (simple, var-array, extended objects) – A POOL or ROOT class as a column type! – also column with variable type (from NOVA functionality) – line restriction and column projection by the server on query 14

Extended Time behaviour 1) Online folder type: no versions, t-interval: Cut at insertion time t 2 in [t 1,inf[ -> [t 1,t 2 ]+[t 2,inf[ a) Single Object as a function of time b) Collection with “id” s that evolve differently 2) Offline folder including versions and tags. Usual version time diagram with: null object suppression + cut at insertion time for t > last start time in folder (v=0) 3) Data mining folder as in 2) but with internal time intervals per line. ex: Temperatures for 1 month. 4) Bare – time independent nature time version AVOID 15

Investigations on Tagging 1) Tag from head 2) Tag from tag: create a tag with the objects of another tag 3) Tag to tag: hierarchical tags that point to several other tags 4) Create tag or Re-tag to the “old” head with insertion time less or equal to a given time (from BaBar) 5) Use a re-tag time interval that only changes objects that are contained in a user time interval. time version Insert time 16

Redesign the Interface and Investigate a three layer approach and RAL Abstract DBMS independent Specific 17

Conclusions and open questions ● The simple approaches of CondDBMySQL have allowed us to focus on prototyping different time patterns. ● The implementation on MySQL worked OK. ● We tried to keep the package self contained. ● LCG component integration is being developed. ● The database as: ● the central available repository of schema and data ● or just a shared index facility. ● The Gen. Container allows retrieval of parts of the objects selected on the server by line or column. ● The POOL/ROOT objects are an entry in the G. Container or the G. Table is a particular POOL/ROOT object…