David Adams ATLAS DIAL/ADA JDL and catalogs David Adams BNL December 4, 2003 ATLAS software workshop Production session CERN.

Slides:



Advertisements
Similar presentations
D. Düllmann - IT/DB LCG - POOL Project1 POOL Release Plan for 2003 Dirk Düllmann LCG Application Area Meeting, 5 th March 2003.
Advertisements

David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL June 23, 2003 GAE workshop Caltech.
Lecture Nine Database Planning, Design, and Administration
Linux Operations and Administration
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
Overview of the Database Development Process
The ATLAS Production System. The Architecture ATLAS Production Database Eowyn Lexor Lexor-CondorG Oracle SQL queries Dulcinea NorduGrid Panda OSGLCG The.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
5 November 2001F Harris GridPP Edinburgh 1 WP8 status for validating Testbed1 and middleware F Harris(LHCb/Oxford)
K.Harrison CERN, 21st November 2002 GANGA: GAUDI/ATHENA AND GRID ALLIANCE - Background and scope - Project organisation - Technology survey - Design -
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL July 15, 2003 LCG Analysis RTAG CERN.
David Adams ATLAS ATLAS Distributed Analysis David Adams BNL March 18, 2004 ATLAS Software Workshop Grid session.
K. Harrison CERN, 20th April 2004 AJDL interface and LCG submission - Overview of AJDL - Using AJDL from Python - LCG submission.
Marianne BargiottiBK Workshop – CERN - 6/12/ Bookkeeping Meta Data catalogue: present status Marianne Bargiotti CERN.
How to Install and Use the DQ2 User Tools US ATLAS Tier2 workshop at IU June 20, Bloomington, IN Marco Mambelli University of Chicago.
David Adams ATLAS AJDL: Analysis Job Description Language David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
Grid Status - PPDG / Magda / pacman Torre Wenaus BNL U.S. ATLAS Physics and Computing Advisory Panel Review Argonne National Laboratory Oct 30, 2001.
1 st December 2003 JIM for CDF 1 JIM and SAMGrid for CDF Mòrag Burgon-Lyon University of Glasgow.
ATLAS DIAL: Distributed Interactive Analysis of Large Datasets David Adams – BNL September 16, 2005 DOSAR meeting.
David Adams ATLAS DIAL status David Adams BNL July 16, 2003 ATLAS GRID meeting CERN.
David Adams ATLAS ATLAS Distributed Analysis Plans David Adams BNL December 2, 2003 ATLAS software workshop CERN.
Event Data History David Adams BNL Atlas Software Week December 2001.
Datasets on the GRID David Adams PPDG All Hands Meeting Catalogs and Datasets session June 11, 2003 BNL.
David Adams ATLAS ADA, ARDA and PPDG David Adams BNL June 28, 2004 PPDG Collaboration Meeting Williams Bay, Wisconsin.
INFSO-RI Enabling Grids for E-sciencE ATLAS Distributed Analysis A. Zalite / PNPI.
David Adams ATLAS Architecture for ATLAS Distributed Analysis David Adams BNL March 25, 2004 ATLAS Distributed Analysis Meeting.
David Adams ATLAS DIAL status David Adams BNL November 21, 2002 ATLAS software meeting GRID session.
DDM Monitoring David Cameron Pedro Salgado Ricardo Rocha.
Metadata Mòrag Burgon-Lyon University of Glasgow.
David Adams ATLAS ADA: ATLAS Distributed Analysis David Adams BNL June 7, 2004 BNL Technology Meeting.
David Adams ATLAS Virtual Data in ATLAS David Adams BNL May 5, 2002 US ATLAS core/grid software meeting.
David Adams ATLAS ATLAS Distributed Analysis David Adams BNL September 30, 2004 CHEP2004 Track 5: Distributed Computing Systems and Experiences.
D. Adams, D. Liko, K...Harrison, C. L. Tan ATLAS ATLAS Distributed Analysis: Current roadmap David Adams – DIAL/PPDG/BNL Dietrich Liko – ARDA/EGEE/CERN.
David Adams ATLAS DIAL: Distributed Interactive Analysis of Large datasets David Adams BNL August 5, 2002 BNL OMEGA talk.
INFSO-RI Enabling Grids for E-sciencE Ganga 4 – The Ganga Evolution Andrew Maier.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL November 17, 2003 SC2003 Phoenix.
Transformation System report Luisa Arrabito 1, Federico Stagni 2 1) LUPM CNRS/IN2P3, France 2) CERN 5 th DIRAC User Workshop 27 th – 29 th May 2015, Ferrara.
K. Harrison CERN, 3rd March 2004 GANGA CONTRIBUTIONS TO ADA RELEASE IN MAY - Outline of Ganga project - Python support for AJDL - LCG analysis service.
David Adams ATLAS ATLAS distributed data management David Adams BNL February 22, 2005 Database working group ATLAS software workshop.
Korea Workshop May GAE CMS Analysis (Example) Michael Thomas (on behalf of the GAE group)
K. Harrison CERN, 22nd September 2004 GANGA: ADA USER INTERFACE - Ganga release status - Job-Options Editor - Python support for AJDL - Job Builder - Python.
David Adams ATLAS ATLAS Distributed Analysis: Overview David Adams BNL December 8, 2004 Distributed Analysis working group ATLAS software workshop.
David Adams ATLAS ATLAS-ARDA strategy and priorities David Adams BNL October 21, 2004 ARDA Workshop.
ATLAS-specific functionality in Ganga - Requirements for distributed analysis - ATLAS considerations - DIAL submission from Ganga - Graphical interfaces.
ADA Job Builder A Graphical Approach to Job Building ATLAS Software and Computing Workshop May 2005 Chun Lik Tan
David Adams ATLAS Datasets for the Grid and for ATLAS David Adams BNL September 24, 2003 ATLAS Software Workshop Database Session CERN.
Pavel Nevski DDM Workshop BNL, September 27, 2006 JOB DEFINITION as a part of Production.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
ELSSISuite Services QIZHI ZHANG Argonne National Laboratory on behalf of the TAG developers group ATLAS Software and Computing Week, 4~8 April, 2011.
Status of tests in the LCG 3D database testbed Eva Dafonte Pérez LCG Database Deployment and Persistency Workshop.
K. Harrison CERN, 21st February 2005 GANGA: ADA USER INTERFACE - Ganga release Python client for ADA - ADA job builder - Ganga release Conclusions.
David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
David Adams ATLAS ATLAS Distributed Analysis and proposal for ATLAS-LHCb system David Adams BNL March 22, 2004 ATLAS-LHCb-GANGA Meeting.
INFSO-RI Enabling Grids for E-sciencE Ganga 4 Technical Overview Jakub T. Moscicki, CERN.
David Adams ATLAS AJDL: Abstract Job Description Language David Adams BNL June 29, 2004 PPDG Collaboration Meeting Williams Bay.
David Adams ATLAS ADA: ATLAS Distributed Analysis David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
Joe Foster 1 Two questions about datasets: –How do you find datasets with the processes, cuts, conditions you need for your analysis? –How do.
Chapter 9 Database Planning, Design, and Administration Transparencies © Pearson Education Limited 1995, 2005.
1 DIRAC Project Status A.Tsaregorodtsev, CPPM-IN2P3-CNRS, Marseille 10 March, DIRAC Developer meeting.
EGEE is a project funded by the European Union under contract IST Report from the PTF Fabrizio Pacini Datamat S.p.a. Milan, IT-CZ JRA1 meeting,
Magda Distributed Data Manager Torre Wenaus BNL October 2001.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL May 19, 2003 BNL Technology Meeting.
David Adams ATLAS Hybrid Event Store Integration with Athena/StoreGate David Adams BNL March 5, 2002 ATLAS Software Week Event Data Model and Detector.
David Adams Brookhaven National Laboratory September 28, 2006
ADA analysis transformations
Presentation transcript:

David Adams ATLAS DIAL/ADA JDL and catalogs David Adams BNL December 4, 2003 ATLAS software workshop Production session CERN

David Adams ATLAS DIAL/ADA JDL ATLAS SW – Prod sessionDecember 4, Contents DAC mandate Scope Strategy Scenario for first release Plans for the first release GANGA status DIAL status Deliverables for the first release Conclusions

David Adams ATLAS DIAL/ADA JDL ATLAS SW – Prod sessionDecember 4, ADA Strategy Implement DA as a collection of grid services As described in ARDA document Use ARDA components where possible Add missing and ATLAS-specific pieces Provide clients for ATLAS analysis environments Python, ROOT, command line Regular releases Perhaps for each SW week and ATLAS X.0 Provide useful tool Demonstrate functionality Expand functionality with each release

David Adams ATLAS DIAL/ADA JDL ATLAS SW – Prod sessionDecember 4, ADA Strategy (cont) Look to common projects for most of the pieces ARDA, GANGA, DIAL, … Share as much as possible with ATLAS production –Also distributed –Similar interfaces and code for bulk and user-level production ADA (ATLAS distributed analysis) must identify these pieces and tie them together Deployment ADA services must be deployed at relevant sites Provide testing and monitoring of these services Work with facilities to deploy and maintain –Also to develop facility-specific features

David Adams ATLAS DIAL/ADA JDL ATLAS SW – Prod sessionDecember 4, DIAL JDL High-level JDL DIAL envisions a hierarchy of schedulers Interface to these schedulers constitutes a high-level JDL (job definition language) –Job submission, monitoring and gathering of results –See figure Would like to standardize this JDL so schedulers can be shared between projects and experiments –See figure Exchanged objects have XML representations

David Adams ATLAS DIAL/ADA JDL ATLAS SW – Prod sessionDecember 4, User Analysis Job 1 Job 2 ApplicationTask Dataset 1 Scheduler 1. Create or locate 2. select3. Create or select 4. select 5. submit(app,tsk,ds) 6. split Dataset Dataset 2 7. create e.g. ROOT e.g. athena Result 9. fill 10. gather Result 9. fill ResultCode Components of DIAL high-level JDL

David Adams ATLAS DIAL/ADA JDL ATLAS SW – Prod sessionDecember 4, DIAL status: sharing via JDL

David Adams ATLAS DIAL/ADA JDL ATLAS SW – Prod sessionDecember 4, Scheduler Web service interface Scheduler class has similar interface See DIAL JDL page for complete interface & WSDL Partial list (last argument is return) –has_application(XML app, bool stat) –add_task(XML app, XML tsk, bool stat) –submit(XML app, XML tsk, XML dst, XML jobid) –job(XML jobid, XML job) Provide clients with similar interface Command line C++ (imported in ROOT) Python (future from GANGA)

David Adams ATLAS DIAL/ADA JDL ATLAS SW – Prod sessionDecember 4, Application Contents Name Version Corresponds to a software package Same name and version Provides two entry points –Build task –Process a dataset and generate a result Specifies dependencies (other packages) Eventually install with package management system –Now applications must be preinstalled

David Adams ATLAS DIAL/ADA JDL ATLAS SW – Prod sessionDecember 4, Task Contents Collection of named files –Embedded text, PFN or LFN Usage Input to task build –Used only by application

David Adams ATLAS DIAL/ADA JDL ATLAS SW – Prod sessionDecember 4, Dataset Contents Depend on type –See following class diagram for existing types Usage User –select from catalog (DSC) –query for content, # events, … System –locate accessible replica (DRC) –split –determine logical files for staging –extract application view (e.g. event collection) And more…

David Adams ATLAS DIAL/ADA JDL ATLAS SW – Prod sessionDecember 4, Dataset classes Single combined ntuple file (input to application) User selects Used for splitting

David Adams ATLAS DIAL/ADA JDL ATLAS SW – Prod sessionDecember 4, Result Content Depends on type Perhaps should be fixed as list of files –as for task Also need collection of results Usage Communicate results –From application to scheduler –From scheduler to user Provides code to merge with another result –This should probably move to a SW package >Then carry package name in result?

David Adams ATLAS DIAL/ADA JDL ATLAS SW – Prod sessionDecember 4, Job Content Application, task, dataset –provenance Status (running, done, failed, …) Time of start, stop and last update Result –May be empty or partial Usage User –Check status –Access partial results

David Adams ATLAS DIAL/ADA JDL ATLAS SW – Prod sessionDecember 4, Catalogs Two categories Repository –Provides access to XML description indexed by ID –Might be XML DB Selection –Enables user to select with query on catalog contents –Might be relational table Grid service interface User access only through grid service interface Same for access from other services? What granularity? –1 svc, 1 svc/catalog, …

David Adams ATLAS DIAL/ADA JDL ATLAS SW – Prod sessionDecember 4, Catalogs (cont) Application Which applications are available Task Repository –Task XML accessible by ID –Part of provenance system Selection catalog? Application-task consistency?

David Adams ATLAS DIAL/ADA JDL ATLAS SW – Prod sessionDecember 4, Catalogs (cont) Dataset Repository Selection catalog –Very important user interface –Only virtual datasets? Replica catalog –Virtual to non-virtual mapping Single file catalog File Replica catalog Result Repository

David Adams ATLAS DIAL/ADA JDL ATLAS SW – Prod sessionDecember 4, Catalogs (cont) Job Repository –Maybe use selection catalog as repository Selection –Contents >Job ID >Application name and version >Task ID >Dataset ID (and type?) >Result ID (and type?) >Parent job ID >Status –Is this the same catalog that feeds the production supervisor?

David Adams ATLAS DIAL/ADA JDL ATLAS SW – Prod sessionDecember 4, Connection to ATLAS production

David Adams ATLAS DIAL/ADA JDL ATLAS SW – Prod sessionDecember 4, Conclusions