David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.

Slides:



Advertisements
Similar presentations
Metadata Progress GridPP18 20 March 2007 Mike Kenyon.
Advertisements

David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
Interfacing Interactive Data Analysis Tools with the Grid: PPDG CS-11 Activity Doug Olson, LBNL Joseph Perl, SLAC ACAT 2002, Moscow 24 June 2002.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL June 23, 2003 GAE workshop Caltech.
GANGA PANDA Dietrich Liko. Motivation Access to OSG resources by GANGA Collaboration with US colleagues Possibly an alternative way of submitting jobs.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
5 November 2001F Harris GridPP Edinburgh 1 WP8 status for validating Testbed1 and middleware F Harris(LHCb/Oxford)
Don Quijote Data Management for the ATLAS Automatic Production System Miguel Branco – CERN ATC
ATLAS Data Periods in COMA Elizabeth Gallas - Oxford ATLAS Software and Computing Week CERN – April 4-8, 2011.
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL July 15, 2003 LCG Analysis RTAG CERN.
David Adams ATLAS ATLAS Distributed Analysis David Adams BNL March 18, 2004 ATLAS Software Workshop Grid session.
K. Harrison CERN, 20th April 2004 AJDL interface and LCG submission - Overview of AJDL - Using AJDL from Python - LCG submission.
David Adams ATLAS AJDL: Analysis Job Description Language David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
Grid Status - PPDG / Magda / pacman Torre Wenaus BNL U.S. ATLAS Physics and Computing Advisory Panel Review Argonne National Laboratory Oct 30, 2001.
Belle MC Production on Grid 2 nd Open Meeting of the SuperKEKB Collaboration Soft/Comp session 17 March, 2009 Hideyuki Nakazawa National Central University.
ATLAS DIAL: Distributed Interactive Analysis of Large Datasets David Adams – BNL September 16, 2005 DOSAR meeting.
David Adams ATLAS DIAL status David Adams BNL July 16, 2003 ATLAS GRID meeting CERN.
David Adams ATLAS ATLAS Distributed Analysis Plans David Adams BNL December 2, 2003 ATLAS software workshop CERN.
F. Fassi, S. Cabrera, R. Vives, S. González de la Hoz, Á. Fernández, J. Sánchez, L. March, J. Salt, A. Lamas IFIC-CSIC-UV, Valencia, Spain Third EELA conference,
Event Data History David Adams BNL Atlas Software Week December 2001.
Datasets on the GRID David Adams PPDG All Hands Meeting Catalogs and Datasets session June 11, 2003 BNL.
David Adams ATLAS ADA, ARDA and PPDG David Adams BNL June 28, 2004 PPDG Collaboration Meeting Williams Bay, Wisconsin.
INFSO-RI Enabling Grids for E-sciencE ATLAS Distributed Analysis A. Zalite / PNPI.
David Adams ATLAS Architecture for ATLAS Distributed Analysis David Adams BNL March 25, 2004 ATLAS Distributed Analysis Meeting.
David Adams ATLAS DIAL status David Adams BNL November 21, 2002 ATLAS software meeting GRID session.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
DDM Monitoring David Cameron Pedro Salgado Ricardo Rocha.
ARDA Prototypes Andrew Maier CERN. ARDA WorkshopAndrew Maier, CERN2 Overview ARDA in a nutshell –Experiments –Middleware Experiment prototypes (basic.
Metadata Mòrag Burgon-Lyon University of Glasgow.
David Adams ATLAS DIAL/ADA JDL and catalogs David Adams BNL December 4, 2003 ATLAS software workshop Production session CERN.
David Adams ATLAS ADA: ATLAS Distributed Analysis David Adams BNL June 7, 2004 BNL Technology Meeting.
GDB Meeting - 10 June 2003 ATLAS Offline Software David R. Quarrie Lawrence Berkeley National Laboratory
David Adams ATLAS Virtual Data in ATLAS David Adams BNL May 5, 2002 US ATLAS core/grid software meeting.
David Adams ATLAS ATLAS Distributed Analysis David Adams BNL September 30, 2004 CHEP2004 Track 5: Distributed Computing Systems and Experiences.
D. Adams, D. Liko, K...Harrison, C. L. Tan ATLAS ATLAS Distributed Analysis: Current roadmap David Adams – DIAL/PPDG/BNL Dietrich Liko – ARDA/EGEE/CERN.
David Adams ATLAS DIAL: Distributed Interactive Analysis of Large datasets David Adams BNL August 5, 2002 BNL OMEGA talk.
INFSO-RI Enabling Grids for E-sciencE Ganga 4 – The Ganga Evolution Andrew Maier.
Integration of the ATLAS Tag Database with Data Management and Analysis Components Caitriana Nicholson University of Glasgow 3 rd September 2007 CHEP,
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL November 17, 2003 SC2003 Phoenix.
K. Harrison CERN, 3rd March 2004 GANGA CONTRIBUTIONS TO ADA RELEASE IN MAY - Outline of Ganga project - Python support for AJDL - LCG analysis service.
David Adams ATLAS ATLAS distributed data management David Adams BNL February 22, 2005 Database working group ATLAS software workshop.
K. Harrison CERN, 22nd September 2004 GANGA: ADA USER INTERFACE - Ganga release status - Job-Options Editor - Python support for AJDL - Job Builder - Python.
David Adams ATLAS ATLAS Distributed Analysis: Overview David Adams BNL December 8, 2004 Distributed Analysis working group ATLAS software workshop.
David Adams ATLAS ATLAS-ARDA strategy and priorities David Adams BNL October 21, 2004 ARDA Workshop.
ATLAS-specific functionality in Ganga - Requirements for distributed analysis - ATLAS considerations - DIAL submission from Ganga - Graphical interfaces.
ADA Job Builder A Graphical Approach to Job Building ATLAS Software and Computing Workshop May 2005 Chun Lik Tan
David Adams ATLAS Datasets for the Grid and for ATLAS David Adams BNL September 24, 2003 ATLAS Software Workshop Database Session CERN.
Pavel Nevski DDM Workshop BNL, September 27, 2006 JOB DEFINITION as a part of Production.
The ATLAS Strategy for Distributed Analysis on several Grid Infrastructures D. Liko, IT/PSS for the ATLAS Distributed Analysis Community.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
Distributed Physics Analysis Past, Present, and Future Kaushik De University of Texas at Arlington (ATLAS & D0 Collaborations) ICHEP’06, Moscow July 29,
Grid Status - PPDG / Magda / pacman Torre Wenaus BNL DOE/NSF Review of US LHC Software and Computing Fermilab Nov 29, 2001.
ATLAS Distributed Analysis Dietrich Liko IT/GD. Overview  Some problems trying to analyze Rome data on the grid Basics Metadata Data  Activities AMI.
ELSSISuite Services QIZHI ZHANG Argonne National Laboratory on behalf of the TAG developers group ATLAS Software and Computing Week, 4~8 April, 2011.
Distributed Analysis Tutorial Dietrich Liko. Overview  Three grid flavors in ATLAS EGEE OSG Nordugrid  Distributed Analysis Activities GANGA/LCG PANDA/OSG.
K. Harrison CERN, 21st February 2005 GANGA: ADA USER INTERFACE - Ganga release Python client for ADA - ADA job builder - Ganga release Conclusions.
Finding Data in ATLAS. May 22, 2009Jack Cranshaw (ANL)2 Starting Point Questions What is the latest reprocessing of cosmics? Are there are any AOD produced.
David Adams ATLAS ATLAS Distributed Analysis and proposal for ATLAS-LHCb system David Adams BNL March 22, 2004 ATLAS-LHCb-GANGA Meeting.
Future of Distributed Production in US Facilities Kaushik De Univ. of Texas at Arlington US ATLAS Distributed Facility Workshop, Santa Cruz November 13,
INFSO-RI Enabling Grids for E-sciencE Ganga 4 Technical Overview Jakub T. Moscicki, CERN.
David Adams ATLAS AJDL: Abstract Job Description Language David Adams BNL June 29, 2004 PPDG Collaboration Meeting Williams Bay.
David Adams ATLAS ADA: ATLAS Distributed Analysis David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
Joe Foster 1 Two questions about datasets: –How do you find datasets with the processes, cuts, conditions you need for your analysis? –How do.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Magda Distributed Data Manager Torre Wenaus BNL October 2001.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL May 19, 2003 BNL Technology Meeting.
ATLAS Distributed Analysis S. González de la Hoz 1, D. Liko 2, L. March 1 1 IFIC – Valencia 2 CERN.
ATLAS DC2 & Continuous production
Presentation transcript:

David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN

David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, Contents DAC mandate Scope Strategy Scenario for first release Plan for the first release Deliverables for first release Conclusions

David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, DAC Mandate Distributed Analysis Coordinator Is responsible for coordinating the development of software tools for distributed analysis and their integration into the ATLAS software environment Start with the analysis of existing tools such as GANGA, DIAL, AtCom… Provide users with transparent access to metadata of different sorts as well as to event data in all stages of processing Participate actively in the definition of LCG projects such as ARDA Is a member of relevant LCG committees and working groups

David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, Scope Analysis (not necessarily distributed) Supports the manipulation and extraction of summary data (e.g. histograms) from any type of event data –AOD, ESD, … Supports user-level production of event data –e.g. MC generation, simulation and reconstruction Distributed analysis Extends the extraction and production support to include distributed processing and distributed data Natural extension of non-distributed analysis Easily invoked from any ATLAS analysis environment –including Python, ROOT, command line –easily ported to any future environment (e.g. JAS)

David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, Strategy Implement ADA as a collection of grid services As described in ARDA document Use ARDA components where possible Add missing and ATLAS-specific pieces Provide clients for ATLAS analysis environments Python, ROOT, command line Interface similar to that of DIAL –See figures Regular releases Perhaps for each SW week and ATLAS X.0 Expand functionality with each release

David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, User Analysis Job 1 Job 2 ApplicationTask Dataset 1 Scheduler 1. Create or locate 2. select3. Create or select 4. select 5. submit(app,tsk,ds) 6. split Dataset Dataset 2 7. create e.g. ROOT e.g. athena Result 9. fill 10. gather Result 9. fill ResultCode ADA/DIAL user interface

David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, High level JDL as a bridge Initial ADA

David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, Strategy (cont) Look to common projects for most of the pieces ARDA, GANGA, DIAL, … Share as much as possible with ATLAS production –Also distributed –Similar interfaces and code for bulk and user-level production ADA must identify these pieces and tie them together Deployment ADA services must be deployed –Interactive service at one or two sites with data Provide testing and monitoring of these services Work with facilities to deploy and maintain –Also to develop facility-specific features –Looking for 1 or 2 initial sites for interactive service

David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, Scenario for first release Here is a scenario for user interaction with the first release of ADA Authenticate –Proxy from authentication service Choose application –E.g. PAW to process DC1 ntuples or –Athena to process DC2 AOD or –Athena reconstruction Define task –Analysis: provide code to define and fill histograms –Production: athena job options, maybe code –Perhaps select starting point from task catalog Select input dataset –From dataset (metadata) catalog service

David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, Scenario for first release (cont) Create job configuration –Response time, role, optional splitter,… Locate processing service Submit job –Application, task, dataset, configuration While job is running –Query service for status and partial results –Examine partial results (e.g. histograms) –Kill job if results are bad When job is finished –Examine complete result –Modify task or select new dataset and repeat

David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, Plan for first release Schedule Implement and deploy in advance of March 2004 software workshop Might slip to May meeting Building blocks Code and developers in GANGA and DIAL LCG project following from ARDA –Just starting; so don’t wait but –Stay closely coupled to that project Open to contributions (especially effort) from others

David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, Deliverables for first release Comments Goal is to support the scenario outlined earlier Build on current GANGA and DIAL implementations and plans Emergence of ARDA project may change plans Add more tasks if more ideas and effort are found

David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, Deliverables for first release (cont) Authentication service GSI based Support both EDG and US certificates High-level JDL Start from current DIAL interface Incorporate ideas from PPDG, ARDA, … –If available in time This defines the interface (WSDL) for the following analysis and production services Clients for analysis environments Command line, GANGA and ROOT

David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, Deliverables for first release (cont) Interactive analysis service Goal is “interactive” response time Initial implementation at one or two sites Build on existing DIAL scheduler service –Add authentication –Deploy as web or grid service Application/task/dataset –PAW with fortran task to fill histos from combined ntuples –Add ROOT with C++ task to fill from ROOT ntuples? –Add athena with C++ task to fill from AOD?

David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, Deliverables for first release (cont) Batch analysis service Batch-like response Processing distributed over grid Start from ATLAS production supervisor/executor –See figure Support athena tasks –Fill histograms from AOD –Reconstruction

David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, Possible connections to ATLAS production Possible ADA Scheduler services Production system Different flavors for different grids

David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, Deliverables for first release (cont) Catalog services Catalog tasks, datasets, results and jobs Dataset catalog functionality: –Means for users to select an input dataset –Means for production to register output dataset –Means for system (e.g. DIAL scheduler) to turn dataset specification into accessible physical files Host in AMI –Add grid service interface Try to share with production File catalog and replication services –Use DMS?

David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, Conclusions Distributed analysis is a new project for ATLAS Philosophy Tightly integrate with non-distributed analysis Be neutral  use client-server mechanism to support different analysis environments and different processing systems Be flexible  capabilities (and hence demands) will change as technology evolves Be responsive to evolving user requirements Build on existing ideas and projects including GANGA, DIAL, ATLAS production, ARDA, GriPhyN/IVDGL, PPDG, …

David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, Conclusions (cont) Plan of action Define interface (high-level JDL) Quickly implement clients –command line, GANGA, ROOT Quickly implement services –authentication and authorization –interactive and batch analysis/production –catalogs Expose to users, learn lessons and re-implement Repeat More information Web site coming soon Mail to