ARDA Prototypes Andrew Maier CERN. ARDA WorkshopAndrew Maier, CERN2 Overview ARDA in a nutshell –Experiments –Middleware Experiment prototypes (basic.

Slides:



Advertisements
Similar presentations
ATLAS/LHCb GANGA DEVELOPMENT Introduction Requirements Architecture and design Interfacing to the Grid Ganga prototyping A. Soroko (Oxford), K. Harrison.
Advertisements

1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
A tool to enable CMS Distributed Analysis
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
Ganga Developments Karl Harrison (University of Cambridge) 18th GridPP Meeting University of Glasgow, 20th-21st March 2007
EGEE is a project funded by the European Union under contract IST “The ARDA project: Grid analysis prototypes of the LHC experiments” Massimo.
5 November 2001F Harris GridPP Edinburgh 1 WP8 status for validating Testbed1 and middleware F Harris(LHCb/Oxford)
K.Harrison CERN, 21st November 2002 GANGA: GAUDI/ATHENA AND GRID ALLIANCE - Background and scope - Project organisation - Technology survey - Design -
David Adams ATLAS ATLAS Distributed Analysis David Adams BNL March 18, 2004 ATLAS Software Workshop Grid session.
EGEE is a project funded by the European Union under contract IST The ARDA Project Between Grid middleware and LHC experiments Juha Herrala.
K. Harrison CERN, 20th April 2004 AJDL interface and LCG submission - Overview of AJDL - Using AJDL from Python - LCG submission.
David Adams ATLAS ATLAS Distributed Analysis Plans David Adams BNL December 2, 2003 ATLAS software workshop CERN.
EGEE is a project funded by the European Union under contract IST “ARDA status” Dietrich Liko / CERN XXVII HTASC, 10 September
DOSAR Workshop, Sao Paulo, Brazil, September 16-17, 2005 LCG Tier 2 and DOSAR Pat Skubic OU.
David Adams ATLAS ADA, ARDA and PPDG David Adams BNL June 28, 2004 PPDG Collaboration Meeting Williams Bay, Wisconsin.
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
ROOT-CORE Team 1 PROOF xrootd Fons Rademakers Maarten Ballantjin Marek Biskup Derek Feichtinger (ARDA) Gerri Ganis Guenter Kickinger Andreas Peters (ARDA)
T3 analysis Facility V. Bucard, F.Furano, A.Maier, R.Santana, R. Santinelli T3 Analysis Facility The LHCb Computing Model divides collaboration affiliated.
David Adams ATLAS DIAL/ADA JDL and catalogs David Adams BNL December 4, 2003 ATLAS software workshop Production session CERN.
ATLAS is a general-purpose particle physics experiment which will study topics including the origin of mass, the processes that allowed an excess of matter.
LCG ARDA status Massimo Lamanna 1 ARDA in a nutshell ARDA is an LCG project whose main activity is to enable LHC analysis on the grid ARDA is coherently.
INFSO-RI Enabling Grids for E-sciencE Experience of using gLite for analysis of ATLAS combined test beam data A. Zalite / PNPI.
David Adams ATLAS ATLAS Distributed Analysis David Adams BNL September 30, 2004 CHEP2004 Track 5: Distributed Computing Systems and Experiences.
A PanDA Backend for the Ganga Analysis Interface J. Elmsheuser 1, D. Liko 2, T. Maeno 3, P. Nilsson 4, D.C. Vanderster 5, T. Wenaus 3, R. Walker 1 1: Ludwig-Maximilians-Universität.
EGEE is a project funded by the European Union under contract IST “Interfacing to the gLite Prototype” Andrew Maier / CERN LCG-SC2, 13 August.
15 December 2015M. Lamanna “The ARDA project”1 The ARDA Project (meeting with the LCG referees) Massimo Lamanna CERN.
INFSO-RI Enabling Grids for E-sciencE Ganga 4 – The Ganga Evolution Andrew Maier.
ARDA Prototypes Julia Andreeva/CERN On behalf of the ARDA team CERN.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
K. Harrison CERN, 22nd September 2004 GANGA: ADA USER INTERFACE - Ganga release status - Job-Options Editor - Python support for AJDL - Job Builder - Python.
ANALYSIS TOOLS FOR THE LHC EXPERIMENTS Dietrich Liko / CERN IT.
David Adams ATLAS ATLAS-ARDA strategy and priorities David Adams BNL October 21, 2004 ARDA Workshop.
ATLAS-specific functionality in Ganga - Requirements for distributed analysis - ATLAS considerations - DIAL submission from Ganga - Graphical interfaces.
EGEE is a project funded by the European Union under contract IST “ARDA input for the testing coordination meeting” Massimo Lamanna CERN, 7.
Daniele Spiga PerugiaCMS Italia 14 Feb ’07 Napoli1 CRAB status and next evolution Daniele Spiga University & INFN Perugia On behalf of CRAB Team.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
The ATLAS Strategy for Distributed Analysis on several Grid Infrastructures D. Liko, IT/PSS for the ATLAS Distributed Analysis Community.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
EGEE is a project funded by the European Union under contract IST ARDA Project Status Massimo Lamanna ARDA Project Leader NA4/HEP Cork, 19.
ATLAS Distributed Analysis Dietrich Liko IT/GD. Overview  Some problems trying to analyze Rome data on the grid Basics Metadata Data  Activities AMI.
Enabling Grids for E-sciencE INFSO-RI Enabling Grids for E-sciencE Gavin McCance GDB – 6 June 2007 FTS 2.0 deployment and testing.
Enabling Grids for E-sciencE CMS/ARDA activity within the CMS distributed system Julia Andreeva, CERN On behalf of ARDA group CHEP06.
Distributed Analysis Tutorial Dietrich Liko. Overview  Three grid flavors in ATLAS EGEE OSG Nordugrid  Distributed Analysis Activities GANGA/LCG PANDA/OSG.
K. Harrison CERN, 21st February 2005 GANGA: ADA USER INTERFACE - Ganga release Python client for ADA - ADA job builder - Ganga release Conclusions.
David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
David Adams ATLAS ATLAS Distributed Analysis and proposal for ATLAS-LHCb system David Adams BNL March 22, 2004 ATLAS-LHCb-GANGA Meeting.
INFSO-RI Enabling Grids for E-sciencE Ganga 4 Technical Overview Jakub T. Moscicki, CERN.
ATLAS Distributed Analysis DISTRIBUTED ANALYSIS JOBS WITH THE ATLAS PRODUCTION SYSTEM S. González D. Liko
EGEE is a project funded by the European Union under contract IST Feedback on the gLite middleware Dietrich Liko / IT - LCG ARDA Workshop,
CMS Production Management Software Julia Andreeva CERN CHEP conference 2004.
David Adams ATLAS ADA: ATLAS Distributed Analysis David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
The ARDA Project Prototypes for User Analysis on the GRID Dietrich Liko/CERN IT.
ARDA Massimo Lamanna / CERN Massimo Lamanna 2 TOC ARDA Workshop Post-workshop activities Milestones (already shown in December)
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
CMS Experience with the Common Analysis Framework I. Fisk & M. Girone Experience in CMS with the Common Analysis Framework Ian Fisk & Maria Girone 1.
ATLAS Physics Analysis Framework James R. Catmore Lancaster University.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
Acronyms GAS - Grid Acronym Soup, LCG - LHC Computing Project EGEE - Enabling Grids for E-sciencE.
EGEE is a project funded by the European Union under contract IST “ARDA status” Massimo Lamanna / CERN LCG-SC2, 13 August
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
EGEE is a project funded by the European Union under contract IST “The ARDA project” Massimo Lamanna / CERN on behalf of the LCG-ARDA project.
L’analisi in LHCb Angelo Carbone INFN Bologna
INFN-GRID Workshop Bari, October, 26, 2004
LHCb Computing Model and Data Handling Angelo Carbone 5° workshop italiano sulla fisica p-p ad LHC 31st January 2008.
LCG middleware and LHC experiments ARDA project
The Ganga User Interface for Physics Analysis on Distributed Resources
Presentation transcript:

ARDA Prototypes Andrew Maier CERN

ARDA WorkshopAndrew Maier, CERN2 Overview ARDA in a nutshell –Experiments –Middleware Experiment prototypes (basic components, ARDA contribution, status, plans) –CMS, ATLAS, LHCb and ALICE Conclusions

ARDA WorkshopAndrew Maier, CERN3 ARDA and HEP experiments EGEE middleware (gLite) LHCb Ganga,Dirac, Gaudi, DaVinci… Alice ROOT,AliRoot, Proof… CMS Cobra,Orca, OCTOPUS… Atlas Dial,Ganga, Athena, Don Quijote ARDA Interfacing of middleware for use in the experiment frameworks and applications Early deployment of (a series of) prototypes supporting analysis tasks in the distributed environment Provide early and continuous feedback

ARDA WorkshopAndrew Maier, CERN4 Working model Development of one prototype per experiment –ARDA emphasis is to enable each of the experiment to do its job –A Common Application Layer might emerge in future Provide a forum for discussion –Comparison on results/experience/ideas –Interaction with other projects –… Organizes workshops to interact with the community

ARDA WorkshopAndrew Maier, CERN5 Analysis environment –Multiple users Robustness might be an issue –Concurrent “read” actions Performance should be addressed –Used by all physicists for their analysis Easy access and simplicity Additional requirements

ARDA WorkshopAndrew Maier, CERN6 LCG milestones End-To-End Prototype activity Date Description Dec 2004 E2E prototype for each experiments (4 prototypes), capable of analysis (or advanced production) Dec 2005 E2E prototype for each experiments (4 prototypes), capable of analysis and production

ARDA WorkshopAndrew Maier, CERN7 Middleware Prototype Available for us since May 18 th –In the first month, many problems connected with the stability of the service and procedures –At that point just a few worker nodes available –Most important services are available: file catalog, authentication module, job queue, meta-data catalog, package manager, Grid access service –A second site (Madison) available since the end of June –CASTOR access to the actual data store Number of CPUs will increase –50 as a target for CERN, hardware available Number of sites will increase

ARDA WorkshopAndrew Maier, CERN8 Prototypes overview LHC Experiment Main focus Basic prototype component Experiment analysis application framework Middleware prototype GUI to GridGANGA DaVincigLite Interactive analysis PROOF AliROOT gLite High level service DIALAthenagLite Use of maximum native gLite functionality Not yet fully defined by CMS ORCAgLite

ARDA WorkshopAndrew Maier, CERN9 LHCb Basic component of the prototype defined by the experiment : GANGA - Gaudi/Athena aNd Grid Alliance GANGA GUI JobOptions Algorithms Collective & Resource Grid Services Submitting jobs Monitoring Retrieving results Framework for job creating-submitting- monitoring Can be used with different applications and different submission back-ends Was chosen as a prototype component also by the Atlas experiment GAUDI Program GAUDI – LHCb analysis framework Experiment Book-keeping DB

ARDA WorkshopAndrew Maier, CERN10 ARDA contributions : –GANGA Release management and software process CVS, Savannah,… –GANGA Participating in the development driven by the GANGA team –GANGA-gLite Integrating of GANGA with gLite Enabling job submission through GANGA to gLite Job splitting and merging Retrieving results –GANGA-gLite-DaVinci Enabling real analysis jobs (DaVinci) to run on gLite using GANGA framework Running DaVinci jobs on gLite Installing and managing LHCb software on gLite using gLite package manager LHCb

ARDA WorkshopAndrew Maier, CERN11 Related activities : –GANGA-DIRAC (LHCb production system) Convergence with GANGA/components/experience Submitting jobs to DIRAC using GANGA –GANGA-Condor Enabling submission of jobs through GANGA to Condor and DAGs –LHCb Metadata catalogue performance tests Collaboration with Taiwan colleagues, using their experience LHCb

ARDA WorkshopAndrew Maier, CERN12 LHCb –Current Status GANGA job submission handler for gLite has been developed DaVinci job running on gLite submitted through GANGA Submission of user jobs is working Using the gLite provided job-splitter works on the file level Command line interface (CLI) prototype for GANGA has been developed Can submit jobs using the gLite job-splitter

ARDA WorkshopAndrew Maier, CERN13 LHCb Working CLI example –Submits a job to gLite –Uses the gLite job splitter #!/usr/bin/env python import sys sys.path.append('/afs/cern.ch/sw/ganga/install/rh73_gcc32/cli /Ganga/python') from Ganga.CLI import * from Ganga.CLI.egee_handlers import Glite j = Job(backend="Glite",exe='subjob') j.backend.datafiles=['LF:/egee/user/a/andrew/bin/run', 'LF:/egee/user/a/andrew/bin/davinci.csh'] j.backend.voms_user = 'andrew' j.backend.split_file = 1 j.submit()

ARDA WorkshopAndrew Maier, CERN14 LHCb gLite handler in GANGA

ARDA WorkshopAndrew Maier, CERN15 LHCb Short term plans –Involve people from LHCb physics community (limited number) in testing for getting feed back from the user side –Integrating LHCb software releases with the gLite package manager

ARDA WorkshopAndrew Maier, CERN16 Basic components of the prototype defined by the experiment : ROOT AliROOT PROOF ARDA contributions : Enabling of batch and interactive analysis on gLite Main focus on the interactive analysis using PROOF, emphasising robustness and error recovery gLite–related activities ( all experiments can profit) : - C++ access library for gLite - C library for Posix like IO Alice

ARDA WorkshopAndrew Maier, CERN17 USER SESSION PROOF SLAVES PROOF PROOF MASTER SERVER PROOF SLAVES Site A Site C Site B PROOF Analysis system based on ROOT The ALICE/ARDA will evolve the ALICE analysis system (SuperComputing 2003) Alice

ARDA WorkshopAndrew Maier, CERN18 Forward Proxy Rootd Proofd Grid/Root Authentication Grid Access Control Service TGrid UI/Queue UI Proofd StartupPROOFClient PROOFMaster Slave Registration/ Booking- DB Site PROOF SLAVE SERVERS Site A PROOF SLAVE SERVERS Site BPROOFSteer Master Setup New Elements Grid Service Interfaces Grid File/Metadata Catalogue Client retrieves list of logical file (LFN + MSN) Booking Request with logical file names “Standard” Proof Session Slave ports mirrored on Master host Optional Site Gateway Master Client Grid-Middleware independend PROOF Setup Only outgoing connectivity Alice Improvements Overcoming outbound connectivity requirement of PROOF Middleware independent solution Flexible configuration of slave servers Make SiteGateway module optionall by using network tunnel technology Passing user libraries to slaves from ROOT session

ARDA WorkshopAndrew Maier, CERN19 Current status Batch analysis jobs are running on gLite middleware Interactive analysis is under way Improving robustness and error recovery of PROOF Parallelizing of the startup of PROOF slaves is implemented C++ access library and C IO libraries are developed, will be deployed very soon Short term plans Enable both batch and interactive analysis running on gLite by beginning of November Alice

ARDA WorkshopAndrew Maier, CERN20 ATLAS Basic component of the prototype DIAL- Distributed Analysis of Large datasets Dataset1Dataset2 Dataset Result1 Application CodeResult Task Result2 User analysis framework Scheduler Job1 Job2 Event data, summary data, tuples ROOT,JAS,SEAL Athena, dialpaw, ROOT Collects results Does splitting

ARDA WorkshopAndrew Maier, CERN21 ATLAS SE GRID 3 DQ server RLS SE LCG DQ server RLS SE Nordugrid DQ server RLS SE gLite DQ server FC DQ Client Don Quijote and gLite

ARDA WorkshopAndrew Maier, CERN22 ATLAS ARDA contribution: Integrating DIAL with gLite Enabling Atlas analysis jobs (Athena application) submitted through DIAL to run on gLite Integrate gLite with Atlas data management based on Don Quijote Related activities: Stress testing of the AMI metadata DB

ARDA WorkshopAndrew Maier, CERN23 ATLAS Example: ATLAS TRT data analysis PNPI St Petersburg Number of straw hits per layer Real data processed at gLite Standard RecExTB Data from CASTOR Processing on gLite workernode

ARDA WorkshopAndrew Maier, CERN24 ATLAS Current status : DIAL server has been adapted to CERN environment and installed at CERN First implementation of gLite scheduler for DIAL is developed Still depending on a shared file system for inter-job communication ATHENA jobs submitted through DIAL are run on gLite middleware Integration of gLite with Atlas file management based on Don Quijote is in progress Future plans : Evolve ATLAS prototype to work directly with glite middleware: –Authentication and seamless data access

ARDA WorkshopAndrew Maier, CERN25 The CMS system within ARDA is still under discussion Exploratory/preparatory activity Running CMS analysis jobs (ORCA application) on gLite Populating gLite catalog with CMS data collections, residing at CERN castor Trying gLite package manager to install and handle CMS software Adopting gLite job-splitting service for CMS use- cases CMS

ARDA WorkshopAndrew Maier, CERN26 While CMS decision for ARDA is pending: Started development of the first end-to-end prototype for enabling CMS analysis jobs on gLite Main strategy is to use as much of native middleware functionality as gLite can provide and only in case of very CMS specific tasks develop something on top of existing middleware CMS RefDB PubDB Workflow planner with gLite back-end and command line UI gLite Dataset and owner name defining CMS data collection Points to the corresponding PubDB where POOL catalog for a given data collection is published POOL catalog and a set of COBRA META files Register required info in gLite catalog Creates and submits jobs to gLite, Queries their status Retrieves output

ARDA WorkshopAndrew Maier, CERN27 CMS MonAlisa job monitoring in the CMS analysis prototype –Monitoring progress of splitted analysis jobs Analysis task is splitted into subjobs by gLite MW Each subjob reports its progress (e.g. number of analysed events) to the MonAlisa server. –MonAlisa output can support us to determine whether there is need to re-submit job(s). Submit & Split Master Job subjob MonAlisa monitoring Monitor & Re-submit Merge

ARDA WorkshopAndrew Maier, CERN28 Related activities : Data management task is vital for CMS - Participating in the development of PubDB (Publication DB) distributed data bases for publishing information about available data collection CMS- wide - Participating in the redesign of RefDB (Reference DB), CMS meta data catalog and production book- keeping data base - Development of CMS specific tool for extracting from RefDB META components (POOL catalog and COBRA META files) required for running analysis on a given data collection - RefDB stress testing CMS

ARDA WorkshopAndrew Maier, CERN29 Current status: Prototype development is under way ORCA analysis jobs were successfully run on gLite test-bed Future plans: Have the first version of the prototype ready for testing by limited number of CMS users by the end of October Depending on CMS decision either evolve this prototype according to the users feed back, or integrate it with the tool(s) which CMS would choose for ARDA prototype CMS

ARDA WorkshopAndrew Maier, CERN30 Conclusions ARDA is up and running –Since April 1 st preparing the ground for the experiments prototypes –Definition of the detailed work program –Contributions in the experiment-specific domain –Prototype development started and progressing well Next important steps –(More) real users –Need of more hardware resources –Both important for December 2004 milestone