EGEE is a project funded by the European Union under contract IST-2003-508833 The ARDA Project Between Grid middleware and LHC experiments Juha Herrala.

Slides:



Advertisements
Similar presentations
Distributed Analysis at the LCG Torre Wenaus, BNL/CERN LCG Applications Area Manager Caltech Grid Enabled Analysis.
Advertisements

1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
D. Düllmann - IT/DB LCG - POOL Project1 POOL Release Plan for 2003 Dirk Düllmann LCG Application Area Meeting, 5 th March 2003.
1 Grid services based architectures Growing consensus that Grid services is the right concept for building the computing grids; Recent ARDA work has provoked.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
LCG Milestones for Deployment, Fabric, & Grid Technology Ian Bird LCG Deployment Area Manager PEB 3-Dec-2002.
EGEE is a project funded by the European Union under contract IST “The ARDA project: Grid analysis prototypes of the LHC experiments” Massimo.
5 November 2001F Harris GridPP Edinburgh 1 WP8 status for validating Testbed1 and middleware F Harris(LHCb/Oxford)
K.Harrison CERN, 21st November 2002 GANGA: GAUDI/ATHENA AND GRID ALLIANCE - Background and scope - Project organisation - Technology survey - Design -
David Adams ATLAS ATLAS Distributed Analysis David Adams BNL March 18, 2004 ATLAS Software Workshop Grid session.
CHEP – Mumbai, February 2006 The LCG Service Challenges Focus on SC3 Re-run; Outlook for 2006 Jamie Shiers, LCG Service Manager.
David Adams ATLAS ATLAS Distributed Analysis Plans David Adams BNL December 2, 2003 ATLAS software workshop CERN.
EGEE is a project funded by the European Union under contract IST “ARDA status” Dietrich Liko / CERN XXVII HTASC, 10 September
DOSAR Workshop, Sao Paulo, Brazil, September 16-17, 2005 LCG Tier 2 and DOSAR Pat Skubic OU.
ATLAS and GridPP GridPP Collaboration Meeting, Edinburgh, 5 th November 2001 RWL Jones, Lancaster University.
K. Harrison CERN, 25th September 2003 GANGA: GAUDI/ATHENA AND GRID ALLIANCE - Project news - Ganga release 1 - Work towards Ganga release 2 - Interaction.
Status of the LHCb MC production system Andrei Tsaregorodtsev, CPPM, Marseille DataGRID France workshop, Marseille, 24 September 2002.
Responsibilities of ROC and CIC in EGEE infrastructure A.Kryukov, SINP MSU, CIC Manager Yu.Lazin, IHEP, ROC Manager
David Adams ATLAS ADA, ARDA and PPDG David Adams BNL June 28, 2004 PPDG Collaboration Meeting Williams Bay, Wisconsin.
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
Bob Jones Technical Director CERN - August 2003 EGEE is proposed as a project to be funded by the European Union under contract IST
EGEE is a project funded by the European Union under contract IST Middleware Planning for LCG/EGEE Bob Jones EGEE Technical Director e-Science.
JRA Execution Plan 13 January JRA1 Execution Plan Frédéric Hemmer EGEE Middleware Manager EGEE is proposed as a project funded by the European.
ARDA Prototypes Andrew Maier CERN. ARDA WorkshopAndrew Maier, CERN2 Overview ARDA in a nutshell –Experiments –Middleware Experiment prototypes (basic.
Metadata Mòrag Burgon-Lyon University of Glasgow.
David Adams ATLAS DIAL/ADA JDL and catalogs David Adams BNL December 4, 2003 ATLAS software workshop Production session CERN.
Grid User Interface for ATLAS & LHCb A more recent UK mini production used input data stored on RAL’s tape server, the requirements in JDL and the IC Resource.
LCG ARDA status Massimo Lamanna 1 ARDA in a nutshell ARDA is an LCG project whose main activity is to enable LHC analysis on the grid ARDA is coherently.
INFSO-RI Enabling Grids for E-sciencE Experience of using gLite for analysis of ATLAS combined test beam data A. Zalite / PNPI.
EGEE is a project funded by the European Union under contract IST “Interfacing to the gLite Prototype” Andrew Maier / CERN LCG-SC2, 13 August.
15 December 2015M. Lamanna “The ARDA project”1 The ARDA Project (meeting with the LCG referees) Massimo Lamanna CERN.
LCG LCG Workshop – March 23-24, Middleware Development within the EGEE Project LCG Workshop CERN March 2004 Frédéric Hemmer.
INFSO-RI Enabling Grids for E-sciencE Ganga 4 – The Ganga Evolution Andrew Maier.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
K. Harrison CERN, 3rd March 2004 GANGA CONTRIBUTIONS TO ADA RELEASE IN MAY - Outline of Ganga project - Python support for AJDL - LCG analysis service.
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
K. Harrison CERN, 22nd September 2004 GANGA: ADA USER INTERFACE - Ganga release status - Job-Options Editor - Python support for AJDL - Job Builder - Python.
David Foster LCG Project 12-March-02 Fabric Automation The Challenge of LHC Scale Fabrics LHC Computing Grid Workshop David Foster 12 th March 2002.
David Adams ATLAS ATLAS-ARDA strategy and priorities David Adams BNL October 21, 2004 ARDA Workshop.
ATLAS-specific functionality in Ganga - Requirements for distributed analysis - ATLAS considerations - DIAL submission from Ganga - Graphical interfaces.
EGEE is a project funded by the European Union under contract IST Roles & Responsibilities Ian Bird SA1 Manager Cork Meeting, April 2004.
EGEE is a project funded by the European Union under contract IST “ARDA input for the testing coordination meeting” Massimo Lamanna CERN, 7.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
EGEE is a project funded by the European Union under contract IST ARDA Project Status Massimo Lamanna ARDA Project Leader NA4/HEP Cork, 19.
ATLAS Distributed Analysis Dietrich Liko IT/GD. Overview  Some problems trying to analyze Rome data on the grid Basics Metadata Data  Activities AMI.
Enabling Grids for E-sciencE CMS/ARDA activity within the CMS distributed system Julia Andreeva, CERN On behalf of ARDA group CHEP06.
EGEE is a project funded by the European Union under contract IST “The LCG ARDA prototypes” Workshop summary Organised by ARDA and the gLite.
Distributed Analysis Tutorial Dietrich Liko. Overview  Three grid flavors in ATLAS EGEE OSG Nordugrid  Distributed Analysis Activities GANGA/LCG PANDA/OSG.
INFSO-RI Enabling Grids for E-sciencE gLite Test and Certification Effort Nick Thackray CERN.
David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.
David Adams ATLAS ATLAS Distributed Analysis and proposal for ATLAS-LHCb system David Adams BNL March 22, 2004 ATLAS-LHCb-GANGA Meeting.
LHCC Referees Meeting – 28 June LCG-2 Data Management Planning Ian Bird LHCC Referees Meeting 28 th June 2004.
INFSO-RI Enabling Grids for E-sciencE Ganga 4 Technical Overview Jakub T. Moscicki, CERN.
David Adams ATLAS ADA: ATLAS Distributed Analysis David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
ARDA Massimo Lamanna / CERN Massimo Lamanna 2 TOC ARDA Workshop Post-workshop activities Milestones (already shown in December)
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
CMS Experience with the Common Analysis Framework I. Fisk & M. Girone Experience in CMS with the Common Analysis Framework Ian Fisk & Maria Girone 1.
EGEE is a project funded by the European Union under contract IST “ARDA status” Massimo Lamanna / CERN LCG-SC2, 13 August
EGEE is a project funded by the European Union under contract IST “The ARDA project: status report” Massimo Lamanna LCG PEB, 1 June
EGEE is a project funded by the European Union under contract IST “The ARDA project” Massimo Lamanna / CERN on behalf of the LCG-ARDA project.
Bob Jones EGEE Technical Director
EGEE Middleware Activities Overview
(on behalf of the POOL team)
Ian Bird GDB Meeting CERN 9 September 2003
Joint JRA1/JRA3/NA4 session
Dirk Düllmann CERN Openlab storage workshop 17th March 2003
The ARDA Project status report
LCG middleware and LHC experiments ARDA project
Presentation transcript:

EGEE is a project funded by the European Union under contract IST The ARDA Project Between Grid middleware and LHC experiments Juha Herrala ARDA Project GridPP 10 th Collaboration Meeting, CERN, 04 June cern.ch/lcg

04 June Contents Introduction to the LCG ARDA Project  History and mandate  Relation to EGEE project  ARDA prototypes  Relation to Regional Centres Towards ARDA prototypes  LHCb, ATLAS, ALICE, CMS Coordination and forum activities  Workshops and meetings Conclusions and Outlook

04 June How ARDA evolved LHC Computing Grid (LCG) project’s Requirements and Technical Assessment Group (RTAG) for distributed analysis presented their ARDA report in November  ARDA = Architectural Roadmap for Distributed Analysis  Defined a set of collaborating Grid services and their interfaces As a result the ARDA project was launched by LCG  ARDA = A Realisation of Distributed Analysis  Purpose is to coordinate different activities in the development of distributed analysis systems of the LHC experiments, which will be based on the new service-oriented Grid middleware and infrastructure. But the generic Grid middleware is developed by the EGEE project  Sometimes ARDA became also a synonym for this “second generation” Grid middleware, which was later (May 2004) renamed to Glite.  Generic = no significant functionality that is of interest for HEP or any other science/community alone.

04 June Our starting point / mandate Recommendations of the ARDA working group New service decomposition  Strong influence of Alien system the Grid system developed by the ALICE experiments and used by a wide scientific community (not only HEP) Role of experience, existing technology…  Web service framework Interfacing to existing middleware to enable their use in the experiment frameworks Early deployment of (a series of) end-to-end prototypes to ensure functionality and coherence  Middleware as a building block  Validation of the design EGEE Middleware LCG ARDA project

04 June EGEE and LCG ARDA LCG strongly linked with middleware developed/deployed in EGEE (continuation of EDG) The core infrastructure of the EGEE Grid operation service will grow out of the LCG service  LCG includes many US and Asian partners  EGEE includes other sciences  Substantial part of infrastructure common to both Parallel production lines  LCG-2 production Grid 2004 data challenges  Pre-production prototype EGEE/Glite MW ARDA playground for the LHC experiments ARDA EGEE WS MW EGEE-2EGEE-1LCG-2LCG-1 VDT/EDG

04 June Support LHC experiments to implement their end-to-end analysis prototypes based on the EGEE/Glite middleware  ARDA will equally support each of the LHC experiments  Close collaboration with data analysis teams, ensuring end-to-end coherence of the prototypes  One prototype per experiment Role of ARDA  Interface with the EGEE middleware  Adapt/verify components of analysis environments of the experiments (robustness/many users, performance/concurrent “read” actions)  A Common Application Layer may emerge in future  Feedback from the experiments to the middleware team Final target beyond the prototype activity: sustainable distributed analysis services for the four experiments deployed at LCG Regional Centres. ARDA prototypes

04 June Regional Centres Regional Centres have valuable practical experience and know how  Understand “deployability” issues, which is a key factor for (EGEE/Glite) middleware success  Data base technologies  Web Services Some Regional Centres will have the responsibility to provide early installation for the middleware  EGEE Middleware test bed  Pilot sites might enlarge the resources available and give fundamental feedback in terms of “deployability” to complement the EGEE SA1 Running ARDA pilot installations  ARDA test bed for analysis prototypes  Experiment data available where the experiment prototype is deployed Stress and performance tests could be ideally located outside CERN  Experiment-specific components (e.g. a Meta Data catalogue) which might be used by the ARDA prototypes  Exploit local know how of the Regional Centres Final ARDA goal: sustainable analysis service for LHC experiments

04 June ARDA project team Massimo Lamanna Birger Koblitz Derek Feichtinger Andreas Peters Dietrich Liko Frederik Orellana Julia Andreeva Juha Herrala Andrew Maier Kuba Moscicki Andrey Demichev Viktor Pose Wei-Long Ueng Tao-Sheng Chen LHCb CMS ATLAS ALICE Russia Taiwan Experiment interfaces Piergiorgio Cerello (ALICE) David Adams (ATLAS) Lucia Silvestris (CMS) Ulrik Egede (LHCb)

04 June Towards ARDA prototypes Existing systems as starting point  Every experiment has different implementations of the standard services  Used mainly in production environments  Now more emphasis on analysis

04 June Prototype activity Provide a fast feedback to the EGEE MW development team  Avoid uncoordinated evolution of the middleware  Coherence between users’ expectations and final product Experiments may benefit from the new MW as soon as possible  Frequent snapshots of the middleware available  Expose the experiments (and the community in charge of the deployment) to the current evolution of the whole system  Experiments’ systems are very complex and still evolving Move forward towards new-generation real systems (analysis!)  A lot of work (experience and useful software) is invested in current data challenges of the experiments, which makes them a concrete starting point  Whenever possible adapt/complete/refactorise the existing components: we do not need yet another system! Attract and involve users  Prototypes with realistic workload and conditions, thus real users from LHC experiments required!

04 June The initial prototype will have a reduced scope of functionality  Currently components are selection for the first prototype Not all use cases/operation modes will be supported  Every experiment has a production system (with multiple backends, like PBS, LCG, G2003, NorduGrid, …).  We focus on end-user analysis on a EGEE MW based infrastructure Informal Use Cases are still being defined, e.g. a generic analysis case:  A physicist selects a data sample (from current Data Challenges)  With an example/template as starting point (s)he prepares a job to scan the data  The job is split in sub-jobs, dispatched to the Grid, some error-recovery is automatically performed if necessary, and finally merged back in a single output  The output (histograms, ntuples) is returned together with simple information on the job-end status Prototype activity

04 June Towards ARDA prototypes LHCb - ARDA

04 June LHCb GANGA as a principal component  Friendly user interface for Grid services The LHCb/GANGA plans match naturally with the ARDA mandate  Goal is to enable physicists (via GANGA) to analyse the data being produced during 2004 for their studies  Have the prototype where the LHCb data will be the key (CERN, RAL, …) At the beginning, the emphasis will be focused on  Usability of GANGA  Validation of the splitting and merging functionality of users jobs The DIRAC system is also an important component  LHCb grid system, used mainly in production so far  Useful target to understand the detailed behaviour of LHCb-specific grid components, like the file catalog. Convergence between DIRAC and GANGA anticipated.

04 June GANGA Gaudi/Athena aNd Grid Alliance Gaudi/Athena: LHCb/ATLAS frameworks  The Athena uses Gaudi as a foundation Single “desktop” for a variety of tasks Help configuring and submitting analysis jobs Keep track of what they have done, hiding completely all technicalities  Resource Broker, LSF, PBS, DIRAC, Condor  Job registry stored locally or in the roaming profile  Automate config/submit/monitor procedures Provide a palette of possible choices and specialized plug-ins (pre-defined application configurations, batch/grid systems, etc.) Friendly user interface (CLI/GUI) is essential  GUI Wizard Interface Help users to explore new capabilities Browse job registry  Scripting/Command Line Interface Automate frequent tasks python shell embedded into the Ganga GUI GAUDI Program GANGA GUI JobOptions Algorithms Collective & Resource Grid Services Histograms Monitoring Results Grid Services GANGA UI BkSvc Bookkeeping Service WorkLoad Manager SE File catalog WLMProSvcMonitor Internal Model Profile Service GAUDI Program Instr. CE

04 June Release management procedure established  Software process and integration Testing, bug fix releases, tagging policies, etc.  Infrastructure Installation, packaging etc.  ARDA team member in charge Integration with job managers/resource brokers  Waiting for the EGEE middleware, we developed an interface to Condor  Use of Condor DAGMAN for splitting/merging and error recovery capability Design and development in next future  Integration with EGEE middleware  Command Line Interface  Evolution of Ganga features ARDA contribution to GANGA

04 June Client TAIWAN CERN Oracle DB Bookkeeping Server CPU Load Network Process time Web & XML- RPC Service performance tests CPU Load Network Process time DB I/O Sensor Network monitor Virtual Users CERN/Taiwan tests on LHCb metadata catalogue Clone Bookkeeping DB in Taiwan Install the WS layer Performance Tests  Database I/O Sensor  Bookkeeping Server performance tests Taiwan/CERN Bookkeeping Server DB XML-RPC Service performance tests CPU Load, Network send/receive sensor, Process time  Client Host performance tests CPU Load, Network send/receive sensor, Process time Feedback to LHCb metadata catalogue developers

04 June Towards ARDA prototypes ATLAS - ARDA

04 June ATLAS ATLAS has a relatively complex strategy for distributed analysis, addressing different areas with specific projects  Fast response (DIAL)  User-driven analysis (GANGA)  Massive production with multiple Grids, etc…  For additional information see the ATLAS Distributed Analysis (ADA) site: The ATLAS system within ARDA has been agreed  Starting point is the DIAL service model for distributed interactive analysis; users will be exposed to different user interface (GANGA) The AMI metadata catalog is a key component in ATLAS prototype  mySQL as a back end  Genuine Web Server implementation  Robustness and performance tests from ARDA In the start up phase, ARDA provided some assistance in developing production tools

04 June SOAP-Proxy Meta-Data (MySQL) User AMI studies in ARDA Atlas Metadata- Catalogue, contains File Metadata:  Simulation/Reconstruction- Version  Does not contain physical filenames Many problems still open:  Large network traffic overhead due to schema independent tables  SOAP Web Services proxy supposed to provide DB access Note that Web Services are “stateless” (not automatic handles to have the concept of session, transaction, etc…): 1 query = 1 (full) response  Large queries might crash server  Shall SOAP front-end proxy re- implement all the database functionality? Good collaboration in place with ATLAS-Grenoble Studied behaviour using many concurrent clients:

04 June Towards ARDA prototypes ALICE - ARDA

04 June Strategy:  The ALICE-ARDA will evolve the analysis system presented at SuperComputing 2003 ‘Grid-enabled PROOF’ Where to improve:  Heavily connected with the middleware services  “Inflexible” configuration  No chance to use PROOF on federated grids like LCG  User libraries distribution Activity on PROOF  Robustness  Error recovery ALICE USER SESSION PROOF SLAVES TcpRouter PROOF PROOF SLAVES PROOF MASTER SERVER PROOF SLAVES TcpRouter Site A Site C Site B

04 June Improved PROOF system Original problem: no support for hierarchical Grid infrastructure, only local cluster mode. The remote proof slaves look like a local proof slave on the master machine Booking service is usable also on local clusters PROOF PROOF SLAVE SERVERS Proxy proofd Proxy rootd Master Grid Services Booking

04 June Towards ARDA prototypes CMS - ARDA

04 June CMS The CMS system within ARDA is still under discussion Provide easy access (and possibly sharing) of data for the CMS users is a key issue in discussions RefDB McRunjob T0 worker nodes GDB castor pool Tapes Export Buffers Transfer agent RLSTMDB Reconstruction instructions Reconstruction jobs Reconstructed data Reconstructed data Checks what has arrived Updates Summaries of successful jobs CMS DC04 production

04 June CMS RefDB Potential starting point for the prototype Bookkeeping engine to plan and steer the production across different phases (simulation, reconstruction, to some degree into the analysis phase) Contains all necessary information except file physical location (RLS) and info related to the transfer management system (TMDB) The actual mechanism to provide these data to analysis users is under discussion Measuring performances underway (similar philosophy as for the LHCb Metadata catalog measurements)

04 June Coordination and forum activities  Workshops and meetings

04 June Coordination and forum activities Forum activities are seen as ‘fundamental’ in the ARDA project definition  ARDA will channel information to the appropriate recipients, especially to analysis-related activities and projects outside the ARDA prototypes  Ensures that new technologies can be exposed to the relevant community ARDA should organise a set of regular meetings  Aim is to discuss results, problems, new/alternative solutions and possibly agree on some coherent program of work. Workshop every three months.  The ARDA project leader organises this activity which will be truly distributed and lead by the active partners ARDA is embedded in EGEE  NA4, Application Identification and Support Special relation with LCG  LCG GAG is a forum for Grid requirements and use cases  Experiments representatives coincide with the EGEE NA4 experiments representatives

04 June Workshops and meetings  1 st ARDA workshop January 2004 at CERN; open Over 150 participants  2 nd ARDA workshop “The first 30 days of EGEE middleware” June at CERN; by invitation Expected 30 participants  EGEE NA4 Meeting mid July NA4/JRA1 (middleware) and NA4/SA1 (Grid operations) sessions Organised by M. Lamanna and F. Harris  3 rd ARDA workshop Currently scheduled for September 2004 close to CHEP; open

04 June Next ARDA workshop “The first 30 days of the EGEE middleware” CERN: of June 2004  Exceptionally by invitation only Monday, June 21  ARDA team / JRA1 team  ATLAS (Metadata database services for HEP experiments) Tuesday, June 22  LHCb (Experience in building web services for grid)  CMS (Data management) Wednesday, June 23  ALICE (Interactivity on the Grid)  Close out Info on the web:

04 June Conclusions

04 June Conclusions and Outlook LCG ARDA has started  Main objective: experiment prototypes for analysis  EGEE/Glite middleware becoming available  Good feedback from the LHC experiments  Good collaboration within EGEE project  Good collaboration with Regional Centres. More help needed. Main focus  Prototyping distributed analysis systems of LHC experiments.  Collaborate with the LHC experiments, the EGEE middleware team and the Regional Centres to set up the end-to-end prototypes. Aggressive schedule  Milestone for the first end-to-end prototypes is already December 2004.