Dan van der Ster for the Ganga Team

Slides:



Advertisements
Similar presentations
User view Ganga classes and functions can be used interactively at a Python prompt, can be referenced in scripts, or can be used indirectly via a Graphical.
Advertisements

Computing Lectures Introduction to Ganga 1 Ganga: Introduction Object Orientated Interactive Job Submission System –Written in python –Based on the concept.
GRID INTEROPERABILITY USING GANGA Soonwook Hwang (KISTI) YoonKee Lee and EunSung Kim (Seoul National Uniersity) KISTI-CCIN2P3 FKPPL Workshop December 1,
Experiment Support Introduction to HammerCloud for The LHCb Experiment Dan van der Ster CERN IT Experiment Support 3 June 2010.
DIRAC API DIRAC Project. Overview  DIRAC API  Why APIs are important?  Why advanced users prefer APIs?  How it is done?  What is local mode what.
Analysis demos from the experiments. Analysis demo session Introduction –General information and overview CMS demo (CRAB) –Georgia Karapostoli (Athens.
GANGA PANDA Dietrich Liko. Motivation Access to OSG resources by GANGA Collaboration with US colleagues Possibly an alternative way of submitting jobs.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
K.Harrison CERN, 21st November 2002 GANGA: GAUDI/ATHENA AND GRID ALLIANCE - Background and scope - Project organisation - Technology survey - Design -
3 Sept 2001F HARRIS CHEP, Beijing 1 Moving the LHCb Monte Carlo production system to the GRID D.Galli,U.Marconi,V.Vagnoni INFN Bologna N Brook Bristol.
1 DIRAC – LHCb MC production system A.Tsaregorodtsev, CPPM, Marseille For the LHCb Data Management team CHEP, La Jolla 25 March 2003.
Belle MC Production on Grid 2 nd Open Meeting of the SuperKEKB Collaboration Soft/Comp session 17 March, 2009 Hideyuki Nakazawa National Central University.
Automated Grid Monitoring for LHCb Experiment through HammerCloud Bradley Dice Valentina Mancinelli.
Status of the LHCb MC production system Andrei Tsaregorodtsev, CPPM, Marseille DataGRID France workshop, Marseille, 24 September 2002.
Ganga A quick tutorial Asterios Katsifodimos Trainer, University of Cyprus Nicosia, Feb 16, 2009.
Enabling Grids for E-sciencE EGEE-III INFSO-RI Using DIANE for astrophysics applications Ladislav Hluchy, Viet Tran Institute of Informatics Slovak.
Giuseppe Codispoti INFN - Bologna Egee User ForumMarch 2th BOSS: the CMS interface for job summission, monitoring and bookkeeping W. Bacchi, P.
Experiment Support ANALYSIS FUNCTIONAL AND STRESS TESTING Dan van der Ster, CERN IT-ES-DAS for the HC team: Johannes Elmsheuser, Federica Legger, Mario.
Getting started DIRAC Project. Outline  DIRAC information system  Documentation sources  DIRAC users and groups  Registration with DIRAC  Getting.
Karsten Köneke October 22 nd 2007 Ganga User Experience 1/9 Outline: Introduction What are we trying to do? Problems What are the problems? Conclusions.
A PanDA Backend for the Ganga Analysis Interface J. Elmsheuser 1, D. Liko 2, T. Maeno 3, P. Nilsson 4, D.C. Vanderster 5, T. Wenaus 3, R. Walker 1 1: Ludwig-Maximilians-Universität.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Ricardo Rocha CERN (IT/GS) EGEE’08, September 2008, Istanbul, TURKEY Experiment.
INFSO-RI Enabling Grids for E-sciencE Ganga 4 – The Ganga Evolution Andrew Maier.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Ganga Tutorial From: Jakub T. Moscicki (CERN)
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
HammerCloud Functional tests Valentina Mancinelli IT/SDC 28/2/2014.
K. Harrison CERN, 3rd March 2004 GANGA CONTRIBUTIONS TO ADA RELEASE IN MAY - Outline of Ganga project - Python support for AJDL - LCG analysis service.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Andrea Sciabà Hammercloud and Nagios Dan Van Der Ster Nicolò Magini.
Distributed Data Analysis with GANGA (Tutorial) Alexander Zaytsev Budker Institute of Nuclear Physics (BudkerINP), Novosibirsk On the basis of GANGA EGEE.
SPI NIGHTLIES Alex Hodgkins. SPI nightlies  Build and test various software projects each night  Provide a nightlies summary page that displays all.
2 June 20061/17 Getting started with Ganga K.Harrison University of Cambridge Tutorial on Distributed Analysis with Ganga CERN, 2.
ATLAS-specific functionality in Ganga - Requirements for distributed analysis - ATLAS considerations - DIAL submission from Ganga - Graphical interfaces.
The ATLAS Strategy for Distributed Analysis on several Grid Infrastructures D. Liko, IT/PSS for the ATLAS Distributed Analysis Community.
ATLAS Distributed Analysis Dietrich Liko IT/GD. Overview  Some problems trying to analyze Rome data on the grid Basics Metadata Data  Activities AMI.
Enabling Grids for E-sciencE CMS/ARDA activity within the CMS distributed system Julia Andreeva, CERN On behalf of ARDA group CHEP06.
Distributed Analysis Tutorial Dietrich Liko. Overview  Three grid flavors in ATLAS EGEE OSG Nordugrid  Distributed Analysis Activities GANGA/LCG PANDA/OSG.
David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.
CERN IT Department CH-1211 Genève 23 Switzerland t Distributed Analysis User Support in ATLAS (and LHCb) Dan van der Ster, CERN IT-GS & ATLAS.
David Adams ATLAS ATLAS Distributed Analysis and proposal for ATLAS-LHCb system David Adams BNL March 22, 2004 ATLAS-LHCb-GANGA Meeting.
INFSO-RI Enabling Grids for E-sciencE Ganga 4 Technical Overview Jakub T. Moscicki, CERN.
A GANGA tutorial Professor Roger W.L. Jones Lancaster University.
David Adams ATLAS ADA: ATLAS Distributed Analysis David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
Analysis vs Storage 4 the LHC Experiments (… ATLAS biased view ) Alessandro Di Girolamo CERN / INFN.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
Geant4 GRID production Sangwan Kim, Vu Trong Hieu, AD At KISTI.
Acronyms GAS - Grid Acronym Soup, LCG - LHC Computing Project EGEE - Enabling Grids for E-sciencE.
Seven things you should know about Ganga K. Harrison (University of Cambridge) Distributed Analysis Tutorial ATLAS Software & Computing Workshop, CERN,
Grid User Interface:Ganga Farida Fassi Master de Physique Informatique Rabat, Maroc th, May, 2011.
Scientific Data Processing Portal and Heterogeneous Computing Resources at NRC “Kurchatov Institute” V. Aulov, D. Drizhuk, A. Klimentov, R. Mashinistov,
ATLAS TAG Services Jack Cranshaw with support from Thomas Doherty, Julius Hrivnac, Marcin Nowak.
Daniele Bonacorsi Andrea Sciabà
Distrubuited Analysis using GANGA
L’analisi in LHCb Angelo Carbone INFN Bologna
(on behalf of the POOL team)
Moving the LHCb Monte Carlo production system to the GRID
Key Activities. MND sections
GWE Core Grid Wizard Enterprise (
POW MND section.
BOSS: the CMS interface for job summission, monitoring and bookkeeping
BOSS: the CMS interface for job summission, monitoring and bookkeeping
A full demonstration based on a “real” analysis scenario
LHCb Computing Model and Data Handling Angelo Carbone 5° workshop italiano sulla fisica p-p ad LHC 31st January 2008.
Readiness of ATLAS Computing - A personal view
BOSS: the CMS interface for job summission, monitoring and bookkeeping
Building Grids with Condor
Jakub T. Moscicki (KUBA) CERN
The Ganga User Interface for Physics Analysis on Distributed Resources
Monitoring of the infrastructure from the VO perspective
D. van der Ster, CERN IT-ES J. Elmsheuser, LMU Munich
Presentation transcript:

Ganga User-friendly Grid job submission and management tool for LHC and beyond Dan van der Ster for the Ganga Team CHEP 2009 – Prague, Czech Republic – 23-27 March 2009

Outline Motivation for a modular end-user tool for The Grid. Introduction to Ganga Review of functionality, highlighting new features in Ganga 5 Stable release system and extensive testing framework Who is using Ganga? Ganga for LHCb, ATLAS, and others... Novel usage in the community Dan van der Ster – Ganga: User-friendly Grid job submission and management tool for LHC and beyond

What Users Want “configure once, run anywhere” Users want: Development on the laptop; full analysis on “The Grid™”. To get results quickly, utilizing all of the resources available, wherever they are. A familiar and consistent user interface to all of the resources. Users don’t want: To know the details of the grids or the resources. To learn yet another tool in order to access some resources To have to reconfigure their application to run on different resources. “configure once, run anywhere” Dan van der Ster – Ganga: User-friendly Grid job submission and management tool for LHC and beyond

Introduction to Ganga Ganga is a user-friendly job management tool. Jobs can run locally or on a number of batch systems and grids. Easily monitor the status of jobs running everywhere. To change where the jobs run, change one option and resubmit. Ganga is the main distributed analysis tool for LHCb and ATLAS. Experiment-specific plugins are included. Ganga is an open source community-driven project: Core development is joint between LHCb and ATLAS Modular architecture makes it extendible by anyone Mature and stable, with an organized development process Dan van der Ster – Ganga: User-friendly Grid job submission and management tool for LHC and beyond

Ganga User Interfaces Choose your own interface: CLI, GUI, or Scripting. *** Welcome to Ganga *** Version: Ganga-4-2-8 Documentation and support: http://cern.ch/ganga Type help() or help('index') for online help. In [1]: jobs Out[1]: Statistics: 1 jobs -------------- # id status name subjobs application backend backend.actualCE # 1 completed Executable LCG lcg- compute.hpc.unimelb.edu.au:2119/jobmanage CLIP GUI #!/usr/bin/env ganga #-*-python-*- import time j = Job() j.backend = LCG() j.submit() while not j.status in [‘completed’,’failed’]: print(‘job still running’) time.sleep(30) ./myjob.exec ganga ./myjob.exec In [1]:execfile(“myjob.exec”) GPI & Scripting To start Ganga: > ganga To start the GUI: > ganga --gui Dan van der Ster – Ganga: User-friendly Grid job submission and management tool for LHC and beyond

Submitting a Job with Ganga What is a Ganga Job? Hello world: Job().submit() Hello world on the EGEE grid: Job(backend=LCG()).submit() Listing of the existing jobs: jobs Get help (e.g. on a job): help(jobs) Display the nth job: jobs(n) Copy and resubmit the nth job: jobs(n).copy().submit() Copy and submit to another grid: j=jobs(n).copy() j.backend=DIRAC() j.submit() Kill and remove the nth job: job(n).kill() job(n).remove() Dan van der Ster – Ganga: User-friendly Grid job submission and management tool for LHC and beyond

New in Ganga 5 A new major version (Ganga 5) was released in summer 2008 Improved configuration interface Instance settings on command line : ganga -o[Logging]Ganga=DEBUG or within ganga: config.Logging # list the Logging section config.Logging.Ganga=‘DEBUG’ User settings in $HOME/.gangarc ganga -g generates an empty .gangarc with all options listed. [Logging] Ganga = DEBUG System settings in $GANGA_CONFIG_PATH E.g. system-wide GangaAtlas-v5.ini on the CERN AFS. Job slices and collective operations jobs[0:2] # list the 2 oldest jobs jobs[-3:].submit() # submit the 3 most recent jobs Job selection and collective operations jobs.select(status=‘completed’) # list completed jobs jobs.select(status=‘failed’).resubmit() # retry all failed jobs jobs.select(xrange(1,1000,2)).kill() # kill the odd jobs jobs.select(backend=‘Panda’).select(status=‘failed’).copy().submit() Also better jobs display, type checking of parameter and config options, repository speed and memory improvements Dan van der Ster – Ganga: User-friendly Grid job submission and management tool for LHC and beyond

Remote backend One new feature in 5.0.0 was the Remote backend Remote allows users to submit jobs from the anywhere, even if you don’t have the batch/grid client tools installed: Local ganga packages the input sandbox, connects to a remote ganga instance via ssh The remote instance actually submits and monitors via the real backend (e.g. LCG/Dirac/Panda) Useful for example to work around quota issues etc… Dan van der Ster – Ganga: User-friendly Grid job submission and management tool for LHC and beyond

Testing Framework & Releases One of the strengths of Ganga is the stable release procedure and extensive testing Release Procedure: Ganga developers rotate through 6 week terms as release manager Rel. mgr’s job is quite easy, most of the process is automated. When enough tags have been collected, a prelease is created Testing Framework: Each pre-release is validated with nearly 500 test cases Tests of new features and to prevent regressions Ideally, each bugfix gets a test case. Failures usually due to timeouts Dan van der Ster – Ganga: User-friendly Grid job submission and management tool for LHC and beyond

Who is using Ganga? Garfield Many projects using Ganga + DIANE HARP DIANE: tool to efficiently use resources

Ganga End-Users Over 1000 unique users in the past 6 months: Dip caused by monitoring outage Over 1000 unique users in the past 6 months: Generally 50% ATLAS (blue), 25% LHCb (green), 25% other Monthly ~500 unique ~2000 unique since January 2007 Dan van der Ster – Ganga: User-friendly Grid job submission and management tool for LHC and beyond

Ganga End-Users (2) Which interface is most popular? Peak is from one user running many scripting instances Which interface is most popular? Normally scripting (blue) and command line interface (green) see 50%/50% usage GUI is not used often… good for tutorials and learning, but CLI and scripts are more efficient. Dan van der Ster – Ganga: User-friendly Grid job submission and management tool for LHC and beyond

More than 137 unique domains using Ganga in past 6 months Ganga End-Users (3) More than 137 unique domains using Ganga in past 6 months Dan van der Ster – Ganga: User-friendly Grid job submission and management tool for LHC and beyond

Ganga for LHCb Ganga is the Grid UI for LHCb Main use is for running Gaudi jobs, including: Configuring Gaudi jobs Much easier to work with multiple configurations Specify the datasets Run the jobs locally, on batch systems and on the Grid via Dirac Managing the output data, Ntuples and histogram files. Also support ROOT jobs See Andrew Maier’s talk “User analysis of LHCb data with Ganga” 26-Mar-2009 @ 15:00 Dan van der Ster – Ganga: User-friendly Grid job submission and management tool for LHC and beyond

Ganga and ATLAS See Johannes Elmsheuser’s talk: Ganga is one of two front-ends to the ATLAS grids (OSG, EGEE, NG) Pathena and Ganga derive from a common “ATLAS Grid" library Plugins provided for all the analysis workflows: Athena & AthenaROOTAccess processing AODs, DPDs, ESDs AthenaMC app for private small MC production DQ2 data management plugins Flexible data access modes: local posix I/O, copy and process, FileStager See Johannes Elmsheuser’s talk: “Distributed Analysis in ATLAS using GANGA” 24-Mar-2009 @ 17:50 Dan van der Ster – Ganga: User-friendly Grid job submission and management tool for LHC and beyond

Ganga and ATLAS (2) As ATLAS moves toward the PanDA pilot-based workload management system for analysis, GangaPanda is getting development attention. One novel feature is the “Personal Pilot” application. Ganga sends a job to the Panda system If the site selected for the job does not actively receive pilots (e.g. glexec not available) then Ganga can send pilots directly to the site using the LCG backend. See poster “A PanDA Backend for the Ganga Analysis Interface” Dan van der Ster – Ganga: User-friendly Grid job submission and management tool for LHC and beyond

GangaRobot and HammerCloud Ganga is not only useful for end users: The Python API allows development of Grid applications Two examples come from the ATLAS Distributed Analysis Testing tools. GangaRobot is used to run many short functional tests daily to continuously validate the DA workflows. Results are fed into Ganga so that broken sites can be avoided. HammerCloud is used to run large stress tests: Measuring the behaviour of the storage, networks, databases, etc… under load. See talk “Functional and Large-Scale Testing of the ATLAS Distributed Analysis Facilities with Ganga” 26-Mar-2009 @ 15:20 Dan van der Ster – Ganga: User-friendly Grid job submission and management tool for LHC and beyond

Example from the Wider Community There are novel contributions coming from outside the core Ganga team: KISTI (Korea Institute of Science and Technology Information) and the WISDOM project Continued effort to use grids for Avian flu, Malaria drug searches Some of their resources could already be reached by the existing LCG/gLite backend. But, other resources are Globus sites, managed by GridWay. No GridWay backend, so they developed one. Further, they wanted to hide the backend details from the users: InterGrid backend: selects between LCG and GridWay backends using load information. See Soonwook Hwang et al. “An Approach to Grid Interoperability using Ganga”. EGEE User Forum, Catania, March 2-6 2009 Dan van der Ster – Ganga: User-friendly Grid job submission and management tool for LHC and beyond

Next Steps What is next for Ganga Core development? Developers meeting in January 2009 highlights: Further repository improvements: Testing an XML repository for improved speed and error resilience Unified output management: Remove the distinction between “sandboxes” and “data”, allowing arbitrary treatment of all job I/O Multi-stage or multi-application jobs: Prevent unnecessary storage of intermediate data by sending all stages of a job as one Timekeeping: Incorporate timestamps of status transitions in the Job structure. Dan van der Ster – Ganga: User-friendly Grid job submission and management tool for LHC and beyond

Conclusions Ganga is a user-friendly job management tool for Grid, Batch and Local systems “configure once, run anywhere” A stable development model: Well organized release procedure with extensive testing Plugin architecture allows new functionality to come from non-core developers Not just a UI – provides a Grid API on which many applications are built Strong development support from LHCb and ATLAS, and 25% usage in other VOs For more information visit http://cern.ch/ganga Dan van der Ster – Ganga: User-friendly Grid job submission and management tool for LHC and beyond