Analysis of job submissions through the EGEE Grid Overview The Grid as an environment for large scale job execution is now moving beyond the prototyping.

Slides:



Advertisements
Similar presentations
London Tier2 Status O.van der Aa. Slide 2 LT 2 21/03/2007 London Tier2 Status Current Resource Status 7 GOC Sites using sge, pbs, pbspro –UCL: Central,
Advertisements

Workload Management Status of current activity GridPP 13, Durham, 6 th July 2005.
Your university or experiment logo here What is it? What is it for? The Grid.
Workload management Owen Maroney, Imperial College London (with a little help from David Colling)
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Infrastructure overview Arnold Meijster &
Lime is… …an interactive visual analysis tool connected to Dynamics NAV Graphical representation of NAV data in your browser Real-time visualization of.
The LHC Computing Grid – February 2008 The Worldwide LHC Computing Grid Dr Ian Bird LCG Project Leader 15 th April 2009 Visit of Spanish Royal Academy.
Grid Information Systems. Two grid information problems Two problems  Monitoring  Discovery We can use similar techniques for both.
ScotGrid: a Prototype Tier-2 Centre – Steve Thorn, Edinburgh University SCOTGRID: A PROTOTYPE TIER-2 CENTRE Steve Thorn Authors: A. Earl, P. Clark, S.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Pilot Test-bed Operations and Support Work.
Open Science Grid For CI-Days Internet2: Fall Member Meeting, 2007 John McGee – OSG Engagement Manager Renaissance Computing Institute.
Computing and LHCb Raja Nandakumar. The LHCb experiment  Universe is made of matter  Still not clear why  Andrei Sakharov’s theory of cp-violation.
Nicholas LoulloudesMarch 3 rd, 2009 g-Eclipse Testing and Benchmarking Grid Infrastructures using the g-Eclipse Framework Nicholas Loulloudes On behalf.
A short introduction to GRID Gabriel Amorós IFIC.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Configuring and Maintaining EGEE Production.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
:: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: :: GridKA School 2009 MPI on Grids 1 MPI On Grids September 3 rd, GridKA School 2009.
Open Science Grid For CI-Days Elizabeth City State University Jan-2008 John McGee – OSG Engagement Manager Manager, Cyberinfrastructure.
Real Time Monitor of Grid Job Executions Janusz Martyniak Imperial College London.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Julia Andreeva CERN (IT/GS) CHEP 2009, March 2009, Prague New job monitoring strategy.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
Automated Grid Monitoring for LHCb Experiment through HammerCloud Bradley Dice Valentina Mancinelli.
Instrumentation of the SAM-Grid Gabriele Garzoglio CSC 426 Research Proposal.
Grid infrastructure analysis with a simple flow model Andrey Demichev, Alexander Kryukov, Lev Shamardin, Grigory Shpiz Scobeltsyn Institute of Nuclear.
Costin Grigoras ALICE Offline. In the period of steady LHC operation, The Grid usage is constant and high and, as foreseen, is used for massive RAW and.
GridPP Deployment & Operations GridPP has built a Computing Grid of more than 5,000 CPUs, with equipment based at many of the particle physics centres.
Enabling Grids for E-sciencE EGEE-III INFSO-RI Using DIANE for astrophysics applications Ladislav Hluchy, Viet Tran Institute of Informatics Slovak.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks WMSMonitor: a tool to monitor gLite WMS/LB.
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
OSG Area Coordinator’s Report: Workload Management Maxim Potekhin BNL
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Performance Improvements to BDII - Grid Information.
Enabling Grids for E- sciencE EGEE and gLite are registered trademarks EGEE-III INFSO-RI Analysis of Overhead and waiting times.
GridPP Dirac Service The 4 th Dirac User Workshop May 2014 CERN Janusz Martyniak, Imperial College London.
1 LHCb on the Grid Raja Nandakumar (with contributions from Greig Cowan) ‏ GridPP21 3 rd September 2008.
Your university or experiment logo here What is it? What is it for? The Grid.
Website: Answering Continuous Queries Using Views Over Data Streams Alasdair J G Gray Werner.
Recent improvements in HLRmon, an accounting portal suitable for national Grids Enrico Fattibene (speaker), Andrea Cristofori, Luciano Gaido, Paolo Veronesi.
1 A lightweight Monitoring and Accounting system for LHCb DC'04 production V. Garonne R. Graciani Díaz J. J. Saborido Silva M. Sánchez García R. Vizcaya.
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
Your university or experiment logo here Performance Monitoring Gidon Moont e-Science, HEP, Imperial College London Talk to JRA1.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks APEL CPU Accounting in the EGEE/WLCG infrastructure.
OSG Area Coordinator’s Report: Workload Management Maxim Potekhin BNL May 8 th, 2008.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Deliverable DSA1.4 Jules Wolfrat ARM-9 –
Global ADC Job Monitoring Laura Sargsyan (YerPhI).
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The LCG interface Stefano BAGNASCO INFN Torino.
Accounting in LCG/EGEE Can We Gauge Grid Usage via RBs? Dave Kant CCLRC, e-Science Centre.
EGEE is a project funded by the European Union under contract INFSO-RI Grid accounting with GridICE Sergio Fantinel, INFN LNL/PD LCG Workshop November.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical using WMProxy advanced job submission.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Imperial College SA3 Status David Colling.
CERN - IT Department CH-1211 Genève 23 Switzerland t Grid Reliability Pablo Saiz On behalf of the Dashboard team: J. Andreeva, C. Cirstoiu,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Grid Observatory: goals and challenges.
e-ScienceTalk: Supporting Grid and High Performance Computing Reporting across Europe GA No September 2010 – 31 July 2013.
e-ScienceTalk: Supporting Grid and High Performance Computing Reporting across Europe GA No September 2010 – 31 May 2013.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Job Management Claudio Grandi.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
TIFR, Mumbai, India, Feb 13-17, GridView - A Grid Monitoring and Visualization Tool Rajesh Kalmady, Digamber Sonvane, Kislay Bhatt, Phool Chand,
INFSO-RI Enabling Grids for E-sciencE Diagnostic Tool Brainstorming Ratnadeep Abrol EGEE JRA4 F2F, DANTE, Cambridge 9 th May 2005.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
Pledged and delivered resources to ALICE Grid computing in Germany Kilian Schwarz GSI Darmstadt ALICE Offline Week.
A Statistical Analysis of Job Performance on LCG Grid David Colling, Olivier van der Aa, Mona Aggarwal, Gidon Moont (Imperial College, London)
On behalf of D. Colling, G. Moont, M. Aggarwal
Grid Computing: Running your Jobs around the World
GA No 1 September 2010 – 31 May 2013 Steven Newhouse Director EGI.eu
Nicolas Jacq LPC, IN2P3/CNRS, France
Dagmar Adamova (NPI AS CR Prague/Rez) and Maarten Litmaath (CERN)
The LHCb Computing Data Challenge DC06
Presentation transcript:

Analysis of job submissions through the EGEE Grid Overview The Grid as an environment for large scale job execution is now moving beyond the prototyping phase to real deployments over national and international scales providing real computational cycles to application scientists who are using it for real scientific applications. These real deployments are highlighting characteristics of these Grids which could not have been predicted before major deployment. In order to better understand these characteristics a full analysis of these Grids needs to be performed. In this work we analyse trace logs of over 70 million jobs collected from jobs executed through the Enabling Grids for EsciencE (EGEE) Grid. A large worldwide Grid consisting of over 41,000 CPU’s and 5PB of online disk storage spread over 240 institutions from over 45 countries. Users are members of different Virtual Organizations (VO’s) based on shared projects and/or geographical location. Execution Site SE CE Execution Site User Interface LB WMS Real Time Monitor Performanc e Store BIOMED Job submissions vs time Three main periods of activity – Grand Challenges. Many failed jobs at other time (testing). E RAW vs time During Challenges RAW efficiency is good, when job load is low RAW efficiency is variable – matches testing. Job Hours vs time Hours of successful work are high in comparison to failed hours. Failures happen fast (as desired). General increase in hours used.. E USER vs time User efficiency is good during challenges and inconsistent in-between – due to low job numbers. LHCb Job submissions vs time Production runs mid 2006 and They use pilot jobs which fail quickly if resource is wrong or no waiting job. E RAW vs time Pilot jobs make the RAW efficiency low. Especially when in production runs. Job Hours vs time Though many failed pilot jobs these take very little time so very little time is spent running failed jobs. E USER vs time User efficiency is very good as jobs which run do so quickly and in general there is a high job load from this VO. Performance Metrics We use the following metrics in this work: Job Submissions in a given day. Cumulative job submissions in a given day. Active Users per VO – number of users from a given VO that have been active on a given day. Job Hours – total number of hours consumed by a VO on a given day. E RAW = Total number of successful jobs in a day Total number of jobs in that day EUSER = Time job is running. Time Job is in system Growth of the EGEE Grid The Cumulative job submissions graph shows a greater than linear increase since September VO User Activity VO’s have between 43 and 1600 members though no more than 120 users were active on a given day, In general users will naturally interleave Grid work with other work. The weekly and annual cycles can be clearly seen here. Data Collection Imperial College London (High Energy Physics Group) have been developing tools for monitoring the state of jobs on the Grid. The culmination of this work is the Real Time Monitor (RTM The RTM queries the LB’s within the Grid directly and makes this information available to the end user in a graphical format. We have also been logging this information since September 2005, collecting statistics on over 70 million jobs.. Mona Aggarwal, David Colling, Janusz Martyniak, A. Stephen McGough, Gidon Moont and Olivier van der Aa Imperial College London SE CE SE CE SE CE Execution Site