GridPP: the UK's contribution to the international collaboration building a worldwide Grid, the LHC Computing Grid GridPP – is the system usable? Tony.

Slides:



Advertisements
Similar presentations
D. Britton GridPP Status - ProjectMap 8/Feb/07. D. Britton08/Feb/2007GridPP Status GridPP2 ProjectMap.
Advertisements

User Board - Supporting Other Experiments Stephen Burke, RAL pp Glenn Patrick.
GridPP Deployment Status, User Status and Future Outlook Tony Doyle.
1 User Analysis Workgroup Update  All four experiments gave input by mid December  ALICE by document and links  Very independent.
The LHC Computing Grid – February 2008 The Worldwide LHC Computing Grid Dr Ian Bird LCG Project Leader 15 th April 2009 Visit of Spanish Royal Academy.
15th January, NGS for e-Social Science Stephen Pickles Technical Director, NGS Workshop on Missing e-Infrastructure Manchester, 15 th January, 2007.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
December 17th 2008RAL PPD Computing Christmas Lectures 11 ATLAS Distributed Computing Stephen Burke RAL.
Resources and Financial Plan Sue Foffano WLCG Resource Manager C-RRB Meeting, 12 th October 2010.
GridPP use- interoper- communic- ability Tony Doyle.
5 November 2001F Harris GridPP Edinburgh 1 WP8 status for validating Testbed1 and middleware F Harris(LHCb/Oxford)
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Monitoring the Grid at local, national, and Global levels Pete Gronbech GridPP Project Manager ACAT - Brunel Sept 2011.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
Nick Brook Current status Future Collaboration Plans Future UK plans.
Rackspace Analyst Event Tim Bell
3 June 2004GridPP10Slide 1 GridPP Dissemination Sarah Pearce Dissemination Officer
The LHC Computing Grid – February 2008 The Worldwide LHC Computing Grid Dr Ian Bird LCG Project Leader 25 th April 2012.
ATLAS and GridPP GridPP Collaboration Meeting, Edinburgh, 5 th November 2001 RWL Jones, Lancaster University.
GridPP3 Project Management GridPP20 Sarah Pearce 11 March 2008.
Tony Doyle - University of Glasgow 1 July 2005Oversight Committee GridPP: Executive Summary Tony Doyle.
The ILC And the Grid Andreas Gellrich DESY LCWS2007 DESY, Hamburg, Germany
GridPP Deployment & Operations GridPP has built a Computing Grid of more than 5,000 CPUs, with equipment based at many of the particle physics centres.
Slide David Britton, University of Glasgow IET, Oct 09 1 Prof. David Britton GridPP Project leader University of Glasgow GridPP Computing for Particle.
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
Metadata requirements for HEP Paul Millar. Slide 2 12 September 2007 Metadata requirements for HEP Some of the players in this game... WLCG – Umbrella.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Next steps with EGEE EGEE training community.
Overall Goal of the Project  Develop full functionality of CMS Tier-2 centers  Embed the Tier-2 centers in the LHC-GRID  Provide well documented and.
Dan Tovey, University of Sheffield User Board Overview Dan Tovey University Of Sheffield.
GridPP Building a UK Computing Grid for Particle Physics Professor Steve Lloyd, Queen Mary, University of London Chair of the GridPP Collaboration Board.
GridPP: Running a Production Grid Stephen Burke CLRC/RAL On behalf of the GridPP Deployment & Operations Team UK e-Science All-hands, Nottingham, 21 st.
…building the next IT revolution From Web to Grid…
The LHC Computing Grid – February 2008 The Challenges of LHC Computing Dr Ian Bird LCG Project Leader 6 th October 2009 Telecom 2009 Youth Forum.
Les Les Robertson LCG Project Leader High Energy Physics using a worldwide computing grid Torino December 2005.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Grid User Interface for ATLAS & LHCb A more recent UK mini production used input data stored on RAL’s tape server, the requirements in JDL and the IC Resource.
ATLAS is a general-purpose particle physics experiment which will study topics including the origin of mass, the processes that allowed an excess of matter.
LHCbComputing Manpower requirements. Disclaimer m In the absence of a manpower planning officer, all FTE figures in the following slides are approximate.
LCG Storage Accounting John Gordon CCLRC – RAL LCG Grid Deployment Board September 2006.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The EGEE User Support Infrastructure Torsten.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Ricardo Rocha CERN (IT/GS) EGEE’08, September 2008, Istanbul, TURKEY Experiment.
EGEE is a project funded by the European Union under contract IST Support in EGEE Ron Trompert SARA NEROC Meeting, 28 October
Documentation (& User Support) Issues Stephen Burke RAL DB, Imperial, 12 th July 2007.
Certification and test activity ROC/CIC Deployment Team EGEE-SA1 Conference, CNAF – Bologna 05 Oct
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
LCG GDB LCG User Support 8 February 2005 – n o 1 LCG/EGEE User Support Flavia Donno LCG/INFN-Pisa
Julia Andreeva on behalf of the MND section MND review.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Monitoring of the LHC Computing Activities Key Results from the Services.
LCG WLCG Accounting: Update, Issues, and Plans John Gordon RAL Management Board, 19 December 2006.
MND review. Main directions of work  Development and support of the Experiment Dashboard Applications - Data management monitoring - Job processing monitoring.
INFSO-RI Enabling Grids for E-sciencE The EGEE Project Owen Appleton EGEE Dissemination Officer CERN, Switzerland Danish Grid Forum.
LCG User Level Accounting John Gordon CCLRC-RAL LCG Grid Deployment Board October 2006.
Distributed Physics Analysis Past, Present, and Future Kaushik De University of Texas at Arlington (ATLAS & D0 Collaborations) ICHEP’06, Moscow July 29,
CERN - IT Department CH-1211 Genève 23 Switzerland t Grid Reliability Pablo Saiz On behalf of the Dashboard team: J. Andreeva, C. Cirstoiu,
WLCG: The 1 st year with data & looking to the future WLCG: Ian Bird, CERN WLCG Project Leader WLCG Project LeaderLCG-France; Strasbourg; 30 th May 2011.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
LCG Workshop User Support Working Group 2-4 November 2004 – n o 1 Some thoughts on planning and organization of User Support in LCG/EGEE Flavia Donno LCG.
II EGEE conference Den Haag November, ROC-CIC status in Italy
Operation team at Ccin2p3 Suzanne Poulat –
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
LHC Computing at RAL PPD Dave Newbold RAL PPD / University of Bristol The LHC computing challenge PPD and the Grid Computing for physics PPD added value.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Pierre Auger Observatory Jiří Chudoba Institute of Physics and CESNET, Prague.
WLCG – Status and Plans Ian Bird WLCG Project Leader openlab Board of Sponsors CERN, 23 rd April 2010.
EGEE is a project funded by the European Union under contract IST ROC-IT User Support in the EGEE infrastructure Riccardo Brunetti INFN-Torino.
Job Priorities and Resource sharing in CMS A. Sciabà ECGI meeting on job priorities 15 May 2006.
INFN-Grid WS, Bari, 2004/10/15 Andrea Caltroni, INFN-Padova Marco Verlato, INFN-Padova Andrea Ferraro, INFN-CNAF Bologna EGEE User Support Report.
Ian Bird GDB Meeting CERN 9 September 2003
Readiness of ATLAS Computing - A personal view
LHCb Grid Computing LHCb is a particle physics experiment which will study the subtle differences between matter and antimatter. The international collaboration.
Presentation transcript:

GridPP: the UK's contribution to the international collaboration building a worldwide Grid, the LHC Computing Grid GridPP – is the system usable? Tony Doyle

Tony Doyle - University of Glasgow Usable Systems 21 September 2006 Summary GridPP runs a major part of the EGEE/LCG Grid, which supports ~3000 users The Grid is not (yet) as transparent as end-users want it to be The underlying overall failure rate is ~10% User (interface)s, middleware and operational procedures (need to) adapt (see talks by Dave Britton and Stephen Burke for more info. on performance and operations [now]) Procedures to manage the underlying problems such that system is usable are highlighted

Tony Doyle - University of Glasgow Usable Systems 21 September million hours “Active” User requires thousands of CPU hours EGEE CPU hours (1 April 2006 to 31 July 2006 )

Tony Doyle - University of Glasgow Usable Systems 21 September 2006 Virtual Organisations Users are grouped into Virtual Organisations –Users/VO varies from 1 to 806 members (and growing..) Broadly four classes of VO –LHC experiments –EGEE supported –Worldwide (mainly non-LHC particle physics) –Local/regional e.g. UK PhenoGrid Sites can choose which VOs to support, subject to MOU/funding commitments –Most GridPP sites support ~20 VOs –GridPP nominally allocates 1% of resources to EGEE non-HEP VOs –GridPP currently contributes 30% of the EGEE CPU resources

Tony Doyle - University of Glasgow Usable Systems 21 September 2006 User View? Perspective matters This talk is not –a usability survey –unbiased –representative Straw poll –users overcame initial registration hurdles within ~two weeks –users adapt to Grid in (un-)coordinated ways –The Grid was sufficiently flexible for many analysis applications

Tony Doyle - University of Glasgow Usable Systems 21 September 2006 Physics Analysis ESD: Data or Monte Carlo Event Tags Event Selection Analysis Object Data AOD Analysis Object Data AOD Calibration Data Analysis, Skims Raw Data Collaboration -wide Tasks Analysis Groups Individual Physicists Physics Analysis Physics Objects Physics Objects Physics Objects INCREASING DATA FLOWINCREASING DATA FLOW

Tony Doyle - University of Glasgow Usable Systems 21 September 2006 User evolution Number of UK Grid users (exc. Deployment Team) Quarter: 05Q4 06Q206Q3 Value: Many EGEE VOs supported c.f EGEE target Number of active users (> 10 jobs per month) Quarter: 05Q4 06Q1 06Q2 Value: Fraction: 6.2% 11.0% Viewpoint: growing fairly rapidly, but not as active as they could be? depends on the “active” definition

Tony Doyle - University of Glasgow Usable Systems 21 September atlas 763 dzero 577 cms 566 dteam 150 lhcb 131 alice 75 bio 65 dteamsgm 41 esr 31 ilc 27 atlassgm 27 alicesgm 21 cmsprg 18 atlasprg 17 fusn 15 zeus 13 dteamprg 13 cmssgm 11 hone 9 pheno 9 geant 7 babar 6 aliceprg 5 lhcbsgm 5 biosgm 3 babarsgm 2 zeussgm 2 t2k 2 geantsgm 2 cedar 1 phenosgm 1 minossgm 1 lhcbprg 1 ilcsgm 1 honesgm 1 cdf Know your users? UK-enabled VOs

Tony Doyle - University of Glasgow Usable Systems 21 September 2006 User Interface The GUI is relatively low-level (jobs, file collections) Dynamic panels for higher level functions Job details Logical Folders Job Monitoring Log window Job builder Scriptor Screenshot of the Ganga GUI Dockable windows

Tony Doyle - University of Glasgow Usable Systems 21 September 2006 Complex Applications ATLAS GANGA software framework (jointly with LHCb) data challenges producing Monte Carlo data 10 million CPU hours per year CMS Monte Carlo production, data transfer, job submission CMS transfers top a petabyte a month for the last three months LHCb DIRAC software to submit analysis jobs using Grid 2006 analysis job completion efficiency improved to 91%

Tony Doyle - University of Glasgow Usable Systems 21 September 2006 WLCG MoU Particle physicists collaborate, play roles and delegate –e.g. “prg” production group “sgm” software group managers Underpinned by Memoranda of Understanding Current MoU signatories: China France Germany Italy India Japan Netherlands Pakistan Portugal Romania Taiwan UK USA Pending signatures: Australia Belgium Canada Czech Republic Nordic Poland Russia Spain Switzerland Ukraine Negotiation w.r.t. resource and service level

Tony Doyle - University of Glasgow Usable Systems 21 September 2006 Resource allocation Need to assign quotas and priorities to VOs and measure delivery VOMS provides group/role information in the proxy Tools to control quotas and priorities in site services being developed –So far only at whole-VO level –Maui batch scheduler is flexible, easy to map to groups/roles –Sites set the target shares –Can publish VO/group-specific values in GLUE schema, hence the RB can use them for scheduling Accounting tool (APEL) measures CPU use at global level (UK task) –Storage accounting currently being added –GridPP monitors storage across UK –Privacy issues around user-level accounting, being solved by encryption

Tony Doyle - University of Glasgow Usable Systems 21 September 2006 User Support Becoming vital as the number of users grows –But modest effort available in the various projects Global Grid User Support (GGUS) portal at Karlsruhe provides a central ticket interface –Problems are categorised Tickets are classified by an on-duty Ticket Process Manager, and assigned to an appropriate support unit –UK (GridPP) contributes support effort GGUS has a web-service interface to ticketing systems at each ROC –Other support units are local mailing lists –Mostly best-effort support, working hours only Currently ~tens of tickets/week –Manageable, but may not scale much further –Some tickets slip through the net

Tony Doyle - University of Glasgow Usable Systems 21 September 2006 Documentation & Training Need documentation and training for both system managers and users –Mostly expert users up to now, but user community is expanding –Induction of new VOs is a particular problem – no peer support –EGEE is running User Fora for users to share experience Next in Manchester in May ’07 (with OGF) –EGEE has a dedicated training activity run by NeSC/Edinburgh Documentation is often a low priority, little dedicated effort –The rapid pace of change means that material requires constant review Effort on documentation is now increasing –GridPP has appointed a documentation officer GridPP web site, wiki –Installation manual for admins is good There is also a wiki for admins to share experience –Focus is now on user documentation New EGEE web site – coming soon

Tony Doyle - University of Glasgow Usable Systems 21 September 2006 Alternative view? The number of users in the Grid School for the Gifted is ~manageable now The system may be too complex, requiring too much work by the “average user”? Or the (virtual) help desk may not be enough? Or the documentation may be misleading? Or.. Having smart users helps (the current ones are)