GridPP: Executive Summary Tony Doyle. Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 Outline Exec 2 Summary Grid status High level.

Slides:



Advertisements
Similar presentations
S.L.LloydATSE e-Science Visit April 2004Slide 1 GridPP – A UK Computing Grid for Particle Physics GridPP 19 UK Universities, CCLRC (RAL & Daresbury) and.
Advertisements

Slide David Britton, University of Glasgow IET, Oct 09 1 Prof. David Britton GridPP Project leader University of Glasgow GridPP Oversight Committee Meeting.
Deployment metrics and planning (aka Potentially the most boring talk this week) GridPP16 Jeremy Coles 27 th June 2006.
GridPP4 – Revised Plan Implementing the PPAN recommendations.
RAL Tier1: 2001 to 2011 James Thorne GridPP th August 2007.
S.L.LloydGridPP Collaboration Meeting IC Sept 2002Slide 1 Introduction Welcome to the 5 th GridPP Collaboration Meeting Steve Lloyd, Chair of GridPP.
GridPP: Executive Summary Tony Doyle. Tony Doyle - University of Glasgow Oversight Committee 11 October 2007 Exec 2 Summary Grid Status: Geographical.
UK Agency for the support of: High Energy Physics - the nature of matter and mass Particle Astrophysics - laws from natural phenomena Astronomy - the.
GridPP From Prototype to Production David Britton 21/Sep/06 1.Context – Introduction to GridPP 2.Performance of the GridPP/EGEE/wLCG Grid 3.Some Successes.
15 May 2006Collaboration Board GridPP3 Planning Executive Summary Steve Lloyd.
Quarterly report ScotGrid Quarter Fraser Speirs.
Tony Doyle “GridPP2 Proposal”, GridPP7 Collab. Meeting, Oxford, 1 July 2003.
S.L.LloydGridPP CB 29 Oct 2002Slide 1 Agenda 1.Introduction – Steve Lloyd 2.Minutes of Previous Meeting (23 Oct 2001) 3.Matters Arising 4.Project Leader's.
Southgrid Status Pete Gronbech: 27th June 2006 GridPP 16 QMUL.
GridPP Status “How Ready are We for LHC Data-Taking?” Tony Doyle.
London Tier 2 Status Report GridPP 13, Durham, 4 th July 2005 Owen Maroney, David Colling.
UKI-SouthGrid Overview Face-2-Face Meeting Pete Gronbech SouthGrid Technical Coordinator Oxford June 2013.
Slide David Britton, University of Glasgow IET, Oct 09 1 Prof. David Britton GridPP Project leader University of Glasgow GridPP Vendor Day 30 th April.
Southgrid Status Report Pete Gronbech: February 2005 GridPP 12 - Brunel.
LCG Milestones for Deployment, Fabric, & Grid Technology Ian Bird LCG Deployment Area Manager PEB 3-Dec-2002.
CMS Report – GridPP Collaboration Meeting VI Peter Hobson, Brunel University30/1/2003 CMS Status and Plans Progress towards GridPP milestones Workload.
D. Britton Collaboration Board Meeting David Britton 16/Jul/07.
Quarterly report SouthernTier-2 Quarter P.D. Gronbech.
D. Britton GridPP Status - ProjectMap 22/Feb/06. D. Britton22/Feb/2006GridPP Status GridPP2 ProjectMap.
S.L.LloydGridPP CB 19 February 2003Slide 1 Agenda 1.Minutes of Previous Meeting (29 Oct 2002) 2.Matters Arising 3.GridPP2 Planning 4.EGEE 5.Any Other Business.
Computing for ILC experiment Computing Research Center, KEK Hiroyuki Matsunaga.
Tony Doyle GridPP – From Prototype To Production, GridPP10 Meeting, CERN, 2 June 2004.
Tony Doyle - University of Glasgow 12 January 2005Collaboration Board GridPP: Executive Summary Tony Doyle.
GridPP23 – Final Steps to Data David Britton, 8/Sep/09.
12th November 2003LHCb Software Week1 UK Computing Glenn Patrick Rutherford Appleton Laboratory.
Quarterly report ScotGrid Quarter Fraser Speirs.
Oxford Update HEPix Pete Gronbech GridPP Project Manager October 2014.
GridPP: UK Computing for Particle Physics Tony Doyle.
1 st EGEE Conference – April UK and Ireland Partner Dave Kant Deputy ROC Manager.
Organisation Management and Policy Group (MPG): Responsible for setting and policy decisions and resolving any issues concerning fractional usage, acceptable.
GridPP3 project status Sarah Pearce 14 April 2010 GridPP24 RHUL.
GridPP3 Project Management GridPP20 Sarah Pearce 11 March 2008.
Project Management Sarah Pearce 3 September GridPP21.
Tony Doyle - University of Glasgow 1 July 2005Oversight Committee GridPP: Executive Summary Tony Doyle.
Tony Doyle - University of Glasgow 6 September 2005Collaboration Meeting GridPP Overview (emphasis on beyond GridPP) Tony Doyle.
John Gordon CCLRC e-Science Centre LCG Deployment in the UK John Gordon GridPP10.
SouthGrid SouthGrid SouthGrid is a distributed Tier 2 centre, one of four setup in the UK as part of the GridPP project. SouthGrid.
Jeremy Coles UK LCG Operations The Geographical Distribution of GridPP Institutes Production Manager.
GridPP Deployment & Operations GridPP has built a Computing Grid of more than 5,000 CPUs, with equipment based at many of the particle physics centres.
Southgrid Technical Meeting Pete Gronbech: 26 th August 2005 Oxford.
11 March 2008 GridPP20 Collaboration meeting David Britton - University of Glasgow GridPP Status GridPP20 Collaboration Meeting, Dublin David Britton,
GridPP Deployment Status GridPP14 Jeremy Coles 6 th September 2005.
GridPP Building a UK Computing Grid for Particle Physics Professor Steve Lloyd, Queen Mary, University of London Chair of the GridPP Collaboration Board.
Tony Doyle - University of Glasgow 8 July 2005Collaboration Board Meeting GridPP Report Tony Doyle.
GridPP3 project status Sarah Pearce 24 April 2010 GridPP25 Ambleside.
Tony Doyle - University of Glasgow Introduction. Tony Doyle - University of Glasgow 6 November 2006ScotGrid Expression of Interest Universities of Aberdeen,
UK Tier 1 Centre Glenn Patrick LHCb Software Week, 28 April 2006.
Slide David Britton, University of Glasgow IET, Oct 09 1 Prof. David Britton GridPP Project leader University of Glasgow UK-T0 Meeting 21 st Oct 2015 GridPP.
Performance analysis extracts from GridPP OC metrics report For UKI operations meeting 15 th June 2005.
INFSO-RI Enabling Grids for E-sciencE The EGEE Project Owen Appleton EGEE Dissemination Officer CERN, Switzerland Danish Grid Forum.
LCG Accounting Update John Gordon, CCLRC-RAL WLCG Workshop, CERN 24/1/2007 LCG.
Your university or experiment logo here User Board Glenn Patrick GridPP20, 11 March 2008.
WLCG Status Report Ian Bird Austrian Tier 2 Workshop 22 nd June, 2010.
J Jensen/J Gordon RAL Storage Storage at RAL Service Challenge Meeting 27 Jan 2005.
Dominique Boutigny December 12, 2006 CC-IN2P3 a Tier-1 for W-LCG 1 st Chinese – French Workshop on LHC Physics and associated Grid Computing IHEP - Beijing.
Slide § David Britton, University of Glasgow IET, Oct 09 1 Prof. David Britton GridPP Project leader University of Glasgow GridPP delivering The UK Grid.
UK Status and Plans Catalin Condurache – STFC RAL ALICE Tier-1/Tier-2 Workshop University of Torino, February 2015.
18/12/03PPD Christmas Lectures 2003 Grid in the Department A Guide for the Uninvolved PPD Computing Group Christmas Lecture 2003 Chris Brew.
London Tier-2 Quarter Owen Maroney
Collaboration Meeting
Understanding the nature of matter -
Update on Plan for KISTI-GSDC
Tier-1 Status Progress and Difficulties A View from RAL
Collaboration Board Meeting
UK MoUs and Tier-1/A experiment shares
Presentation transcript:

GridPP: Executive Summary Tony Doyle

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 Outline Exec 2 Summary Grid status High level view 2006 Outturn Performance Monitoring Outlook for 2007 Beyond GridPP2 2007

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 Exec 2 Summary 2006 was the second full year for the UK Production Grid More than 5,000 CPUs and more than 1/2 Petabyte of disk storage The UK is the largest CPU provider on the EGEE Grid, with total CPU used of 15 GSI2k-hours in 2006 The GridPP2 project has met 69% of its original targets with 92% of the metrics within specification The initial LCG Grid Service is now starting and will run for the first 6 months of 2007 The aim is to continue to improve reliability and performance ready for startup of the full Grid service on 1st July 2007 The GridPP2 project has been extended by 7 months to April 2008 The outcome of the GridPP3 proposal to PPARC is awaited We anticipate a challenging period from Sept onwards

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 Grid Overview Aim: by 2008 (full year’s data taking) -CPU ~100MSI2k (100,000 CPUs) -Storage ~80PB - Involving >100 institutes worldwide -Build on complex middleware being developed in advanced Grid technology projects, both in Europe (Glite) and in the USA (VDT) 1.Prototype went live in September 2003 in 12 countries 2.Extensively tested by the LHC experiments in September February ,547 CPUs, 4398 TB storage Status in February 2007: 177 sites, 32,412 CPUs, 13,282 TB storage Monitoring via Grid Operations Centre

Tony Doyle - University of Glasgow Oversight Committee 8 February Resources 2006 CPU Usage by Region Via APEL accounting

Tony Doyle - University of Glasgow Oversight Committee 8 February Outturn Definitions: "Promised" is the total that was planned at the Tier-1/A (in the March 2005 planning) and Tier-2s (in the October 2004 Tier-2 MoU) for CPU and storage "Delivered" is the total that was physically installed for use by GridPP, including LCG and SAMGrid at Tier-2 and LCG and BaBar at Tier-1/A "Available" is available for LCG Grid use, i.e. declared via the EGEE mechanisms with storage via an SRM interface "Used" is as accounted for by the Grid Operations Centre

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 Resources Delivered CPU KSI2KStorage TB PromisedDeliveredRatioPromisedDeliveredRatio Brunel % % Imperial % % QMUL % % RHUL % % UCL % % Lancaster % % Liverpool % % Manchester % % Sheffield %3267% Durham %5479% Edinburgh711152% % Glasgow % % Birmingham % % Bristol391231% % Cambridge % % Oxford % % RAL PPD % % London % % NorthGrid % % ScotGrid % % SouthGrid % % Total % % Tier % % Tier-1 and Tier-2 total delivery is impressive and usage is improved Available CPU: 8.5 MSI2k Storage: 1.7 PB Disk: 0.54 PB Delivery of Tier-1 disk Used CPU:15 GSI2k-hours Disk: 0.26 PB Usage of Tier-2 CPU, disk Request: PPARC acceptance of the 2006 outturn

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 Available (KSI2K)Used (KSI2K Hours)Ratio 1Q062Q063Q064Q061Q062Q063Q064Q061Q062Q063Q064Q06 Brunel ,811105,014159,082643, %41.30%62.60%61.20% Imperial ,82883,62782,593557, %18.80%18.60%39.70% QMUL ,335612,564459,4271,259, %23.10%17.40%47.60% RHUL163 25,08521,940176,046147, %6.10%49.30%41.30% UCL121 42,21751,10673,763156, %19.30%27.80%59.10% Lancaster ,463402,774210,432297, %38.60%20.30%28.70% Liverpool ,218455,72740,551164, %35.20%3.10%12.70% Manchester ,8571,042,154248,704370, %66.10%9.90%9.20% Sheffield ,41159,86078,795127, %15.00%19.70%31.80% Durham ,69958,18533,67159, %33.20%19.20%33.70% Edinburgh666614,8294,6373,6414, %35.30%27.70%37.40% Glasgow ,77450,46272,105155, %22.20%70.10%8.90% Birmingham ,47331,79528,29953, %62.00%55.20%105.20% Bristol ,2088,9826, %45.70%57.00%41.80% Cambridge ,2282,4421, %2.70%2.90%2.20% Oxford ,09392,84182,28463, %65.40%58.00%45.10% RAL PPD ,919132,046143,648235, %82.10%20.50%33.60% London ,276874,251950,9112,765, %22.00%24.00%48.30% NorthGrid ,9491,960,515578,482959, %45.40%11.00%14.20% ScotGrid ,302113,284109,417220, %27.20%37.60%11.30% SouthGrid ,981266,118265,655361, %58.80%26.80%36.40% Total Tier ,403,5083,214,1681,904,4654,306, %35.10%18.10%27.90% Tier ,6361,089,9171,393,022992, %80.20%97.60%53.40% LCG CPU Usage

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 (measured by UK Tier-1 for all VOs) ~90% CPU efficiency due to i/o bottlenecks is OK Concern that this is currently ~75% Efficiency Each experiment needs to work to improve their system/deployment practice anticipating e.g. hanging gridftp connections during batch work target

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 (Tier-1 CPUs brought online on Jan 10) Tier-1 CPU fully utilised throughout 2006 (Grid & non-Grid) Added 64 Intel twin dual-core Woodcrests on Jan 10 Busy with Grid jobs within 30 minutes Utilisation

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 (Estimated utilisation based on gstat job slots/usage) UKI mirrors overall EGEE utilisation Average Utilisation for Q306: 66% Compared to target of ~70% CPU utilisation was a major T2 issue, but now improving.. Utilisation

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 CPU by experiment Used at Tier-2 (KSI2K Hours)Used at Tier-1 (KSI2K Hours) 1Q062Q063Q064Q061Q062Q063Q064Q06 ALICE ATLAS CMS LHCb BaBar CDF 1517 D H ZEUS Other LHC Total

Tony Doyle - University of Glasgow Oversight Committee 8 February CPU Usage by experiment UK Resources

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 LCG Disk Usage Available (TB)Used (TB)Ratio 1Q062Q063Q064Q061Q062Q063Q064Q061Q062Q063Q064Q06 Brunel %18.10%91.10% Imperial %69.40%51.70%72.00% QMUL %22.60%18.40%26.40% RHUL %10.60%7.70%27.30% UCL %54.30%32.60%70.00% Lancaster %24.70%56.30%21.30% Liverpool %16.30%50.00% Manchester %5.80%3.10% Sheffield %32.10%12.40%4.50% Durham %68.10%25.40%34.30% Edinburgh %45.10%9.50%19.50% Glasgow %15.00%70.80%12.10% Birmingham %31.80%41.60%72.20% Bristol %12.00%16.00%22.20% Cambridge %0.60%26.30%67.70% Oxford %1.10%0.00%15.60% RAL PPD %9.40%4.20%81.30% London %26.60%24.40%57.00% NorthGrid %12.80%26.40%8.10% ScotGrid %42.80%14.00%16.00% SouthGrid %9.30%13.20%67.20% Total Tier %19.60%22.80%21.50% Tier %93.70%121.40%122.30%

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 (individual rates) Aim: to maintain data transfers at a sustainable level as part of experiment service challenges File Transfers Current goals:goals >250Mb/s inbound-only >250Mb/s outbound-only >200Mb/s inbound and outbound

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 Approval for new (shared) machine room – ETA Summer Space for 300 racks. Procurement –March 06: 52 AMD 270 units, 21 disk servers (168TB data capacity) –FY 06/07: 47 disk servers (282TB disk capacity), 64 twin dual-core Intel Woodcrest 5130 units (550kSI2K) –FY 06/07 upcoming: further 210 TB disk capacity plus high-availability systems (redundant PSUs, hot-swappable paired HDDs) Storage commissioning saga –Ongoing problems with March kit. Firmware updates have now solved problem. (Disks on Areca 1170 in raid 6 experienced multiple dropouts during testing of WD drives) Move to CASTOR –Very support heavy but made available for CSA06 and performing well General - Air-con problems with high-temperatures triggering high pressure cut-outs in refrigerator gas circuits - July security incident - 10Gb CERN line in place. Second 10Gb line scheduled in 07Q1 Tier-1 Resource

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 e.g. Glasgow: UKI-SCOTGRID-GLASGOW 800 kSI2k 100 TB DPM Needed for LHC s t a rt- u p August 28 September 1 October 13 October 23 T2 Resources IC-HEP 440 KSI2K 52 TB dCache Brunel 260 KSI2K 5 TB DPM

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 GridPP Middleware incorporates.. Security Network Monitoring Information Services Grid Data Management Storage Interfaces Workload Management Middleware

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 MSN Outlook The results of the GridPP2+ project extension proposal to PPARC were made known to GridPP in November 2006 The effects on MSN are significant and particularly damaging with the overall effort reduced by more than a third from 13 to 8.3 FTEs WMS testing and contributions to EGEE SA3 will reduce GridPP work on metadata will cease and UK leadership will be lost, but this is known to be an area the experiments are keen to see tackled The reduction in Information and Monitoring effort will severely impact re- engineering work and support for R-GMA and compromises UK obligations in fulfilling the EGEE contract GridPP has recognised the importance of finishing the R-GMA re- engineering, thus meeting the R-GMA deliverables to EGEE and has therefore agreed (in consultation with PPARC) to meet the costs of maintaining the current staffing levels to the end of EGEE-II from within existing allocations The reduction in networking activities is likely to impact GridPP’s ability to optimise its use of the underlying JANET network Staff whose contracts will not be extended beyond the end of August 2007 have been informed

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 e.g. ATLAS Tier-2 Testing Most of the experiments are now well advanced in highly pragmatic deployment issues, particularly in advance of the LHC data at the end of 2007

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 Applications Outlook Products developed by GridPP are in mainstream use, and will form a vital component of the computing system of each LHC experiment for first data-taking and analysis However, almost all explicit funding for the further development and support of such products will terminate in September 2007, since it is now clear that this area will be supported neither via GridPP3 (as planned) nor the Rolling Grants round (as requested) This is a matter of concern both for the UK collaborations and the experiments as a whole Recovery plans are being prepared within each experiment, attempting to use non-specialist RA effort in tension with physics and hardware support, but there will be profound negative consequences for the continuation and maintenance of these projects

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 Dissemination Outlook Dissemination was one of the areas not fully funded in GridPP2+ The Dissemination Officer post was funded at 0.5 FTE (as at present), but the PPRP did not allocate funds to continue the Events Officer position Due to a large number of events and activities planned for the end of 2007, we aim to fund this position for some months out of the current dissemination budget

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 Hardware Outlook Planning for A profiled ramp-up of resources is planned throughout 2007 to meet the UK requirements of the LHC and other experiments The results are available for the Tier-1 and Tier-2sTier-1Tier-2s The Tier-1/A Board reviewed UK input to International MoU negotiations for the LHC experiments as well as providing input to the International Finance Committee for BaBar An impasse was reached in planning for 2007 No new investment in the BaBar Tier A analysis facility hardware is planned For LCG, the 2007 commitment for disk and CPU capacity can be met out of existing hardware already delivered

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 Timeline Proposal WritingProposal Defence Apr MayJunJulAugSepOct 31 st March – PPARC Call 16 th June – GridPP16 at QMUL 6 th September – 1 st PPRP review 1 st November – GridPP17 8 th November PPRP “visiting panel” 30 th November GridPP2+ outcome ~February GridPP3 outcome 13 th July – Bid Submitted CBOCCB Future? ~10 month process to propose/defend/define future programme

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 Scenario Planning – Resource Requirements [TB, kSI2k] GridPP requested a fair share of global requirements, according to experiment requirements Changes in the LHC schedule prompted a(nother) round of resource planning - presented to CRRB on Oct 24 th New UK resource requirements have been derived and incorporated in the scenario planning e.g. Tier-1

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 Input to Scenario Planning – Hardware Costing Empirical extrapolations with extrapolated (large) uncertainties Hardware prices have been re-examined following recent Tier-1 purchase CPU (woodcrest) was cheaper than expected based on extrapolation of previous 4 years of data

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 Scenario Planning An example 70% “minimum viable level” scenario [£m]

Tony Doyle - University of Glasgow Oversight Committee 8 February 2007 Beyond GridPP2 The separation between GridPP2+ and GridPP3 was primarily designed to ensure an early decision could be made on the extension in order to retain key staff Approval for the extension was received in late November but included major cuts in the middleware support area This is problematic in two ways: –EU-CCLRC contractual obligation –crucial 7 month ramp-up period - the worst time to cut back Problems are severely compounded by the outcome of the Rolling Grant round where much of the Applications support work will be lost during this same critical period We currently await the outcome of the GridPP3 bid in order to be able to assess the whole picture We anticipate a highly challenging period from September 2007 onwards