15/07/2010Swiss WLCG Operations Meeting Summary of the last GridKA Cloud Meeting (07 July 2010) Marc Goulette (University of Geneva)

Slides:



Advertisements
Similar presentations
National Grid's Contribution to LHCb IFIN-HH Serban Constantinescu, Ciubancan Mihai, Teodor Ivanoaica.
Advertisements

Status GridKa & ALICE T2 in Germany Kilian Schwarz GSI Darmstadt.
ATLAS Tier-3 in Geneva Szymon Gadomski, Uni GE at CSCS, November 2009 S. Gadomski, ”ATLAS T3 in Geneva", CSCS meeting, Nov 091 the Geneva ATLAS Tier-3.
NorthGrid status Alessandra Forti Gridpp13 Durham, 4 July 2005.
S. Gadomski, "ATLAS computing in Geneva", journee de reflexion, 14 Sept ATLAS computing in Geneva Szymon Gadomski description of the hardware the.
ATLAS computing in Geneva Szymon Gadomski, NDGF meeting, September 2009 S. Gadomski, ”ATLAS computing in Geneva", NDGF, Sept 091 the Geneva ATLAS Tier-3.
LHCC Comprehensive Review – September WLCG Commissioning Schedule Still an ambitious programme ahead Still an ambitious programme ahead Timely testing.
Computing for ILC experiment Computing Research Center, KEK Hiroyuki Matsunaga.
SC4 Workshop Outline (Strong overlap with POW!) 1.Get data rates at all Tier1s up to MoU Values Recent re-run shows the way! (More on next slides…) 2.Re-deploy.
Status of the DESY Grid Centre Volker Guelzow for the Grid Team DESY IT Hamburg, October 25th, 2011.
Integration Program Update Rob Gardner US ATLAS Tier 3 Workshop OSG All LIGO.
CHEP – Mumbai, February 2006 The LCG Service Challenges Focus on SC3 Re-run; Outlook for 2006 Jamie Shiers, LCG Service Manager.
CERN IT Department CH-1211 Genève 23 Switzerland t EIS section review of recent activities Harry Renshall Andrea Sciabà IT-GS group meeting.
LCG Service Challenge Phase 4: Piano di attività e impatto sulla infrastruttura di rete 1 Service Challenge Phase 4: Piano di attività e impatto sulla.
FAX UPDATE 26 TH AUGUST Running issues FAX failover Moving to new AMQ server Informing on endpoint status Monitoring developments Monitoring validation.
John Gordon STFC-RAL Tier1 Status 9 th July, 2008 Grid Deployment Board.
Status of the production and news about Nagios ALICE TF Meeting 22/07/2010.
Grid Lab About the need of 3 Tier storage 5/22/121CHEP 2012, The need of 3 Tier storage Dmitri Ozerov Patrick Fuhrmann CHEP 2012, NYC, May 22, 2012 Grid.
WLCG Service Report ~~~ WLCG Management Board, 1 st September
ATLAS in LHCC report from ATLAS –ATLAS Distributed Computing has been working at large scale Thanks to great efforts from shifters.
CCRC’08 Weekly Update Jamie Shiers ~~~ LCG MB, 1 st April 2008.
Status Report of WLCG Tier-1 candidate for KISTI-GSDC Sang-Un Ahn, for the GSDC Tier-1 Team GSDC Tier-1 Team 12 th CERN-Korea.
Architecture and ATLAS Western Tier 2 Wei Yang ATLAS Western Tier 2 User Forum meeting SLAC April
CSCS Status Peter Kunszt Manager Swiss Grid Initiative CHIPP, 21 April, 2006.
BNL Tier 1 Service Planning & Monitoring Bruce G. Gibbard GDB 5-6 August 2006.
V.Ilyin, V.Gavrilov, O.Kodolova, V.Korenkov, E.Tikhonenko Meeting of Russia-CERN JWG on LHC computing CERN, March 14, 2007 RDMS CMS Computing.
1 LHCb on the Grid Raja Nandakumar (with contributions from Greig Cowan) ‏ GridPP21 3 rd September 2008.
Site Report --- Andrzej Olszewski CYFRONET, Kraków, Poland WLCG GridKa+T2s Workshop.
CERN IT Department CH-1211 Genève 23 Switzerland t Frédéric Hemmer IT Department Head - CERN 23 rd August 2010 Status of LHC Computing from.
INFSO-RI Enabling Grids for E-sciencE Enabling Grids for E-sciencE Pre-GDB Storage Classes summary of discussions Flavia Donno Pre-GDB.
SLACFederated Storage Workshop Summary For pre-GDB (Data Access) Meeting 5/13/14 Andrew Hanushevsky SLAC National Accelerator Laboratory.
BNL Service Challenge 3 Status Report Xin Zhao, Zhenping Liu, Wensheng Deng, Razvan Popescu, Dantong Yu and Bruce Gibbard USATLAS Computing Facility Brookhaven.
Data Management: US Focus Kaushik De, Armen Vartapetian Univ. of Texas at Arlington US ATLAS Facility, SLAC Apr 7, 2014.
EGI-InSPIRE EGI-InSPIRE RI DDM solutions for disk space resource optimization Fernando H. Barreiro Megino (CERN-IT Experiment Support)
LHCbComputing LHCC status report. Operations June 2014 to September m Running jobs by activity o Montecarlo simulation continues as main activity.
Plans for Service Challenge 3 Ian Bird LHCC Referees Meeting 27 th June 2005.
Doug Benjamin Duke University. 2 ESD/AOD, D 1 PD, D 2 PD - POOL based D 3 PD - flat ntuple Contents defined by physics group(s) - made in official production.
The Network & ATLAS Workshop on transatlantic networking panel discussion CERN, June Kors Bos, CERN, Geneva & NIKHEF, Amsterdam ( ATLAS Computing.
WLCG Service Report ~~~ WLCG Management Board, 31 st March 2009.
WLCG Service Report ~~~ WLCG Management Board, 18 th September
GridKa Summer 2010 T. Kress, G.Quast, A. Scheurer Migration of data from old to new dCache instance finished on Nov. 23 rd almost 500'000 files (600.
Eygene Ryabinkin, on behalf of KI and JINR Grid teams Russian Tier-1 status report May 9th 2014, WLCG Overview Board meeting.
ATLAS Distributed Computing perspectives for Run-2 Simone Campana CERN-IT/SDC on behalf of ADC.
Computing Operations Report 29 Jan – 7 June 2015 Stefan Roiser NCB 8 June 2015.
14/03/2007A.Minaenko1 ATLAS computing in Russia A.Minaenko Institute for High Energy Physics, Protvino JWGC meeting 14/03/07.
U.S. ATLAS Facility Planning U.S. ATLAS Tier-2 & Tier-3 Meeting at SLAC 30 November 2007.
SL5 Site Status GDB, September 2009 John Gordon. LCG SL5 Site Status ASGC T1 - will be finished before mid September. Actually the OS migration process.
Enabling Grids for E-sciencE INFSO-RI Enabling Grids for E-sciencE Gavin McCance GDB – 6 June 2007 FTS 2.0 deployment and testing.
CERN IT Department CH-1211 Genève 23 Switzerland t The Tape Service at CERN Vladimír Bahyl IT-FIO-TSI June 2009.
Data transfers and storage Kilian Schwarz GSI. GSI – current storage capacities vobox LCG RB/CE GSI batchfarm: ALICE cluster (67 nodes/480 cores for batch.
BNL dCache Status and Plan CHEP07: September 2-7, 2007 Zhenping (Jane) Liu for the BNL RACF Storage Group.
The Grid Storage System Deployment Working Group 6 th February 2007 Flavia Donno IT/GD, CERN.
WLCG Operations Coordination report Maria Alandes, Andrea Sciabà IT-SDC On behalf of the WLCG Operations Coordination team GDB 9 th April 2014.
1 5/4/05 Fermilab Mass Storage Enstore, dCache and SRM Michael Zalokar Fermilab.
ATLAS Distributed Computing ATLAS session WLCG pre-CHEP Workshop New York May 19-20, 2012 Alexei Klimentov Stephane Jezequel Ikuo Ueda For ATLAS Distributed.
Status of GSDC, KISTI Sang-Un Ahn, for the GSDC Tier-1 Team
LCG Accounting Update John Gordon, CCLRC-RAL 10/1/2007.
ATLAS Computing Model Ghita Rahal CC-IN2P3 Tutorial Atlas CC, Lyon
Pledged and delivered resources to ALICE Grid computing in Germany Kilian Schwarz GSI Darmstadt ALICE Offline Week.
Availability of ALICE Grid resources in Germany Kilian Schwarz GSI Darmstadt ALICE Offline Week.
Evolution of storage and data management
Kilian Schwarz ALICE Computing Meeting GSI, October 7, 2009
Daniele Bonacorsi Andrea Sciabà
WLCG IPv6 deployment strategy
LCG Service Challenge: Planning and Milestones
Update on Plan for KISTI-GSDC
Proposal for obtaining installed capacity
LHCb Software & Computing Status
Luca dell’Agnello INFN-CNAF
The LHCb Computing Data Challenge DC06
Presentation transcript:

15/07/2010Swiss WLCG Operations Meeting Summary of the last GridKA Cloud Meeting (07 July 2010) Marc Goulette (University of Geneva)

15/07/2010Swiss WLCG Operations Meeting Cloud Status (Guenter Duckeck) * New ATLAS contact Andreas Petzold at GridKa since 1st July * Operations running smooth in June, some problems: - GridKa tape reading tests failed, tape library broken - Freiburg extended downtime due to cooling problems - DESY-HH: ATLASSCRATCHDISK size/overload - DESY-ZN: observed ATLAS jobs with excessive memory usage * Amsterdam Jamboree: WLCG meeting on evolution of data and storage element - Trend to more dynamic data distribution (caching) rather than static placement - Several demonstrator projects in the next months - Might change/increase network usage * TAB and HGF-Grid PB meetings: - Discussed network situation in DE cloud - Started first analysis of ATLAS data transfer patterns using log information provided by sites + GridKa dominates + DE T2-T2 traffic low (<10%) + Interpretation difficult as GridKa numbers also include FTS 3rd-party transfers betweeen sites (but this is expected to be small contribution) + Some variations between sites (DESY has relatively large non-DE & CERN transfer fraction) - Discussion of network situation in DE cloud (see for details). Mixed situation wrt. network connectivity in Germany. - J. Schultes provided a script to parse dCache billing logs

15/07/2010Swiss WLCG Operations Meeting Cloud Status (Guenter Duckeck) * ATLAS GridKa F2F operations meeting on June 24 - Agenda and minutes: - Extensive and productive discussion of operation areas, monitoring, testing, documentation - Template for operations wiki (to be filled): - We should extend our cloud monitoring page + Job info (e.g. running, queued, CPU/Wallt for prod and user), + Storage info (e.g. space token usage, IO rates, movers) + Will discuss if/how sites could provide this information * ATLAS DE cloud computing meeting on July 19/20 - Main focus on user analysis experience and support - Plan to have 2 hrs T1/T2 operations meeting before - Preliminary agenda:

15/07/2010Swiss WLCG Operations Meeting TIER1 OPERATIONS (Gen Kawamura, Andreas Petzold): * dCache milestone file space is ready (ongoing this week, 2/3 ready, 1/3 yet to come) * FTS updated to latest release including OS upgrade * CREAM CE: cream-3-fzk available (CREAM 1.6 / SL5) cream-2-fzk had been drained and was updated * OPS tests switched off, nagios probes used instead * Upgrade of VOBOX * LFC: new on SL5 will be installed (no date yet) * dCache access statistics: - dcap access to all space tokens becoming more important - Most accessed files: COND, DBRELEASE, group.phys-top.D2PD (on SCRATCHDISK), DATADISK (see T1 report pdf file) * Tape problems of last month: All problems fixed, tape library back online but still not as reliable as required PRODUCTION OPERATIONS: * In general no problems to report, almost no production in June, "missing pilots"-problems under investigation

15/07/2010Swiss WLCG Operations Meeting DATA MANAGEMENT (Cedric Serfon): * Smooth operation in June * Overall transfer efficiency in the last 30 days: 96% (95% last month) * Volume transfers a bit lower (~1.7M files [June], ~2.5M [May], 136MB/s [June], 300MB/s [May]) * 2 file losses - Wuppertal (~19000 files) due to problem with disk controller - LRZ (~9000 files) backplane burned * It was recently proposed not to export MC (DATA) to T2s that do not have at least 50TB for ATLASMCDISK (ATLASDATADISK) - Until now only a proposition, will probably discussed in software week - Current situation: 3 sites in DE cloud to cross this threshold for at least one of their space tokens: Cyfronet, MPPMU, Innsbruck + CYFRONET: Will get new hardware this year and will be able to increase tokens to 50TB + MPPMU expect new hardware in September, Increased one token to 50TB + Innsbruck will add new hardware * LOCALGROUPDISK usage: - 22 user over 1TB (17 in May), Total space used: 173TB (110TB in May, 89TB in April) - Could run into problems soon with LOCALGROUPDISK filling up - Quota system still under development (probably not available before end of summer) * Discussion of provenance of files at Tier2 sites (obtained from Dashboard/site services): - Most of transfer volumes from GridKa - Exception for CSCS where more that 1/2 of files coming from CERN (caused by group user doing production at CERN

15/07/2010Swiss WLCG Operations Meeting TIER2 REPORT (Jan Erik Sundermann): * Discussion on space token usage (see pdf file) - CYFRONET MCDISK close to be full * Accounting (see pdf file) - Almost no production in June - See increased user activity (mainly via PANDA pilots) SOFTWARE INSTALLATION (Joerg Meyer), see pdf file: * Most sites have the latest releases installed. Some smaller problems under investigation