SA2: Networking Support Status Report

Slides:



Advertisements
Similar presentations
Connect. Communicate. Collaborate I-SHARe Anand Patil, DANTE NML-WG, Open Grid Forum 22, Cambridge (MA), 26 February 2008.
Advertisements

EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Wrap up on perfSONAR-Lite_TSS and Network Troubleshooting Mario Reale GARR.
Connect. Communicate. Collaborate Place your organisation logo in this area End-to-End Coordination Unit Toby Rodwell, Network Engineer, DANTE TNLC, 28.
1 ESnet Network Measurements ESCC Feb Joe Metzger
Linking European and Chinese Research Infrastructures and Communities.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE II - Network Service Level Agreement (SLA) Establishment EGEE’07 Mary Grammatikou.
EGI: SA1 Operations John Gordon EGEE09 Barcelona September 2009.
FP6−2004−Infrastructures−6-SSA IPv6 and Grid Middleware: the EUChinaGRID experience Gabriella Paolini – GARR Valentino.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Steven Newhouse EGEE’s plans for transition.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks General relationships with EGEE JRA1 SA3.
FP6−2004−Infrastructures−6-SSA IPv6 in the EGEE Related Projects: the EUChinaGRID experience Gabriella Paolini – GARR.
Connect. Communicate. Collaborate Implementing Multi-Domain Monitoring Services for European Research Networks Szymon Trocha, PSNC A. Hanemann, L. Kudarimoti,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Xavier Jeannin Activity Manager CNRS EGEE-III.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-EGI Grid Operations Transition Maite.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks IPv6 test methodology Mathieu Goutelle (CNRS.
NORDUnet Nordic Infrastructure for Research & Education Workshop Introduction - Finding the Match Lars Fischer LHCONE Workshop CERN, December 2012.
EGEE-III-INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-III All Activity Meeting Brussels,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The EGEE User Support Infrastructure Torsten.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-III Network activity overall Xavier.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Etienne Dublé - CNRS/UREC EGEE SA2 Xavier.
EGEE is a project funded by the European Union under contract IST Network Resources Provision Jean-Paul Gautier SA2 manager Cork meeting,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGI Operations Tiziana Ferrari EGEE User.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Xavier Jeannin (CNRS/UREC Paris, FR) 24.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Vassiliki Pouli
Enabling Grids for E-sciencE EGEE-II Meeting EGEE-II SA2 activity Tziouvaras Chrysostomos, MSc NTUA, 14 th March 2006.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Xavier Jeannin Activity Manager CNRS EGEE-III.
Connect communicate collaborate LHCONE Diagnostic & Monitoring Infrastructure Richard Hughes-Jones DANTE Delivery of Advanced Network Technology to Europe.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks ENOC - Status and plans Guillaume Cessieux.
PIC port d’informació científica EGEE – EGI Transition for WLCG in Spain M. Delfino, G. Merino, PIC Spanish Tier-1 WLCG CB 13-Nov-2009.
INFSO-RI Enabling Grids for E-sciencE NRENs & Grids Workshop Relations between EGEE & NRENs Mathieu Goutelle (CNRS UREC) EGEE-SA2.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Xavier Jeannin (CNRS/UREC Paris, FR) 24.
EGEE is a project funded by the European Union under contract IST Roles & Responsibilities Ian Bird SA1 Manager Cork Meeting, April 2004.
SA2 : Network Resource Provision All Activity Meeting – 17 March SA2 Execution Plan for the first year Jean-Paul Gautier SA2 Manager CNRS/UREC.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks A three years thorough review of a project’s.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA2 Networking support for EGEE III Xavier.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1 & SA2-ENOC Interactions status and plans.
EGEE is a project funded by the European Union under contract IST JRA4 Overview Javier Orellana JRA4 Coordinator EGEE Kick Off Meeting SA2.
All Activities Meeting – 13/14 Jan SA2 Execution Plan for the first year Franck Bonnassieux CNRS/UREC EGEE is proposed as a project funded by.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Xavier Jeannin Activity Manager CNRS EGEE-III.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks LHCOPN Operational model: Roles and functions.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Etienne Dublé - CNRS/UREC EGEE SA2 Mario.
INFSO-RI Enabling Grids for E-sciencE Network Services Development Network Resource Provision 3 rd EGEE Conference, Athens, 20 th.
LHCOPN operational model Guillaume Cessieux (CNRS/FR-CCIN2P3, EGEE SA2) On behalf of the LHCOPN Ops WG GDB CERN – November 12 th, 2008.
Javier Orellana EGEE-JRA4 Coordinator CERN March 2004 EGEE is proposed as a project funded by the European Union under contract IST Network.
Connect. Communicate. Collaborate Place your organisation logo in this area End-to-End Coordination Unit Marian Garcia, Operations Manager, DANTE LHC Meeting,
EGI-InSPIRE EGI-InSPIRE RI Network Troubleshooting and PerfSONAR-Lite_TSS Mario Reale GARR.
TSA1.4 Infrastructure for Grid Management Tiziana Ferrari, EGI.eu EGI-InSPIRE – SA1 Kickoff Meeting1.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operating an Optical Private Network: the.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Xavier Jeannin (CNRS/UREC) All Hands meeting.
LHCOPN operational handbook Documenting processes & procedures Presented by Guillaume Cessieux (CNRS/IN2P3-CC) on behalf of CERN & EGEE-SA2 LHCOPN meeting,
INFSO-RI Enabling Grids for E-sciencE TNC 2005 Networking activities in EGEE Mathieu Goutelle (CNRS UREC, France) EGEE-SA2 activity.
Bob Jones EGEE Technical Director
LHC T0/T1 networking meeting
JRA2: Quality Assurance
Regional Operations Centres Core infrastructure Centres
Status of SA2 network monitoring and troubleshooting tools
EGEE is a project funded by the European Union
JRA1 Middleware Re-engineering Status Report
SA1 Execution Plan Status and Issues
EGEE SA2 / TERENA NRENs & Grids joint workshop
Ian Bird GDB Meeting CERN 9 September 2003
PerfSONAR: Development Status
Networking support (SA2) tasks for EGI
Maite Barroso, SA1 activity leader CERN 27th January 2009
Nordic ROC Organization
LCG Operations Workshop, e-IRG Workshop
Connecting the European Grid Infrastructure to Research Communities
ESnet Network Measurements ESCC Feb Joe Metzger
Network Technology Evolution
Network Technology Evolution
Presentation transcript:

SA2: Networking Support Status Report Xavier Jeannin Activity Manager CNRS EGEE-III First Review, 24-25 June, 2009

SA2 Overview 6 countries and one international entity SA2 Budget Country Total PM planned at M24 Total FTE France 96 4.0 Germany 12 0.5 Greece 18 0.8 Italy Russia 6 0.3 Spain DANTE (GEANT2) 3 0.1 Total PM planned at M24 153   6.4 SA2 Budget Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009

SA2 Global view SA2 – EGEE-III TSA2.1 Running the ENOC TSA2.4 Management and general project tasks TSA2.2 Support for the ENOC Operational procedures (CNRS) TSA2.3 Overall Networking coordination WLCG Support (CNRS) IPv6 (GARR, CNRS) Operational tools and maintenance (RRC-KI, CNRS) IPv6 (GARR, CNRS) TT exchange standardization (GRNET) Monitoring (DFN) Advanced network services (GRNET) Troubleshooting (DFN) Site networking needs (RedIRIS) TNLC Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009

EGEE Network Operation Centre A single point of contact between EGEE and the NRENs Sites GGUS Users Support Units NRENs GÉANT2 EGEE Network ENOC Role of the ENOC GÉANT2 NREN A RC 1 Grid site 1 NREN B RC 2 Grid site 2 Operated by DANTE Operated by NOC of NREN A Operated by NOC of NREN B Operated by NOC of RC2 Operated by NOC of RC1 A single point of contact between EGEE and the NRENs where EGEE and the network can exchange operational information A Network support unit in GGUS GGUS = global grid user support ENOC ensuring E2E connectivity for Grid sites Assess the impact on the Grid of network trouble Troubleshoot problems Provide support to users Identify the faulty domain Assess the network connectivity of the Grid sites ENOC ensuring E2E connectivity for Grid sites on the whole path Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009

Network connectivity assessment Assessment for year 2008 on EGEE certified Grid sites (~ 300) (Tool DownCollector ) Network troubles are not concentrated on few sites More than half of connectivity problems detected are on-sites 80% of off-site network troubles are solved within 30 minutes Only ~ 45/month last more 80% Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009 5

ENOC metrics 19 NRENS sending their tickets, 11 languages Network Language Kind ACONET German NREN CESNET Czech DFN E2ECU English LHCOPN GARR Italian GEANT2 REGIONAL GRNET Greek HEANET HUNGARNET Hungarian ILAN JANET NORDUNET PIONIER Polish RBNET/RUNNET Russian REDIRIS Spanish RENATER French SURFNET SWITCH TWAREN Chinese Total: 11 Very few Grid user notifications about network problems 19 NRENS sending their tickets, 11 languages Steady stream of 2 500 emails/mth, 800 tickets/mth 75% of European EGEE certified sites covered Usage information processed by the ENOC is more and more used Nb of Hits has been multiplied by 6 since 2008 Data downloaded have increased by 5 since 2008 Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009

WLCG Support EGEE will be the main user of the LHCOPN SA2 has taken the lead in designing and implementing a pioneering federated operational model for the LHCOPN Distributed not centralized. Tiers are responsible for network operation (https://twiki.cern.ch/twiki/bin/view/LHCOPN/OperationalModel) Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009

WLCG Support Processes were documented and disseminated Several meetings and training sessions help the dissemination Related tools were released, including a GGUS helpdesk tailored for the LHCOPN Implementation is ongoing and will be ready for LHC start-up Example of layer 2 incident management Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009

Operational tools and maintenance Trouble matching and correlation for the ENOC Correlate tickets with monitoring data Better assessment of the impact on the Grid of trouble tickets Be able to warn the Grid operation in case of network connectivity outage of EGEE sites Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009

Operational tools and maintenance First stage of our study The results are experimental and should improve Future work plan includes: Moving from experiment to production Automatic ticket ranking based on matching results Tuning of matching algorithm, possibly through more extensive use of the topology knowledge Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009

Network monitoring tools Network monitoring tools for efficient troubleshooting PerfSONAR-Lite TroubleShooting Services Based on PerfSONAR-PS Launch test on demand from a Grid site under central server control: Bandwidth measurements DNS lookup Traceroute Port testing Ping Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009

Network monitoring tools First beta-release is expected in June Beta-testers: CNRS, NorduNET, GARR. First version Autumn 2009 Detection of asymmetric traffic by launching a traceroute test on the remote site Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009

Sites networking needs Assess network requirements (bandwidth, delay, jitter, etc.) for a site within the Grid, according to the kind of site and VOs supported Empirical approach Deployment of perfSONAR at country scale RedIRIS provides significant additional effort for this task than funded through EGEE First deployment in Europe over several domains (4 domains, 8 sites) of such solution (no appliance box is used) PerfSONAR is deployed into EGEE sites and into networks used. Issue about interoperability between perfSONAR versions perfSONAR MDM (Multi-Domain Monitoring) and perfSONAR PS First deployment end of September Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009

Sites networking needs EGEE site USC EGEE site CESGA EGEE site IFAE EB-Santander0 IFCA EB-Bilbao0 TIER 1 EB-Santiago0 EGEE site PIC UB Regional Network EB-Iris4 GW-Barcelona0 Anella CESCA GW-Nacional2 GW-Madrid0 CAM EB-Barcelona0 GW-Nacional1 GW-Valencia0 UAM EB-Madrid0 EGEE site CIEMAT EGEE site EB-Iris2 IFIC EGEE site Topology of the network monitored by this task Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009

Advanced network services Collaboration with AMPS team - Advanced Multi-domain Provisioning System – in order to automate network SLA establishment Development of a web interface to manage the EGEE SLA requests Store and manage the EGEE users’ SLA requests ENOC will act on behalf of the user The user request is stored into the ENOC The ENOC validates it and will then forward it to the AMPS system to make the reservation AutoBAHN (Automated Bandwidth Allocation across Heterogeneous Networks) has also been studied but seems not mature at the moment Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009

Technical Network Liaison Committee TNLC (Technical Network Liaison Committee): Set up during EGEE in order to ease the technical discussions between EGEE, the NRENs and the GÉANT2 project Participants: EGEE SA2, GÉANT2 (represented by DANTE as coordinator of GÉANT2), some of the NRENs involved in the EGEE activities and CERN 2 meetings Work mainly focused on: Monitoring Design a solution for the Grid infrastructure Improvement of trouble ticket contents Improve the assessment of the impact of problems on the Grid Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009

Trouble ticket exchange standardization Ticket normalization is very important to improve efficiency of project’s wide network operations (impact assessment) Standardizing interfaces with network providers EGEE initiated a standardization process Dissemination was also made through a submission of a RFC (draft-dzis-nwg-nttdm-00) about the normalization of the trouble tickets “The Network Trouble Ticket Data Model” Internet Draft http://tools.ietf.org/html/draft-dzis-nwg-nttdm-00 GRNET and the CNRS provided the ENOC with a central server translating NREN’s tickets into standard tickets Designed and implemented with open source software Trouble ticket status transition diagram Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009

IPv6 IPv4 public address exhaustion  Hard to deploy new Grid sites Analysis of the gLite source code Using the IPv6 metric (IPv6 code checker) in ETICS to point out 75 parts of the code where there are indications of possible of non-compliant function calls: 16 invalid (i.e. duplicate, obsolete component, false positive, etc.), 29 fixed, 30 being fixed This analysis effectively helped developers to work on IPv6 Assessment of the evolution obtained on the gLite repository of ETICS IPv6 compliance of external dependencies Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009 18

Current stand on gLite and IPv6 IPv6 compliance Full IPv6 compliance – for the production version LFC DPM globus-url-copy/gridFTP Full IPv6 compliance – for a prototype version BDII(perl)‏ IPv6 compliance to be tested/verified by SA2 – gLite part of the deployment module claimed to be IPv6 compliant CREAM BDII(python)‏ WMproxy/Job submission blah IPv6 porting currently on-going gfal lcgutils VOMS WMS-server IPv6 porting plan exist FTS Currently no known porting plans PX VObox MON dCache Torque C/S MPIutils Condorutils AMGA Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009

IPv6 support 1/2 A new IPv6 code checker developed by SA2 IPv6 CARE http://sourceforge.net/projects/ipv6-care It monitors the execution of any program - even if you don’t have the source code - and detects networking function calls and provides the diagnosis Many informative studies https://twiki.cern.ch/twiki/bin/view/EGEE/IPv6FollowUp IPv6 programming method C/C++, Java, Python and Perl / IPv6 testing method gSOAP / Axis / Axis2 / Boost:asio / gridFTP / PythonZSI / PerlSOAPLite Assessment of the IPv6 compliance of gLite components: DPM & LFC Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009

IPv6 support 2/2 SA2 provides 2 testbeds (Rome/Paris) to check IPv6 compliance Dissemination: meetings, training session, demonstration, video Demonstration of the 2 first dual stack IPv4/IPv6 sites of EGEE at User Forum 09  smooth transition to IPv6 IPv6 next step Integration into EGEE validation process Testing new gLite IPv6 modules Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009

SA2 summary SA2 activity has completed all tasks and objectives for this first year of EGEE-III ENOC Deployment of PerfSONAR-Lite TroubleShooting Services SA2 is providing an extra effort to design a network monitoring solution with NRENs and DANTE support Improve the impact assessment of trouble ticket by fostering collaboration with NRENs WLCG / LHCOPN: Design of the LHCOPN operational model IPv6 Improvement of gLite / 2 first dual-stack sites / smooth transition to IPv6 Trouble ticket exchange standardization Submission of a RFC, “The Network Trouble Ticket Data Model”, Internet Draft Collaboration with NRENs, TNLC EGEE 09 – TERENA NRENs & Grid joint meeting, Barcelona Sept. 2009 Transition toward EGI-NGI Network activity understaffed within the EGI-NGI structure Networking Support – Xavier Jeannin - EGEE-III First Review 24-25 June 2009