Enabling Grids for E-sciencE EGEE-III INFSO-RI-222667 Status of the EGI O-E-12 Task: Coordination of Network Support for EGI Mario Reale IGI / GARR

Slides:



Advertisements
Similar presentations
Connect. Communicate. Collaborate I-SHARe Anand Patil, DANTE NML-WG, Open Grid Forum 22, Cambridge (MA), 26 February 2008.
Advertisements

EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Wrap up on perfSONAR-Lite_TSS and Network Troubleshooting Mario Reale GARR.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks From ROCs to NGIs The pole1 and pole 2 people.
EGEE-II INFSO-RI Enabling Grids for E-sciencE AP ROC Min-Hong Tsai ASGC SA1 Transition Meeting May 8 th, 2008
Africa & Arabia ROC tutorial Introduction to A&A ROC Mario Reale GARR - Italy ASREN-JUNET Grid School - 24 November 2011 Africa & Arabia ROC Tutorial.
EGI: SA1 Operations John Gordon EGEE09 Barcelona September 2009.
INFSO-RI Enabling Grids for E-sciencE SA1: Cookbook (DSA1.7) Ian Bird CERN 18 January 2006.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Steven Newhouse EGEE’s plans for transition.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks General relationships with EGEE JRA1 SA3.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks ?? Athens, May 5-6th 2009 Community Support.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Report Mario Reale NGI IT / GARR HEPiX f2f meeting.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks David Kelsey RAL/STFC,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Xavier Jeannin Activity Manager CNRS EGEE-III.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team James Casey EGEE’08.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-EGI Grid Operations Transition Maite.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Bob Jones EGEE project director CERN.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks IPv6 test methodology Mathieu Goutelle (CNRS.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1: Grid Operations Maite Barroso (CERN)
Enabling Grids for E-sciencE EGEE Applications Registry Current status & latest developments Marios Chatziangelou.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks David Fergusson, Emidio Giorgio, Gergely.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Business SSC for EGI SSC Workshop: Preparing.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The EGEE User Support Infrastructure Torsten.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Gergely Sipos Activity Deputy Manager MTA.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-III Network activity overall Xavier.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Etienne Dublé - CNRS/UREC EGEE SA2 Xavier.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Antonio Retico CERN, Geneva 19 Jan 2009 PPS in EGEEIII: Some Points.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Pre-production in EGEEIII Operation principles Antonio Retico EGEE-II / EGEE II SA1.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGI Operations Tiziana Ferrari EGEE User.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Using GStat 2.0 for Information Validation.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Xavier Jeannin (CNRS/UREC Paris, FR) 24.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Vassiliki Pouli
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Robin McConnell NA3 Activity Manager 02.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks ENOC - Status and plans Guillaume Cessieux.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Robin McConnell NA3 Activity Manager 28.
WLCG Laura Perini1 EGI Operation Scenarios Introduction to panel discussion.
PIC port d’informació científica EGEE – EGI Transition for WLCG in Spain M. Delfino, G. Merino, PIC Spanish Tier-1 WLCG CB 13-Nov-2009.
INFSO-RI Enabling Grids for E-sciencE NRENs & Grids Workshop Relations between EGEE & NRENs Mathieu Goutelle (CNRS UREC) EGEE-SA2.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Xavier Jeannin (CNRS/UREC Paris, FR) 24.
EGEE is a project funded by the European Union under contract IST Roles & Responsibilities Ian Bird SA1 Manager Cork Meeting, April 2004.
Enabling Grids for E-sciencE EGEE Applications Registry Current status & latest developments Marios Chatziangelou.
INFSO-RI SA2 ETICS2 first Review Valerio Venturi INFN Bruxelles, 3 April 2009 Infrastructure Support.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team Kickoff Meeting.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA2 Networking support for EGEE III Xavier.
Operations model Maite Barroso, CERN On behalf of EGEE operations WLCG Service Workshop 11/02/2006.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1 & SA2-ENOC Interactions status and plans.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks NA5: Policy and International Cooperation.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Xavier Jeannin Activity Manager CNRS EGEE-III.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Mario Reale – GARR NetJobs: Network Monitoring Using Grid Jobs.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks What all NGIs need to do: Helpdesk / User.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
INFSO-RI Enabling Grids for E-sciencE Network Services Development Network Resource Provision 3 rd EGEE Conference, Athens, 20 th.
EMI INFSO-RI Testbed for project continuous Integration Danilo Dongiovanni (INFN-CNAF) -SA2.6 Task Leader Jozef Cernak(UPJŠ, Kosice, Slovakia)
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Etienne Dublé.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Network Support Workshop Mario Reale / IGI - GARR EGI Network Support.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks NA5: Policy and International Cooperation.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Astrophysical Cluster Session Claudio Vuerli,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Regional tools use cases overview Peter Solagna – EGI.eu On behalf of the.
INFSO-RI Enabling Grids for E-sciencE GOCDB2 Matt Thorpe / Philippa Strange RAL, UK.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GOCDB4 Gilles Mathieu, RAL-STFC, UK An introduction.
EGI-InSPIRE EGI-InSPIRE RI Network Troubleshooting and PerfSONAR-Lite_TSS Mario Reale GARR.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks IT ROC: Vision for EGEE III Tiziana Ferrari.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Network Support task force January 24, 2011 EGI OMB f2f meeting Amsterdam.
TSA1.4 Infrastructure for Grid Management Tiziana Ferrari, EGI.eu EGI-InSPIRE – SA1 Kickoff Meeting1.
Bob Jones EGEE Technical Director
Status of SA2 network monitoring and troubleshooting tools
EGI Network Support task force: Proposal for the identified use cases
Ian Bird GDB Meeting CERN 9 September 2003
Networking support (SA2) tasks for EGI
Mario Reale – IGI / GARR Lyon, Sept 19, 2011
Presentation transcript:

Enabling Grids for E-sciencE EGEE-III INFSO-RI Status of the EGI O-E-12 Task: Coordination of Network Support for EGI Mario Reale IGI / GARR

Enabling Grids for E-sciencE EGEE-III INFSO-RI Contents O-E-12 definitions and goals O-E-12 status –Wrap up of the migration (final phase of EGEE III) –Current task tools –Overview of networking support within individual NGIs  Summary of the EGEE III questionnaire for NGIs for Network Support Next steps and challenges ahead

Enabling Grids for E-sciencE EGEE-III INFSO-RI O-E-12 Definition and Goals O-E-12 is the coordination of the network support for EGI Its goal is providing network support to EGI by –proposing useful synergies and promoting cooperation among EGI.eu, the national NGI efforts and the NRENs community –encouraging the definition and adoption of best practices –proposing common solutions and tools –liaising with the NRENs community and GEANT (DANTE) Provisioned through the EGI-Inspire tasks TSA1.7 (Support Teams) and TSA1.4 (Grid Management Infrastructure) Provided with a manpower of 0.5 FTE within the EGI- Inspire project, and an additional contribution from IGI Fundamental will be the collaboration by NGIs and NRENs

Enabling Grids for E-sciencE EGEE-III INFSO-RI Summary of the original workplan Perform an initial assessment of the adopted model for network support within each NGI Further follow up the developments of pS-Lite_TSS for on demand troubleshooting and grid-specific tests on the network –Support its deployment on the EGI/NGIs infrastructure –Possibly exploiting further monitoring tools Define, jointly with the user community, a subset of the Grid sites belonging to the EGI global infrastructure to be periodically monitored –Excluding a priori a full-mash spanning all sites –Putting in place a workflow for the exchange of information about network faults and scheduled downtimes Organize the structure of a global PERT support for EGI

Enabling Grids for E-sciencE EGEE-III INFSO-RI Current Status of O-E-12: Summary of the EGEE to EGI transition phase Transition from EGEE SA2 to EGI O-E-12 implied close collaboration and discussions, especially among GARR, EGEE ENOC in Lyon (CC IN2P3), CNRS UREC in Paris We identified 2 main tools to keep among the ones provided by ENOC and SA2, plus an additional tool to keep following up for possible future adoption: –PerfSONAR-Lite_TSS  On-demandNetwork monitoring and troubleshooting tool based on perfSONAR –The Downcollector  A central tool to check Grid services registered in the GOC DB on their specific TCP ports –The Grid Job based approach for network monitoring  A system not requiring anly local deployment by sysadmins

Enabling Grids for E-sciencE EGEE-III INFSO-RI DownCollector The DownCollector is a polling tool reporting on the reachability of the services registered in the GOC DB Star-based architecture, Central tool –All tests start from the same initial point It checks services are reachable on the corresponding TCP ports Available at Migrated to It will be accessible through a new portal dedicated to the O-E-12 task, which will be available at the URL –This is NOT YET available. It will be setup in the next days High Availability currently not available –Might be implemented in future if operation will prove usefulness of HA Originally developed by IN2P3 CC-Lyon within EGEE SA2 –In future, endorsed by GARR 6

Enabling Grids for E-sciencE EGEE-III INFSO-RI perfSONAR-lite TroubleShooting Services Started in EGEE-III, entirely designed by SA2 Developments lead by DFN/Erlangen as a SA2 partner Central server orchestrating on demand e2e measurements between light probes hosted by Grid sites EGEE driven improvements of standard perfSONAR framework Authentication & Authorisation mapped from GOCDB’s roles 7 Users 3 - e2e measurement 2 - Request 4 - Result 1 - Request 5 - Result

Enabling Grids for E-sciencE EGEE-III INFSO-RI Networking Support – Xavier Jeannin - EGEE-III First Review June PerfSONAR-Lite_TSS Focus on on-demand troubleshooting: –Launch test on demand from a Grid site under central server control : –Bandwidth measurements, DNS lookup, Traceroute, Port testing, Ping ENOC Local site light PerfSONAR’s probe Central ENOC monitoring server 1 Grid site B ENOC supervisor ROCs members site administrator Grid site A  Authentication Authorization Process –is easy to use for the Grid administrators –can be used quickly by site admin without the need to establish each time a contact the remote site involved in the problem

Enabling Grids for E-sciencE EGEE-III INFSO-RI Networking Support – Xavier Jeannin - EGEE-III First Review June PerfSONAR-Lite_TSS First version was released and installed on 6 sites Installation guide and procedure – –FAQ, tutorial, new features (users, sites, ROC management) –Software authorization schema was adapted to be able to fit with hierarchical EGI/NGI model Difficult to deploy the software during the transition phase toward EGI

Enabling Grids for E-sciencE EGEE-III INFSO-RI perfSONAR-lite TSS 10

Enabling Grids for E-sciencE EGEE-III INFSO-RI perfSONAR-lite TSS: outlook Expected users: Sites, ROCs, ENOC... Status: Tool basically ready, but missing maturation phase –Suffered some staff movements and licensing issues –Not yet fully in production but distributed testbed in place  First production release released at the end of March Future: –Wrap up on current status and initial deployment strategy within the EGI required –O-E-12 will follow up and organize dedicated pre-production deployment campaigns in the next weeks –Future developments to further improve security related to available bandwidth tests and simply AA – May be followed and used outside EGI –DFN and CNRS declared their interest in following up the tool 11

Enabling Grids for E-sciencE EGEE-III INFSO-RI Grid Job based approach to monitoring Within EGEE SA2 a development started to exploit an approach to Network Monitoring for the Grid based on the Grid Jobs –“Monitor the Grid using the Grid” The main advantage of this approach is that Grid site adminitrators don’t have to deploy anything –Only accepting 2 jobs permanently running from a specific VO This approach was conceived especially thinking of the minor and medium-size EGEE sites, with limited resources and attendance/manpower EGEE SA2 produced a prototype deployed on a testbed of 8 sites in France and Italy Main developers are Etienne Double / CNRS UREC and Alfredo Pagano / GARR Structure, example, issues, options will be further described in another presentation by O-E-12

Enabling Grids for E-sciencE EGEE-III INFSO-RI Job-based Network monitoring for Grid www request DB 1 DB 2 Frontend: Apache Tomcat, Ajax, Google Web Toolkit (GWT) Backend: PostgreSQL Implementation languages: Python, bash script Monitoring Urec CNRS Grid network monitoring jobs Monitoring Urec CNRS DB ROC1 Monitoring ROC1 – Server A Monitoring ROC1 – Server B Possible evolutions

Enabling Grids for E-sciencE EGEE-III INFSO-RI Assessment of the current model for network support within the NGIs EGEE SA2 contributed with 3 questions to the Questionnaire for the NGIs (operations): –Do you expect to nominate a network representative who can be the contact point for the collaboration with the Network Support task at EGI level ? –Could you shortly describe what is your current operational model for network related tickets and issues ? –Have you contacted your NREN to participate to the Network Support task ? (if yes, provide details) As predictable, we got a large variety of different answers and amount of provided information

Enabling Grids for E-sciencE EGEE-III INFSO-RI First highlights from the Questionnaire 32 organizations (31 NGIs + CERN) answered As of today: –13 provided the address of a contact person/team for the Network Support task –14 answered they will appoint someone (or will possibly do it) –5 answered they will not, or they haven’t decide yet, or they did not answer In 28 cases the NGI and the NREN are interconnected with already established workflows for network related issues (1 not applicable:CERN) We will further analyse into more detail the outcome and provide a summary document to SA1 and O-E- 12/Network Support contacts

Enabling Grids for E-sciencE EGEE-III INFSO-RI Challenges ahead Get the Network Support task fully supported by all NGIs to –Involve NGIs in a reasonable roadmap towards the achievements of the O-E-12/Network Support goals The real task challenge is the Multi Domain/Cross domain e-2-e related network support –People should discuss, agree and act on common goals  We consider this the first major achievement for the O-E-12 task What shall we focus on ? We proposed something: –Sharing of information on scheduled downtimes and observed faults –3 tools to keep working on, exploiting them on a larger set of sites –Organize a general workflow for observed, percepted performances issue organizing at the EGI level a unique entry point for PERT support, able to properly handle, route/ escalate the issues –Defining – together with the VRC/VOs – a subset of relevant sites for which possibly set up periodic and systematic NM measurements

Enabling Grids for E-sciencE EGEE-III INFSO-RI The fundamental O-E-12 Trade Off There is a general trade off to keep in mind: –Doing essentially nothing  “The Network works…and anyhow If I have a problem, I know myself whom to call.. –Doing too much, trying to provide too much information, which normally means eventually no useful information  Shooting IPERFlike tests everywhere, to the full mash of sites  Providing all tickets related to all NRENs in all possible languages to a unique unfortunate team, in charge of informing everyone that the Institute for Submarine Research of The univeristy of Nowherecity, in the country of TheresEvenMe-Land will possibly have an electric power cut next Thurday, after having been able to translate and understand the original ticket

Enabling Grids for E-sciencE EGEE-III INFSO-RI Challenges ahead / brainstorming We would like to see useful tools agreed upon and adopted by the NGIs We would like to provide useful information, only useful information, only when required, to essentialy everyone in need of it Can we envisage a general tool and the corresponding required level of standardization to be able to provide to “everyone” the binary (1/0) information about the network reachability of a specific Grid site ? –Going beyond the “modelling specific workflows for specific Grid projects”? In other words: would it be able to provide to Grid managers information on possible network problems related to a specific site ?

Enabling Grids for E-sciencE EGEE-III INFSO-RI What has been achieved so far Migration plans successfully completed: –pS-Lite_TSS server, DownCollector, BugZilla already in place Started liasing with GEANT3/DANTE to formalize the EGI-GEANT collaboration –Discussed on GN3 MB on May 5, 2010 –Identified areas for collaboration:  Security  AAI  perfSONAR and interdomain tools Got an initial set of contacts and a very sketchy, draft idea of what the various NGIs are internally doing w.r.t. network support –But this required further work and peer-to-peer communication

Enabling Grids for E-sciencE EGEE-III INFSO-RI Still Missing / Next steps Create a full-fledged portal for network support –Including contacts/ wiki / documents / access to the tools  May be sticking it to the domain netsup.egi.eu ?  For the moment we will start by eginet.garr.it Plan the further development and deployment startegies for –PerfSONAR-Lite_TSS –Grid Job based approach to Network Monitoring for Grids Get new NGIs and new sites involved about them Organize the NRENs and NGIs established communication channels / fora aimed at defining an agreed strategy for the Multi Domain and the concrete tools / steps / workflows the NRENs will provide to EGI/NGIs : – NRENs&NGIs event? –Periodical VideoConferences involving NGIs and NRENs ?

Enabling Grids for E-sciencE EGEE-III INFSO-RI Thank you. Questions ?