Network Monitoring Using Grid Jobs EGEE SA2

Slides:

Advertisements

Similar presentations

Security middleware Andrew McNab University of Manchester.

Advertisements

EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Wrap up on perfSONAR-Lite_TSS and Network Troubleshooting Mario Reale GARR.

Grid and CDB Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.

Computer Monitoring System for EE Faculty By Yaroslav Ross And Denis Zakrevsky Supervisor: Viktor Kulikov.

How Clients and Servers Work Together. Objectives Learn about the interaction of clients and servers Explore the features and functions of Web servers.

16: Distributed Systems1 DISTRIBUTED SYSTEM STRUCTURES NETWORK OPERATING SYSTEMS The users are aware of the physical structure of the network. Each site.

EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Etienne Dublé - CNRS/UREC Alfredo Pagano – GARR NetJobs: Network Monitoring.

Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug

IMS 4212: Distributed Databases 1 Dr. Lawrence West, Management Dept., University of Central Florida Distributed Databases Business needs.

1 Chapter Client-Server Interaction. 2 Functionality  Transport layer and layers below  Basic communication  Reliability  Application layer.

Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.

The huge amount of resources available in the Grids, and the necessity to have the most up-to-date experimental software deployed in all the sites within.

Security monitoring boxes Andrew McNab University of Manchester.

1 Client-Server Interaction. 2 Functionality Transport layer and layers below –Basic communication –Reliability Application layer –Abstractions Files.

Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]

6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.

EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Etienne Dublé - CNRS/UREC EGEE SA2 Xavier.

Derek Ross E-Science Department DCache Deployment at Tier1A UK HEP Sysman April 2005.

EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.

EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MSG - A messaging system for efficient and.

Rutherford Appleton Lab, UK VOBox Considerations from GridPP. GridPP DTeam Meeting. Wed Sep 13 th 2005.

Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.

EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Update on Network Performance Monitoring.

VO Box Issues Summary of concerns expressed following publication of Jeff’s slides Ian Bird GDB, Bologna, 12 Oct 2005 (not necessarily the opinion of)

Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.

EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Xavier Jeannin (CNRS/UREC Paris, FR) 24.

EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The LCG interface Stefano BAGNASCO INFN Torino.

EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Mario Reale – GARR NetJobs: Network Monitoring Using Grid Jobs.

Connect communicate collaborate Performance Metrics & Basic Tools Robert Stoy, DFN EGI TF, Madrid September 2013.

II EGEE conference Den Haag November, ROC-CIC status in Italy

SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,

G. Russo, D. Del Prete, S. Pardi Kick Off Meeting - Isola d'Elba, 2011 May 29th–June 01th A proposal for distributed computing monitoring for SuperB G.

INFSO-RI Enabling Grids for E-sciencE GOCDB2 Matt Thorpe / Philippa Strange RAL, UK.

The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.

EGI-InSPIRE EGI-InSPIRE RI Network Troubleshooting and PerfSONAR-Lite_TSS Mario Reale GARR.

HEPiX IPv6 Working Group David Kelsey (STFC-RAL) GridPP33 Ambleside 22 Aug 2014.

CH-1211 Genève 23 Job efficiencies at CERN Review of job efficiencies at CERN status report James Casey, Daniel Rodrigues, Ulrich Schwickerath.

Voice Performance Measurement and related technologies

Jean-Philippe Baud, IT-GD, CERN November 2007

WLCG IPv6 deployment strategy

Grid Computing: Running your Jobs around the World

Status of SA2 network monitoring and troubleshooting tools

Introduction to Distributed Platforms

High Availability Linux (HA Linux)

Use of Nagios in Central European ROC

Integrating HA Legacy Products into OpenSAF based system

Database System Concepts and Architecture

EGEE SA2 / TERENA NRENs & Grids joint workshop

ALICE FAIR Meeting KVI, 2010 Kilian Schwarz GSI.

Global Banning List and Authorization Service

PROTEAN: A Scalable Architecture for Active Networks

EGEE VO Management.

Server Concepts Dr. Charles W. Kann.

Short update on the latest gLite status

A Messaging Infrastructure for WLCG

Artem Trunov and EKP team EPK – Uni Karlsruhe

Network Requirements Javier Orellana

Mario Reale – IGI / GARR Lyon, Sept 19, 2011

Interoperability & Standards

Cloud Computing.

Module 3 Building a web app.

Pierre Girard ATLAS Visit

Software models - Software Architecture Design Patterns

Grid Coordination by Using the Grid Coordination Protocol

Grid Management Challenge - M. Jouvin

Anant Mudambi, U. Virginia

Site availability Dec. 19 th 2006

Installation/Configuration

Presentation transcript:

Network Monitoring Using Grid Jobs EGEE SA2 Xavier Jeannin, Etienne Dublé - CNRS/UREC Mario Reale, Alfredo Pagano – GARR Wednesday, February 24, 2010 – Technical Network Liaison Comittee

Content Network Monitoring… The idea System architecture Next steps In the context of grids In the context of EGEE The idea System architecture Global view The Server, the Jobs and the Grid User Interface Next steps Discussion

- In the context of grids Network Monitoring… - In the context of grids - In the context of EGEE

Network Monitoring for Grids GRIDs are big users and they will exercise the network The LHC generating ~15 PetaBytes of raw data/year for sure is a big user Grid middleware can benefit from monitoring: Example: Network aware job and data transfer scheduling When a problem occurs, a grid operator / user would like to check quickly if the network is involved in the problem: This is especially important for grids because in such a complex environment the network is one of many layers Take account of: –capability of CEs –size and number of jobs already waiting (queued) at CEs –performance of network link for each CE-SE combination The Grid may use network performance data to improve its decision making Data Intensive and Network Aware (DIANA) Grid Scheduling

Previous and other EGEE efforts e2emonit (pingER, UDPmon, IPERF) NPM (Network Performance Monitor) PCP (Probe Control Protocol) Diagnostic Tool PerfSONAR_Lite-TSS PerfSONAR-MDM

The EGEE context No EGEE recommended general solution for network monitoring A part of the grid is already monitored (LHCOPN, specific national initiatives, …), and there are plans to monitor more links Monitor all Tier-1 <-> Tier-2 links using PerfSONAR? PerfSONAR Lite TSS should be deployed very soon, but it is dedicated to troubleshooting In this project EGEE SA2 is trying to address the needs which are not yet addressed

Characteristics of the tool Our approach had to take into account: High scalability Security Reliability Cost-effectiveness And preferably: A lightweight deployment

“Instead of installing a probe at each site, run a grid job” The idea: “Instead of installing a probe at each site, run a grid job”

pros and cons Added value: Limits: No installation/deployment needed in the sites Monitoring 10 or 300 sites is just a matter of configuration A monitoring system running on a proven architecture (the grid) Possibility to use grid services (ex: AuthN and AuthZ) Limits: Some low-level metrics can’t be implemented in the job Because no control of the Worker Node environment (hardware, software) where the job is running is available Some sites will have to slightly update their middleware configuration The maximum lifetime of jobs should be increased if it is too low (at least for the DN of the certificate that the system uses)

System architecture: Global view

System Architecture the components www request Front-end DB 1 Monitoring server Grid network monitoring jobs DB 2 Monitoring server Monitoring server DB ROC1 Monitoring server @ ROC1 – Server A Monitoring server @ ROC1 – Server B Possible new configuration Frontend: Apache Tomcat, Ajax, Google Web Toolkit (GWT) Monitoring server: Python, bash script Jobs: Python, bash script (portability is a major aspect for jobs) Database: PostgreSQL

Current prototype: 8 Sites

Choice of network paths, scheduling Monitor all possible site-to-site paths will be too much: N x (N-1) and N ~ 300 sites for a whole grid coverage We must restrict the number of these paths To a specific VO, to an experiment, to the most used paths, etc. We have studied this at https://edms.cern.ch/document/1001777 The system is completely configurable about these paths and the scheduling of measurements The admin specifies a list of scheduled tests, giving for each one: The source and the remote site The type of test The frequency of the test Users will be able to request the system to monitor a given path (this request must then be validated by the admin) If you still have many paths, you can start several server instances (to achieve the needed performance)

Example of scheduling Latency test Hop count MTU size TCP RTT Every 10 minutes Hop count Iterative connect() test MTU size Socket (IP_MTU socket option) Achievable Bandwidth TCP throughput transfer via GridFTP transfer between 2 Storage Elements Every 8h In order to avoid too many connections these three measurements are done in the same test

The Server, the Jobs, and the Grid System architecture: The Server, the Jobs, and the Grid

Technical constraints Technical constraints to be dealt with: When running a job, the grid user is mapped to a Linux user of the Worker Node (WN): This means the job is not running as root on the WN Some low level operations are not possible (for example opening an ICMP listening socket is not allowed) Heterogeneity of the WN environments (various OS, 32/64 bits…) Ex: making the job download and run an external tool may be tricky (except if it is written in an OS independent programming language) The system has to deal with the grid mechanism overhead (delays, job lifetime limit…)

Initialization of grid jobs Site paris-urec-ipv6 Site X Ready! UI WMS Central monitoring server program (CMSP) Site A Site B Site C CE CE CE Request: RTT test to site A Request: BW test to site B WN WN WN Job Job Job Job submission Socket connection Probe Request 17

Remarks about this initialization step Chosen design (1 job <-> many probes) is much more efficient than starting a job for each probe Considering delays Considering the handling of middleware failures (most failures occur at job submission, extremely few of them occur once the job is running) TCP connection is initiated by the job No open port needed on the WN  better for security of sites An authentication mechanism is implemented between the job and the server A job cannot last forever (GlueCEPolicyMaxWallClockTime) So actually there are 2 jobs running at each site A ‘main’ one, and A ‘redundant’ one which is waiting and will become ‘main’ when the other one ends

RTT, MTU and hop count test Site paris-urec-ipv6 UI Central monitoring server program (CMSP) Site B Site C CE Request: RTT test to site C WN Job Socket connection Probe Request Probe Result

RTT, MTU and hop count test The ‘RTT’ measure is the time a TCP ‘connect()’ function call takes: Because a connect() call involves a round-trip of packets: SYN -> SYN-ACQ <- ACQ -> Results very similar to the ones of ‘ping’ The MTU is given by the IP_MTU socket option The number of hops is calculated in an iterative way These measures require: To connect to an accessible port (1) on a machine of the remote site To close the connection (no data is sent) Note: This (connect/disconnect) is detected in the application log (1): We use the port of the gatekeeper of the CE since it is known to be accessible (it is used by the grid middleware gLite) Round trip Just sending => no network delay

Request: GridFTP BW test to site C Site paris-urec-ipv6 UI Central monitoring server program (CMSP) Replication of a large grid file Site A Site C SE SE Read the gridFTP log file Request: GridFTP BW test to site C WN Job Socket connection Probe Request Probe Result

GridFTP BW test If the GridFTP log file is not accessible (cf. dCache?) In this case we just do the transfer via globus-url-copy in a verbose mode in order to get the transfer rate. A passive version of this BW test is being developed The job just reads the gridftp log file periodically (the system does not requests additional transfers) This is only possible if the log file is available on the Storage Element (i.e. it is a DPM)

System architecture: User Interface

The user interface

Next steps

Next steps Very near future (i.e. being developped): Server: GridFTP passive BW test Front-end: provide a new form which will allow users to request the system to monitor a new path

Next steps Other possible enhancements (later): Triggering system to alert site and network admins Refresh measurements on-demand (don’t wait several hors for the next bw test...) Add more types of measurements? Consider adding a dedicated box (VObox?) If some of the metrics needed are not available with the job-based approach Ex: low level measurements requiring root privileges The job would interact with this box and transport the results This might be done in a restricted set of major sites Consider interaction with other systems (some probes may be already installed at some sites, we could benefit from them)

Feedback, discussion, requests… Thank You Feedback, discussion, requests… Front-end: http://egeemon.dir.garr.it:8080/NetMonDB/ Wiki: https://twiki.cern.ch/twiki/bin/view/EGEE/GridNetworkMonitoring Contacts: etienne.duble@urec.cnrs.fr alfredo.pagano@garr.it