Network and Grid Monitoring Ludek Matyska CESNET Czech Republic.

Slides:



Advertisements
Similar presentations
The Grid Job Monitoring Service Luděk Matyska et al. CESNET, z.s.p.o. Prague Czech Republic.
Advertisements

© 2012 Open Grid Forum Simplifying Inter-Clouds October 10, 2012 Hyatt Regency Hotel Chicago, Illinois, USA.
Overview of local security issues in Campus Grid environments Bruce Beckles University of Cambridge Computing Service.
Database Architectures and the Web
4.1.5 System Management Background What is in System Management Resource control and scheduling Booting, reconfiguration, defining limits for resource.
Smart Grid - Cyber Security Small Rural Electric George Gamble Black & Veatch
1 On Death, Taxes, & the Convergence of Peer-to-Peer & Grid Computing Adriana Iamnitchi Duke University “Our Constitution is in actual operation; everything.
Information Technology Current Work in System Architecture November 2003 Tom Board Director, NUIT Information Systems Architecture.
Computing and Data Infrastructure for Large-Scale Science Deploying Production Grids: NASA’s IPG and DOE’s Science Grid William E. Johnston
Semester 4 - Chapter 3 – WAN Design Routers within WANs are connection points of a network. Routers determine the most appropriate route or path through.
Sergey Belov, Tatiana Goloskokova, Vladimir Korenkov, Nikolay Kutovskiy, Danila Oleynik, Artem Petrosyan, Roman Semenov, Alexander Uzhinskiy LIT JINR The.
Computer Science Perspective Ludek Matyska Faculty of Informatics, Masaryk University, Brno and also CESNET, Prague.
Word Wide Cache Distributed Caching for the Distributed Enterprise.
1. 2 Purpose of This Presentation ◆ To explain how spacecraft can be virtualized by using a standard modeling method; ◆ To introduce the basic concept.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE II - Network Service Level Agreement (SLA) Establishment EGEE’07 Mary Grammatikou.
A Lightweight Platform for Integration of Resource Limited Devices into Pervasive Grids Stavros Isaiadis and Vladimir Getov University of Westminster
Computer Science and Engineering 1 Service-Oriented Architecture Security 2.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
SAMANVITHA RAMAYANAM 18 TH FEBRUARY 2010 CPE 691 LAYERED APPLICATION.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
February 20, AgentCities - Agents and Grids Prof Mark Baker ACET, University of Reading Tel:
A Novel Approach to Workflow Management in Grid Environments Frank Berretz*, Sascha Skorupa*, Volker Sander*, Adam Belloum** 15/04/2010 * FH Aachen - University.
INFSO-RI Enabling Grids for E-sciencE GRID sites connectivity database design Anthony Teslyuk, RRC KI JRA4, SA2 Meeting 4 th EGEE.
DISTRIBUTED COMPUTING Introduction Dr. Yingwu Zhu.
Middleware for Grid Computing and the relationship to Middleware at large ECE 1770 : Middleware Systems By: Sepehr (Sep) Seyedi Date: Thurs. January 23,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks WMSMonitor: a tool to monitor gLite WMS/LB.
Distributed Computing Systems CSCI 4780/6780. Geographical Scalability Challenges Synchronous communication –Waiting for a reply does not scale well!!
Grid Middleware Tutorial / Grid Technologies IntroSlide 1 /14 Grid Technologies Intro Ivan Degtyarenko ivan.degtyarenko dog csc dot fi CSC – The Finnish.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
SOFTWARE DESIGN AND ARCHITECTURE LECTURE 13. Review Shared Data Software Architectures – Black board Style architecture.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Etienne Dublé - CNRS/UREC EGEE SA2 Xavier.
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MSG - A messaging system for efficient and.
Panel “Making real large-scale grids for real money-making users: why, how and when?” August 2005 Achim Streit Forschungszentrum Jülich in der.
CoreGRID Workpackage 5 Virtual Institute on Grid Information and Monitoring Services Michał Jankowski, Paweł Wolniewicz, Jiří Denemark, Norbert Meyer,
Measurement in the Internet Measurement in the Internet Paul Barford University of Wisconsin - Madison Spring, 2001.
Internet of Things. IoT Novel paradigm – Rapidly gaining ground in the wireless scenario Basic idea – Pervasive presence around us a variety of things.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Vassiliki Pouli
INFSO-RI Enabling Grids for E-sciencE Charon Extension Layer. Modular environment for Grid jobs and applications management Jan.
DTI Mission – 29 June LCG Security Ian Neilson LCG Security Officer Grid Deployment Group CERN.
Enhancing Security in Enterprise Distributed Real-time and Embedded Systems using Domain-specific Modeling Akshay Dabholkar, Joe Hoffert, Aniruddha Gokale,
INFSO-RI Enabling Grids for E-sciencE Grid Services for Resource Reservation and Allocation Tiziana Ferrari Istituto Nazionale di.
Partnerships in Innovation: Serving a Networked Nation Grid Technologies: Foundations for Preservation Environments Portals for managing user interactions.
INFSO-RI Enabling Grids for E-sciencE NRENs & Grids Workshop Relations between EGEE & NRENs Mathieu Goutelle (CNRS UREC) EGEE-SA2.
Company LOGO Network Management Architecture By Dr. Shadi Masadeh 1.
DIRAC Project A.Tsaregorodtsev (CPPM) on behalf of the LHCb DIRAC team A Community Grid Solution The DIRAC (Distributed Infrastructure with Remote Agent.
DataTAG is a project funded by the European Union International School on Grid Computing, 23 Jul 2003 – n o 1 GridICE The eyes of the grid PART I. Introduction.
Ian Collier, STFC, Romain Wartel, CERN Maintaining Traceability in an Evolving Distributed Computing Environment Introduction Security.
1 Open Science Grid: Project Statement & Vision Transform compute and data intensive science through a cross- domain self-managed national distributed.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
G. Russo, D. Del Prete, S. Pardi Kick Off Meeting - Isola d'Elba, 2011 May 29th–June 01th A proposal for distributed computing monitoring for SuperB G.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
AMSA TO 4 Advanced Technology for Sensor Clouds 09 May 2012 Anabas Inc. Indiana University.
Enabling Grids for E-sciencE Agreement-based Workload and Resource Management Tiziana Ferrari, Elisabetta Ronchieri Mar 30-31, 2006.
A Seminar On. What is Cloud Computing? Distributed computing on internet Or delivery of computing service over the internet. Eg: Yahoo!, GMail, Hotmail-
Dr. Ir. Yeffry Handoko Putra
Grid and Cloud Computing
TrueSight Operations Management 11.0 Architecture
An assessment framework for Intrusion Prevention System (IPS)
Semester 4 - Chapter 3 – WAN Design
GGF OGSA-WG, Data Use Cases Peter Kunszt Middleware Activity, Data Management Cluster EGEE is a project funded by the European.
Network Requirements Javier Orellana
Physical Architecture Layer Design
Introduction to Cloud Computing
Cloud Management Mechanisms
Database Environment Transparencies
Resource and Service Management on the Grid
Fundamental Concepts and Models
Presentation transcript:

Network and Grid Monitoring Ludek Matyska CESNET Czech Republic

May 11 th, 2005NREN & Grids, Amsterdam GRIDs – Basic Concepts Distributed Systems –Large scale –Spanning administrative domains –Heterogeneous Provide processing and services Very complex jobs –Workflows including data stageing –Interaction of many components

May 11 th, 2005NREN & Grids, Amsterdam Networks for GRIDs Basic underlying infrastructure Provide data transfer capability Usually not directly exposed –Except for data transfer planning

May 11 th, 2005NREN & Grids, Amsterdam Network Information for GRIDs Availability and quality of paths –Network capacity prediction –Monitoring of actual data transfers Fulfillment of SLA (implicit/explicit) Failover –Implicit –Explicit – new search for appropriate path

May 11 th, 2005NREN & Grids, Amsterdam Network Capacity Prediction How/when I can transfer amount X of data from A to B? I have two sets of sites: {A, B, C, …} (data sources) and {X, Y, Z, …} (using data). Find me one node from the first and one node from the second set such that the transfer of data will be the fastest.

May 11 th, 2005NREN & Grids, Amsterdam Network Monitoring Role The Network Capacity prediction needs a network monitoring data –Flow and flow patterns are needed NRENs collects this data to some extent only –And usually do not provide it to third parties

May 11 th, 2005NREN & Grids, Amsterdam Network Weather Service (NWS) Collects information about flows through network Provides interface for predicted link/path capacity Challenges –Completeness of the information Is deployed as a NREN service? –Unpredicted reaction of NWS users Their reaction synchronized if not under control –Prediction is hard/impossible

May 11 th, 2005NREN & Grids, Amsterdam Monitoring of Data Transfers Purpose: –Check the SLA –Provide data for prediction improvement –Look for patterns Identification of a particular flow Authorization to use such service Authorization to access data –At the end of data transfer –In real time Not (yet) commonly available

May 11 th, 2005NREN & Grids, Amsterdam CESNET Systems G3 system –Infrastructure monitoring –SNMP based –High detail (e.g. history of virtual port related data) FTAS (Flow-based Traffic Analysis System) –Aggregate and individual flow information –Both IPv4 and IPv6 supported –Heavily used for security related incidents DoS and DDoS attacks None currently used by the Grid community

May 11 th, 2005NREN & Grids, Amsterdam Grid Information and Monitoring Services Almost every Grid component relies on some external information Information provides every Grid element –Distributed producers Distributed Use Classical separation –Grid Information services –Grid Monitoring services

May 11 th, 2005NREN & Grids, Amsterdam Grid Information Services “Static” information about elements –Number of CPUs –GPS location of nodes –Users and their affiliation –Actual length of a queue –Number of free CPUs Usually does not check itself

May 11 th, 2005NREN & Grids, Amsterdam Grid Monitoring Infrastructure/status monitoring –Nodes –Services May be distributed/duplicated/migrating –Jobs and their workflows Application monitoring –Infrastructure part –Particular application oriented part

May 11 th, 2005NREN & Grids, Amsterdam Grid infrastructure monitoring Structure of service –Centralized Easier to deploy Scalability problems –Distributed/Hierarchical Higher overhead Reliability of monitoring components

May 11 th, 2005NREN & Grids, Amsterdam Grid infrastructure monitoring Nodes Monitor properties of nodes –Configuration –Services associated with nodes Examples: –Job acceptance –Compiler availability Could run complex checks –Checkout a CVS and compile a program

May 11 th, 2005NREN & Grids, Amsterdam Example of Node Monitoring

May 11 th, 2005NREN & Grids, Amsterdam Grid infrastructure monitoring Services General Grid services Service discovery –Where the service run now Redundancy –Check the service or all its copies? Grid monitoring itself a service

May 11 th, 2005NREN & Grids, Amsterdam Grid infrastructure monitoring Jobs Information about job flows through the Grid middleware –Distributed gathering of monitoring data –Must be somewhere completed Complex jobs/workflows EGEE Logging and Bookkeeping (LB) –Collects events triggered by job flow through the middleware –Computes job states on the fly –Provides user access to the job states

May 11 th, 2005NREN & Grids, Amsterdam LB Service Architecture

May 11 th, 2005NREN & Grids, Amsterdam Middleware instrumentation Idea to collect data from running middleware –Similar to the LB service, but more general Large number of sources –Distributed collection and processing SNMP  SGMP?

May 11 th, 2005NREN & Grids, Amsterdam Monitoring information dissemination Many places/services look for the data Streams of data –Monitoring data discovery What is available where Events –Subscription/Notification Cross organizational data flow –Sharing monitoring data

May 11 th, 2005NREN & Grids, Amsterdam Privacy/Security Considerations Potentially sensitive data collected –When/where to aggregate –Access authorization Extended use of mutual authentication of monitoring elements Not always required –Provision of alternate less secure (faster/low overhead) monitoring infrastructure

May 11 th, 2005NREN & Grids, Amsterdam NRENs and GRIDs Role for NRENs –Provide information about the network What is available where –Provide information and monitoring services for Grids In some aspects analogy to PERT/PACE Major challenges –Grids are multi-NRENs –(Much) more “users” –Privacy/Security –Heterogeneity of monitored elements –Portal and service access to the collected data

May 11 th, 2005NREN & Grids, Amsterdam Questions?