May 2005 Iosif Legrand 1 Iosif Legrand California Institute of Technology May 2005 An Agent Based, Dynamic Service System to Monitor, Control and Optimize.

Slides:



Advertisements
Similar presentations
Caltech Proprietary Videoconferencing Security in VRVS 3.0 and Future Videoconferencing Security in VRVS 3.0 and Future Kun Wei California Institute of.
Advertisements

CPSCG: Constructive Platform for Specialized Computing Grid Institute of High Performance Computing Department of Computer Science Tsinghua University.
TCP Monitor and Auto Tuner. Need Analysis Enable monitoring of TCP Connections Enable maximum bandwidth utilization No such utility available in MONALISA.
Chapter 3: Planning a Network Upgrade
Distributed Processing, Client/Server and Clusters
INTRODUCTION TO COMPUTER NETWORKS Zeeshan Abbas. Introduction to Computer Networks INTRODUCTION TO COMPUTER NETWORKS.
1 Chapter 11: Data Centre Administration Objectives Data Centre Structure Data Centre Structure Data Centre Administration Data Centre Administration Data.
May 2005 Iosif Legrand 1 Iosif Legrand California Institute of Technology ICFA WORKSHOP Daegu, May 2005 Daegu, May 2005 An Agent Based, Dynamic Service.
4.1.5 System Management Background What is in System Management Resource control and scheduling Booting, reconfiguration, defining limits for resource.
Netscape Application Server Application Server for Business-Critical Applications Presented By : Khalid Ahmed DS Fall 98.
A Java Architecture for the Internet of Things Noel Poore, Architect Pete St. Pierre, Product Manager Java Platform Group, Internet of Things September.
Notes to the presenter. I would like to thank Jim Waldo, Jon Bostrom, and Dennis Govoni. They helped me put this presentation together for the field.
June 2003 Iosif Legrand MONitoring Agents using a Large Integrated Services Architecture Iosif Legrand California Institute of Technology.
Monitoring and controlling VRVS Reflectors Catalin Cirstoiu 3/7/2003.
Managing Agent Platforms with the Simple Network Management Protocol Brian Remick Thesis Defense June 26, 2015.
October 2003 Iosif Legrand Iosif Legrand California Institute of Technology.
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
Systems Architecture, Fourth Edition1 Internet and Distributed Application Services Chapter 13.
© DSRG 2001www.cs.agh.edu.pl Cross Grid Workshop - Kraków Krzysztof Zieliński, Sławomir Zieliński University of Mining and Metallurgy {kz,
L. Granado Cardoso, F. Varela, N. Neufeld, C. Gaspar, C. Haen, CERN, Geneva, Switzerland D. Galli, INFN, Bologna, Italy ICALEPCS, October 2011.
Grid Monitoring By Zoran Obradovic CSE-510 October 2007.
BMC Software confidential. BMC Performance Manager Will Brown.
Cloud Computing for the Enterprise November 18th, This work is licensed under a Creative Commons.
September 2005 Iosif Legrand 1 End User Agents: extending the "intelligence" to the edge in Distributed Service Systems Iosif Legrand California Institute.
KARMA with ProActive Parallel Suite 12/01/2009 Air France, Sophia Antipolis Solutions and Services for Accelerating your Applications.
Users’ Authentication in the VRVS System David Collados California Institute of Technology November 20th, 2003TERENA - Authentication & Authorization.
Chapter 9: Novell NetWare
Multi-media Computers and Computer Networks. Questions ? Media is used for ………………… Multimedia computer is capable of integrating ………………………………….. OCR stands.
A Lightweight Platform for Integration of Resource Limited Devices into Pervasive Grids Stavros Isaiadis and Vladimir Getov University of Westminster
Online Monitoring with MonALISA Dan Protopopescu Glasgow, UK Dan Protopopescu Glasgow, UK.
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
ACAT 2003 Iosif Legrand Iosif Legrand California Institute of Technology.
Ramiro Voicu December Design Considerations  Act as a true dynamic service and provide the necessary functionally to be used by any other services.
1 Ramiro Voicu, Iosif Legrand, Harvey Newman, Artur Barczyk, Costin Grigoras, Ciprian Dobre, Alexandru Costan, Azher Mughal, Sandor Rozsa Monitoring and.
The Network Performance Advisor J. W. Ferguson NLANR/DAST & NCSA.
Chapter 4 Realtime Widely Distributed Instrumention System.
Monitoring, Accounting and Automated Decision Support for the ALICE Experiment Based on the MonALISA Framework.
February 2006 Iosif Legrand 1 Iosif Legrand California Institute of Technology February 2006 February 2006 An Agent Based, Dynamic Service System to Monitor,
1 VINCI : Virtual Intelligent Networks for Computing Infrastructures An Integrated Network Services System to Control and Optimize Workflows in Distributed.
1 Iosif Legrand, Harvey Newman, Ramiro Voicu, Costin Grigoras, Catalin Cirstoiu, Ciprian Dobre An Agent Based, Dynamic Service System to Monitor, Control.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
Operating Systems David Goldschmidt, Ph.D. Computer Science The College of Saint Rose CIS 432.
DYNES Storage Infrastructure Artur Barczyk California Institute of Technology LHCOPN Meeting Geneva, October 07, 2010.
LISHEP 2004 Iosif Legrand Iosif Legrand California Institute of Technology DISTRIBUTED SERVICES.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
A Collaborative Framework for Scientific Data Analysis and Visualization Jaliya Ekanayake, Shrideep Pallickara, and Geoffrey Fox Department of Computer.
VRVS Research Roadmap (Caltech). VRVS Deployment and Usage.
Overview of ALICE monitoring Catalin Cirstoiu, Pablo Saiz, Latchezar Betev 23/03/2007 System Analysis Working Group.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
1 MonALISA Team Iosif Legrand, Harvey Newman, Ramiro Voicu, Costin Grigoras, Ciprian Dobre, Alexandru Costan MonALISA capabilities for the LHCOPN LHCOPN.
Xrootd Monitoring and Control Harsh Arora CERN. Setting Up Service  Monalisa Service  Monalisa Repository  Test Xrootd Server  ApMon Module.
April 2003 Iosif Legrand MONitoring Agents using a Large Integrated Services Architecture Iosif Legrand California Institute of Technology.
PPDG February 2002 Iosif Legrand Monitoring systems requirements, Prototype tools and integration with other services Iosif Legrand California Institute.
Features Of SQL Server 2000: 1. Internet Integration: SQL Server 2000 works with other products to form a stable and secure data store for internet and.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
ECHO A System Monitoring and Management Tool Yitao Duan and Dawey Huang.
October 2006 Iosif Legrand 1 Iosif Legrand California Institute of Technology An Agent Based, Dynamic Service System to Monitor, Control and Optimize Distributed.
Mobile Analyzer A Distributed Computing Platform Juho Karppinen Helsinki Institute of Physics Technology Program May 23th, 2002 Mobile.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
January 2006 Iosif Legrand 1 Iosif Legrand California Institute of Technology January 2006 An Agent Based, Dynamic Service System to Monitor, Control and.
Netscape Application Server
California Institute of Technology
Open Source distributed document DB for an enterprise
GGF15 – Grids and Network Virtualization
Initial job submission and monitoring efforts with JClarens
AWS Cloud Computing Masaki.
Presentation transcript:

May 2005 Iosif Legrand 1 Iosif Legrand California Institute of Technology May 2005 An Agent Based, Dynamic Service System to Monitor, Control and Optimize Distributed Systems Control and Optimize Distributed Systems

May 2005 Iosif Legrand 2 MonALISA is A Dynamic, Distributed Service Architecture   Real-time monitoring is an essential part of managing distributed systems. The monitoring information gathered is necessary for developing higher level services, and components that provide automated decisions, to help operate and globally optimize the workflow in complex systems.   The MonALISA system is designed as an ensemble of autonomous multi-threaded, self-describing agent-based subsystems which are registered as dynamic services, and are able to collaborate and cooperate in performing a wide range of monitoring tasks and to analyze and process this information in a distributed way to provide optimization decisions in large scale distributed applications.   An agent-based architecture provides the ability to invest the system with increasing degrees of intelligence; to reduce complexity and make global systems manageable in real time

May 2005 Iosif Legrand 3 The MonALISA Architecture Provides:  Reliable Registration and Discovery for Services and Applications.  Monitoring all aspects of complex systems :  System information for computer nodes and clusters  Network information : WAN and LAN  Monitoring the performance of Applications or services  The End User Systems  Can interact with any other services to provide in near real-time customized / filtered information based on monitoring data  Secure, remote administration for services and applications  Agents to supervise applications, to restart or reconfigure them, and to notify other services when certain conditions are detected.  The MonALISA framework can be used to develop higher level decision services, implemented as a distributed network of communicating agents, to perform global optimization tasks.  Powerful Graphical User Interfaces

May 2005 Iosif Legrand 4 Lookup Service MonALISA service & Data Handling MonALISA Service Data Cache Service & DB Configuration Control (SSL) Configuration Control (SSL) Lookup Service Monitor Data Stores WEB Service WSDL SOAP Client (other service) Java Discovery Registration Client (other service) Web client data Postgres DB MySQL MDS Applications User defined loadable Modules to write /sent data Predicates & Agents Communications via the ML Proxy

May 2005 Iosif Legrand 5 Lookup Service Registration / Discovery Admin Access and AAA for Clients MonALISA Service Lookup Service Client (other service) Discovery Registration (signed certificate) MonALISA Service MonALISA Service Services Proxy Multiplexer Services Proxy Multiplexer Client (other service) Admin SSL connection Trust keystore AAA services Client authentication Data Data Filters & Agents Filters & Agents Trust keystore

May 2005 Iosif Legrand 6 MonALISA Discovery System & Services Network of JINI-LUSs Secure & Public MonALISA service Proxies Clients, HL services repositories Fully Distributed Discovery Dynamic - based on a lease Mechanism and REN Distributed Information System. Dynamic load balancing Scalability & Replication Security Global Services or Clients

May 2005 Iosif Legrand 7   Grid3 Grid3 ~40 sites in US and 1 Korea   CMS-US sites CMS-US sites   CMS CMS   CDF   D0 SAR D0 SAR   ABILENE backbone ABILENE   GLORIAD GLORIAD   STAR STAR   ALICE ALICE   VRVS System VRVS System   RoEduNET backbone RoEduNET backbone   INTERNET2 PIPES INTERNET2 PIPES   OSG Communities using MonALISA ABILENE VRVS - - GRID3 ALICE CMS-DC04 It has been used for Demonstrations at:   SC2003   Telecom 2003   WSIS 2003   SC 2004

May 2005 Iosif Legrand 8 Monitoring I2 Network Traffic, Grid03 Farms and Jobs

May 2005 Iosif Legrand 9 Monitoring Network Topology Latency, Routers NETWORKS AS ROUTERS

May 2005 Iosif Legrand 10 Monitoring the Execution of Jobs and the Time Evolution SPLIT JOBS LIFELINES for JOBS Job Job1 Job2 Job3 Job 31 Job 32 Summit a Job DAG

May 2005 Iosif Legrand 11 Monitoring ABILENE backbone Network u Test for a Land Speed Record u ~ 7 Gb/s in a single TCP stream from Geneva to Caltech

May 2005 Iosif Legrand 12 Monitoring Optical Switches Agents to Create on Demand an Optical Path

May 2005 Iosif Legrand 13 Monitoring VRVS Reflectors and Communication Topology

May 2005 Iosif Legrand 14 MonALISA provides automated management and global optimization for the EVO system  Dynamic Discovery of Reflectors  Creates and maintains, in near real-time, the optimal connectivity between reflectors (a dynamic minimum spanning tree) based on periodic network measurements. In case of any network problems the entire connection tree is modified to optimize the overall performance.  Detects and monitor the “End User” configuration, its hardware, the connectivity and its performance.  Dynamically connects the client to the best reflector  Provides secure administration for services using a flexible GUI. It is possible to start / stop / update / reconfigure reflectors It is possible to start / stop / update / reconfigure reflectors  Monitors the entire system and keeps long term history  It is using alarm triggers to notify unexpected events

May 2005 Iosif Legrand 15 Communication in the Distributed Collaborative System Communication in the Distributed Collaborative System vrvs us vrvs eu pub funet star- light sinica kek cor- nell triumf cal- tech usf usp inet 2 vrvs 5 Reflectors are hosts that interconnect users by permanent IP tunnels. The active IP tunnels must be selected so that there is no cycle formed. Tree The selection is made according to the real-time measurements of the network performance. minimum-spanning tree(MST) minimum-spanning tree (MST)

May 2005 Iosif Legrand 16 Creating a Dynamic, Global, Minimum Spanning Tree to optimize the connectivity A weighted connected graph G = (V,E) with n vertices and m edges. The quality of connectivity between any two reflectors is measured every 2s. Building in near real time a minimum- spanning tree T

May 2005 Iosif Legrand 17 LISA- Localhost Information Service Agent End To End Monitoring Tool u u It is very easy to deploy and install by simply using any browser. u u It detects the system architecture, the operating system and selects dynamically the binary parts necessary on each system. u u It can be easily deployed on any system. It is now used on all versions of Windows, Linux, Mac. u u It provides complete system monitoring of the host computer: u u CPU, memory, IO, disk, … u u Hardware detection u u Main components, Audio, Video equipment, u u Drivers installed in the system u u Provides embedded clients for IPERF (or other network monitoring tools, like Web 100 ) u u A user friendly GUI to present all the monitoring information. A lightweight Java Web Start application that provides complete monitoring of the end user systems, the network connectivity and can use the MonALISA framework to optimize client applications

May 2005 Iosif Legrand 18 LISA- Provides an Efficient Integration for Distributed Systems and Applications LISA Lookup Service Lookup Service Discovery Registration Best Service MonALISA Application Service MonALISA Application Service MonALISA Application Service MonALISA Application Service u u It is using external services to identify the real IP of the end system, its network ID and AS u u Discovers MonALISA services and can select, based on service attributes, different applications and their parameters (location, AS, functionality, load … ) è è Based on information such as AS number or location, it determines a list with the best possible services. è è Registers as a listener for other service attributes (eg. number of connected clients). è è Continuously monitors the network connection with several selected services and provides the best one to be used from the client’s perspective. è è Measures network quality, detects faults and informs upper layer services to take appropriate decisions

May 2005 Iosif Legrand 19 LISA is used by the Clients to Dynamically Select the Best Reflector LISA Discover the Best Service LISA Discover the Best Service MonALISA Reflector A MonALISA Reflector A MonALISA Reflector A MonALISA Reflector A MonALISA Reflector A Minimum Spanning Tree Maintained continuously by Dedicated MonALISA agents Monitoring Feedback CLIENT

May 2005 Iosif Legrand 20 LISA Detects the Best Reflector for each Client and MonALISA Agents keep the reflectors connected in a MST

May 2005 Iosif Legrand 21 Global Optimization for the Interaction and Integration between Clients and Services u u LISA clients can discover and select the best services to be used, based on network performance measurements, load of the services and any additional attributes u u This provides a dynamic load balancing in how refectors are allocated and at the same time is optimizing the performance from the client perspective u u LISA clients can report all the collected monitoring information to one or more MonALISA services in a dynamic way. In this way, services are informed about the performance of each client, its load, available local resources and the quality of its connectivity. For multimedia applications the hardware and the drivers used are also very important. u u The real-time feedback from clients is important in operating large, complex systems. Based on this information, services can adjust dynamically to different load patterns.

May 2005 Iosif Legrand 22 SUMMARY MonaLISA is a fully distributed service system with no single point of failure. It provides reliable registration and discovery of services and applications. MonaLISA is a fully distributed service system with no single point of failure. It provides reliable registration and discovery of services and applications. u MonALISA is interfaced with many monitoring tools and is capable to collect information from different applications u It allows to analyze and process information locally, using Filters or Agents that are dynamically deployed to provide customized information to other services or clients or to trigger predefined actions. u Can be used to control and monitor any other applications. Agents can be used to supervise applications, to restart or reconfigure them, and to notify other services when certain conditions are detected. u Provides a secure administration interface which allows to remotely control (start / stop/ reconfigure / upgrade) distributed services or applications. u The Agent system in the MonALISA framework can be used to develop higher level services, implemented as a distributed network of communicating agents, to perform global optimization tasks. It proved to be a stable and reliable distributed service system ~180 Sites running MonALISA ~180 Sites running MonALISA