An Integrated Instrumentation Architecture for NGI Applications Ian Foster, Darcy Quesnel, Steven Tuecke Argonne National Laboratory The University of.

Slides:



Advertisements
Similar presentations
The Anatomy of the Grid: An Integrated View of Grid Architecture Carl Kesselman USC/Information Sciences Institute Ian Foster, Steve Tuecke Argonne National.
Advertisements

Grid Monitoring Discussion Dantong Yu BNL. Overview Goal Concept Types of sensors User Scenarios Architecture Near term project Discuss topics.
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
The Network Weather Service A Distributed Resource Performance Forecasting Service for Metacomputing Rich Wolski, Neil T. Spring and Jim Hayes Presented.
Distributed Systems Architectures
Chapter 10: Stream-based Data Management Title: Design, Implementation, and Evaluation of the Linear Road Benchmark on the Stream Processing Core Authors:
CS538: Advanced Topics in Information Systems. 2 Secure Location transparency Consistent Real-Time Available Black Box: Distributed Storage [GMM] ? Data.
The Network Weather Service: A Distributed Resource Performance Forecasting Service for Metacomputing, Rich Wolski, Neil Spring, and Jim Hayes, Journal.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Grid Computing, B. Wilkinson, 20046c.1 Globus III - Information Services.
Grids and Globus at BNL Presented by John Scott Leita.
Grid IO APIs William Gropp Mathematics and Computer Science Division.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
1 Monitoring Grid Services Yin Chen June 2003.
Digital Library Architecture and Technology
The Globus Toolkit: Description and Applications Review Steve Tuecke & Ian Foster Argonne National Laboratory The University of Chicago Globus Co-PI: Carl.
Performance and Exception Monitoring Project Tim Smith CERN/IT.
SCAN: a Scalable, Adaptive, Secure and Network-aware Content Distribution Network Yan Chen CS Department Northwestern University.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
University of Illinois at Chicago Who, Where, What, Why, How, and a little When Tom DeFanti October 7, 1999 ESnet/MREN Regional Grid Experimental NGI Testbed.
SOS EGEE ‘06 GGF Security Auditing Service: Draft Architecture Brian Tierney Dan Gunter Lawrence Berkeley National Laboratory Marty Humphrey University.
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
SCADA. 3-Oct-15 Contents.. Introduction Hardware Architecture Software Architecture Functionality Conclusion References.
Computer and Automation Research Institute Hungarian Academy of Sciences Presentation and Analysis of Grid Performance Data Norbert Podhorszki and Peter.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.
The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Dataset Caitlin Minteer & Kelly Clynes.
GRID IIII D UK Particle Physics GridPP Collaboration meeting - R.P.Middleton (RAL/PPD) 23-25th May Grid Monitoring Services Robin Middleton RAL/PPD24-May-01.
PPDG and ATLAS Particle Physics Data Grid Ed May - ANL ATLAS Software Week LBNL May 12, 2000.
The Anatomy of the Grid: An Integrated View of Grid Architecture Ian Foster, Steve Tuecke Argonne National Laboratory The University of Chicago Carl Kesselman.
The Globus Project: A Status Report Ian Foster Carl Kesselman
Globus Replica Management Bill Allcock, ANL PPDG Meeting at SLAC 20 Sep 2000.
Perspectives on Grid Technology Ian Foster Argonne National Laboratory The University of Chicago.
NET100 Development of network-aware operating systems Tom Dunigan
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
The Earth System Grid (ESG) Computer Science and Technologies DOE SciDAC ESG Project Review Argonne National Laboratory, Illinois May 8-9, 2003.
LEGS: A WSRF Service to Estimate Latency between Arbitrary Hosts on the Internet R.Vijayprasanth 1, R. Kavithaa 2,3 and Raj Kettimuthu 2,3 1 Coimbatore.
CLRC and the European DataGrid Middleware Information and Monitoring Services The current information service is built on the hierarchical database OpenLDAP.
Grid Technologies Research and Development Ian Foster Argonne National Laboratory The University of Chicago.
Globus Toolkit Massimo Sgaravatto INFN Padova. Massimo Sgaravatto Introduction Grid Services: LHC regional centres need distributed computing Analyze.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
May 6, 2002Earth System Grid - Williams The Earth System Grid Presented by Dean N. Williams PI’s: Ian Foster (ANL); Don Middleton (NCAR); and Dean Williams.
Globus – Part II Sathish Vadhiyar. Globus Information Service.
ORBIT: Location- based services Henning Schulzrinne Columbia University.
7. Grid Computing Systems and Resource Management
Distributed Time Series Database
GraDS MacroGrid Carl Kesselman USC/Information Sciences Institute.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
Globus: A Report. Introduction What is Globus? Need for Globus. Goal of Globus Approach used by Globus: –Develop High level tools and basic technologies.
Company LOGO Network Management Architecture By Dr. Shadi Masadeh 1.
Relevant Systems Issues Motivation 1. A Group Calendar application available through web and partially disconnected portable information appliances will.
Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –
Grid Activities in CMS Asad Samar (Caltech) PPDG meeting, Argonne July 13-14, 2000.
The Globus Toolkit The Globus project was started by Ian Foster and Carl Kesselman from Argonne National Labs and USC respectively. The Globus toolkit.
FESR Trinacria Grid Virtual Laboratory gLite Information System Muoio Annamaria INFN - Catania gLite 3.0 Tutorial Trigrid Catania,
Time Series Data Repository #ODSummit - The Generic, Extensible, and Elastic Data Repository in OpenDaylight for Advanced Analytics.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
System Software Laboratory Databases and the Grid by Paul Watson University of Newcastle Grid Computing: Making the Global Infrastructure a Reality June.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Open Source distributed document DB for an enterprise
Globus —— Toolkits for Grid Computing
AWS Cloud Computing Masaki.
Unit V Mobile Middleware.
Proposed Grid Protocol Architecture Working Group
Presentation transcript:

An Integrated Instrumentation Architecture for NGI Applications Ian Foster, Darcy Quesnel, Steven Tuecke Argonne National Laboratory The University of Chicago

DOE NGI Instrumentation Project “A Uniform Instrumentation, Event, and Adaptation Framework for Network-Aware Middleware and Advanced Network Applications” –With UIUC (Dan Reed, Ruth Aydt) –“Produce uniform notification and adaptation mechanisms, with the goal of catalyzing the development of both network-aware middleware and sophisticated network-aware applications”

Motivation l Environment incorporates multiple sensors –Sources of events relating to behavior of resources, middleware, and applications l Significant advantages to having uniform mechanisms for publishing/discovering sensors and for accessing sensor data –E.g., find all sensors for path A->B –Including historical data l Enables end-to-end, top-to-bottom, past- to-present analysis

Examples of Sensors l Network devices –E.g., routers l End system devices –E.g., computers, storage systems l Grid services –E.g., Globus HBM, Network Weather Service l Libraries –E.g., CAVERNsoft, MPI l Applications

For Example... H RRR H S (SNMP) S S S (netstat) S H/W App Libs Sys GRAMHBMNWS... SSSS MPICHglobus-ioCAVERNsoftDPSS S SSSS S

Three Project Components 1. Mechanisms for creating, publishing, discovering, and accessing sensors 2. Synthesis and analysis techniques for identifying qualitative behavior and trends in sensor data 3. Adaptation techniques that exploit sensor data to adjust middleware and application configurations to improve performance Argonne focus: (1) and (3); UIUC: (2), (3)

Current Approach l Use a directory service (LDAP) to register and publish event sources –Publish: source, type, contact [online, archive] –Discover: “find all event sources of type X” l Use NetLogger format for data l Develop sensor manager to handle publish, subscribe, archiving l Use SQL database as archive l Initial sensor set based on Globus libraries, applications, NetLogger-accessible devices

Initial Instrumentation Architecture LDAP Sensor Manager SQL Netarchive MySQL Publish (“netstat, host A, time T, contact X”) Subscribe Discover (“what event sources for route A to B?”) Application Events in NetLogger format Sensor Archive File

Sensor Manager l We are building a program which: –Archives sensor event streams –Redirects sensor event streams to clients using a publish/subscribe interface –Generates sensor event streams from archive, based on query language –Publishes interfaces and index to LDAP l Relation to other work –Superset of Netlogd (simple archiver) –Might exploit Netarchiver (MySQL indexing)

Archiving Events l How to archive sensor event streams? –SQL: Save each event as a record in an SQL database >Advantage: Rich query support –Netarchive: Save each event into file. Use SQL database to build index of file contents >Advantage: Performance and scale? l We will explore the use of SQL databases –Premise: Most sensors will not produce high volume event streams; hence optimize for simplicity and rich query support

Bandwidth/Latency ANL-NASA Ames NCSA Origin Nodes ANL CPU Load Bandwidth/Latency ANL-Indiana Applying Info Infrastructure to Instrumentation

Publishing & Discovering Sensors l Globus LDAP-based Metacomputing Directory Service (MDS) provides scalable, global infrastructure for publishing and discovering sensor managers –Sensors stream events to a sensor manager –Sensor manager publishes availability of streams into LDAP –Clients discover sensor managers from LDAP, and can subscribe to either current or archived sensor event streams directly from sensor managers

Initial Applications l Replica creation in “Data Grid” applications –Online and historical instrumentation for large data transfers (app, lib, network) –Involves DPSS, globus-io –Also application-level selection of replicas, based on sensor information l MPI-based video streaming (Karonis, Papka)

Security l Grid Security Infrastructure (GSI) will be used throughout, hence possible to say e.g. –“Manager M accepts only streams from sensors of user U” –“Manager N only publishes streams to clients of users A, B, C” l As a first step, we have augmented the Netlogger C client with GSI

Instrumentation Architecture Showing Actuators LDAP Sensor Manager SQL Netarchive MySQL File Publish DiscoverSubscribe Events Sensor Publish Discover Actuator Monitor Subscribe Sensor Events

Future Directions l XML –Netlogger is an ASCII based format –If you using ASCII, why not use XML? –XML database could be used for archive l Events –Performance related events should be just one part of a larger, integrated event system l Typing –Netlogger is weakly typed –Various advantages to strongly typed events

Future Directions (2): Publish/Subscribe for Sensors l In first version: –Netlogger based sensors stream events to manager –Manager publishes sensor availability to LDAP –Clients subscribe to sensor manager for events l In later version: –Sensor can publish existence to LDAP –Client can subscribe directly to sensor for events

Network Weather Service (R. Wolski et al., U.Tenn) l Scalable, fault tolerant system for –Real-time performance measurements –Predictions of future state l When installed on N hosts, delivers: –Network performance (<=N 2 via netperf) –Host cpu-load measurements (N) l We (USC/ISI crew) are working to integrate this into MDS; hopefully will eventually be consistent with approach described here (to be discussed)

Structure of NWS data in MDS (old) c=US o=ISIo=Globus nn= the Internet source: hn=source.isi.edu, o=ISI, c=US destination: hn=destination.anl.edu, o=ANL, c=US serviceProvider: NWS throughput: throughput_prediction: throughput_MSE: 0.95 latency: 5.3 latency_throughput: 6.1 latency_MSE: 0.04 hs=source.isi.edu to destination.anl.gov current_cpu: current_cpu_prediction: current_cpu_MSE: weighted_cpu weighted_cpu_prediction: weighted_cpu_MSE: hn=source.isi.edu N 2 Network performance entries for N hosts N sets of cpu info for N hosts...