Italo Epicoco, Ph.D. University of Lecce, Italy iGrid: a Relational Information Service A novel resource & service discovery approach.

Slides:



Advertisements
Similar presentations
Overview of local security issues in Campus Grid environments Bruce Beckles University of Cambridge Computing Service.
Advertisements

High Performance Computing Course Notes Grid Computing.
Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.
A Computation Management Agent for Multi-Institutional Grids
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Notes to the presenter. I would like to thank Jim Waldo, Jon Bostrom, and Dennis Govoni. They helped me put this presentation together for the field.
Massimo Cafaro GridLab Review GridLab WP10 Information Services Massimo Cafaro CACT/ISUFI University of Lecce, Italy.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
A Model for Grid User Management Rich Baker Dantong Yu Tomasz Wlodek Brookhaven National Lab.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
Lesson 11-Virtual Private Networks. Overview Define Virtual Private Networks (VPNs). Deploy User VPNs. Deploy Site VPNs. Understand standard VPN techniques.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 1: Introduction to Windows Server 2003.
Implementing ISA Server Caching. Caching Overview ISA Server supports caching as a way to improve the speed of retrieving information from the Internet.
Grid Computing, B. Wilkinson, 20046c.1 Globus III - Information Services.
INFN experience with Globus GIS A. Cavalli - F. Semeria INFN Grid Information Services workshop CERN, March 2001.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 1: Introduction to Windows Server 2003.
McGraw-Hill The McGraw-Hill Companies, Inc., 2000 SNMP Simple Network Management Protocol.
Windows Server 2008 Chapter 8 Last Update
WP6: Grid Authorization Service Review meeting in Berlin, March 8 th 2004 Marcin Adamski Michał Chmielewski Sergiusz Fonrobert Jarek Nabrzyski Tomasz Nowocień.
Intranet, Extranet, Firewall. Intranet and Extranet.
A Scalable Application Architecture for composing News Portals on the Internet Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta Famagusta.
MCSE Guide to Microsoft Exchange Server 2003 Administration Chapter Four Configuring Outlook and Outlook Web Access.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED.
CGW 2003 Institute of Computer Science AGH Proposal of Adaptation of Legacy C/C++ Software to Grid Services Bartosz Baliś, Marian Bubak, Michał Węgiel,
Microsoft Active Directory(AD) A presentation by Robert, Jasmine, Val and Scott IMT546 December 11, 2004.
Security in Virtual Laboratory System Jan Meizner Supervisor: dr inż. Marian Bubak Consultancy: dr inż. Maciej Malawski Master of Science Thesis.
Overview of the NorduGrid Information System Balázs Kónya 3 rd NorduGrid Workshop 23 May, 2002, Helsinki.
Robert Fourer, Jun Ma, Kipp Martin Copyright 2006 An Enterprise Computational System Built on the Optimization Services (OS) Framework and Standards Jun.
ANSTO E-Science workshop Romain Quilici University of Sydney CIMA CIMA Instrument Remote Control Instrument Remote Control Integration with GridSphere.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
SEC835 Practical aspects of security implementation Part 1.
A. Cavalli - F. Semeria INFN Experience With Globus GIS 1 A. Cavalli - F. Semeria INFN First INFN Grid Workshop Catania, 9-11 April 2001 INFN Experience.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Engr. M. Fahad Khan Lecturer Software Engineering Department University Of Engineering & Technology Taxila.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Information System on gLite middleware Vincent.
Application code Registry 1 Alignment of R-GMA with developments in the Open Grid Services Architecture (OGSA) is advancing. The existing Servlets and.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
PwC New Technologies New Risks. PricewaterhouseCoopers Technology and Security Evolution Mainframe Technology –Single host –Limited Trusted users Security.
E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA gLite Information System Pedro Rausch IF.
1 Service Creation, Advertisement and Discovery Including caCORE SDK and ISO21090 William Stephens Operations Manager caGrid Knowledge Center February.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America gLite Information System Claudio Cherubino.
GraDS MacroGrid Carl Kesselman USC/Information Sciences Institute.
1 Active Directory Service in Windows 2000 Li Yang SID: November 2000.
Policy-Based Dynamic Negotiation for Grid Services Authorization Ionut Constandache, Daniel Olmedilla, Wolfgang Nejdl Semantic Web Policy Workshop, ISWC’05.
10 May 2001WP6 Testbed Meeting1 WP5 - Mass Storage Management Jean-Philippe Baud PDP/IT/CERN.
Active Directory. Computers in organizations Computers are linked together for communication and sharing of resources There is always a need to administer.
GIIS Implementation and Requirements F. Semeria INFN European Datagrid Conference Amsterdam, 7 March 2001.
April 4, 2002Atlas Testbed Workshop ATLAS Hierarchical MDS Server Patrick McGuigan.
EGEE is a project funded by the European Union under contract IST Experiment Software Installation toolkit on LCG-2
The Globus Toolkit The Globus project was started by Ian Foster and Carl Kesselman from Argonne National Labs and USC respectively. The Globus toolkit.
FESR Trinacria Grid Virtual Laboratory gLite Information System Muoio Annamaria INFN - Catania gLite 3.0 Tutorial Trigrid Catania,
EGI-InSPIRE RI EGI Webinar EGI-InSPIRE RI Porting your application to the EGI Federated Cloud 17 Feb
Aaron Corso COSC Spring What is LAMP?  A ‘solution stack’, or package of an OS and software consisting of:  Linux  Apache  MySQL  PHP.
 Cloud Computing technology basics Platform Evolution Advantages  Microsoft Windows Azure technology basics Windows Azure – A Lap around the platform.
G. Russo, D. Del Prete, S. Pardi Kick Off Meeting - Isola d'Elba, 2011 May 29th–June 01th A proposal for distributed computing monitoring for SuperB G.
Database and Cloud Security
Web Application Vulnerabilities, Detection Mechanisms, and Defenses
StratusLab Final Periodic Review
StratusLab Final Periodic Review
Platform as a Service.
Objectives Differentiate between the different editions of Windows Server 2003 Explain Windows Server 2003 network models and server roles Identify concepts.
The Application Lifecycle
The Globus Toolkit™: Information Services
Architecture Competency Group
Chapter 2: Operating-System Structures
Chapter 2: Operating-System Structures
Presentation transcript:

Italo Epicoco, Ph.D. University of Lecce, Italy iGrid: a Relational Information Service A novel resource & service discovery approach

Outline of the talk Requirements & features Technologies Architecture Security Decentralized control & fault tolerance Information providers Relational schema Web service methods Performances iGrid Public Release iGrid

GridLab Project The GridLab Project was born on the basis of two important – Co-development of grid infrastructure and grid-aware applications – Dynamic grid computing The main goal of the GridLab project is to develop innovative grid computing technologies, which can be immediately and easily adopted and exploited by applications from many different research and engineering fields.

Requirements & features

iGrid Requirements Performance Scalability Security Uniformity Expressiveness Extensibility Dynamic data Flexible access Deployability Decentralized control

iGrid Features Web Service interface built using the gSOAP Toolkit Support for Globus Toolkit GSI security mechanism – Through our GSI plug-in for gSOAP Toolkit Distributed architecture Fault tolerance Based on the PostgreSQL relational DBMS Support for TLS binding to DBMS Support for heterogeneous DBMS – Through our GRelC library Easy to extend with new information providers Support for GAS authorization service (WP6) Support for MERCURY logging service (WP11) Platforms: linux, Mac OS X, tru64, irix Extreme Performance

Technologies

iGrid Technologies gSOAP Toolkit Globus Toolkit GSI GSI plug-in for gSOAP PostgreSQL GRelC TLS libxml2

Architecture

iGrid Architecture Hierarchical, distributed architecture based on a server that can act as – iServe: collects information related to a specific computing resource – iStore: gathers the information related to registered iServes iGrid implements a push policy for data exchange – Information is extracted from resources synchronously and stored in a relational DBMS – Each user and /or service supplied information (asynchronously) is immediately stored on local DBMS and sent to iStores Information is always available and updated – On client request information is immediately retrieved from DBMS – No need for iStore to pull information from iServes

iGrid Architecture

iServe It extracts system information from local resource It stores user and/or service supplied information in the local DBMS Periodically, it collects and sends new information to remote iStores In case of iStores failure, iServe temporarily removes them from its iStore list Periodically, the iStore list is updated by adding previously removed iStore servers. The local database is dumped and sent to newly added iStores

iStore Acts as an iServe for its local resource It gathers information coming from registered and authorized iServes An iStore can be registered to other iStores, thus creating hierarchies

iGrid DBMS Maintenance Each kind of information is tagged with – creation date: it represents the UTC (GMT) time when the information has been acquired – time to live: it indicates how long information can be considered reliable iGrid removes expired information from DBMS – on each lookup, a data cleanup is performed. Stale information is never returned to clients – at startup the entire DBMS is cleaned up, removing stale information

Information dissemination The frequency of system information forwarding is based on the information itself, but we also allow defining a per information specific policy Currently, system information forwarding is based on the rate of change of the information itself. As an example, information that does not change frequently or change slowly (e.g. the amount of RAM installed) does not require a narrow update interval. Interestingly, this is true even for the opposite extreme, i.e., for information changing rapidly (e.g., CPU load), since it is extremely unlikely that continuous forwarding of this kind of information can be valuable for users, due to information becoming quickly inaccurate Finally, information whose rate of change is moderate is forwarded using narrow update intervals

Security

iGrid Security (I) Access to iGrid through a GSI enabled web service – using our GSI plug-in for gSOAP which provides: – code based on GSS APIs – mutual authentication – authorization – delegation of credentials – connection caching – support for both IPv4 and IPv6 – support for development of both clients & servers – debugging framework – extensive error reporting

iGrid Security (II) GSI protects connections – from users/services to iServes – from iServes to iStores – from iStores to hierarchical iStores TLS protects connections – from an iServe or iStore to its PostgreSQL back-end This provides mutual authentication, confidentiality and anti-tampering And prevents snooping, spoofing, tampering, hijacking and capture replay network attacks

iGrid Security (III) Authorization can take advantage of – support for administrator defined ACL – support for GridLab GAS – fully customizable authorization mechanism through a GSI callback The iGrid daemon does not need to run as a privileged user We sanitize the environment on iGrid startup All user input is validated Special attention is devoted to the prevention of SQL injection attacks – an SQL SELECT query statement is escaped as needed – the query string is validated by an SQL parser which accepts only valid SELECT statements and rejects everything else

Decentralized control & fault tolerance

iGrid Decentralized Control Decentralized control in iGrid is achieved by means of distributed iServe and iStore services Both creation and maintenance of system/user/service information are delegated to the sites where resources are actually located, and administrators fully control how their site local policies are enforced – For instance, they decide – about users/services that must be trusted for both data lookup and registration – if an iServe must register to one or more iStores – if an iStore must accept incoming data from remote iServes – the frequency of system information dissemination from iServes to iStores on a per information provider basis – etc

iGrid Fault Tolerance In case of failure of an iStore, iServes temporarily remove the faulty iStore from their registration list. Periodically, the iStore list is updated by adding previously removed iStores when iStores are available again. In this case, the local database is dumped and immediately sent to newly added iStores Moreover, because of iGrid distributed and hierarchical architecture it is possible to implement a primary/backup approach by mirroring an iStore configuring all of the iServes to publish their information simultaneously on two or more different iStores

Information providers

iGrid Information Providers (I) System information – Cpu – Memory – Network – Filesystems – Users – Certification Authorities – Resource Managers

iGrid Information Providers (II) User/Service supplied information – Services – Web services – Software – Firewalls – Virtual Organizations

Relational schema

iGrid Relational schema (I)

iGrid Relational schema (II)

iGrid Relational schema (III)

iGrid Relational schema (IV)

Web service methods

iGrid Web Service Methods SQL SELECT query – exact – range – approximate – etc Lookup – predefined set of search queries Register Unregister Update

Performances

iGrid Performances The performances of iGrid are extremely good. In what follows, we report the performance obtained for information retrieval on two testbeds There were no high speed, dedicated networks connecting the machines belonging to the testbeds, and Globus MDS2 servers were accessed without GSI authentication, while iGrid servers were accessed using GSI, and the PostgreSQL back-end using TLS The results were averaged over 50 runs

First Testbed Results, in seconds, are related to a testbed which comprises an iServe and a GRIS server hosted on one machine in Lecce, Italy, and an iStore and GIIS server hosted on another machine in Brno, Czech Republic. A query to both iStore and MDS2 GIIS was issued requesting all of the available information using clients in Lecce, Italy Search for information related to all of the resources Cache expiredCache not expired Globus MDS2 GIIS777 iStore0.6

Second Testbed Results, in seconds, are related to iStore/GIIS servers running on a testbed which comprises four machines in Italy (three in Lecce and one in Bari, a city 200 Km far from Lecce) and one located in Poznan (Poland). The clients were in Poland, and the servers in Lecce. A total of four queries were issued The first query in this set requested all of the information of each registered resource The second one requested all of the information related to the machine in Poland The third one requested only the CPU information related to each registered resource The last query was a complex query involving many filtering operations to retrieve information about attributes related to CPU, memory, network and filesystems.

Second Testbed iStore / GIIS information related to all resources information related to resource in Poland information related to CPU of all resources complex query GIIS cache expired25,8823,5222,4218,59 GISS cache not expired2,531,640,460,67 iStore3,061,010,920,67

Second testbed iServe / GRIS All of the available information CPU information GRIScache expired23,883,36 GRIS cache not expired1,050,39 iServe0,950,81

iGrid Public Release

Work in progress We are now finalizing version 2.0 Public Release We are investigating a peer-to-peer approach Additional information dissemination protocols New information providers Additional improvements

Q & A