MDS-2.1 and Futures Karl Czajkowski Information Sciences Institute University of Southern California.

Slides:



Advertisements
Similar presentations
Giggle: A Framework for Constructing Scalable Replica Location Services Ann Chervenak, Ewa Deelman, Ian Foster, Leanne Guy, Wolfgang Hoschekk, Adriana.
Advertisements

The Replica Location Service In wide area computing systems, it is often desirable to create copies (replicas) of data objects. Replication can be used.
Data Management Expert Panel - WP2. WP2 Overview.
The Anatomy of the Grid: An Integrated View of Grid Architecture Carl Kesselman USC/Information Sciences Institute Ian Foster, Steve Tuecke Argonne National.
Indications in green = Live content Indications in white = Edit in master Indications in blue = Locked elements Indications in black = Optional elements.
High Performance Computing Course Notes Grid Computing.
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
Italo Epicoco, Ph.D. University of Lecce, Italy iGrid: a Relational Information Service A novel resource & service discovery approach.
Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.
A Computation Management Agent for Multi-Institutional Grids
Grid Information Service (Meta-Directory Service 2) Globus Toolkit™ Developer Tutorial The Globus Project™ Argonne National Laboratory USC Information.
USING THE GLOBUS TOOLKIT This summary by: Asad Samar / CALTECH/CMS Ben Segal / CERN-IT FULL INFO AT:
 Introduction Originally developed by Open Software Foundation (OSF), which is now called The Open Group ( Provides a set of tools and.
Extensible Scalable Monitoring for Clusters of Computers Eric Anderson U.C. Berkeley Summer 1997 NOW Retreat.
Massimo Cafaro GridLab Review GridLab WP10 Information Services Massimo Cafaro CACT/ISUFI University of Lecce, Italy.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Grid Computing, B. Wilkinson, 20046c.1 Globus III - Information Services.
Windows 2000 Remote Access. Remote Access Overview With Windows 2000 remote access, remote access clients connect to remote access servers and are transparently.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Understanding Active Directory
Grid Monitoring By Zoran Obradovic CSE-510 October 2007.
Grid Toolkits Globus, Condor, BOINC, Xgrid Young Suk Moon.
Grid Information Systems. Two grid information problems Two problems  Monitoring  Discovery We can use similar techniques for both.
CGW 2003 Institute of Computer Science AGH Proposal of Adaptation of Legacy C/C++ Software to Grid Services Bartosz Baliś, Marian Bubak, Michał Węgiel,
The EU DataGrid – Information and Monitoring Services The European DataGrid Project Team
GRID Centralized management of the Globus grid-mapfile Carlo Rocca INFN, Catania.
MDS-2.1 and Futures Karl Czajkowski Information Sciences Institute University of Southern California.
OSG Public Storage and iRODS
Overview of the NorduGrid Information System Balázs Kónya 3 rd NorduGrid Workshop 23 May, 2002, Helsinki.
Grid Discovery and Monitoring Systems Laura Pearlman USC/Information Sciences Institute With materials from Ben Clifford and others from the Globus Project.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
A. Cavalli - F. Semeria INFN Experience With Globus GIS 1 A. Cavalli - F. Semeria INFN First INFN Grid Workshop Catania, 9-11 April 2001 INFN Experience.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Information System on gLite middleware Vincent.
The Anatomy of the Grid: An Integrated View of Grid Architecture Ian Foster, Steve Tuecke Argonne National Laboratory The University of Chicago Carl Kesselman.
The VEGA Approach to Grid Security Grid System Software Group, ICT, CAS Security In VEGA GOS v2 Li ZHA
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Shannon Hastings Multiscale Computing Laboratory Department of Biomedical Informatics.
Introduce Grid Service Authoring Toolkit Shannon Hastings, Scott Oster, Stephen Langella, David Ervin Ohio State University Software Research Institute.
Introduction to Grid Computing Ed Seidel Max Planck Institute for Gravitational Physics
The Replica Location Service The Globus Project™ And The DataGrid Project Copyright (c) 2002 University of Chicago and The University of Southern California.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Globus Toolkit Massimo Sgaravatto INFN Padova. Massimo Sgaravatto Introduction Grid Services: LHC regional centres need distributed computing Analyze.
Scientific Data Grid & China-VO Kai Nan Computer Network Information Center Chinese Academy of Sciences November 27, 2003.
Globus – Part II Sathish Vadhiyar. Globus Information Service.
E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA gLite Information System Pedro Rausch IF.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
INFSO-RI Enabling Grids for E-sciencE Ganga 4 – The Ganga Evolution Andrew Maier.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America gLite Information System Claudio Cherubino.
GraDS MacroGrid Carl Kesselman USC/Information Sciences Institute.
1 AHM, 2–4 Sept 2003 e-Science Centre GRID Authorization Framework for CCLRC Data Portal Ananta Manandhar.
Globus: A Report. Introduction What is Globus? Need for Globus. Goal of Globus Approach used by Globus: –Develop High level tools and basic technologies.
The EU DataGrid – Information and Monitoring Services The European DataGrid Project Team
Grid Information Services The Globus Project Argonne National Laboratory USC Information Sciences Instutute
April 4, 2002Atlas Testbed Workshop ATLAS Hierarchical MDS Server Patrick McGuigan.
Distributed File Systems Questions answered in this lecture: Why are distributed file systems useful? What is difficult about distributed file systems?
The Globus Toolkit The Globus project was started by Ian Foster and Carl Kesselman from Argonne National Labs and USC respectively. The Globus toolkit.
FESR Trinacria Grid Virtual Laboratory gLite Information System Muoio Annamaria INFN - Catania gLite 3.0 Tutorial Trigrid Catania,
Active Directory Domain Services (AD DS). Identity and Access (IDA) – An IDA infrastructure should: Store information about users, groups, computers and.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
A System for Monitoring and Management of Computational Grids Warren Smith Computer Sciences Corporation NASA Ames Research Center.
E-science grid facility for Europe and Latin America Updates on Information System Annamaria Muoio - INFN Tutorials for trainers 01/07/2008.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Globus —— Toolkits for Grid Computing
Software Design and Architecture
The Globus Toolkit™: Information Services
Introduction to Apache
JINI ICS 243F- Distributed Systems Middleware, Spring 2001
Presentation transcript:

MDS-2.1 and Futures Karl Czajkowski Information Sciences Institute University of Southern California

Talk Outline l Introduction –Problem, history, etc. l MDS-2 Architecture –Protocols –Features, services l MDS-2.1 Software –Technology map l Information Model –Additional background

Resource Discovery/Monitoring l Distributed users and resources l Variable resource status l Variable grouping and connectivity R R R R R R ? ? R R R R RR R RR ? ? R R R R R dispersed users VO-AVO-B network

Basic Grid Acquisition Phases l Resource Discovery –What resources are relevant? –Bootstraps planner state l Resource Status Inquiry –How do resources compare (now)? –Refines planner knowledge l Resource Control –Did I acquire the resources? –Not part of information service!

MDS History l MDS-1 (classic) –Centralized database –Globus and earlier –Did not scale l MDS-2 –Distributed services –MDS 2.0 in Globus –New MDS 2.1 development in beta release

Base Features l Virtual Organizations (VOs) –Group together resources and users –Support community-specific discovery –Specialized views l Scalability –Many resources –Many VOs –Graceful degradation of service

Virtual Organizations l Collaborating individuals and institutions –Shared goals –Enable sharing of resources –Non-locality of participants l Dynamic in nature –VOs come and go –Resources join and leave VOs –Resources change status and fail l Community-wide goals

Scalability l Large numbers –Many resources –Many users l Independence –Resources shouldnt affect one another –VOs shouldnt affect one another l Graceful degradation of service –As much function as possible –Tolerate partitions, prune failures

New MDS-2.1 Features l Security Mechanisms –GSI mutual-authentication –Fine-grained access control by GSI name l Performance Enhancements –Better query speeds –Less stale information l New Information Model (schema) –Better representation of computers –Cleaner namespace management

MDS-2 Service Architecture l Dynamic Registration via Reg. Protocol (GRRP) l Resource Inquiry via Info. Protocol (GRIP) –Co-located with resource on network l Resource Discovery (via GRIP or other) –Using GRIP allows resource/directory hierarchy discovery (GRIP?) lookup (GRIP) registration (GRRP) VO-specific Aggregate Directories standard Resource Description services RRRR AA?

Distributed Services l Service scales with Grid growth l Loose consistency model tolerates failures l Interoperability by GRIP/GRRP protocols D D D D

Soft-state Registration l Periodic notification –Service/resource is available –Expected-frequency metadata l Automatic directory construction –Add new resources to directory –Invite resources to join new directory l Self-cleaning –Reduce occurrence of dead references

MDS-2 Implementation l Grid Information Service (GRIS) –Provides resource description –Modular content gateway l Grid Index Information Service (GIIS) –Provides aggregate directory –Hierarchical groups of resources l Lightweight Dir. Access Protocol (LDAP) –Standard with many client implementations –Used for GRIP (and GRRP currently)

MDS-2.1 Development Activities l Incorporates external advances –New OpenLDAP 2.0.x code-base –Cyrus-SASL/GSI security integration –Leveraging new Globus packaging model l Improving internal components –Better query servicing –New configuration/policy support –Invitation (reverse registration) >Not in current beta...

MDS-2.1 External Software Stack l OpenLDAP 2.0.x (.14) –Implements LDAPv3 protocol –Client and server components l Cyrus-SASL –Generic security –We provide loadable SASL/GSS plugin l Globus GSI (repackaged) –Provides GSS-API interface to PKI –Shared library used by our SASL plugin

MDS 2.1 Security l PKI authentication l Static authorization –Class, attribute, object name rules l Self authorization –Semi-dynamic rule –Requires owner attribute on objects l Dynamic authorization –Directory-based group lists (or future CAS) –Per-object access rule attributes –LDAP dynamic authorization (beta?)

MDS-2.1 Internal Software l Wrappers/tools –Simplify typical idioms l Feature-specific GRIS providers –Resource-specific information probes –Modular interface to GRIS l LDAP server backend modules –GRIS provider dispatch/caching –GIIS implementation l SASL/GSS plugin

MDS-2.1 GRIS Providers grid-info-cpu reports CPU/load info grid-info-fs reports filesystem info grid-info-mem reports RAM/VM info grid-info-net reports NIC/net info grid-info-os reports OS info grid-info-platform reports arch. info grid-info-merged merges all host info* l Extensible for other sources, e.g. GRAM

2.1 GRIS Provider Times OSRH 6.2RH7.1IrixSolaris Platform Os Cpu Mem Fs Net Merge total OLD10.01N/A

GRIS Dispatch Logic l For each provider: 1.Could search intersect provider? No, then skip. 2.Is provider cache stale? Yes, then refill. 3.Apply search filter to cache data. l Combine all intersecting providers results

GRIS Response Issues l MDS 2.0 and 2.1 are lazy –Probes are not issued unless queried l Some system probes are slow –Best probe may take several seconds l How to avoid stale data? –Clients set time-out per query –GRIS/GIIS define time-out per source –Fresh data found before timeout is returned –Cache fill continues after client time-out

Hierarchical GIIS l Maintain set of remote services –Track incoming live registrations –GRIS or GIIS registrants l Cached proxy results (now), or –Same cache logic as GRIS –Refill cache with chaining queries l LDAPv3 referral results (planned) –Do not maintain any local info cache –Redirect clients to active registrants

Extensible GIIS Framework l Modular registration actions –Re-use registration protocol decoding –Specialize directory update –e.g. prefetch indexable data l Modular query actions –Re-use query protocol decoding –Specialize query handler algorithm –e.g. utilize precomputed indices l Not in current beta…

MDS 2.1 Information Model l Structural information –Resource hierarchy maps to objects –Named positions in LDAP DIT l Merged information –Some parents join child data –Simplifies common query patterns l Auxiliary information –Uniform representation of leaf/parent data –Uses LDAP auxiliary objectclasses

GRIS Host Objects OS

GRIS Object Hierarchy Mds-Host-name=hostname Mds-Software-Deployment=operating system Mds-Device-Group-name=processors Mds-Device-name=cpu 0 Mds-Device-Group-name=memory Mds-Device-name=physical memory Mds-Device-name=virtual memory Mds-Device-Group-name=filesystems Mds-Device-name=/scratch1 Mds-Device-name=/scratch2 Mds-Device-Group-name=networks Mds-Device-name=eth0

GRIS Structural Class Hierarchy Mds Attr: Mds-validfrom (like createtime) Attr: Mds-validto (accuracy metadata) Attr: Mds-keepto (discard metadata) MdsHost MdsDevice MdsDeviceGroup MdsSoftwareDeployment l Every MDS object: name, time metadata

GRIS Auxiliary Class Examples MdsCpu Attr: Mds-Cpu-vendor Attr: Mds-Cpu-Model Attr: Mds-Cpu- speedMHz MdsCpuCache Attr: Mds-Cpu-Cache-1kB MdsCpuSmp Attr: Mds-Cpu-Smp-size MdsCpuTotal Attr: Mds-Cpu-Total-count l Once per CPU l Once per SMP l Once per MPP

GRIS Auxiliary Class Examples MdsCpuFree Attr: Mds-Cpu-Free-1min Attr: Mds-Cpu-Free-5min Attr: Mds-Cpu-Free-15min MdsCpuTotalFree Attr: Mds-Cpu-Total-Free-1min Attr: Mds-Cpu -Total-Free-5min Attr: Mds-Cpu -Total-Free-15min l once per SMP l once per MPP

More Information l Questions? l HPDC-10 Paper (August 2001) –Grid Information Services for Distributed Resource Sharing l Globus Toolkit 2.0 beta –Early access to test code