Using Globus to Locate Services Case Study 1: A Distributed Information Service for TeraGrid John-Paul Navarro, Lee Liming.

Slides:



Advertisements
Similar presentations
TeraGrid Deployment Test of Grid Software JP Navarro TeraGrid Software Integration University of Chicago OGF 21 October 19, 2007.
Advertisements

5/30/2012. Provides a method for finding services/data on the Exchange Network – discover data. Supports User Friendly Tools Can automatically collect.
Building an Operational Enterprise Architecture and Service Oriented Architecture Best Practices Presented by: Ajay Budhraja Copyright 2006 Ajay Budhraja,
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
MTA SZTAKI Hungarian Academy of Sciences Grid Computing Course Porto, January Introduction to Grid portals Gergely Sipos
Seminar Grid Computing ‘05 Hui Li Sep 19, Overview Brief Introduction Presentations Projects Remarks.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Data Grids: Globus vs SRB. Maturity SRB  Older code base  Widely accepted across multiple communities  Core components are tightly integrated Globus.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Grid Computing, B. Wilkinson, 20046c.1 Globus III - Information Services.
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links.
ORACLE APPLICATION SERVER BY PHANINDER SURAPANENI CIS 764.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Globus 4 Guy Warner NeSC Training.
Kate Keahey Argonne National Laboratory University of Chicago Globus Toolkit® 4: from common Grid protocols to virtualization.
January 2013 CDMI: An Introduction. Big Data Complexity Volume Speed “Big Data” refers to datasets whose size is beyond the ability of typical tools to.
Grid Monitoring By Zoran Obradovic CSE-510 October 2007.
TeraGrid’s Integrated Information Service “IIS” Grid Computing Environments 2009 Lee Liming, JP Navarro, Eric Blau, Jason Brechin, Charlie Catlett, Maytal.
TeraGrid Gateway User Concept – Supporting Users V. E. Lynch, M. L. Chen, J. W. Cobb, J. A. Kohl, S. D. Miller, S. S. Vazhkudai Oak Ridge National Laboratory.
GIG Software Integration: Area Overview TeraGrid Annual Project Review April, 2008.
TeraGrid Information Services December 1, 2006 JP Navarro GIG Software Integration.
GIG Software Integration Project Plan, PY4-PY5 Lee Liming Mary McIlvain John-Paul Navarro.
Introduction to Grid Computing Ann Chervenak and Ewa Deelman USC Information Sciences Institute.
TeraGrid Information Services John-Paul “JP” Navarro TeraGrid Grid Infrastructure Group “GIG” Area Co-Director for Software Integration and Information.
TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
A Lightweight Platform for Integration of Resource Limited Devices into Pervasive Grids Stavros Isaiadis and Vladimir Getov University of Westminster
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
From Creation to Dissemination A Case Study in the Library of Congress’s use Open Source Software DLF Spring Forum Corey Keith
CTSS 4 Strategy and Status. General Character of CTSSv4 To meet project milestones, CTSS changes must accelerate in the coming years. Process –Process.
GCMD/IDN STATUS AND PLANS Stephen Wharton CWIC Meeting February19, 2015.
SeLeNe - Architecture George Samaras Kyriakos Karenos Larnaca – April 2003 THE UNIVERSITY OF CYPRUS.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
ESP workshop, Sept 2003 the Earth System Grid data portal presented by Luca Cinquini (NCAR/SCD/VETS) Acknowledgments: ESG.
Some Grid Experiences Laura Pearlman USC Information Sciences Institute ICTP Advanced Training Workshop on Scientific Instruments on the Grid *Most of.
Application code Registry 1 Alignment of R-GMA with developments in the Open Grid Services Architecture (OGSA) is advancing. The existing Servlets and.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
1 Advanced Software Architecture Muhammad Bilal Bashir PhD Scholar (Computer Science) Mohammad Ali Jinnah University.
Shannon Hastings Multiscale Computing Laboratory Department of Biomedical Informatics.
TeraGrid CTSS Plans and Status Dane Skow for Lee Liming and JP Navarro OSG Consortium Meeting 22 August, 2006.
The Fedora Project April 28-29, 2003 CNI, Washington DC Thornton Staples University of Virginia Sandy Payette Cornell Information Science NOTE: CSG
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Known Territories l There are several ways that Globus software has been used that are well-trod paths u General-purpose Grid infrastructures u Domain-specific.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Information Services Andrew Brown Jon Ludwig Elvis Montero grid:seminar1:lectures:seminar-grid-1-information-services.ppt.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
Evolving Interfaces to Impacting Technology: The Mobile TeraGrid User Portal Rion Dooley, Stephen Mock, Maytal Dahan, Praveen Nuthulapati, Patrick Hurley.
Registries, ebXML and Web Services in short. Registry A mechanism for allowing users to announce, or discover, the availability and state of a resource:
1 NSF/TeraGrid Science Advisory Board Meeting July 19-20, San Diego, CA Brief TeraGrid Overview and Expectations of Science Advisory Board John Towns TeraGrid.
TeraGrid Gateway User Concept – Supporting Users V. E. Lynch, M. L. Chen, J. W. Cobb, J. A. Kohl, S. D. Miller, S. S. Vazhkudai Oak Ridge National Laboratory.
2005 GRIDS Community Workshop1 Learning From Cyberinfrastructure Initiatives Grid Research Integration Development & Support
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
TeraGrid Capability Discovery John-Paul “JP” Navarro TeraGrid Area Co-Director for Software Integration University of Chicago/Argonne National Laboratory.
Integrated Information Services “IIS” JP Navarro, U. of Chicago/ANL OGF 30 October 28, 2010.
Introduction: AstroGrid increases scientific research possibilities by enabling access to distributed astronomical data and information resources. AstroGrid.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
TeraGrid Software Integration: Area Overview (detailed in 2007 Annual Report Section 3) Lee Liming, JP Navarro TeraGrid Annual Project Review April, 2008.
NSF TeraGrid Review January 10, 2006
TeraGrid Information Services
NSF TeraGrid Review January 10, 2006
TeraGrid Information Services: Building on Globus MDS4
TeraGrid Information Services Developer Introduction
Information Services Discussion TeraGrid ‘08
Open Source distributed document DB for an enterprise
The Fedora Project March 19, 2003 ISTEC Symposium, Brazil
The Anatomy and The Physiology of the Grid
The Fedora Project April 28-29, 2003 CNI, Washington DC
Presentation transcript:

Using Globus to Locate Services Case Study 1: A Distributed Information Service for TeraGrid John-Paul Navarro, Lee Liming

OSGCC 2008Globus Primer: An Introduction to Globus Software2 UCSD UT UC/ANL NCSA PSC ORNL PU IU A National Science Foundation Investment in Cyberinfrastructure $100M 3-year construction ( ) $150M 5-year operation & enhancement ( ) NSF’s TeraGrid * l TeraGrid DEEP: Integrating NSF’s most powerful computers (60+ TF) u 2+ PB Online Data Storage u National data visualization facilities u World’s most powerful network (national footprint) l TeraGrid WIDE Science Gateways: Engaging Scientific Communities u 90+ Community Data Collections u Growing set of community partnerships spanning the science community. u Leveraging NSF ITR, NIH, DOE and other science community projects. u Engaging peer Grid projects such as Open Science Grid in the U.S. as peer Grids in Europe and Asia-Pacific. l Base TeraGrid Cyberinfrastructure: Persistent, Reliable, National u Coordinated distributed computing and information environment u Coherent User Outreach, Training, and Support u Common, open infrastructure services * Slide courtesy of Ray Bair, Argonne National Laboratory

OSGCC 2008Globus Primer: An Introduction to Globus Software3 l Provide a mechanism that allows resource providers, users, and partners the ability to publish and discover information about available capabilities u What are the TG compute resources? u What capabilities does resource X provide? u Where are the login services? u Where can I get data collection Y? u Who has a queue prediction service? u Who has a weather forecasting service? l Provide a mechanism that is suitable for TeraGrid’s open community u Publishers register information (as opposed to turning it over to a central database) u Central index (like Google) enables aggregation, discovery u Multiple access interfaces (WS/SOAP, WS/ReST, browser) The Challenge

OSGCC 2008Globus Primer: An Introduction to Globus Software4 TG Information Services… …IS NOT……IS… A central database (Data Warehouse)A central index/aggregation (Google) A new user interface A way user interfaces access information A single implementation/toolIncludes several tools A single software interface Accessed using several useful interfaces A specific set of dataPhased growing set of data Changed data ownershipOwnership maintained as appropriate Way to manage scientific informationWay to manage Grid meta-data A data management system (database) An information publishing system …is a coordinated way to publish, index, and access public [Tera]Grid information using software interfaces.

OSGCC 2008Globus Primer: An Introduction to Globus Software5 Issues - Technical l Information is stored in many legacy systems u Databases (several types, restricted access) u Static & dynamic web browser interfaces l Schema are many and diverse u Impractical to design a relational database that supports all of these data types and relations l Many kinds of clients (browsers, SOAP, ReST) l High availability is critical u The service will be depended on both by TG operations (testing, documentation, planning) and by many TG users and partners, so it must be available all the time and very stable u Goal is 99.5% availability

OSGCC 2008Globus Primer: An Introduction to Globus Software6 Issues - Social l TG is a community of independent service providers u Independence is prized u Ownership of information (and its quality control) is important u Participation in other grids is typical l Publishers have low threshold for tech hassles u Publishing mechanism must be simple l The solution must add to (not replace) existing interfaces

OSGCC 2008Globus Primer: An Introduction to Globus Software7 MDS4 Overview l Components u Index Service – aggregates information and provides a query interface u Trigger Service – aggregates information and takes actions when conditions are met u WebMDS - subsets and transforms XML based on XPath queries, XSLT transforms and style sheets u Information provider APIs – integration with legacy systems u APIs and command-line clients for developers l Implemented as Web services l Uses WSRF (lifecycle, resource properties, etc.) l Included in the Globus Toolkit 4.0

OSGCC 2008Globus Primer: An Introduction to Globus Software8 The MDS4 Hourglass Schemas Information Users : Schedulers, Portals, Warning Systems, etc. Other sources of information Services (GRAM, RFT, RLS) Queueing Systems (PBS, Torque, etc.) WS standard interfaces for subscription, registration, notification

OSGCC 2008Globus Primer: An Introduction to Globus Software9 Clients TeraGrid’s IS Architecture Cache WS/REST HTTP GET WS/SOAP WS MDS4 Tomcat WebMDS Apache 2.0 TeraGrid Central Services TeraGrid Repositories Partners WS/SOAP WS MDS4 Resource Provider Services

OSGCC 2008Globus Primer: An Introduction to Globus Software10 Central vs. Distributed Services l Publisher Content u Publisher-owned and maintained information u Data probably originates somewhere in the local system l Publisher Code u An MDS4 index service u Or: Any Web service that has WS- ResourceProperties l Central Content u Aggregated publisher content l Central Code u Redundant servers u Information caching (persistence) u MDS4 index services (WS/SOAP) u WebMDS/Tomcat, Apache 2.0, … (WS/REST) u Content published in: HTML, XHTML/XML, XML, Atom, RSS, …

OSGCC 2008Globus Primer: An Introduction to Globus Software11 Registration l Publisher registers available content u Local service maintains a registration with the central indices u Registration expires automatically, so refresh is needed periodically u Publishers retain ownership and operation of their own information service (can be registered with other grids!) l Index services pull content u Registrations are subject to access control u Uses registration data to contact service and get latest content u Caches content locally, subject to purge policy u Cache allows for service outages, etc.

OSGCC 2008Globus Primer: An Introduction to Globus Software12 High-Availability Design … info.dyn.teragrid.org info.teragrid.org TeraGrid Dynamic DNS Dynamically direct clients to one or more servers Set by Information Services administrators Changes propagate globally fast (TTL = 15 minutes) Clients Dynamically Changes Doesn’t Change RP/partner services TG wide servers (Patrick Dorn & NCSA NetEng)

OSGCC 2008Globus Primer: An Introduction to Globus Software13 Information Services Users User DocumentationUser Portal Database? Gateways Peer Grids User Applications info.teragrid.org Others

OSGCC 2008Globus Primer: An Introduction to Globus Software14 Queue Contents in User Portal

OSGCC 2008Globus Primer: An Introduction to Globus Software15 Where are GridFTP services?

Where Can I Login? OSGCC 2008Globus Primer: An Introduction to Globus Software16 To login to cobalt at NCSA, ssh to grid-co.ncsa.teragrid.org! Don’t use this one (yet).

OSGCC 2008Globus Primer: An Introduction to Globus Software17 Results - TeraGrid l Considerable excitement from information owners… u A way to raise awareness for their information & capabilities u Doesn’t require them to replace legacy systems or turn information over to someone else l …and information consumers u Simple, consistent access mechanisms for lots of information types u A mediating agency for independent service operators l Integration to date: u Compute service descriptions and queue status u Software & service availability registry u Central documentation u Verification & validation testing service

MDS4 has… l WS/WSRF interface l WS/REST interface (browser-accessible) l XSLT/Xpath support l Registration, polling, subscription, notification capabilities l Index & trigger service l GLUE CE providers l Plug-in API for custom info providers MDS4 doesn’t have… l Your own custom info providers l Schema validation l Many clients (unless you count browsers!) l XSLT style examples l High-availability deployment OSGCC 2008Globus Primer: An Introduction to Globus Software18

Other Uses of MDS4 l Directory of service deployments u E.g., caGrid service registry l Monitoring/alert service u Trigger service notifies when an expected service registration isn’t there anymore l Monitoring/recording service u Subscriber periodically records value of a registered resource property (e.g., free space, services registered, system load) OSGCC 2008Globus Primer: An Introduction to Globus Software19