TeraGrid Information Services John-Paul “JP” Navarro TeraGrid Grid Infrastructure Group “GIG” Area Co-Director for Software Integration and Information.

Slides:



Advertisements
Similar presentations
TeraGrid Community Software Areas (CSA) JP (John-Paul) Navarro TeraGrid Grid Infrastructure Group Software Integration University of Chicago and Argonne.
Advertisements

TeraGrid Deployment Test of Grid Software JP Navarro TeraGrid Software Integration University of Chicago OGF 21 October 19, 2007.
18 April 2002 e-Science Architectural Roadmap Open Meeting 1 Support for the UK e-Science Roadmap David Boyd UK Grid Support Centre CLRC e-Science Centre.
MTA SZTAKI Hungarian Academy of Sciences Grid Computing Course Porto, January Introduction to Grid portals Gergely Sipos
Seminar Grid Computing ‘05 Hui Li Sep 19, Overview Brief Introduction Presentations Projects Remarks.
ARCHIMÈDE Presented by Guy Teasdale Directeur, Services soutien et développement Bibliothèque de l’Université Laval CARL Workshop on Institutional Repositories.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Globus Toolkit 4 hands-on Gergely Sipos, Gábor Kecskeméti MTA SZTAKI
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
Data Grids: Globus vs SRB. Maturity SRB  Older code base  Widely accepted across multiple communities  Core components are tightly integrated Globus.
Using Globus to Locate Services Case Study 1: A Distributed Information Service for TeraGrid John-Paul Navarro, Lee Liming.
4b.1 Grid Computing Software Components of Globus 4.0 ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson, slides 4b.
Grid Computing, B. Wilkinson, 20046c.1 Globus III - Information Services.
Globus Computing Infrustructure Software Globus Toolkit 11-2.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
1 Globus Developments Malcolm Atkinson for OMII SC 18 th January 2005.
Globus 4 Guy Warner NeSC Training.
Kate Keahey Argonne National Laboratory University of Chicago Globus Toolkit® 4: from common Grid protocols to virtualization.
TeraGrid’s Integrated Information Service “IIS” Grid Computing Environments 2009 Lee Liming, JP Navarro, Eric Blau, Jason Brechin, Charlie Catlett, Maytal.
GIG Software Integration: Area Overview TeraGrid Annual Project Review April, 2008.
Grid Information Systems. Two grid information problems Two problems  Monitoring  Discovery We can use similar techniques for both.
TeraGrid Information Services December 1, 2006 JP Navarro GIG Software Integration.
GIG Software Integration Project Plan, PY4-PY5 Lee Liming Mary McIlvain John-Paul Navarro.
Cancer Bioinformatics Grid (caBIG) CANS 2006 Chicago, Illinois Shannon Hastings Department of Biomedical Informatics Ohio State University.
Holding slide prior to starting show. A Grid-based Problem Solving Environment for GECEM Maria Lin and David Walker Cardiff University Yu Chen and Jason.
TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
OPEN GRID SERVICES ARCHITECTURE AND GLOBUS TOOLKIT 4
From Creation to Dissemination A Case Study in the Library of Congress’s use Open Source Software DLF Spring Forum Corey Keith
CTSS 4 Strategy and Status. General Character of CTSSv4 To meet project milestones, CTSS changes must accelerate in the coming years. Process –Process.
ANSTO E-Science workshop Romain Quilici University of Sydney CIMA CIMA Instrument Remote Control Instrument Remote Control Integration with GridSphere.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
TeraGrid CTSS Plans and Status Dane Skow for Lee Liming and JP Navarro OSG Consortium Meeting 22 August, 2006.
09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.
WALSAIP Portal Automated Composition of Signal Processing Operators Mariana Mendoza Botero.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Information Services Andrew Brown Jon Ludwig Elvis Montero grid:seminar1:lectures:seminar-grid-1-information-services.ppt.
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
1 Service Creation, Advertisement and Discovery Including caCORE SDK and ISO21090 William Stephens Operations Manager caGrid Knowledge Center February.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Biomedical and Bioscience Gateway to National Cyberinfrastructure John McGee Renaissance Computing Institute
Data Manipulation with Globus Toolkit Ivan Ivanovski TU München,
Rendering Syndicated Library Content in an Institutional Portal: Integrating MyLibrary into uPortal John Fereira: Cornell University Eric Lease Morgan:
Managing deployment and activation of Web Applications in a distributed e-Infrastructure EGI Technical Forum September 2011 Lyon
CTSS Version 4 User Support Documentation Mike Dwyer, Kerry Hagan, Diana Diehl.
GT3 Index Services Lecture for Cluster and Grid Computing, CSCE 490/590 Fall 2004, University of Arkansas, Dr. Amy Apon.
TeraGrid’s Common User Environment: Status, Challenges, Future Annual Project Review April, 2008.
Software Integration Highlights CY2008 Lee Liming, JP Navarro GIG Area Directors for Software Integration University of Chicago, Argonne National Laboratory.
DataGrid is a project funded by the European Commission EDG Conference, Heidelberg, Sep 26 – Oct under contract IST OGSI and GT3 Initial.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
TeraGrid Capability Discovery John-Paul “JP” Navarro TeraGrid Area Co-Director for Software Integration University of Chicago/Argonne National Laboratory.
Integrated Information Services “IIS” JP Navarro, U. of Chicago/ANL OGF 30 October 28, 2010.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
TeraGrid’s Process for Meeting User Needs. Jay Boisseau, Texas Advanced Computing Center Dennis Gannon, Indiana University Ralph Roskies, University of.
Amy Krause EPCC OGSA-DAI An Overview OGSA-DAI on OMII 2.0 OMII The Open Middleware Infrastructure Institute NeSC,
Portlet Development Konrad Rokicki (SAIC) Manav Kher (SemanticBits) Joshua Phillips (SemanticBits) Arch/VCDE F2F November 28, 2008.
TeraGrid Software Integration: Area Overview (detailed in 2007 Annual Report Section 3) Lee Liming, JP Navarro TeraGrid Annual Project Review April, 2008.
NSF TeraGrid Review January 10, 2006
TeraGrid Information Services
NSF TeraGrid Review January 10, 2006
TeraGrid Information Services: Building on Globus MDS4
TeraGrid Information Services Developer Introduction
Information Services Discussion TeraGrid ‘08
Viet Tran Institute of Informatics Slovakia
Presentation transcript:

TeraGrid Information Services John-Paul “JP” Navarro TeraGrid Grid Infrastructure Group “GIG” Area Co-Director for Software Integration and Information Services University of Chicago, Argonne National Laboratory GCE07 November 12, 2007

GCE 072 Grids include interconnected hardware components, coordinated software and (grid) services, and institutions and groups that operate them. To effectively use grids, users need access to information about the hardware, software, grid services,and the institutions and groups operating them. The TeraGrid's Information Services vision is to: 1)define a coordinated way for TeraGrid participants to publish about the services they offer, 2)define a way for the TeraGrid to aggregate and index the information from all TeraGrid participants, and 3)to publish this information to the public in a form that can easily be used by other software, users, and TeraGrid service providers themselves. This talk will introduce the TeraGrid's Information Service strategy, the high-level architecture, current and future content, and the methods available to users, applications, and gateways to access TeraGrid Information Services content. Abstract

November 12, 2007GCE 073 Collection of Information Grid Services Service providers publish local information TeraGrid wide aggregating/indexing for discovery Primarily focused on public information Primarily accessible thru software interfaces Using standards based interfaces Reliable, scalable, and fast Initially focused on TeraGrid information Able to include partner/community information TeraGrid Information Services Elements

November 12, 2007GCE 074 TG Information Services IS [NOT] IS NOTIS A central database (Data Warehouse)A central index/aggregation (Google) A new user interfaceA way software (user interfaces) can access information A single implementation/toolEvolving set of software tools A single software interfaceSeveral useful interfaces (small set) A specific set of informationPhased growing collection of information Changed data ownershipOwnership maintained as appropriate Way to manage scientific informationWay to manage Grid meta-data A data management system (database)An information publishing system A coordinated way to index and publish public [Tera]Grid information thru software interfaces.

November 12, 2007GCE 075 Clients High-Level Components Cache WS/REST HTTP GET WS/SOAP WS MDS4 Tomcat WebMDS Apache 2.0 TeraGrid Wide Information Services WS/SOAP WS MDS4 Service Provider Information Services TeraGrid Wide Respositories TeraGrid Wide Respositories

November 12, 2007GCE 076 Services Provider Information Services Content: Locally owned and maintained information Originates anywhere the service provider wishes Services: 1 general purpose MDS service 2 scheduling MDS services: authenticated and public (merging) TeraGrid Wide Information Services Content: Aggregate/index service provider information Additional central information (TGCDB, GIG operated services, …) Cached (service providers services can be down) Authenticated registrations Services: Several redundant servers (99.5% plus availability) Information caching (persistence) Several MDS4 services (WS/SOAP) WebMDS/Tomcat, Apache 2.0, … services (WS/REST) Content published in: HTML, XHTML/XML, XML, Atom, RSS, … Service Provider vs TG Wide Services

November 12, 2007GCE 077 WS/* (Tomcat 5.0, Apache 2.0) Benefits Very common web services platform Supports several web service interfaces (including simple) Supports multiple styles like REST, Web 2.0 Can be highly scalable Content Many formats: HTML, XHTML/XML, XML, RSS/Atom, … WebMDS (Globus 4.0.5/VDT 1.7.1) Benefits Live MDS4 content access XPath support XSLT transforms Content Many formats: HTML, XHTML/XML, XML, RSS/Atom WS/SOAP (Globus 4.0.5/VDT MDS4) Benefits Indexing, Trigger Registration, Publish, Subscribe Security/Authorization Robust WSRF interface Content XML Tools

November 12, 2007GCE 078 High-Availability Design … info.dyn.teragrid.org info.teragrid.org TeraGrid Dynamic DNS Information Services administrators select servers Changes propagate globally with a 15 minute TTL Clients Dynamically Changes Doesn’t Change Service Provider Information Services TG wide information services

November 12, 2007GCE 079 Information Services Users User Documentation User Portal Gateways Peer Grids User Applications info.teragrid.org Inca Testing Harness

November 12, 2007GCE 0710 What’s in Production? Services –TeraGrid Resource Provider Information Services –TeraGrid Wide Aggregating/Indexing Information Services Content (since when) –Scheduling information for User Portal (Spring) Scheduler load, Queue contents (restricted) –CTSS 4 capabilities kits (August) Which capability kits are available on each resource What software is available in each kit on each resource What services are available from each kit on each resource –TeraGrid Central Database (tgcdb) keys and descriptions (October)

Queue Contents in User Portal

November 12, 2007GCE 0712 CTSS 4 Capability Kits For each capability kit on each resource –Current support level, and target support level Development, Testing, Production –Support organization and contact –Inca status URL –Multiple version of a kit with different support levels

November 12, 2007GCE 0713 CTSS 4 Capability Kit Software For each kit software component on each resource –Name, version, how to access it –Multiple versions of a single component

November 12, 2007GCE 0714 CTSS 4 Capability Kit Services For each kit service on each resource –Name, type, version, and Endpoint (contact location) –GSI OpenSSH, GridFTP, SRB servers, PreWS & WS GRAM, MDS4 –Multiple services of the same type

CTSS Capability Kit Availability

Where are the GridFTP services?

November 12, 2007GCE 0717 What’s in Development? Expanded content –Local HPC Software –Extended GridFTP service information –(Meta)Scheduling support information Core Extension –Information Services Metadata (registration w/o aggregation) Information Access –tginfo, universal command line query tool –WS/REST, Web 2.0 style information access –Multiple formats: CSV TEXT, RSS/Atom, XML, … –GLUE 2.0 Community publishing –Community supported capabilities –Community information services registration

November 12, 2007GCE 0718 Accessing TeraGrid IS from software Learn what information is available – Choose your access method and client software –WS/SOAP: GT4 Java core, or client toolkit –WS/REST: Any tool that can issue HTTP GET Code TG Information Services queries –Using GT4 access (XPATH) –Using HTTP GET (Optional) Resource Selection –List of TG ResourceIDs

November 12, 2007GCE 0719 Gateways Publish or just register to TeraGrid Wide Information Services Data collections Data collections register to TeraGrid Wide Information Services Data collections access method, service Endpoint, paths Community software areas Which resources have each CSA What software is available in each CSA, how to access it Service Provider Planned and unplanned outage information Policies Peer grids/interoperability Resources, services available on peer grids (OSG, EGEE, …) ……. [Not so] Farfetched Possibilities

November 12, 2007GCE 0720 Find out more: (links to content and documentation) Request content: mailto: or Discuss Information Services content, requirements, and design: list View current Information Services content User Portal (scheduler load & queue contents): User Documentation (CTSS 4 kits, software, services): Information Service Main Page: More Information