Information Services Discussion TeraGrid ‘08

Slides:



Advertisements
Similar presentations
TeraGrid Community Software Areas (CSA) JP (John-Paul) Navarro TeraGrid Grid Infrastructure Group Software Integration University of Chicago and Argonne.
Advertisements

TeraGrid Deployment Test of Grid Software JP Navarro TeraGrid Software Integration University of Chicago OGF 21 October 19, 2007.
The Biosafety Clearing-House of the Cartagena Protocol on Biosafety Tutorial – BCH Resources.
The Cactus Portal A Case Study in Grid Portal Development Michael Paul Russell Dept of Computer Science The University of Chicago
Massimo Cafaro GridLab Review GridLab WP10 Information Services Massimo Cafaro CACT/ISUFI University of Lecce, Italy.
Data Grids: Globus vs SRB. Maturity SRB  Older code base  Widely accepted across multiple communities  Core components are tightly integrated Globus.
Using Globus to Locate Services Case Study 1: A Distributed Information Service for TeraGrid John-Paul Navarro, Lee Liming.
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
IBM User Technology March 2004 | Dynamic Navigation in DITA © 2004 IBM Corporation Dynamic Navigation in DITA Erik Hennum and Robert Anderson.
Kate Keahey Argonne National Laboratory University of Chicago Globus Toolkit® 4: from common Grid protocols to virtualization.
TeraGrid’s Integrated Information Service “IIS” Grid Computing Environments 2009 Lee Liming, JP Navarro, Eric Blau, Jason Brechin, Charlie Catlett, Maytal.
GIG Software Integration: Area Overview TeraGrid Annual Project Review April, 2008.
TeraGrid Information Services December 1, 2006 JP Navarro GIG Software Integration.
GIG Software Integration Project Plan, PY4-PY5 Lee Liming Mary McIlvain John-Paul Navarro.
TeraGrid Information Services John-Paul “JP” Navarro TeraGrid Grid Infrastructure Group “GIG” Area Co-Director for Software Integration and Information.
TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007.
CTSS 4 Strategy and Status. General Character of CTSSv4 To meet project milestones, CTSS changes must accelerate in the coming years. Process –Process.
GCMD/IDN STATUS AND PLANS Stephen Wharton CWIC Meeting February19, 2015.
HTML. Principle of Programming  Interface with PC 2 English Japanese Chinese Machine Code Compiler / Interpreter C++ Perl Assembler Machine Code.
TeraGrid CTSS Plans and Status Dane Skow for Lee Liming and JP Navarro OSG Consortium Meeting 22 August, 2006.
1/22/08 RTR Project Presentation to TPTF RTR Project Michael Daskalantonakis & Brian Cook.
Schedule Introduction to Web & Database Integration Tools and Resources HTML and Styles Forms and Client-Side Scripts DB Engines Forms Processing and Server-Side.
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
AxKit A member of the Apache XML project Ryan Maslyn Kyle Bechtel.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
1 Data Architecture Strawman - Grimshaw Important points Everything is a service (object) >All have a name (EPR) and an interface (type) One or more base.
Rendering Syndicated Library Content in an Institutional Portal: Integrating MyLibrary into uPortal John Fereira: Cornell University Eric Lease Morgan:
EGEE is a project funded by the European Union under contract IST Introduction to Web Services 3 – 4 June
CTSS Version 4 User Support Documentation Mike Dwyer, Kerry Hagan, Diana Diehl.
TeraGrid’s Common User Environment: Status, Challenges, Future Annual Project Review April, 2008.
Software Integration Highlights CY2008 Lee Liming, JP Navarro GIG Area Directors for Software Integration University of Chicago, Argonne National Laboratory.
TeraGrid Capability Discovery John-Paul “JP” Navarro TeraGrid Area Co-Director for Software Integration University of Chicago/Argonne National Laboratory.
Integrated Information Services “IIS” JP Navarro, U. of Chicago/ANL OGF 30 October 28, 2010.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
TeraGrid Software Integration: Area Overview (detailed in 2007 Annual Report Section 3) Lee Liming, JP Navarro TeraGrid Annual Project Review April, 2008.
Service Oriented Architecture (SOA) Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
TeraGrid User Portal and Online Presence David Hart, SDSC Area Director, User-Facing Projects and Core Services TeraGrid Annual Review April 6, 2009.
SmartCode Brad Argue INLS /19/2001.
BUILD SECURE PRODUCTS AND SERVICES
NSF TeraGrid Review January 10, 2006
TeraGrid Information Services
Distributed Control and Measurement via the Internet
Chapter 8 Environments, Alternatives, and Decisions.
GPIR GridPort Information Repository
WWW and HTTP King Fahd University of Petroleum & Minerals
NSF TeraGrid Review January 10, 2006
Web Development Web Servers.
TeraGrid Information Services: Building on Globus MDS4
z/Ware 2.0 Technical Overview
TeraGrid Information Services Developer Introduction
Open Source distributed document DB for an enterprise
Globus —— Toolkits for Grid Computing
Unit – 5 JAVA Web Services
The Internet.
Some Basics of Globus Web Services
TeraGrid’s GLUE 2 Implementation
CASE STUDY -HTML,URLs,HTTP
Processes The most important processes used in Web-based systems and their internal organization.
Introduction to JSP Liu Haibin 12/09/2018.
CAPT One-year Review Content Access Policy and Technology Committee
Prepared for Md. Zakir Hossain Lecturer, CSE, DUET Prepared by Miton Chandra Datta
iSERVOGrid Architecture Working Group Brisbane Australia June
Patrick Dreher Research Scientist & Associate Director
The Re3gistry software and the INSPIRE Registry
Section 14.1 Section 14.2 Identify the technical needs of a Web server
The Globus Toolkit™: Information Services
Cloud Web Filtering Platform
敦群數位科技有限公司(vanGene Digital Inc.) 游家德(Jade Yu.)
Information System (BDII)
Presentation transcript:

Information Services Discussion TeraGrid ‘08 NSF TeraGrid Review January 10, 2006 Information Services Discussion TeraGrid ‘08 John-Paul (JP) Navarro TeraGrid Grid Infrastructure Group (GIG) Area Co-Director for Software Integration University of Chicago, Argonne National Laboratory June 2008 Charlie Catlett (cec@uchicago.edu)

Vision The TeraGrid's Information Services vision is to: NSF TeraGrid Review Vision January 10, 2006 The TeraGrid's Information Services vision is to: define a coordinated way for TeraGrid participants to publish what they offer users, define a way for the TeraGrid to aggregate and index the information from all TeraGrid participants, and to publish this information to the public in a form that can easily be used by other software, users, and TeraGrid service providers themselves. Our motivating vision major improvements to how TeraGrid Service Providers communicate information about their service offerings to the User Community June 2008 TeraGrid '08 Charlie Catlett (cec@uchicago.edu)

High-Level Components NSF TeraGrid Review High-Level Components January 10, 2006 TeraGrid Wide Information Services Apache 2.0 WS/REST HTTP GET Clients Cache Tomcat WebMDS TeraGrid Wide Information WS/SOAP Clients WS MDS4 Service Provider Information Services WS/SOAP Service Provider Information WS MDS4 Clients June 2008 TeraGrid '08 Charlie Catlett (cec@uchicago.edu)

High-Availability Design NSF TeraGrid Review High-Availability Design January 10, 2006 TeraGrid Wide Information Services Clients info.teragrid.org Service Provider Information Services info.dyn.teragrid.org TeraGrid Dynamic DNS This is both a high-availability and high-throughput design Server failover propagates globally in 15 minutes … Static paths Dynamic paths June 2008 TeraGrid '08 Charlie Catlett (cec@uchicago.edu)

Information Services Design Goals NSF TeraGrid Review Information Services Design Goals January 10, 2006 Applies Grid concepts to information publishing Publishing is the responsibility of the information owner Publishing is done using standard (content) schemas Publishing thru standard interfaces regardless of content and where the data comes from Publishing services should be available globally (subject to authentication/authorization) Information owners publish to EVERYONE, not just the TeraGrid Publishing is a grid service Applies Grid concepts to aggregating information Publishing aggregated information is done exactly like original information publishing Aggregation uses standard information services interfaces to retrieve information This is how a collaboration, such as the TeraGrid, aggregates participant information Applies Grid concepts to querying information Querying can use standard interfaces regardless on content Our motivating vision major improvements to how TeraGrid Service Providers communicate information about their service offerings to the User Community June 2008 TeraGrid '08 Charlie Catlett (cec@uchicago.edu)

Service Provider vs TG Wide Services NSF TeraGrid Review Service Provider vs TG Wide Services January 10, 2006 Services Provider Information Services Content: Locally owned and maintained information Originates anywhere the service provider wishes Services: 1 general purpose MDS service 1 scheduling MDS service TeraGrid Wide Information Services Aggregate/index service provider information Additional central information (TGCDB, GIG operated services, …) Cached (service providers services can be down) Authenticated registrations Several redundant servers (99.5% plus availability) Information caching (persistence) Several MDS4 services (WS/SOAP) WebMDS/Tomcat, Apache 2.0, … services (WS/REST) Content published in: HTML, XML, CVS, … June 2008 TeraGrid '08 Charlie Catlett (cec@uchicago.edu)

Tooling WS/* (Tomcat 5.0, Apache 2.0) WebMDS (Globus 4.0.x/VDT 1.7.1) NSF TeraGrid Review Tooling January 10, 2006 WS/* (Tomcat 5.0, Apache 2.0) Benefits Very common web services platform Supports several web service interfaces (including simple) Supports multiple styles like REST, Web 2.0 Can be highly scalable Content Many formats: HTML, XHTML/XML, XML, RSS/Atom, … WebMDS (Globus 4.0.x/VDT 1.7.1) Live MDS4 content access XPath support XSLT transforms Many formats: HTML, XHTML/XML, XML, RSS/Atom WS/SOAP (Globus 4.0.x/VDT 1.x.y MDS4) Indexing, Trigger Registration, Publish, Subscribe Security/Authorization Robust WSRF interface XML June 2008 TeraGrid '08 Charlie Catlett (cec@uchicago.edu)

Current Content CTSS v4 Capability Kits NSF TeraGrid Review Current Content January 10, 2006 CTSS v4 Capability Kits Services Software Site & Resource Cross-Reference Information Service identifiers TGCDB identifiers and descriptions Science Gateways Descriptive Information Local HPC Software Prototype Coordinated software and services -> CTSS Local (uncoordinated) HPC software June 2008 TeraGrid '08 Charlie Catlett (cec@uchicago.edu)

CTSS 4 Capability Kits For each capability kit on each resource NSF TeraGrid Review CTSS 4 Capability Kits January 10, 2006 For each capability kit on each resource Current support level, and target support level Development, Testing, Production Support organization and contact Inca status URL Multiple version of a kit with different support levels June 2008 TeraGrid '08 Charlie Catlett (cec@uchicago.edu)

Site & Resource Cross Reference NSF TeraGrid Review Site & Resource Cross Reference January 10, 2006 June 2008 TeraGrid '08 Charlie Catlett (cec@uchicago.edu)

Science Gateways NSF TeraGrid Review January 10, 2006 June 2008 TeraGrid '08 Charlie Catlett (cec@uchicago.edu)

Content in development NSF TeraGrid Review Content in development January 10, 2006 HPC Local Software information Extended GridFTP information (Bryon Gill and SI) TGCP configuration information Core 2.0 Resource Description Repository "RDR" (Ed Hanna) Co/Meta-scheduling information (Warren Smith) SPRUCE On-Demand information (Suman Nadella) Science Gateway software and services (Jason Reilly, John McGee) Coordinated software and services -> CTSS Local (uncoordinated) HPC software June 2008 TeraGrid '08 Charlie Catlett (cec@uchicago.edu)

Soft versus Hard State Aggregation layer Leaf layer NSF TeraGrid Review Soft versus Hard State January 10, 2006 Aggregation layer MDS default is Soft state TeraGrid customized Hard state Leaf layer Determined by the provider design Coordinated software and services -> CTSS Local (uncoordinated) HPC software June 2008 TeraGrid '08 Charlie Catlett (cec@uchicago.edu)

Open Discussion Content delivery formats NSF TeraGrid Review Open Discussion January 10, 2006 Content delivery formats XML CSV JSON Perl text tginfo universal command line tool ….. Coordinated software and services -> CTSS Local (uncoordinated) HPC software June 2008 TeraGrid '08 Charlie Catlett (cec@uchicago.edu)

Publishing New Content NSF TeraGrid Review Publishing New Content January 10, 2006 Service Provider Information Services TeraGrid Wide Information Services Requirements gathering Identify content Information ownership Information (system) sources Aggregation/refresh/caching Access requirements Content integration Is (some) content in information services How is the content indexed/mapped with other content Development Choose existing schema and/or develop new schema Use existing or develop information providers Use existing or develop aggregation/refresh/caching Use existing or develop access views/applications This is both a high-availability and high-throughput design June 2008 TeraGrid '08 Charlie Catlett (cec@uchicago.edu)

New Content Access Views NSF TeraGrid Review New Content Access Views January 10, 2006 Service Provider Information Services TeraGrid Wide Information Services Requirements gathering Identify content Query protocols Query aggregation scope Query reliability Query frequency/performance For users and/or software Development Choose existing access protocols and views Develop new access views Develop new access protocols This is both a high-availability and high-throughput design June 2008 TeraGrid '08 Charlie Catlett (cec@uchicago.edu)

New Content Aggregation/Storage NSF TeraGrid Review New Content Aggregation/Storage January 10, 2006 Service Provider Information Services TeraGrid Wide Information Services Requirements gathering Persistence Versioning Etc Development Extend existing aggregation/storage methods Develop new aggregation/storage methods This is both a high-availability and high-throughput design June 2008 TeraGrid '08 Charlie Catlett (cec@uchicago.edu)

More Information Find out more: Request content: NSF TeraGrid Review More Information January 10, 2006 Find out more: http://info.teragrid.org/ (links to content and documentation) Request content: mailto: help@teragrid.org or navarro@mcs.anl.gov Discuss Information Services content, requirements, and design: E-mail list tg-cat@teragrid.org View current Information Services content User Portal (scheduler load & queue contents): https://portal.teragrid.org:443/gridsphere/gridsphere?cid=resources User Documentation (CTSS 4 kits, software, services): http://www.teragrid.org/userinfo/software/ctss.php Information Service Main Page: http://info.teragrid.org/ June 2008 TeraGrid '08 Charlie Catlett (cec@uchicago.edu)