TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007.

Slides:



Advertisements
Similar presentations
Scaling TeraGrid Access A Testbed for Attribute-based Authorization and Leveraging Campus Identity Management
Advertisements

TeraGrid Deployment Test of Grid Software JP Navarro TeraGrid Software Integration University of Chicago OGF 21 October 19, 2007.
Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Torsten Antoni – LCG Operations Workshop, CERN 02-04/11/04 Global Grid User Support - GGUS -
Welcome to Middleware Joseph Amrithraj
5/30/2012. Provides a method for finding services/data on the Exchange Network – discover data. Supports User Friendly Tools Can automatically collect.
Globus Toolkit 4 hands-on Gergely Sipos, Gábor Kecskeméti MTA SZTAKI
Massimo Cafaro GridLab Review GridLab WP10 Information Services Massimo Cafaro CACT/ISUFI University of Lecce, Italy.
Using Globus to Locate Services Case Study 1: A Distributed Information Service for TeraGrid John-Paul Navarro, Lee Liming.
4b.1 Grid Computing Software Components of Globus 4.0 ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson, slides 4b.
UMIACS PAWN, LPE, and GRASP data grids Mike Smorul.
TeraGrid Science Gateway AAAA Model: Implementation and Lessons Learned Jim Basney NCSA University of Illinois Von Welch Independent.
Globus Computing Infrustructure Software Globus Toolkit 11-2.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Kate Keahey Argonne National Laboratory University of Chicago Globus Toolkit® 4: from common Grid protocols to virtualization.
TeraGrid’s Integrated Information Service “IIS” Grid Computing Environments 2009 Lee Liming, JP Navarro, Eric Blau, Jason Brechin, Charlie Catlett, Maytal.
GIG Software Integration: Area Overview TeraGrid Annual Project Review April, 2008.
TeraGrid Information Services December 1, 2006 JP Navarro GIG Software Integration.
GIG Software Integration Project Plan, PY4-PY5 Lee Liming Mary McIlvain John-Paul Navarro.
Open Science Grid Software Stack, Virtual Data Toolkit and Interoperability Activities D. Olson, LBNL for the OSG International.
TeraGrid Information Services John-Paul “JP” Navarro TeraGrid Grid Infrastructure Group “GIG” Area Co-Director for Software Integration and Information.
Future Grid Future Grid User Portal Marlon Pierce Indiana University.
Long Term Ecological Research Network Information System LTER Grid Pilot Study LTER Information Manager’s Meeting Montreal, Canada 4-7 August 2005 Mark.
CTSS 4 Strategy and Status. General Character of CTSSv4 To meet project milestones, CTSS changes must accelerate in the coming years. Process –Process.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
1 PY4 Project Report Summary of incomplete PY4 IPP items.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
Apache Airavata (Incubating) Gateway to Grids & Clouds Suresh Marru Nov 10 th 2011.
TeraGrid CTSS Plans and Status Dane Skow for Lee Liming and JP Navarro OSG Consortium Meeting 22 August, 2006.
09/02 ID099-1 September 9, 2002Grid Technology Panel Patrick Dreher Technical Panel Discussion: Progress in Developing a Web Services Data Analysis Grid.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
SAN DIEGO SUPERCOMPUTER CENTER Inca TeraGrid Status Kate Ericson November 2, 2006.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
User Working Group 2013 Data Access Mechanisms – Status 12 March 2013
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
Evolving Interfaces to Impacting Technology: The Mobile TeraGrid User Portal Rion Dooley, Stephen Mock, Maytal Dahan, Praveen Nuthulapati, Patrick Hurley.
Community Sign-On and BEN. Table of Contents  What is community sign-on?  Benefits  How it works (Shibboleth)  Shibboleth components  CSO workflow.
GCRC Meeting 2004 BIRN Coordinating Center Software Development Vicky Rowley.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
Portal Update Plan Ashok Adiga (512)
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
1 Service Creation, Advertisement and Discovery Including caCORE SDK and ISO21090 William Stephens Operations Manager caGrid Knowledge Center February.
Ad Hoc VO Akylbek Zhumabayev Images. Node Discovery vs. Registration VO Node Resource User discover register Resource.
Biomedical and Bioscience Gateway to National Cyberinfrastructure John McGee Renaissance Computing Institute
Distributed Data for Science Workflows Data Architecture Progress Report December 2008.
Network, Operations and Security Area Tony Rimovsky NOS Area Director
TeraGrid User Portal Migration Project Summery Jeff Koerner Director of Operations TeraGrid GIG Matt Heinzel Director TeraGrid GIG September 2009.
CTSS Version 4 User Support Documentation Mike Dwyer, Kerry Hagan, Diana Diehl.
TeraGrid’s Common User Environment: Status, Challenges, Future Annual Project Review April, 2008.
Software Integration Highlights CY2008 Lee Liming, JP Navarro GIG Area Directors for Software Integration University of Chicago, Argonne National Laboratory.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
TeraGrid Capability Discovery John-Paul “JP” Navarro TeraGrid Area Co-Director for Software Integration University of Chicago/Argonne National Laboratory.
Data Infrastructure in the TeraGrid Chris Jordan Campus Champions Presentation May 6, 2009.
Integrated Information Services “IIS” JP Navarro, U. of Chicago/ANL OGF 30 October 28, 2010.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
International Planetary Data Alliance Registry Project Update September 16, 2011.
TeraGrid’s Process for Meeting User Needs. Jay Boisseau, Texas Advanced Computing Center Dennis Gannon, Indiana University Ralph Roskies, University of.
Grid Colombia Workshop with OSG Week 2 Startup Rob Gardner University of Chicago October 26, 2009.
TeraGrid Software Integration: Area Overview (detailed in 2007 Annual Report Section 3) Lee Liming, JP Navarro TeraGrid Annual Project Review April, 2008.
NSF TeraGrid Review January 10, 2006
TeraGrid Information Services
NSF TeraGrid Review January 10, 2006
TeraGrid Information Services: Building on Globus MDS4
TeraGrid Information Services Developer Introduction
Information Services Discussion TeraGrid ‘08
Patrick Dreher Research Scientist & Associate Director
Presentation transcript:

TeraGrid Information Services JP Navarro, Lee Liming University of Chicago TeraGrid Architecture Meeting September 20, 2007

September 2007TeraGrid Architecture Meeting2 People Interactions Identify available resources and request allocation User Documentation, User Portal, POPS Learn how to use resources and their status User Documentation, Knowledge Base User Portal Conferences and other events Ask for help Telephone (User) Software Interactions Standard service interfaces (login, move data, run jobs, WAN file-systems, manage data, etc.) Coordinated software (Grid clients, development tools, science workflow, etc.) Coordinated Unix interaction (standard variables, SoftEnv) Grid in what sense?

September 2007TeraGrid Architecture Meeting3 Provide an Information Services Infrastructure: –Applying grid concepts to information publishing –Have RP/GIG/partner operated information services –Centrally indexing and/or aggregated for discovery –Primarily focused on public information –Primarily accessible thru software interfaces –Using standards based interfaces –Reliable, scalable, and fast –Publishing TeraGrid information –And partner/community information TeraGrid Information Services Vision

September 2007TeraGrid Architecture Meeting4 TG Information Services IS [NOT] IS NOTIS A central database (Data Warehouse)A central index/aggregation (Google) A new user interfaceA way user interfaces access information A single implementation/toolIncludes several tools A single software interfaceAccessed using several useful interfaces A specific set of dataPhased growing set of data Changed data ownershipOwnership maintained as appropriate Way to manage scientific informationWay to manage Grid meta-data A data management system (database)An information publishing system A coordinated way to index and publish public [Tera]Grid information using software interfaces.

September 2007TeraGrid Architecture Meeting5 Clients High-Level Components Cache WS/REST HTTP GET WS/SOAP WS MDS4 Tomcat WebMDS Apache 2.0 TeraGrid Wide Services TeraGrid Repositories Partners WS/SOAP WS MDS4 Resource Provider Services

September 2007TeraGrid Architecture Meeting6 RP Information Services Content: RP owned and maintained information Data can originate in local systems Infrastructure: 2 scheduling MDS services: authenticated and public (merging) 1 general purpose MDS service TeraGrid Wide Information Services Content: Aggregate/index RP Information Services content Additional central information (TGCDB, GIG operated services, …) Infrastructure: Several redundant servers Information caching (persistence) Several MDS4 services (WS/SOAP) WebMDS/Tomcat, Apache 2.0, … (WS/REST) Content published in: HTML, XHTML/XML, XML, Atom, RSS, … TeraGrid Wide vs RP Services

September 2007TeraGrid Architecture Meeting7 WS/* (Tomcat 5.0, Apache 2.0) Benefits Very common web services platform Supports several web service interfaces (including simple) Supports multiple styles like REST, Web 2.0 Can be highly scalable Content Many formats: HTML, XHTML/XML, XML, Atom, RSS, … WS/SOAP (Globus 4.0.5/VDT MDS4) Benefits Indexing, Trigger Registration, Publish, Subscribe Security/Authorization Robust WSRF interface Content XML WebMDS Benefits XPath support XSLT transforms Tools

September 2007TeraGrid Architecture Meeting8 High-Availability Design … info.dyn.teragrid.org info.teragrid.org TeraGrid Dynamic DNS Dynamically direct clients to one or more servers Set by Information Services administrators Changes propagate globally fast (TTL = 15 minutes) Clients Dynamically Changes Doesn’t Change RP/partner services TG wide servers (Patrick Dorn & NCSA NetEng)

September 2007TeraGrid Architecture Meeting9 Information Services Users User Documentation (Michael, Diana) User Portal (Maytal & team) Database? Gateways Peer Grids User Applications info.teragrid.org Inca (Kate & team)

September 2007TeraGrid Architecture Meeting10 Completed Milestones Infrastructure information services are in production –RP Information Services –TeraGrid Wide Information Services Scheduling information published to User Portal (since Spring) –Scheduler load –Queue contents CTSS 4 capabilities published in publicly (Since August) –Accessed by User Documentation Inca system –Content Which capability kits are available on each resource What software is available in each kit on each resource What services are available from each kit on each resource

Queue Contents in User Portal

Resources and Available Kits

Where are the GridFTP services?

September 2007TeraGrid Architecture Meeting14 CTSS 4 Kit Capabilities For each kit on each resource –Current support level, and target support level Development/prototype, Testing/pre-production, Production –Support organization –Inca status URL –Multiple version of a kit with different support levels For each kit software component on each resource –Name, version, how to access it –Multiple versions of a single component For each kit service on each resource –Name, type, version, and Endpoint (contact location) –GSI OpenSSH, GridFTP, SRB servers, GRAM, MDS4 –Multiple services of the same type The coordinated way TeraGrid publishes available CTSS capabilities.

September 2007TeraGrid Architecture Meeting15 Open Forward Process - Requirements Requirements AnalysisDesign & DevelopDeploy & Support What Use cases (who needs it)? What type of information (attributes)? Who owns and provides the data (ownership, source)? Is any of it already available thru information services (gap analysis)? Who Everyone: users, staff, partners Where Discussion: Analysis results: wiki documents currently linked here:

September 2007TeraGrid Architecture Meeting16 Open Forward Process - Develop Requirements AnalysisDesign & DevelopDeploy & Support What Register a provider and/or publish data? Design/adopt a schema? Develop publishing adaptors, services. Choose and/or develop query interfaces. Who Several groups: information users, owner, consumers, SI facilitated Where Discussion:

September 2007TeraGrid Architecture Meeting17 Open Forward Process - Deploy Requirements AnalysisDesign & DevelopDeploy & Support What Deploy adapters or publishing services. Deploy query interfaces. Update information services documentation. Who Several groups: information providers, RPs, SI Where Discussion: gateways, docgroup,

September 2007TeraGrid Architecture Meeting18 Deploy & Support Upgrade/expand/merge scheduling information services Part of WS GRAM upgrade Doesn’t address Scheduling WG requirements yet Implement 99.5% availability TeraGrid wide information services Improve information services documentation Design & Develop Publish TGCDB resource and organization information Prototyping REST/Web 2.0 interfaces: Publish HTML (staff/developer views), XHTML/XML, Atom, RSS, etc. Prototype universal command line client Requirements Analysis HPC (non-CTSS) software Resource hardware specifications Scheduling information (BQP, Scheduling WG) Data movement information (Data WG) Current Activity Areas

September 2007TeraGrid Architecture Meeting19 Gateways User Services gateway database information Gateway published, capabilities, software, and services Data collections User Services data collections database information Data collections access method, service Endpoint, paths Community software areas Which resources have each CSA What software is available in each CSA, how to access it Community accounts Which resources is each account active on Outages Planned and unplanned outage information (gateways could use) Resource Provider Policies Peer grids/interoperability Resources, services available on peer grids (OSG, EGEE, …) ……. [Not so] Farfetched Possibilities

September 2007TeraGrid Architecture Meeting20 Discuss Information Services content, requirements, and design: list View current Information Services content User Portal (scheduler load & queue contents): User Documentation (CTSS 4 kits, software, services): Staff and developer WebMDS views Useful URLs: Will link to other areas: Information Services activities, plan, etc, More Information