NextGRID & OGSA Data Architectures: Example Scenarios Stephen Davey, NeSC, UK ISSGC06 Summer School, Ischia, Italy 12 th July 2006.

Slides:



Advertisements
Similar presentations
LEAD Portal: a TeraGrid Gateway and Application Service Architecture Marcus Christie and Suresh Marru Indiana University LEAD Project (
Advertisements

© 2007Open Grid Forum GGF19, 1'st February 2007 OGSA Data Architecture Services Dave Berry & Allen Luniewski.
© 2007 Open Grid Forum Data Management Challenge - The View from OGF OGF22 – February 28, 2008 Cambridge, MA, USA Erwin Laure David E. Martin Data Area.
© 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.
18 Copyright © 2005, Oracle. All rights reserved. Distributing Modular Applications: Introduction to Web Services.
Designing Services for Grid-based Knowledge Discovery A. Congiusta, A. Pugliese, Domenico Talia, P. Trunfio DEIS University of Calabria ITALY
Abstraction Layers Why do we need them? –Protection against change Where in the hourglass do we put them? –Computer Scientist perspective Expose low-level.
TSpaces Services Suite: Automating the Development and Management of Web Services Presenter: Kevin McCurley IBM Almaden Research Center Contact: Marcus.
This product includes material developed by the Globus Project ( Introduction to Grid Services and GT3.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Distributed components
Grid Architecture: Representing NextGRID David Snelling Fujitsu Labs Europe.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Data Grid Web Services Chip Watson Jie Chen, Ying Chen, Bryan Hess, Walt Akers.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Processing of structured documents Spring 2003, Part 6 Helena Ahonen-Myka.
Chapter 7: The Object-Oriented Approach to Requirements
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
GRID job tracking and monitoring Dmitry Rogozin Laboratory of Particle Physics, JINR 07/08/ /09/2006.
Interoperability Scenario Producing summary versions of compound multimedia historical documents.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
FESR Consorzio COMETA Grid Introduction and gLite Overview Corso di formazione sul Calcolo Parallelo ad Alte Prestazioni (edizione.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Grid-enabling OGC Web Services Andrew Woolf, Arif Shaon STFC e-Science Centre Rutherford Appleton Lab.
Web Services based e-Commerce System Sandy Liu Jodrey School of Computer Science Acadia University July, 2002.
© 2008 Open Grid Forum Independent Software Vendor (ISV) Remote Computing Primer Steven Newhouse.
Application code Registry 1 Alignment of R-GMA with developments in the Open Grid Services Architecture (OGSA) is advancing. The existing Servlets and.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Distributed Information Retrieval Using a Multi-Agent System and The Role of Logic Programming.
17 March 2008 © 2008 The University of Edinburgh, European Microsoft Innovation Center and University of Southampton IT Innovation Centre 1 NextGRID Security.
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
Web: Minimal Metadata for Data Services Through DIALOGUE Neil Chue Hong AHM2007.
Lesson Overview 3.1 Components of the DBMS 3.1 Components of the DBMS 3.2 Components of The Database Application 3.2 Components of The Database Application.
An information and monitoring system for static and dynamic information about grid resources, applications, networks … RDBMS Servlet aware of API during.
Grid Services I - Concepts
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Cole David Ronnie Julio. Introduction Globus is A community of users and developers who collaborate on the use and development of open source software,
Jini Architecture Introduction System Overview An Example.
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
Web Services An Introduction Copyright © Curt Hill.
David Adams ATLAS ATLAS distributed data management David Adams BNL February 22, 2005 Database working group ATLAS software workshop.
Intro to Web Services Dr. John P. Abraham UTPA. What are Web Services? Applications execute across multiple computers on a network.  The machine on which.
An approach to Web services Management in OGSA environment By Shobhana Kirtane.
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
OGSA-DAI.
Enabling Grids for E-sciencE Agreement-based Workload and Resource Management Tiziana Ferrari, Elisabetta Ronchieri Mar 30-31, 2006.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
ESRIN, 15 July 2009 Slide 1 Web Service Security support in the SSE Toolbox HMA-T Phase 2 FP 14 December 2009 S. Gianfranceschi, Intecs.
Leading the pervasive adoption of grid computing for research and industry © 2006 Global Grid Forum The information contained herein is subject to change.
Maintaining and Searching Metadata Mario Antonioletti, Shannon Hastings, Peter Kunszt, Stephen Langella, Simon Laws, Susan Malaika, Gavin McCance, Alex.
Java Web Services Orca Knowledge Center – Web Service key concepts.
Autonomic Resource Virtualization in Cloud-like Environments A
OGSA Data Architecture WG Data Transfer Discussion
OGSA Data Architecture Scenarios
GGF OGSA-WG, Data Use Cases Peter Kunszt Middleware Activity, Data Management Cluster EGEE is a project funded by the European.
The OGSA Data Architecture
OGSA Data Architecture Scenarios
Documenting ONAP components (functional)
Service Oriented Architecture (SOA)
Resource and Service Management on the Grid
RELATIONAL GRID MONITORING ARCHITECHTURE
Introduction to OGF Standards
OGSA Data Architecture
Presentation transcript:

NextGRID & OGSA Data Architectures: Example Scenarios Stephen Davey, NeSC, UK ISSGC06 Summer School, Ischia, Italy 12 th July 2006

2 Contributors & Acknowledgments This presentation is based on work by  Stephen Davey et al., “OGSA Data Scenarios” wg/docman.root.working_drafts/doc wg/docman.root.working_drafts/doc13605  Allen Luniewski, Dave Berry et al., “OGSA Data Architecture” wg/docman.root.working_drafts/doc wg/docman.root.working_drafts/doc12659 With additional thanks to  NextGRID Architecture WP1, OGSA Data Working Group.

3 Introduction - Aim & Scope These slides cover the following:  Example Data Scenarios Data Storage Data Replication Data Staging Data Pipelining  Data Components & Architectural Context NextGRID Data Architecture OGSA Data Architecture

4 Data Scenarios Purpose of the Scenarios  Example scenarios of a generic nature to accompany the OGSA Data Architecture document.  Not a use case document generating requirements for the OGSA Data Architecture.  Instead provides illustrations of how the components and interfaces described in the OGSA Data Architecture document can be put together in a selection of typical data scenarios.

5 Scenarios done so far … Data Storage – store file data in a Grid Data Service and retrieve it later. Data Replication – maintain a replica of data at a different location (for availability or performance). Data Staging – the movement of data in preparation for the performing of operations on or with this data. Data Pipelining – connect the output from one service to the input of another. To be covered next week: Data Integration – bringing the data that you require together from disparate sources. [See OGSA-DAI sessions 26, 27]. Personal Data Service – the organising of an individual’s data to allow them access to it from many different locations. [See sessions 32, 33; my Grid etc.]. Data Discovery – discover data; register data/metadata. [See Ontologies & Semantic grids sessions 32, 33].

6 Data Storage Scenario Use Case 1: Writing a file into storage 1. The customer requests file storage space on the Data Storage Service to which the file can be written. 2. The customer requests a file name (SURL) from the Data Storage Service for the given space to write a file. The Data Storage Service returns a valid SURL. 3. Using the file name, the client requests a file URL (reference) with some specific parameters (protocol, security tokens, etc) with which the file can be actually written. The Data Storage Service returns a valid Transfer URL (TURL). The TURL may also be an Access URL (i.e. for POSIX access as opposed to transfer). 4. The customer makes use of the service that supports the requested protocol to actually write the file into the given space on storage using the TURL. This may be through: a) The Data Storage Service directly, b) or the Data Access Service, c) or the Data Transfer Service. 5. The customer notifies the storage at the end of the operation that the write is complete. Data Storage Service acknowledges completion.

7 Data Storage – Writing a file Storage Devices Customer Data Storage Service Access Service Transfer Service 1. Request file space. 4a. Write file. File Space 4a. Write file. 4b. Write file. 4c. Write file. 4b. Write file. 4c. Write file. 2. Get file name (SURL). 3. Get Transfer URL (TURL) or Access URL. 5. Notify of completion.

8 Data Storage Scenario 2 Use Case 2: Make data available online. The customer has the file names for a set of files in a given space and requires that these files should be available online. 1. The files are made available online by the Data Storage Service. 2. The data are read through an appropriate interface, such as the Transfer Service. 3. The online attribute of the files may expire and they can be retired to nearline storage.

9 Data Storage – Make online Storage Devices Customer Data Storage Service Transfer Service 1. Make files online. 2. Read files. Nearline Storage Online Storage 1. Make online. 3. Retire to nearline.

10 Data Replication Scenario 1. A data resource is registered with a replicating data service (details such as creation time, access control, etc. would also be included) and replication service enters the data resource into a replica catalogue. 2. The replication service uses a data transfer service to move copies of this data to different locations and tracks which data is kept where. 3. Clients access the catalogue to find the data resource, or to return a list of resources that satisfy certain Quality of Service (QoS) requirements. 4. Clients then access the stores either directly or indirectly. 5. Changes to the data are notified to the replication service. 6. Updates then occur between the data services to synchronize the replicas.

11 Data Replication – 1 Customer 1 Data Transfer Service Replication Service Data Storage 1 Data Storage 2 Data Service 2 Data Service 1 1b. Publish 2. Transfer copies 6. Update 4. Access data 5. Notify 2. Transfer copies 2. Transfer copies Registry Service 3. Find data 1a. Register data Customer 2

12 Data Replication – 2 1. Register Customer 1 Data Transfer Service Data Storage 1 Data Storage 2 Data Service 2 Data Service 1 2. Transfer copies 6. Update 3. Find data 4. Access data 5. Notify 2. Transfer copies 2. Transfer copies Repli- cation Service Data Service Replica Catalogue Service Customer 2

13 Data Staging Scenario 1. Customer 1 submits a parameter space exploration job to the Parameter Space Exploration Service. 2. An optimized copy (bulk load) of the boundary conditions data is made from the Parameter Space Exploration Service to the Simulation Service, utilising a Data Service to assist in the extraction and transfer of the data. This step would actually have 3 parts: a) Firstly, storage space needs to be reserved through the Simulation Service with the corresponding EPR for the storage being returned to the Parameter Space Exploration Service. b) Secondly, the Parameter Space Exploration Service queries the Boundary Conditions database for the relevant data. c) Finally the Data Service bulk loads the boundary condition data to the Simulation Service. 3. The Simulation Service sets up the results database.

14 Data Staging Scenario (cont.) 4. From the parameter set the simulation jobs are generated and sent to the Simulation Service. Each of the jobs will take parameters from the parameter set database and then read the boundary condition data from the local copy of the boundary conditions database. 5. Results from the Simulation Service are stored in the results database. 6. On completion of all the generated jobs the Simulation Service’s local copy of the boundary conditions database is deleted. 7. Queries (or jobs) are used to get derivatives from the results database. 8. The simulation service returns the derived data to the consumer. 9. On completion of all queries the simulation service deletes the results set database.

15 Data Staging Parameter Space Exploration Service Boundary Conditions Simulation Service Parameter Set Boundary Conditions (copy) Results Set 1. Submit job. 6. Delete boundary condition data. 7. Query results set. 3. Set up Results DB. 2c. Bulk load boundary condition data. 4. Generated jobs from parameter set. 8. Return derived data. Customer 1 2b. Query relevant boundary conditions. 2a. Get EPR for storage & CPUs. Data Service 1 Data Service 2 5. Store results. 9. Delete Results DB.

16 Data Pipelining Scenario 1. Customer 1 (Designer) submits a rendering job to the Rendering Service. 2. Completed animation is stored to a common storage device. 3. Rendering Service transfers the completed animations (data) to the Visualization Service using the Data Transfer Service. 4. The Visualization Service displays the animations to the customers (Designer & Reviewer) in an agreed format.

17 Data Pipelining Completed Animations Visualisation Service Customer 2 1. Submit job.2. Store results. 3. Transfer results. 4. Return results. Customer 1 Data Transfer Service 3. Transfer results. Rendering Service Data Service

18 Summary of Data Components Capabilities that can be provided by the data architecture include:  Data transfer infrastructure for transferring data between services and/or resources.  Data access methods of accessing data, whether that data is stored locally or remotely.  Data location management staging, caching and replicating data resources.  Data federation integrating multiple data resources so that they can be accessed as if they were a single resource. Data description The types of data (both simple and compound) under consideration and how those types are specified. Policies quality of service (QoS), protocols and coherency conditions

19 Basic structure of a data architecture Access Sink/ Source Description Access Sink/ Source Description Transfer Registries Lookup Client APIs (non-OGSA) / Other services Interface Service Key: Data Management Other Data Services Storage Management Storage An API or service calling an interface Transfer of data between resources. Managed Storage Stored Data Resources Other Data Resources Transfer Protocols Resource A service using a resource. From: “The Open Grid Services Architecture, Version 1.6”.

20 Architectural Context NextGRID data architecture  Within framework provided by OGSA WSRF Base Profile (and built on Web Services) provides the default messaging layers and service specification languages management of distributed resources addressing notification of events Naming Registries and resource discovery Security & Trust Policies and agreements

21 NextGRID Interactions Registry Functional SLA Management Trust and Security Naming and Addressing Orchestration Register / Update Query Resolve Generate / Verify Administer policy Monitor/ Control Get tokens Negotiate SLA Invoke Get token assertions Register / Update / Query Get token assertions Get token assertions Get token assertions Schemas

22 Questions? Data Scenarios  Data Storage  Data Replication  Data Staging  Data Pipelining Data Architecture & Context