The OGSA Data Architecture

Slides:



Advertisements
Similar presentations
© 2007Open Grid Forum OGF22, 25th February 2008 OGSA Data Architecture Mario Antonioletti.
Advertisements

© 2007Open Grid Forum GGF19, 1'st February 2007 OGSA Data Architecture Services Dave Berry & Allen Luniewski.
© 2007 Open Grid Forum Data Management Challenge - The View from OGF OGF22 – February 28, 2008 Cambridge, MA, USA Erwin Laure David E. Martin Data Area.
© 2006 Open Grid Forum GGF18, 13th September 2006 OGSA Data Architecture Scenarios Dave Berry & Stephen Davey.
Abstraction Layers Why do we need them? –Protection against change Where in the hourglass do we put them? –Computer Scientist perspective Expose low-level.
GT 4 Security Goals & Plans Sam Meder
Meta Data Larry, Stirling md on data access – data types, domain meta-data discovery Scott, Ohio State – caBIG md driven architecture semantic md Alexander.
Data Management Expert Panel. RLS Globus-EDG Replica Location Service u Joint Design in the form of the Giggle architecture u Reference Implementation.
User Data Convergence CT4 specifications Jean-Jacques Trottin
Internet Technologies (Grid Computing (OGSA, WSRF) )
Grid Architecture: Representing NextGRID David Snelling Fujitsu Labs Europe.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
NextGRID & OGSA Data Architectures: Example Scenarios Stephen Davey, NeSC, UK ISSGC06 Summer School, Ischia, Italy 12 th July 2006.
Brief Overview of Major Enhancements to PAWN. Producer – Archive Workflow Network (PAWN) Distributed and secure ingestion of digital objects into the.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Data Grid Web Services Chip Watson Jie Chen, Ying Chen, Bryan Hess, Walt Akers.
Grid-enabling OGC Web Services Andrew Woolf, Arif Shaon STFC e-Science Centre Rutherford Appleton Lab.
DAIS Grid1 Database Access and Integration Services on the Grid * * Authors: N. Paton, M. Atkinson, V.
Application code Registry 1 Alignment of R-GMA with developments in the Open Grid Services Architecture (OGSA) is advancing. The existing Servlets and.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
Web: Minimal Metadata for Data Services Through DIALOGUE Neil Chue Hong AHM2007.
17 March 2008Standards for Interoperable Grids 1 Data Management Standards for Interoperable Grids: Experience from NextGRID and OMII-Europe Clive Davenhall.
Web Services Presented By : Noam Ben Haim. Agenda Introduction What is a web service Basic Architecture Extended Architecture WS Stacks.
Health eDecisions Use Case 2: CDS Guidance Service Strawman of Core Concepts Use Case 2 1.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Data Services (sample). OGSA cross WG discussion template2 Outline Requirements Key concepts/functionality Architecture/Model (if any) Services/portTypes.
Create Content Capture Content Review Content Edit Content Version Content Version Content Translate Content Translate Content Format Content Transform.
RUS: Resource Usage Service Steven Newhouse James Magowan
Authorization GGF-6 Grid Authorization Concepts Proposed work item of Authorization WG Chicago, IL - Oct 15 th 2002 Leon Gommans Advanced Internet.
© 2004 IBM Corporation ICSOC2004 Panel Discussion: Grid Systems: What is needed from web service standards? Jeffrey Frey IBM.
1 OGSA-DAI: Service Grids Neil P Chue Hong. 2 Motivation  Access to data is a necessity on the Grid  The ability to integrate different data resources.
Preservation Data Services Persistent Archive Research Group Reagan W. Moore October 1, 2003.
WP2: Data Management Gavin McCance University of Glasgow.
Enabling Grids for E-sciencE Agreement-based Workload and Resource Management Tiziana Ferrari, Elisabetta Ronchieri Mar 30-31, 2006.
NAREGI PSE with ACS S.Kawata 1, H.Usami 2, M.Yamada 3, Y.Miyahara 3, Y.Hayase 4 1 Utsunomiya University 2 National Institute of Informatics 3 FUJITSU Limited.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Leading the pervasive adoption of grid computing for research and industry © 2006 Global Grid Forum The information contained herein is subject to change.
Maintaining and Searching Metadata Mario Antonioletti, Shannon Hastings, Peter Kunszt, Stephen Langella, Simon Laws, Susan Malaika, Gavin McCance, Alex.
OGSA Data Architecture WG Data Transfer Session Allen Luniewski, IBM Dave Berry, NESC.
Update to the Community GGF16 - Athens
OGSA Information Abdeslem Djaoui OGSA Information, OGSA-WG #11
Models for Resources and Management
OGSA Information Management Services status
Grid File System Working Group
WS-Agreement Overview for OGSA
OGSA Data Architecture WG Data Transfer Discussion
OGSA Data Architecture Scenarios
GGF OGSA-WG, Data Use Cases Peter Kunszt Middleware Activity, Data Management Cluster EGEE is a project funded by the European.
Middleware independent Information Service
Bringing Grid & Web Services Together
UK e-Science OGSA-DAI November 2002 Malcolm Atkinson
Norman Paton University of Manchester
Network Requirements Javier Orellana
Grid Metadata Management
OGSA Data Architecture Scenarios
Database Access and Integration Services Working Group
Status and Future Steps
OGSA and Security Services GGF12 , September 20th, 2004 Hiro Kishimoto
Core Grid Functions: A Minimal Architecture for Grids
LOD reference architecture
OGF19 – Chapel Hill, NC, USA 30 January 2007
Introduction to Web Services
WebDAV Design Overview
Service Oriented Architecture (SOA)
Introduction to OGF Standards
OGSA Data Architecture
Grid Systems: What do we need from web service standards?
Presentation transcript:

The OGSA Data Architecture Dave Berry Allen Luniewski OGSA F2F 19th January 2006

Current Scope Files and databases (& storage) Not streams, sessions, provenance, … Services and interfaces Storage, Access, Transfer, Metadata catalogues Replication, Caching, Federation Cross-cutting themes Security, Policies, … Scenarios Including Grid file system, Data staging, Database federation, Replication, … Part of the bigger OGSA picture E.g. Naming, Workflow, Transactions, Scheduling, …

Some of the WG members Dave Berry, NeSC - Author: Overview, Cache Allen Luniewski, IBM USA - Author: Replication, Federation Stephen Davey, NeSC - Author: Services document Mark Morgan, U. Virginia - Author: Access (ByteIO) Mario Antonioletti, EPCC - Author: Access (WS-DAI) Peter Kunszt, ex-CERN - Author: Storage Management Simon Laws, IBM UK - Author: Data Description Ann Chervenak, ISI - Author: Replication Susan Malaika, IBM USA Fred Maciel, Hitachi Neil Chue Hong, EPCC Chris Jordan, SDSC

Architecture document Overview Architectural Context Requirements on OGSA Security Data Description Data Transfer Data Access Storage Resource Management Cache Services Data Replication Data Federation Catalogues Appendices: Specifications referenced Glossary

Ch 3: Architectural Context Naming (WS-A, OGSA-Naming, RNS?) Management GAP! – Need info model for data services WS-Management, WSDM – how do we use these? Security (see Chapter 4) State, Lifetime & Notification OGSA Base Profile Resource Discovery Fuzzy boundary with Information Services design team Policies and agreements Site & VO management Reservation, Scheduling & Provisioning At some point we will need to integrate with EMS Transactions (WS-Coordination, WS-CAF) GAP! - Sessions

Things to name… activities metadata roles caches namespaces schemas catalogues naming schemes schema mappings content identifiers networks security contexts data bytes people security tokens data formats policies service level agreements data streams queries service types database tables query result row sets services databases references storage (space) file directories registries times file locations replicas transactions files repositories transformations identities resolvers transport protocols languages resource locations user defined entities locales resources vocabularies

Ch 4: Security Discusses requirements beyond simple AAA Legal requirements for security and privacy GAP! – Security policies GAP! – Attaching security policies to data in motion GAP? – Geographical location of requester and resource GAP? – Reason for access GAP? – Authorisation of sequences of access requests GAP? – Authorisation based on previous requests WGs: AuthZ, OGSA

Ch 5: Data Description Format description Resource description Static formats (e.g. XML) Dynamic formats (e.g. an XML schema) GAP! - URIs to name data formats Resource description “Data resource” = source/sink of data Set of properties May be managed by a service May be stored in a catalogue GAP? – Set of generic properties for data resources QoS policies Service description WSDL List of data resources? Management & control information? GAP! – Information model WGs: DFDL? CIM?

Ch 6: GAP! - Data Transfer Sources & sinks that are data-type agnostic Must allow low-level optimisations, e.g. at the storage level Must allow different protocols URIs to name protocols Separation of access protocols and transfer protocols? Basic level Single point -> single point, bytes only Policies: performance, scheduling, robustness, … Higher levels (for future work) Broadcast Encryption Format translation WGs: DMIS (BOF)?

Ch 7: Data Access Selecting and consuming data WS-DAI (WS-DAIR, WS-DAIX, WS-DAIRDF) ByteIO (RandomByteIO, StreamableByteIO) Direct access (Request/Response) Indirect access (Request -> new EPR) WS-DAI only GAP! - Third-party delivery Integration with transfer GAP! - URIs for query languages and access mechanisms WS-Enumeration? WGs: DAIS, ByteIO

Ch 8: Storage Resource Management Storage Properties Space types (file space, raw, streaming) Data retention (volatile, permanent, durable) Quotas Interaction of storage and transfer Policies: performance, availability, resilience, … Use access & transfer interfaces to use the storage allocated GAP! – interfaces for the different space types WGs: GSM, SNIA?, IETF?

Ch 9: Cache Services Option 1: Managed by client Remote resource doesn’t know about it Option 2: Composed with resource Transparent to client GAP! – Management Interface Policies: coherence WGs: None

Ch 10: Data Replication Creating replicas Discovering replicas Validation of registered replicas Consistency GAP! – consistency policies Managing replicas Supports files and databases GAP! – more detail about these interfaces WGs: OREP?

Ch 11: Data Federation General description Creation of federations Hard to standardise detailed capability Creation of federations Expansion / Contraction Add / Remove input sources Add / Remove access mechanisms GAP! - Federation policies Security considerations GAP! – properties that describe a source’s access capabilities WGs: None

Ch 12: “Metadata” Catalogues Publish, Update, Classify, Augment, Delete Find, Subscribe XML format, Xpath/Xquery Consistency management GAP! - Consistency policies Duplicating Information Services DT work? WGs: ?

Scenarios document Introduction Data Replication Data Transfer Data Integration Data Staging Personal Data Profile Data Discovery Data Storage Data Federation

GAP! - Grid file system scenario Where to fit into the architecture? Probably a combination of human-readable names & data transfer Left-field possibility: a query language for selecting files from a file system GAP! – File description metadata Also useful for file replication WGs: GFS?

General issues Language for information documents? Description vs. MUST/SHOULD Present vs. future tense Format for interface descriptions Policy languages, ontologies Working groups needed Should OGSA-D address any gaps ourselves? Reaffirm policy that recommendation documents are written by dedicated WGs

Next steps Currently GGF16, Athens Meanwhile… Revising both documents for consistency Gap analysis GGF16, Athens Present documents to wider community Discussions with GFS Aim for submission soon after Meanwhile… DAIS, ByteIO already submitted