OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton.

Slides:



Advertisements
Similar presentations
Enabling and Supporting Provenance in e-Science Applications Luc Moreau University of Southampton
Advertisements

IETF Trade Working Group January 2000 XML Messaging Overview January 2000.
Provenance: concepts, architecture and envisioned tools Professor Luc Moreau University of Southampton
UK e-Science All Hands Meeting 2005 Paul Groth, Simon Miles, Luc Moreau.
Architecture Tutorial Summary and Conclusions. Architecture Tutorial The Provenance Architecture.
Principles of Personalisation of Service Discovery Electronics and Computer Science, University of Southampton myGrid UK e-Science Project Juri Papay,
Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.
Introduction and Overview “the grid” – a proposed distributed computing infrastructure for advanced science and engineering. Purpose: grid concept is motivated.
Distributed Service Architectures Yitao Duan 03/19/2002.
Business Process Orchestration
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
Workshop on Cyber Infrastructure in Combustion Science April 19-20, 2006 Subrata Bhattacharjee and Christopher Paolini Mechanical.
Provenance Challenges and Technologies for Grids Luc Moreau University of Southampton
Globus Ian Foster and Carl Kesselman Argonne National Laboratory and University of Southern California
GRID COMPUTING & GRID SCHEDULERS - Neeraj Shah. Definition A ‘Grid’ is a collection of different machines where in all of them contribute any combination.
Implementation and Evaluation of a Protocol for Recording Process Documentation in the Presence of Failures Zheng Chen and Luc Moreau
UK e-Science All Hands Meeting 2005 Paul Groth, Simon Miles, Luc Moreau.
E-Science Meeting March Trusted Coordination in Dynamic Virtual Organisations Santosh Shrivastava School of Computing Science Newcastle University,
The GRIMOIRES Service Registry Weijian Fang and Luc Moreau School of Electronics and Computer Science University of Southampton.
London e-Science Centre Imperial College London Making the Grid Pay Economic Services - Pricing and Payment William Lee.
Software Engineering General architecture. Architectural components:  Program organisation overview Major building blocks in a system Definition of each.
Usage of `provenance’: A Tower of Babel Luc Moreau.
Provenance Aware Service Oriented Architecture (1 year on) Professor Luc Moreau University of Southampton
Miguel Branco CERN/University of Southampton Enabling provenance on large-scale e-Science applications.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Towards a Universal Client for Grid Monitoring Systems Towards a Universal Client for Grid Monitoring Systems Design and Implementation of the Ovid Browser.
Progress SOA Reference Model Explained Mike Ormerod Applied Architect 9/8/2008.
XML Web Services Architecture Siddharth Ruchandani CS 6362 – SW Architecture & Design Summer /11/05.
1 OSG Accounting Service Requirements Matteo Melani SLAC for the OSG Accounting Activity.
An Ontological Framework for Web Service Processes By Claus Pahl and Ronan Barrett.
Security Issues in a SOA- based Provenance System Victor Tan, Paul Groth, Simon Miles, Sheng Jiang, Steve Munroe, Sofia Tsasakou and Luc Moreau PASOA/EU.
Jamie Hall (ILL). SciencePAD Persistent Identifiers Workshop PANData Software Catalogue January 30th 2013 Jamie Hall Developer IT Services, Institut Laue-Langevin.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
GRID ARCHITECTURE Chintan O.Patel. CS 551 Fall 2002 Workshop 1 Software Architectures 2 What is Grid ? "...a flexible, secure, coordinated resource- sharing.
Grid Services I - Concepts
The Globus Toolkit®: The Open Source Solution for Grid Computing
Folie 1 Analysis of SM-Exchange Protocol using SM&C MAL DLR/GSOC Author: S.Gully.
Replica Consistency in a Data Grid1 IX International Workshop on Advanced Computing and Analysis Techniques in Physics Research December 1-5, 2003 High.
Introduction to Semantic Web Service Architecture ► The vision of the Semantic Web ► Ontologies as the basic building block ► Semantic Web Service Architecture.
July 27, 2005High Performance Distributed Computing 05 Recording and Using Provenance in a Protein Compressibility Experiment Paul Groth, Simon Miles,
Shuman Guo CSc 8320 Advanced Operating Systems
Scalable Grid system– VDHA_Grid: an e-Science Grid with virtual and dynamic hierarchical architecture Huang Lican College of Computer.
Formalising a protocol for recording provenance in Grids Paul Groth – University of Southampton.
Recording the Context of Action for Process Documentation Ian Wootten Cardiff University, UK
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
ICOM TC Charter TC’s Scope –Specify the normative standards for collaboration objects, along with their attributes, relationships, constraints, and behavior,
Recording Actor Provenance in Scientific Workflows Ian Wootten, Shrija Rajbhandari, Omer Rana Cardiff University, UK.
A Mediated Approach towards Web Service Choreography Michael Stollberg, Dumitru Roman, Juan Miguel Gomez DERI – Digital Enterprise Research Institute
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
GESA: Grid Economic Services Architecture Steven Newhouse Jon MacLaren
Rights Management for Shared Collections Storage Resource Broker Reagan W. Moore
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Provenance in Distr. Organ Transplant Management EU PROVENANCE project: an open provenance architecture for distributed.
F2F April 7, Enumeration & Connection Management presented by Chris Pane.
Open Science Grid OSG Accounting System Matteo Melani SLAC 9/28/05 Joint OSG and EGEE Operations Workshop.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Software Architecture Patterns (3) Service Oriented & Web Oriented Architecture source: microsoft.
Fault Tolerance in Distributed Systems Gökay Burak AKKUŞ Cmpe516 – Fault Tolerant Computing.
Trusted CoordinationTAPAS Workshop, 25-26/09/031 Building Blocks for Trusted Coordination Nick Cook University of Newcastle.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Recording and Reasoning Over Data Provenance in Web and Grid Services Martin Szomszor and Luc Moreau University of Southampton.
Provenance: Problem, Architectural issues, Towards Trust
Globus —— Toolkits for Grid Computing
Grid Metadata Management
University of Technology
Grid Computing B.Ramamurthy 9/22/2018 B.Ramamurthy.
Replication Middleware for Cloud Based Storage Service
1st International Conference on Semantics, Knowledge and Grid
Implementing Consistency -- Paxos
ICOM TC Charter TC’s Scope Out of TC’s Scope Call for Participation
Presentation transcript:

OPODIS'04 A protocol for recording provenance in service-oriented Grids Paul Groth, Michael Luck, Luc Moreau University of Southampton

OPODIS'04 Or…How to show your work. In a Grid

OPODIS'04 Contents 1. What is Provenance and why you should care 2. The Grid and Provenance 3. An Architectural Vision 4. PReP 6. What’s next 7. Conclusion

OPODIS'04 A Definition Main Entry: prov·e·nance Pronunciation: 'präv-n&n(t)s, 'prä-v&-"nän(t)s Function: noun Etymology: French, from provenir to come forth, originate, from Latin provenire, from pro- forth + venire to come Date: : ORIGIN, SOURCE 2 : the history of ownership of a valued object or work of art or literature The provenance of something is the documentation of process that led to that something i.e. showing your work

OPODIS'04 The importance of provenance Process is IMPORTANT Art Wine Drug Discovery Financial Auditing Aerospace …

OPODIS'04 The Grid The Grid problem is defined as coordinated resource sharing and problem solving in dynamic, multi- institutional virtual organisations [Foster, Kesselman, Tuecke].

OPODIS'04 … and the Provenance Problem Given a set of services in an open grid environment that decide to form a virtual organisation with the aim to produce a given result; How can we determine the process that generated the result, especially after the virtual organisation has been disbanded?

OPODIS'04 Provenance Problem cont. Provenance recording should be part of the infrastructure Currently, the Web Services protocol stack and the Open Grid Services Architecture do not provide any support for recording provenance. Methods are generally adhoc and do not interoperate.

OPODIS'04 More Definitions Provenance The provenance of some data is the documentation of process that led to the data. Interaction Provenance For some data, interaction provenance is the documentation of interactions between actors that led to the data. Actor Provenance For some data, actor provenance is documentation that can only be provided by a particular actor pertaining to the process that led to the data.

OPODIS'04 An Architecture

OPODIS'04 Motivating Requirements Verifiability Accountability Reproducibility Repeat Reproduce Preservation Scalability Generality Customisability

OPODIS'04 An Architecture with Provenance Support

OPODIS'04 PReP- Provenance Recording Protocol client service invocation result record invocation and result record invocation and result negotiate Why record 2 views? Provenance Store Provenance Store Provenance Store

OPODIS'04 Multiple Provenance Stores clientservice invocation result Provenance Store invocation and result record invocation and result record clientservice invocation result Provenance Store invocation and result record invocation and result record clientservice invocation result Provenance Store invocation and result record invocation and result record Provenance stores may be shared or different

OPODIS'04 Linking Stores clientservice invocation result Provenance Store invocation and result record invocation and result record clientservice invocation result Provenance Store invocation and result record invocation and result record clientservice invocation result Provenance Store invocation and result record invocation and result record Link

OPODIS'04 PReP in detail Model PReP using asynchronous message passing. Maps well to any implementation Helpful for scalability Four Phase Protocol Negotiation Invocation Provenance Recording Termination

OPODIS'04 PReP’s messages Propose Reply Invoke Result Record Negotiation Record Invocation Record Result Submission Finished Additional Provenance Record Negotiation Ack Record Invocation Ack Record Result Ack Submission Finished Ack Additional Provenance Ack

OPODIS'04 PReP’s messages Propose Reply Invoke Result Record Negotiation Record Invocation Record Result Submission Finished Additional Provenance Record Negotiation Ack Record Invocation Ack Record Result Ack Submission Finished Ack Additional Provenance Ack Used for connecting provenance stores and for recording provenance about actors.

OPODIS'04 Provenance Store – An abstract state machine Formalise the protocol by formalising the individual entities in the protocol Know exactly how the Provenance Service responds to receipt of messages Use to show a liveness property Something good will eventually happen

OPODIS'04 Client and Service State transition diagram Cannot formalise internals, only the response to PReP Show Termination Property

OPODIS'04

What’s next PReP Version 2 Greater Flexibility In configuration and recorded info Security non-repudiation mutual authentication Fault Tolerant Specification What if you have a lot of data? Look at scalability

OPODIS'04 Conclusion Provenance is important Provenance recording must be part of the infrastructure. Standards. PReP provides is a first step. An implementation (PReServ) is available.

OPODIS'04 Where you can download PReServ The End