Download presentation
Presentation is loading. Please wait.
Published byMiranda Farrington Modified over 9 years ago
1
UK e-Science All Hands Meeting 2005 Paul Groth, Simon Miles, Luc Moreau
2
UK e-Science All Hands Meeting 2005 Outline Process Documentation for Provenance Power of the P-Structure P-assertion Recording Protocol PReServ’s Functionality Performance Pitch
3
UK e-Science All Hands Meeting 2005 Provenance The Provenance Question –Lots of definitions… –Boil it down to a question. –What is the process that led to a particular result? How do we answer this question? –Search through documentation.
4
UK e-Science All Hands Meeting 2005 Documentation Process Documentation –encompasses all other documentation SOA based model of process Actors communicate via message passing Actors make ASSERTIONS to document process. Termed p-assertions. Three types of p-assertions
5
UK e-Science All Hands Meeting 2005 M1 M2 M3 M4 Actor 1 Actor 2 I received M1, M4 I sent M2, M3 I received M3 I sent M4 From these p-assertions, we can derive that M3 was sent by Actor 1 and received by Actor 2 (and likewise for M4) If actors are black boxes, these assertions are not very useful because we do not know dependencies between messages Interaction P-assertions
6
UK e-Science All Hands Meeting 2005 M1 M2 M3 M4 Actor 1 Actor 2 M2 is in reply to M1 M3 is caused by M1 M2 is caused by M4 M4 is in reply to M3 These assertions help identify order of messages, but not how data were computed Relationship P-assertions
7
UK e-Science All Hands Meeting 2005 f M1 M2 M3 M4 Actor 1 Actor 2 f1 f2 M3 = f1(M1) M2 = f2(M1,M4)M4 = f(M3) These assertions help identify how data is computed, but provide no information about non-functional characteristics of the computation (time, resources used, etc) Relationship P-assertions
8
UK e-Science All Hands Meeting 2005 M1 M2 M3 M4 Actor 1 Actor 2 I used 386 cluster Request sat in queue for 6min I used sparc processor I used algorithm x version x.y.z Actor State P-assertions
9
UK e-Science All Hands Meeting 2005 P-Structure
10
UK e-Science All Hands Meeting 2005 P-Structure View
11
UK e-Science All Hands Meeting 2005 Benefits Domain independent queries That are provenance specific P-structure is a shared logical organisation of p-assertions Does not prescribe how p-assertions are exactly stored in an implementation.
12
UK e-Science All Hands Meeting 2005 PReP Introduces the role of a Provenance Store –A separate conceptual entity that is responsible for maintaining process documentation PReP specifies how an actor can communicate with the Provenance Store to record p-assertions. PReP has a number of nice properties. – Statelessness – Idempotence – Termination
13
UK e-Science All Hands Meeting 2005 An Implementation What is PReServ? –A Web Services implementation of a Provenance Store –Implements PReP for recording XQuery for querying –Provides libraries and wrappers for making applications provenance aware.
14
UK e-Science All Hands Meeting 2005 Axis Handler Axis Handler Provenance Store Backend Store Interface Database Store In-Memory Store … Backend Stores PS Client Side Library PS Client Side Library Web Service WS Client Query Actor WS PS Client Side Library WS Calls Java Calls PReServ Implementation Diagram
15
UK e-Science All Hands Meeting 2005 Implementation cont. Backend Store Interface Java Object DatabaseMemory… Store Plug InQuery Plug In … Dispatcher SOAP Msg
16
UK e-Science All Hands Meeting 2005 Tuning PReServ can be tuned to fit application characteristics. When does your application need to query p-assertions? Cache and then process P-assertion Processed P-Assertions Wait X milliseconds
17
UK e-Science All Hands Meeting 2005 Requirements Apache Tomcat 5.0 Apache Ant 1.6.2 Java 1.5 (1.4 supported with some help) Pure Java, tested on –Windows –Mac OS X –Debian Linux
18
UK e-Science All Hands Meeting 2005 Evaluation Deployment Protein Compressibility Experiment –HPDC’05 Workflow runs under VMWare –deployment consistency –ease of development Workflow is executed on one machine PReServ runs on another machine
19
UK e-Science All Hands Meeting 2005 Recording Performance
20
UK e-Science All Hands Meeting 2005 Query Performance
21
UK e-Science All Hands Meeting 2005 Applications
22
UK e-Science All Hands Meeting 2005 What’s new New version of PReP –Beyond request-response message exchange patterns P-structure compatible XQuery interface Tuning Refactored code Performance tests
23
UK e-Science All Hands Meeting 2005 Conclusion The p-structure allows for domain independent, provenance specific queries using XQuery. Both recording and query times are linear with respect to experiment size. PReServ has a extensible architecture allowing for further functionality to be easily added.
24
UK e-Science All Hands Meeting 2005 Download! Try it out! Download PReServ 0.2: –The AHM release –Released under Open Source MIT License www.pasoa.org –Click software Contact us, we will try to help you make your application provenance-aware.
25
UK e-Science All Hands Meeting 2005 Configuration Redhat Linux 9.1 on VMWare on Windows XP Pentium P4 2.8 GHZ 1.5 GB RAM PReServ on another machine –Database backend Berkley JDB 100 Mb local ethernet
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.