Presentation is loading. Please wait.

Presentation is loading. Please wait.

University of California, Davis Daniel Zinn 1 University of California, Davis Daniel Zinn 1 Daniel Zinn Bertram Ludäscher University of California at Davis.

Similar presentations


Presentation on theme: "University of California, Davis Daniel Zinn 1 University of California, Davis Daniel Zinn 1 Daniel Zinn Bertram Ludäscher University of California at Davis."— Presentation transcript:

1 University of California, Davis Daniel Zinn 1 University of California, Davis Daniel Zinn 1 Daniel Zinn Bertram Ludäscher University of California at Davis Presented at IPAW 2010 Abstract Provenance Graphs: Anticipating and Exploiting Schema-Level Data Provenance Abstract Provenance Graphs: Anticipating and Exploiting Schema-Level Data Provenance

2 University of California, Davis Daniel Zinn 2 Motivation Phylogenetic Tree of Primates

3 University of California, Davis Daniel Zinn 3 Scientific Workflows Actors Channels Ports Tokens int, string, record{..}, array[..],.. SciWF = Executable specification of Scientific Method DrawTree AA-Sequences Clustal Aligned AA-Sequences Quicktree Newick Tree

4 University of California, Davis Daniel Zinn 4 Virtual Data Assembly Lines (COMAD in Kepler) Data is organized as XML-like tree structures Encapsulate actor within a configurable shell XML Three configuration parameters: Scope σ – to select scope of actor invocation Input assembler γ – to create inputs for wrapped component Write expression ω – to write results back into data stream

5 University of California, Davis Daniel Zinn 5 Workflow execution and Provenance

6 University of California, Davis Daniel Zinn 6 Ex1: Phylogenetics Workflow Configurations WorkflowGraph APG

7 University of California, Davis Daniel Zinn 7 Ex2: BUG! Idle Actor – No Input Configurations WorkflowGraph APG No input

8 University of California, Davis Daniel Zinn 8 Ex3: Bug! Wrong input Configurations WorkflowGraph APG

9 University of California, Davis Daniel Zinn 9 Bigger Picture

10 University of California, Davis Daniel Zinn 10 Types and such

11 University of California, Davis Daniel Zinn 11 In the paper…

12 University of California, Davis Daniel Zinn 12 APGs for the Three Examples Ex1: Desired result Ex2: Bug - No input Ex3: Bug - Too much input

13 University of California, Davis Daniel Zinn 13 Time-Collapsed Flowgraph Only Show Collection-Structure at the End Collapse Provenance

14 University of California, Davis Daniel Zinn 14 Structure-Collapsed Flowgraph Collapse the Collection edges

15 University of California, Davis Daniel Zinn 15 Summary Abstract Provenance Graphs … summarize potential Provenance graphs via graph homomorphisms … are constructed via static analysis of the workflow without running the WF … explain workflow’s data-flow … make it easier to spot certain configuration bugs

16 University of California, Davis Daniel Zinn 16 University of California, Davis Daniel Zinn 16


Download ppt "University of California, Davis Daniel Zinn 1 University of California, Davis Daniel Zinn 1 Daniel Zinn Bertram Ludäscher University of California at Davis."

Similar presentations


Ads by Google