Start smart finish wise The Kiel Marine Science Provenance- Aware Data Management Approach Peer Brauer 1, Andreas Czerniak 2, Wilhelm Hasselbring 1 1 Software Engineering Group Kiel University 2 Kiel Data Management Team, Geomar Köln,
2 Motivation
3
4 OCN Start SmartPubFlow
5 Agenda ‣ The „Start smart“ Approach ‣ The PubFlow Framework ‣ The Evaluation Scenario
6 Section 1 The „Start Smart“ Approach
7 Basic idea Before the Cruise: Describe the experiments as workflows In the field: Use predefined forms and electronic pens to write your measurements down In the lab: Extract the information of the forms automatically
8 Before the cruise
9 OCN
10 During the cruise
11 During the cruise
12 After the cruise
13 „Start Smart“ OCN
14 Section 2 The PubFlow Framework
15 What is PubFlow about? ‣ Creating a scientific workflow environment for data publication ‣ Introducing role based working models to the domain of data management ‣ Increasing the degree of automation in data management
16 Features Provenance Awareness ‣ Automatically capturing of provenance data ‣ Integrated W3C Prov-O compliant provenance archive ‣ Workflow based provenance browser
17 Features Graphical Workflow Editor ‣ Supports graphical DSLs ‣ Data managers can easily define own workflows ‣ Workflows can be transformed to selected target execution environment
18 Section 3 The Evaluation Szenario
19 Photo: NOAA
20 Evaluation Scenario
21 Evaluation Scenario
22 Evaluation Scenario OCN
23 Evaluation Scenario OCN
24 Evaluation Scenario Map to Pangaea Load from DB Open Tasks? Edit Dataset To Pangaea
25 Evaluation Scenario Photo: NOAA OCN
Evaluation Scenario
27 Conclusion „Start smart finish wise“ ‣ Our approach collects and preserves provenance information ‣ Introduces role based working models to the domain of data management ‣ Allows (semi-) automatic research data publication