Scufl2 – because a workflow is more than its definition

Slides:



Advertisements
Similar presentations
1 RDF Tools Brian Kelly UKOLN University of Bath Bath, BA2 7AY UKOLN is funded by the British Library Research and Innovation Centre,
Advertisements

IMPACT is supported by the European Community under the FP7 ICT Work Programme. The project is coordinated by the National Library of the Netherlands.
RDF formats for Linked Data by Mabi Harandi. RDF is not a format, it is a model for data So: It will provide supports for different formats like :  Turtle.
E-Science Data Information and Knowledge Transformation The BinX Language.
Enabling Academic Research: Office Add-ins Alex Wade Director – Scholarly Communication Microsoft External Research.
Dr. Alexandra I. Cristea RDF.
28/1/2001 Seminar in Databases in the Internet Environment Introduction to J ava S erver P ages technology by Naomi Chen.
Migrating to EPiServer CMS 5 Johan Björnfot -
4/20/2017.
JSP Standard Tag Library
Semantic Computing Research Group University of Helsinki 1 SemCocoon – A Semantic Cocoon OntoView – Portal System.
Taverna and my Grid Basic overview and Introduction Tom Oinn
14/11/11 Taverna Roadmap Shoaib Sufi myGrid Project Manager.
Copyright © Orbeon, Inc. All rights reserved. Erik Bruchez Applications of XML Pipelines XML Prague, June 16 th, 2007.
A presentation by Robin Upton ( ) ‏ latest version at Attribution – NonCommercial - ShareAlike FF 404.
Copyright © IBM Corp., All rights reserved. This presentation is licensed under Creative Commons Att. Nc Nd 2.5 license. OpenDocument Format.
Taverna and my Grid Open Workflow for Life Sciences Tom Oinn
MyExperiment Research Objects: Beyond Workflows and Packs Stian Soiland-Reyes myGrid, University of Manchester BOSC 2013, ISMB, Berlin, This.
1 Chapter 10: XML What is XML What is XML Basic Components of XML Basic Components of XML XPath XPath XQuery XQuery.
Wf4Ever: Annotating research objects Stian Soiland-Reyes, Sean Bechhofer myGrid, University of Manchester Open Annotation Rollout, Manchester,
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.
Managing and communicating uncertainty in geospatial web service workflows Richard Jones, Dan Cornford, Lucy Bastin, Matthew Williams Computer Science,
Taverna Platform What is done to run a workflow?.
Web: Minimal Metadata for Data Services Through DIALOGUE Neil Chue Hong AHM2007.
Technology behind using Taverna in caGrid caGrid user meeting Stian Soiland-Reyes, myGrid University of Manchester, UK
Semantic Technologies and Application to Climate Data M. Benno Blumenthal IRI/Columbia University CDW /04-01.
Stian Soiland-Reyes myGrid, School of Computer Science University of Manchester, UK UKOLN DevSci: Workflow Tools Bath,
SEE-GRID-SCI The SEE-GRID-SCI initiative is co-funded by the European Commission under the FP7 Research Infrastructures contract no.
CS 157B: Database Management Systems II February 11 Class Meeting Department of Computer Science San Jose State University Spring 2013 Instructor: Ron.
A Practical Approach to Metadata Management Mark Jessop Prof. Jim Austin University of York.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
© 2006 Altova GmbH. All Rights Reserved. Altova ® Product Line Overview.
Application Report: An extensible policy editing API for privacy and identity management policies Giles Hogben jrc. It European Commission.
IRS Tax Map Electronic Research Tool David Brown Internal Revenue Service Media and Publications Division David Brown Internal Revenue Service Media and.
SHIWA Desktop Cardiff University, Budapest, 3 rd July 2012.
Practical RDF Chapter 3. The Basic Elements Within the RDF/XML Syntax Shelley Powers, O’Reilly SNU IDB Lab. Hyewon Kim.
Jackson, Web Technologies: A Computer Science Perspective, © 2007 Prentice-Hall, Inc. All rights reserved Chapter 7 Representing Web Data:
The Virtual Observatory and Ecological Informatics System (VOEIS): Using RESTful architecture and an extensible data model to provide a unique data management.
Lecture Transforming Data: Using Apache Xalan to apply XSLT transformations Marc Dumontier Blueprint Initiative Samuel Lunenfeld Research Institute.
Semantic Web In Depth Resource Description Framework Dr Nicholas Gibbins –
Infrastructure and Workflow for the Formal Evaluation of Semantic Search Technologies Stuart N. Wrigley 1, Raúl García-Castro 2 and Cassia Trojahn 3 1.
1 RIC 2009 Symbolic Nuclear Analysis Package - SNAP version 1.0: Features and Applications Chester Gingrich RES/DSA/CDB 3/12/09.
Improving ODF applications by sharing ODF tests Svante Schubert Software Engineer Sun Microsystems Inc.
Data and tools on the Web have been exposed in a RESTful manner. Taverna provides a custom processor for accessing such services.
Research Objects Preserving scientific data and methods Stian Soiland-Reyes, Khalid Belhajjame School of Computer Science, Univ of Manchester myGrid NIHBI.
Eclipse Vorto Alexander Edelmann.
OCF Data Model Michael J Koster.
VNF Package CSAR Format Tal Halfon, Amdocs Andrei Kojukhov, PhD, Amdocs Aug 3, 2017.
ODF API - ODFDOM Svante Schubert Software Engineer
Unit 4 Representing Web Data: XML
VNF Package CSAR for ONAP Release 2 – adding Telco grade features Andrei Kojukhov, PhD, Amdocs Oct 6, 2017.
Resource Description Framework
What can provenance do for me?
Alan Williams, Donal Fellows, Finn Bacall,
WEB API.
Chapter 7 Representing Web Data: XML
TOSCA-Metadata (directory )
A DDI3.2 Style for Data and Metadata Extracted from SAS
Chapter 9 Web Services: JAX-RPC, WSDL, XML Schema, and SOAP
Automation with Gwen Introduction.
Henning Schulzrinne Dept. of Computer Science Columbia University
TOSCA-Metadata (directory )
CSSSPEC6 SOFTWARE DEVELOPMENT WITH QUALITY ASSURANCE
Automated Bulk Signing Solution
Taverna workflow management system
敦群數位科技有限公司(vanGene Digital Inc.) 游家德(Jade Yu.)
REST Services Data and tools on the Web have been exposed in both WSDL and REST. Taverna provides a custom processor for accessing REST services Peter.
Consuming Web Services with 2E Generated Objects
VNF Package CSAR Format Tal Halfon, Amdocs Andrei Kojukhov, PhD, Amdocs Aug 3, 2017.
WCF Data Services and Silverlight
Presentation transcript:

Scufl2 – because a workflow is more than its definition http://www.taverna.org.uk/ Scufl2 – because a workflow is more than its definition Project site: http://www.taverna.org.uk/ Source code: https://github.com/mygrid/scufl2 http://taverna.googlecode.com/ License: GNU Lesser General Public License (LGPL) 2.1 Stian Soiland-Reyes, Alan R Williams, Stuart Owen, David Withers and Carole Goble School of Computer Science, University of Manchester, UK {stian.soiland-reyes, alan.r.williams, stuart.owen, david.withers, carole.a.goble}@manchester.ac.uk Taverna’s future workflow and data format Goal: Simplify third-party reading, writing, annotating and extending Scufl2: specification, schema, ontology, Java API and conversion tool Alpha “Structured” zip-file (can be unpacked to be exposed on the web) Self-documenting media type (OpenOffice ODF, ePub OCF, Adobe UCF) – for tools like file(1) and mime magic Root file: Primary document of the bundle (ePub OCF/Adobe UCF). Alternative representations of same workflow bundle allowed (Turtle, JSON, HTML, etc) the-workflow-bundle.scufl2 application/vnd.taverna.scufl2.workflow-bundle mimetype Manifest listing all resources in bundle with media types <manifest:manifest xmlns:manifest="urn:oasis:names:tc:opendocument:xmlns:manifest:1.0">  <manifest:file-entry manifest:media-type="application/vnd.taverna.scufl2.workflow-bundle" manifest:full-path="/"/>  <manifest:file-entry manifest:media-type="application/rdf+xml" manifest:full-path="workflowBundle.rdf"/> <!– Standard resources -->  <manifest:file-entry manifest:media-type="application/rdf+xml" manifest:full-path="workflow/HelloWorld.rdf"/>  <manifest:file-entry manifest:media-type="application/rdf+xml" manifest:full-path="profile/tavernaWorkbench.rdf"/>  <manifest:file-entry manifest:media-type="application/rdf+xml" manifest:full-path="profile/tavernaServer.rdf"/>  <manifest:file-entry manifest:media-type="application/rdf+xml" manifest:full-path="annotation/user_annotations.rdf"/> <!– Any additional resources -->  <manifest:file-entry manifest:media-type="application/rdf+xml" manifest:full-path="annotation/myExperiment-wf-765.rdf"/>   <manifest:file-entry manifest:media-type="image/png" manifest:full-path="Thumbnails/thumbnail.png"/>  <manifest:file-entry manifest:media-type="image/svg+xml" manifest:full-path="diagram/workflow/HelloWorld.svg"/> </manifest:manifest> META-INF/manifest.xml META-INF/container.xml     <rootfiles> <container version="1.0“ xmlns="urn:oasis:names:tc:opendocument:xmlns:container">         <!-- <rootfile full-path="workflowBundle.ttl"         <rootfile full-path="workflowBundle.rdf" media-type="application/rdf+xml" />     </rootfiles>          media-type="text/turtle" /> Alternative repr. -->  </container> Unique identifier which can be used as prefix for all relative references in bundle workflowBundle.rdf Suggests main workflow and main profile, but executor could (e.g. by parameter) run a different workflow or profile <rdf:RDF xmlns="http://ns.taverna.org.uk/2010/scufl2#" xmlns:rdf=".." xmlns:rdfs=".." xmlns:xsi=".." xsi:schemaLocation="http://ns.taverna.org.uk/2010/scufl2# http://ns.taverna.org.uk/2010/scufl2/scufl2.xsd .." xsi:type="WorkflowBundleDocument" xml:base="./">           <sameBaseAs rdf:resource= "http://ns.taverna.org.uk/2010/workflowBundle/28f7c..a0ef731/" />                   <name>HelloWorld</name>       <WorkflowBundle rdf:about="">           <workflow>           <mainWorkflow rdf:resource="workflow/HelloWorld/" />             <Workflow rdf:about="workflow/HelloWorld/">               <rdfs:seeAlso rdf:resource="workflow/HelloWorld.rdf" />           <mainProfile rdf:resource="profile/tavernaWorkbench/" />           </workflow>            </Workflow> <!-- plus each nested <Workflow> -->            <Profile rdf:about="profile/tavernaWorkbench/">           <profile>           </profile>             </Profile> <!– plus optional alternative <Profile>-->              <rdfs:seeAlso rdf:resource="profile/tavernaWorkbench.rdf" />       </WorkflowBundle>           <rdfs:seeAlso rdf:resource="annotation/myExperiment-wf-765.rdf" />           <rdfs:seeAlso rdf:resource="annotation/user_annotations.rdf" /> </rdf:RDF> Definition of workflow structure. Nested workflows are separate resources. Additional (non-executable) annotations and metadata workflow/HelloWorld.rdf <rdf:RDF xmlns="http://ns.taverna.org.uk/2010/scufl2#"       xsi:schemaLocation="http://ns.taverna.org.uk/2010/scufl2# http://ns.taverna.org.uk/2010/scufl2/scufl2.xsd ..       xmlns:rdf=".." xmlns:owl=".." xmlns:rdfs=".." xmlns:xsi=".."   xsi:type="WorkflowDocument" xml:base="HelloWorld/"> <workflowIdentifier <name>HelloWorld</name>   <Workflow rdf:about="">   <InputWorkflowPort rdf:about="in/yourName"> <inputWorkflowPort>   rdf:resource="http://ns.taverna.org.uk/2010/workflow/006...c84e2ca/" /> <name>yourName</name> </inputWorkflowPort>   </InputWorkflowPort> <portDepth>0</portDepth> <Processor rdf:about="processor/Hello/"> <processor> <outputWorkflowPort><!-- .. --></outputWorkflowPort> <name>Hello</name>   <OutputProcessorPort rdf:about="processor/Hello/out/greeting"> <outputProcessorPort> <inputProcessorPort><!-- .. --></inputProcessorPort> <name>greeting</name> </OutputProcessorPort> <granularPortDepth>0</granularPortDepth>               <dispatchStack> </outputProcessorPort>                  <rdf:type rdf:resource="http://ns.taverna.org.uk/2010/scufl2/taverna#defaultDispatchStack" />                <DispatchStack rdf:about="processor/wait4me/dispatchStack/">                 </DispatchStack>                <IterationStrategyStack rdf:about="processor/Hello/iterationstrategy/">               <iterationStrategyStack>               </dispatchStack>                 <iterationStrategies rdf:parseType="Collection">                 </IterationStrategyStack>                   </iterationStrategies>                      <CrossProduct> <!-- .. --></CrossProduct>               </iterationStrategyStack>         <datalink>         </processor> <!-- plus other <processor>s -->             </Processor>          <DataLink rdf:about="datalink?from=processor/Hello/out/greeting&to=out/results&mergePosition=0">               <sendTo rdf:resource="out/results" />            <receiveFrom rdf:resource="processor/Hello/out/greeting" />               <mergePosition>0</mergePosition>         <control>         </datalink>  <!-- plus more <datalink>s -->                 </DataLink>          <Blocking rdf:about="control?block=processor/Hello/&untilFinished=processor/wait4me/">            <block                  <untilFinished rdf:resource="processor/wait4me/" />            <block rdf:resource="processor/Hello/" />     </Workflow>         </control>           </Blocking>   </rdf:RDF> Profile gives implementation bindings, alternative profiles can customize execution of workflow steps for different environments (e.g. desktop, server, cloud) profile/tavernaWorkbench.rdf Every workflow part has a URI, allowing deep annotations Relative references can also be made absolute using the sameBaseAs prefix, RDF clients resolving such URIs at http://ns.taverna.org.uk/2010/ workflowBundle/ can be redirected to a generated RDF resource stating what’s obvious from the URI. <rdf:RDF xmlns="http://ns.taverna.org.uk/2010/scufl2#"  xmlns:beanshell="http://ns.taverna.org.uk/2010/activity/beanshell#" xmlns:dc=".."  xmlns:owl=".." xmlns:rdf=".." xmlns:rdfs=".." xmlns:xsi=".."       xsi:schemaLocation="http://ns.taverna.org.uk/2010/scufl2# http://ns.taverna.org.uk/2010/scufl2/scufl2.xsd .."       xsi:type="ProfileDocument" xml:base="tavernaWorkbench/">             <processorBinding rdf:resource="processorbinding/Hello/" />             <name>tavernaWorkbench</name>       <Profile rdf:about="">       </Profile>             <activateConfiguration rdf:resource="configuration/Hello/" />             <name>HelloScript</name>             <rdf:type rdf:resource="http://ns.taverna.org.uk/2010/activity/beanshell" />       <Activity rdf:about="activity/HelloScript/">                   <InputActivityPort rdf:about="activity/HelloScript/in/personName">             <inputActivityPort>                   </InputActivityPort>                         <portDepth>0</portDepth>                         <name>personName</name>       </Activity>             <outputActivityPort><!-- .. --></outputActivityPort>             </inputActivityPort>             <bindActivity rdf:resource="activity/HelloScript/" />                   <name>Hello</name>                  <ProcessorBinding rdf:about="processorbinding/Hello/">             <activityPosition>10</activityPosition>             <bindProcessor rdf:resource="../../workflow/HelloWorld/processor/Hello/" />          <bindInputActivityPort rdf:resource="activity/HelloScript/in/personName" />                   <InputPortBinding rdf:about="processorbinding/Hello/in/name">             <inputPortBinding>          <bindInputProcessorPort rdf:resource="../../workflow/HelloWorld/processor/Hello/in/name" />                   </InputPortBinding>       </ProcessorBinding>             <outputPortBinding><!-- --> </outputPortBinding>                        </inputPortBinding>             <rdf:type rdf:resource="http://ns.taverna.org.uk/2010/activity/beanshell#Config" />       <Configuration rdf:about="configuration/Hello/">             <beanshell:script>hello = "Hello, " + personName;             <configure rdf:resource="activity/HelloScript/" />             <name>Hello</name> </rdf:RDF>       </Configuration> JOptionPane.showMessageDialog(null, hello);</beanshell:script>   Hybrid of RDF/XML and XML schema - If the xsi:type is given, documents can be parsed and generated as regular XML, using xpath, etc. The schema ensures the document can still be parsed as RDF/XML as well. Pure RDF writers can ommit the xsi:type “Cool URI”-style relative references allow parsers to ‘cheat’ and pick out processor “Hello” and port “greeting” - but only if starts with the keyword paths out/ in/ or processor/ Annotations Resources Provenance Inputs Outputs Binaries The bundle (as a ZIP file) can be extended to include arbitrary resources. These could be referenced internally by workflow activities, it could be PDFs, spreadsheets, etc. Wf4Ever: A Research Object (RO) captures enough data and provenance about results to make data and methods reproducible, verifiable, shareable, reusable and repeatable. The Scufl2 workflow run bundle forms such an RO for Taverna workflow results – with the goal of becoming an executable paper. Embedding provenance, input and output data of a workflow run can provide example use and expected outputs. Changing the root-file to point to the provenance makes the bundle a workflow run bundle, where the workflow definition is secondary. Similarly a data bundle represents primarily the data, but by including the workflow and the provenance also embed information on how it was generated. http://www.wf4ever-project.org/ http://www.mygrid.org.uk/ Funded by: EPSRC EP/G026238/1; European Commission’s 7th FWP FP7-ICT-2007-6 270192;  FP7-ICT-2007-6 270137