These exercises highlight the services that do not perform biological functions, but are vital for running life science workflows.

Slides:



Advertisements
Similar presentations
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester.
Advertisements

European Life Sciences Infrastructure for Biological Information Rafael C Jimenez ELIXIR CTO EMBL-EBI workshop networks and pathways.
Fission Yeast Computing Workshop -1- Exercise 5: Looking for overreprsented GO terms in a gene set using Onto-Express GO annotations can be used to obtain.
BioMoby and Taverna Tutorial. Downloading Taverna ► Taverna can be obtained from:
Records and Information Management IT - Enterprise Content Management SPIDR II Global Features Reference Guide April 2013.
This tutorial is a “suggested workflow” to help with sending documents to Referring Providers via Starpanel. The Provider Communication Wizard is an application.
Web Design HTML, Frontpage, DreamWeaver μέρος β ΠΡΥ019 - Πληροφορική Δρ.Βάσος Βασιλείου.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester materials by Dr Katy Wolstencroft and Dr Aleksandra.
Tom Oinn,  Download Taverna from  Windows or linux If you are using either.
For Version 6.0 and later Lattice3D Reporter Tutorial For Version 6.0 and later LATTICE TECHNOLOGY, INC.
An Introduction to Designing, Executing and Sharing Workflows with Taverna Nowgen, Next Gen Workshop 17/01/2012.
For Version 3.1 and later Lattice3D Reporter Tutorial For Version 3.1 and later LATTICE TECHNOLOGY, INC.
An Introduction to Designing and Executing Workflows with Taverna Katy Wolstencroft University of Manchester.
Introduction to Taverna, an environment For designing and executing workflows Franck Tanoh University of Manchester.
Galaxy for Bioinformatics Analysis An Introduction TCD Bioinformatics Support Team Fiona Roche, PhD Date: 31/08/15.
Oracle Data Integrator Workflow Management: The Packages.
Taverna Workflows myExperiment Paul Fisher University of Manchester
Execute Workflow. Home page To execute a workflow navigate to My Workflows Page.
SADI and Taverna 2 Tutorial David Withers. Preamble The Taverna 2 platform is constantly changing; while the look and feel of the workbench may change,
For Version 4.0 and later Lattice3D Reporter Tutorial For Version 4.0 and later LATTICE TECHNOLOGY, INC.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.
1 NORMA Lab. 5 Duplicating Object Type and Predicate Shapes Finding Displayed Shapes Using the Diagram Spy Using Multiple Windows Using the Context Window.
BioMoby and Taverna 2 Tutorial Mark Wilkinson, Edward Kawas, David Withers.
Kepler Deana Pennington LTER Network Office. Download Kepler Kepler website: website:
An Introduction to Designing, Executing and Sharing Workflows with Taverna Katy Wolstencroft myGrid University of Manchester IMPACT/Taverna Hackathon 2011.
Performing statistical analyses using the Rshell processor Original material by Peter Li, University of Birmingham, UK Adapted by Norman.
Introduction to Taverna Online and Interaction service Aleksandra Pawlik University of Manchester.
Kepler Exercise Deana Pennington University of New Mexico December 10, 2004.
Copyright OpenHelix. No use or reproduction without express written consent1.
Prepared by the Academic Faculty Members of IT. Tables Creating Tables. Merging Cells. Splitting Cells. Sorting Tables. Performing Calculations.
XP New Perspectives on Macromedia Dreamweaver MX 2004 Tutorial 5 1 Adding Shared Site Elements.
An Introduction to Designing, Executing and Sharing Workflows with Taverna Anja Le Blanc Stian Soiland-Reyes Alan Willams University of Manchester.
Designing, Executing and Sharing Workflows with Taverna 2.2 Katy Wolstencroft myGrid University of Manchester.
An Introduction to Taverna Workflows Dr K Wolstencroft University of Manchester.
Exploring Taverna engine Aleksandra Pawlik materials by Katy Wolstencroft University of Manchester.
Data Exchange and Sharing using Taverna Workflows and myExperiment Katy Wolstencroft myGrid University of Manchester.
An Introduction to Taverna Workflows Paul Fisher, University of Manchester
Advanced Taverna Aleksandra Pawlik University of Manchester materials by Katy Wolstencroft, Aleksandra Pawlik, Alan Williams
Getting data out of XML These exercises provide an overview of how to use the native Taverna XPath services to get data out of XML.
An Introduction to Running, Reusing and Sharing Workflows with Taverna – part 2 Aleksandra Pawlik materials by Katy Wolstencroft University of Manchester.
Taverna allows you to automatically iterate through large data sets. This section introduces you to some of the more advanced configuration options for.
Exploring Taverna 2 Katy Wolstencroft myGrid University of Manchester.
Aleksandra Pawlik University of Manchester. Something that can be put into a workflow Well described - what the component does Behaves “well” - conforms.
Aleksandra Pawlik Alan Williams University of Manchester.
An Introduction to Designing, Executing and Sharing Workflows with Taverna BioVel Workshop 2011.
An Introduction to Designing and Executing Workflows with Taverna Part 2 – Importing and exporting data Norman Morrison University of Manchester Credits:
Data and tools on the Web have been exposed in a RESTful manner. Taverna provides a custom processor for accessing such services.
Designing, Executing and Sharing Workflows with Taverna 2.4 Different Service Types Katy Wolstencroft Helen Hulme myGrid University of Manchester.
Scan, Import, and Automatically file documents to Box Introduction
Product Training Program
VAB™ for INFINITY Tutorial
Designing and Sharing Taverna Workflows: Exploring Taverna 2.1 Beta
Performing statistical analyses using the Rshell processor
LMEvents SharePoint Portal How-to Guide
How to automatise the grid production - using model builder in ArcGIS
An Introduction to Designing and Executing Workflows with Taverna
Holdings Management Overview
Mail Merge And Macros in MS WORD
Groups.
CCA Skill Certification
Other Features – Filter Options
NORMA Lab. 5 Duplicating Object Type and Predicate Shapes
Taverna Tutorial exercise 2: REST services from BioCatalogue
An Introduction to Designing, Executing and Sharing Workflows with Taverna and myExperiment Katy Wolstencroft University of Manchester.
Shim (Helper) Services and Beanshell Services
Aleksandra Pawlik materials by Katy Wolstencroft
Xpath service Getting data out of XML Aleksandra Pawlik materials by Katy Wolstencroft University of Manchester 1.
REST Services Data and tools on the Web have been exposed in both WSDL and REST. Taverna provides a custom processor for accessing REST services Peter.
An Introduction to Designing and Executing Workflows with Taverna
3.3 – Invoke Workflow File Exercise 3.2: Extract as Workflow
Presentation transcript:

These exercises highlight the services that do not perform biological functions, but are vital for running life science workflows

 A shim is a service that doesn’t perform an experimental function, but acts as a connector, or glue, when 2 experimental services have incompatible outputs and inputs  A shim can be any type of service – WSDL, soaplab etc. Many are simple Beanshell scripts  We have already used many shims in these exercises  (for the origin of the word)

 Find the workflow ‘BiomartandEmbossDisease’ on myExperiment (workflow ID: 2213)  Run the workflow and work out what it does  Work out which services are shims  What do the shims do?

 One of the most common uses of shims is to transform your input data into a format the analysis services will understand  In this exercise, we will find a set of protein sequences from Uniprot and perform a multiple sequence alignment with them  We will use shims to process a list of Uniprot identifiers and to combine retrieved services into a list for multiple alignment

 In the services panel, find ‘get Protein FASTA’ and import it into a new workflow  Connect an input and output and run the workflow with the input ‘Q9RZ30’ (a Uniprot ID)  Look at the results – you should see a single protein sequence  Go back to the design window and right-click on the workflow input and select ‘edit workflow input port’  Change the port depth from single to ‘List of depth 1’  Rerun the workflow. This time, enter ‘Q9RZ30’ and ‘Q5WJS9’ as a list

 The results now contain 2 protein sequences. This is ok for one or 2, but it is very inefficient for a long list of protein identifiers  Go back to the design window and change the input port back to a single value (you can use Taverna’s undo button to do this if you like).  Find the service ‘Split string into string list by regular expression’ and import it into the workflow  Delete the link between the input and ‘get Protein Fasta’ and reconnect the workflow so that the input flows to ‘split_string (string)’ and then ‘get Protein Fasta’

 Right-click on the regex input to ‘split_string...’ and enter the constant value ‘\n’  This is the regular expression that will split an input string every time there is a new line  Run the workflow again, but this time, download and use the ‘Shims Protein Data File’ on myExperiment as input:  How many sequences do you get this time?  Now we will align those sequences using the ‘EMBL-EBI ClustalW2_SOAP’ workflow from myExperiment (ID:1768)  Upload it and see what input it requires

 Clustalw requires a single file containing all the sequences. Currently, we have a list. Therefore, we need another shim  Go back to your ‘Get Protein FASTA’ workflow and find and import the service ‘Merge string list to string’  Add this service after ‘Get Protein FASTA’ and run the workflow again  This is another shim. It changes the format of the service output  Now import the ‘EMBL-EBI ClustalW2_SOAP’ workflow as a nested workflow and run the whole thing. This time you will get a protein sequence alignment. (Note: if you can’t remember how to import workflows, refer to the intro tutorial from yesterday)

Many shims are actually Beanshell scripts. Beanshell scripts allow you to add simple data transformation steps into your workflow in an easy way. The next few exercises will give you a brief introduction to writing Beanshells and give you some examples of when they are commonly used.

 Create a new workflow by selecting ‘file’ and ‘New Workflow’  Add a new Beanshell from the “service template” section of the service panel. A configure window will pop-up  Create 2 input ports named: myName and mySurname after selecting the ‘Ports’ tab  Cretate 1 output port named: myFullname

 Select the script tab and Paste the following script myFullname = myName +"\t" + mySurname  Create 2 workflow inputs and 1 workflow output and connect them to the configured beanshell service.  Run the workflow  You should get your full name printed in the output. This is a very simple example of using helper services to format results from your workflow

 Load the workflow ‘Disease Genes and GO’ from myExperiment (workflow ID 944)  This workflow produces a list of all genes associated with OMIM diseases on Chrom Y and finds their GO descriptions and IDs  Currently, this workflow produces different outputs for Gene Ids, gene ontology descriptions and identifiers.  Write a Beanshell to combine the gene IDs, gene ontology terms and descriptions into one output file HINT: when you run the workflow, you need to think about iteration and how your different IDs and terms combine

 Service that produce and consume XML naturally require XML documents as input. These are called Complex Type services  Producing XML by hand can be difficult  Taverna’s XML splitters help you construct or navigate the XML required or produced  XML splitters can either assemble XML elements or separate elements out of the XML so that users only have to add strings as inputs or get strings as outputs, and Taverna populates the XML or extracts from the XML behind the scenes.

 Load the service ‘GetWeather’ into the workbench. Right-click and select add xml input splitter  Expand the ports – now you will see the XML splitter box has divided the one input from GetWeather into 2 inputs – CityName and CountryName  Add two string constants – France and Paris  Add an XML splitter for the output  Save and run the workflow