Taverna allows you to automatically iterate through large data sets. This section introduces you to some of the more advanced configuration options for.

Slides:



Advertisements
Similar presentations
My EBSCOhost Tutorial Tutorial support.ebsco.com.
Advertisements

Grid Wizard Enterprise Basic Tutorial Using Web Control Panel.
Welcome to the Online Employment Applicant Tutorial Click here for next screen.
Welcome to WebCRD.
Lecture 10 Sharing Resources. Basics of File Sharing The core component of any server is its ability to share files. In fact, the Server service in all.
MY NCBI (module 4.5).
Manage Transfer Students. C3-TF Manage Transfer Students by School Student Registrar Description: –This function allows the School Student Registrar to;
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester.
Configuring Windows Vista Security Lesson 8. Skills Matrix Technology SkillObjective DomainObjective # Setting Up Users Configure and troubleshoot parental.
Creating a UAA VPN Connection For Your Computer To Facilitate Polycom PVX – For Windows XP Last Modified On 10/25/2010 University of Alaska Anchorage,
WebCT: CE-6 Group Manager. Working with Groups: In WebCT Ce-6 you can: –create custom groups. –create multiple groups. –create groups with sign-up sheets.
BioMoby and Taverna Tutorial. Downloading Taverna ► Taverna can be obtained from:
1 Chapter Overview Introduction to Windows XP Professional Printing Setting Up Network Printers Connecting to Network Printers Configuring Network Printers.
ELecta Live Update What’s new in Version 4.8 What’s New in V. 4.8 February
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester materials by Dr Katy Wolstencroft and Dr Aleksandra.
by Chris Brown under Prof. Susan Rodger Duke University June 2012
® IBM Software Group © 2009 IBM Corporation Rational Publishing Engine RQM Multi Level Report Tutorial David Rennie, IBM Rational Services A/NZ
SchoolDude ArbiterGame Integration. FSDirect – Locations FSDirect’s Locations are the same as ArbiterGame’s Sites You can add a new location to your list.
An Introduction to Designing, Executing and Sharing Workflows with Taverna Nowgen, Next Gen Workshop 17/01/2012.
Moodle (Course Management Systems). Assignments 1 Assignments are a refreshingly simple method for collecting student work. They are a simple and flexible.
Copyright © 2007, Oracle. All rights reserved. Managing Concurrent Requests.
An Introduction to Designing and Executing Workflows with Taverna Katy Wolstencroft University of Manchester.
Support.ebsco.com My EBSCOhost Tutorial Tutorial.
Introduction to Taverna, an environment For designing and executing workflows Franck Tanoh University of Manchester.
Windows Tutorial Common Objects ACOS: 1, 4. Using the Taskbar 1. Using the taskbar, you can switch between open programs and between open documents within.
SADI and Taverna 2 Tutorial David Withers. Preamble The Taverna 2 platform is constantly changing; while the look and feel of the workbench may change,
GroupWise 6.5 Junk Mail Handling July 28, Configuring Junk Mail Handling Junk Mail Handling enables you to have actions taken automatically on any.
CSC350: Learning Management Systems COMSATS Institute of Information Technology (Virtual Campus)
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.
BioMoby and Taverna 2 Tutorial Mark Wilkinson, Edward Kawas, David Withers.
Sequence Diagrams And Collaboration Diagrams HungNM.
11/25/2015Slide 1 Scripts are short programs that repeat sequences of SPSS commands. SPSS includes a computer language called Sax Basic for the creation.
An Introduction to Designing, Executing and Sharing Workflows with Taverna Katy Wolstencroft myGrid University of Manchester IMPACT/Taverna Hackathon 2011.
Using the AccuGlobe Software with the IndianaMap Using the AccuGlobe Software.
Basic & Advanced Reporting in TIMSNT ** Part Three **
Introduction to Taverna Online and Interaction service Aleksandra Pawlik University of Manchester.
Envision Tutorial Horner APG, LLC July 18, Introduction The Cscape Remote Viewer allows remote interaction with the user interface on Horner OCS.
Contract Invoice Guide
TEAM FOUNDATION VERSION CONTROL AN OVERVIEW AND WALKTHROUGH By: Michael Mallar.
IllinoisJobLink.com Training Video Creating a Resume Copyright © 2015, America’s Job Link Alliance–Technical Support (AJLA–TS) All rights reserved. This.
Access Queries and Forms. Adding a New Field  To insert a field after you have saved your table, open Access, and open the table  It is easier to add.
Sorting Contacts with Categories Lunch and Learn.
Splunk Enterprise Instructor: Summer Partain 3 Day Course.
Active-HDL Server Farm Course 11. All materials updated on: September 30, 2004 Outline 1.Introduction 2.Advantages 3.Requirements 4.Installation 5.Architecture.
An Introduction to Designing, Executing and Sharing Workflows with Taverna Anja Le Blanc Stian Soiland-Reyes Alan Willams University of Manchester.
Designing, Executing and Sharing Workflows with Taverna 2.2 Katy Wolstencroft myGrid University of Manchester.
Exploring Taverna engine Aleksandra Pawlik materials by Katy Wolstencroft University of Manchester.
© 2010 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 1 © 2010 Cisco and/or its affiliates. All rights reserved. Cisco Confidential.
Advanced Taverna Aleksandra Pawlik University of Manchester materials by Katy Wolstencroft, Aleksandra Pawlik, Alan Williams
Getting data out of XML These exercises provide an overview of how to use the native Taverna XPath services to get data out of XML.
An Introduction to Running, Reusing and Sharing Workflows with Taverna – part 2 Aleksandra Pawlik materials by Katy Wolstencroft University of Manchester.
Exploring Taverna 2 Katy Wolstencroft myGrid University of Manchester.
Aleksandra Pawlik University of Manchester. Something that can be put into a workflow Well described - what the component does Behaves “well” - conforms.
Aleksandra Pawlik Alan Williams University of Manchester.
An Introduction to Designing, Executing and Sharing Workflows with Taverna BioVel Workshop 2011.
These exercises highlight the services that do not perform biological functions, but are vital for running life science workflows.
Designing, Executing and Sharing Workflows with Taverna 2.4 Different Service Types Katy Wolstencroft Helen Hulme myGrid University of Manchester.
Submitting an invoice with the Tungsten Portal
Designing and Sharing Taverna Workflows: Exploring Taverna 2.1 Beta
How to develop a presentation portfolio in Foliotek
An Introduction to Designing and Executing Workflows with Taverna
Taverna Tutorial exercise 2: REST services from BioCatalogue
An Introduction to Designing, Executing and Sharing Workflows with Taverna and myExperiment Katy Wolstencroft University of Manchester.
Shim (Helper) Services and Beanshell Services
Aleksandra Pawlik materials by Katy Wolstencroft
Xpath service Getting data out of XML Aleksandra Pawlik materials by Katy Wolstencroft University of Manchester 1.
An Introduction to Designing and Executing Workflows with Taverna
Presentation transcript:

Taverna allows you to automatically iterate through large data sets. This section introduces you to some of the more advanced configuration options for the Taverna engine, such as, setting iteration strategies and adding loops to your workflows

As you have already seen, Taverna can automatically iterate over sets of data. When 2 sets of iterated data are combined, however, Taverna needs extra information about how they should be combined. You can have: A cross product – combining every item from list 1 with every item from list 2 - all against all A dot product – only combining item 1 from list 1 with item 1 from list 2, and so on – line against line

Find and load the workflow ‘Demonstration of configurable iteration’ from myExperiment  Read the workflow metadata to find out what the workflow does (by looking at the ‘Details’)  Select the ‘ColourAnimals’ service and select the ‘Details’ in the workflow explorer and ‘configure list handling’  Click on ‘dot product’ in the pop-up window. This allows you to switch to cross product

 Run the workflow twice – once with ‘dot product’ and once with ‘cross product’.  Save the first results so you can compare them – what is the difference? What does it mean to specify dot or cross product? NOTE: The iteration strategies will become very important later in the course. Setting cross product instead of dot when you have 2000 data items can cause large and unnecessary increases in computation

 From myExperiment, find the workflow ‘InterproScan without Looping’ by Katy Wolstencroft  InterproScan analyses a given protein sequence (or set of sequences) for functional motifs and domains  This workflow is asynchronous. This means that when you submit data to the ‘runInterproScan’ service, it will return a jobID and place your job in a queue (this is very useful if your job will take a long time!)  The ‘Status’ nested workflow will query your job ID to find out if it is complete

The default behaviour in a workflow is to call each service only once for each item of data – so what if your job has not finished when ‘Status’ workflow asks?  Run the workflow, using the default protein sequence and your own address (the EBI requires an academic address for it to run)  Almost every time, the workflow will fail because the results have not been returned before the workflow reaches the ‘get_results’ service

This is where looping is useful. Taverna can keep running the ‘status’ service until it reports that the job is done.  Select the ‘Status’ nested workflow and right-click. Select ‘configure running’ from the drop-down list. (you could also just click on ‘details’ in the workflow explorer).  Select ‘advanced’ and click on ‘add looping’  Use the drop-down boxes in the looping window to set ‘get_status_output_status’ ‘is_not_equal_to’ RUNNING

 Save the workflow and run it again  This time, the workflow will run until the ‘Status’ nested workflow reports that it is either DONE, or it has an ERROR.  You will see results for ‘TextResults’, but you will still get an error for ‘Graphical_results’. This is because there is one more configuration to change – we also need ‘Control Links’

 A control link specifies that there is a dependency of one service on another even though there is no data flowing between them.  A control link is a line with a white circle at the end that connects two services (see the link between the ‘Status’ nested workflow and ‘get_Result_input’)

 We will add control links to the other output type  Right-click on getResult_graphical_input and select ‘Run after’ from the drop down menu.  Set it to ‘Run after’ -> ‘Status’  Save and run the workflow  Now you will see each result returned

 Web services can sometimes fail due to network connectivity  If you are iterating over lots of data items, you can guard against these temporary interruptions by adding retries to your workflow  Upload the ‘Retry-Example’ workflow from the myExperiment Next Generation Sequencing Tutorial group. This workflow is designed to fail sometimes.  Run the workflow as it is and count the number of failed iterations

 Now, select the ‘sometimes_fails’ service and select the ‘details’ tab in the workflow explorer panel  Click on ‘advanced’ and ‘configure’ for retries  In the pop-up box, change it so that it retries each service iteration 2 times  Run the workflow again – how many failures do you get this time?  Change the workflow to retry 5 times – does it work every time now?

 If Taverna is iterating over lots of independent input data, you can improve the efficiency of the workflow by running those iterated jobs in parallel  Run the Retry workflow again and time how long it takes  Go back to the design window, right-click on the ‘sometimes_fails’ service, and select ‘configure running’  This time select ‘Parallel jobs’ and change the maximum number to 20  Run the workflow again  Does it run faster?

 Setting parallel jobs makes your workflows run faster, but you should be careful if you are using remote services. Sometimes they have policies for the number of concurrent jobs individuals should run (e.g. The EBI ask that you do not submit more than 25 at once).  If you exceed this number, your service invocations may be blocked by the provider. In extreme cases, the provider may block your whole institution!