BioMoby and Taverna Tutorial. Downloading Taverna ► Taverna can be obtained from:

Slides:

Advertisements

Similar presentations

Support.ebsco.com EBSCOhost Digital Archives Viewer Tutorial.

Advertisements

Support.ebsco.com The EBSCOhost Result List Tutorial.

EBSCO Discovery Service

Citavi – Adding References – Articles from EBSCOhost Databases

For suggestions about how Talk Factory can be incorporated into lessons on investigating evaporation please see the associated lesson plans on the Talk.

Compose Workflow. Home page To compose a workflow navigate to the “Workflow Editor” page.

Computer Basics Hit List of Items to Talk About ● What and when to use left, right, middle, double and triple click? What and when to use left, right,

An End-User Perspective On Using NatQuery Extraction From two Files T

An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester.

Chapter 18 - Data sources and datasets 1 Outline How to create a data source How to use a data source How to use Query Builder to build a simple query.

NetAcumen ActiveX Download Instructions

Talk Factory Primary Generic: instructions for use Supporting children‘s whole class discussions in primary school Copyright 2011 Open University.

Guide to Oracle10G1 Introduction To Forms Builder Chapter 5.

Access Tutorial 8 Sharing, Integrating, and Analyzing Data

The basics of the Online Portal

Quick Start Guide: Filters Advanced Learn about: 1.What filters are and their functionality 2.How to create a filter using Samples, Equipment & Labels.

Microsoft Office 2007 Access 2007 Chapter 9 Administering a Database System.

An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester materials by Dr Katy Wolstencroft and Dr Aleksandra.

Tom Oinn,  Download Taverna from  Windows or linux If you are using either.

XP New Perspectives on Introducing Microsoft Office XP Tutorial 1 1 Introducing Microsoft Office XP Tutorial 1.

Installing the SAFARIODBC.EXE For use with Excel May 3, 2002.

® IBM Software Group © 2009 IBM Corporation Rational Publishing Engine RQM Multi Level Report Tutorial David Rennie, IBM Rational Services A/NZ

An Introduction to Designing, Executing and Sharing Workflows with Taverna Nowgen, Next Gen Workshop 17/01/2012.

Creating a Web Site to Gather Data and Conduct Research.

An Introduction to Designing and Executing Workflows with Taverna Katy Wolstencroft University of Manchester.

McGraw-Hill/Irwin The Interactive Computing Series © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Microsoft Excel 2002 Lesson 1 Introduction.

Chapter 17 Creating a Database.

1 NORMA Lab. 7 Generating Reports More Display Options File: NORMA_Lab6.ppt. Author: T. Halpin. Last updated: 2009 June 9.

An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.

Publishing Your Web Pages Ann Emmanuel SIUE Web Administrator

BioMoby and Taverna 2 Tutorial Mark Wilkinson, Edward Kawas, David Withers.

EndNote. What is EndNote? EndNote is referencing software that enables you to create a database of references from your readings.

National Center for Supercomputing Applications University of Illinois at Urbana-Champaign Ergo User Tutorial - Part 3 NCSA, UIUC.

1 Chapter 20 – Data sources and datasets Outline How to create a data source How to use a data source How to use Query Builder to build a simple query.

An Introduction to Designing, Executing and Sharing Workflows with Taverna Katy Wolstencroft myGrid University of Manchester IMPACT/Taverna Hackathon 2011.

The material contained in this document is proprietary to Triniti Corporation (Triniti). This material may not be disclosed, duplicated or otherwise revealed,

Intermacs Form Download Excel Tutorial Pivot Tables, Graphic Tools, Macros By: Devin Koehl.

XP New Perspectives on Microsoft Office FrontPage 2003 Tutorial 7 1 Microsoft Office FrontPage 2003 Tutorial 8 – Integrating a Database with a FrontPage.

National Center for Supercomputing Applications University of Illinois at Urbana-Champaign Ergo User Tutorial - Part 3 NCSA, UIUC.

INTRODUCTION TO ACCESS. OBJECTIVES  Define the terms field, record, table, relational database, primary key, and foreign key  Create a blank database.

Intermacs Form Download Excel Tutorial Pivot Tables, Graphic Tools, Macros By: Devin Koehl.

Access Module Implementing a Database with Microsoft Access A Great Module on Your CD.

ACCESS PROJECT TWO. PROJECT ONE In project one you: -Created Tables -Created Forms -Created Reports In this project you will learn about queries. Databases.

Designing, Executing and Sharing Workflows with Taverna 2.2 Katy Wolstencroft myGrid University of Manchester.

Advanced Taverna Aleksandra Pawlik University of Manchester materials by Katy Wolstencroft, Aleksandra Pawlik, Alan Williams

Taverna allows you to automatically iterate through large data sets. This section introduces you to some of the more advanced configuration options for.

Exploring Taverna 2 Katy Wolstencroft myGrid University of Manchester.

An Introduction to Designing and Executing Workflows with Taverna Part 2 – Importing and exporting data Norman Morrison University of Manchester Credits:

Data and tools on the Web have been exposed in a RESTful manner. Taverna provides a custom processor for accessing such services.

Product Training Program

The Business Source Databases Advanced Searching

Using the Advanced Search Guided Style Find Fields on

An Introduction to Designing and Executing Workflows with Taverna

T_C_N_L_G_ E D I D I E O Y O H I E B J I R E A A W.

String several geoprocessing processes

Using the Advanced Search Guided Style Find Fields on

Physician Tips and Tricks

Access Tutorial 8 Sharing, Integrating, and Analyzing Data

NORMA Lab. 5 Duplicating Object Type and Predicate Shapes

RPM: Basic plan data entry process A step-by-step guide for Plan Leads

EBSCOhost Digital Archives Viewer

Rational Publishing Engine RQM Multi Level Report Tutorial

REST Services Data and tools on the Web have been exposed in both WSDL and REST. Taverna provides a custom processor for accessing REST services Peter.

Tutorial 8 Sharing, Integrating, and Analyzing Data

An Introduction to Designing and Executing Workflows with Taverna

Presentation transcript:

BioMoby and Taverna Tutorial

Downloading Taverna ► Taverna can be obtained from:

► Once at the site, click on Download

► Download the version appropriate for your operating system.

Running ► For a more comprehensive guide on Running and using Taverna, please refer to

► Assuming that you have downloaded and unzipped Taverna, you can run it by double-clicking on runme.bat (windows) or executing runme.sh (Unix/Linux/OS X) ► You may need to: $ chmod u+r runme.sh $ chmod u+r runme.sh

► Taverna’s splash screen

► Once Taverna has loaded, you will see 3 windows:  Advanced Model Explorer  Workflow Diagram  Available Services

► The Advanced model explorer is Taverna’s primary editor and allows you to load, save and edit any property of a workflow.

► Workflow diagram contains a read only graphical representation of your workflow.

► The Available services window lists all of the services available to a workflow designer.

► Under the node …’ Moby services and Moby data types are represented. ► The Object ontology is available as children of MOBY Objects node ► Services are sorted by service provider authority

► If you wish to use registries other than the default one, you can add a new Moby ‘Scavenger’ by choosing to ‘Add new Biomoby scavenger…’

► Enter the registry’s location and click okay.

Creating Workflows

► We will start by adding the Object ontology node Object to our workflow.

► The Advanced model explorer now shows that we have a processor called Object  Object has 3 input ports: id, namespace and article name  Object has 1 output port: mobyData ► The Workflow diagram illustrates our processor

► We can discover services that consume our data type, context click on ‘Object’ and choose ‘Moby Object Details’

► A window will pop up that tells you what services Object feeds into and is produced by

► Expanding the Feeds into node results in a list of service provider authorities ► Expanding an authority, for example, bioinfo.icapture.ubc.ca, reveals a list of services

► We will choose to add the service called ‘MOBYSHoundGetGenBankFasta’ to our workflow.

► A look at the state of our current workflow.

► And graphically. ► The service consumes Object, with article name identifier, and produces FASTA, with article name fasta.

► To discover more services, context click on the service that outputs the data type that you would like to discover consuming services for and choose Moby Service Details.

► The resultant window displays the services inputs and outputs. ► There are also tool tips that show up when your mouse hovers over any particular input or output that tells you what namespaces the data type is valid in

► Context clicking on an output reveals a menu with 3 options.  A brief search for services that consume our datatype  A semantic search for services that consume our datatype  Adding a parser to the workflow that understands our datatype

► The result of choosing to add a parser for FASTA to our workflow. ► The parser allows us to extract:  The namespace and id from FASTA  The namespace and id from the child String  The textual content from the child String

► The result of choosing to conduct a brief search for services that consume FASTA

► We will add the service getDragonBlastText to our workflow by choosing ‘Add service -…’ from the context menu

► The current state of our workflow shown graphically.

► A more complex view of our workflow

► Finding services that consume NCBI_BLAST_Text starts by viewing the details of the service ‘getDragonBlastText’

► Conduct a brief search

► Add the service ‘parseBlastText’ to our workflow

► Our current workflow

► Workflow inputs are added by context clicking on Workflow inputs in the Advanced model explorer and choosing ‘Create New Input…’

► The result from adding 2 inputs:  Id  namespace

► The workflow input id will be connected to Object’s input port ‘id’

► Workflow after connecting the workflow input ‘id’

► The workflow input namespace will connect to Object’s input port ‘namespace’

► Workflow after connection the workflow inputs.

► Workflow outputs are added by context clicking on Workflow outputs in the Advanced model explorer and choosing ‘Create New Output…’

► The result from adding 2 workflow outputs:  moby_blast_ids  fasta_out

► The output moby_blast_ids will be connected to parseBlastText’s output port Object(Collection –’hit_ids’)

► The output fasta_out will be connected to Parse_Moby_Data_FASTA’s output port fasta_’content’

► To run the workflow, click on ‘Tools and Workflow Invocation’ ► Choose ‘Run workflow’

► A prompt to add values to our 2 workflow inputs

► To add a value to the input ‘id’ click on id from the left pane and choose ‘New Input’

► Enter as the id

► Choose namespace from the left and click on ‘New Input’

► Enter NCBI_gi as the value for namespace ► Once you are done, click on ‘Run Workflow’

► Our workflow in action

► Once the workflow is complete, we can examine the results of our workflow.

► A detailed report is available outlining what happened when and in what order.

► We can examine the intermediate inputs and output, as well as visualize our workflow.

► If we choose the Graph tab, our workflow is illustrated.

► Intermediate inputs allow us to examine what a service has accepted as input

► Similarly, Intermediate outputs allows us to examine the output from any particular service.

► Without the parser, FASTA is represented as a Moby message, fully enclosed in its wrapper. ► Non-moby services do not expect this kind of message

► Non-moby services expect the just the sequence and using the Parse_Moby_Data processor, we can extract just that

► Moby services can interact with the other services in Taverna. ► Let’s add a Soaplab service.

► We will choose a nucleic_restriction soaplab service called ‘restrict’

► Choose the restrict service and add it to the workflow.

► We will connect the output port fasta_’content’ from the service Parse_Moby_Dat a_FASTA to the input port ‘sequence_direct _data’ from the service restrict

► The result of our actions so far. ► We will need to add another workflow output to capture the output of restrict.

► Create an output called restrict_out

► Connect the output port ‘outfile’ from the service restrict to the workflow output restrict_out

► Once the connections have been made, run the workflow again using the same inputs.

► The workflow on the left has some extra services added to it.  FASTA2HighestGenericSequenceObject from the authority bioinfo.icapture.ubc.ca  runRepeatMasker from the authority genome.imim.es  A Moby parser for the output DNASequence from runRepeatMasker.  A workflow output Masked_Sequence ► Add them to your workflow

► The service runRepeatMasker is configurable, i.e. it consumes Secondary parameters. ► To edit these parameters, context click on the service and choose ‘Configure Moby Service’

► The name of the parameter is on the left and the value is on the right. ► Clicking on the Value will bring up a drop down menu, an input text field, or any other appropriate field depending on the parameter.

► The parameter species contains an enumerated list of possibilities. ► Select human. ► When you have made your selection, you may close the window.

► Let’s run the workflow

► We will run our workflow with a list  Click on id in the left pane and then click on New Input twice

► Enter and as the ids ► Enter NCBI_gi as the value for namespace ► Our workflow will now run using each id with the single namespace

► Notice how the workflow is running with iterations. This is happening because the Enacter is performing a cross- product on the input

► You can still view intermediate inputs and outputs. ► Using the queryIDs, you can track each invocation of a moby service through the whole workflow

► Imagine now that you want to run the workflow using a FASTA sequence that you input yourself (without the gi identifier) ► To do this, context click on getDragonBlastText and choose Moby Service Details  Expand the Inputs node and context click on FASTA(‘sequence’)  Choose Add Datatype – FASTA(‘sequence’) to the workflow ► A FASTA datatype will be added to the workflow and the appropriate links created

► Notice the datatype FASTA on the left of the workflow  Since the datatype FASTA hasa String, a String was also added to our workflow and the appropriate connection was made ► We will now have to add another workflow input and connect it to the String component of FASTA.

► A workflow input ‘sequence’ was added to the workflow and a connection was made from the workflow input to the input port ‘value’ of String. ► We also removed the link between MOBYSHoundGetGenBankFasta and getDragonBlastText by context clicking on the link in the Advanced model explorers’ Data links and choosing to remove the link ► Now when we choose to run our workflow, we will also have the chance to enter a FASTA sequence

► Go ahead an enter any FASTA sequence as the input to the workflow input ‘sequence’ ► Run the workflow

► Any results can be saved by simply choosing to Save to disk  You will be prompted to enter a directory to save the results.  Each workflow output will be saved in a folder with the same name as a workflow output and the contents of the folder will be the results ► You can also choose Excel, which produces an Excel worksheet with columns representing the workflow outputs and with rows that represent the actual data.