The Taverna Software Suite Prof Carole Goble FREng FBCS CITP The University of Manchester, UK

Slides:



Advertisements
Similar presentations
LEAD Portal: a TeraGrid Gateway and Application Service Architecture Marcus Christie and Suresh Marru Indiana University LEAD Project (
Advertisements

OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
European Life Sciences Infrastructure for Biological Information Rafael C Jimenez ELIXIR CTO EMBL-EBI workshop networks and pathways.
XSEDE 13 July 24, Galaxy Team: PSC Team:
1 Richard White Design decisions: architecture 1 July 2005 BiodiversityWorld Grid Workshop NeSC, Edinburgh, 30 June - 1 July 2005 Design decisions: architecture.
Jiten Bhagat University of myExperiment A Social VRE for Research Objects JISC Roadshow | February.
EXTENDING SCIENTIFIC WORKFLOW SYSTEMS TO SUPPORT MAPREDUCE BASED APPLICATIONS IN THE CLOUD Shashank Gugnani Tamas Kiss.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Workflows Information Flows Prof. Silvia Olabarriaga Dr. Gabriele Pierantoni.
SCI-BUS is supported by the FP7 Capacities Programme under contract nr RI WS-PGRADE/gUSE Supporting e-Science communities in Europe Zoltan Farkas.
Taverna Server 3 Donal Fellows. Taverna Server Workflow Execution Server Runs Taverna Workflows Limits amount of time to run Deletes old workflow runs.
Cloud Computing for the Enterprise November 18th, This work is licensed under a Creative Commons.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester materials by Dr Katy Wolstencroft and Dr Aleksandra.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space The Capabilities of the GridSpace2 Experiment.
14/11/11 Taverna Roadmap Shoaib Sufi myGrid Project Manager.
25/07/12 Input from Taverna team – slides by Shoaib Sufi Taverna – State of the nation.
SCI-BUS is supported by the FP7 Capacities Programme under contract nr RI CloudBroker Platform integration into WS-PGRADE/gUSE Zoltán Farkas MTA.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
Per Møldrup-Dalum State and University Library SCAPE Information Day State and University Library, Denmark, SCAPE Scalable Preservation Environments.
GumTree Feature Overview Tony Lam Data Acquisition Team Bragg Institute eScience Workshop 2006.
Grids and Portals for VLAB Marlon Pierce Community Grids Lab Indiana University.
Taverna and my Grid Open Workflow for Life Sciences Tom Oinn
COMP3019 Coursework: Introduction to GridSAM Steve Crouch School of Electronics and Computer Science.
GridFE: Web-accessible Grid System Front End Jared Yanovich, PSC Robert Budden, PSC.
1 Overview of the Application Hosting Environment Stefan Zasada University College London.
Taverna Workflow. A suite of tools for bioinformatics Fully featured, extensible and scalable scientific workflow management system – Workbench, server,
Installation and Development Tools National Center for Supercomputing Applications University of Illinois at Urbana-Champaign The SEASR project and its.
Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
SCAP E SCAPE Project EU project aimed at building a scalable platform for planning and execution of computation intensive processes for ingestion or migration.
Convert generic gUSE Portal into a science gateway Akos Balasko 02/07/
Using Biological Cyberinfrastructure Scaling Science and People: Applications in Data Storage, HPC, Cloud Analysis, and Bioinformatics Training Scaling.
1 Taverna CISTIB Ernesto Coto Taverna Open Workshop, October 2014.
Holding slide prior to starting show. A Portlet Interface for Computational Electromagnetics on the Grid Maria Lin and David Walker Cardiff University.
Stian Soiland-Reyes myGrid, School of Computer Science University of Manchester, UK UKOLN DevSci: Workflow Tools Bath,
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
Introduction to Taverna Online and Interaction service Aleksandra Pawlik University of Manchester.
Biomedical and Bioscience Gateway to National Cyberinfrastructure John McGee Renaissance Computing Institute
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Using the ARCS Grid and Compute Cloud Jim McGovern.
Convert generic gUSE Portal into a science gateway Akos Balasko.
PLANETS, OPF & SCAPE A summary of the tools from these preservation projects, and where their development is heading.
Portals and my Grid Stefan Rennick Egglestone Mixed Reality Laboratory University of Nottingham.
PROGRESS: GEW'2003 Using Resources of Multiple Grids with the Grid Service Provider Michał Kosiedowski.
BalticGrid-II Project EGEE UF’09 Conference, , Catania Partner’s logo Framework for Grid Applications Migrating Desktop Framework for Grid.
Holding slide prior to starting show. Lessons Learned from the GECEM Portal David Walker Cardiff University
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
Supporting Big Data Processing via Science Gateways EGI CF 2015, November, Bari, Italy Dr Tamas Kiss, CloudSME Project Director University of Westminster,
Taverna, myExperiment and HELIO services Anja Le Blanc Stian Soiland-Reyes Alan Willams University of Manchester.
1 Globe adapted from wikipedia/commons/f/fa/ Globe.svg IDGF-SP International Desktop Grid Federation - Support Project SZTAKI.
SCI-BUS is supported by the FP7 Capacities Programme under contract nr RI Accessing cloud resources through the WS-PGRADE/gUSE and CloudBroker integrated.
REST API to develop application for mobile devices Mario Torrisi Dipartimento di Fisica e Astronomia – Università degli Studi.
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
CyVerse Workshop Discovery Environment Overview. Welcome to the Discovery Environment A Simple Interface to Hundreds of Bioinformatics Apps, Powerful.
Convert generic gUSE Portal into a science gateway Akos Balasko.
Norman Morrison Senior Research Fellow, The University of Manchester Biodiversity Virtual e-Laboratory An e-Infrastructure and e-Science environment supporting.
Accessing the VI-SEEM infrastructure
Ecological Niche Modelling in the EGI Cloud Federation
Tools and Services Workshop
Dag Toppe Larsen UiB/CERN CERN,
Joslynn Lee – Data Science Educator
CyVerse Discovery Environment
Dag Toppe Larsen UiB/CERN CERN,
Platform as a Service.
Alan Williams, Donal Fellows, Finn Bacall,
Module 01 ETICS Overview ETICS Online Tutorials
Taverna workflow management system
Shim (Helper) Services and Beanshell Services
Storing and Accessing G-OnRamp’s Assembly Hubs outside of Galaxy
Web Application Development Using PHP
Presentation transcript:

The Taverna Software Suite Prof Carole Goble FREng FBCS CITP The University of Manchester, UK

The Taverna Suite of Tools Client User Interfaces User Interfaces Workflow Repository Service Catalogue Third Party Tools Web Portals / Gateways Activity and Service Plug-in Manager Workflow ProvenanceWorkflow Server Secure Service Access OAuth1 & 2, username/password, certificates. Workflow Engine Virtual Machine Prog APIs Command Line Player Workflow Components WorkbenchTaverna Lite Interaction Server

VPH-Share Project Models of Human Physiology Eagle Genomics & NHS Next Generation Sequencing based Patient Diagnostics Astronomy & HelioPhysics Library Doc Preservation Systems Biology of Micro-Organisms OpenTox Project Chemistry Development Kit Drug Toxicity BioDiversity Invasive Species Modelling Metagenomics

5820 members, 304 groups, 2415 workflows, 604 files and 229 packs (research objects)

biovel.myexperiment.org

Taverna Engine Execution Scufl2 language Functional dataflow, simple control flows, implicit iteration Linking services and tools Data movement, monitoring, staging, reference “In Workflow Programming” Beanshell scripting Provenance collection: W3C PROV(+) format Plug-in Framework –Infrastructures: Grid, HPC, Web Services (SOAP, REST) –Domain: CDK, BioMart, VOTable, SADI –Common Tools: Excel Spreadsheets, Google Refine, R OAuth security plug-in

Taverna Pro-Workbench Desktop application GUI Intermediate results views Gateway to BioCatalogue and myExperiment Plug-in Framework

Workflow Blocks made of a workflow Well described Well behaved Well looked after Agreed fail Agreed formats in and out Agreed provenance Deposited in myExperiment Grouped into families Components

Workflow Blocks made of a workflow Well described Well behaved Well looked after Agreed fail Agreed formats in and out Agreed provenance Deposited in myExperiment Grouped into families Components

Workflow Blocks made of a workflow Well described Well behaved Well looked after Agreed fail Agreed formats in and out Agreed provenance Deposited in myExperiment Grouped into families Components

Desktop Client Data Centric Interface BIFI (Beautiful Interfaces for Inputs) Taverna Workbench Plug-in, GUI definition language

Data services Vanilla Taverna –Domain data type neutral AstroTaverna plug-in –IVOA data services –VOTables PyWPS plug-in –Exposes OGC-compliant Web Processing Services that can handle large data

Taverna Server Multiple clients, Multi-user SOAP and REST API Server Host Taverna Server “Client” Taverna Server “Client” Taverna Server Front End TavServ Back End Service

Taverna Server Family Taverna Server –Multiple clients, Multi-user –SOAP and REST API Taverna Server Amazon Machine Image –Bundled R server, Atom feed server –Multiple instances in Amazon Cloud and as required, for multiple users/uses and different security scenarios Taverna Virtual Machine Taverna Command Line Bundled Servers

Calling Grid/Cloud Services Expose services/tools as WSDL/REST services –HELIO: Fixed host name –VPH-Share: Services running on dynamically started instances –SZTAKI Desktop Grid – BOINC/Debian Package Specific service/extension to Taverna –UNICORE plugin: Ask grid what services are available, Include services in a workflow, Invoke services on the grid see talk by Shahbaz Memon Library to control job submission to grid –PBS plugin: beanshells in a workflow include invocations of jobs –KnowARC plugin: Advanced Resource Connector to submit jobs to NorduGrid

Web interface Input SNPs Results Storage (S3) Ensembl (mySQL) Cache (S3) Taverna Server Workflow engine orchestrator e-Hive other Taverna Common API Application specific tools and Web Services WS Tool WS All user interaction via web interface User data stored in the Cloud Data for all tools and Web Services stored in the Cloud Unified access to different workflow engines with our common REST API Tools and Web Services for each workflow are installed together for easy replication Cloud Analytics for Life Sciences

Tavoop—Taverna & Hadoop Compiles Taverna Workflow to collection of Hadoop jobs Designed for handling very large amounts of data –Overhead to using Hadoop, but wins if enough data –Data ingest (expensive step) must have already been done Supports Taverna Platform Execution interface Parallelisable service types labs.org/display/SP/PPLhttp://wiki.opf- labs.org/display/SP/PPL Hadoop Cluster Taverna Execution Interface Tavoop Compiler Portal (Taverna Player) GUI Application (Workbench)

Interacting with a workflow Many workflows need user interaction A workflow on a server does not need to be “press a button and wait” –VPH-Share opens a VNC connection to the spawned instance. Taverna Interaction Service –Users interact with a workflow (wherever it is running) in a web browser. –Interaction Service Plug-in in workbench

URLs and Frames

Taverna Tool Spectrum Technical Computational Scientist Domain Scientist Workbench Components Lite Domain-Specific Website / Tool / Portal Workflow Visibility Concept Knowledge TavernaDomain HighLow PlayerCommand Line

Taverna Client Family Java library / Ruby GEM Run a Taverna workflow in another workflow system e.g. Galaxy tools Command line Simple Taverna “player” –Fixed workflow Upload & run workflows and choose data –Universitat Pompeu Fabra’s “Soaplab MajorDomo” –Taverna Lite

Tavaxy Taverna-Galaxy

Taverna-Lite Generic Web-based Client Hide complexity Access to datasets Upload and interact with workflows Build Portal Homepage User-Sessions Workflow Management Run Management Server Credentials Uses Components for simpler assembly and workflow edits

Web apps to create and run workflows Service Chaining Editor Pete Walker et al Plymouth Marine Laboratory For chaining OGC Web Processing Service geospatial Web services

Web apps to create and run workflows Online Taverna Dr Vadim Surpin and Vitaly Sharanutsa Institute for Information Transmission Problems of Russian Academy of Sciences (IITP RAS) An online, in- browser application for assembling and running Taverna Workflows over a HPC platform Software Sustainability Institute Booth Dr Vadim Surpin

Upload workflow by URL Online Taverna

Taverna 3 Beta July 2013

Summary Taverna Suite for interactive and batch workflows Flexible Plug-ins and Flexibly Plugged-in Themed Taverna Establishing Taverna Foundation We welcome collaboration/contribution

Learn more…. myGrid – Taverna – myExperiment – BioCatalogue – Wf4ever – SCAPE – Software Sustainability Institute – BioVeL –