Taverna and SoapLab Elda Rossi – CINECA (Italy)

Slides:



Advertisements
Similar presentations
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester.
Advertisements

Lesson 6 Software and Hardware Interaction
ViaLogy Lien Chung Jim Breaux, Ph.D. SoCalBSI 2004 “ Improvements to Microarray Analytical Methods and Development of Differential Expression Toolkit ”
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 8 Introduction to Printers in a Windows Server 2008 Network.
Cambodia-India Entrepreneurship Development Centre - : :.... :-:-
Installing Tomcat on Windows  You may find the Tomcat install shield has some problems recognizing JSDK 1.4 beta installations.  You.
1. QRP: QRP is reports file which is being called from application when ever clients choose to view reports. 2. What Is QRPBANK? QRPBANK is a data file.
Microsoft Application Virtualization (App-V) ver 4.5 overview Nguyen An Que Microsoft Vietnam Tech blog: blogs.technet.com/quenguyen.
INTRODUCTION TO WEB DATABASE PROGRAMMING
Ext Environment Copyright © 2005 Liferay, LLC All Rights Reserved. No material may be reproduced electronically or in print without written permission.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester materials by Dr Katy Wolstencroft and Dr Aleksandra.
Servlets Environment Setup. Agenda:  Setting up Java Development Kit  Setting up Web Server: Tomcat  Setting up CLASSPATH.
1 HTML and CGI Scripting CSC8304 – Computing Environments for Bioinformatics - Lecture 10.
Tutorial 11 Installing, Updating, and Configuring Software
CS441 CURRENT TOPICS IN PROGRAMMING LANGUAGES LECTURE 5_1 George Koutsogiannakis/ Summer
Standard Grade Computing System Software & Operating Systems.
Guidelines for Homework 6. Getting Started Homework 6 requires that you complete Homework 5. –All of HW5 must run on the GridFarm. –HW6 may run elsewhere.
Internet of Things with Intel Edison Compiling and running Pierre Collet Intel Software.
9 Chapter Nine Compiled Web Server Programs. 9 Chapter Objectives Learn about Common Gateway Interface (CGI) Create CGI programs that generate dynamic.
The gLite API – PART I Giuseppe LA ROCCA INFN Catania ACGRID-II School 2-14 November 2009 Kuala Lumpur - Malaysia.
An Introduction to Designing and Executing Workflows with Taverna Katy Wolstencroft University of Manchester.
ASP.NET.. ASP.NET Environment ASP.NET is Microsoft's programming framework that enables the development of Web applications and services. It is an easy.
Plug-in Development Environment. Session Outline Tools Installation Configuration New Project Basic Debugging Remote Debugging.
Python From the book “Think Python”
COP 3330 Notes 1/12. Today's topics Downloading Java and Eclipse Hello World Basic control structures Basic I/O Strings.
Contents 1.Introduction, architecture 2.Live demonstration 3.Extensibility.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.
9/2/ CS171 -Math & Computer Science Department at Emory University.
WS-Freefluo-VLAM Tutorial Spiros Koulouzis. Outline Get the WS-Freefluo-VLAM (GUI) Start the WS-Freefluo-VLAM (GUI) Generate modules from WSDL Compose.
Topic Java EE installation (Eclipse, glassfish, etc.) Eclipse configuration for EE Creating a Java Web Dynamic Project Creating your first servlet.
NEES Cyberinfrastructure Center at the San Diego Supercomputer Center, UCSD George E. Brown, Jr. Network for Earthquake Engineering Simulation Analyzing.
An Introduction to Designing, Executing and Sharing Workflows with Taverna Katy Wolstencroft myGrid University of Manchester IMPACT/Taverna Hackathon 2011.
APST Internals Sathish Vadhiyar. apstd daemon should be started on the local resource Opens a port to listen for apst client requests Runs on the host.
Parsing BLAST output. Output of a local BLAST search “less” program Full path to the BLAST output file.
Performing statistical analyses using the Rshell processor Original material by Peter Li, University of Birmingham, UK Adapted by Norman.
Introduction to Taverna Online and Interaction service Aleksandra Pawlik University of Manchester.
Chapter 1 Computers, Compilers, & Unix. Overview u Computer hardware u Unix u Computer Languages u Compilers.
PAYware Transact Terminal Interface Manager
1 G52IWS: Example Web-services Chris Greenhalgh. 2 Contents Software requirements AXIS web service run-time components Getting started with Jetty & AXIS.
Agenda Using FTP What is FTP? How to Use the FTP Program How to transfer files Using FTP.
Soaplab SOAP-based Analysis Web Services Martin Senger
Install CB 1.8 on Ubuntu. Steps Followed Install Ubuntu (Ubuntu LTS) on Virtual machine – (VMware Workstation) (
Technical lssues for the Knowledge Engineering Competition Stefan Edelkamp Jeremy Frank.
1 CSE 451 Section 2: Processes, the shell, and system calls.
Downloading the MAXENT Software
Wouter Verkerke, NIKHEF 1 Using ‘stoomboot’ for NIKHEF-ATLAS batch computing What is ‘stoomboot’ – Hardware –16 machines, each 2x quad-core Pentium = 128.
Chapter 16 Advanced Bourne Shell Programming. Copyright © 2005 Pearson Addison-Wesley. All rights reserved. Objectives To discuss numeric data processing.
Alan Williams University of Manchester.  Allows you to call a command line script as part of a workflow  Simplest case is calling a single tool  Can.
Designing, Executing and Sharing Workflows with Taverna 2.2 Katy Wolstencroft myGrid University of Manchester.
Upgrade on Windows 7. DownloadSoftware Download Software from link provided in Webliography: e/
Exploring Taverna 2 Katy Wolstencroft myGrid University of Manchester.
CACI Proprietary Information | Date 1 Upgrading to webMethods Product Suite Name: Semarria Rosemond Title: Systems Analyst, Lead Date: December 8,
1 Chapter 1 INTRODUCTION TO WEB. 2 Objectives In this chapter, you will: Become familiar with the architecture of the World Wide Web Learn about communication.
Advanced Computing Facility Introduction
Managing User Desktops with Group Policy
Taverna and SoapLab Elda Rossi – CINECA (Italy)
Connect:Direct for UNIX v4.2.x Silent Installation
Performing statistical analyses using the Rshell processor
Data Virtualization Tutorial: Introduction to SQL Script
Web Interface for Formatter
IW2D migration to HTCondor
An Introduction to Designing and Executing Workflows with Taverna
The Linux Operating System
Introduction to Programming the WWW I
Perl A simple test.
PHP Introduction.
Introduction to javadoc
Introduction to javadoc
Getting Started With Solr
Presentation transcript:

Taverna and SoapLab Elda Rossi – CINECA (Italy)

What is CINECA Cineca is a consortium of Italian Universities and CNR Funded in 1969, now under the control of Research and University Ministry

Resources The most important national infrastructure in Italy for the computational support to scientific research Mission: promoting the use of the most advanced computing systems to support public and private scientific and technological research

R & Bioconductor Bioconductor is an open source and open development software project for the analysis and comprehension of genomic data. It is based on R, a language and environment for statistical computing and graphics.

R & Bioconductor BioConductor is a collection of “packages” Two main types: 1. provides basic infrastructure support. 2. Provides innovative methodology We chose a function in the affy package (type 2. )

The affy package Package: affy Description: The package contains functions for exploratory oligonucleotide array analysis. The dependance to tkWidgets only concerns few convenience functions. 'affy' is fully functional without it. Version: Author: Rafael A. Irizarry, Laurent Gautier, Benjamin Milo Bolstad, and Crispin Miller with contributions from … Maintainer: Rafael A. Irizarry Dependencies: R (>= 1.9.0), Biobase (>= ), reposTools Suggests: tkWidgets (>= 1.2.2), affydata SystemRequirements: None License: LGPL version 2 or newer URL: None available Function: Expresso. From raw probe intensities to expression values

The expresso function Expression measures The most common operation is certainly to convert probe level data to expression values. 1. reading in probe level data 2. background correction 4 methods 3. Normalization 7 methods 4. probe specific background correction, e.g. subtracting MM 3 methods 5. summarizing the probe set values into one expression measure and, in some cases, a standard error for this summary 5 methods

How to run expresso $ R > library(affy) > data<-ReadAffy() > data.mas<-expresso(data,bgcorrect.method="mas", pmcorrect.method="mas", normalize.method="constant", summary.method="medianpolish") > write.exprs(data.mas,file=“Data.out") Data.CEL Data.out $ R CMD BATCH script library(affy) data<-ReadAffy() data.mas<-expresso(data,bgcorrect.method="mas", pmcorrect.method="mas", normalize.method="constant", summary.method="medianpolish") write.exprs(data.mas,file=“Data.out") Report script

The files [CEL] Version=3 [HEADER] Cols=126 Rows=126 TotalX=126 TotalY=126 Baseline=Not normalized DatHeader=ctrl150:CLS=1167 … [INTENSITY] NumberCells=15876 CellHeader=X Y MEAN Sample001.cel Sample002.cel Sample003.cel _at _at _at _at _at _at _at _at _at _at _at _at _at _at _at _at _at CEL file OUT file

Setting up SoapLab A linux based server was chosen Tomcat was installed Java was upgraded Axis was installed SoapLab was installed Vega.cineca.it Tomcat Java 1.4 Axis 1.1 SoapLab precompiled for Suse Linux Up to here: No Problems !!!

Defining the Application 1. Write the application wrapper 2. Write the ACD file for the application 3. Convert ACD to XML 4. Start up the SoapLab server 5. Deploy the new service

1. Write the application wrapper #!/usr/bin/perl use Getopt::Long; # command arguments (with default) GetOptions("bgcorrect=s"=>\$bgcorrect, "normalize=s"=>\$normalize); $bgcorrect="mas" if $bgcorrect eq ""; $normalize="constant" if $normalize eq ""; # location of R executable $rexe="/biotools/R/R-2.1.0/bin/R"; # data directory $datadir=“/biotools/services/data"; # R code to run analysis open(AFFY,">$datadir/affy"); print AFFY <<EOF ; library(affy) data<-ReadAffy() data.mas<-expresso(data, bgcorrect.method="$bgcorrect", pmcorrect.method="mas", normalize.method="$normalize", summary.method="medianpolish") write.exprs(data.mas,file="data.txt") EOF close(AFFY); # now run program system "cd $datadir; $rexe CMD BATCH affy"; # print output open(OUT,"$datadir/data.txt"); while ( ) {print $_;} close(OUT); /biotools/services/affy-expresso.pl

2. Write the ACD file appl: bioconductor [ documentation: "affy/expresso function of BioConductor" version: "1.0" groups: "Microarrays" nonemboss: "Y" executable: affy-expresso.pl ] string: bgcorrect [ additional: "Y" parameter: "Y" default: "mas" ] string: normalize [ additional:"Y" parameter: "Y" default: "constant" ] outfile: output [ additional: "Y" default:“stdout" /biotools/soapbin/analysis-interfaces/metadata/affy.acd The path is defined in the shell Input1: Background correction Input1: Normalization method Output: standard output

3, 4, 5: Final steps 3. Convert ACD to XML 4. Start up the SoapLab server 5. Deploy the new service /biotools/soapbin/analysis-interfaces/generator/acd2xml From:../metadata/affy.acd To:../metadata/microarrays/affy-al.xml /biotools/soapbin/analysis-interfaces/run-AppLab-serverHow to shut down the server? /biotools/soapbin/analysis-interfaces/ws/deploy-web-services

Using the service from Taverna From the Available service window select Add new SoapLab scavenger and enter our server address

Using the service … (2) The new processor appears in the microarrays folder you can find the affy service After connecting input & output ports, the service can be launched

Problems encountered Documentation is not so clear and complete How can we transfer (large) files from the personal WS to the server machine We need a permanent and private data area for storing data We would like to monitor the service while it is running (asynchronous services?) How can we return data in addition to stdOut and stdErr …..

A possible (future) workflow Upload one or more CEL files on the server Analyse the data and get expression levels Verify the output data download the output data and clear the personal space WS-plot WS-expresso OK ? NO YES WS-upload