Building and Running caGrid Workflows in Taverna 1 Computation Institute, University of Chicago and Argonne National Laboratory, Chicago, IL, USA 2 Mathematics.

Slides:



Advertisements
Similar presentations
Introduction The cancerGrid metadata registry (cgMDR) has proved effective as a lightweight, desktop solution, interoperable with caDSR, targeted at the.
Advertisements

OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
Creating and Sharing Re-usable Workflows in Cardiovascular Research: Lessons learned using Taverna Ravi Madduri University.
CACORE TOOLS FEATURES. caCORE SDK Features caCORE Workbench Plugin EA/ArgoUML Plug-in development Integrated support of semantic integration in the plugin.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Designing, Executing and Reusing Scientific Workflows Katy Wolstencroft, Paul Fisher, myGrid.
Experiences from CancerGrid and caBIG what are the real issues in international interoperability? Peter Maccallum CancerGrid Project Manager Department.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Education in the Science 2.0 Era.
Building Scientific Workflows with Taverna and BPEL: a Comparative Study in caGrid Wei Tan 1, Paolo Missier 2, Ravi Madduri 1, Ian Foster 1 1 University.
1 Robust diagnosis DLBCL from gene expression data from different laboratories Dimacs Workshop, June 22, 2005 Gyan Bhanot, IBM Research.
CoMPAS Pro: Comprehensive Meta Prediction and Annotation Services for Proteins Sebastian J. Schultheiß Christoph Malisi.
GeWorkbench caGrid TeraGrid Integration Scott Oster Ohio State University – Dept. of Biomedical Informatics Christine Hung Columbia University – JCSB/C2B2.
Logical Analysis of Diffuse Large B Cell Lymphoma Gabriela Alexe 1, Sorin Alexe 1, David Axelrod 2, Peter Hammer 1, and David Weissmann 3 of RUTCOR(1)
CaGrid Service Metadata Scott Oster - Ohio State
Alizadeh et. al. (2000) Stephen Ayers 12/2/01. Clustering “Clustering is finding a natural grouping in a set of data, so that samples within a cluster.
1 Robust diagnosis of DLBCL from gene expression data from different laboratories DIMACS - RUTCOR Workshop on Boolean and Pseudo-Boolean Functions in Memory.
OpenMDR: Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
Moving forward our shared data agenda: a view from the publishing industry ICSTI, March 2012.
Wrapping third- party analytical services for caBIG Taverna-caBIG project Stian Soiland-Reyes Alexandra Nenadic University of Manchester, UK
OpenMDR: Alternative Methods for Generating Semantically Annotated Grid Services Rakesh Dhaval Shannon Hastings.
Knowledge Discovery in Biomedicine Limsoon Wong Institute for Infocomm Research.
CaBench-to- Bedside (caB2B) An easy to use tool for searching across caGrid Rakesh Nagarajan Washington University School of Medicine.
Cancer Bioinformatics Grid (caBIG) CANS 2006 Chicago, Illinois Shannon Hastings Department of Biomedical Informatics Ohio State University.
Multiple Examples of tumor tissue (public data from Whitehead/MIT) SVM Classification of Multiple Tumor Types DNA Microarray Data Oracle Data Mining 78.25%
Life Sciences Integrated Demo Joyce Peng Senior Product Manager, Life Sciences Oracle Corporation
Taverna and my Grid Basic overview and Introduction Tom Oinn
Department of Biomedical Informatics Service Oriented Bioscience Cluster at OSC Umit V. Catalyurek Associate Professor Dept. of Biomedical Informatics.
PrognoChip Mediator, FORTH-ICS 1 Integrating Clinical and Genomic Information through the PrognoChip Mediator Implemented by Manos Kalaitzakis, Dimitris.
Cancer Nanotechnology Knowledgebase for Nanoparticle Analysis and Design caOBR-caB2B integration Nathan Baker, David Paik, and Persistent Systems.
Taverna and my Grid Open Workflow for Life Sciences Tom Oinn
GenePattern Overview for MAGE-TAB Workshop Ted Liefeld January 24, 2007.
Domain-Specific Languages for Composing Signature Discovery Workflows Ferosh Jacob*, Adam Wynne+, Yan Liu+, Nathan Baker+, and Jeff Gray* *Department of.
Taverna Workflow. A suite of tools for bioinformatics Fully featured, extensible and scalable scientific workflow management system – Workbench, server,
The ACGT Workflow Editing & Enactment Environment Giorgos Zacharioudakis Institute of Computer Science, Foundation for Research & Technology – Hellas (ICS-FORTH)
Kansas State University Department of Computing and Information Sciences Kansas State University KDD Lab ( cDNA.
The Broad Institute of MIT and Harvard Classification / Prediction.
H Using the Open Metadata Registry (OpenMDR) to generate semantically annotated grid services Rakesh Dhaval, MS, Calixto Melean,
CaBIG Workflow University of Chicago, USA University of Manchester, UK.
GeWorkbench Highlights caBIG ® Molecular Analysis Tools Knowledge Center AACR Annual Meeting, April 3, 2011.
Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester.
Analysis and Management of Microarray Data Previous Workshops –Computer Aided Drug Design –Public Domain Resources in Biology –Application of Computer.
Technology behind using Taverna in caGrid caGrid user meeting Stian Soiland-Reyes, myGrid University of Manchester, UK
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
CaBench-to-Bedside (caB2B) An easy to use tool for searching across the caGrid Mukesh Sharma Washington University School of Medicine.
Stian Soiland-Reyes myGrid, School of Computer Science University of Manchester, UK UKOLN DevSci: Workflow Tools Bath,
CaGrid Overview and Core Services caGrid Knowledge Center February 2011.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
1 Cancer Models Database (caMOD). 2 History  January 2000 – Prototype is presented during the Mouse Models of Human Cancers (MMHCC) Steering Committee.
6 February 2009 ©2009 Cesare Pautasso | 1 JOpera and XtremWeb-CH in the Virtual EZ-Grid Cesare Pautasso Faculty of Informatics University.
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
A collaborative tool for sequence annotation. Contact:
1 Service Creation, Advertisement and Discovery Including caCORE SDK and ISO21090 William Stephens Operations Manager caGrid Knowledge Center February.
Brad Windle, Ph.D Unsupervised Learning and Microarrays Web Site: Link to Courses and.
Prof. Yechiam Yemini (YY) Computer Science Department Columbia University (c)Copyrights; Yechiam Yemini; Lecture 2: Introduction to Paradigms 2.3.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
Wrapping analytical services for caBIG Taverna-caGrid technical review meeting Stian Soiland-Reyes, myGrid University of Manchester, UK
Call in: Participant Passcode: Centra: Meeting ID: ICR_WShttp://ncicb.centra.com August 11, 2010 ICR-WS Meeting.
Protégé 3.4 Plug-in for Editing and Maintaining the NCI Thesaurus Protégé Conference June 23, 2009 Amsterdam Sherri de Coronado, Gilberto Fragoso.
CaGrid Workflow Examples Wei Tan, Ravi Madduri University of Chicago {wtan,
Holding slide prior to starting show. Lessons Learned from the GECEM Portal David Walker Cardiff University
J. Douglas Armstrong Institute for Adaptive and Neural Computation, School of Informatics, University of Edinburgh. Bioinformatics at Edinburgh.
基于 R/Bioconductor 进行生物芯片数据分析 曹宗富 博奥生物有限公司
Transforming Science Through Data-driven Discovery Workshop Overview Ohio State University MCIC Jason Williams – Lead, CyVerse – Education, Outreach, Training.
ARCH/VCDE F2F BoF And the Presentation Subtitle Goes Here Ravi Madduri December 2008.
Cancer Bioinformatics Grid (caBIG) CANS 2006 Chicago, Illinois
Tools and Services Workshop
Joslynn Lee – Data Science Educator
A Black-Box Approach to Query Cardinality Estimation
Presentation transcript:

Building and Running caGrid Workflows in Taverna 1 Computation Institute, University of Chicago and Argonne National Laboratory, Chicago, IL, USA 2 Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL, USA 3 School of Computer Science, University of Manchester, Manchester, UK OVERVIEW For the empowerment of users from biological or biomedical domains in creating and executing their workflows efficiently, the caGrid Workflow team, with the ICR working group, has selected the Taverna workbench and successfully created a tool suite to orchestrate caGrid Data and Analytical services for ICR workflows. This tool suite aims at providing an easy-to-use workflow authoring and submission tool that will be capable of integrating caGrid services as well as third-party services in scientific workflows. We also helped caGrid community to build several workflows that have real scientific value, and we commit ourselves to support caBIG users across workspaces in creating and executing their domain based workflows. Web Resources: Taverna: caGrid Plug-in download: caBIG: CaGrid Workflow Quick Start Guide: End-to-End Solution for caGrid Workflow Search caGrid Index Service for registered caGrid services matching various search criteria: Service name, inputs, outputs, research center, class names, concept codes, etc. Application: Lymphoma Prediction Workflow *,[1] Scientific value Use gene-expression patterns associated with Diffuse large B- cell Lymphoma (DLBCL) and Follicular Lymphoma (FL) to predict the lymphoma type of an unknown sample. Use GenePattern services SVM and KNN to build the tumor classification model and predict the tumor types of unknown examples. Major steps Extract Microarray. Querying training data and unknown sample from experiments stored in caArray. Preprocess Microarray. Preprocessing, or normalize the microarray data for later processing. Predict Lymphoma type. Predicting lymphoma type using SVM & KNN services. Extension Generalized the lymphoma prediction workflow into a cancer type prediction workflow. Applied it on Experiment 236 in caArray database.[2] [1] [1] MA Shipp, et al. Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning. Nature medicine, 2002(8) [2] S. Ramaswamy, et al. Multiclass cancer diagnosis using tumor gene expression signatures. PNAS, vol. 98, p , *Acknowledgement: Juli Klemm, Xiaopeng Bian, Rashmi Srinivasa (NCI); Jared Nedzel (MIT) Log onto a given Grid, configure service’s security properties with caGrid credential. Lymphoma prediction workflow 1.Extract Microarray 2.Preprocess Microarray 3.Predict Lymphoma Type Semantic search WSRF Support Invoke stateful Grid services caGrid Security Support Available caGrid Workflows caDSR data query Protein sequence query Microarray clustering Lymphoma prediction Cancer classification caGrid workflows at myExperiment workflows/search?query=cabig workflows/search?query=cabig “Facebook” for caGrid workflows Result of the lymphoma prediction workflow Result of the cancer type prediction over caArray Experiment 236