EnVisioning Data Integration SME forum 2009, Vienna Henning Hermjakob Henning Hermjakob

Slides:



Advertisements
Similar presentations
웹 서비스 개요.
Advertisements

IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
MIT Lincoln Laboratory A Service-Oriented Approach to Application Development Robert Darneille & Gary Schorer WPI MQP Presentations ICS Group 10 October.
Using Taverna to access SOAP-based web services Per Larsson CBR
VAMDC Registry Portal Proof of Concept. Registry VAMDC Registry is available at – ex.jsp
Integration of Protein Family, Function, Structure Rich Links to >90 Databases Value-Added Reports for UniProtKB Proteins iProClass Protein Knowledgebase.
European Life Sciences Infrastructure for Biological Information Rafael C Jimenez ELIXIR CTO EMBL-EBI workshop networks and pathways.
EBI Proteomics Services Team – Standards, Data, and Tools for Proteomics Henning Hermjakob European Bioinformatics Institute SME forum 2009 Vienna.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Introduction to Web services MSc on Bioinformatics for Health Sciences May 2006 Arnaud Kerhornou Iván Párraga García INB.
A Flexible Workbench for Document Analysis and Text Mining NLDB’2004, Salford, June Gulla, Brasethvik and Kaada A Flexible Workbench for Document.
NextGRID & OGSA Data Architectures: Example Scenarios Stephen Davey, NeSC, UK ISSGC06 Summer School, Ischia, Italy 12 th July 2006.
August 29, 2002InforMax Confidential1 Vector PathBlazer Product Overview.
EBI is an Outstation of the European Molecular Biology Laboratory. Web Services Programmatic access to Life Sciences resources. Rodrigo Lopez.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Web Services Interoperability Through Standardisation The EMBRACE Technology Recommendation.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Emerging Technologies Committee eForms and XML at NYS Dept of Taxation and Finance Jim Lieb, Director – Common Services NYS Department of Taxation and.
SeaDataNet Ontology Use Case Roy Lowry British Oceanographic Data Centre Coastal Atlas Interoperability Workshop, Corvallis, July (+ Lessons.
SOA, BPM, BPEL, jBPM.
Cytoscape A powerful bioinformatic tool Mathieu Michaud
Updating and Improving the INTAMAP web service Madhu Rani 2012 Intern 1.
Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.
SSC2: Web Services. Web Services Web Services offer interoperability using the web Web Services provide information on the operations they can perform.
Sept 13-15, 2004IHE Interoperability Workshop 1 Integrating the Healthcare Enterprise Overview of IHE IT Infrastructure Patient Synchronized Applications.
The Functional Genomics Experiment Model (FuGE) Andy Jones School of Computer Science and Faculty of Life Sciences, University of Manchester.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
1 CLASS – Simple NOAA Archive Access Portal SNAAP Eric Kihn and Rob Prentice NOAA/NGDC ESIP Meeting January 7 th, 2009 Simple NOAA Archive Access Portal.
NERC DataGrid Vocabulary Server Access Vocabulary Workshop, RAL, February 25, 2009.
SEMESTER PROJECT PRESENTATION CS 6030 – Bioinformatics Instructor Dr.Elise de Doncker Chandana Guduru Jason Eric Johnson.
GCMD/IDN STATUS AND PLANS Stephen Wharton CWIC Meeting February19, 2015.
1 Technologies for distributed systems Andrew Jones School of Computer Science Cardiff University.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Web Services Kanda Runapongsa Dept. of Computer Engineering Khon Kaen University.
Supporting High- Performance Data Processing on Flat-Files Xuan Zhang Gagan Agrawal Ohio State University.
Taverna Workflow. A suite of tools for bioinformatics Fully featured, extensible and scalable scientific workflow management system – Workbench, server,
1 © ATHENA Consortium 2006 ATHENA M30 Intermediate Audit October 2006 Brussels, Belgium.
XML Registries Source: Java TM API for XML Registries Specification.
Web Services interoperability and standards. Infrastructure Challenge ● Applied bioinformatics need various computer resources ● The amount and size of.
Copyright OpenHelix. No use or reproduction without express written consent1.
Nadir Saghar, Tony Pan, Ashish Sharma REST for Data Services.
XML Web Services Architecture Siddharth Ruchandani CS 6362 – SW Architecture & Design Summer /11/05.
Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester.
BIological NetwOrk Manager Cytoscape plugin Andrei Zinovyev Institut Curie/INSERM/Ecole de Mines, UMR 900 “Computational Systems Biology of Cancer”
The european ITM Task Force data structure F. Imbeaux.
1 Geospatial and Business Intelligence Jean-Sébastien Turcotte Executive VP San Francisco - April 2007 Streamlining web mapping applications.
The Functional Genomics Experiment Object Model (FuGE) Andrew Jones, School of Computer Science, University of Manchester MGED Society.
Implementing computational analysis through Web services Arnaud Kerhornou CRG/INB Barcelona - BioMed Workshop IRB November 2007.
Moby Web Services Iván Párraga García MSc on Bioinformatics for Health Sciences May 2006.
Generic Database. What should a genome database do? Search Browse Collect Download results Multiple format Genome Browser Information Genomic Proteomic.
A collaborative tool for sequence annotation. Contact:
A PPARC funded project Common Execution Architecture Paul Harrison IVOA Interoperability Meeting Cambridge MA May 2004.
EMBL-EBI Structural Proteomics Automatic Target Selection Gordon Whamond.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
The Protein Identifier Cross-Reference (PICR) service.
Intro to Web Services Dr. John P. Abraham UTPA. What are Web Services? Applications execute across multiple computers on a network.  The machine on which.
1 CLASS – Simple NOAA Archive Access Portal SNAAP Eric Kihn and Rob Prentice NGDC CLASS Developers Meeting July 14th, 2008 Simple NOAA Archive Access Portal.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
ECHO Technical Interchange Meeting 2013 Timothy Goff 1 Raytheon EED Program | ECHO Technical Interchange 2013.
RDF based on Integration of Pathway Database and Gene Ontology SNU OOPSLA LAB DongHyuk Im.
VIEWS b.ppt-1 Managing Intelligent Decision Support Networks in Biosurveillance PHIN 2008, Session G1, August 27, 2008 Mohammad Hashemian, MS, Zaruhi.
By Jeremy Burdette & Daniel Gottlieb. It is an architecture It is not a technology May not fit all businesses “Service” doesn’t mean Web Service It is.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Designing, Executing and Sharing Workflows with Taverna 2.4 Different Service Types Katy Wolstencroft Helen Hulme myGrid University of Manchester.
Take a REST from manual searching
EMBL-EBI, programmatically - take a REST from manual searching: Sequence analysis tools Web Production Team Anna Foix Joon Lee.
Unit – 5 JAVA Web Services
LOD reference architecture
Presentation transcript:

EnVisioning Data Integration SME forum 2009, Vienna Henning Hermjakob Henning Hermjakob

Henning Hermjakob Enfin Experiment Model EnCore

Henning Hermjakob Use cases 1.Target user group: Bioinformaticians, programmatic access 2.Simple 1.Set of “interesting” Affymetrix ids, 2.Get the relevant UniProt accession numbers 3.Get the surrounding interaction networks from IntAct 3.A bit more 1.Set of differentially expressed proteins in Pride Find experiments with “similar” set of regulated genes Get Reactome pathways Expand protein set by IntAct, then get Reactome pathways 4.Even more: EnVision

Red edges: Bouwmeester et al, 2005 Green edges: Rual et al, 2005 Violet edges: Stelzl et al, 2005

Henning Hermjakob Infrastructure Shallow integration easy addition of resources independent resources minimal centralisation easier to maintain very flexible Common Service Interface established standards well defined schema

Henning Hermjakob Diverse web service world External service External service SOAP XML REST CSV plain text PERL API JAVA API SOAP XML REST CSV plain text PERL API JAVA API database access analysis tools database access analysis tools Multiple manual connections with possibly multiple technologies Multiple result files which have to be combined manually Difficult to keep audit trail Much work to reproduce Multiple manual connections with possibly multiple technologies Multiple result files which have to be combined manually Difficult to keep audit trail Much work to reproduce ? ? ? ? ? ? ? ?

Henning Hermjakob EnCORE Enfin XML EnCORE service Enfin XML EnCORE service Enfin XML EnCORE service Enfin XML EnCORE service EnVISION User interface & representation heterogeneous external world standardised EnCORE world External service External service Single entry point One technology No manual combination of results Audit trial build in Visualisation build in Easy to reproduce Single entry point One technology No manual combination of results Audit trial build in Visualisation build in Easy to reproduce ! !

Henning Hermjakob ENFIN XML enXml – the EnCORE data exchange format XML schema standard interface to services simple and easy to understand structure generic to allow various data types stores service results and keeps an audit trail minimal restrictions for data representation high degree of freedom modelling user data need for modelling guidelines to ensure service interoperability

Henning Hermjakob enXml document graph EnsMartIntAct start toUniProt ppiExpand s2 s12 s26 s28 s27s29 Molecules Sets Experiments 1993_s_at BRCA1 BRAP Q5ST83 H2AFX

Henning Hermjakob enXml document graph EnsMartIntAct start toUniProt ppiExpand s2 s12 s26 s28 s27s29 Molecules Sets Experiments 1993_s_at BRCA1 BRAP Q5ST83 H2AFX Source relation

Henning Hermjakob Existing EnCore web services AffyMetrixprobe set ID to protein ID mapping ArrayExpressmicro array data BioModelssearch for biological models CellMINTprotein localization information g:GOStprotein grouping, functional profiling IntActprotein interactions KEGG pathwaypathway search PICRProtein Identifier Cross Reference PRIDEprotein identification Reactomepathway search UniProtprotein information retrieval Utility generation of ENFIN XML from protein IDs

Henning Hermjakob EnCORE Enfin XML EnCORE service Enfin XML EnCORE service Enfin XML EnCORE service Enfin XML EnCORE service EnVISION User interface & representation heterogeneous external world standardised EnCORE world External service External service Single entry point One technology No manual combination of results Audit trial build in Visualisation build in Easy to reproduce Single entry point One technology No manual combination of results Audit trial build in Visualisation build in Easy to reproduce ! !

Henning Hermjakob Synchronous communication - doService performs service with standard parameters - doServiceAdv performs service with custom parameters - doServiceTest only echoes the input clientclient serviceservice ENFIN XML ENFIN XML ENFIN XML ENFIN XML call service

Henning Hermjakob Domaination protein domain prediction tool analysis tool, not only data retrieval service possible long run times sync communication inadequate initiator for async communication model

Henning Hermjakob Asynchronous web services - doServiceAsync submits service with standard parameters & returns job ticket - getStatus reports the status of the job with specified ticket - retrieveResult returns the result of job with specified ticket serviceservice ENFIN XML ENFIN XML ENFIN XML ENFIN XML ticket number status clientclient submit loop retrieve if status OK

Henning Hermjakob EnCore use Primarily designed as framework for bioinformaticians Write your own client to access one or multiple services (example clients available in different programming languages) Very flexible access, can be tailored to your specific needs Full control over the client and its functionality Create your own services to extend the functionality of EnCORE Semi-automatic WSDL wrapper generation for services Workflow control with Taverna (Prototype)

Henning Hermjakob EnVision: Application of EnCore in a semi-fixed data flow Easier to demonstrate functionality than by showing a bunch of WSDLs Production application for the analysis of (proteomics) datasets Source for biologist feedback EnVision(1): Technically oriented demonstrator, access to XML configuration files, XSLT output generation EnVision2: “Friendly” end user application Beta version EnVision

Henning Hermjakob

Protein Identifier Space Translation PICR translates between ca. 20 protein identifier spaces Based on sequence identity Shows all known sequence-identifier associations, both historic and current Based on UniParc archive of 18 million public protein sequences Interactive use and computational access (web service, REST) Côté RG, et al.: The Protein Identifier Cross-Referencing (PICR) service: reconciling protein identifiers across multiple source databases. BMC Bioinformatics Oct 18;8:401.

Protein Identifier Space Translation

Ontology Lookup Service

Côté RG, Jones P, Martens L, Apweiler R, Hermjakob H.: The Ontology Lookup Service: more data and better tools for controlled vocabulary queries. Nucleic Acids Res May 8.

Tying databases together: DAS DAS Registry DAS Proxy DAS Servers DAS Infrastructure User

- Lightweight integration

Henning Hermjakob Acknowledgements EU FP6 LSHG-CT Pascal Kahlem

Henning Hermjakob ?

Henning Hermjakob

Henning Hermjakob

Henning Hermjakob Examples of data modelled in enXml Enfin IntAct service: find interaction partners enfin-intact ID2 ID56 Enfin Reactome service: find pathways from protein list enfin-reactome ID8 ID13 ID14 true

Henning Hermjakob EnCORE Enfin XML EnCORE service Enfin XML EnCORE service Enfin XML EnCORE service Enfin XML EnCORE service EnVISION User interface & representation heterogeneous external world standardised EnCORE world External service External service Single entry point One technology No manual combination of results Audit trial build in Visualisation build in Easy to reproduce Single entry point One technology No manual combination of results Audit trial build in Visualisation build in Easy to reproduce ! !

Henning Hermjakob EnVision