EBI is an Outstation of the European Molecular Biology Laboratory. Bird‘s Eye View of... Molecular Interaction Standards: PSI-MI XML PSI-MI Tool support.

Slides:



Advertisements
Similar presentations
Copyright © 2003 Pearson Education, Inc. Slide 7-1 Created by Cheryl M. Hughes, Harvard University Extension School Cambridge, MA The Web Wizards Guide.
Advertisements

Copyright © 2003 Pearson Education, Inc. Slide 8-1 Created by Cheryl M. Hughes, Harvard University Extension School Cambridge, MA The Web Wizards Guide.
Copyright © 2003 Pearson Education, Inc. Slide 3-1 Created by Cheryl M. Hughes The Web Wizards Guide to XML by Cheryl M. Hughes.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 6 Author: Julia Richards and R. Scott Hawley.
Author: Julia Richards and R. Scott Hawley
Properties Use, share, or modify this drill on mathematic properties. There is too much material for a single class, so you’ll have to select for your.
UNITED NATIONS Shipment Details Report – January 2006.
HL7 Project Management Tool Overview for HL7 Project Facilitators
David Burdett May 11, 2004 Package Binding for WS CDL.
1 Introducing the Specifications of the Metro Ethernet Forum MEF 19 Abstract Test Suite for UNI Type 1 February 2008.
18 Copyright © 2005, Oracle. All rights reserved. Distributing Modular Applications: Introduction to Web Services.
8 Copyright © 2005, Oracle. All rights reserved. Creating the Web Tier: JavaServer Pages.
1 RA I Sub-Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Casablanca, Morocco, 20 – 22 December 2005 Status of observing programmes in RA I.
1 CREATING AN ADMINISTRATIVE DRAW REQUEST (OCC) Complete a Checklist for Administrative Draw Requests (Form 16.08). Draw Requests amount must agree with.
Properties of Real Numbers CommutativeAssociativeDistributive Identity + × Inverse + ×
Exit a Customer Chapter 8. Exit a Customer 8-2 Objectives Perform exit summary process consisting of the following steps: Review service records Close.
Create an Application Title 1A - Adult Chapter 3.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
|epcc| NeSC Workshop Open Issues in Grid Scheduling Ali Anjomshoaa EPCC, University of Edinburgh Tuesday, 21 October 2003 Overview of a Grid Scheduling.
1 Click here to End Presentation Software: Installation and Updates Internet Download CD release NACIS Updates.
Excel Functions. Part 1. Introduction 2 An Excel function is a formula or a procedure that is performed in the Visual Basic environment, outside the.
1. 2 Objectives Become familiar with the purpose and features of Epsilen Learn to navigate the Epsilen environment Develop a professional ePortfolio on.
REVIEW: Arthropod ID. 1. Name the subphylum. 2. Name the subphylum. 3. Name the order.
DOROTHY Design Of customeR dRiven shOes and multi-siTe factorY Product and Production Configuration Method (PPCM) ICE 2009 IMS Workshops Dorothy Parallel.
Anything But Typical Learning to Love JavaScript Prototypes Page 1 © 2010 Razorfish. All rights reserved. Dan Nichols March 14, 2010.
EU market situation for eggs and poultry Management Committee 20 October 2011.
American Community Survey Data Products Updated February 2013.
Microsoft Access.
EIS Bridge Tool and Staging Tables September 1, 2009 Instructor: Way Poteat Slide: 1.
AEMCPAGE Relaunch 1 June 2009.
XML and Databases Exercise Session 3 (courtesy of Ghislain Fourny/ETH)
VOORBLAD.
Benchmark Series Microsoft Excel 2013 Level 2
Sample Service Screenshots Enterprise Cloud Service 11.3.
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
1..
31242/32549 Advanced Internet Programming Advanced Java Programming
Macromedia Dreamweaver MX 2004 – Design Professional Dreamweaver GETTING STARTED WITH.
Sandra Orchard EMBL-EBI Molecular Interactions
Pasewark & Pasewark Microsoft Office XP: Introductory Course 1 INTRODUCTORY MICROSOFT WORD Lesson 8 – Increasing Efficiency Using Word.
Analyzing Genes and Genomes
Systems Analysis and Design in a Changing World, Fifth Edition
To the Assignments – Work in Progress Online Training Course
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 12 View Design and Integration.
Essential Cell Biology
ANSC644 Bioinformatics-Database Mining 1 ANSC644 Bioinformatics §Carl J. Schmidt §051 Townsend Hall §
Intracellular Compartments and Transport
PSSA Preparation.
Chapter 11 Creating Framed Layouts Principles of Web Design, 4 th Edition.
Essential Cell Biology
Immunobiology: The Immune System in Health & Disease Sixth Edition
Energy Generation in Mitochondria and Chlorplasts
RefWorks: The Basics October 12, What is RefWorks? A personal bibliographic software manager –Manages citations –Creates bibliogaphies Accessible.
© Paradigm Publishing, Inc Access 2010 Level 2 Unit 2Advanced Reports, Access Tools, and Customizing Access Chapter 8Integrating Access Data.
Introduction Peter Dolog dolog [at] cs [dot] aau [dot] dk Intelligent Web and Information Systems September 9, 2010.
1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
TIDE Presentation Florida Standards Assessments 1 FSA Regional Trainings Updated 02/09/15.

5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions and Pathways Sandra Orchard EMBL-EBI
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI.
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI.
EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI
Copyright OpenHelix. No use or reproduction without express written consent1.
Introduction to IntAct Pablo Porras Millán, IntAct
EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI
Presentation transcript:

EBI is an Outstation of the European Molecular Biology Laboratory. Bird‘s Eye View of... Molecular Interaction Standards: PSI-MI XML PSI-MI Tool support (APIs, Validator) ChEBI APO-SYS workshop 20 – 21st January 2009 Berlin

PROTEOMICS STANDARD INITIATIVE A gentle introduction to the 2

3 Engineering 1850 Nuts and bolts fit perfectly together, but only if they originate from the same factory Standardisation proposal in 1864 by William Sellers It took until after WWII until it was generally accepted, though … Proteomics 2003 Proteomics data are perfectly compatible, but only if they are from the same lab / database / software “Publish and vanish” by data producers Collecting all publicly available data requires huge effort Urgent need for standardisation

4 Community standard for Molecular Interactions XML schema and detailed controlled vocabularies Jointly developed by major data providers: BIND, CellZome, DIP, GSK, HPRD, Hybrigenics, IntAct, MINT, MIPS, Serono, U. Bielefeld, U. Bordeaux, U. Cambridge, and others Version 1.0 published in February 2004 The HUPO PSI Molecular Interaction Format - A community standard for the representation of protein interaction data. Henning Hermjakob et al, Nature Biotechnology 2004, 22, Version 2.5 published in October 2007 Broadening the Horizon – Level 2.5 of the HUPO-PSI Format for Molecular Interactions; Samuel Kerrien et al. BioMed Central PSI-MI XML format

5 Collecting and combining data from different sources has become easier Standardized annotation through PSI-MI ontologies Tools from different organizations can be chained, e.g. analysis of IntAct data in Cytoscape. PSI-MI XML benefits Home page

PSI-MI CONTROLLED VOCABULARIES An overview of the 6

7 Ontology Lookup Service Makes available OBO controlled vocabularies Web site allows for searching and browsing their hierarchy

8 Ontology Lookup Service Each term has a definition as well as literature reference

PSI-MI XML 2.5 DATA MODEL An overview of the 9

10 PSI-MI 2.5 Standards

11 Top level structure unchanged compared to PSI-MI 1.0 Use of Id/Ref on main objects Bird’s eye view of PSI-MI XML 2.5

12 Main objects - Experiment Controlled by Ontologies Literature references Confidence measures

13 Main objects - Interactor Generic interactor Reference to a public database

14 Main objects - Interaction Controlled by Ontology Copyright Experiment Kinetics parameters Confidence value

15 Basics – Controlled Vocabularies Why ? Ensure data consistency Provide reliable mean for searching & filtering data How ? By providing a reference to an ontology term Using Xref !!

16 Main objects - Participant e.g. enzyme target Interactor e.g. bait, prey Delivery method expression level… Interactor used experimentally Building of Complex

PSI-MI TAB DATA MODEL An overview of the 17

18 Standard columns (15): ID(s) interactor A & B Alt. ID(s) interactor A & B Alias(es) interactor A & B Interaction detection method(s) Publication 1st author(s) Publication Identifier(s) Taxid interactor A & B Interaction type(s) Source database(s) Interaction identifier(s) Confidence value(s) PSIMITAB Standard Columns

INTACT EXTENDED MITAB A quick look into 19

20 IntAct specific columns (+11): Experimental role(s) of interactors Biological role(s) of interactors Properties (CrossReference) of interactors Type(s) of interactors HostOrganism(s) Expansion method(s) Dataset name(s) Standard columns (15): ID(s) interactor A & B Alt. ID(s) interactor A & B Alias(es) interactor A & B Interaction detection method(s) Publication 1st author(s) Publication Identifier(s) Taxid interactor A & B Interaction type(s) Source database(s) Interaction identifier(s) Confidence value(s) + PSIMITAB Extended Columns

PSI-MI XML 2.5 JAVA API A hands on introduction to 21

22 PSI-MI XML Java API Uses Java 5 Provides binding between XML and Java object model Tools to read/write XML from/to file Read can be done in 2 fashions: Load a whole file in an EntrySet Only allows to load large files if you have enough memory Easy to update content and write back to file Index XML data and give access though an IndexedEntry Memory efficient with large files Allows to browse through interactions, experiments… Trickier to write updated content (yet, feasible)

PSI-MI TAB 2.5 JAVA API A hands on introduction to 23

24 PSI-MI TAB Java API Uses Java 5 Provides binding between TAB and a Java object model Tools to read/write TAB from/to file You can read in 2 fashions: Load a whole file in a Collection Only allows to load large files if you have enough memory Load interaction one at a time using Iterator Memory efficient with large files

25 PSI-MI XML is the de facto standard for molecular interactions We have code samples & exercises for both APIs ! Let me know if you want access to it … The Java API makes it easy to handle Summary PSI-MI Home page API Download ftp://ftp.ebi.ac.uk/pub/databases/intact/current/psi25 Data

R packages for PSI-MI Quick introduction to 26

27 Rintact & RpsiXML Initiative from the Wolfgang Huber’s group at the EBI Allows to read PSI-MI XML data into R data structure Enables data analysis using existing packages such as: RBGL, ppiStats, apComplex, … Currently supports: IntAct, MINT, HPRD, DIP, BioGRID, MIPS/CORUM, MatriDB, MPACT. API Download Documentation

PSI SEMANTIC VALIDATOR Quick introduction to 28

29 The PSI validator framework automatically checks that experimental data reported using a specific XML format and various CVs are compliant with the overall MIAPE recommendations. The semantic validator checks : - the XML syntax - the appropriate CV terms are used in specific locations of a document - misc. consistency check The Framework (in the context of PSI)

30 Ontology Manager Ontology Mapping Rule Object Rule Semantic Validator Messages Data Model Config OBO OL S Data File Components of the Validator

31 The Ontology Manager Declaration of ontologies or Controlled Vocabularies: location, format, retrieval method (local file or via web services)

32 Ontology Lookup Service Currrently 61 Ontologies available Web Service for easy access

33 CV Mapping Rules Is an explicit specification of which CV terms may/should/must be used in a given location. crucial to bind a data model to a set of CVs necessary to enforce MIAPE guidelines allows to develop CVs independently from a schema (necessary to comply to CV guidelines) this mapping is specified in an XML file

34 Exchange Format Referenced ontologies and CVs Resulting mapping file CV Mapping Rules – example with MzML

35 A data model is not bound to a single mapping PSI MI and MS workgroup provide a mapping corresponding to their respective minimum reporting guidelines (MIAPE) Mapping can be customized by any end user of a standard to be more or less granular CV Mapping Rules – final thoughts

36 List of consistency check tailored to specific data type Examples: -taxid is an existing entry at NCBI -PubMed ID is an existing publication -protein and DNA sequence defined using appropriate alphabet -CV dependency rules Note: These rules are to be programmed in Java The Object Rules

37 Fancy Building Your Own ? We are currently finalizing a tutorial to guide users in writing a validator based on their own data model. It provides: Additional explanation on the Validator ’ s modules Example of configuration files A working prototype based on a made up data model Source code available to get you quick-started.

EBI is an Outstation of the European Molecular Biology Laboratory. IntAct team Rolf Apweiler Henning Hermjakob Sandra Orchard Jyoti Khadake Luisa Montecchi Dave Thorneycroft Cathy Derow Prem Achuthan Bruno Aranda Samuel Kerrien IntAct is funded by the European Commission under FELICS, contract number (RII3)

EBI is an Outstation of the European Molecular Biology Laboratory. Luisa Montecchi-Palazzi Florian Reisinger Lennart Martens Andy Jones Mathias Oesterheld Bruno Aranda Prem Achuthan Henning Hermjakob PSI participants (direct contributors to the validator) Juan A Vizcaino Chris Taylor Eric Deutsch Pierre Alain Binz Susanna Sansone Frank Gibson Zsuzsanna Bencsath Daniel Schober Trish Wetzel Pete Souda Other PSI participants

40 ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?