Session outline 1.Standards and the problem of data integration Example: PSICQUIC and the PSICQUIC game 2.Introduction to ontologies. Exploring the Gene.

Slides:



Advertisements
Similar presentations
Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Advertisements

Macromolecular complexes – A new Online Portal (under construction!) Birgit Meldal (IntAct)
Sandra Orchard EMBL-EBI Molecular Interactions
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title International Molecular Exchange Consortium - IMEx Sandra Orchard EMBL-EBI.
MitoInteractome : Mitochondrial Protein Interactome Database Rohit Reja Korean Bioinformation Center, Daejeon, Korea.
European Life Sciences Infrastructure for Biological Information Rafael C Jimenez ELIXIR CTO EMBL-EBI workshop networks and pathways.
EBI Proteomics Services Team – Standards, Data, and Tools for Proteomics Henning Hermjakob European Bioinformatics Institute SME forum 2009 Vienna.
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI.
The IntAct Database Sandra Orchard & Birgit Meldal.
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions and Pathways Sandra Orchard EMBL-EBI
In silico systems biology:network reconstruction, analysis and network based modelling EMBO practical course April 2010, Hinxton, UK.
IntAct Janna Hastings and James Watson EBI Bioinformatics Roadshow ILRI, Nairobi (2-3 March 2011) UCT, Cape Town (7-8 March 2011) A database of Molecular.
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI.
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI.
Computational analysis of protein-protein interactions for bench biologists 2-8 September, Berlin Protein Interaction Databases Francesca Diella.
The Complex Portal: A ‘one-stop shop’ for protein complexes Birgit Meldal IntAct Curator
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Cluster analysis of networks generated through homology: automatic identification of important protein communities involved in cancer metastasis Jonsson.
Introduction to biological networks. protein-gene interactions protein-protein interactions PROTEOME GENOME Citrate Cycle METABOLISM Bio-chemical reactions.
Systems Biology Biological Sequence Analysis
EBI is an Outstation of the European Molecular Biology Laboratory. UniProt Jennifer McDowall, Ph.D. Senior InterPro Curator Protein Sequence Database:
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
European Life Sciences Infrastructure for Biological Information Rafael C Jimenez ELIXIR CTO EMBL-EBI workshop networks and pathways.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Searching for TFBSs with TRANSFAC - Hot topics in Bioinformatics.
Session outline 1.Standards and the problem of data integration Example: PSICQUIC and the PSICQUIC game 2.Introduction to ontologies. Exploring the Gene.
Protein-protein interactions Chapter 12. Stable complex Transient Interaction Transient Signaling Complex Rap1A – cRaf1 Interface 1310 Å 2 Stable complex:
Ch10. Intermolecular Interactions and Biological Pathways
Cytoscape A powerful bioinformatic tool Mathieu Michaud
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and.
EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI
Presentation for Shamir group meeting Interactome under construction: protein-protein interaction and pathway databases 5/1/2011 Based on the papers: Protein-protein.
Sandra Orchard Introduction to Molecular Interaction Data Master headline.
Copyright OpenHelix. No use or reproduction without express written consent1.
© Wiley Publishing All Rights Reserved. Protein and Specialized Sequence Databases.
GTL Facilities Computing Infrastructure for 21 st Century Systems Biology Ed Uberbacher ORNL & Mike Colvin LLNL.
Protein interactions and Pathways
Networks and Interactions Boo Virk v1.0.
Intralab Workshop - Reactome CMAP Chang-Feng Quo June 29 th, 2006.
Finish up array applications Move on to proteomics Protein microarrays.
DAS for Molecular Interactions Hagen Blankenburg.
EADGENE and SABRE Post-Analyses Workshop 12-14th November 2008, Lelystad, Netherlands 1 François Moreews SIGENAE, INRA, Rennes Cytoscape.
Copyright OpenHelix. No use or reproduction without express written consent1.
Computational prediction of protein-protein interactions Rong Liu
Workshop Aims NMSU GO Workshop 20 May Aims of this Workshop  WIIFM? modeling examples background information about GO modeling  Strategies for.
Introduction to the GO: a user’s guide Iowa State Workshop 11 June 2009.
Introduction to IntAct Pablo Porras Millán, IntAct
Other biological databases and ontologies. Biological systems Taxonomic data Literature Protein folding and 3D structure Small molecules Pathways and.
A curated database of biological pathways.
Introduction to the GO: a user’s guide NCSU GO Workshop 29 October 2009.
Introduction to biological molecular networks
A database of biological pathways and processes (borrowed from a presentation created by Steve Jupe)
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
GO based data analysis Iowa State Workshop 11 June 2009.
1 Protein-Protein Interactions High-throughput strategy –Prediction from sequence In silico analysis –Protein A from species A: domain 1 and 2 –Protein.
IntAct David Croft A database of Molecular Interactions.
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI
Protein interactions and Pathways Tutorial Wellcome trust summer school Jyoti Khadake.
Copyright OpenHelix. No use or reproduction without express written consent1.
SRI International Bioinformatics 1 Pathway Tools Features Available Only in the Desktop Version PathoLogic.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
OncoTrack Bioinformatics Workshop Max Planck Institute for Molecular Genetics, Berlin Wednesday 6 th November 2013 TimeSubject 13:30-15:00 Introduction.
Protein-protein Interactions
Ministry of Economic Development and Innovation
The Complex Portal Birgit Meldal
Workshop Aims TAMU GO Workshop 17 May 2010.
Presentation transcript:

Session outline 1.Standards and the problem of data integration Example: PSICQUIC and the PSICQUIC game 2.Introduction to ontologies. Exploring the Gene Ontology -Coffee break- 3.Introduction to protein-protein interactions (PPIs) IntAct: the molecular interactions database at the EBI -Lunch break- 4.Introduction to pathways Reactome: a database of human biological pathways -Coffee break- 5.Network representation and analysis: strategies and limitations Network generation and analysis through Cytoscape and PSICQUIC

Introduction to protein-protein interactions (PPIs)

EMBL-EBI A couple of definitions… Protein-protein interactions (PPIs): physical and selective contacts that happen between pairs of proteins, in certain molecular regions and in a defined biological context. Interactome: the totality of PPIs that happen in a cell / in an organism / in a specific biological context... Protein-protein interaction network: Graphical representation of a group of PPIs in which proteins are represented as nodes and interactions as edges.

EMBL-EBI Why protein-protein interactions? 1.To predict a protein biological function “guilt by association” proteins with similar functions should cluster together 2.To improve characterization of protein complexes and pathways interaction networks work as a draft map that brings detail to biological processes and pathways Gene level DNARNA Protein level 1 protein = 1 function 1 protein = n functions = n networks! WRONG!

EMBL-EBI Guilt by association Histone methyltransferase Role in early development and hematopoiesis Proto-oncogene Transcription factor Cyclin-dependent kinase Role in transcription regulation Mediator of RNA polymerase transcription activity Role in transcription regulation Transcriptional modulator Transcription regulation dependent on kinases Receptor-activated transcription modulator Transcription modulation, signal transduction, activated by kinases. Role in inhibition of wound healing Putative oncoprotein Transcription activation Putative oncoprotein Involved in transcription regulation Cyclin Regulation of cyclin kinase, transcription regulation and cell cycle control Something to do with transcription and cell cycle control

EMBL-EBI Glaab et al., BMC Bioinformatics, PMID: Characterization of protein complexes and pathways Hook, B. and Schagat, T. [Internet] Available from: bhub/functional-proteomics-techniques- to-isolate-and-characterize-the-human- proteasome/ bhub/functional-proteomics-techniques- to-isolate-and-characterize-the-human- proteasome/

EMBL-EBI Yeast-two hybrid (Y2H) High-throughput X-ray diffraction studies Low- throughput Tandem affinity purification+ mass spectrometry (TAP-MS) Protein-protein interaction detection methods No single method can accurately reproduce a true binary interaction observed under physiological conditions – every interaction detected experimentally is fundamentally artefactual.

EMBL-EBI Protein-protein interaction detection methods Braun et al., Nat Methods, 2008, PMID:

EMBL-EBI Binary interactions – Two participants Co-localization – Two or more participants closely located N-ary ints. (associations) – Complex purification Functional / direct ints. – E. g. enzymatic interactions Types of protein-protein interactions

EMBL-EBI 10 Binary interactions

EMBL-EBI 11 Colocalization

EMBL-EBI N-ary interactions (association) Schleiff et al., Nat Rev Mol Cell Biol PMID:

EMBL-EBI Direct / functional interactions Usually in vitro assays Participants IDs are known in advance (predetermined) Examples – enzymatic assays, SPR, crystallography, methods using purified protein… Images from Wikipedia:

EMBL-EBI interaction domains Overlap in sequence ranges: Representing PPIs: interaction domains

EMBL-EBI Some experimental methods generate complex data: E. g. Tandem affinity purification (TAP) There are two algorithms to transform this information into binary data: Representing PPIs: The problem with complexes

EMBL-EBI De Las Rivas & Fontanillo, PLoS Computational biology, PMID: Interactions databases: types

EMBL-EBI Databases: curation levels DEEP CURATION SHALLOW CURATION

EMBL-EBI Databases: curation levels Shallow curation BioGRID – active curation, limited number of model organisms HPRD – active curation, human focus, predicted interactions MPIDB – curation stopped, data re-located to IntAct, interactions in micro- organisms InnateDB – active curation, interactions related with innate immunity Deep curation IntAct – active curation, wide species coverage, all types of molecules MINT – active curation, wide species coverage, PPIs only DIP – active curation, wide species coverage, PPIs only MPACT – curation currently stopped, limited species coverage, PPIs only MatrixDB – active curation, extracellular matrix molecules only BIND – curation stopped in 2006/7, wide species coverage, all types of molecules – information getting outdated I2D – active curation, PPIs involved in cancer

EMBL-EBI Primary databases: coverage De Las Rivas & Fontanillo, PLoS Computational biology, PMID: Human PPIs coverage in the main public primary databases (Dec 2009)

EMBL-EBI A standard for PPIs representation: the IMEx consortium Orchard et al., Nature Methods, PMID:

IntAct: The molecular interactions database at the EBI

EMBL-EBI 1.Publicly available repository of molecular interactions (mainly PPIs) - >430K binary interactions taken from >12,000 publications (October 2013) 2.Data is standards-compliant and available via our website, for download at our ftp site or via PSICQUIC ftp://ftp.ebi.ac.uk/pub/databases/intact 3.Provide open-access versions of the software to allow installation of local IntAct nodes. IntAct goals & achievements

EMBL-EBI Entry Publication Experiment 1 Interaction 1 Participant 1 Features Participant 2 Features Interaction 2 Experiment 2 Interaction 3Interaction 4 … … … [A] Publication level (entry) [B] Experiment level [C] Interaction level [D] Participant level [E] Feature level IntAct: Data storage schema

EMBL-EBI UniProt Knowledge Base Interactions can be mapped to the canonical sequence…... to splice variants or to post- processed chains

EMBL-EBI IntAct: PSI-MI ontology

EMBL-EBI “Lifecycle of an Interaction” Publication (full text) Sanity Checks (nightly) IntAct Curation pipeline CVs curator report Curation manual. reject Super curator annotate p1 p2 I exp IMEx MatrixDB Mint DIP Public web site FTP site accept check

EMBL-EBI CURATION DIRECT SUBMISSIONS PUBLISHED MOLECULAR INTERACTIONS DATA LARGE DATASETS FROM HIGH-THROUGHPUT PROJECTS IntAct: the role of the curator

EMBL-EBI CROSS-REFERENCES FAMILIES AND DOMAINS InterPro SMALL MOLECULES ChEBI FUNCTION Gene Ontology GENOME SEQUENCES Ensembl UniProtKB PROTEIN SEQUENCES LARGE DATASETS FROM HIGH-THROUGHPUT PROJECTS PUBLISHED MOLECULAR INTERACTIONS DATA CURATION DIRECT SUBMISSION Others STRUCTURES, ORGANISM, TISSUE...

EMBL-EBI Common curation platform Specific Data Dissemination Platforms General curation, large scale General curation, domain int. UniProt entry related Extracellular matrix Model organisms Immune system Commercial curation Cellular mechanics Regulatory interactions Specific curation focus/expertise Other DBs Host – pathogen interactions Cardiovascular proteins IntAct as a common curation platform

EMBL-EBI IntAct – Home Page

EMBL-EBI IntAct webpage-based search

EMBL-EBI IntAct webpage-based search Details of interaction Choice of UniProtKB or Dasty View

EMBL-EBI IntAct: changing the layout

EMBL-EBI IntAct: download formats

EMBL-EBI MITAB 2.7 specific columns (+27): Expansion method(s) Biological role(s) of interactors Experimental role(s) of interactors Type(s) of interactors Properties (CrossReference) of interactors / interaction Annotation(s) of interactors / interaction HostOrganism(s) Parameters of interaction Creation and update dates Checksum(s) of interactors / interaction Negative Feature(s) interactors Stoichiometry(s) interactors Participant(s) identification method(s) MITAB 2.5 Standard columns (15): ID(s) interactor A & B Alt. ID(s) interactor A & B Alias(es) interactor A & B Interaction detection method(s) Publication 1st author(s) Publication Identifier(s) Taxid interactor A & B Interaction type(s) Source database(s) Interaction identifier(s) Confidence value(s) PSIMITAB Columns

EMBL-EBI Interaction detail in IntAct

EMBL-EBI Detailed participant information: Dasty view

EMBL-EBI IntAct: filtering results

EMBL-EBI IntAct: visualizing results as a network

EMBL-EBI IntAct: using lists

EMBL-EBI IntAct: browse menu

EMBL-EBI IntAct: advanced search...

EMBL-EBI IntAct: MIQL syntax search

EMBL-EBI Searching and visualizing

EMBL-EBI Using lists

EMBL-EBI Checking details

EMBL-EBI Advanced search

EMBL-EBI Advanced search: using MIQL

EMBL-EBI Ontology search

EMBL-EBI Link to other PSICQUIC services

EMBL-EBI More about IntAct: “on-line” EBI courses

The PSICQUIC client: A unified gate to interactomics data

EMBL-EBI Unified query client: PSICQUIC PSICQUIC Query MIQL input Interactions PSI-MI output PSICQUIC Registry PSICQUIC Service A PSICQUIC Service B PSICQUIC Service C

EMBL-EBI Unified query client interface: PSICQUIC view

EMBL-EBI Acknowledgements Henning Hermjakob Group leader Rafael Jiménez Marine Dumousseau Noemí del Toro Developing team Sandra Orchard Margaret Duesbury Birgit Meldal Curation teamCoordinator