Protein Interactions Michel Dumontier, Ph.D. Carleton University

Slides:



Advertisements
Similar presentations
Annotation of Gene Function …and how thats useful to you.
Advertisements

BioPAX The Birth of A Data Exchange Language for Biological Pathways Joanne Luciano BioPAX Core Group 7 th International Annual Bio-Ontologies.
Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Recombinant DNA Technology
Sandra Orchard EMBL-EBI Molecular Interactions
Microarray Data Analysis Day 2
CellDesigner Tutorial Laurence Calzone, Andrei Zinovyev UMR U900 INSERM/Institut Curie/Ecole des Mines de Paris Wednesday, April 30th.
Darwinian Genomics Csaba Pal Biological Research Center Szeged, Hungary.
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Biological networks: Types and sources Protein-protein interactions, Protein complexes, and network properties.
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI.
Computational analysis of protein-protein interactions for bench biologists 2-8 September, Berlin Protein Interaction Databases Francesca Diella.
Gene Ontology John Pinney
COG and GO tutorial.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Introduction to biological networks. protein-gene interactions protein-protein interactions PROTEOME GENOME Citrate Cycle METABOLISM Bio-chemical reactions.
Internet tools for genomic analysis: part 2
Biological networks: Types and origin
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
BIND: the Biomolecular Interaction Network Database Gary D. Bader, Doron Betel and Christopher W. V. Houge Seminar in Bioinformatics Elinor Heller.
Modeling Functional Genomics Datasets CVM Lessons 4&5 10 July 2007Bindu Nanduri.
Session outline 1.Standards and the problem of data integration Example: PSICQUIC and the PSICQUIC game 2.Introduction to ontologies. Exploring the Gene.
Protein-protein interactions Chapter 12. Stable complex Transient Interaction Transient Signaling Complex Rap1A – cRaf1 Interface 1310 Å 2 Stable complex:
Mapping Protein-Protein Interactions MEDG 505 (Genome Analysis) 13 January 2005 Morin: -Overview -IP-MS -Data integration Student presentations: -Y2H interactions.
Ch10. Intermolecular Interactions and Biological Pathways
Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and.
July 2015 CSHL Data analysis: GO tools and YeastMine, use-case examples.
Interactions and more interactions
How Ontologies Add Value BioPAX: Biological Pathway Data Exchange Ontology Joanne Luciano BioPAX Workgroup (biopax.org)biopax.org BioPathways Consortium.
1 Bio-Trac 40 (Protein Bioinformatics) October 8, 2009 Zhang-Zhi Hu, M.D. Associate Professor Department of Oncology Department of Biochemistry and Molecular.
Reconstruction of Transcriptional Regulatory Networks
Semantic Web for Life Sciences Workshop Session VII: Semantic Aggregation, Integration, and Inference Moderator: Joanne Luciano October, Cambridge,
Copyright OpenHelix. No use or reproduction without express written consent1.
CZ5211 Topics in Computational Biology Lecture 6: Biological Pathways I: Molecular Interactions Prof. Chen Yu Zong Tel:
Advanced Bioinformatics
Proteome and interactome Bioinformatics.
Organizing information in the post-genomic era The rise of bioinformatics.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Introduction to the GO: a user’s guide Iowa State Workshop 11 June 2009.
Systems Biology ___ Toward System-level Understanding of Biological Systems Hou-Haifeng.
Introduction to IntAct Pablo Porras Millán, IntAct
1 Gene function annotation. 2 Outline  Functional annotation  Controlled vocabularies  Functional annotation at TAIR  Resources and tools at TAIR.
Other biological databases and ontologies. Biological systems Taxonomic data Literature Protein folding and 3D structure Small molecules Pathways and.
TAP(Tandem Affinity Purification) Billy Baader Genetics 677.
Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function.
The Mammalian Protein – Protein Interaction Database and Its Viewing System That Is Linked to the Main FANTOM2 Viewer Genome Research (2003) Speaker: 蔡欣吟.
A curated database of biological pathways.
Introduction to the GO: a user’s guide NCSU GO Workshop 29 October 2009.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
BioPAX The Birth of A Data Exchange Language for Biological Pathways Joanne Luciano BioPAX Group 7 th International Annual Bio-Ontologies.
1 Protein-Protein Interactions High-throughput strategy –Prediction from sequence In silico analysis –Protein A from species A: domain 1 and 2 –Protein.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
High throughput biology data management and data intensive computing drivers George Michaels.
Lab Interactions and Ontologies LAB CBW Bioinformatics Workshop February 23 th 2006, Toronto Christopher Hogue Blueprint Initiative.
` Comparison of Gene Ontology Term Annotations Between E.coli K12 Databases REDDYSAILAJA MARPURI WESTERN KENTUCKY UNIVERSITY.
Overview of two hybrid system for identifying and characterizing protein–protein interactions. Shown are the basic components and operation of the two.
Networks and Interactions
Functional organization of the yeast proteome by systematic analysis of protein complexes Presented by Nathalie Kirshman and Xinyi Ma.
Interactions and Ontologies
Networks and Pathways I
Overview of two hybrid system for identifying and characterizing protein–protein interactions. Shown are the basic components and operation of the two.
Interrogation of cross talk between proteins and gene regulatory networks in breast cancer Chambers, Teressa Lee Hiren Karathia Sridhar Hannenhalli.
Yeast Two-Hybrid Screening: The Principles
Department of Genetics • Stanford University School of Medicine
Functional Annotation of the Horse Genome
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
Protein Complex Discovery
Chemical Dimerizers and Three-Hybrid Systems
Protein Complex Discovery
Presentation transcript:

Protein Interactions Michel Dumontier, Ph.D. Carleton University June 17, 2005 Protein Interactions Michel Dumontier, Ph.D. Carleton University michel@bioinfocg.com Lecture 4.1 (c) 2005 CGDN

Outline Protein interactions Discovery Storage Experimental Lecture 4.1

Molecular Interactions B A Between two molecular objects DNA, RNA, gene, protein, molecular complex, small molecule, photon Binding Sites Under some Experimental Condition With a particular Cellular Location Possibly having a Chemical Action Lecture 4.1

Interaction Discovery Michel Dumontier Interaction Discovery June 17, 2005 Databases Fully electronic Easily computer readable Literature Increasingly electronic Human readable Biologist’s brains Richest data source Limited bandwidth access Experiments Basis for models Lecture 4.1 (c) 2005 CGDN

Yeast Two Hybrid Assay The two-hybrid system is a molecular genetic tool which facilitates the study of protein-protein interactions. If two proteins interact, then a reporter gene is transcriptionally activated. e.g. gal1-lacZ - the beta-galactosidase gene A colour reaction can be seen on specific media. You can use this to Study the interaction between two proteins which you expect to interact Find proteins (prey) which interact with a protein you have already (bait). Lecture 4.1

Two-hybrid assay 1. 3. 2. 4. B A SNF4 SNF1 GAL4-DBD Michel Dumontier Two-hybrid assay June 17, 2005 SNF4 1. B SNF1 A 3. 2. GAL4-DBD Transcription activation domain UASG 4. Fields S. Song O. Nature. 1989 Jul 20;340(6230):245-6. PMID: 2547163 GAL1 Allows growth on galactose Lecture 4.1 (c) 2005 CGDN

Some Two-hybrid caveats Michel Dumontier Some Two-hybrid caveats June 17, 2005 1. A 3. 2. 4. Does the DNA Binding Domain fusion have activity by itself? Lecture 4.1 (c) 2005 CGDN

Some Two-hybrid caveats Michel Dumontier Some Two-hybrid caveats June 17, 2005 1. C B A 3. 2. 4. Is the ‘interaction’ mediated by some other protein? Lecture 4.1 (c) 2005 CGDN

Some Two-hybrid questions Michel Dumontier Some Two-hybrid questions June 17, 2005 1. B A 3. 2. Are the proteins expresssed? Are they over-expressed? Are they in-frame? Are the interacting domains defined? Was the observation reproducible? Was the strength of interaction significant? Was another method used to back-up the conclusion? Are the two proteins from the same compartment? 4. Lecture 4.1 (c) 2005 CGDN

Affinity purification Michel Dumontier Affinity purification June 17, 2005 A this molecule will bind the ‘tag’. tag modification (e.g. HA/GST/His) Protein of interest Lecture 4.1 (c) 2005 CGDN

Affinity purification Michel Dumontier Affinity purification June 17, 2005 the cell A Lecture 4.1 (c) 2005 CGDN

Affinity purification Michel Dumontier Affinity purification June 17, 2005 lots of other untagged proteins the cell A B naturally binding protein Lecture 4.1 (c) 2005 CGDN

Affinity purification Michel Dumontier Affinity purification June 17, 2005 Ruptured membranes A B cell extract Lecture 4.1 (c) 2005 CGDN

Affinity purification Michel Dumontier Affinity purification June 17, 2005 A B untagged proteins go through fastest (flow-through) Lecture 4.1 (c) 2005 CGDN

Affinity purification Michel Dumontier Affinity purification June 17, 2005 A B tagged complexes are slower and come out later (eluate) Lecture 4.1 (c) 2005 CGDN

Some affinity purification questions Michel Dumontier June 17, 2005 Some affinity purification questions Is the bait protein expressed and in frame? Is the bait protein observed? Is the bait protein over-expressed? Are the interacting domains defined? Was the observation reproducible? Was the interactor found in the background? Was the strength of interaction significant? Was the interaction saturable? Was the interactor stoichiometric with the bait protein? Was another method used to back-up the conclusion? Was tandem-affinity purification (TAP) used? Was the interaction shown using an extract or a purified protein? Is the inverse interaction observable? Are the two proteins from the same compartment? Are the two proteins known to be involved in the same process? Is the interactor likely to be physiologically significant? A B Lecture 4.1 (c) 2005 CGDN

Some affinity purification caveats Michel Dumontier June 17, 2005 Some affinity purification caveats First and most importantly, this is only a representation of the observation. You can only tell what proteins are in the eluate; you can’t tell how they are connected to one another. If there is only one other protein present (B), then its likely that A and B are directly interacting. But, what if I told you that two other proteins (B and C) were present along with A…. A B A C B Lecture 4.1 (c) 2005 CGDN

Complexes with unknown topology Michel Dumontier June 17, 2005 Complexes with unknown topology A A A B C B C B C Which of these models is correct? The complex described by this experimental result is said to have an Unknown Topology. Lecture 4.1 (c) 2005 CGDN

Complexes with unknown stoichiometry Michel Dumontier June 17, 2005 Complexes with unknown stoichiometry A A B C Here’s another possibility? The complex described by this experimental result is also said to have Unknown Stoichiometry. Lecture 4.1 (c) 2005 CGDN

High-throughput Mass Spectrometric Protein Complex Identification (HMS-PCI) Michel Dumontier June 17, 2005 Mike Tyers, SLRI Ste12 Ho et al. Nature. 2002 Jan 10;415(6868):180-3 Lecture 4.1 (c) 2005 CGDN

Michel Dumontier June 17, 2005 Lecture 4.1 (c) 2005 CGDN

Synthetic Genetic Interactions Michel Dumontier Synthetic Genetic Interactions June 17, 2005 Synthetic genetic interactions (lethal, slow growth) Mate two mutants without phenotypes to get a daughter cell with a phenotype Synthetic lethal (SL), slow growth robotic mating using the yeast deletion library Genetic interactions provide functional data on protein interactions or redundant genes About 23% of known SLs (1295 - YPD+MIPS) are known protein interactions in yeast Tong et al. Science. 2001 Dec 14;294(5550):2364-8 Lecture 4.1 (c) 2005 CGDN

Working overtime Charlie Boone’s Robots Michel Dumontier Working overtime Charlie Boone’s Robots June 17, 2005 Lecture 4.1 (c) 2005 CGDN

Synthetic Genetic Interactions in Yeast Cell Polarity Cell Wall Maintenance Cell Structure Mitosis Chromosome Structure DNA Synthesis DNA Repair Unknown Others Michel Dumontier Synthetic Genetic Interactions in Yeast June 17, 2005 Lecture 4.1 Tong, Boone (c) 2005 CGDN

SGA Synthetic Genetic Interaction Network 2004 Michel Dumontier June 17, 2005 ~1000 Genes ~4000 Interactions 132 SGA Screens Lecture 4.1 Tong, Boone, Science, Feb 2004 (c) 2005 CGDN

A measure of confidence? Michel Dumontier A measure of confidence? June 17, 2005 How do you know if the interaction really exists? Each method has its advantages and disadvantages. Be aware of systematic errors (i.e. tag effects) Be aware of contaminating proteins. Each method observes interactions from a slightly different experimental condition. Support from many different sources is certainly better than just one. Lecture 4.1 (c) 2005 CGDN

Outline Molecular interactions Discovery Storage Data Mining Databases File Formats Data Mining Lecture 4.1

Interaction/Pathway Databases Michel Dumontier Interaction/Pathway Databases June 17, 2005 Arguably the most accessible data source, but... Varied formats, representation, coverage Pathway data extremely difficult to combine and use Pathway Resource List (http://cbio.mskcc.org/prl/) Lecture 4.1 (c) 2005 CGDN

http://bind.ca A free, open-source database for archiving and exchanging molecular assembly information. BIND is managed by the Blueprint Initiative at Mount Sinai Hospital in Toronto. The database contains Interactions/Reactions Molecular complexes Pathways BIND has an extensive data model, GNU software tools and is based on the NCBI toolkit; extended recently to XML/Java The ~175000 BIND records are curated and validated. Bader GD, Betel D, Hogue CW. (2003) BIND: the Biomolecular Interaction Network Database. Nucleic Acids Res. 31(1):248-50 PMID: 12519993 Lecture 4.1

BIND Interaction Types Lecture 4.1

Interaction Experimental Evidence in BIND Remaining 1% Lecture 4.1

Lecture 4.1

Lecture 4.1

55 Identifier Searches Supported! Lecture 4.1

Lecture 4.1

GI Pair - CSV Export Lecture 4.1

BIND Record Header BIND record identifier Description & Division Publications that support or dispute interaction Export Options Network Visualization Lecture 4.1

BIND Record View Lecture 4.1

BIND Record View The Interacting Molecules (A and B) Main identifier: GI Organism Cross-references and aliases Gene Ontology terms Proteoglyphs Graphical representations of domain and protein structure. Ontoglyphs Graphical representations of molecule function, localization and binding Lecture 4.1

Gene Ontology Functional protein annotation Michel Dumontier Gene Ontology June 17, 2005 Functional protein annotation http://www.geneontology.org Controlled vocabulary for protein function and localization Molecular function e.g. DNA helicase Biological process e.g. mitosis Cellular Component e.g. nucleus Thousands of terms… Lecture 4.1 (c) 2005 CGDN

Lecture 4.1

Lecture 4.1

Ontoglyph Summary View Michel Dumontier Ontoglyph Summary View June 17, 2005 Lecture 4.1 (c) 2005 CGDN

Ontoglyph Filtering Lecture 4.1

Lecture 4.1

Lecture 4.1

Other Interaction Databases DIP http://dip.doe-mbi.ucla.edu MINT http://mint.bio.uniroma2.it/mint MIPS http://mips.gsf.de/proj/yeast/tables/interaction/ IntAct – EBI’s interaction database http://www.ebi.ac.uk/intact/ Human Protein Interaction Database http://www.hpid.org/ TRANSFAC – transcription factors http://www.gene-regulation.com/ Lecture 4.1

Information Exchange Software Database User With Data Michel Dumontier Information Exchange June 17, 2005 Database Software User With Data Exchange Format >100 DBs and tools Tower of Babel Lecture 4.1 (c) 2005 CGDN

Data Exchange File Formats BIND http://bind.ca Peer reviewed but closed process (Spec v3.1) ASN.1 or XML DTD/Schema PSI-MI http://psidev.sourceforge.net Peer reviewed, HUPO community standard Widely adopted BioPax http://www.biopax.org Community schema (Sloan Kettering, BioPathways Consortium) XML Schema, OWL, Protégé and GKB SBML Widely adopted for representing models of biochemical reaction networks Lecture 4.1

BIND ASN.1 (text) XML Flat File Lecture 4.1

PSI level 2 Lecture 4.1

PSI Record Format Lecture 4.1

Michel Dumontier BioPAX June 17, 2005 http://www.biopax.org Represent: Metabolic pathways Signaling pathways Protein-protein, molecular interactions Gene regulatory pathways Genetic interactions Accommodate representations used in existing databases such as BioCyc, BIND, WIT, aMAZE, KEGG, Reactome, etc. Community effort (open meetings) Lecture 4.1 (c) 2005 CGDN

Conclusion Many experimental techniques to generate interaction data Interaction databases like BIND are a great resource for building up interaction networks into pathways Common standards for file formats imperative for making use of all this data! Lecture 4.1