Interactions and more interactions

Slides:



Advertisements
Similar presentations
Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Advertisements

Proteomics technologies and protein-protein interaction Lars Kiemer Center for Biological Sequence Analysis The Technical University of Denmark Advanced.
Biological networks Bing Zhang Department of Biomedical Informatics Vanderbilt University
PPI network construction and false positive detection Jin Chen CSE Fall 1.
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Biological networks: Types and sources Protein-protein interactions, Protein complexes, and network properties.
Biological networks: Types and sources Protein-protein interactions, Protein complexes, and network properties.
Research Methodology of Biotechnology: Protein-Protein Interactions Yao-Te Huang Aug 16, 2011.
UC Davis, May 18 th 2006 Introduction to Biological Networks Eivind Almaas Microbial Systems Division.
A Real-life Application of Barabasi’s Scale-Free Power-Law Presentation for ENGS 112 Doug Madory Wed, 1 JUN 05 Fri, 27 MAY 05.
A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae Article by Peter Uetz, et.al. Presented by Kerstin Obando.
Separation of Scales. Interpretation of Networks Most publications do not consider this.
Biological networks Bing Zhang Department of Biomedical Informatics Vanderbilt University
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
Protein-protein interactions
Sedgewick & Wayne (2004); Chazelle (2005) Sedgewick & Wayne (2004); Chazelle (2005)
Biological Gene and Protein Networks
Biological networks: Types and origin Protein-protein interactions, complexes, and network properties Thomas Skøt Jensen Center for Biological Sequence.
Protein domains vs. structure domains - an example.
Introduction to biological networks. protein-gene interactions protein-protein interactions PROTEOME GENOME Citrate Cycle METABOLISM Bio-chemical reactions.
BACKGROUND E. coli is a free living, gram negative bacterium which colonizes the lower gut of animals. Since it is a model organism, a lot of experimental.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Protein-protein Interactions Hsueh-Fen Juan 2003, Mar 31 NTNU.
Biological networks Construction and Analysis. Recap Gene regulatory networks –Transcription Factors: special proteins that function as “keys” to the.
Biological networks: Types and origin
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Marcotte EM, Pellegrini M, Ng HL, Rice DW, Yeates TO, Eisenberg D. (1999). Detecting protein function and protein-protein interactions from genome sequences.
Protein-Protein Interaction Screens. Bacterial Two-Hybrid System selectable marker RNA polymerase DNA binding protein bait target sequence target.
Affinity chromatography/mass spec Bait protein GST Page 252.
Protein Classification A comparison of function inference techniques.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Protein Interaction Networks Aalt-Jan van Dijk Applied Bioinformatics, PRI, Wageningen UR & Mathematical and Statistical Methods, Biometris, Wageningen.
Protein-protein interactions Chapter 12. Stable complex Transient Interaction Transient Signaling Complex Rap1A – cRaf1 Interface 1310 Å 2 Stable complex:
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Network Analysis and Application Yao Fu
MN-B-WP II (BInf 2) Bioinformatische Datenbanken Kay Hofmann – Protein Evolution Group Woche 4: Interaktionsdatenbanken.
Protein analysis and proteomics (Part 2 of 2). Many of the images in this powerpoint presentation are from Bioinformatics and Functional Genomics by Jonathan.
Biological Pathways & Networks
(D) Crosslinking Interacting proteins can be identified by crosslinking. A labeled crosslinker is added to protein X in vitro and the cell lysate is added.
Protein-protein interactions Courtesy of Sarah Teichmann & Jose B. Pereira-Leal MRC Laboratory of Molecular Biology, Cambridge, UK EMBL-EBI.
Finish up array applications Move on to proteomics Protein microarrays.
Reconstruction of Transcriptional Regulatory Networks
Proteome and interactome Bioinformatics.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
Anis Karimpour-Fard ‡, Ryan T. Gill †,
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
Introduction to biological molecular networks
Russell Group, Protein Evolution _________ ____ Rob Russell Cell Networks University of Heidelberg Interactions and Modules: the how and why of molecular.
Proteomics, the next step What does each protein do? Where is each protein located? What does each protein interact with, if anything? What role does it.
Biol 729 – Proteome Bioinformatics Dr M. J. Fisher - Protein: Protein Interactions.
1 Protein-Protein Interactions High-throughput strategy –Prediction from sequence In silico analysis –Protein A from species A: domain 1 and 2 –Protein.
Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic Network Science, Vol 292, Issue 5518, , 4 May 2001.
How many interactions are there? ~6,200 genes ~6,200 proteins x 2-10 interactions/protein ~12, ,000 interactions Yeast.
3-D Structural Analysis of Protein Interaction Networks Gives New Insight Into Protein Function, Network Topology and Evolution CSB Seminar Philip M. Kim,
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Robustness, clustering & evolutionary conservation Stefan Wuchty Center of Network Research Department of Physics University of Notre Dame title.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Network Analysis Goal: to turn a list of genes/proteins/metabolites into a network to capture insights about the biological system 1.Types of high-throughput.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Ito et al. Genome-Scale In Vivo Protein Interaction Testing
Bioinformatics 3 V6 – Biological Networks are Scale- free, aren't they? Fri, Nov 2, 2012.
Biological networks CS 5263 Bioinformatics.
Large Scale Data Integration
Tanya Vavouri, Jennifer I. Semple, Rosa Garcia-Verdugo, Ben Lehner 
Deep Learning in Bioinformatics
Presentation transcript:

Interactions and more interactions Rob Russell Cell Networks University of Heidelberg

Aloy & Russell Nature Rev Mol Cell Biol 2006

But instead of a cell dominated by randomly colliding individual protein molecules, we now know that nearly every major process in a cell is carried out by assemblies of 10 or more protein molecules Bruce Alberts, Cell 1998

X Y X Y Yeast two-hybrid system Fields & Song, Nature, 340, 245, 1989 UASG GAL1-lacZ a Native GAL4 UASG GAL1-lacZ X Y b Individual hybrids with GAL4 domains GAL4 DNA- binding domain GAL4 activating region UASG GAL1-lacZ X GAL4 DNA- binding domain Y c Interaction between hybrids reconstitutes GAL4 activity Applied to whole Yeast genome Uetz et al, Nature, 403, 623, 2000. Ito et al, PNAS, 98, 4569, 2001.

Interaction discovery I The two-hybrid system Binary interactions: Bait Prey FUS3 DIG2 DIG2 FUS3 LSM2 PAT1 CKS1 CLB1 NPL4 UFD1 NPL4 CDC48 NPL4 FUC1 NPL4 SUA7 . . x1000s Uetz et al, Nature, 2000. (Yeast) Ito et al, PNAS, 2001. (Yeast) Rain et al, Nature, 2002. (H.pylori) Giot et al, Science, 2003 (D. melanogaster) Li et al, Science, 2004 (C. elegans)

The system works, but how? UASG GAL1-lacZ X GAL4 DNA- binding domain Y c Interaction between hybrids reconstitutes GAL4 activity CDC28 CKS Cyclin A Gal-4 (N) Gal-4 (C) (hypothetical) S12 L22 Native GAL4

Two datasets in Yeast See: Ito et al, PNAS, 2001 (comparing to Uetz et al, Nature, 2000)

Interaction discovery II Affinity purification (e.g. TAP/MS) x1000s Complexes: Bait Co-purification partners FUS3 DIG2 DIG1 DIG3 DIG2 FUS3 DIG2 NPL4 UFD1 CDC48 FUC1… (Etc.) Gavin et al, Nature, 2002. (Yeast) Ho et al, Nature, 180, 2002. (Yeast)

Trying to define binary interactions from purification data Reality Purifications only report a collection of proteins and don’t provide any information about precisely who interacts with whom. There are thus two models for representing binary interactions from complexes, neither of which are real. Purification Spoke Matrix Hakes et al, Comp Funct Genomics, 2006

Different worlds Comparing interactions to known 3D structures shows that original yeast two-hybrid datasets contain more transient interactions, compared to affinity purification datasets that contain more stable complexes (e.g. of 25 Uetz et al interactions with structures, 23 are transient, 2 are dedicated or stable) Aloy & Russell, Trends Biochem Sci, 2003

Error rates in interaction discovery False negatives: interactions known to occur that are missed by a screen - To asses this one needs a reference set of positives (i.e. known interactions) among a set of proteins being screened. The fraction of these missed is the false-negative rate. Relatively simple - normally one has a set of previously determined interactions or “gold standard” False positives: interactions reported by a screen that are incorrect - To assess this one needs a set of interactions that are known not to occur that are seen in a screen. Very difficult to obtain – how can you know that two proteins definitely do not interact? - tricks include taking pairs of proteins presumed to never see each other (i.e. different cellular compartment, etc.) Von Mering et al, Nature, 2002

Error rates in interaction discovery: the old view Von Mering et al, Nature, 2002

Error rates in interaction discovery: the new view Yu et al, Nature, 2002

Sociological bias affects the perceived performance Interactions determined on a protein by protein basis are focused around what the investigator wants to study, and thus biased towards particular areas of biology that are hot. High-throughput techniques are used precisely to find new interactions. Thus using the previously determined networks as a “gold standard” is likely to be unfair. Braun et al, Nature Methods, 2008

Interaction data: predictions I Proteins in the same bacterial operon are typically functionally associated, and often physically interacting. Groups of proteins entirely absent in one or more organisms among a closely related set are often functionally/physically associated Proteins that are separate in some organisms and fused in others are likely interacting physically. Aloy & Russell, Nature Rev Mol Cell Biol 2006

Interaction data: predictions II Pairs of proteins homologous to pairs of proteins seen to interact in known 3D structures can interact in the same way. Pairs of proteins containing a pair of domains often seen in interacting proteins can be used to infer interactions in proteins where interactions have not been observed. The presence of a linear motif can indicate interactions with proteins known to bind this motif.. Aloy & Russell, Nature Rev Mol Cell Biol 2006

Interaction databases Resources are very different in appearance and content Efforts are underway to make a unified search/view, but not complete Thus one needs currently to look at several sites to check if an interaction is known Some are content (e.g. IntAct, MINT) others are processed and augmented (e.g. STRING) with additional predicted/inferred interactions

Interaction networks Sos-1 Grb-2 RGS-4 RGS-3 Ga/q Node Edge

Biological interaction networks Nodes: Proteins Genes Chemicals Effects(?) Edges: Physical interaction (e.g. yeast two-hybrid) Co-expression (e.g. microarrays) Same operon Regulation of gene expression (protein to gene) Catalysis (e.g. metabolic networks) Node Node Edge

Interaction networks Jeong et al, Nature, 2001. Biological networks tend to be scale free: most nodes (e.g. proteins) are connected to only a few others with a handful of “hubs” making many more interactions. They are also “small-world” in that any pair of nodes tends to be connected via a relatively small number of intermediate nodes.

“Hubs” in networks Hubs are more likely to be lethal when deleted Jeong et al, Nature, 2001 Hubs are more likely to be disordered. Haynes et al, PLoS Comp Biol, 2006

p53 – the promiscuous transcription factor

Russell & Gibson, FEBS Lett. 2008 Linear motifs in p53 15:DNA-PK,RSK2,ATM P 18:CK1s P NES P 20:CHK2 9:Unknown Tetramerization domain (323-356) USP7 P S 386 MDM2 CYCLIN DNA binding domain (95-289) 33:GSK-3s,CDK7,CDKs P P 55:MAPKs P P P 215:AuroraA P 37:DNA-PK/ATM P 371,376,378:CDK7 P 46:HIPK2 P P 315:AuroraA,CDKs IUPred disorder prediction 392:CDK2s,CDK7,EIF2AK2 Russell & Gibson, FEBS Lett. 2008