Copyright  2004 limsoon wong Assessing Reliability of Protein- Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research.

Slides:



Advertisements
Similar presentations
Molecular Systems Biology 3; Article number 140; doi: /msb
Advertisements

Network biology Wang Jie Shanghai Institutes of Biological Sciences.
MitoInteractome : Mitochondrial Protein Interactome Database Rohit Reja Korean Bioinformation Center, Daejeon, Korea.
Table 2 shows that the set TFsf-TGblbs of predicted regulatory links has better results than the other two sets, based on having a significantly higher.
PPI network construction and false positive detection Jin Chen CSE Fall 1.
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
Darwinian Genomics Csaba Pal Biological Research Center Szeged, Hungary.
Predicting domain-domain interactions using a parsimony approach Katia Guimaraes, Ph.D. NCBI / NLM / NIH.
A hub-attachment based method to detect functional modules from confidence-scored protein interactions and expression profiles Authors: Chia-Hao Chin 1,4,
CISC667, F05, Lec26, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Genetic networks and gene expression data.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
Author: Jason Weston et., al PANS Presented by Tie Wang Protein Ranking: From Local to global structure in protein similarity network.
Integrated analysis of regulatory and metabolic networks reveals novel regulatory mechanisms in Saccharomyces cerevisiae Speaker: Zhu YANG 6 th step, 2006.
Copyright (c) 2004 by Limsoon Wong Assessing Reliability of Protein-Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research.
Simulation and Application on learning gene causal relationships Xin Zhang.
Predicting protein functions from redundancies in large-scale protein interaction networks Speaker: Chun-hui CAI
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Exploiting indirect neighbors and topological weight to predict protein function from protein– protein interactions Hon Nian Chua, Wing-Kin Sung and Limsoon.
Exciting Bioinformatics Adventures Limsoon Wong Institute for Infocomm Research.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Do not reproduce without permission 1 Gerstein.info/talks (c) (c) Mark Gerstein, 2002, Yale, bioinfo.mbb.yale.edu Permissions Statement This Presentation.
Network Analysis and Application Yao Fu
Interactions and more interactions
The importance of enzymes and their occurrences: from the perspective of a network W.C. Liu 1, W.H. Lin 1, S.T. Yang 1, F. Jordan 2 and A.J. Davis 3, M.J.
Improving PPI Networks with Correlated Gene Expression Data Jesse Walsh.
CSCE555 Bioinformatics Lecture 16 Identifying Differentially Expressed Genes from microarray data Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Reverse engineering gene regulatory networks Dirk Husmeier Adriano Werhli Marco Grzegorczyk.
Reconstructing gene networks Analysing the properties of gene networks Gene Networks Using gene expression data to reconstruct gene networks.
HUMAN-MOUSE CONSERVED COEXPRESSION NETWORKS PREDICT CANDIDATE DISEASE GENES Ala U., Piro R., Grassi E., Damasco C., Silengo L., Brunner H., Provero P.
Inferring strengths of protein-protein interactions from experimental data using linear programming Morihiro Hayashida, Nobuhisa Ueda, Tatsuya Akutsu Bioinformatics.
Knowledge Discovery from Biological and Clinical Data: BASIC BACKGROUND.
Proteome and interactome Bioinformatics.
Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae Speaker: Chunhui Cai.
Discovering the Correlation Between Evolutionary Genomics and Protein-Protein Interaction Rezaul Kabir and Brett Thompson
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
Xiaowei Ying, Xintao Wu Univ. of North Carolina at Charlotte PAKDD-09 April 28, Bangkok, Thailand On Link Privacy in Randomizing Social Networks.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
Impact of microRNAs on Organization of Protein Interactions and Formation of Protein Complexes Limsoon Wong 7 April 2011 (Thanks: Wilson Goh, Guimei Liu,
Microarrays.
Journal Club Meeting Sept 13, 2010 Tejaswini Narayanan.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
DNAmRNAProtein Small molecules Environment Regulatory RNA How a cell is wired The dynamics of such interactions emerge as cellular processes and functions.
How many interactions are there? ~6,200 genes ~6,200 proteins x 2-10 interactions/protein ~12, ,000 interactions Yeast.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Motif Search and RNA Structure Prediction Lesson 9.
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
Create and assess protein networks through molecular characteristics of individual proteins Yanay Ofran et al. ISMB ’06 Presenter: Danhua Guo 12/07/2006.
Robustness, clustering & evolutionary conservation Stefan Wuchty Center of Network Research Department of Physics University of Notre Dame title.
RNA Structure Prediction
Bioinformatics 3 V 4 – Weak Indicators and Communities Fri, Oct 26, 2012.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Detecting Protein Function and Protein-Protein Interactions from Genome Sequences TuyetLinh Nguyen.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Show & Tell Limsoon Wong Kent Ridge Digital Labs Singapore Role of Bioinformatics in the Genomic Era.
7. Performance Measurement
Bioinformatics 3 V6 – Biological Networks are Scale- free, aren't they? Fri, Nov 2, 2012.
Sofus A. Macskassy Fetch Technologies
Genomic Data Integration
Biological networks CS 5263 Bioinformatics.
Predict Failures with Developer Networks and Social Network Analysis
Modelling Structure and Function in Complex Networks
Walking the Interactome for Prioritization of Candidate Disease Genes
SEG5010 Presentation Zhou Lanjun.
COS 461: Computer Networks
Making Sense of Transcription Networks
Label propagation algorithm
Presentation transcript:

Copyright  2004 limsoon wong Assessing Reliability of Protein- Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research

Copyright  2004 limsoon wong How Reliable are Experimental Protein- Protein Interaction Data?

Copyright  2004 limsoon wong Some Protein Interaction Data Sets Sprinzak et al., JMB, 327: , 2003 GY2H: genome-scale Y2H 2M, 3M, 4M: intersection of 2, 3, 4 methods Large disagreement betw methods

Copyright  2004 limsoon wong Quantitative Estimate of Interacting Pairs Sprinzak et al, JMB, 327: , 2003 Ditto wrt co-cellular-role Expected proportion of co-localized pairs among non true interacting pairs Expected proportion of co-localized pairs among true interacting pairs

Copyright  2004 limsoon wong Reliability of Protein Interaction Data Sprinzak et al, JMB, 327: , 2003 % of TP based on co-localization % of TP based on shared cellular role (I = 1) % of TP based on shared cellular role (I =.95) TP = ~50%

Copyright  2004 limsoon wong Objective Some high-throughput protein interaction expts have as much as 50% false positives  Can we find a way to rank candidate interaction pairs according to their reliability?  How do we do this? –Would knowing their neighbours help? –Would knowing their local topology help? –Would knowing their global topology help?

Copyright  2004 limsoon wong Would knowing their neighbours help? The story of interaction generality

Copyright  2004 limsoon wong An Observation a b It seems that configuration a is less likely than b in protein interaction networks Can we exploit this?

Copyright  2004 limsoon wong Interaction Generality Saito et al., NAR, 30: , 2002 ig(YDR412W  GLC7) = 1 + # of yellow nodes The number of proteins that “interact” with just X or Y, and nobody else

Copyright  2004 limsoon wong Assessing Reliability Using Interaction Generality a b Recall configuration a is less likely than b in protein interaction networks  The smaller the “ig” value of a candidate interaction pair is, the more likely that interaction is

Copyright  2004 limsoon wong Evaluation wrt Intersection of Ito et al. & Uetz et al. Interacting pairs common to Ito et al. & Uetz et al. are more reliable These also have smaller “ig”  “ig” seems to work There are 229 pairs in Ito having ig = 1. Of these, 66 (or 34%) are also reported by Uetz

Copyright  2004 limsoon wong Evaluation wrt Co-localization Interaction pairs having common cellular localization are more likely These also have lower “ig”  “ig” seems to work ~60% of pairs in in Ito having ig=1 are known to have common localization

Copyright  2004 limsoon wong Evaluation wrt Co-cellular-role Interaction pairs having common cellular role are more likely These also have lower “ig”  “ig” seems to work A: before restrict to pairs with “ig = 1” B: after restrict to pairs with “ig = 1” reduced x-talk

Copyright  2004 limsoon wong Would knowing their local topology help? The story of interaction generality 2

Copyright  2004 limsoon wong Existence of Network Motifs Milo et al., Science, 298: , 2002 A network motif is just a local topological configuration of the network “Detected” in gene regulation networks, WWW links, etc.

Copyright  2004 limsoon wong 5 Possible Network Motifs Classify a protein C that directly interacts with the pair A  B according to these 5 topological configurations 11 22 33 44 55

Copyright  2004 limsoon wong A New Interaction Generality Saito et al., Bioinformatics, 19: , 2002

Copyright  2004 limsoon wong Evaluation wrt Reproducible Interactions “ig 2 ” correlates to “reproducible” interactions  “ig 2 ” seems to work ~90% of pairs in intersection of Ito & Uetz have ig 2 < 0. ~40% of pairs not in intersection of Ito & Uetz have ig 2 > 0

Copyright  2004 limsoon wong Evaluation wrt Common Cellular Role, etc. “ig 2 ” correlates well to common cellular roles, localization, & expression “ig 2 ” seems to work better than “ig” ~95% of pairs having ig2 = –6 have common cellular roles

Copyright  2004 limsoon wong Would knowing their global topology help? The story of interaction pathway reliability

Copyright  2004 limsoon wong Some “Reasonable” Speculations A true interacting pair is often connected by at least one alternative path (reason: a biological function is performed by a highly interconnected network of interactions) The shorter the alternative path, the more likely the interaction (reason: evolution of life is through “add-on” interactions of other or newer folds onto existing ones)

Copyright  2004 limsoon wong Interaction Pathway Reliability So existence of a strong short alternative path connecting an interacting pair indicates that the interaction is “reliable”

Copyright  2004 limsoon wong Evaluation wrt Reproducible Interactions “ipr” correlates well to “reproducible” interactions  “ipr” seems to work The number of pairs not in the intersection of Ito & Uetz is not changed much wrt the ipr value of the pairs The number of pairs in the intersection of Ito & Uetz increases wrt the ipr value of the pairs

Copyright  2004 limsoon wong Evaluation wrt Common Cellular Role, etc “ipr” correlates well to common cellular roles, localization, & expression “ipr” seems to work better than “ig 2 ” At the ipr threshold that eliminated 80% of pairs, ~85% of the of the remaining pairs have common cellular roles

Copyright  2004 limsoon wong Stability in Topology of Protein Networks Maslov & Sneppen, Science, 296: , 2002 According to Maslov & Sneppen –Links betw high-connected proteins are suppressed –Links betw high- & low-connected proteins are favoured As this decreases cross talks and increases robustness Part of the network of physical interactions reported by Ito et al., PNAS, 2001

Copyright  2004 limsoon wong Evaluation wrt “Many-few” Interactions Number of “Many-few” interactions increases when more “reliable” IPR threshold is used to filter interactions Consistent with the Maslov-Sneppen prediction

Copyright  2004 limsoon wong Evaluation wrt “Cross-Talkers” A MIPS functional cat: | 02 | ENERGY | | glycolysis and gluconeogenesis | | glycolysis methylglyoxal bypass | | regulation of glycolysis & gluconeogenesis The first 2 digits is top cat Subsequent digits add more granularity to the cat  Compare high- & low- IPR pairs that are not co- localised to determine number of pairs that fall into same cat. If more high- IPR pairs are in same cat, then IPR works For top cat –148/257 high-IPR pairs are in same cat –65/260 low-IPR pairs are in same cat For fine-granularity cat –135/257 high-IPR pairs are in same cat. 37/260 low-IPR pairs are in same cat  IPR works  IPR pairs that are not co- localized are real cross- talkers!

Copyright  2004 limsoon wong Conclusions There are latent local & global “motifs” that indicate the likelihood of protein interactions These “motifs” can be exploited in computation elimination of false positives from high- throughput Y2H expts Acknowledgements Jin Chen, Mong Li Lee, Wynne Hsu (NUS SOC) See-Kiong Ng (I 2 R) Prasanna Kolatkar, Jer-Ming Chia (GIS)

Copyright  2004 limsoon wong References J. Chen et al, “Systematic assessment of high-throughput protein interaction data using alternative path approach”, submitted R. Saito et al, “Interaction generality, a measurement to assess the reliability of a protein-protein interaction”, NAR 30: , 2002 R. Saito et al, “Construction of reliable protein-protein interaction networks with a new interaction generality measure”, Bioinformatics 19: , 2002 S. Maslov & K. Sneppen, “Specificity and stability in topology of protein networks”, Science 296: , 2002 E. Sprinzak et al, “How reliable are experimental protein- protein interaction data?”, JMB 327: , 2003