Protein interaction Computational (inferred) Experimental (observed)

Slides:



Advertisements
Similar presentations
Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Advertisements

Basic Gene Expression Data Analysis--Clustering
Using phylogenetic profiles to predict protein function and localization As discussed by Catherine Grasso.
Integrating Cross-Platform Microarray Data by Second-order Analysis: Functional Annotation and Network Reconstruction Ming-Chih Kao, PhD University of.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
PPI network construction and false positive detection Jin Chen CSE Fall 1.
. Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman.
Protein Targeting by Functional Linkage of Non-Homologous Proteins with examples from M. tuberculosis Genome-wide functional linkage map Structural Genomics.
The STRING database Michael Kuhn EMBL Heidelberg.
Research Methodology of Biotechnology: Protein-Protein Interactions Yao-Te Huang Aug 16, 2011.
Gene expression analysis summary Where are we now?
Protein-protein interactions
Functional genomics and inferring regulatory pathways with gene expression data.
Protein domains vs. structure domains - an example.
Protein-protein interactions Ia. A combined algorithm for genome-wide prediction of protein function. Edward M. Marcotte, Matteo Pellegrini, Michael J.
Functional annotation and network reconstruction through cross-platform integration of microarray data X. J. Zhou et al
1 Protein-Protein Interaction Networks MSC Seminar in Computational Biology
Evidence for dynamically organized modularity in the yeast protein- protein interaction network Han, et al
Biological networks Construction and Analysis. Recap Gene regulatory networks –Transcription Factors: special proteins that function as “keys” to the.
Microarray analysis 2 Golan Yona. 2) Analysis of co-expression Search for similarly expressed genes experiment1 experiment2 experiment3 ……….. Gene i:
Marcotte EM, Pellegrini M, Ng HL, Rice DW, Yeates TO, Eisenberg D. (1999). Detecting protein function and protein-protein interactions from genome sequences.
Protein-Protein Interaction Screens. Bacterial Two-Hybrid System selectable marker RNA polymerase DNA binding protein bait target sequence target.
Epistasis Analysis Using Microarrays Chris Workman.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
Protein Interactions and Disease Audry Kang 7/15/2013.
Interaction Networks in Biology: Interface between Physics and Biology, Shekhar C. Mande, August 24, 2009 Interaction Networks in Biology: Interface between.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
A highly abbreviated introduction to proteomics
Protein-protein interactions Chapter 12. Stable complex Transient Interaction Transient Signaling Complex Rap1A – cRaf1 Interface 1310 Å 2 Stable complex:
Protein protein interactions
Functional Linkages between Proteins. Introduction Piles of Information Flakes of Knowledge AGCATCCGACTAGCATCAGCTAGCAGCAGA CTCACGATGTGACTGCATGCGTCATTATCTA.
Protein analysis and proteomics (Part 2 of 2). Many of the images in this powerpoint presentation are from Bioinformatics and Functional Genomics by Jonathan.
Genetic network inference: from co-expression clustering to reverse engineering Patrik D’haeseleer,Shoudan Liang and Roland Somogyi.
Biological Pathways & Networks
蛋白质相互作用的生物信息学 高友鹤 中国医学科学院 基础医学研究所. 蛋白质相互作用的生物信息学 1. 实验数据 2. 蛋白质相互作用数据库 3. 高通量实验数据的验证 4. 蛋白质相互作用网络 5. 计算预测蛋白质相互作用.
Interactions and more interactions
Protein-protein interactions Courtesy of Sarah Teichmann & Jose B. Pereira-Leal MRC Laboratory of Molecular Biology, Cambridge, UK EMBL-EBI.
Functional Associations of Protein in Entire Genomes Sequences Bioinformatics Center of Shanghai Institutes for Biological Sciences Bingding.
Improving PPI Networks with Correlated Gene Expression Data Jesse Walsh.
Bioinformatics Brad Windle Ph# Web Site:
Proteome and interactome Bioinformatics.
Genome-wide Functional Linkage Maps Methods for inferring functional linkages: Complexes, Pathways Rosetta stone Phylogenetic profiles Gene neighbors Operon.
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
Anis Karimpour-Fard ‡, Ryan T. Gill †,
Overview  Introduction  Biological network data  Text mining  Gene Ontology  Expression data basics  Expression, text mining, and GO  Modules and.
The Mammalian Protein – Protein Interaction Database and Its Viewing System That Is Linked to the Main FANTOM2 Viewer Genome Research (2003) Speaker: 蔡欣吟.
CSCE555 Bioinformatics Lecture 18 Network Biology: Comparison of Networks Across Species Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu.
An Overview of Clustering Methods Michael D. Kane, Ph.D.
PPI team Progress Report PPI team, IDB Lab. Sangwon Yoo, Hoyoung Jeong, Taewhi Lee Mar 2006.
Central dogma: the story of life RNA DNA Protein.
I. Prolinks: a database of protein functional linkage derived from coevolution II. STRING: known and predicted protein-protein associations, integrated.
Functional and Evolutionary Attributes through Analysis of Metabolism Sophia Tsoka European Bioinformatics Institute Cambridge UK.
Nothing in (computational) biology makes sense except in the light of evolution after Theodosius Dobzhansky (1970) Comparative genomics, genome context.
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
Proteomics, the next step What does each protein do? Where is each protein located? What does each protein interact with, if anything? What role does it.
Large-scale Prediction of Yeast Gene Function Introduction to Bio-Informatics Winter Roi Adadi Naama Kraus
Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic Network Science, Vol 292, Issue 5518, , 4 May 2001.
How many interactions are there? ~6,200 genes ~6,200 proteins x 2-10 interactions/protein ~12, ,000 interactions Yeast.
Shortest Path Analysis and 2nd-Order Analysis Ming-Chih Kao U of M Medical School
Protein interactions: main methods for detection (all organisms) Two-hybrid8,446 (Co-)Immunoprecipitation567 Interaction adhesion assay225 In vitro binding138.
1 Genomics Advances in 1990 ’ s Gene –Expressed sequence tag (EST) –Sequence database Information –Public accessible –Browser-based, user-friendly bioinformatics.
1 Computational functional genomics Lital Haham Sivan Pearl.
Yeast Cell-Cycle Regulation Network inference Wang Lin.
Protein-Protein Interactions. A Protein may interact with: –Other proteins –Nucleic Acids –Small molecules Protein Interactions.
FLiPS Functional Linkage Prediction Service.
CISC 841 Bioinformatics (Spring 2006) Inference of Biological Networks
Volume 20, Issue 5, Pages (November 2014)
Principle of Epistasis Analysis
Bioinformatics, Vol.17 Suppl.1 (ISMB 2001) Weekly Lab. Seminar
Volume 20, Issue 5, Pages (November 2014)
Presentation transcript:

Protein interaction Computational (inferred) Experimental (observed)

Inference of Protein function

Guilt by association Five methods for inferring functional interactions: Complexes, Pathways Rosetta stone Phylogenetic profiles Expression profiles Gene neighbors Operon method

Five Known Rosetta Stone Links: Marcotte et al., Science, 285, pp (1999)

Pathways Detected by the Domain Fusion (Rosetta stone) Method AroHYDIBAroK PurF AroFAroE AroL AroGPur2 AroA AroB AroBPurTPur3 B AroDPurL AroEPur7 Pur5 Pur2 PurU AroKPurE Pur3 PurE C AroL Pur5 PurK AroAPurK PurTPur7 AroCGuaA GuaBPurB PurB A D GuaAPurHPurA Marcotte et al., Science, 285, pp (1999)

PHYLOGENETIC PROFILE METHOD

YC083W homology to thioredoxin MRPL2 MRPL6 MRPL7 MRPL10 known MRPL16 ribosomalpredicted to YGR021WMRPL23 proteins target member ofMRPS9 mitochondria highlyMRPS28 conservedMRF1 peptide chain release factor protein protein familyYJR113C homology to ribosomal protein S7 synthesis of unknownMSY1 ttrosyl-tRNA synthetase functionYGL068W probable ribosomal protein L12 MGE1 heat shock protein/chaperone YDR116C homology to bacterial ribosomal L1 protein YHR189W homology to peptidyl-tRNA hydrolase SIS1/XDJ homology to DraJ heat shock protein PDR13/SSE1/LHS1 homolgoy to Hsp70 RIB2 DRAP deaminase YDL036C homology to Rib2/pseudouridine synthase MIS1 C1-THF synthase ADE3 C1-THF synthase TPI1 tiose phosphate isomerase YGL236C homology to conserved gidA family, unknown function YOL060C homology to hypothetical C. elegans protein M02F4.4 Phylogenetic Profile: e.g. for a yeast protein

Microarray Co-expression Analysis Conclusion: P2 and P7 are functionally linked P3 and P6 are functionally linked Expression Profiles Rich med Starve High T P P P P P P P Profile Clusters P P P P P P P

Inferring Functional Linkages from the Gene Neighbor Method genome 1... genome 2genome 3 genome 4 A A A A B B B B C C C C A B C A statistically significant correlation is observed between the positions of proteins A and B across multiple genomes. A functional relationship is inferred between proteins A and B, but not between the other pairs of proteins:

gene A gene B gene C OPERON method of inferring functional linkages in the genome of Mycobacterium tuberculosis

Examples of Inferred Protein Networks

Features of Inferred Networks Linkages are between non-homologs The function of each protein is defined in its cellular context by its linkages Each linkage is assigned a probability Need to separate and visualize each module of functional proteins

Prions Can spontaneously change shape (10 -5 ) Can induce shape change in other proteins Contagious, inheritable Sup35 –Translation termination factor –Prion: reads through stop codons

Sup 35 Interactions

A Network of Proteins Related to Flagella fliA

Flagella-Related Transcription Factors

fliA Flagella-Specific ATP Synthase

fliA Chemotaxis-Related Proteins

Quantitative Assessment of Inferred Protein Linkages

Assessing Functional Linkages by method of Keyword Recovery X A Y Z Is the linkage inferred between A and Y valid ? Compare the keyword annotations common to A and Y, with the number expected at random = “signal-to-noise”

Assessing Inference of Linkages by Recovery of Keywords YEAST GENOME (Marcotte et al. Nature, 1999) Inferrence Method # functional Signal links to noise Individual Methods: Experiment (2-hybrid) Phylogenetic Profiles20,749 5 Rosetta Stone50,459 3 Correlated mRNA expression 26,013 2 Combined methods: Links by ≥ 2 methods 1,271 8

Observed Functional Linkages

Database of Interacting Proteins Experimentally detected interactions from the scientific literature

The DIP Database DOE-MBI LSBMM, UCLA

  Paul D. Boyer Nature (1999) XRay structure Stock et al. Science (1999) Two-hybrid assay Moritani C et al. Biochim Biophys (1996) ATP Synthase ATP Synthase in DIP HOW PROTEIN INTERACTIONS ARE REPRESENTED IN DIP

Number of proteins per network Number of clusters DIP contains 22,000 interactions from 2,200 articles, representing hundreds of networks

b Assessing DIP reliability with mRNA expression data (ERP)

Coexpression Measure Euclidean distance Protein A Protein B expression level log(e/e ref )

Extracting the Expression Profile (EPR) Index d2d2 p x 100 Model: non Y2H all vs all 40% of Y2H is highly reliable Overall: 50% of 8000 yeast DIP interactions are reliable = 4000 Interacting Experimental (Y2H) Random fitted  =0.4

Many proteins interacting in a Y2H may never meet each other in the cell!

Expression Profile Data Assesses Interaction Data (EPR Index; Deane et al. Molec & Cell Proteomics, 2002) DOE-MBI LSBMM, UCLA Uetz, P. et al.; Nature 403: (2000) Ito, T. et al.; PNAS 97: (2000) Ito, T. et al.; PNAS 98: (2001) Ito, T. et al.; PNAS 97: (2000) Ito, T. et al.; PNAS 98: (2001) Ho, Y. et al.; Nature 415: (2002) Gavin, A.C. et al.; Nature 415: (2002)  7.8+/-3.7%  19.5+/-3.7%  48.7+/-6.9%  89.5+/-6.6%

Measured vs Inferred Functional Interactions

EPR Assessment of Inferred Interactions vs. Y2Hybrid & MS MethodLinkagesEPR index Rosetta S +Phylo P 24, Yeast 2 hybrid ~4, Mass spec ~4, DIP yeast overall 6,1440.3

Some Conclusions A X Y Z B V C A protein’s function is defined by the cellular context of its linkages Many functional linkages are revealed from genomic data Validity of functional linkages can be assessed computationally by keyword recovery or experimentally by expression data Functional complexes can be discovered from genomic data