Download presentation
Presentation is loading. Please wait.
1
Protein interaction Computational (inferred) Experimental (observed)
2
Inference of Protein function
3
Guilt by association Five methods for inferring functional interactions: Complexes, Pathways Rosetta stone Phylogenetic profiles Expression profiles Gene neighbors Operon method
5
Five Known Rosetta Stone Links: Marcotte et al., Science, 285, pp. 751-753 (1999)
6
Pathways Detected by the Domain Fusion (Rosetta stone) Method AroHYDIBAroK PurF AroFAroE AroL AroGPur2 AroA AroB AroBPurTPur3 B AroDPurL AroEPur7 Pur5 Pur2 PurU AroKPurE Pur3 PurE C AroL Pur5 PurK AroAPurK PurTPur7 AroCGuaA GuaBPurB PurB A D GuaAPurHPurA Marcotte et al., Science, 285, pp. 751-753 (1999)
7
PHYLOGENETIC PROFILE METHOD
8
YC083W homology to thioredoxin MRPL2 MRPL6 MRPL7 MRPL10 known MRPL16 ribosomalpredicted to YGR021WMRPL23 proteins target member ofMRPS9 mitochondria highlyMRPS28 conservedMRF1 peptide chain release factor protein protein familyYJR113C homology to ribosomal protein S7 synthesis of unknownMSY1 ttrosyl-tRNA synthetase functionYGL068W probable ribosomal protein L12 MGE1 heat shock protein/chaperone YDR116C homology to bacterial ribosomal L1 protein YHR189W homology to peptidyl-tRNA hydrolase SIS1/XDJ homology to DraJ heat shock protein PDR13/SSE1/LHS1 homolgoy to Hsp70 RIB2 DRAP deaminase YDL036C homology to Rib2/pseudouridine synthase MIS1 C1-THF synthase ADE3 C1-THF synthase TPI1 tiose phosphate isomerase YGL236C homology to conserved gidA family, unknown function YOL060C homology to hypothetical C. elegans protein M02F4.4 Phylogenetic Profile: e.g. for a yeast protein
9
Microarray Co-expression Analysis Conclusion: P2 and P7 are functionally linked P3 and P6 are functionally linked Expression Profiles Rich med Starve High T P1 1 0 1 P2 1 1 0 P3 0 1 1 P4 1 0 0 P5 1 1 1 P6 0 1 1 P7 1 1 0 Profile Clusters P4 1 0 0 P1 1 0 1 P5 1 1 1 P2 1 1 0 P7 1 1 0 P3 0 1 1 P6 0 1 1
10
Inferring Functional Linkages from the Gene Neighbor Method genome 1... genome 2genome 3 genome 4 A A A A B B B B C C C C A B C A statistically significant correlation is observed between the positions of proteins A and B across multiple genomes. A functional relationship is inferred between proteins A and B, but not between the other pairs of proteins:
11
gene A gene B gene C OPERON method of inferring functional linkages in the genome of Mycobacterium tuberculosis
12
Examples of Inferred Protein Networks
13
Features of Inferred Networks Linkages are between non-homologs The function of each protein is defined in its cellular context by its linkages Each linkage is assigned a probability Need to separate and visualize each module of functional proteins
14
Prions Can spontaneously change shape (10 -5 ) Can induce shape change in other proteins Contagious, inheritable Sup35 –Translation termination factor –Prion: reads through stop codons
16
Sup 35 Interactions
17
A Network of Proteins Related to Flagella fliA
18
Flagella-Related Transcription Factors
19
fliA Flagella-Specific ATP Synthase
20
fliA Chemotaxis-Related Proteins
21
Quantitative Assessment of Inferred Protein Linkages
22
Assessing Functional Linkages by method of Keyword Recovery X A Y Z Is the linkage inferred between A and Y valid ? Compare the keyword annotations common to A and Y, with the number expected at random = “signal-to-noise”
23
Assessing Inference of Linkages by Recovery of Keywords YEAST GENOME (Marcotte et al. Nature, 1999) Inferrence Method # functional Signal links to noise Individual Methods: Experiment (2-hybrid) 500 8 Phylogenetic Profiles20,749 5 Rosetta Stone50,459 3 Correlated mRNA expression 26,013 2 Combined methods: Links by ≥ 2 methods 1,271 8
24
Observed Functional Linkages
25
Database of Interacting Proteins www.dip.doe-mbi.ucla.edu Experimentally detected interactions from the scientific literature
26
The DIP Database DOE-MBI LSBMM, UCLA
27
Paul D. Boyer Nature (1999) XRay structure Stock et al. Science (1999) Two-hybrid assay Moritani C et al. Biochim Biophys (1996) ATP Synthase ATP Synthase in DIP HOW PROTEIN INTERACTIONS ARE REPRESENTED IN DIP
28
0 50 100 150 200 123456789101113166872138 Number of proteins per network Number of clusters DIP contains 22,000 interactions from 2,200 articles, representing hundreds of networks
29
b Assessing DIP reliability with mRNA expression data (ERP)
30
Coexpression Measure Euclidean distance Protein A Protein B expression level log(e/e ref )
31
Extracting the Expression Profile (EPR) Index 0 4 8 12 16 20 0102030 d2d2 p x 100 Model: non Y2H all vs all 40% of Y2H is highly reliable Overall: 50% of 8000 yeast DIP interactions are reliable = 4000 Interacting Experimental (Y2H) Random fitted =0.4
32
Many proteins interacting in a Y2H may never meet each other in the cell!
33
Expression Profile Data Assesses Interaction Data (EPR Index; Deane et al. Molec & Cell Proteomics, 2002) DOE-MBI LSBMM, UCLA Uetz, P. et al.; Nature 403: 623-627 (2000) Ito, T. et al.; PNAS 97:1143-47 (2000) Ito, T. et al.; PNAS 98:4569-74 (2001) Ito, T. et al.; PNAS 97:1143-47 (2000) Ito, T. et al.; PNAS 98:4569-74 (2001) Ho, Y. et al.; Nature 415:180-183 (2002) Gavin, A.C. et al.; Nature 415: 141-147 (2002) 7.8+/-3.7% 19.5+/-3.7% 48.7+/-6.9% 89.5+/-6.6%
35
Measured vs Inferred Functional Interactions
36
EPR Assessment of Inferred Interactions vs. Y2Hybrid & MS MethodLinkagesEPR index Rosetta S +Phylo P 24,2000.4-0.6 Yeast 2 hybrid ~4,0000.2-0.5 Mass spec ~4,0000.1-0.9 DIP yeast overall 6,1440.3
37
Some Conclusions A X Y Z B V C A protein’s function is defined by the cellular context of its linkages Many functional linkages are revealed from genomic data Validity of functional linkages can be assessed computationally by keyword recovery or experimentally by expression data Functional complexes can be discovered from genomic data
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.