Marcotte EM, Pellegrini M, Ng HL, Rice DW, Yeates TO, Eisenberg D. (1999). Detecting protein function and protein-protein interactions from genome sequences.

Slides:



Advertisements
Similar presentations
Using phylogenetic profiles to predict protein function and localization As discussed by Catherine Grasso.
Advertisements

Protein Targeting by Functional Linkage of Non-Homologous Proteins with examples from M. tuberculosis Genome-wide functional linkage map Structural Genomics.
Cell signaling: responding to the outside world Cells interact with their environment by interpreting extracellular signals via proteins that span their.
Research Methodology of Biotechnology: Protein-Protein Interactions Yao-Te Huang Aug 16, 2011.
A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae Article by Peter Uetz, et.al. Presented by Kerstin Obando.
Protein-protein interactions
Biological Gene and Protein Networks
Protein domains vs. structure domains - an example.
An Exploratory Method to Reconstruct Pathways Cory Tobin.
Protein-protein interactions Ia. A combined algorithm for genome-wide prediction of protein function. Edward M. Marcotte, Matteo Pellegrini, Michael J.
Introduction to biological networks. protein-gene interactions protein-protein interactions PROTEOME GENOME Citrate Cycle METABOLISM Bio-chemical reactions.
Identifying functional residues of proteins from sequence info Using MSA (multiple sequence alignment) - search for remote homologs using HMMs or profiles.
Protein Modules An Introduction to Bioinformatics.
Protein-protein Interactions Hsueh-Fen Juan 2003, Mar 31 NTNU.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Protein-Protein Interaction Screens. Bacterial Two-Hybrid System selectable marker RNA polymerase DNA binding protein bait target sequence target.
Affinity chromatography/mass spec Bait protein GST Page 252.
Protein Classification A comparison of function inference techniques.
Protein-protein Interactions June 18, Why PPI?  Protein-protein interactions determine outcome of most cellular processes  Proteins which are.
Interaction Networks in Biology: Interface between Physics and Biology, Shekhar C. Mande, August 24, 2009 Interaction Networks in Biology: Interface between.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Protein-protein interactions Chapter 12. Stable complex Transient Interaction Transient Signaling Complex Rap1A – cRaf1 Interface 1310 Å 2 Stable complex:
Protein protein interactions
Protein Bioinformatics Course
Functional Linkages between Proteins. Introduction Piles of Information Flakes of Knowledge AGCATCCGACTAGCATCAGCTAGCAGCAGA CTCACGATGTGACTGCATGCGTCATTATCTA.
9/14/20151 EECS 730 Introduction to Bioinformatics Introduction to Proteomics Luke Huan Electrical Engineering and Computer Science
Research Methodology of Biotechnology: Protein-Protein Interactions
Protein analysis and proteomics (Part 2 of 2). Many of the images in this powerpoint presentation are from Bioinformatics and Functional Genomics by Jonathan.
B. Signal Transduction Pathway (cell signaling)
Interactions and more interactions
Transient Protein-Protein Interactions (TPPI)
Protein-protein interactions Courtesy of Sarah Teichmann & Jose B. Pereira-Leal MRC Laboratory of Molecular Biology, Cambridge, UK EMBL-EBI.
Functional Associations of Protein in Entire Genomes Sequences Bioinformatics Center of Shanghai Institutes for Biological Sciences Bingding.
Finish up array applications Move on to proteomics Protein microarrays.
Advanced Bioinformatics
Bioinformatics: Theory and Practice – Striking a Balance (a plea for teaching, as well as doing, Bioinformatics) Practice (Molecular Biology) Theory: Central.
Proteome and interactome Bioinformatics.
Computational prediction of protein-protein interactions Rong Liu
CISC667, F05, Lec9, Liao CISC 667 Intro to Bioinformatics (Fall 2005) Sequence Database search Heuristic algorithms –FASTA –BLAST –PSI-BLAST.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
Functional Genomics Functional genomic datasets Biological networks Integrating genomic datasets BIO520 BioinformaticsJim Lund.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
Anis Karimpour-Fard ‡, Ryan T. Gill †,
Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function.
Reverse Interactomics
An overview of Bioinformatics. Cell and Central Dogma.
I. Prolinks: a database of protein functional linkage derived from coevolution II. STRING: known and predicted protein-protein associations, integrated.
Functional and Evolutionary Attributes through Analysis of Metabolism Sophia Tsoka European Bioinformatics Institute Cambridge UK.
Introduction to biological molecular networks
Pairwise Sequence Alignment Part 2. Outline Summary Local and Global alignments FASTA and BLAST algorithms Evaluating significance of alignments Alignment.
Russell Group, Protein Evolution _________ ____ Rob Russell Cell Networks University of Heidelberg Interactions and Modules: the how and why of molecular.
Biol 729 – Proteome Bioinformatics Dr M. J. Fisher - Protein: Protein Interactions.
Announcements: Note that there will be presentations and associated paper summaries for both Thursday and Tuesday classes. The Exam II mean is 81.6 and.
1 Protein-Protein Interactions High-throughput strategy –Prediction from sequence In silico analysis –Protein A from species A: domain 1 and 2 –Protein.
How many interactions are there? ~6,200 genes ~6,200 proteins x 2-10 interactions/protein ~12, ,000 interactions Yeast.
Predicting Protein Function Annotation using Protein- Protein Interaction Networks By Tamar Eldad Advisor: Dr. Yanay Ofran Computational Biology.
Protein interactions: main methods for detection (all organisms) Two-hybrid8,446 (Co-)Immunoprecipitation567 Interaction adhesion assay225 In vitro binding138.
The two-hybrid system – why?
1 Computational functional genomics Lital Haham Sivan Pearl.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
Detecting Protein Function and Protein-Protein Interactions from Genome Sequences TuyetLinh Nguyen.
Protein-Protein Interactions. A Protein may interact with: –Other proteins –Nucleic Acids –Small molecules Protein Interactions.
Protein-protein Interactions
Protein association networks with STRING
Protein Interaction Networks
Approaches to signal transduction:
Bioinformatics, Vol.17 Suppl.1 (ISMB 2001) Weekly Lab. Seminar
Basic Local Alignment Search Tool
Presentation transcript:

Marcotte EM, Pellegrini M, Ng HL, Rice DW, Yeates TO, Eisenberg D. (1999). Detecting protein function and protein-protein interactions from genome sequences. Science 285, Enright AJ, Iliopoulos I, Kyrpides NC, Ouzounis CA. (1999). Protein interaction maps for complete genomes based on gene fusion events. Nature 402, compare briefly with yeast-2-hybrid system (y2h) Protein-protein interactions

Rosetta stone sequences protein A is homologous to subsequence from protein C protein B is homologous to subsequence from protein C subsequences from A and B are NOT homologous to each other

Proteins A and B form a multisubunit complex which is fused into a single protein in sequence C - thermodynamics- pieces don’t need to find each other in cell - efficiency- cell needs to produce much less of each as a result metabolic channeling- mentioned in Nature paper’s last paragraph - enzymes in related biochemical pathway may form functional complexes - substrates could then pass from one enzyme to another directly, instead of diffusing into the cytosol at large - not clear if there is direct evidence showing metabolic channeling anywhere- (tryptophan synthase?) Rationale

Method 1: use domain subsequences defined by Pro-Dom - all pairs of subsequence matches considered and searched for - two proteins which have one from each pair matched against Method 2: sequence comparison - two non-overlapping local sequence alignments to a third protein both use a minimal threshold for id’ing statistical significant scores Marcotte, et al (July 1999)

Trying to test accuracy of independent predictions… Method 1: shared keywords in SWISS-PROT annotations - golly gee… it’s better than random… - 68% vs 15% in E. coli - 32% vs 15% in S. cerevisiae (yeast) Method 2: Database of interacting Proteins - 6.4% of applicable sequences are also in the database - Rosetta Stone is not comprehensive (more on this later) Method 3: phylogenetic profiles (see last week’s papers) - wow it’s better than random too… - 5%, eight times as many as random Marcotte, et al (July 1999) (2)

use BLASTP to compare query genome against itself - formation of a binary matrix (1’s or 0’s in each entry) - symmetrification with Smith-Waterman (local alignments BLASTP to compare query vs reference genome - get a second binary matrix - all pairs of query proteins similar to a reference protein Z-score to test for significance of alignments - set an arbitrary Z-score cutoff to determine coverage/accuracy Enright, et al (NOV 1999)

y2h => yeast-2-hybrid detection of protein-protein interactions - Fields, S. and Song, O. Nature 1989, if you’re curious. transcription factors composed of two separable domains - DNA-binding and transcriptional-complex recruitment And now for something completely different…

is only high-throughput if you are not a postdoc. - lots of transformations and assays - fortunately you only have to transform once. has major problems with false negatives - integral membrane proteins don’t work (don’t fold properly) - post-translational modifications - require nuclear localization - misfolding or steric hindrances - transcription factors (?) also has significant false positives also- not sure why… - Uetz et al (2000) had 20% of interactions screen twice… other validation methods - genetic techniques - biochemical (coIP, affinity chromatography, mass spec, etc) High throughput y2h…

kinase cascades are ubiquitous in signaling pathways -MKKK => MKK => MAPK kinase cascades are ubiquitous in signaling pathways - interesting when looking at given Rosetta Stone examples - will kinase cascades be detected? SH2 and SH3 domains (Marcotte, et al) - SH2 bind phosphorylated tyrosine residues, SH3 bind proline- rich sequences - both are common motifs but have sequence-specific affinity Kinase cascades/signaling pathways are sometimes Y2H targets A brief discussion on signal transduction