Bioinformatics, Vol.17 Suppl.1 (ISMB 2001) Weekly Lab. Seminar

Slides:



Advertisements
Similar presentations
Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Advertisements

Recombinant DNA Technology
Using phylogenetic profiles to predict protein function and localization As discussed by Catherine Grasso.
Table 2 shows that the set TFsf-TGblbs of predicted regulatory links has better results than the other two sets, based on having a significantly higher.
Basics of Comparative Genomics Dr G. P. S. Raghava.
Research Methodology of Biotechnology: Protein-Protein Interactions Yao-Te Huang Aug 16, 2011.
A Real-life Application of Barabasi’s Scale-Free Power-Law Presentation for ENGS 112 Doug Madory Wed, 1 JUN 05 Fri, 27 MAY 05.
Structural bioinformatics
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Introduction to biological networks. protein-gene interactions protein-protein interactions PROTEOME GENOME Citrate Cycle METABOLISM Bio-chemical reactions.
Comparative ab initio prediction of gene structures using pair HMMs
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
ANALYSIS OF GENETIC NETWORKS USING ATTRIBUTED GRAPH MATCHING.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Marcotte EM, Pellegrini M, Ng HL, Rice DW, Yeates TO, Eisenberg D. (1999). Detecting protein function and protein-protein interactions from genome sequences.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
Inferring subnetworks from perturbed expression profiles Dana Pe’er, Aviv Regev, Gal Elidan and Nir Friedman Bioinformatics, Vol.17 Suppl
Protein Classification A comparison of function inference techniques.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Comparative Genomics of the Eukaryotes
Protein-protein interactions Chapter 12. Stable complex Transient Interaction Transient Signaling Complex Rap1A – cRaf1 Interface 1310 Å 2 Stable complex:
BIONFORMATIC ALGORITHMS Ryan Tinsley Brandon Lile May 9th, 2014.
Bioinformatics and it’s methods Prepared by: Petro Rogutskyi
Functional Linkages between Proteins. Introduction Piles of Information Flakes of Knowledge AGCATCCGACTAGCATCAGCTAGCAGCAGA CTCACGATGTGACTGCATGCGTCATTATCTA.
Genetic network inference: from co-expression clustering to reverse engineering Patrik D’haeseleer,Shoudan Liang and Roland Somogyi.
Functional Associations of Protein in Entire Genomes Sequences Bioinformatics Center of Shanghai Institutes for Biological Sciences Bingding.
BASys: A Web Server for Automated Bacterial Genome Annotation Gary Van Domselaar †, Paul Stothard, Savita Shrivastava, Joseph A. Cruz, AnChi Guo, Xiaoli.
Creating Metabolic Network Models using Text Mining and Expert Knowledge J.A. Dickerson, D. Berleant, Z. Cox, W. Qi, and E. Wurtele Iowa State University.
Reconstruction of Transcriptional Regulatory Networks
HUMAN-MOUSE CONSERVED COEXPRESSION NETWORKS PREDICT CANDIDATE DISEASE GENES Ala U., Piro R., Grassi E., Damasco C., Silengo L., Brunner H., Provero P.
Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae Speaker: Chunhui Cai.
Discovering the Correlation Between Evolutionary Genomics and Protein-Protein Interaction Rezaul Kabir and Brett Thompson
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
PIRSF Classification System PIRSF: Evolutionary relationships of proteins from super- to sub-families Homeomorphic Family: Homologous proteins sharing.
Anis Karimpour-Fard ‡, Ryan T. Gill †,
Starting Monday M Oct 29 –Back to BLAST and Orthology (readings posted) will focus on the BLAST algorithm, different types and applications of BLAST; in.
Decoding the Network Footprint of Diseases With increasing availability of data, there is significant activity directed towards correlating genomic, proteomic,
PPI team Progress Report PPI team, IDB Lab. Sangwon Yoo, Hoyoung Jeong, Taewhi Lee Mar 2006.
By: Amira Djebbari and John Quackenbush BMC Systems Biology 2008, 2: 57 Presented by: Garron Wright April 20, 2009 CSCE 582.
I. Prolinks: a database of protein functional linkage derived from coevolution II. STRING: known and predicted protein-protein associations, integrated.
Introduction to biological molecular networks
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Large-scale Prediction of Yeast Gene Function Introduction to Bio-Informatics Winter Roi Adadi Naama Kraus
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
1 Computational functional genomics Lital Haham Sivan Pearl.
Bioinformatics Research Overview Li Liao Develop new algorithms and (statistical) learning methods > Capable of incorporating domain knowledge > Effective,
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Network Analysis Goal: to turn a list of genes/proteins/metabolites into a network to capture insights about the biological system 1.Types of high-throughput.
Clustering [Idea only, Chapter 10.1, 10.2, 10.4].
Mining Coherent Dense Subgraphs across Multiple Biological Networks Vahid Mirjalili CSE 891.
Biological networks CS 5263 Bioinformatics.
Basics of Comparative Genomics
FLiPS Functional Linkage Prediction Service.
Selection of Oligonucleotide Probes for Protein Coding Sequences
Prediction of Regulatory Elements for Non-Model Organisms Rachita Sharma, Patricia.
Predicting Active Site Residue Annotations in the Pfam Database
Mutual exclusivity analysis identifies oncogenic
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Introduction to Bioinformatics II
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Volume 138, Issue 4, Pages (August 2009)
Summarized by Ji-Yeon Lee & Soo-Yong Shin
SEG5010 Presentation Zhou Lanjun.
Anastasia Baryshnikova  Cell Systems 
Basics of Comparative Genomics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Restructuring Sparse High Dimensional Data for Effective Retrieval
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Bioinformatics, Vol.17 Suppl.1 (ISMB 2001) Weekly Lab. Seminar Summarized by Jae-Hong Eom

© 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/ Overview ‘interaction-domain pair profile’ method A technique to predict protein-protein interaction maps across organisms. Uses a high-quality protein interaction map with interaction domain information as input to predict an interaction map in another organism. Combine sequence similarly searches with clustering. Apply to this approach to the prediction of an interaction map of Escherichia coli. Results are compared with predictions of naïve method. (based only on full-length protein sequence similarity). Improved sensitivity and identification of false-positives. © 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/

© 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/ Introduction Recent emergence of high-throughout techniques to systemically identify physical interactions between proteins  opened new prospects. Protein interaction maps can provide detailed functional insights On characterized as well as not yet uncharacterized proteins. With an information base for the identification of biological complexes and metabolic or signal transduction pathways. © 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/

© 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/ Introduction (2) On the experimental front: High-throughput techniques derived from the yeast two-hybrid have been used to build protein interaction maps fro several organisms. E.g. Saccharomyces cerevisiae, Caenorhabditis elegans etc. On the computational front: Protein linkage maps have also been predicted ab initio using algorithms based on a sequence data from completely sequenced genomes. E.g. ‘Rosetta stone’ / ‘gene fusion’ method, ‘phylogenetic profile’ method, ‘gene neighbor’ method, the mRNA expression level correlation method. © 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/

© 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/ Introduction (3) In this paper, present a computational approach Aimed at predicting the protein interaction map of a target organism from a large-scale “reference” interaction map (of a source organism). – that includes interaction domain information. ‘Interacting Domain Profile Pairs’ (IDPP) approach Based on a combination of interaction data and sequence data, and uses a combination of homology searches and clustering. Apply this method to the inference of A protein interaction map of a model organism Escherichia coli from a Helicobacter pylori reference interaction map. Compared with ‘naïve method’ Prediction method based only on full-length protein sequence similarities. © 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/

© 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/ Introduction (4) IDs di dj In Pi Pj © 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Algorithm #1 – Naïve method The correspondence is meant To link a protein in the source organism with any number of proteins sharing the same interaction capabilities in the target organism. © 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Algorithm #1 – Naïve method (2) Construction of a correspondence between PS and PT . The library of target sequences is screened against the full-length sequences of proteins connected in MS.. A protein xT of PT is termed homologous to a protein xS of PS if, There is a significant sequence similarity between their sequences. Prediction of interactions Target interaction map MT is completed by linking the proteins in PT. Interaction is predicted between two different target proteins xT and yT if there are two different proteins xS and yS, respectively homologous to xT and yT, and interacting in MS. © 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Algorithm #1 – Naïve method (3) Two major a priori weakness Does not take account the range of interactions. Does not fully exploit the network structure of MS .  these two properties of the input data MS can be exploited to increase the sensitivity of the prediction algorithm. © 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Algorithm #2 – IDPP method The aim of IDPP algorithm Predict a protein interaction map on a target proteome PT from a source map MS . Fully exploit the properties of the available instance of MS. The availability of domain information for each interaction. The fact that from a given ID domain d of a protein x, the protein interaction map will typically provide several instances of domain interaction with d. Use additional step, for Transformation source map into abstract interaction map (MDS). Correspondence in built between this abstract interaction map and the target proteome. Interactions are inferred along this correspondence. © 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Naïve method vs. IDPP method © 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Algorithm #2 – IDPP method (2) First step, cluster domain of different proteins that interact with a common region. Fro each protein xS in PS , examine the IDs of all proteins interacting with xS . Theses Ids can be clustered into interacting clusters (IC) Where an IC of protein xS is defined as a set of MS IDs interacting with common region of xS . © 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Algorithm #2 – IDPP method (3) Interacting cluster (IC) can be viewed as a clique (A sub-graph where each vertex is linked to all others.) All the IDs are pair-connected by links – I link. Mean ‘interact with the same part of the x protein’. Clustering of homologous IDs Within each IC, regroup domains by sequence similarities. ID sequences are pairwise compared. Alignment above certain threshold are considered significant. If domains d1 and d2 show a significant similarity,  a sequence similarity link (S-link) is generated between d1 and d2 . © 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Algorithm #2 – IDPP method (4) Domains are clustered on the basis of S-links. Clustering Non-transitive Non-exclusive Resulting clusters are cliques both in terms of S-links and I-links. This called n-SIC (Similarity & Interacting Cliques) n : the number of IDs in the cluster. Construction of MDS edges (interaction between domain clusters) ID Profile Pairs (IDPP): Interaction between SICs. Analyze all possible pairs of SIC. © 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/

© 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/ Threshold T = 75% Distinguished types of IDPP ‘1:1 ’ ‘1:n ’ ‘n:m ’ (n > 0) © 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Algorithm #2 – IDPP method (5) © 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/

Algorithm #2 – IDPP method (6) Searching for similarities between domain profiles in target proteome PT . For each n-SIC, a library containing the target protein sequence is screened (using as a probe a single ID seq. if n=1, else a ID profile) Significant hits define homologies between target protein domains and source ID profiles. The correspondence between vertices of MDS and PT is Defined by associating to each n-SIS the set of PT proteins similar to its profiles. Inference from MDS to MT : prediction of interaction from the IDPP collection. Final step. The property “x interacts with y” is transported along the correspondence. Inference step is similar to the one described as ‘naïve’ method. © 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/

© 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/ Results © 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/

© 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/

© 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/

© 2001, SNU Biointelligence Lab, http://bi.snu.ac.kr/