Anis Karimpour-Fard ‡, Ryan T. Gill †,

Slides:



Advertisements
Similar presentations
Using phylogenetic profiles to predict protein function and localization As discussed by Catherine Grasso.
Advertisements

Finding detailed relationships between proteins specific to phenotypes among microbial organisms Daniel Park Molecular Biology Institute, UCLA Yeates lab.
Research Methodology of Biotechnology: Protein-Protein Interactions Yao-Te Huang Aug 16, 2011.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Cluster analysis of networks generated through homology: automatic identification of important protein communities involved in cancer metastasis Jonsson.
Networks are useful for describing systems of interacting objects, where the nodes represent the objects and the edges represent the interactions between.
Comparison of Networks Across Species CS374 Presentation October 26, 2006 Chuan Sheng Foo.
Readings for this week Gogarten et al Horizontal gene transfer….. Francke et al. Reconstructing metabolic networks….. Sign up for meeting next week for.
Biological Gene and Protein Networks
Author: Jason Weston et., al PANS Presented by Tie Wang Protein Ranking: From Local to global structure in protein similarity network.
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
Summary Protein design seeks to find amino acid sequences which stably fold into specific 3-D structures. Modeling the inherent flexibility of the protein.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Marcotte EM, Pellegrini M, Ng HL, Rice DW, Yeates TO, Eisenberg D. (1999). Detecting protein function and protein-protein interactions from genome sequences.
Blast heuristics Morten Nielsen Department of Systems Biology, DTU.
Protein Classification A comparison of function inference techniques.
Applications of protomic Presented By: Muhammad Rizwan Roll no: Department of Bioinformatics.
Protein Interactions and Disease Audry Kang 7/15/2013.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Interaction Networks in Biology: Interface between Physics and Biology, Shekhar C. Mande, August 24, 2009 Interaction Networks in Biology: Interface between.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Protein-protein interactions Chapter 12. Stable complex Transient Interaction Transient Signaling Complex Rap1A – cRaf1 Interface 1310 Å 2 Stable complex:
Automatic methods for functional annotation of sequences Petri Törönen.
Functional Linkages between Proteins. Introduction Piles of Information Flakes of Knowledge AGCATCCGACTAGCATCAGCTAGCAGCAGA CTCACGATGTGACTGCATGCGTCATTATCTA.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
Protein analysis and proteomics (Part 2 of 2). Many of the images in this powerpoint presentation are from Bioinformatics and Functional Genomics by Jonathan.
Interactions and more interactions
Analyzing transcription modules in the pathogenic yeast Candida albicans Elik Chapnik Yoav Amiram Supervisor: Dr. Naama Barkai.
Functional Associations of Protein in Entire Genomes Sequences Bioinformatics Center of Shanghai Institutes for Biological Sciences Bingding.
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
Networks and Interactions Boo Virk v1.0.
Social behavior of proteins? Rui Alves. Organization of the talk Social behavior of the protein?!?!?!? Using meta text analysis Using phylogenetic profiling.
Finish up array applications Move on to proteomics Protein microarrays.
HUMAN-MOUSE CONSERVED COEXPRESSION NETWORKS PREDICT CANDIDATE DISEASE GENES Ala U., Piro R., Grassi E., Damasco C., Silengo L., Brunner H., Provero P.
Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae Speaker: Chunhui Cai.
Discovering the Correlation Between Evolutionary Genomics and Protein-Protein Interaction Rezaul Kabir and Brett Thompson
Inferring Functional Information from Domain co-evolution Yohan Kim, Mehmet Koyuturk, Umut Topkara, Ananth Grama and Shankar Subramaniam Gaurav Chadha.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
Metabolic Network Inference from Multiple Types of Genomic Data Yoshihiro Yamanishi Centre de Bio-informatique, Ecole des Mines de Paris.
CSCE555 Bioinformatics Lecture 18 Network Biology: Comparison of Networks Across Species Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
PPI team Progress Report PPI team, IDB Lab. Sangwon Yoo, Hoyoung Jeong, Taewhi Lee Mar 2006.
Anis Karimpour-Fard 1, Corrella Detweiler 2, Ryan T. Gill 3, and Lawrence Hunter 1 1 University of Colorado School of Medicine 2 MCD-Biology, University.
Analysis and comparison of very large metagenomes with fast clustering and functional annotation Weizhong Li, BMC Bioinformatics 2009 Present by Chuan-Yih.
I. Prolinks: a database of protein functional linkage derived from coevolution II. STRING: known and predicted protein-protein associations, integrated.
Functional and Evolutionary Attributes through Analysis of Metabolism Sophia Tsoka European Bioinformatics Institute Cambridge UK.
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
Functional prediction methods. The usual troubles of the molecular and cellular biology labs What are the functions of a previously non characterized.
Introduction to biological molecular networks
GO based data analysis Iowa State Workshop 11 June 2009.
Predicting Protein Function Annotation using Protein- Protein Interaction Networks By Tamar Eldad Advisor: Dr. Yanay Ofran Computational Biology.
1 Computational functional genomics Lital Haham Sivan Pearl.
Computer Science and Engineering PhD in Computer Science Monday, November 07, :00 a.m. – 11:00 a.m. Swearingen Conference Room 3A75 Network Based.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
Discovering functional linkages and uncharacterized cellular pathways using phylogenetic profile comparisons: a comprehensive assessment Raja Jothi, Teresa.
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
Genomic Data Manipulation Thinking about data visually
FLiPS Functional Linkage Prediction Service.
Genome Annotation Continued
Genomic Data Manipulation
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
CISC 841 Bioinformatics (Spring 2006) Inference of Biological Networks
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
GENE ANNOTATION AND NETWORK INFERENCE BY PHYLOGENETIC PROFILING
Anastasia Baryshnikova  Cell Systems 
Bioinformatics, Vol.17 Suppl.1 (ISMB 2001) Weekly Lab. Seminar
Gautam Dey, Tobias Meyer  Cell Systems 
Biologic processes necessary for survival after alkylation damage are conserved and when compiled generate a cross-species functionome. Biologic processes.
Global analysis of the chemical–genetic interaction map.
Presentation transcript:

Anis Karimpour-Fard ‡, Ryan T. Gill †, and Lawrence Hunter ‡ ‡ University of Colorado School of Medicine † Department of Chemical and Biological Engineering, University of Colorado, Boulder Investigation of factors affecting prediction of protein-protein interaction networks by phylogenetic profiling Dec 1, 2007

The meaning of protein function Eisenberg, D. et. al. Nature 2000 SP A Biochemical view The function of protein A is its action on Substrate to form a Product The function of A is the context of its interactions with other proteins in the cell Post genomic view A B Y Z MD N X C The problem …… More than 500 Microbial genomes are fully sequence and there is high percent of genes with unknown function. For example: E. coli K12 15% P. aeruginosa 45%

Homology based methods (gives partial understanding about protein role) –Simple sequence similarity searches (BLAST) –Profile searches (PSI-BLAST) –Databases of conserved domains (Pfam, SMART) Prediction from genomic context Phylogenetic profile Gene cluster Gene neighbor Rosetta Stone Prediction from high-throughput experimental data –Microarray gene expression data –Protein-protein interaction screens –... Prediction protein function

Phylogenetic Profile Pellegrini et al. PNAS 96, 4285 (1999) Marcotte et al. PNAS 97, (2000) 1- Select sets of genomes as a reference set 2- Create phylogenetic profile matrix for target organism: Do one-against-all BLAST search to identify all homologous target genes in diverse reference organisms. Does the selection of the reference genomes influence the prediction? if so? How? How E-value threshold effects the protein-protein interactions prediction? Reference selection? Blast E-value threshold (present or absent) Measure profile similarities Reference selection

Protein X: Protein Y: matching bits out of Measure profile similarities 4- Generate protein-protein interactions Generate Protein-protein interactions network 5- Create clusters from set of protein-protein interactions Protein X Protein Y 2 nodes are connected if the 2 proteins have similar profile) 6- Visualize network

Protein X Protein Y Measure profile similarities Protein X: Protein Y: Mutual information MI(X, Y) = H(X) + H(Y) - H(X, Y) H(Y) = -∑p(i) ln p(i) p(i), (i= 0, 1) as the fraction of genomes in which protein Y in the state i 2 nodes are connected if the 2 proteins have similar profile) Pearson correlation coefficient Inverse homology Calculate the homology between two genomes: The ratio of number of homologs of each reference organism j to the number of proteins in the target genome i ( H i,j ). P ij =1/( H i,j ) otherwise P ij =0. Karimpour-Fard et al. BMC Genomics. 2007;8(1):393

c) Comparison of different combinations of reference genomes and E-value thresholds using COG PPV =TP/(TP+FP) –TP = # predicted pair in the same functional category –FP= # predicted pair that were classified but were not same functional category Random sets All Low GC Aerobic Karimpour-Fard et al. BMC Genomics. 2007;8(1):393

Co-evolution can be used to assign function to unstudied genes Hypothetical proteins YcgB, YeaH, YeaG are co-conserved across different species. Comparison of sub-graphs across species (CS-CCC) suggested that a previously unstudied S. typhimurium gene, ycgB, is functionally related to yeaH. Experimental data support the hypothesis that both genes are important for antimicrobial peptide resistance. Edge color code: E. coli K12 (green) E. coli O157 (blue) Shigella flexneri (black) S. typhimurium LT2 (purple) P. aeruginosa (mustard) Karimpour-Fard et al. Genome Biology :R185