Quantitative analysis of domain interactomes Jason Lee Capstone presentation Sp `07.

Slides:



Advertisements
Similar presentations
Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Advertisements

Development of on-line database & tool for protein interface analysis Suk-hoon Jung.
Journal Club Jenny Gu October 24, Introduction Defining the subset of Superfamilies in LUCA Examine adaptability and expansion of particular superfamilies.
Redefining Nodes and Edges: Relating 3D Structures to Yeast Protein Networks Provides Insights into their Evolution Yeast Genetics Meeting Philip.
MitoInteractome : Mitochondrial Protein Interactome Database Rohit Reja Korean Bioinformation Center, Daejeon, Korea.
Molecular Basis for Relationship between Genotype and Phenotype DNA RNA protein genotype function organism phenotype DNA sequence amino acid sequence transcription.
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
Collaborators  Donald R. Frohlich University of St. Thomas University of St. Thomas  Jae-Ho Kim Rogers State University Rogers State University  Gary.
Phylogenetic reconstruction
Predicting domain-domain interactions using a parsimony approach Katia Guimaraes, Ph.D. NCBI / NLM / NIH.
Cells Cells have the same basic composition, and the same kinds of organelles, but not all living things are the same Cells are ___________________.
A Real-life Application of Barabasi’s Scale-Free Power-Law Presentation for ENGS 112 Doug Madory Wed, 1 JUN 05 Fri, 27 MAY 05.
Proteome Network Evolution by Gene Duplication S. Cenk Şahinalp Simon Fraser University.
Adaptive evolution of bacterial metabolic networks by horizontal gene transfer Chao Wang Dec 14, 2005.
1 Protein-Protein Interaction Networks MSC Seminar in Computational Biology
WORKSHOP ON ONTOLOGIES OF CELLULAR NETWORKS
Evidence for dynamically organized modularity in the yeast protein- protein interaction network Han, et al
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
Protein Classification A comparison of function inference techniques.
Manipulating the Genome: DNA Cloning and Analysis 20.1 – 20.3 Lesson 4.8.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
The Science of Life Biology unifies much of natural science
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Unit 1: The Language of Science  communicate and apply scientific information extracted from various sources (3.B)  evaluate models according to their.
歐亞書局 PRINCIPLES OF BIOCHEMISTRY Chapter 9 DNA-Based Information Technologies.
Frontiers of Genetics Chapter 13.
PutidaNET :Interactome database service and network analysis of Pseudomonas putida KT2440 (P. putida KT2440) Korean BioInformation Center (KOBIC) Seong-Jin,
Role of Rubisco in Photosynthesis Anu Murphy Dept. of Molecular and Integrative Physiology, University of Illinois at Urbana-Champaign.
Everyone is a Biologist ! Chapter 1 What is Life?
Everyone is a Biologist ! Today: Four Questions What are the Characteristics of Life? How diverse is life? How do we study the natural world? Who are.
Genome Organization and Evolution. Assignment For 2/24/04 Read: Lesk, Chapter 2 Exercises 2.1, 2.5, 2.7, p 110 Problem 2.2, p 112 Weblems 2.4, 2.7, pp.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Gene Regulation. Regulation in Prokaryotes Gene Expression = gene to protein processing that functions within cells. Regulation = We are talking about.
Studying Life Vodcast 1.3 Unit 1: Introduction to Biology.
Discovering the Correlation Between Evolutionary Genomics and Protein-Protein Interaction Rezaul Kabir and Brett Thompson
Molecular Basis for Relationship between Genotype and Phenotype DNA RNA protein genotype function organism phenotype DNA sequence amino acid sequence transcription.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
Complementarity of network and sequence information in homologous proteins March, Department of Computing, Imperial College London, London, UK 2.
Protein and RNA Families
Classification Section 18.2 & Phylogeny: Evolutionary relationships among organisms Biologists group organisms into categories that represent lines.
Chapter 15 Classification.
CSCE555 Bioinformatics Lecture 18 Network Biology: Comparison of Networks Across Species Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu.
Reverse Interactomics
DNAmRNAProtein Small molecules Environment Regulatory RNA How a cell is wired The dynamics of such interactions emerge as cellular processes and functions.
3-D Structural Analysis of Protein Interaction Networks Gives New Insight Into Protein Function, Network Topology and Evolution CSB Seminar Philip M. Kim,
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
Ubiquitination Sites Prediction Dah Mee Ko Advisor: Dr.Predrag Radivojac School of Informatics Indiana University May 22, 2009.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
Protein. Protein and Roles 1: biological process unknown 1.1 Structural categories 1.2 organism categories 1.3 cellular component o unlocalized.
DNA TranscriptionTranslation The Central Dogma TraitRNA Protein Molecular Genetics - From DNA to Trait RNA processing.
Eukaryotic genes are interrupted by large introns. In eukaryotes, repeated sequences characterize great amounts of noncoding DNA. Bacteria have compact.
General Microbiology (Micr300)
MCB 7200: Molecular Biology
CSCI2950-C Lecture 12 Networks
Interrogation of cross talk between proteins and gene regulatory networks in breast cancer Chambers, Teressa Lee Hiren Karathia Sridhar Hannenhalli.
Protein Interaction Networks
Genomes and Their Evolution
Sourav Roy School of Informatics Indiana University
Relationship between Genotype and Phenotype
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
Volume 63, Issue 4, Pages (August 2016)
Relationship between Genotype and Phenotype
Volume 39, Issue 2, Pages (October 2016)
Unit Genomic sequencing
Relationship between Genotype and Phenotype
Volume 30, Issue 3, Pages (May 2008)
Presentation transcript:

Quantitative analysis of domain interactomes Jason Lee Capstone presentation Sp `07

Protein domain Domain architecture of proteins A protein with three domains Protein PKC Each domain carries out certain function Modular nature confers protein a capability to compose domains to effect desired functions

Domain interaction Ex) Pkinase domain Different interfaces mediating domain- domain interaction –distinct ways of interaction Possible units of interaction (a) pdb_1ung: cell division kinase 5 and CDK5 activator

Domain interaction (cont’d) (b) pdb_1buh: CDK2 and CKSHS1 (c) pdb_1b6c: TGF-beta receptor R4 and FK506 binding protein

Purpose and pertinence of current study Characterize domain interactions Characterize protein interactions that are mediated by domain interaction Use domain interaction information to predict protein interactions Gain evolutionary perspective

Data and methods Databases: ipfam, BIND BIND: a compilation of known protein interactions Ipfam: known domain interactions obtained from structural information Five species were examined: human, mouse, fruit fly, yeast and E. coli protein interactions among proteins Take intersection between ipfam and BIND: compile protein interactions that involve known ipfam pair HumanMouseFruit flyYeastE. ColiTotal proteins interaction

Example Protein TGFBR1 interacts with 132 other proteins according to BIND Domains activin_recp and pkinase comprise TGFBR1 Each BIND interaction is checked to see if it involves any of ipfam DDI pairs 44 protein interactions are found to have ipfam pair TGFBR1+GI – Pkinase+PBD TGFBR1+GI Pkinase+FKBP_C TGFBR1+GI Activin_recp+TGF_beta …

Obtained domain interactions 1884 domain-domain interactions among 1587 domains HumanMouseFruit flyYeastE. Coli Domains Interaction

Low coverage of DDI over PPI 1650 PPI in human involve known domain pairs, while 9900 did not 14.29% of total human protein interactions From 5 species, 4604 PPI’s involve at least one domain pair, while did not have any 9.39% of total interactions HumanMouseFruit flyYeastE. coliTotal Ipfam intrn Non-ipfam intrn % Total intrn

Possible explanations of low coverage of DDI High FP rate in PPI data Incomplete coverage of ipfam DDI data Many PPI’s are not mediated by DDI Possible expedience of protein interactions –Domain interaction may be too restricting to answer all physiological and molecular demands from organisms

Domain interaction graph (H. sapiens) Entire domain interactome

Protein interaction graph (H. sapiens) Many subgraphs, only the largest subgraph is shown

Comparison of node degree distribution Both show power-law distribution

Comparison of graph topologies Both domain and protein interaction graphs show scale- free property Domains on average interacts with half the number of partners a protein interacts with PPIDDI Subgraphs Avg. node degree Avg. node degree (excl. single partner nodes) Largest subgraph5422 (68.50)71 (15.14) Nodes

Phylogenetic tree of five species Human a Mouse E. coli Fruit fly Yeast Mammal Multi-cellularEukaryotes Prokaryote single-cell

Measuring commonality of domain composition and interactomes between species Inner product of domains and domain pairs between two species S and T IP_domain =|Common_domains| / sqrt (|Domains_S| * |Domains_T|) IP_pair = |Common_domain_pairs| / sqrt (|Domains_pairs_S| *|Domains_pairs_T|)

Evolutionary consideration Common domains Common domain pairs HumanMouseFruit flyYeastE. coli Human Mouse Fruit fly Yeast E. Coli HumanMouseFruit flyYeastE. coli Human Mouse Fruit fly Yeast E. Coli Common domains and domain pairs reflect evolutionary relationship

Ontological characterization Use GO controlled vocabulary and compare physiological reflection of domain compositions of species Correlation between physiology and domain composition Differential domains – domains that are present exclusively in one lineage or species and not in the other Multi-cellularsingle-cell Response to stimulus 103 Cell communication105 Regulation of cellular process 84 Signal transducer113 Enzyme regulator92 transport1729

Ontological characterization (cont’d) Categories of other differential domains unique to multicellular species –Cell adhesion (2) –Regulation of biological processes (2) –Cell differentiation (1) –Cell death (1) –Cell homeostasis (1) –Coagulation (1) Domains involved in multi-cellularity are conspicuous

Domain node degree and DomainDegreeInstances (copy number) Occur. in intrn. (interaction frequency) Associativity RAS Pkinase RNA_pol_RPB1_ Ubiquitin RNA_pol_RPB2_ Trypsin AAA RNA_pol_L GTP_EFTU SNARE Ten domains with largest degrees

Correlation among node degree, copy number, etc. (all five species) Correlation between DDI node degree and interaction frequency: Correlation between DDI node degree and number of instances: When RNA polymerase domains are excluded –Degree and interaction frequency: –Degree and number of instances: Associativity: number of domains a domain appears together in peptide sequences –Ex) domain pkinase associates with 45 domains Node degree and associativity: Having a large number of domain partners does not mean a domain mediates many protein interactions nor it is associated with many other domains

Interaction propensity Between a pair of domains Only hetero-domain pairs are considered due to possible crystallization artifacts of homo-domain pairs Interaction propensity = |pair_occurrences| / ( |domain_0| * |domain_1| ) Domain0Domain1Pairs| Domain_0 || Domain_1|i-prop (%) Cyclin_NPkinase AnkPkinase PHRas Cyclin_CPkinase FGFIG ANKTIG ANKRHD SH2STAT_bind Sufficient selectivity can be encoded at the molecular level onto domain interaction Protein interactions mediated by domain interactions are very specific

Discussion A domain on average has a smaller number of interaction partners than proteins Only small number of protein interactions are mediated by domain interactions Domain composition and domain interactomes reflect evolutionary relationship between species Correlation among domain node degree, domain copy number, occurrences in interaction and number of associated domains were all very low Domain interaction is a scaffold and specificity is tuned up by atomic and residue level coding

Acknowledgement Prof. Sun Kim Prof. Haixu Tang Prof. Predrag Radivojac Prof. Mehmet Dalkilic Dr. John Colburne Prof. Marty Siegel Linda Hostetter