A Method for Protein Functional Flow Configuration and Validation Woo-Hyuk Jang 1 Suk-Hoon Jung 1 Dong-Soo Han 1

Slides:



Advertisements
Similar presentations
Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Advertisements

CELL COMMUNICATION. YOU MUST KNOW… THE 3 STAGES OF CELL COMMUNICATION: RECEPTION, TRANSDUCTION, AND RESPONSE HOW G-PROTEIN-COUPLED RECEPTORS RECEIVE CELL.
1 Harvard Medical School Mapping Transcription Mechanisms from Multimodal Genomic Data Hsun-Hsien Chang, Michael McGeachie, and Marco F. Ramoni Children.
Social networks, in the form of bibliographies and citations, have long been an integral part of the scientific process. We examine how to leverage the.
MitoInteractome : Mitochondrial Protein Interactome Database Rohit Reja Korean Bioinformation Center, Daejeon, Korea.
Putting genetic interactions in context through a global modular decomposition Jamal.
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April Seminar.
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Cell signaling: responding to the outside world Cells interact with their environment by interpreting extracellular signals via proteins that span their.
University of Buffalo The State University of New York Spatiotemporal Data Mining on Networks Taehyong Kim Computer Science and Engineering State University.
Structural bioinformatics
Seeing the forest for the trees : using the Gene Ontology to restructure hierarchical clustering Dikla Dotan-Cohen, Simon Kasif and Avraham A. Melkman.
Cluster analysis of networks generated through homology: automatic identification of important protein communities involved in cancer metastasis Jonsson.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Un Supervised Learning & Self Organizing Maps Learning From Examples
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Systems Biology Biological Sequence Analysis
Evidence for dynamically organized modularity in the yeast protein- protein interaction network Han, et al
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Data Mining Presentation Learning Patterns in the Dynamics of Biological Networks Chang hun You, Lawrence B. Holder, Diane J. Cook.
Biological networks Construction and Analysis. Recap Gene regulatory networks –Transcription Factors: special proteins that function as “keys” to the.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Signaling Pathways and Summary June 30, 2005 Signaling lecture Course summary Tomorrow Next Week Friday, 7/8/05 Morning presentation of writing assignments.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Efficient Algorithms for Detecting Signaling Pathways in Protein Interaction Networks Jacob Scott, Trey Ideker, Richard M. Karp, Roded Sharan RECOMB 2005.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Automatic methods for functional annotation of sequences Petri Törönen.
Synthetic biology: New engineering rules for emerging discipline Andrianantoandro E; Basu S; Karig D K; Weiss R. Molecular Systems Biology 2006.
Networks and Interactions Boo Virk v1.0.
Toward Automatically Drawn Metabolic Pathway Atlas with Peripheral Node Abstraction Algorithm Myungha Jang, Arang Rhie, and Hyun-Seok Park * Bioinformatics.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
Network & Systems Modeling 29 June 2009 NCSU GO Workshop.
Abstract Background: In this work, a candidate gene prioritization method is described, and based on protein-protein interaction network (PPIN) analysis.
Discovering the Correlation Between Evolutionary Genomics and Protein-Protein Interaction Rezaul Kabir and Brett Thompson
Cell Signaling Ontology Takako Takai-Igarashi and Toshihisa Takagi Human Genome Center, Institute of Medical Science, University of Tokyo.
Uncovering Signaling Transduction Networks from PPI network by Inductive Logic Programming Woo-Hyuk Jang
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Problem Limited number of experimental replications. Postgenomic data intrinsically noisy. Poor network reconstruction.
Metabolic Network Inference from Multiple Types of Genomic Data Yoshihiro Yamanishi Centre de Bio-informatique, Ecole des Mines de Paris.
The Mammalian Protein – Protein Interaction Database and Its Viewing System That Is Linked to the Main FANTOM2 Viewer Genome Research (2003) Speaker: 蔡欣吟.
Gene set analyses of genomic datasets Andreas Schlicker Jelle ten Hoeve Lodewyk Wessels.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
Introduction to biological molecular networks
DNAmRNAProtein Small molecules Environment Regulatory RNA How a cell is wired The dynamics of such interactions emerge as cellular processes and functions.
SASMI Self-Awareness and Self-Monitoring for Innovation.
GO based data analysis Iowa State Workshop 11 June 2009.
Development of a Signaling Pathway Map for the FXM Gil Sambrano, Lily Jiang, Madhu Natarajan, Alex Gilman, Adam Arkin University of California San Francisco,
Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation Bioinformatics, July 2003 P.W.Load,
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
 Signal Transduction transmits signals from outside to the inside of the cell  Integer Linear Programming model is used to unravel STN.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
Computer Science and Engineering PhD in Computer Science Monday, November 07, :00 a.m. – 11:00 a.m. Swearingen Conference Room 3A75 Network Based.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
CSCI2950-C Lecture 12 Networks
Networks and Interactions
1. SELECTION OF THE KEY GENE SET 2. BIOLOGICAL NETWORK SELECTION
Biological networks CS 5263 Bioinformatics.
FUNCTIONAL ANNOTATION OF REGULATORY PATHWAYS
Bioinformatics, Vol.17 Suppl.1 (ISMB 2001) Weekly Lab. Seminar
Computational Biology
Presentation transcript:

A Method for Protein Functional Flow Configuration and Validation Woo-Hyuk Jang 1 Suk-Hoon Jung 1 Dong-Soo Han 1 1 School of Engineering, Information and Communications University, 119, Munjiro,Yuseong-gu, Daejeon, , Korea ABSTRACT Background At the early studies of PPI prediction, many prediction techniques were developed based mainly on a few features of a protein (i.e., domain frequency in the interaction protein pair), so they suffered from low prediction accuracy problem. However, recent researches gradually consider physicochemical properties of proteins and support high resolution results with integration of experimental results. With regard to current research trend, it is very close future to complete a PPI network configuration of each organism. The signal transduction is a process which describes a cell change by external stimulus, and it plays an important reference roll of most fundamental cellular processes. Most of signal transduction are initiated by extra cellular signal, and cascade intracellular activities by ligand-receptor binding are followed such as protein phosphorylation and de-phosphorylation, PPI, and protein-small molecules’ interaction. Given the fact that signal transduction pathway is protein’s cascading activities, identifying participants and their relationship for a specific signal transduction from whole PPI network is an essential work. However, even though the PPI network is completely configured, extracting signal transduction pathway is complicated problem because PPI network is only a set of co-expressive proteins or gene products, and its network link means simple physical binding rather than in-depth knowledge of biological process. Thus, most of the target signal transductions have been manually discovered so far. To overcome the problem, we suggest a protein functional flow model which can be a constraint to extract meaningful sub paths from whole PPI network. Suggested model is a directed network based on a protein functions’ relation of signaling transduction pathway. With explosively growing PPI databases, the computational approach for a prediction and configuration of PPI network has been a big stream in the bioinformatics area. Recent researches gradually consider physicochemical properties of proteins and support high resolution results with integration of experimental results. With regard to current research trend, it is very close future to complete a PPI network configuration of each organism. However, direct applying the PPI network to real field is a complicated problem because PPI network is only a set of co-expressive proteins or gene products, and its network link means simple physical binding rather than in-depth knowledge of biological process. In this paper, we suggest a protein functional flow model which is a directed network based on a protein functions’ relation of signaling transduction pathway. The vertex of the suggested model is a molecular function annotated by gene ontology, and the relations among the vertexes are considered as edges. Thus, it is easy to trace a specific function’s transition, and it can be a constraint to extract a meaningful sub-path from whole PPI network. To evaluate the model, 11 functional flow models of Homo sapiens were built from KEGG, and Chronbach’s alpha values were measured (alpha=0.67). Among 1023 functional flows, 765 functional flows showed 0.6 or higher alpha values Motivation & Related Work ▣ Conventional PPI network can not be directly applied to real fields such as signaling transduction pathway prediction or metabolic pathway prediction, because of a lack of in- depth biological knowledge annotations. ▣ Researchers’ focuses are moving from a single protein-protein interaction possibility inspection to a extracting meaningful sub networks against whole protein interaction network. Figure 1. To fulfill applied area’s needs, researchers’ focuses are moving from single protein pairs to functionally related sub-networks. In this situation, it is essential to develop a reference model or rules to navigate the paths. ▣ Protein pairs show a specific functional patterns on the PPI network. ▣ Correlated interacting genes with GO annotations (~12% of interacting genes had exactly same annotations; 27% had very similar annotations)[1]. ▣ Found functional patterns from PPI network, and compared them to random patterns respect to MIPS and KEGG respectively[2]. ▣ Functional template modeling by abstraction of enzyme functions[3]. Figure 2. Gene Ontology (GO) annotations have hierarchical relationship each other, so one function can be replaced to its parent function. With this strategy, they finally make the most general functional template, the Pathway Functionality Template (PFT). Functional Flow ▣ Concept 1: There are functional flow patterns in the meaningful protein interaction path such as signaling transduction pathway. ▣ Concept 2: If a certain protein pair show a functional pattern, a relation type (i.e. activation) of this pair is similarly detected to other protein pairs which have same functional patterns. ▣ Concept 3: Usually, multiple molecular functions are annotated for one protein, but only one function of them would be selected by other protein. RAB23 SMO (SMOH) SHH (HPE3, HLP3) PTCH2 LRP2 inhibition dissociatio n activation binding/associatio n Q68DJ6, Q9ULC3 A4D1K5, Q99835 Q13635, Q9Y6C5 Q14623, Q43323, … (4) P98164 GO: , GO: GO: , GO: , GO: GO: , GO: , GO: GO: , Unknown dissociatio n GO: , protein binding GO: , receptor activity GO: , transmembrane receptor activity GO: , cholesterol binding GO: , patched binding GO: , laminin-1 binding Figure 3. Some proteins and their relationship in Hedgehog Signaling Pathway. ▣ Figure 3 shows some proteins and their relationship of Hedgehog signaling transduction pathway. Each protein has general molecular functions and the functions have relations such as inhibition or dissociation. [1] Tong, et al., “Global mapping of the yeast genetic interaction network”, Science, [2] Mehmet E Turanalp and Tolga Can, “Discovering functional interaction patterns in protein-protein interaction networks”, BMC Bioinformatics, [3] Ali Cakmak and Gultekin Ozsoyoglu(CS, USA), “Mining biological networks for unknown pathways”, BIOINFORMATICS, 2007, 23:20. ▣ This research was financially supported by the Ministry of Education, Science Technology (MEST) and Korea Industrial Technology Foundation (KOTEF) through the Human Recourse Training Project For Regional Innovation. ▣ Based on the relation “binding/association” between “LRP2” and “SHH”, we consider that protein binding(GO: ), cholesterol binding(GO: ), patched binding(GO: ) and laminin-1 binding(GO: ) functions have “binding/association” relation. Proteins whose function is unknown were manually removed, and redundant count of functional flows was utilized as a weight score. Similarly, total 1023 functional flow were extracted from 11 H. sapiens signaling transduction pathways of KEGG database. Validation ▣ Top 10 Function Flows including & excluding general function Protein Binding (GO: ) Function 1Function 2Sub TypeCount GO: phophorylation35 GO: activation29 GO: compound16 GO: GO: compound13 GO: inhibition12 GO: Binding/association11 GO: GO: compound10 GO: GO: phosphorylation9 GO: GO: phosphorylation9 GO: GO: phosphorylation8 Function 1Function 2Sub TypeCount GO: GO: Inhibition/dephosphorylation5 GO: GO: phosphorylation5 GO: GO: phosphorylation5 GO: GO: activation4 GO: GO: activation4 GO: GO: compound4 GO: GO: compound4 GO: GO: compound4 GO: GO: compound4 GO: GO: compound4 ▣ Internal integrity of functional flow was measured via Chronbach’s alpha value. Chronbach’s alpha value checks integrity or similarity of each questionnaire when single concept is asked by many different questionnaires. The variables of Chronbach’s alpha values correspond to followings. N = total count of functional flows which extracted from a specific signaling transduction pathway = a variance of a specific functional flows out of 11 signaling transduction pathways. = a variance of all functional flows in a specific signaling transduction pathway. ▣ Note that the type of a certain functional flow has conflict in other signaling transduction pathway, we decrease a appearance values from 11 to zero. The average alpha of overall functional flows was 0.67, and 765 functional flows had 0.6 or higher alpha values. Intelligent Service Integration Laboratory,