Presentation is loading. Please wait.

Presentation is loading. Please wait.

A Method for Protein Functional Flow Configuration and Validation Woo-Hyuk Jang 1 Suk-Hoon Jung 1 Dong-Soo Han 1

Similar presentations


Presentation on theme: "A Method for Protein Functional Flow Configuration and Validation Woo-Hyuk Jang 1 Suk-Hoon Jung 1 Dong-Soo Han 1"— Presentation transcript:

1 A Method for Protein Functional Flow Configuration and Validation Woo-Hyuk Jang 1 Suk-Hoon Jung 1 Dong-Soo Han 1 toraim@icu.ac.kr jsh@icu.ac.krdshan@icu.ac.kr 1 School of Engineering, Information and Communications University, 119, Munjiro,Yuseong-gu, Daejeon, 305-714, Korea ABSTRACT Background At the early studies of PPI prediction, many prediction techniques were developed based mainly on a few features of a protein (i.e., domain frequency in the interaction protein pair), so they suffered from low prediction accuracy problem. However, recent researches gradually consider physicochemical properties of proteins and support high resolution results with integration of experimental results. With regard to current research trend, it is very close future to complete a PPI network configuration of each organism. The signal transduction is a process which describes a cell change by external stimulus, and it plays an important reference roll of most fundamental cellular processes. Most of signal transduction are initiated by extra cellular signal, and cascade intracellular activities by ligand-receptor binding are followed such as protein phosphorylation and de-phosphorylation, PPI, and protein-small molecules’ interaction. Given the fact that signal transduction pathway is protein’s cascading activities, identifying participants and their relationship for a specific signal transduction from whole PPI network is an essential work. However, even though the PPI network is completely configured, extracting signal transduction pathway is complicated problem because PPI network is only a set of co-expressive proteins or gene products, and its network link means simple physical binding rather than in-depth knowledge of biological process. Thus, most of the target signal transductions have been manually discovered so far. To overcome the problem, we suggest a protein functional flow model which can be a constraint to extract meaningful sub paths from whole PPI network. Suggested model is a directed network based on a protein functions’ relation of signaling transduction pathway. With explosively growing PPI databases, the computational approach for a prediction and configuration of PPI network has been a big stream in the bioinformatics area. Recent researches gradually consider physicochemical properties of proteins and support high resolution results with integration of experimental results. With regard to current research trend, it is very close future to complete a PPI network configuration of each organism. However, direct applying the PPI network to real field is a complicated problem because PPI network is only a set of co-expressive proteins or gene products, and its network link means simple physical binding rather than in-depth knowledge of biological process. In this paper, we suggest a protein functional flow model which is a directed network based on a protein functions’ relation of signaling transduction pathway. The vertex of the suggested model is a molecular function annotated by gene ontology, and the relations among the vertexes are considered as edges. Thus, it is easy to trace a specific function’s transition, and it can be a constraint to extract a meaningful sub-path from whole PPI network. To evaluate the model, 11 functional flow models of Homo sapiens were built from KEGG, and Chronbach’s alpha values were measured (alpha=0.67). Among 1023 functional flows, 765 functional flows showed 0.6 or higher alpha values Motivation & Related Work ▣ Conventional PPI network can not be directly applied to real fields such as signaling transduction pathway prediction or metabolic pathway prediction, because of a lack of in- depth biological knowledge annotations. ▣ Researchers’ focuses are moving from a single protein-protein interaction possibility inspection to a extracting meaningful sub networks against whole protein interaction network. Figure 1. To fulfill applied area’s needs, researchers’ focuses are moving from single protein pairs to functionally related sub-networks. In this situation, it is essential to develop a reference model or rules to navigate the paths. ▣ Protein pairs show a specific functional patterns on the PPI network. ▣ Correlated interacting genes with GO annotations (~12% of interacting genes had exactly same annotations; 27% had very similar annotations)[1]. ▣ Found functional patterns from PPI network, and compared them to random patterns respect to MIPS and KEGG respectively[2]. ▣ Functional template modeling by abstraction of enzyme functions[3]. Figure 2. Gene Ontology (GO) annotations have hierarchical relationship each other, so one function can be replaced to its parent function. With this strategy, they finally make the most general functional template, the Pathway Functionality Template (PFT). Functional Flow ▣ Concept 1: There are functional flow patterns in the meaningful protein interaction path such as signaling transduction pathway. ▣ Concept 2: If a certain protein pair show a functional pattern, a relation type (i.e. activation) of this pair is similarly detected to other protein pairs which have same functional patterns. ▣ Concept 3: Usually, multiple molecular functions are annotated for one protein, but only one function of them would be selected by other protein. RAB23 SMO (SMOH) SHH (HPE3, HLP3) PTCH2 LRP2 inhibition dissociatio n activation binding/associatio n Q68DJ6, Q9ULC3 A4D1K5, Q99835 Q13635, Q9Y6C5 Q14623, Q43323, … (4) P98164 GO:0005515, GO:0004872 GO:0005515, GO:0004872, GO:0004888 GO:0015485, GO:0005113, GO:0043237 GO:0005515, Unknown dissociatio n GO:0005515, protein binding GO:0004872, receptor activity GO:0004888, transmembrane receptor activity GO:0015485, cholesterol binding GO:0005113, patched binding GO:0043237, laminin-1 binding Figure 3. Some proteins and their relationship in Hedgehog Signaling Pathway. ▣ Figure 3 shows some proteins and their relationship of Hedgehog signaling transduction pathway. Each protein has general molecular functions and the functions have relations such as inhibition or dissociation. [1] Tong, et al., “Global mapping of the yeast genetic interaction network”, Science, 2004. [2] Mehmet E Turanalp and Tolga Can, “Discovering functional interaction patterns in protein-protein interaction networks”, BMC Bioinformatics, 2008. [3] Ali Cakmak and Gultekin Ozsoyoglu(CS, USA), “Mining biological networks for unknown pathways”, BIOINFORMATICS, 2007, 23:20. ▣ This research was financially supported by the Ministry of Education, Science Technology (MEST) and Korea Industrial Technology Foundation (KOTEF) through the Human Recourse Training Project For Regional Innovation. ▣ Based on the relation “binding/association” between “LRP2” and “SHH”, we consider that protein binding(GO:0005515), cholesterol binding(GO:0015485), patched binding(GO:0005113) and laminin-1 binding(GO:0043237) functions have “binding/association” relation. Proteins whose function is unknown were manually removed, and redundant count of functional flows was utilized as a weight score. Similarly, total 1023 functional flow were extracted from 11 H. sapiens signaling transduction pathways of KEGG database. Validation ▣ Top 10 Function Flows including & excluding general function Protein Binding (GO:0005515) Function 1Function 2Sub TypeCount GO:0005515 phophorylation35 GO:0005515 activation29 GO:0005515 compound16 GO:0004435GO:0005515compound13 GO:0005515 inhibition12 GO:0005515 Binding/association11 GO:0004722GO:0005515compound10 GO:0005515GO:0004674phosphorylation9 GO:0005515GO:0003700phosphorylation9 GO:0005515GO:0004672phosphorylation8 Function 1Function 2Sub TypeCount GO:0004722GO:0004708Inhibition/dephosphorylation5 GO:0000287GO:0004708phosphorylation5 GO:0004707GO:0003700phosphorylation5 GO:0046332GO:0043565activation4 GO:0046332GO:0003700activation4 GO:0004722GO:0005545compound4 GO:0004722GO:0005545compound4 GO:0004722GO:0005158compound4 GO:0004722GO:0016303compound4 GO:0004722GO:0019903compound4 ▣ Internal integrity of functional flow was measured via Chronbach’s alpha value. Chronbach’s alpha value checks integrity or similarity of each questionnaire when single concept is asked by many different questionnaires. The variables of Chronbach’s alpha values correspond to followings. N = total count of functional flows which extracted from a specific signaling transduction pathway = a variance of a specific functional flows out of 11 signaling transduction pathways. = a variance of all functional flows in a specific signaling transduction pathway. ▣ Note that the type of a certain functional flow has conflict in other signaling transduction pathway, we decrease a appearance values from 11 to zero. The average alpha of overall functional flows was 0.67, and 765 functional flows had 0.6 or higher alpha values. Intelligent Service Integration Laboratory, http://isilab.icu.ac.kr


Download ppt "A Method for Protein Functional Flow Configuration and Validation Woo-Hyuk Jang 1 Suk-Hoon Jung 1 Dong-Soo Han 1"

Similar presentations


Ads by Google