Exploiting indirect neighbors and topological weight to predict protein function from protein– protein interactions Hon Nian Chua, Wing-Kin Sung and Limsoon.

Slides:

Advertisements

Similar presentations

Optical networks: Basics of WDM

Advertisements

Multi-Document Person Name Resolution Michael Ben Fleischman (MIT), Eduard Hovy (USC) From Proceedings of ACL-42 Reference Resolution workshop 2004.

+ Multi-label Classification using Adaptive Neighborhoods Tanwistha Saha, Huzefa Rangwala and Carlotta Domeniconi Department of Computer Science George.

Linked data: P redicting missing properties Klemen Simonic, Jan Rupnik, Primoz Skraba {klemen.simonic, jan.rupnik,

CSE 5243 (AU 14) Graph Basics and a Gentle Introduction to PageRank 1.

Optimization Problems in Optical Networks. Wavelength Division Multiplexing (WDM) Directed: Symmetric: Optic Fiber.

VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.

Rumor Routing in Sensor Networks David Braginsky and Deborah Estrin LECS – UCLA Modified and Presented by Sugata Hazarika.

1 Modularity and Community Structure in Networks* Final project *Based on a paper by M.E.J Newman in PNAS 2006.

Structural Inference of Hierarchies in Networks BY Yu Shuzhi 27, Mar 2014.

Networks. Graphs (undirected, unweighted) has a set of vertices V has a set of undirected, unweighted edges E graph G = (V, E), where.

Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.

Correlated Mutations and Co-evolution May 1 st, 2002.

Comparison of Networks Across Species CS374 Presentation October 26, 2006 Chuan Sheng Foo.

Network Statistics Gesine Reinert. Yeast protein interactions.

More routing protocols Alec Woo June 18 th, 2002.

1 University of Freiburg Computer Networks and Telematics Prof. Christian Schindelhauer Mobile Ad Hoc Networks Theory of Interferences, Trade-Offs between.

Semantic text features from small world graphs Jure Leskovec, IJS + CMU John Shawe-Taylor, Southampton.

Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,

Copyright  2004 limsoon wong Assessing Reliability of Protein- Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research.

Computing Trust in Social Networks

Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.

Taming the Underlying Challenges of Reliable Multihop Routing in Sensor Networks.

Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.

Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.

On Distinguishing between Internet Power Law B Bu and Towsley Infocom 2002 Presented by.

Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.

Database k-Nearest Neighbors in Uncertain Graphs Lin Yincheng VLDB10.

HCC class lecture 22 comments John Canny 4/13/05.

Mining Graphs with Constrains on Symmetry and Diameter Natalia Vanetik Deutsche Telecom Laboratories at Ben-Gurion University IWGD10 workshop July 14th,

Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.

Network Measures Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Measures Klout.

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5: Global Routing © KLMH Lienig 1 FLUTE: Fast Lookup Table Based RSMT Algorithm.

The Relative Vertex-to-Vertex Clustering Value 1 A New Criterion for the Fast Detection of Functional Modules in Protein Interaction Networks Zina Mohamed.

Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.

Link Recommendation In P2P Social Networks Yusuf Aytaş, Hakan Ferhatosmanoğlu, Özgür Ulusoy Bilkent University, Ankara, Turkey.

Biological Networks Lectures 6-7 : February 02, 2010 Graph Algorithms Review Global Network Properties Local Network Properties 1.

Outlier Detection Using k-Nearest Neighbour Graph Ville Hautamäki, Ismo Kärkkäinen and Pasi Fränti Department of Computer Science University of Joensuu,

Molecular evidence for endosymbiosis Perform blastp to investigate sequence similarity among domains of life Found yeast nuclear genes exhibit more sequence.

ZORRO : A masking program for incorporating Alignment Accuracy in Phylogenetic Inference Sourav Chatterji Martin Wu.

Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.

Science: Graph theory and networks Dr Andy Evans.

The Link Prediction Problem for Social Networks David Libel-Nowell, MIT John Klienberg, Cornell Saswat Mishra sxm

Using Graph Theory to Study Neural Networks (Watrous, Tandon, Conner, Pieters & Ekstrom, 2012)

Soft Computing Lecture 14 Clustering and model ART.

Medstar: a prototype for biomedical social network Xiaoli Li Institute for Infocomm Research A*Star, Singapore.

Sharon Bruckner, Bastian Kayser, Tim Conrad Freie Uni. Berlin Finding Modules in Networks with Non-modular Regions.

IMPROVED RECONSTRUCTION OF IN SILICO GENE REGULATORY NETWORKS BY INTEGRATING KNOCKOUT AND PERTURBATION DATA Yip, K. Y., Alexander, R. P., Yan, K. K., &

. Finding Motifs in Promoter Regions Libi Hertzberg Or Zuk.

Preserving Privacy and Social Influence Isabelle Stanton.

Design of a Robust Search Algorithm for P2P Networks

Speaker : Yu-Hui Chen Authors : Dinuka A. Soysa, Denis Guangyin Chen, Oscar C. Au, and Amine Bermak From : 2013 IEEE Symposium on Computational Intelligence.

Create and assess protein networks through molecular characteristics of individual proteins Yanay Ofran et al. ISMB ’06 Presenter: Danhua Guo 12/07/2006.

Progress Report ekker. Problem Definition In cases such as object recognition, we can not include all possible objects for training. So transfer learning.

Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey

Mining Coherent Dense Subgraphs across Multiple Biological Networks Vahid Mirjalili CSE 891.

Network-based Prediction of Protein Function by Roded Sharan, Igor Ulitsky and Ron Shamir Molecular Systems Biology2007.

Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama

Random Walks on Graphs.

Biological networks CS 5263 Bioinformatics.

Community detection in graphs

Network Science: A Short Introduction i3 Workshop

Section 8.6 of Newman’s book: Clustering Coefficients

Greedy Algorithms / Dijkstra’s Algorithm Yin Tat Lee

Department of Computer Science University of York

N-Gram Model Formulas Word sequences Chain rule of probability

Clustering Coefficients

SEG5010 Presentation Zhou Lanjun.

CS224w: Social and Information Network Analysis

CSE 373: Data Structures and Algorithms

Presentation transcript:

Exploiting indirect neighbors and topological weight to predict protein function from protein– protein interactions Hon Nian Chua, Wing-Kin Sung and Limsoon Wong

Motivation Predicting the protein function from Protein- protein interaction data. Previous studies  considers level 1 neighbors Can level-2 neighbors play an significant role in this prediction?

Summarizing the output of the study level-2 neighbors does show functional association. A significant no. Proteins were observed to be having associations with level-2 neighbors but not with level-1 neighbors. A predicting algorithm: 1) weight Level 1 & 2 neighbors based on functional similarity. 2) each function was also allotted a score based on its weighted frequency in neighbors

Conventional approaches using only direct interactions i.e level-1 neighbors Consider a radius in the interaction neighborhood network Calculate a functional distance and use clustering to make some functional classes.

Protein-Protein interactions as an undirected graph G=(V,E) (u, v) as two protein nodes And edge e between them as interaction U and v being, K-level neighbors– concept of path with k-edges between u and v. Set of neighbors-- Sk

Indirect Functional Association

Significance out of 4162 annotated proteins, only 1999 or 48% share some function with level-1 neighbors.

Sets of neighborhood pairs

Simple neighbor counting Discuss– M and N M- total predicted N-total functions known

The Algorithm 1) Functional similarity Weight Previous approaches use CD-distance between proteins u and v given by

A simple example

When a fraction ‘x’ of protein’s ‘u’s neighbors is common to protein ‘v’s neighbors then x is proportional to the probability that u’s functions are shared with v through common neighbors. (and vice versa for y protion of v ‘s neighbor common with neighbor of u)

2) integrating reliability of experimental sources: The prediction results can be improved by taking differences in reliability of sources into account. So between u and v, the reliability of the interaction is estimated as: i  source no. Euv  set of sources with interaction u, v n  no. Of times in which interaction btween u and v was observed

So, integrated equation becomes

Transitive functional Association If u is similar to w and w is similar to v then there can be a similarity between u and v given by:

Functional Similarity Weighted Averaging the likelihood of protein p having function x: STR(u,v)  Transitive FS weight r_int  fraction of all the proteins who share this considered function Sigma(p,x) = 1 if p has function x else =0 Pi_x  frequency of function x in proteins

Results 1) ORIGINAL NEIGHBOR COUNTING 2) Neighbor counting with FS-weight 3) scheme in (2)+ level-2 neighbors are considered.

Comparison with other schemes

Improvements? Threshold at level-2..