Evolutionary Clues Embedded In Network Structure —— EPJB,85,106(2012) Zhu Guimei NGS Graduate School for Integrative Science & Engineering, Centre for.

Slides:



Advertisements
Similar presentations
Introduction to Network Theory: Modern Concepts, Algorithms
Advertisements

Analysis and Modeling of Social Networks Foudalis Ilias.
报告人: 林 苑 指导老师:章忠志 副教授 复旦大学  Introduction about random walks  Concepts  Applications  Our works  Fixed-trap problem  Multi-trap problem.
The multi-layered organization of information in living systems
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Information Networks Generative processes for Power Laws and Scale-Free networks Lecture 4.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
Hierarchy in networks Peter Náther, Mária Markošová, Boris Rudolf Vyjde : Physica A, dec
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
Topology Generation Suat Mercan. 2 Outline Motivation Topology Characterization Levels of Topology Modeling Techniques Types of Topology Generators.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
Scale-free networks Péter Kómár Statistical physics seminar 07/10/2008.
A Real-life Application of Barabasi’s Scale-Free Power-Law Presentation for ENGS 112 Doug Madory Wed, 1 JUN 05 Fri, 27 MAY 05.
Network Morphospace Andrea Avena-Koenigsberger, Joaquin Goni Ricard Sole, Olaf Sporns Tung Hoang Spring 2015.
Network Statistics Gesine Reinert. Yeast protein interactions.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
Global topological properties of biological networks.
Topologically biased random walks with application for community finding Vinko Zlatić Dep. Of Physics, “Sapienza”, Roma, Italia Theoretical Physics Division,
CSE 222 Systems Programming Graph Theory Basics Dr. Jim Holten.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Optimization Based Modeling of Social Network Yong-Yeol Ahn, Hawoong Jeong.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
ANALYZING PROTEIN NETWORK ROBUSTNESS USING GRAPH SPECTRUM Jingchun Chen The Ohio State University, Columbus, Ohio Institute.
Community detection algorithms: a comparative analysis Santo Fortunato.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Lecture7 Topic1: Graph spectral analysis/Graph spectral clustering and its application to metabolic networks Topic 2: Different centrality measures of.
Using Graph Theory to Study Neural Networks (Watrous, Tandon, Conner, Pieters & Ekstrom, 2012)
A Graph-based Friend Recommendation System Using Genetic Algorithm
Part 1: Biological Networks 1.Protein-protein interaction networks 2.Regulatory networks 3.Expression networks 4.Metabolic networks 5.… more biological.
Self-Similarity of Complex Networks Maksim Kitsak Advisor: H. Eugene Stanley Collaborators: Shlomo Havlin Gerald Paul Zhenhua Wu Yiping Chen Guanliang.
Listen to the noise: Bridge dynamics and topology of complex networks Jie Ren ( 任 捷 ) NUS Graduate School for Integrative Sciences & Engineering National.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
10 December, 2008 CIMCA2008 (Vienna) 1 Statistical Inferences by Gaussian Markov Random Fields on Complex Networks Kazuyuki Tanaka, Takafumi Usui, Muneki.
Algorithms for Biological Networks Prof. Tijana Milenković Computer Science and Engineering University of Notre Dame Fall 2010.
Networks Igor Segota Statistical physics presentation.
Class 9: Barabasi-Albert Model-Part I
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Lecture 3 1.Different centrality measures of nodes 2.Hierarchical Clustering 3.Line graphs.
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
LECTURE 2 1.Complex Network Models 2.Properties of Protein-Protein Interaction Networks.
Introduction to biological molecular networks
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Complex brain networks: graph theoretical analysis of structural and functional systems.
Bioinformatics Center Institute for Chemical Research Kyoto University
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Transport in weighted networks: optimal path and superhighways Collaborators: Z. Wu, Y. Chen, E. Lopez, S. Carmi, L.A. Braunstein, S. Buldyrev, H. E. Stanley.
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Netlogo demo. Complexity and Networks Melanie Mitchell Portland State University and Santa Fe Institute.
Fractal Networks: Structures, Modeling, and Dynamics 章 忠 志 复旦大学计算机科学技术学院 Homepage:
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Graph clustering to detect network modules
Network (graph) Models
Structures of Networks
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Biological networks CS 5263 Bioinformatics.
Network biology : protein – protein interactions
Random walks on complex networks
Modelling Structure and Function in Complex Networks
Anastasia Baryshnikova  Cell Systems 
Presentation transcript:

Evolutionary Clues Embedded In Network Structure —— EPJB,85,106(2012) Zhu Guimei NGS Graduate School for Integrative Science & Engineering, Centre for Computational Sciences & Engineering, National University of Singapore 1

2 Introductions Localizations on complex networks Evolutionary ages Conclustions Outline

3 Objective and Scopes  Detecting structural patterns at different scales :multi-scale structure  Finding an intresting network evolution mechanisms based on multi- scale structure networks.

4 Complex Networks Protein networks Internet networks scientific collaborations networks A B C Real-World Networks Communication networks: telephone, internet, www… Transportation networks: airports, highways, rail, electric power … Biological networks: genetic,protein-protein interaction, metabolic… Social networks: friendship networks, collaboration networks…

? Function Dynamics Structure Mass Energy Signal Information… Structure, Functions, Dynamics Degree Motif Modularity … Dynamic Process at different structure scale So Structure measures is the cornerstone for understanding the relations between structure, dynamic, function 5

Microscopic Macroscopic How to measure: Multi-Scale Structure ? what is a Meso (midterm) pattern? Degree Motif clustering coefficient … Modules ? Dynamics on Different Structure Scales: 6

7 Define the Mesoscopic pattern  In Physics: Mesoscopic has been well defined Materials that have a relatively intermediate length scale in condensed matter physics:  BUT in Complex networks: not yet well defined size between molecules microns We detect different structures patterns through localization method.

We map networks to large clusters (nodes as atoms; edges as bonds) Huckel Model Consider an undirected complex network with N identical nodes, topological structure can be described by an adjacency matrix (or Laplace matrix ). For an electron moving in such a molecule, the tight-binding Hamiltonian is : Detect structure through localization: how? 8  Adjacent matrix (if nodes i and j are connected, is 1, otherwise 0. Diagonal member all are 0)  Laplace matrix:

Dynamics on networks: Diffusive process Transport processes on networks: from micro- to macro- scales Structures of networks: Motif module Micro-scalemacro-scale Different Eigenvalues represented different structure patterns Emergence of different scale structures on complex Networks Laplace Matrix Diffusive process ….. 2.Tight-binding Hamiltonian Huckle Model 9

How to describe Localizations on complex networks? 10 The localization properties of electrons in the clusters can be used as measures of the structural properties of the networks. detect different structure patterns from the spectra of complex networks. The eigenvalues of L can be ranked as, They correspond to the eigenfunctions from high to low energies.

11  Eigenvectors associated with small eigenvalues, usually have large wavelengths, and so they are sensitive to perturbation on a large size of nodes in networks.  Eigenvectors associated with large eigenvalues, have small wavelengths, are most sensitive to localized perturbations that are applied to a small set of nodes in the network.  Hence, the eigenvalues from to can detect the structural patterns from macro- to micro-scales. Different Eigenvalues represented different structure Scale patterns, how? In Eigen space: (for complex networks)  each eigenstate represents a specific wave function,  they are sensitive to the structural patterns matching in size with its wavelength.

Eigenvalues sensitive to structural patterns matching in size with its wave length (a): The eigenstates on a perfect regular network are periodic waves with the wavelengths from to 2. (b): we construct a local deformation in the segment from the 40th to the 60th node by adding edges. the eigenstates with large values localize mainly in this region (local peak) 12

13 Methods to detect Multi-scale structures:  Standardized The components of the eigenvectors:  The components of every eigenvector of L:  For each scale structure, the components of the nodes involved in it are distinguishably large compared with others. Hence, the -based results are robustness.  The nodes with large values of standardized components ( ) are regarded as the nodes involved in the corresponding scale structure.  Then a threshold can be used to identify the nodes involved in the scale structure, respectively. : is value of the largest component.

14 The Santa Fe Cooperation network (part) :41 ~47 (blue), :1 ~ 6 (magenta), :48 ~ 53 (violet)... We consider a part of the largest component of the Santa Fe Institute collaboration network, N=76 largest eigenvalues can detect the three hubs 40, 7 and 67 (red color). :involves a group of nodes numbered 17 ~ 25 (green nodes), : nodes 26 ~29 and 34 also (cyan) With the decrease of eigenvalue, clusters in much larger scale can be identified (not shown).

15  Three scale-free networks : With edge density w = 2, 4, 8, (a–c) average evolutionary ages, (d–f) average degree (on a logarithmic scale), (g–i) size of eigenvector versus the eigenvalue index i. Eigenvectors associated with large eigenvalues generally have small sizes, but their ages are “older” in the network. Eigenmodes and Average Evolutionary Age: BA Scale Free network

16  Eigenvalue compared with Degree: to describe the Average Evolutionary Age Eigenmodes and Average Evolutionary Age: BA Scale Free network

17 Eigenmodes and Average Evolutionary Age: Scale-free networks generated by other mechanism  Scale-free networks generated by duplication/divergence-based mechanism from PPI network of the Baker’s Yeast,  (d) Average age versus degree. Because of large fluctuation, the degree cannot give age-related information, but the eigenvalues can.

18 Yeast 11k network:  original: 5400 yeast proteins : interaction.  focused on interactions with high and medium confidence among 2617 proteins.  But finally, we only consider the part of the largest component of 2235 proteins from the 2617 proteins. Y11k: PPI network: Evolutionary Age Protein-protein interaction networks: Isotemporal Classification of Proteins  First, classified all yeast proteins into four isotemporal categories: prokaryotes, eukarya, fungi, yeast only (the yeast without annotation).  Based on the university tree of life, we assign evolutionary age 4,3,2,1 from ancient to modern for each group of prokaryotes-4, eukarya-3, fungi-3, and yeast only-1, respectively. (1). C. von Mering et al., Nature 417, 399 (2002).

19  For the largest connected component of the PPI network of the baker’s yeast with 2235 nodes,  (d) Average age versus degree. We see that degree does not reveal age-related information. Eigenmodes and Average Evolutionary Age: PPI Network

20 Summary The localization properties of the eigenvectors from high to low energies can detect patterns from micro- to macro- scales. Interestingly, the patterns contains significant clues of evolutionary ages.

21  (1) G.M. Zhu, H.J. Yang, R.Yang, J. Ren, B. Li, and Y.-C. Lai, European Physical Journal B, 85, 106 (2012).  (2). G.M. Zhu, H.J. Yang, C. Yin, B. Li, Localizations on Complex Networks, Phys. Rev. E 77, (2008).  (3). H.J. Yang, C. Yin, G.M. Zhu, B. Li, Phys. Rev. E 77, (R) (2008) References

22

23

Complex Networks: Nontrivial Properties A: random; small-world; scale-free (power law degree distribution); B: motif, modularity, hierarchy, C: fractal properties, and so on. ….. Santo Fortunato, Physics Reports 486 (2010) ER random networks, N=100, link connect ion probability p=0.02 SW networks, link rewiring probability r=0.1 BA scale free network, N=100, average degree w=2 cauliflower are fractal in nature.. self similarity Hierarchical networks A B C 24

25 Complex Networks: Basic Concepts Structure Description Hierarchical Description: Module Function

Graph theory Degree Clustering coefficient Shortest path Small-world Scale-free Bioinformatics Motif What is more? Social Nets Community Hierarchy clustering micro Node/edge-based average macro global (Newman) Dynamics: Micro To Macro Dynamics process is the bridge between structure and functions Structure Multi-Scale Measures R. Albert and A. -L. Barabasi, Rev. Mod. Phys. 47(2002); M. Newman, SIAM Review 45, (2003); C. Song, et. al., Nature 433,6392(2005); Nature Physics 2,275(2006). 26

27 What is a Mesoscopic pattern ?  In Physics: Mesoscopic has been well defined Materials that have a relatively intermediate length scale in condensed matter physics:  In complex networks: not yet well defined size between a quantity of atoms such as moleculesmaterials measuring microns Could regard it as community in complex networks (but there are also other formations like trees or stars structures) We define it as intermediate length scale structures based on structure induced localization.

28 What is community on complex networks? groups of vertices : characterized by having more internal than external connections between them.  Share common properties  and/or play similar roles within the graph. Community(clusters, modules) Fortunato, S., and C. Castellano, 2009,(Springer, Berlin, Germany), volume 1, eprint arXiv: Santo Fortunato, Physics Reports 486 (2010) Community detect methods  Graph partitioning,  hierarchical clustering  Partitioning clustering  Spectral clustering It is a hot topic but even the definition of a community is a controversial issue. people are still improving the methods to detect the true communities in real world.

29 Y11k: PPI network: multi-scale analysis

30 How to Detect community? Several methods Traditional Methods Graph partitioning: dividing the vertices in g groups of predefined size Hierarchical clustering : definition of a similarity measure between vertices Partitional clustering : separate the points in k clusters such to maximize or minimize a given cost function based on distances between points. Spectral Clustering : eigenvectors of matrix Adjacent or Laplace. Modularity-based methodsModularity optimization Modifications of modularity Limits of Modularity Spectral algorithms: Use the eigenvalue and eigenvectors Divisive algorithms The algorithm of Girvan and Newman : according to the values of measures of edge centrality, estimating the importance of edges according to some property or process running on the graph Ahn, Y. Y., J. P. Bagrow, et al. (2010). "Link communities reveal multiscale complexity in networks." Nature 466(7307): 761-U711.

Small world, scale free, whole –cell networks 31 WSSW model: we construct first a regular circular lattice with each node connecting with its d right-handed nearest neighbors. For each edge we rewire it with probability to another randomly selected node. Self- and double edges are forbidden. The BASF : preferential growth mechanism. Starting from several connected nodes as a seed, at each growth step a new node is added and w edges are established between this node and the existing network. The probability for an existing node to be connected with the new node is proportional to its degree. Self- and double edges are forbidden. For the resulting networks, the number of edges per node obeys a power law. Whole-cell networks: consider cellular functions such as intermediate metabolism and bioenergetics, information pathways, electron transport, and transmembrane transport. The directed edges are replaced simply with nondirected edges. We consider only cellular networks with sizes larger than 500.

Statistical properties of the spectra (localized) (extended) where s is the NNLS and the characteristic distribution width. In order to obtain the value of, we use the accumulated function: some trivial calculations lead to: Fig. Value of Brody parameter versus network parameters pr and w. (a) WSSW and (b) BASF networks. From this formula, we can determine the values of and. The PDF of the Nearest Neighbor Level Spacing(NNLS) distribution obeys the Brody distribution: 32

Wavelets Transform 33 Wavelets are mathematical functions that cut up data into different frequency components, and then study each component with a resolution matched to its scale. They have advantages over traditional Fourier methods in analyzing physical situations where the signal contains discontinuities and sharp spikes. Wavelets were developed independently in the fields of mathematics, quantum physics, electrical engineering, and seismic geology. Interchanges between these fields during the last ten years have led to many new wavelet applications such as image compression, turbulence, human vision, radar, and earthquake prediction.

WT can detect the fractal properties based on the ascending-order-ranked series. As a standard procedure, we first find the WT maximal values : The fractal dimension (statistical subsets properties) can be obtained through the Legendre transform : Fig5. The branched multifractal behavior for the whole cell network of M. jannaschii is presented as a typical example. Fractals properties on networks: Wavelet transform (WT) We assume the probability values has been sorted in ascending order: where a is the given scale. The partition function should scale in the limit of small scales as Local Hurst exponent h: denotes local subsets Positive q, reflects the scaling of large fluctuations Negative q, reflects the scaling of small fluctuations 34

35 Structure, Functions, Dynamics Structural measures: (cornerstone for understanding the relations) Degree Clustering coefficient Shortest path Dynamics: (can be regarded as the transport progresses of ) Mass Energy Singal Informations And so on Dynamic diffusive Process at different structure scale Functions ? L. K. Gallos, C. Song, S. Havlin, Proc. Natl. Acad. Sci. U.S.A. 104, 7746 (2007). H. Yang, C. Yin, G. Zhu, and B. Li, Phys. Rev. E 77,045101(R) (2008) Zhu, G.M., Yang H., Yin C., Li B., Physical Review E, (6)

DNA sequence Proteins Functions Protein-protein interaction networks Functions of Proteins realized by Protein-protein interactions Y1 Y2 Y3 1.Signal transduction: interactions between signaling molecules 2. Protein complex * One carries another, e.g, from cytoplasm to nucleuscytoplasmnucleus * One modify another * complex formation often serves to activate or inhibit one or more of the associated proteins Protein-protein interactions Protein-protein interaction networks 36

Metabolic networks (life processes) metabolism of an organism, the basic chemical system that generates essential components (1) such as amino acids, sugars and lipids, (2) and the energy required to synthesize them (3) and to use them in creating proteins and cellular structures. This system of connected chemical reactions is a metabolic network. 37