Presentation is loading. Please wait.

Presentation is loading. Please wait.

Evolutionary Clues Embedded In Network Structure —— EPJB,85,106(2012) Zhu Guimei NGS Graduate School for Integrative Science & Engineering, Centre for.

Similar presentations


Presentation on theme: "Evolutionary Clues Embedded In Network Structure —— EPJB,85,106(2012) Zhu Guimei NGS Graduate School for Integrative Science & Engineering, Centre for."— Presentation transcript:

1 Evolutionary Clues Embedded In Network Structure —— EPJB,85,106(2012) Zhu Guimei NGS Graduate School for Integrative Science & Engineering, Centre for Computational Sciences & Engineering, National University of Singapore 1

2 2 Introductions Localizations on complex networks Evolutionary ages Conclustions Outline

3 3 Objective and Scopes  Detecting structural patterns at different scales :multi-scale structure  Finding an intresting network evolution mechanisms based on multi- scale structure networks.

4 4 Complex Networks Protein networks Internet networks scientific collaborations networks A B C Real-World Networks Communication networks: telephone, internet, www… Transportation networks: airports, highways, rail, electric power … Biological networks: genetic,protein-protein interaction, metabolic… Social networks: friendship networks, collaboration networks…

5 ? Function Dynamics Structure Mass Energy Signal Information… Structure, Functions, Dynamics Degree Motif Modularity … Dynamic Process at different structure scale So Structure measures is the cornerstone for understanding the relations between structure, dynamic, function 5

6 Microscopic Macroscopic How to measure: Multi-Scale Structure ? what is a Meso (midterm) pattern? Degree Motif clustering coefficient … Modules ? Dynamics on Different Structure Scales: 6

7 7 Define the Mesoscopic pattern  In Physics: Mesoscopic has been well defined Materials that have a relatively intermediate length scale in condensed matter physics:  BUT in Complex networks: not yet well defined size between molecules microns We detect different structures patterns through localization method.

8 We map networks to large clusters (nodes as atoms; edges as bonds) Huckel Model Consider an undirected complex network with N identical nodes, topological structure can be described by an adjacency matrix (or Laplace matrix ). For an electron moving in such a molecule, the tight-binding Hamiltonian is : Detect structure through localization: how? 8  Adjacent matrix (if nodes i and j are connected, is 1, otherwise 0. Diagonal member all are 0)  Laplace matrix:

9 Dynamics on networks: Diffusive process Transport processes on networks: from micro- to macro- scales Structures of networks: Motif module Micro-scalemacro-scale Different Eigenvalues represented different structure patterns Emergence of different scale structures on complex Networks Laplace Matrix Diffusive process ….. 2.Tight-binding Hamiltonian Huckle Model 9

10 How to describe Localizations on complex networks? 10 The localization properties of electrons in the clusters can be used as measures of the structural properties of the networks. detect different structure patterns from the spectra of complex networks. The eigenvalues of L can be ranked as, They correspond to the eigenfunctions from high to low energies.

11 11  Eigenvectors associated with small eigenvalues, usually have large wavelengths, and so they are sensitive to perturbation on a large size of nodes in networks.  Eigenvectors associated with large eigenvalues, have small wavelengths, are most sensitive to localized perturbations that are applied to a small set of nodes in the network.  Hence, the eigenvalues from to can detect the structural patterns from macro- to micro-scales. Different Eigenvalues represented different structure Scale patterns, how? In Eigen space: (for complex networks)  each eigenstate represents a specific wave function,  they are sensitive to the structural patterns matching in size with its wavelength.

12 Eigenvalues sensitive to structural patterns matching in size with its wave length (a): The eigenstates on a perfect regular network are periodic waves with the wavelengths from to 2. (b): we construct a local deformation in the segment from the 40th to the 60th node by adding edges. the eigenstates with large values localize mainly in this region (local peak) 12

13 13 Methods to detect Multi-scale structures:  Standardized The components of the eigenvectors:  The components of every eigenvector of L:  For each scale structure, the components of the nodes involved in it are distinguishably large compared with others. Hence, the -based results are robustness.  The nodes with large values of standardized components ( ) are regarded as the nodes involved in the corresponding scale structure.  Then a threshold can be used to identify the nodes involved in the scale structure, respectively. : is value of the largest component.

14 14 The Santa Fe Cooperation network (part) :41 ~47 (blue), :1 ~ 6 (magenta), :48 ~ 53 (violet)... We consider a part of the largest component of the Santa Fe Institute collaboration network, N=76 largest eigenvalues can detect the three hubs 40, 7 and 67 (red color). :involves a group of nodes numbered 17 ~ 25 (green nodes), : nodes 26 ~29 and 34 also (cyan) With the decrease of eigenvalue, clusters in much larger scale can be identified (not shown).

15 15  Three scale-free networks : With edge density w = 2, 4, 8, (a–c) average evolutionary ages, (d–f) average degree (on a logarithmic scale), (g–i) size of eigenvector versus the eigenvalue index i. Eigenvectors associated with large eigenvalues generally have small sizes, but their ages are “older” in the network. Eigenmodes and Average Evolutionary Age: BA Scale Free network

16 16  Eigenvalue compared with Degree: to describe the Average Evolutionary Age Eigenmodes and Average Evolutionary Age: BA Scale Free network

17 17 Eigenmodes and Average Evolutionary Age: Scale-free networks generated by other mechanism  Scale-free networks generated by duplication/divergence-based mechanism from PPI network of the Baker’s Yeast,  (d) Average age versus degree. Because of large fluctuation, the degree cannot give age-related information, but the eigenvalues can.

18 18 Yeast 11k network:  original: 5400 yeast proteins : 80 000 interaction.  focused on 11 855 interactions with high and medium confidence among 2617 proteins.  But finally, we only consider the part of the largest component of 2235 proteins from the 2617 proteins. Y11k: PPI network: Evolutionary Age Protein-protein interaction networks: Isotemporal Classification of Proteins  First, classified all yeast proteins into four isotemporal categories: prokaryotes, eukarya, fungi, yeast only (the yeast without annotation).  Based on the university tree of life, we assign evolutionary age 4,3,2,1 from ancient to modern for each group of prokaryotes-4, eukarya-3, fungi-3, and yeast only-1, respectively. (1). C. von Mering et al., Nature 417, 399 (2002).

19 19  For the largest connected component of the PPI network of the baker’s yeast with 2235 nodes,  (d) Average age versus degree. We see that degree does not reveal age-related information. Eigenmodes and Average Evolutionary Age: PPI Network

20 20 Summary The localization properties of the eigenvectors from high to low energies can detect patterns from micro- to macro- scales. Interestingly, the patterns contains significant clues of evolutionary ages.

21 21  (1) G.M. Zhu, H.J. Yang, R.Yang, J. Ren, B. Li, and Y.-C. Lai, European Physical Journal B, 85, 106 (2012).  (2). G.M. Zhu, H.J. Yang, C. Yin, B. Li, Localizations on Complex Networks, Phys. Rev. E 77, 066113 (2008).  (3). H.J. Yang, C. Yin, G.M. Zhu, B. Li, Phys. Rev. E 77, 045101(R) (2008) References

22 22

23 23

24 Complex Networks: Nontrivial Properties A: random; small-world; scale-free (power law degree distribution); B: motif, modularity, hierarchy, C: fractal properties, and so on. ….. Santo Fortunato, Physics Reports 486 (2010) 75174 ER random networks, N=100, link connect ion probability p=0.02 SW networks, link rewiring probability r=0.1 BA scale free network, N=100, average degree w=2 cauliflower are fractal in nature.. self similarity Hierarchical networks A B C 24

25 25 Complex Networks: Basic Concepts Structure Description Hierarchical Description: Module Function

26 Graph theory Degree Clustering coefficient Shortest path Small-world Scale-free Bioinformatics Motif What is more? Social Nets Community Hierarchy clustering micro Node/edge-based average macro global (Newman) Dynamics: Micro To Macro Dynamics process is the bridge between structure and functions Structure Multi-Scale Measures R. Albert and A. -L. Barabasi, Rev. Mod. Phys. 47(2002); M. Newman, SIAM Review 45, 167-256 (2003); C. Song, et. al., Nature 433,6392(2005); Nature Physics 2,275(2006). 26

27 27 What is a Mesoscopic pattern ?  In Physics: Mesoscopic has been well defined Materials that have a relatively intermediate length scale in condensed matter physics:  In complex networks: not yet well defined size between a quantity of atoms such as moleculesmaterials measuring microns Could regard it as community in complex networks (but there are also other formations like trees or stars structures) We define it as intermediate length scale structures based on structure induced localization.

28 28 What is community on complex networks? groups of vertices : characterized by having more internal than external connections between them.  Share common properties  and/or play similar roles within the graph. Community(clusters, modules) Fortunato, S., and C. Castellano, 2009,(Springer, Berlin, Germany), volume 1, eprint arXiv:0712.2716. Santo Fortunato, Physics Reports 486 (2010) 75174 Community detect methods  Graph partitioning,  hierarchical clustering  Partitioning clustering  Spectral clustering It is a hot topic but even the definition of a community is a controversial issue. people are still improving the methods to detect the true communities in real world.

29 29 Y11k: PPI network: multi-scale analysis

30 30 How to Detect community? Several methods Traditional Methods Graph partitioning: dividing the vertices in g groups of predefined size Hierarchical clustering : definition of a similarity measure between vertices Partitional clustering : separate the points in k clusters such to maximize or minimize a given cost function based on distances between points. Spectral Clustering : eigenvectors of matrix Adjacent or Laplace. Modularity-based methodsModularity optimization Modifications of modularity Limits of Modularity Spectral algorithms: Use the eigenvalue and eigenvectors Divisive algorithms The algorithm of Girvan and Newman : according to the values of measures of edge centrality, estimating the importance of edges according to some property or process running on the graph Ahn, Y. Y., J. P. Bagrow, et al. (2010). "Link communities reveal multiscale complexity in networks." Nature 466(7307): 761-U711.

31 Small world, scale free, whole –cell networks 31 WSSW model: we construct first a regular circular lattice with each node connecting with its d right-handed nearest neighbors. For each edge we rewire it with probability to another randomly selected node. Self- and double edges are forbidden. The BASF : preferential growth mechanism. Starting from several connected nodes as a seed, at each growth step a new node is added and w edges are established between this node and the existing network. The probability for an existing node to be connected with the new node is proportional to its degree. Self- and double edges are forbidden. For the resulting networks, the number of edges per node obeys a power law. Whole-cell networks: consider cellular functions such as intermediate metabolism and bioenergetics, information pathways, electron transport, and transmembrane transport. The directed edges are replaced simply with nondirected edges. We consider only cellular networks with sizes larger than 500.

32 Statistical properties of the spectra (localized) (extended) where s is the NNLS and the characteristic distribution width. In order to obtain the value of, we use the accumulated function: some trivial calculations lead to: Fig. Value of Brody parameter versus network parameters pr and w. (a) WSSW and (b) BASF networks. From this formula, we can determine the values of and. The PDF of the Nearest Neighbor Level Spacing(NNLS) distribution obeys the Brody distribution: 32

33 Wavelets Transform 33 Wavelets are mathematical functions that cut up data into different frequency components, and then study each component with a resolution matched to its scale. They have advantages over traditional Fourier methods in analyzing physical situations where the signal contains discontinuities and sharp spikes. Wavelets were developed independently in the fields of mathematics, quantum physics, electrical engineering, and seismic geology. Interchanges between these fields during the last ten years have led to many new wavelet applications such as image compression, turbulence, human vision, radar, and earthquake prediction. http://www.amara.com/IEEEwave/IEEEwavelet.html

34 WT can detect the fractal properties based on the ascending-order-ranked series. As a standard procedure, we first find the WT maximal values : The fractal dimension (statistical subsets properties) can be obtained through the Legendre transform : Fig5. The branched multifractal behavior for the whole cell network of M. jannaschii is presented as a typical example. Fractals properties on networks: Wavelet transform (WT) We assume the probability values has been sorted in ascending order: where a is the given scale. The partition function should scale in the limit of small scales as Local Hurst exponent h: denotes local subsets Positive q, reflects the scaling of large fluctuations Negative q, reflects the scaling of small fluctuations 34

35 35 Structure, Functions, Dynamics Structural measures: (cornerstone for understanding the relations) Degree Clustering coefficient Shortest path Dynamics: (can be regarded as the transport progresses of ) Mass Energy Singal Informations And so on Dynamic diffusive Process at different structure scale Functions ? L. K. Gallos, C. Song, S. Havlin, Proc. Natl. Acad. Sci. U.S.A. 104, 7746 (2007). H. Yang, C. Yin, G. Zhu, and B. Li, Phys. Rev. E 77,045101(R) (2008) Zhu, G.M., Yang H., Yin C., Li B., Physical Review E, 2008. 77(6)

36 DNA sequence Proteins Functions Protein-protein interaction networks Functions of Proteins realized by Protein-protein interactions Y1 Y2 Y3 1.Signal transduction: interactions between signaling molecules 2. Protein complex * One carries another, e.g, from cytoplasm to nucleuscytoplasmnucleus * One modify another * complex formation often serves to activate or inhibit one or more of the associated proteins Protein-protein interactions Protein-protein interaction networks 36

37 Metabolic networks (life processes) metabolism of an organism, the basic chemical system that generates essential components (1) such as amino acids, sugars and lipids, (2) and the energy required to synthesize them (3) and to use them in creating proteins and cellular structures. This system of connected chemical reactions is a metabolic network. 37


Download ppt "Evolutionary Clues Embedded In Network Structure —— EPJB,85,106(2012) Zhu Guimei NGS Graduate School for Integrative Science & Engineering, Centre for."

Similar presentations


Ads by Google