Groups of vertices and Core-periphery structure

Slides:



Advertisements
Similar presentations
Analysis and Modeling of Social Networks Foudalis Ilias.
Advertisements

Community Detection Laks V.S. Lakshmanan (based on Girvan & Newman. Finding and evaluating community structure in networks. Physical Review E 69,
Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.
Information Networks Small World Networks Lecture 5.
Advanced Topics in Data Mining Special focus: Social Networks.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
Mining and Searching Massive Graphs (Networks)
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
CS8803-NS Network Science Fall 2013
Models of Influence in Online Social Networks
The Erdös-Rényi models
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Traceroute-like exploration of unknown networks: a statistical analysis A. Barrat, LPT, Université Paris-Sud, France I. Alvarez-Hamelin (LPT, France) L.
Author: M.E.J. Newman Presenter: Guoliang Liu Date:5/4/2012.
A Clustering Algorithm based on Graph Connectivity Balakrishna Thiagarajan Computer Science and Engineering State University of New York at Buffalo.
7.1 and 7.2: Spanning Trees. A network is a graph that is connected –The network must be a sub-graph of the original graph (its edges must come from the.
MEDUSA – New Model of Internet Topology Using k-shell Decomposition Shai Carmi Shlomo Havlin Bloomington 05/24/2005.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
A Graph-based Friend Recommendation System Using Genetic Algorithm
Jure Leskovec Computer Science Department Cornell University / Stanford University Joint work with: Jon Kleinberg (Cornell), Christos.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
Networks Igor Segota Statistical physics presentation.
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Informatics tools in network science
Network Theory: Community Detection Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale.
Class 2: Graph Theory IST402.
Network Partition –Finding modules of the network. Graph Clustering –Partition graphs according to the connectivity. –Nodes within a cluster is highly.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
Lecture II Introduction to complex networks Santo Fortunato.
Graph clustering to detect network modules
Cohesive Subgraph Computation over Large Graphs
Finding Dense and Connected Subgraphs in Dual Networks
Groups of vertices and Core-periphery structure
Gyan Ranjan University of Minnesota, MN
Topics In Social Computing (67810)
Applications of graph theory in complex systems research
Department of Computer and IT Engineering University of Kurdistan
Empirical analysis of Chinese airport network as a complex weighted network Methodology Section Presented by Di Li.
Greedy Algorithm for Community Detection
June 2017 High Density Clusters.
How Do “Real” Networks Look?
Section 8.6: Clustering Coefficients
Node Similarity Ralucca Gera,
Generative models preserving community structure
Community detection in graphs
Overview of Communities in networks
Centralities (2) Ralucca Gera,
Network Science: A Short Introduction i3 Workshop
Section 8.6 of Newman’s book: Clustering Coefficients
How Do “Real” Networks Look?
Section 7.12: Similarity By: Ralucca Gera, NPS.
How Do “Real” Networks Look?
The Importance of Communities for Learning to Influence
Models of Network Formation
Section 8.2: Shortest path and small world effect
Centralities (4) Ralucca Gera,
Models of Network Formation
Why Social Graphs Are Different Communities Finding Triangles
Statistical properties of network community structure
Models of Network Formation
How Do “Real” Networks Look?
Robustness or Network Resilience
Models of Network Formation
Clustering Coefficients
(Social) Networks Analysis II
Practical Applications Using igraph in R Roger Stanton
Affiliation Network Models of Clusters in Networks
Advanced Topics in Data Mining Special focus: Social Networks
Constructing a m-connected k-Dominating Set in Unit Disc Graphs
Presentation transcript:

Groups of vertices and Core-periphery structure By: Ralucca Gera, Applied math department, Naval Postgraduate School Monterey, CA, USA

Mostly observed real networks have: Why? Mostly observed real networks have: Heavy tail (powerlaw most probably, exponential) High clustering (high number of triangles especially in social networks, lower count otherwise) Small average path (usually small diameter) Communities/periphery/hierarchy Homophily and assortative mixing (similar nodes tend to be adjacent) Where does the structure come from? How do we model it?

Macro and Meso Scale properties Macro Scale properties (using all the interactions): Small world (small average path, high clustering) Powerlaw degree distr. (generally pref. attachment) Meso Scale properties applying to groups (using k-clique, k-core, k-dense): Community structure Core-periphery structure Micro Scale properties applying to small units: Edge properties (such as who it connects, being a bridge) Node properties (such as degree, cut-vertex)

Some local and global metrics pertaining to structure of networks Structure they capture Local Statistics Global statistics Direct influence General feel for the distribution of the edges Vertex degree, in and out degree Degree distribution Closeness, distance between nodes Geodesic (path) Distance (numerical value) Diameter, radius, average path length Connectedness of the network How critical are vertices to the connectedness of the graph? How much damage can a network take before disconnecting? Existence of a bridge Existence of a cut vertex Cut sets Tight node/edge neighborhoods, important nodes as a group Clique, plex, core, community, k-dense (for edges) Community detection

Clusters and matrices 1 In a very clustered graph, the adjacency matrix can be put in a block form (identifying communities). Source: Guido Caldarelli, Communities and Clustering in Some social Networks, NetSci 2007 New York, May 20th 2007

Definitions Newman’s book uses k-component, k-cliques, k-plexes and k-cores to refer to a set of vertices with some properties. In graph theory (and research papers) we use a clique to be the set of vertices and edges, so a clique is actually a graph (or subgraph) Either way it works, the graph captures more information, and I will refer them as graphs (induced by the nodes in the sets).

Groups and subgroups identifications Some common approaches to subgroup identification and analysis: K-cliques K-cores (k-shell) K-denseness Components (and k-components) Community detection They are used to explore how large networks can be built up out of small and tight groups.

Components and k-components Section 7.8.2 Components and k-components

Components Recall that a graph is k-connected/k-component if it can be disconnected by removal of k vertices, and no k-1 vertices can disconnect it. Component is a maximal size connected subgraph A k-component (k-connected component) is a connected maximal subgraph that can be disconnected (or we’re left with a 𝐾 1 ) by removal of k vertices, and no k-1 vertices can disconnect it. Alternatively: A k-component is a connected maximal subgraph such that there are k-vertex-independent paths between any two vertices

The k-component tells how robust a graph or subgraph is. In class exercise The k-component tells how robust a graph or subgraph is. Identify a subgraph that is either a: 1-connected 2-connected 3-connected 4-connected

Section 7.8.1 K-Clique and k-core

k-cliques A clique of size 𝑘: a subset of 𝑘 nodes, with every node adjacent to every other member of the subset (all 𝑘−1 one of them) We usually search for the maximum clique Hard to find (decision problem for the clique number is NP-Complete) Why is it hard to use this concept on real networks? Because one might not infer/know all the edges of the true network, so clique may exist but it may not be captured in the data to be analyzed A relaxed version of a clique might be just as useful in large networks.

Relaxed versions of a k-clique are k-dense and k-core In class exercise A clique of size 𝑘: a subset of 𝑘 nodes, with every node connected to every other member of the subset. Identify a: 1-clique 2-clique 3-clique 4-clique Relaxed versions of a k-clique are k-dense and k-core

Idea: enough edges are present to make a group strong. References: k-core 𝐴 𝑘-core: maximal subset of nodes, 𝑆, with deg 𝐺[𝑆] 𝑣 ≥𝑘 , where 𝐺[𝑆] is the subgraph induced by S. Idea: enough edges are present to make a group strong. References: [1] Bollobas, B. Graph Theory and Combinatorics: Proceedings of the Cambridge Combinatorial Conference in honor of P. Erdos, 35 (Academic, New York, 1984). [2] Seidman, S. B. Network structure and minimum degree. Social Networks 5, 269–287 (1983). [3] Carmi, S., Havlin, S, Kirkpatrick, S., Shavitt, Y. & Shir, E. A model of Internet topology using k-shell decomposition. Proc. Natl. Acad. Sci. USA 104, 11150-11154 (2007). [4] Angeles-Serrano, M. & Bogu˜n´a, M. Clustering in complex networks. II. Percolation properties. Phys. Rev. E 74, 056116 (2006).

Finding the core: eliminate lower-order k-cores until to find A 𝑘-core of size n: maximal subset of nodes 𝑘 with deg 𝐺[𝑆] 𝑣 ≥𝑘 , where 𝐺[𝑆] is the subgraph induced by 𝑆. Finding the core: eliminate lower-order k-cores until to find The highest k before the network vanishes. http://iopscience.iop.org/article/10.1088/1367-2630/14/8/083030

In class exercise The 𝑘-core of size n: maximal subset of nodes 𝑘 with deg 𝐺[𝑆] 𝑣 ≥𝑘 , where 𝐺[𝑆] is the subgraph induced by 𝑆. Identify the: 1-core 2-core 3-core 4-core

k-dense A k- dense sub-graph is a group of vertices, in which each pair of vertices {i, j} has at least 𝑘-2 common neighbors. Idea: pairwise friends (𝑘 –dense looks at edges rather than vertices in making them part of the 𝑘 group)

k-dense A k- dense sub-graph is a group of vertices, in which each pair of vertices has at least 𝑘-2 common neighbors. k - clique ⊂ k - dense ⊂ k – core

In class exercise A k- dense sub-graph is a group of vertices, in which each pair of vertices {i, j} has at least k-2 common neighbors. Identify a: 2-dense 3-dense 4-dense 5-dense

Other extensions https://academic.oup.com/comnet/article/doi/10.1093/comnet/cnt016/2392115/Structure-and-dynamics-of-core-periphery-networks

Using them globally

k-cliques, k-cores and k-dense clique of size 𝑘: maximal subset of nodes, with every node adjacent to every other member of the subset 𝑘 -core: maximal subset of nodes, with every node adjacent to at least 𝑘 others in the subset A 𝑘 - dense sub-graph is a group of vertices, in which each pair of vertices {i, j} has at least 𝑘−2 common neighbors.

Communities vs. core/dense/clique K-core/dense/clique: look inside the group of nodes Communities look both at internal and external ties (high internal and low external ties) Core-periphery decomposition also looking at internal and ext. to the core (doesn’t have to be a clique)

K-core (k-shell) decomposition The decomposition identifies the shells for different k-values. Generally (but not well defined): the core of the network (the 𝑘-core for the largest 𝑘) and the outer periphery (last layer: 1-core taking away the 2-core). There are modifications where several top values of 𝑘 make the core. http://3.bp.blogspot.com/-TIjz3nstWD0/ToGwUGivEjI/AAAAAAAAsWw/etkwklnPNw4/s1600/k-cores.png

K-core and degree (1) Reference: Alvarez-Hamelin, J. I., Dall´asta, L., Barrat, A. & Vespignani, A. Large scale networks fingerprinting and visualization using the k-core decomposition. Advances in Neural Information Processing Systems 18, 41–51 (2006) http://papers.nips.cc/paper/2789-large-scale-networks-fingerprinting-and-visualization-using-the-k-core-decomposition.pdf

k-core and degree (2) The degree is highly (and nonlinearly) correlated with the position of the node in the k-shell http://iopscience.iop.org/article/10.1088/1367-2630/14/8/083030

Core-periphery structure

Core-periphery decomposition The core-periphery decomposition captures the notion that many networks decompose into: a densely connected core, and a sparsely connected periphery [6], [12]. The core-periphery structure is a pervasive and crucial characteristic of large networks [13], [14], [15]. If overlapping communities are considered: the network core forms as a result of many overlapping communities http://ilpubs.stanford.edu:8090/1103/2/paper-IEEE-full.pdf

Core-periphery adjacency matrix dark blue = 1 (adjacent) white = 0 (nonadjacent)

Deciding on core-periphery How to decide if a network has core-periphery structure? Not well defined either, but generally the density of the 𝑘-core must be high: Checked by the high correlation, 𝜌 ,: 𝜌= 𝑖,𝑗 𝑎 𝑖𝑗 𝛿 𝑖𝑗 , where 𝑎 𝑖𝑗 is the (i,j) adjacency matrix entry, and 𝛿 𝑖𝑗 = 1, 𝑖𝑓 𝑛𝑜𝑑𝑒 𝑖 𝑜𝑟 𝑗 𝑖𝑠 𝑖𝑛 𝑡ℎ𝑒 𝑐𝑜𝑟𝑒 0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒 http://www.sciencedirect.com/science/article/pii/S0378873399000192

Extensions of core-periphery?! Limitation: There are just two classes of nodes: core and periphery. Is a three-class partition consisting of core, semiperiphery, and periphery more realistic? Or even partitioning with more classes? The problem becomes more difficult as the number of classes is increased, and good justification is needed. http://www.sciencedirect.com/science/article/pii/S0378873399000192

Possible structures dark shade = 0 (nonadjacent) light shade = 1 (adjacent) From Aaron Clauset and Mason Porter

Core and communities The network core was traditionally viewed as a single giant community (lacking internal communities references [7], [8], [9], [10]). Yang and Leskovec (2014, reference [11]) showed that dense cores form as a result of many overlapping communities. Moreover, foodweb, social, and web networks exhibit a single dominant core, while protein-protein interaction and product co-purchasing networks contain many local cores formed around the central core http://ilpubs.stanford.edu:8090/1103/2/paper-IEEE-full.pdf

References M. E. Newman, Analysis of weighted networks Physical Review E, vol. 70, no. 5, 2004. Borgatti, Stephen P., and Martin G. Everett. "Models of core/periphery structures“ Social networks 21.4 (2000): 375-395. Csermely, Peter, et al. "Structure and dynamics of core/periphery networks.“ Journal of Complex Networks 1.2 (2013): 93-123. Kitsak, Maksim, et al. "Identification of influential spreaders in complex networks." Nature Physics 6.11 (2010): 888-893 S. B. Seidman, Network structure and minimum degree, Social networks, vol. 5, no. 3, pp. 269287, 1983 Borgatti, Stephen P., and Martin G. Everett. "Models of core/periphery structures." Social networks 21.4 (2000): 375-395. J. Leskovec, K. J. Lang, A. Dasgupta, and M. W. Mahoney, “Community structure in large networks: Natural cluster sizes and the absence of large well-defined clusters,” Internet Mathematics, vol. 6, no. 1, pp. 29–123, 2009. A. Clauset, M. Newman, and C. Moore, “Finding community structure in very large networks,” Physical Review E, vol. 70, p. 066111, 2004. M. Coscia, G. Rossetti, F. Giannotti, and D. Pedreschi, “Demon: a local-first discovery method for overlapping communities,” in Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2012, pp. 615–623. J. Leskovec, K. Lang, and M. Mahoney, “Empirical comparison of algorithms for network community detection,” in Proceedings of the International Conference on World Wide Web (WWW), 2010 Jaewon Yang and Jure Leskovec. “Overlapping Communities Explain Core-Periphery Organization of Networks” Proceedings of the IEEE (2014) P. Holme, “Core-periphery organization of complex networks,” Physical Review E, vol. 72, p. 046111, 2005. F. D. Rossa, F. Dercole, and C. Piccardi, “Profiling coreperiphery network structure by random walkers,” Scientific Reports, vol. 3, 2013. M. P. Rombach, M. A. Porter, J. H. Fowler, and P. J. Mucha, “Core-periphery structure in networks,” SIAM Journal of Applied Mathematics, vol. 74, no. 1, pp. 167–190, 2014