Peer-to-Peer and Social Networks

Slides:



Advertisements
Similar presentations
CSE 211 Discrete Mathematics
Advertisements

Lecture 7. Network Flows We consider a network with directed edges. Every edge has a capacity. If there is an edge from i to j, there is an edge from.
Social network partition Presenter: Xiaofei Cao Partick Berg.
Graph Partitioning Problems Lecture 18: March 14 s1 s3 s4 s2 T1 T4 T2 T3 s1 s4 s2 s3 t3 t1 t2 t4 A region R1 R2 C1 C2.
Network Matrix and Graph. Network Size Network size – a number of actors (nodes) in a network, usually denoted as k or n Size is critical for the structure.
Lecture 5 Graph Theory. Graphs Graphs are the most useful model with computer science such as logical design, formal languages, communication network,
Approximation Algoirthms: Graph Partitioning Problems Lecture 17: March 16 s1 s3 s4 s2 T1 T4 T2 T3.
1 EE5900 Advanced Embedded System For Smart Infrastructure Static Scheduling.
Community Detection Laks V.S. Lakshmanan (based on Girvan & Newman. Finding and evaluating community structure in networks. Physical Review E 69,
Peer-to-Peer and Social Networks Centrality measures.
Graph Partitioning Dr. Frank McCown Intro to Web Science Harding University This work is licensed under Creative Commons Attribution-NonCommercial 3.0Attribution-NonCommercial.
Information Networks Graph Clustering Lecture 14.
V4 Matrix algorithms and graph partitioning
Complexity 15-1 Complexity Andrei Bulatov Hierarchy Theorem.
HCS Clustering Algorithm
Maximum Flows Lecture 4: Jan 19. Network transmission Given a directed graph G A source node s A sink node t Goal: To send as much information from s.
Lecture 8. Why do we need residual networks? Residual networks allow one to reverse flows if necessary. If we have taken a bad path then residual networks.
Time-Constrained Flooding A.Mehta and E. Wagner. Time-Constrained Flooding: Problem Definition ●Devise an algorithm that provides a subgraph containing.
MAX FLOW CS302, Spring 2013 David Kauchak. Admin.
Programming for Geographical Information Analysis: Advanced Skills Online mini-lecture: Introduction to Networks Dr Andy Evans.
CS774. Markov Random Field : Theory and Application Lecture 13 Kyomin Jung KAIST Oct
Boundary Recognition in Sensor Networks by Topology Methods Yue Wang, Jie Gao Dept. of Computer Science Stony Brook University Stony Brook, NY Joseph S.B.
Clustering Spatial Data Using Random Walk David Harel and Yehuda Koren KDD 2001.
Researchers: Preet Bola Mike Earnest Kevin Varela-O’Hara Han Zou Advisor: Walter Rusin Data Storage Networks.
Network Flow How to solve maximal flow and minimal cut problems.
1 EE5900 Advanced Embedded System For Smart Infrastructure Static Scheduling.
Outline Standard 2-way minimum graph cut problem. Applications to problems in computer vision Classical algorithms from the theory literature A new algorithm.
How to Analyse Social Network? Social networks can be represented by complex networks.
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
Informatics tools in network science
Network Theory: Community Detection Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale.
Example Apply hierarchical clustering with d min to below data where c=3. Nearest neighbor clustering d min d max will form elongated clusters!
Network Partition –Finding modules of the network. Graph Clustering –Partition graphs according to the connectivity. –Nodes within a cluster is highly.
Social Networks Strong and Weak Ties
Topics In Social Computing (67810) Module 1 (Structure) Centrality Measures, Graph Clustering Random Walks on Graphs.
1 New metrics for characterizing the significance of nodes in wireless networks via path-based neighborhood analysis Leandros A. Maglaras 1 Dimitrios Katsaros.
Graph clustering to detect network modules
Social Media Analytics
School of Computing Clemson University Fall, 2012
What Are Routers? Routers are an intermediate system at the network layer that is used to connect networks together based on a common network layer protocol.
Network Simplex Animations
Department of Computer and IT Engineering University of Kurdistan
Graph theory Definitions Trees, cycles, directed graphs.
Routing: Distance Vector Algorithm
Chapter 5. Optimal Matchings
Community detection in graphs
Shortest Path Problems
Multi-Core Parallel Routing
Network Science: A Short Introduction i3 Workshop
Finding modules on graphs
Michael L. Nelson CS 495/595 Old Dominion University
Instructor: Shengyu Zhang
Centralities (4) Ralucca Gera,
Graph Partitioning Problems
Section 14.4 Trees.
Lecture 31: Graph-Based Image Segmentation
Shortest Path Problems
Algorithms for Budget-Constrained Survivable Topology Design
Piyush Kumar (Lecture 6: MaxFlow MinCut Applications)
Algorithms (2IL15) – Lecture 7
EE5900 Advanced Embedded System For Smart Infrastructure
Min Global Cut Animation
Social Network Analysis - Lecture 2 - *
GRAPHS Lecture 17 CS2110 Spring 2018.
Warm Up – Tuesday Find the critical times for each vertex.
Winter 2019 Lecture 11 Minimum Spanning Trees (Part II)
Section 13.4 Trees.
Hierarchical Clustering
Concepts of Computation
Autumn 2019 Lecture 11 Minimum Spanning Trees (Part II)
Presentation transcript:

Peer-to-Peer and Social Networks Centrality measures

Centrality measures Centrality is related to the potential importance of a node. Some nodes have greater “influence” over others compared to the rest, or are more easily accessible to other, or act as a go-between in most node-to-node communications. These are represented by various centrality measures. Some important ones are Degree centrality Closeness centrality Betweenness centrality

Degree centrality The more neighbors a given node has, the greater is its influence. In human society, a person with a large number of friends is believed to be in a favorable position compared to persons with fewer. This leads to the idea of degree centrality, which refers to the degree of a given node in the graph representing a social network.

Closeness centrality Nodes that are able to reach other nodes via shorter paths, or who are “more reachable” by other nodes via shorter paths, are in more favored positions. This structural advantage can be translated into power, and it leads to the notion of closeness centrality.

Closeness centrality normalized 6 4/6 4 1 6 4/6 5 4/5 5 4/5 Let denote the set of nodes, and represent the distance between and . A yardstick of the closeness of a node from the other nodes is . This number is normalized in various ways: one is . However, a problem occurs when the graph is partitioned, so an alternative is to consider the inverse of it i.e. as shown above.

Betweenness centrality For each pair of nodes in a social network, consider one of the shortest paths –- all nodes in this path are intermediaries. The node that belongs to the shortest paths between the maximum number of such communications, is a special node – it is a potential deal maker and is in a special position since most other nodes have to channel their communications through it. Such a node has a high betweenness centrality.

Computing betweenness centrality From each source node u to each destination node v, push 1 unit of flow via shortest paths. If there are multiple shortest paths, then the flow will be evenly split. The amount of flow handled by a node or an edge is a measure of its betweenness centrality.

Computing betweenness centrality ½ ½ ½ ½ ½ ½ Betweenness centrality of C with respect to (A, G) If each node sends 1 unit of flow to every other node (excluding C), then C will handle only 2. (1 + + ) = 4 units of flow. Why? ½ ½

Random Walk Betweenness Centrality Proposed by Newman, the random walk betweenness is centrality is based on random walks, counting how often a given node is traversed by a random walk between two other nodes. This is in contrast with the deterministic approach in The previous slide that is based on shortest paths. Unlike the shortest path based centrality, random walk based centrality does not require the knowledge of the topology of the network.

Community detection A co-authorship network of physicists and mathematicians (Courtesy: Easley & Kleinberg)

What is a community? Informally a community is a “tightly-knit region” of the network. How to identify such a region, and how to separate them from a different tightly-knit region? It depends on how we quantify the notion of tightly-knit. Visual check of the graph is not enough, and can sometimes be misleading. Also, such regions can be nested.

What is a community? Formally, given a network with n nodes and m edges, a community detection algorithm finds subgroups of nodes C1, C2, …, Cp (communities) that satisfy the following two constraints: Ci ∩ Cj = ∅ for i ≠ j, and ∪iCi = set of all nodes in the network. Overlapping communities are not impossible, but are generally not addressed by the community detection algorithms.

(Courtesy: Easley & Kleinberg) What is a community? Removal of a bridge separates the graph into disjoint components bridge An example of a nested structure of the communities (Courtesy: Easley & Kleinberg)

(Courtesy: Easley & Kleinberg) What is a community? A-B is not a bridge, but a local bridge A, B have no common friends (Courtesy: Easley & Kleinberg)

(Courtesy: Easley & Kleinberg) What is a community? Is there a bridge here? A dispute caused the club to split into two clubs. How can you identify them? (Courtesy: Easley & Kleinberg)

Community detection Girvan-Newman Method Remove the edges of highest betweenness first. Repeat the same step with the remainder graph. Continue this until the graph breaks down into individual nodes. As the graph breaks down into pieces, the tightly knit community structure is exposed.

Girvan Newman method: An example Betweenness(7-8)= 7x7 = 49 Betweenness(1-3) = 1X12=12 Betweenness(3-7)=betweenness(6-7)=betweenness(8-9) = betweenness(8-12)= 3X11=33

Girvan Newman method: An example Betweenness(1-3) = 1X5=5 Betweenness(3-7)=betweenness(6-7)=betweenness(8-9) = betweenness(8-12)= 3X4=12

Girvan Newman method: An example Betweenness of every edge = 1

Girvan Newman method: An example

Another example 5X5=25

Another example 5X6=30 5X6=30

Another example

Revisiting Karate Club Network A set of minimum total strength whose removal will disconnect the rival leaders. Zachary followed this to identify the split.

Min Cut Problem

Revisiting Karate Club Network Zachary initially assigned weight to each edge and then partitioned the graph by computing the min-cut. Successive application of this produces a dendogram A Dendogram

Revisiting Karate Club Network Girvan-Newman’s community detection algorithm produced almost the exact solution, except for one node (node 9) However, it is not computationally efficient.