Network Measures Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Measures Klout.

Slides:



Advertisements
Similar presentations
Lower Bounds for Additive Spanners, Emulators, and More David P. Woodruff MIT and Tsinghua University To appear in FOCS, 2006.
Advertisements

CSE 211 Discrete Mathematics
Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale Network Theory: Computational Phenomena and Processes Social Network.
Network Matrix and Graph. Network Size Network size – a number of actors (nodes) in a network, usually denoted as k or n Size is critical for the structure.
Midwestern State University Department of Computer Science Dr. Ranette Halverson CMPS 2433 – CHAPTER 4 GRAPHS 1.
Analysis and Modeling of Social Networks Foudalis Ilias.
Relationship Mining Network Analysis Week 5 Video 5.
V4 Matrix algorithms and graph partitioning
Absorbing Random walks Coverage
Networks. Graphs (undirected, unweighted) has a set of vertices V has a set of undirected, unweighted edges E graph G = (V, E), where.
DATA MINING LECTURE 12 Link Analysis Ranking Random walks.
Lecture 9 Measures and Metrics. Structural Metrics Degree distribution Average path length Centrality Degree, Eigenvector, Katz, Pagerank, Closeness,
Algorithmic and Economic Aspects of Networks Nicole Immorlica.
Applied Discrete Mathematics Week 12: Trees
Centrality and Prestige HCC Spring 2005 Wednesday, April 13, 2005 Aliseya Wright.
Centrality Measures These measure a nodes importance or prominence in the network. The more central a node is in a network the more significant it is to.
CSE 321 Discrete Structures Winter 2008 Lecture 25 Graph Theory.
Expanders Eliyahu Kiperwasser. What is it? Expanders are graphs with no small cuts. The later gives several unique traits to such graph, such as: – High.
Tirgul 13. Unweighted Graphs Wishful Thinking – you decide to go to work on your sun-tan in ‘ Hatzuk ’ beach in Tel-Aviv. Therefore, you take your swimming.
Using Social Networks to Analyze Sexual Relations by Amanda Dargie.
Applied Discrete Mathematics Week 12: Trees
CS8803-NS Network Science Fall 2013
Graphs, relations and matrices
Graph Essentials Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Graph Essentials Networks A network is a graph. – Elements of the network.
Applied Discrete Mathematics Week 10: Equivalence Relations
Social Media Mining Graph Essentials.
Graphs Chapter 10.
Graph Theoretic Concepts. What is a graph? A set of vertices (or nodes) linked by edges Mathematically, we often write G = (V,E)  V: set of vertices,
DATA MINING LECTURE 13 Absorbing Random walks Coverage.
Lectures 6 & 7 Centrality Measures Lectures 6 & 7 Centrality Measures February 2, 2009 Monojit Choudhury
© by Kenneth H. Rosen, Discrete Mathematics & its Applications, Sixth Edition, Mc Graw-Hill, 2007 Chapter 9 (Part 2): Graphs  Graph Terminology (9.2)
Lecture 5: Mathematics of Networks (Cont) CS 790g: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
School of Information Sciences University of Pittsburgh TELCOM2125: Network Science and Analysis Konstantinos Pelechrinis Spring 2013 Figures are taken.
Special Topics in Educational Data Mining HUDK5199 Spring 2013 March 25, 2012.
Lecture 13: Network centrality Slides are modified from Lada Adamic.
Data Structures & Algorithms Graphs
COSC 2007 Data Structures II Chapter 14 Graphs I.
Most of contents are provided by the website Graph Essentials TJTSD66: Advanced Topics in Social Media.
Slides are modified from Lada Adamic
Most of contents are provided by the website Network Models TJTSD66: Advanced Topics in Social Media (Social.
TELCOM2125: Network Science and Analysis
Class 2: Graph Theory IST402. Can one walk across the seven bridges and never cross the same bridge twice? Network Science: Graph Theory THE BRIDGES OF.
Informatics tools in network science
Chapter 9: Graphs.
Graphs Definition: a graph is an abstract representation of a set of objects where some pairs of the objects are connected by links. The interconnected.
Graphs. Graph Definitions A graph G is denoted by G = (V, E) where  V is the set of vertices or nodes of the graph  E is the set of edges or arcs connecting.
Importance Measures on Nodes Lecture 2 Srinivasan Parthasarathy 1.
Social Networks Strong and Weak Ties
School of Information Sciences University of Pittsburgh TELCOM2125: Network Science and Analysis Konstantinos Pelechrinis Spring 2016 Figures are taken.
GRAPH AND LINK MINING 1. Graphs - Basics 2 Undirected Graphs Undirected Graph: The edges are undirected pairs – they can be traversed in any direction.
Lecture 20. Graphs and network models 1. Recap Binary search tree is a special binary tree which is designed to make the search of elements or keys in.
Topics In Social Computing (67810) Module 1 (Structure) Centrality Measures, Graph Clustering Random Walks on Graphs.
Chapter Chapter Summary Graphs and Graph Models Graph Terminology and Special Types of Graphs Representing Graphs and Graph Isomorphism Connectivity.
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Social Networks Analysis
Department of Computer and IT Engineering University of Kurdistan
Lecture 9 Measures and Metrics.
Link-Based Ranking Seminar Social Media Mining University UC3M
Network analysis.
Node Similarity Ralucca Gera,
Community detection in graphs
Lecture 10 Measures and Metrics.
Degree and Eigenvector Centrality
Network Science: A Short Introduction i3 Workshop
Section 7.12: Similarity By: Ralucca Gera, NPS.
Centrality in Social Networks
Graphs Chapter 11 Objectives Upon completion you will be able to:
Graph Operations And Representation
Graph and Link Mining.
Practical Applications Using igraph in R Roger Stanton
Presentation transcript:

Network Measures Social Media Mining

2 Measures and Metrics 2 Social Media Mining Network Measures Klout

3 Social Media Mining Measures and Metrics 3 Social Media Mining Network Measures Why Do We Need Measures? Who are the central figures (influential individuals) in the network? What interaction patterns are common in friends? Who are the like-minded users and how can we find these similar individuals? To answer these and similar questions, one first needs to define measures for quantifying centrality, level of interactions, and similarity, among others.

4 Social Media Mining Measures and Metrics 4 Social Media Mining Network Measures Centrality defines how important a node is within a network. Centrality

5 Social Media Mining Measures and Metrics 5 Social Media Mining Network Measures Degree Centrality The degree centrality measure ranks nodes with more connections higher in terms of centrality d i is the degree (number of adjacent edges) for vertex v i In this graph degree centrality for vertex v 1 is d 1 = 8 and for all others is d j = 1, j  1

6 Social Media Mining Measures and Metrics 6 Social Media Mining Network Measures Degree Centrality in Directed Graphs In directed graphs, we can either use the in- degree, the out-degree, or the combination as the degree centrality value: d out i is the number of outgoing links for vertex v i

7 Social Media Mining Measures and Metrics 7 Social Media Mining Network Measures Normalized Degree Centrality Normalized by the maximum possible degree Normalized by the maximum degree Normalized by the degree sum

8 Social Media Mining Measures and Metrics 8 Social Media Mining Network Measures Eigenvector Centrality Having more friends does not by itself guarantee that someone is more important, but having more important friends provides a stronger signal Eigenvector centrality tries to generalize degree centrality by incorporating the importance of the neighbors (undirected) For directed graphs, we can use incoming or outgoing edges C e (v i ): the eigenvector centrality of node v i : some fixed constant

9 Social Media Mining Measures and Metrics 9 Social Media Mining Network Measures Eigenvector Centrality, cont. Let T  This means that C e is an eigenvector of adjacency matrix A and is the corresponding eigenvalue Which eigenvalue-eigenvector pair should we choose?

10 Social Media Mining Measures and Metrics 10 Social Media Mining Network Measures Eigenvector Centrality, cont.

11 Social Media Mining Measures and Metrics 11 Social Media Mining Network Measures Eigenvector Centrality: Example = (2.68, -1.74, -1.27, 0.33, 0.00) Eigenvalues Vector max = 2.68

12 Social Media Mining Measures and Metrics 12 Social Media Mining Network Measures Katz Centrality A major problem with eigenvector centrality arises when it deals with directed graphs Centrality only passes over outgoing edges and in special cases such as when a node is in a weakly connected component centrality becomes zero even though the node can have many edge connected to it To resolve this problem we add bias term  to the centrality values for all nodes

13 Social Media Mining Measures and Metrics 13 Social Media Mining Network Measures Katz Centrality, cont. Bias termControlling term Rewriting equation in a vector form vector of all 1’s Katz centrality:

14 Social Media Mining Measures and Metrics 14 Social Media Mining Network Measures Katz Centrality, cont.

15 Social Media Mining Measures and Metrics 15 Social Media Mining Network Measures Katz Centrality Example

16 Social Media Mining Measures and Metrics 16 Social Media Mining Network Measures PageRank Problem with Katz Centrality: in directed graphs, once a node becomes an authority (high centrality), it passes all its centrality along all of its out-links This is less desirable since not everyone known by a well-known person is well-known To mitigate this problem we can divide the value of passed centrality by the number of outgoing links, i.e., out-degree of that node such that each connected neighbor gets a fraction of the source node’s centrality

17 Social Media Mining Measures and Metrics 17 Social Media Mining Network Measures PageRank, cont. What if the degree is zero?

18 Social Media Mining Measures and Metrics 18 Social Media Mining Network Measures PageRank Example

19 Social Media Mining Measures and Metrics 19 Social Media Mining Network Measures Betweenness Centrality Another way of looking at centrality is by considering how important nodes are in connecting other nodes the number of shortest paths from s to t that pass through v i the number of shortest paths from vertex s to t – a.k.a. information pathways

20 Social Media Mining Measures and Metrics 20 Social Media Mining Network Measures Normalizing Betweenness Centrality In the best case, node v i is on all shortest paths from s to t, hence, Therefore, the maximum value is 2 Betweenness centrality:

21 Social Media Mining Measures and Metrics 21 Social Media Mining Network Measures Betweenness Centrality Example

22 Social Media Mining Measures and Metrics 22 Social Media Mining Network Measures Closeness Centrality The intuition is that influential and central nodes can quickly reach other nodes These nodes should have a smaller average shortest path length to other nodes Closeness centrality:

23 Social Media Mining Measures and Metrics 23 Social Media Mining Network Measures Compute Closeness Centrality

24 Social Media Mining Measures and Metrics 24 Social Media Mining Network Measures Group Centrality All centrality measures defined so far measure centrality for a single node. These measures can be generalized for a group of nodes. A simple approach is to replace all nodes in a group with a super node – The group structure is disregarded. Let S denote the set of nodes in the group and V- S the set of outsiders

25 Social Media Mining Measures and Metrics 25 Social Media Mining Network Measures Group Centrality Group Degree Centrality – We can normalize it by dividing it by |V-S| Group Betweeness Centrality We can normalize it by dividing it by 2

26 Social Media Mining Measures and Metrics 26 Social Media Mining Network Measures Group Centrality Group Closeness Centrality – It is the average distance from non-members to the group One can also utilize the maximum distance or the average distance

27 Social Media Mining Measures and Metrics 27 Social Media Mining Network Measures Group Centrality Example Consider S={v2,v3} Group degree centrality=3 Group betweenness centrality = 3 Group closeness centrality = 1

28 Social Media Mining Measures and Metrics 28 Social Media Mining Network Measures Transitivity and Reciprocity

29 Social Media Mining Measures and Metrics 29 Social Media Mining Network Measures Transitivity Mathematic representation: – For a transitive relation R: In a social network: – Transitivity is when a friend of my friend is my friend – Transitivity in a social network leads to a denser graph, which in turn is closer to a complete graph – We can determine how close graphs are to the complete graph by measuring transitivity

30 Social Media Mining Measures and Metrics 30 Social Media Mining Network Measures [Global] Clustering Coefficient Clustering coefficient analyzes transitivity in an undirected graph – We measure it by counting paths of length two and check whether the third edge exists When counting triangles, since every triangle has 6 closed paths of length 2: In undirected networks:

31 Social Media Mining Measures and Metrics 31 Social Media Mining Network Measures [Global] Clustering Coefficient: Example

32 Social Media Mining Measures and Metrics 32 Social Media Mining Network Measures Local Clustering Coefficient Local clustering coefficient measures transitivity at the node level Commonly employed for undirected graphs, it computes how strongly neighbors of a node v (nodes adjacent to v) are themselves connected In an undirected graph, the denominator can be rewritten as:

33 Social Media Mining Measures and Metrics 33 Social Media Mining Network Measures Local Clustering Coefficient: Example Thin lines depict connections to neighbors Dashed lines are the missing connections among neighbors Solid lines indicate connected neighbors – When none of neighbors are connected C=0 – When all neighbors are connected C=1

34 Social Media Mining Measures and Metrics 34 Social Media Mining Network Measures Reciprocity If you become my friend, I’ll be yours Reciprocity is a more simplified version of transitivity as it considers closed loops of length 2 If node v is connected to node u, u by connecting to v, exhibits reciprocity

35 Social Media Mining Measures and Metrics 35 Social Media Mining Network Measures Reciprocity: Example Reciprocal nodes: v 1, v 2

36 Social Media Mining Measures and Metrics 36 Social Media Mining Network Measures Measuring stability based on an observed network Balance and Status

37 Social Media Mining Measures and Metrics 37 Social Media Mining Network Measures Social Balance Theory Social balance theory discusses consistency in friend/foe relationships among individuals. Informally, social balance theory says friend/foe relationships are consistent when In the network – Positive edges demonstrate friendships (w ij =1) – Negative edges demonstrate being enemies (w ij =-1) Triangle of nodes i, j, and k, is balanced, if and only if – w ij denotes the value of the edge between nodes i and j

38 Social Media Mining Measures and Metrics 38 Social Media Mining Network Measures Social Balance Theory: Possible Combinations For any cycle if the multiplication of edge values become positive, then the cycle is socially balanced

39 Social Media Mining Measures and Metrics 39 Social Media Mining Network Measures Social Status Theory Status defines how prestigious an individual is ranked within a society Social status theory measures how consistent individuals are in assigning status to their neighbors

40 Social Media Mining Measures and Metrics 40 Social Media Mining Network Measures Social Status Theory: Example A directed ‘+’ edge from node X to node Y shows that Y has a higher status than X and a ‘-’ one shows vice versa Unstable configurationStable configuration

41 Social Media Mining Measures and Metrics 41 Social Media Mining Network Measures How similar are two nodes in a network? Similarity

42 Social Media Mining Measures and Metrics 42 Social Media Mining Network Measures Structural Equivalence In structural equivalence, we look at the neighborhood shared by two nodes; the size of this neighborhood defines how similar two nodes are. For instance, two brothers have in common sisters, mother, father, grandparents, etc. This shows that they are similar, whereas two random male or female individuals do not have much in common and are not similar.

43 Social Media Mining Measures and Metrics 43 Social Media Mining Network Measures Structural Equivalence: Definitions Vertex similarity In general, the definition of neighborhood N(v) excludes the node itself v. – Nodes that are connected and do not share a neighbor will be assigned zero similarity – This can be rectified by assuming nodes to be included in their neighborhoods Jaccard Similarity: Cosine Similarity:

44 Social Media Mining Measures and Metrics 44 Social Media Mining Network Measures Similarity: Example

45 Social Media Mining Measures and Metrics 45 Social Media Mining Network Measures Normalized Similarity Comparing the calculated similarity value with its expected value where vertices pick their neighbors at random For vertices i and j with degrees d i and d j this expectation is d i d j /n

46 Social Media Mining Measures and Metrics 46 Social Media Mining Network Measures Normalized Similarity, cont.

47 Social Media Mining Measures and Metrics 47 Social Media Mining Network Measures Normalized Similarity, cont. Covariance between A i and A j Covariance can be normalized by the multiplication of Variances: Pearson correlation coefficient:   [-1,1]

48 Social Media Mining Measures and Metrics 48 Social Media Mining Network Measures Regular Equivalence In regular equivalence, we do not look at neighborhoods shared between individuals, but how neighborhoods themselves are similar For instance, athletes are similar not because they know each other in person, but since they know similar individuals, such as coaches, trainers, other players, etc.

49 Social Media Mining Measures and Metrics 49 Social Media Mining Network Measures v i, v j are similar when their neighbors v k and v l are similar The equation (left figure) is hard to solve since it is self referential so we relax our definition using the right figure Regular Equivalence

50 Social Media Mining Measures and Metrics 50 Social Media Mining Network Measures Regular Equivalence v i, v j are similar when v j is similar to v i ’s neighbors v k In vector format A vertex is highly similar to itself, we guarantee this by adding an identity matrix to the equation

51 Social Media Mining Measures and Metrics 51 Social Media Mining Network Measures Regular Equivalence v i, v j are similar when v j is similar to v i ’s neighbors v k In vector format A vertex is highly similar to itself, we guarantee this by adding an identity matrix to the equation

52 Social Media Mining Measures and Metrics 52 Social Media Mining Network Measures Regular Equivalence: Example Any row/column of this matrix shows the similarity to other vertices We can see that vertex 1 is most similar (other than itself) to vertices 2 and 3 Nodes 2 and 3 have the highest similarity The largest eigenvalue of A is 2.43 Set α = 0.4 < 1/2.43