Lectures 6 & 7 Centrality Measures Lectures 6 & 7 Centrality Measures February 2, 2009 Monojit Choudhury

Slides:

Advertisements

Similar presentations

Dr. Henry Hexmoor Department of Computer Science Southern Illinois University Carbondale Network Theory: Computational Phenomena and Processes Social Network.

Advertisements

Network Matrix and Graph. Network Size Network size – a number of actors (nodes) in a network, usually denoted as k or n Size is critical for the structure.

CSE 5243 (AU 14) Graph Basics and a Gentle Introduction to PageRank 1.

Introduction to Network Theory: Modern Concepts, Algorithms

Graphs, Node importance, Link Analysis Ranking, Random walks

Link Analysis: PageRank

Experiments with MATLAB Experiments with MATLAB Google PageRank Roger Jang ( 張智星 ) CSIE Dept, National Taiwan University, Taiwan

DATA MINING LECTURE 12 Link Analysis Ranking Random walks.

Mining and Searching Massive Graphs (Networks)

1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 3 March 23, 2005

Social Networks 101 P ROF. J ASON H ARTLINE AND P ROF. N ICOLE I MMORLICA.

Algorithmic and Economic Aspects of Networks Nicole Immorlica.

Introduction to PageRank Algorithm and Programming Assignment 1 CSC4170 Web Intelligence and Social Computing Tutorial 4 Tutor: Tom Chao Zhou

Pádraig Cunningham University College Dublin Matrix Tutorial Transition Matrices Graphs Random Walks.

Multimedia Databases SVD II. Optimality of SVD Def: The Frobenius norm of a n x m matrix M is (reminder) The rank of a matrix M is the number of independent.

Introduction to Information Retrieval Introduction to Information Retrieval Hinrich Schütze and Christina Lioma Lecture 21: Link Analysis.

Zdravko Markov and Daniel T. Larose, Data Mining the Web: Uncovering Patterns in Web Content, Structure, and Usage, Wiley, Slides for Chapter 1:

Introduction to Graphs

1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 3 April 2, 2006

Multimedia Databases SVD II. SVD - Detailed outline Motivation Definition - properties Interpretation Complexity Case studies SVD properties More case.

Centrality Measures These measure a nodes importance or prominence in the network. The more central a node is in a network the more significant it is to.

Link Analysis, PageRank and Search Engines on the Web

An introduction to iterative projection methods Eigenvalue problems Luiza Bondar the 23 rd of November th Seminar.

CSE 321 Discrete Structures Winter 2008 Lecture 25 Graph Theory.

Link Analysis. 2 HITS - Kleinberg’s Algorithm HITS – Hypertext Induced Topic Selection For each vertex v Є V in a subgraph of interest: A site is very.

Applied Discrete Mathematics Week 12: Trees

HCC class lecture 22 comments John Canny 4/13/05.

Network Measures Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Measures Klout.

Motivation When searching for information on the WWW, user perform a query to a search engine. The engine return, as the query’s result, a list of Web.

The effect of New Links on Google Pagerank By Hui Xie Apr, 07.

Stochastic Approach for Link Structure Analysis (SALSA) Presented by Adam Simkins.

Social Media Mining Graph Essentials.

Piyush Kumar (Lecture 2: PageRank) Welcome to COT5405.

Google’s Billion Dollar Eigenvector Gerald Kruse, PhD. John ‘54 and Irene ‘58 Dale Professor of MA, CS and I T Interim Assistant Provost Juniata.

Information Networks Introduction to networks Lecture 1.

Roshnika Fernando P AGE R ANK. W HY P AGE R ANK ?  The internet is a global system of networks linking to smaller networks.  This system keeps growing,

GRAPHS CSE, POSTECH. Chapter 16 covers the following topics Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component,

Principles of Social Network Analysis. Definition of Social Networks “A social network is a set of actors that may have relationships with one another”

1 Random Walks on Graphs: An Overview Purnamrita Sarkar, CMU Shortened and modified by Longin Jan Latecki.

COM1721: Freshman Honors Seminar A Random Walk Through Computing Lecture 2: Structure of the Web October 1, 2002.

Murtaza Abbas Asad Ali. NETWORKOLOGY THE SCIENCE OF NETWORKS.

Lecture 5: Mathematics of Networks (Cont) CS 790g: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.

Data Structures & Algorithms Graphs

Complex Networks: Models Lecture 2 Slides by Panayiotis TsaparasPanayiotis Tsaparas.

How works M. Ram Murty, FRSC Queen’s Research Chair Queen’s University or How linear algebra powers the search engine.

Ranking Link-based Ranking (2° generation) Reading 21.

Models and Algorithms for Complex Networks Introduction and Background Lecture 1.

CS 590 Term Project Epidemic model on Facebook

1 HEINZ NIXDORF INSTITUTE University of Paderborn Algorithms and Complexity Christian Schindelhauer Search Algorithms Winter Semester 2004/ Dec.

Link Analysis Algorithms Page Rank Slides from Stanford CS345, slightly modified.

Ljiljana Rajačić. Page Rank Web as a directed graph  Nodes: Web pages  Edges: Hyperlinks 2 / 25 Ljiljana Rajačić.

Class 2: Graph Theory IST402.

+ GRAPH Algorithm Dikompilasi dari banyak sumber.

“Important” Vertices and the PageRank Algorithm Networked Life NETS 112 Fall 2014 Prof. Michael Kearns.

Importance Measures on Nodes Lecture 2 Srinivasan Parthasarathy 1.

CS 540 Database Management Systems Web Data Management some slides are due to Kevin Chang 1.

GRAPH AND LINK MINING 1. Graphs - Basics 2 Undirected Graphs Undirected Graph: The edges are undirected pairs – they can be traversed in any direction.

Topics In Social Computing (67810) Module 1 (Structure) Centrality Measures, Graph Clustering Random Walks on Graphs.

Density of States for Graph Analysis

Search Engines and Link Analysis on the Web

Link-Based Ranking Seminar Social Media Mining University UC3M

Network analysis.

Eigenvalues of a Graph Scott Grayson.

Centralities (2) Ralucca Gera,

Degree and Eigenvector Centrality

Centrality in Social Networks

Prof. Paolo Ferragina, Algoritmi per "Information Retrieval"

PageRank algorithm based on Eigenvectors

Prof. Paolo Ferragina, Algoritmi per "Information Retrieval"

Adjacency Matrices and PageRank

Presentation transcript:

Lectures 6 & 7 Centrality Measures Lectures 6 & 7 Centrality Measures February 2, 2009 Monojit Choudhury

A brief Intro to Myself Yourself The course The classes ◦ Please ask questions ◦ Don’t disturb otherwise ◦ Please go back and read

I shall assume that you know Basic graph theory ◦ Adjacency matrix representation ◦ Degree, in-degree, out-degree ◦ Connected component, shortest paths Basic linear algebra ◦ Symmetric matrix, transpose ◦ Vectors, multiplication of vectors with vectors and matrices, orthogonality ◦ Eigenvectors and Eigenvalues

Lecture 5 Centrality Measures Lecture 5 Centrality Measures February 2, 2009 Monojit Choudhury

Question 1: Information percolation In this friendship network of 8 persons, suppose that someone comes to know about an interesting news. Who are most likely to receive this news fast?

Question 2: Searching the Web In this hyperlinked network of webpages, which pages are most likely to contain authoritative information ?

Question 3: Spreading of STDs In this hypothetical sexual interaction network, who are most likely to be affected by STDs such as AIDS?

A common answer to all the questions Nodes which are most “CENTRAL” to the network Centrality of a node measures its ◦ Power, Prestige, Prominence & imPortance ◦ The 4 “P”s

Degree Centrality How many friends do you have? Measure of centralization of the network ◦ Star network – most centralized ◦ Line graph – least centralized Thus, the variance of degree centrality is the measure of (de)centralization of a network

How much is this network centralized?

When is centralization good/bad? Fault tolerance ◦ Centralized: bad ◦ Decentralized: good However, for random attacks ◦ Centralized: good What happens in a scale-free network?

Closeness Centrality Reciprocal of the sum of shortest paths to all the nodes Compute closeness centrality for nodes 3 and

Closeness Centrality What does variance of closeness centrality indicate? What would this variance be for ◦ A Clique ◦ A Tree ◦ A Ring

Spreading of STDs Who should be removed from this network to make this community less susceptible to spreading of STDs?

Betweenness Centrality Joydeep Subrata Rich (in what?) Joydeep has the opportunity to play a information broker – but Subrata doesn’t

Mathematical Definition s t v Can be extended to edges

Which networks have Nodes with very small betweenness centrality Node(s) with very high betweenness centrality What is the betweenness centrality of the nodes in a complete bipartite network?

Question 2: Searching the Web In this hyperlinked network of webpages, which pages are most popular?

The basic idea I am popular if my friends are popular p 6 = p 2 + p 5 + p 7 + p 8

Computing Popularity

Oops! Popularity grows unboundedly!!

A better approach 1/8 4/8 2/8 3/8 1/8 4/8 3/8 4/22 2/22 3/22 1/22 4/22 3/22

Computing popularity 4/22 2/22 3/22 1/22 4/22 3/22 13/22 6/22 10/22 4/22 9/22 10/22 13/68 6/68 10/68 4/68 9/68 10/68

Computing popularity 13/68 6/68 10/68 4/68 9/68 10/68 39/68 15/68 33/68 9/68 29/68 33/68 39/206 15/206 33/206 9/206 29/206 33/206

Is it converging? 39/206 15/206 33/206 9/206 29/206 33/206 11/82/226/6815/ /84/229/6829/ /83/2210/6833/ /84/2213/6839/

Observations The popularity values eventually converge Nodes which are isomorphic have the same popularity What happens when we start from a different initialization? Does it converge for every graph? What happens for a disconnected graph?

An alternative view to popularity Random surfer model: ◦ The surfer lands up on a random page ◦ With probability w it stays in the same page, but with probability ( 1-w ) it visits any other random link from the page

What’s the probability that the surfer is at node i ? p 6 = wp 6 + (1-w) [p 2 /4+ p 5 + p 7 /3 + p 8 ]

What’s the probability that the surfer is at node i ? p i = wp i + (1-w)  j a ji p j /d j

Therefore, popularity is Eigenvector Centrality Introduced by Bonacich (1972) A slightly different variant is used as “PageRank” p i = (1-w)+ w  j a ji p j /d j

Does all networks have = 1 Yes! Actually, all stochastic matrices (aka Markov Matrices) have the largest Eigenvalue 1 = 1 Perron-Frobenius Theorem ◦ If A is a positive matrix, so is its largest Eigenvalue 1 > all other | i |. Every component of the corresponding Eigenvector is also positive.