Identity and search in social networks Presented by Pooja Deodhar Duncan J. Watts, Peter Sheridan Dodds and M. E. J. Newman.

Slides:



Advertisements
Similar presentations
Peer-to-Peer and Social Networks Power law graphs Small world graphs.
Advertisements

The Small World Phenomenon: An Algorithmic Perspective Speaker: Bradford Greening, Jr. Rutgers University – Camden.
1 Analyzing Kleinberg’s Small-world Model Chip Martel and Van Nguyen Computer Science Department; University of California at Davis.
Small-world networks.
Milgram-Routing in Social Networks
1 Greedy Forwarding in Dynamic Scale-Free Networks Embedded in Hyperbolic Metric Spaces Dmitri Krioukov CAIDA/UCSD Joint work with F. Papadopoulos, M.
It’s a Small World by Jamie Luo. Introduction Small World Networks and their place in Network Theory An application of a 1D small world network to model.
Online Social Networks and Media Navigation in a small world.
UNIVERSITY OF JYVÄSKYLÄ Building NeuroSearch – Intelligent Evolutionary Search Algorithm For Peer-to-Peer Environment Master’s Thesis by Joni Töyrylä
Generated Waypoint Efficiency: The efficiency considered here is defined as follows: As can be seen from the graph, for the obstruction radius values (200,
Information Networks Small World Networks Lecture 5.
Advanced Topics in Data Mining Special focus: Social Networks.
Important clustering methods used in microarray data analysis Steve Horvath Human Genetics and Biostatistics UCLA.
Structural Inference of Hierarchies in Networks BY Yu Shuzhi 27, Mar 2014.
Lecture 7 CS 728 Searchable Networks. Errata: Differences between Copying and Preferential Attachment In generative model: let p k be fraction of nodes.
School of Information University of Michigan SI 614 Search in structured networks Lecture 15.
Company LOGO 1 Identity and Search in Social Networks D.J.Watts, P.S. Dodds, M.E.J. Newman Maryam Fazel-Zarandi.
Farnoush Banaei-Kashani and Cyrus Shahabi Criticality-based Analysis and Design of Unstructured P2P Networks as “ Complex Systems ” Mohammad Al-Rifai.
Common approach 1. Define space: assign random ID (160-bit) to each node and key 2. Define a metric topology in this space,  that is, the space of keys.
CPSC 335 Computer Science University of Calgary Canada.
SASH Spatial Approximation Sample Hierarchy
An Analysis of the Optimum Node Density for Ad hoc Mobile Networks Elizabeth M. Royer, P. Michael Melliar-Smith and Louise E. Moser Presented by Aki Happonen.
Data Mining Cluster Analysis: Advanced Concepts and Algorithms Lecture Notes for Chapter 9 Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach,
Building Low-Diameter P2P Networks Eli Upfal Department of Computer Science Brown University Joint work with Gopal Pandurangan and Prabhakar Raghavan.
Visual Querying By Color Perceptive Regions Alberto del Bimbo, M. Mugnaini, P. Pala, and F. Turco University of Florence, Italy Pattern Recognition, 1998.
Advanced Topics in Data Mining Special focus: Social Networks.
Complex Networks Structure and Dynamics Ying-Cheng Lai Department of Mathematics and Statistics Department of Electrical Engineering Arizona State University.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network WWW09 报告人: 徐波.
Lecture 18: Small World Networks CS 790g: Complex Networks
Friends and Locations Recommendation with the use of LBSN
Author: M.E.J. Newman Presenter: Guoliang Liu Date:5/4/2012.
Jure Leskovec Computer Science Department Cornell University / Stanford University Joint work with: Eric Horvitz, Michael Mahoney,
School of Information Sciences University of Pittsburgh TELCOM2125: Network Science and Analysis Konstantinos Pelechrinis Spring 2013 Figures are taken.
Small-world networks. What is it? Everyone talks about the small world phenomenon, but truly what is it? There are three landmark papers: Stanley Milgram.
Clustering Spatial Data Using Random Walk David Harel and Yehuda Koren KDD 2001.
A Scalable Content-Addressable Network (CAN) Seminar “Peer-to-peer Information Systems” Speaker Vladimir Eske Advisor Dr. Ralf Schenkel November 2003.
Friends and Locations Recommendation with the use of LBSN By EKUNDAYO OLUFEMI ADEOLA
Online Social Networks and Media
3. SMALL WORLDS The Watts-Strogatz model. Watts-Strogatz, Nature 1998 Small world: the average shortest path length in a real network is small Six degrees.
Navigation in small worlds Social Networks: Models and Applications Seminar Toronto, Fall 2007 (based on a presentation by Stratis Ioannidis)
The new protocol of freenet Taken from Ian Clarke and Oskar Sandberg (The Freenet Project)
Pastry Antony Rowstron and Peter Druschel Presented By David Deschenes.
Lecture 17: Search in structured networks CS 765 Complex Networks Slides are modified from Lada Adamic.
Radial Basis Function ANN, an alternative to back propagation, uses clustering of examples in the training set.
Database Management Systems, R. Ramakrishnan 1 Algorithms for clustering large datasets in arbitrary metric spaces.
An Effective Method to Improve the Resistance to Frangibility in Scale-free Networks Kaihua Xu HuaZhong Normal University.
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Performance Evaluation Lecture 1: Complex Networks Giovanni Neglia INRIA – EPI Maestro 10 December 2012.
Siddhartha Gunda Sorabh Hamirwasia.  Generating small world network model.  Optimal network property for decentralized search.  Variation in epidemic.
CLUSTERING HIGH-DIMENSIONAL DATA Elsayed Hemayed Data Mining Course.
Percolation Percolation is a purely geometric problem which exhibits a phase transition consider a 2 dimensional lattice where the sites are occupied with.
Models and Algorithms for Complex Networks
Navigation in Networks, Revisited Networked Life MKSE 112 Fall 2012 Prof. Michael Kearns.
Topics In Social Computing (67810) Module 1 Introduction & The Structure of Social Networks.
Graph clustering to detect network modules
Connectivity and the Small World
Expectations of Random Variables, Functions of Random Variables
Lecture 1: Complex Networks
Search in structured networks
Identity and Search in Social Networks
Peer-to-Peer and Social Networks
Community detection in graphs
Milgram’s experiment really demonstrated two striking facts about large social networks: first, that short paths are there in abundance;
Network Science: A Short Introduction i3 Workshop
The Watts-Strogatz model
Network Science: A Short Introduction i3 Workshop
Navigation and Propagation in Networks
Advanced Topics in Data Mining Special focus: Social Networks
Presentation transcript:

Identity and search in social networks Presented by Pooja Deodhar Duncan J. Watts, Peter Sheridan Dodds and M. E. J. Newman

Presentation Outline Introduction Contentions – Social Networks Algorithm explanation Our model and Milgram’s findings Further Extensions Applications 2

Introduction Social Networks are “Searchable” Our model offers explanation of searchability in terms of recognizable personal identities Personal identities - sets of characteristics in different social dimensions Class of searchable networks and method for searching them applicable to many real world problems 3

Introduction Small World Network ◦ Network in which most nodes are not neighbors of each other but most nodes can be reached from every other node by a number of hops 4

Introduction Milgram’s Experiment ◦ Short paths exist between individuals in large social network ◦ Ordinary people can find these short paths ◦ People rarely have more than local knowledge about the network 5 Source

Introduction Searchability ◦ Property of being able to find a target quickly Shown to exist in networks ◦ With certain fraction of hubs (highly connected nodes which once reached can distribute messages to all parts of the network) ◦ Built upon underlying geometric lattice 6

Introduction Limited hubs in social networks Social Networks are more like a peer-to- peer network Need for a hierarchical model Some measure of distance between individuals Can be based on targets identity, friends identity, friend’s popularity 7

Contentions – Social Networks Individual identities – sets of characteristics attributed to them by virtue of association, participation in social groups Groups – Collection of individuals with well-defined set of social characteristics 8

Contentions – Social Networks Breaking down of world into set of layers Top layer – whole population Lower layers – specific division into groups 9

Contentions – Social Networks Similarity x ij – between individuals i, j x ij – Height of the lowest common ancestor level between i and j Individuals in same group are at distance of one from each other 10

Contentions – Social Networks Combined social distance y ij = min h x ij In the above figure H = 2 In 1 st heirarchy, y ij = 1 and y jk = 1 in 2 nd But y ik = 4 > y ij + y jk = 2 11

Contentions – Social Networks Probability of acquaintance between i and j decreases with decreasing similarity of groups to which they belong Link distance x for individual i has probability p(x) = ce - α x Measure of homophily – tendency of like to associate with like 12

Contentions – Social Networks Individuals hierarchically partition the social world in more than one way. ◦ h = 1, …, H hierarchies Node’s identity is the vector ◦ is position of node i in hierarchy h. Social distance 13

Contentions – Social Networks At each step the holder i of the message passes it to one of its friends who is closest to the target t in terms of social distance Individuals know the identity vectors of: ◦ themselves ◦ their friends, ◦ the target Two kinds of partial information – social distance and network paths 14

Algorithm Explanation Principal objective – determine conditions for average path length L of a message chain is small Define q as probability of an arbitrary message chain reaching a target. Searchable network - Any network for which q ≥ r for a desired r. 15

Searchability Searchable networks occupy a broad region of parameter space which are sociologically plausible Searchability is generic property of social networks 16

Algorithm Explanation In terms of chain length L, q = (1 - p) L ≥ r L = length of message chain P = message failure probability From above, L can be obtained by the approximate inequality, L <= ln r / ln (1 - p) 17

Our model and Milgram’s findings All searchable networks have α > 0, H > 1 Individuals are essentially homophilous but judge similarity along more than one social dimension Best performance is achieved for H = 2 or 3 Thus, use of 2 or 3 dimensions used by individuals in small world experiments when forwarding a message 18

Searchable Networks 19 Solid boundary – N=102,400 Dot-dash – N= Dash – N=409,600 p = 0.25, b = 2, g = 100, r = 0.25 at least

Our model and Milgram’s findings Increasing number of independent dimensions from H = 1 yields dramatic reduction in delivery time for α > 0 This improvement lost as H is increased further Thus, network ties become less correlated as H increases For large H, network becomes a random graph, search algorithm becomes random walk 20

Searchable Networks 21 Probability of message completion when for α = 0 (squares) and for α = 2 (circles) for N = 102,400 Horizontal line – pos of the threshold Open symbols indicate network is searchable – q <= r

Our model and Milgram’s data n(L) – no. of completed chains of length L taken from original small world expt. (shown by bar graphs) Taken for example of our model for N = 10^8 individuals and for 42 completed chains shown by filled circles 22

Our model and Milgram’s findings Comparison of distribution of chain lengths in our model with that of Travers and Milgram Avg. chain length for Milgrams expt = 6.5 Avg. chain length for our model =

Summary Simple greedy algorithm. Represents properties present in real social networks: ◦ Considers local clustering. ◦ Reflects the notion of locality. High-level structure + random links. 24

Further Extensions Should we consider other parameters such as friend’s popularity information in addition to homophily? ◦ Allow variation in node degrees? Assume correlation between hierarchies? Are all hierarchies equally important? 25

Applications Broad class of decentralized problems ◦ Peer to peer networking Any data structure in which data elements can be judged along more than one dimension Designing of databases ◦ Eg. Music files – same genre/same year 26