Jure Leskovec, CMU Eric Horwitz, Microsoft Research.

Slides:



Advertisements
Similar presentations
Routing in Poisson small-world networks A. J. Ganesh Microsoft Research, Cambridge Joint work with Moez Draief.
Advertisements

Mobile Communication Networks Vahid Mirjalili Department of Mechanical Engineering Department of Biochemistry & Molecular Biology.
Chapter 52 Earth’s Fluctuating Populations
By Lee Betancourt Director of Communications and Public Relations Jane Myers Public Relations, Communications and Social Media Coordinator Social Media.
Stelios Lelis UAegean, FME: Special Lecture Social Media & Social Networks (SM&SN)
Λ14 Διαδικτυακά Κοινωνικά Δίκτυα και Μέσα Strong and Weak Ties Chapter 3, from D. Easley and J. Kleinberg book.
The Connectivity and Fault-Tolerance of the Internet Topology
Online Social Networks and Media Navigation in a small world.
P2P Topologies Centralized Ring Hierarchical Decentralized Hybrid.
Jure Leskovec Joint work with Eric Horvitz, Microsoft Research.
Advanced Topics in Data Mining Special focus: Social Networks.
Identity and search in social networks Presented by Pooja Deodhar Duncan J. Watts, Peter Sheridan Dodds and M. E. J. Newman.
Complex network of the brain I Small world vs. scale-free networks Jaeseung Jeong, Ph.D. Department of Bio and Brain Engineering, KAIST.
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
Directional triadic closure and edge deletion mechanism induce asymmetry in directed edge properties.
Network Models Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Models Why should I use network models? In may 2011, Facebook.
Small Worlds Presented by Geetha Akula For the Faculty of Department of Computer Science, CALSTATE LA. On 8 th June 07.
UNDERSTANDING VISIBLE AND LATENT INTERACTIONS IN ONLINE SOCIAL NETWORK Presented by: Nisha Ranga Under guidance of : Prof. Augustin Chaintreau.
CS 728 Lecture 4 It’s a Small World on the Web. Small World Networks It is a ‘small world’ after all –Billions of people on Earth, yet every pair separated.
Sampling from Large Graphs. Motivation Our purpose is to analyze and model social networks –An online social network graph is composed of millions of.
Marko Grobelnik, Dunja Mladenic JSI Parts of the presentation taken from the tutorial “Structure and function of real-world graphs and networks” by Jure.
Graphs and Topology Yao Zhao. Background of Graph A graph is a pair G =(V,E) –Undirected graph and directed graph –Weighted graph and unweighted graph.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network WWW09 报告人: 徐波.
Mining Large Graphs Part 3: Case studies Jure Leskovec and Christos Faloutsos Machine Learning Department Joint work with: Lada Adamic, Deepay Chakrabarti,
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
 What is Europe’s largest city?  Paris, France  What is its population?  9.6 million.
Planetary-Scale Views on a Large Instant-Messaging Network 报告人:徐波.
Jure Leskovec Joint work with Eric Horvitz, Microsoft Research.
Jure Leskovec Computer Science Department Cornell University / Stanford University Joint work with: Eric Horvitz, Michael Mahoney,
Section 8 – Ec1818 Jeremy Barofsky March 31 st and April 1 st, 2010.
Data Analysis in YouTube. Introduction Social network + a video sharing media – Potential environment to propagate an influence. Friendship network and.
WALKING IN FACEBOOK: A CASE STUDY OF UNBIASED SAMPLING OF OSNS junction.
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Network Characterization via Random Walks B. Ribeiro, D. Towsley UMass-Amherst.
Internet Economics כלכלת האינטרנט Class 10 – it’s a small world 1.
Jure Leskovec Computer Science Department Cornell University / Stanford University Joint work with: Jon Kleinberg (Cornell), Christos.
Microsoft Instant Messenger Communication Network How does the world communicate? Jure Leskovec Machine Learning Department
Online Social Networks and Media
Selfishness, Altruism and Message Spreading in Mobile Social Networks September 2012 In-Seok Kang
3. SMALL WORLDS The Watts-Strogatz model. Watts-Strogatz, Nature 1998 Small world: the average shortest path length in a real network is small Six degrees.
CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University
Online Social Networks and Media
Complex Network Theory – An Introduction Niloy Ganguly.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Complex Network Theory – An Introduction Niloy Ganguly.
Most of contents are provided by the website Network Models TJTSD66: Advanced Topics in Social Media (Social.
What Is A Network? (and why do we care?). An Introduction to Network Theory | Kyle Findlay | SAMRA 2010 | 2 “A collection of objects (nodes) connected.
Does the world have a carrying capacity for the human population? If so, what is it? How long until we reach it? Or have we already overshot it? What will.
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Internet Economics כלכלת האינטרנט Class 9 – social networks (based on chapter 3 from Easely & Kleinberg’s books) 1.
Offense: Planetary-Scale Views on a Large Instant Messaging Network J. Leskovec, et al.
Class 4: It’s a Small World After All Network Science: Small World February 2012 Dr. Baruch Barzel.
Netlogo demo. Complexity and Networks Melanie Mitchell Portland State University and Santa Fe Institute.
Topics In Social Computing (67810) Module 1 Introduction & The Structure of Social Networks.
Social Networks Some content from Ding-Zhu Du, Lada Adamic, and Eytan Adar.
Lecture 23: Structure of Networks
Connectivity and the Small World
The Role of Optimal Distinctiveness and Homophily in Online Dating
Lecture 23: Structure of Networks
The Watts-Strogatz model
Network Science: A Short Introduction i3 Workshop
Lecture 23: Structure of Networks
CS224w: Social and Information Network Analysis
Advanced Topics in Data Mining Special focus: Social Networks
Planetary-Scale Views on a Large Instant-Messaging Network
Presentation transcript:

Jure Leskovec, CMU Eric Horwitz, Microsoft Research

 Contact (buddy) list  Messaging window 2

3 Buddy Conversation

 Observe social and communication phenomena at planetary scale  Practically the largest social network Research questions:  How does communication change with user demographics (age, sex, language, country)?  How does geography affect communication?  What is the structure of the communication network? 4

The record of communication  User demographic data (self-reported):  Age  Gender  Location (Country, ZIP)  Language  Communication data:  For each conversation: participants and time  No message text 5

 We collected the data for June 2006  Log size: 150Gb/day (compressed)  Total: 1 month of communication data: 4.5Tb of compressed data  Activity over June 2006 (30 days)  245 million users logged in  180 million users engaged in conversations  17,5 million new accounts activated  More than 30 billion conversations  More than 255 billion messages 6

Activity on June :  1 billion conversations  93 million users login  65 million different users talk (exchange messages)  1.5 million invitations for new accounts sent 7

How does user demographics influence communication? 8

9

10

Correlation 11 People tend to talk to similar people (except gender)  How do people’s attributes (age, gender) influence communication? Probability

12 User self reported age High Low 1) Young people communicate with same age 2) Older people communicate uniformly across ages

13 User self reported age High Low 1) Old people talk long 2) Working ages (25-40) talk short

14 User self reported age High Low 1) Old people talk long 2) Working ages (25-40) talk quick

15 User self reported age High Low 1) Old people talk slow 2) Young talk fast

 Is gender communication biased?  Do female talk more among themselves?  Do male-female conversations took longer?  Findings:  No of. conversations is not biased (follows chance)  Cross gender conversations take longer and are mo intense (people put more attention) 16 MF 49% 21% 20% Conversations MF 5mi n 4.5 min 4min Duration MF Messages/conversation

 Longer links are used more 17

Each dot represents number of users at geo location 18 Map of the world appears! Costal regions dominate

Users per capita 19 Mid-USA and Scandinavia have highest per capita Fraction of country’s population on MSN: Iceland: 35% Spain: 28% Netherlands, Canada, Sweden, Norway: 26% France, UK: 18% USA, Brazil: 8%

20 Northern hemisphere dominates communication For each conversation between geo points (A,B) we increase the intensity on the line between A and B

21 At least 1 message exchanged

 Buddy graph (estimate)  240 million people (people that login in June ’06)  9.1 billion buddy edges (friendship links)  Communication graph  Edge if the users exchanged at least 1 message  180 million people  1.3 billion edges  30 billion conversations 22

23 Number of buddies follows power- law with exponential cutoff distribution Limit of 600 buddies

Milgram’s small world experiment (i.e., hops + 1) 24  Small-world experiment [Milgram ‘67]  People send letters from Nebraska to Boston  How many steps does it take?  Messenger social network of the whole planet Eart  240M people, 1.3B edges MSN Messenger network Number of steps between pairs of people Avg. path length % of the people can be reached in < 8 hops

 How many triangles are closed?  Clustering normally decays as k High clusteringLow clustering Communication network is highly clustered: k -0.37

26

 What is the structure of the core of the network? 27 [Batagelj & Zaveršnik, 2002]

 People with k<20 are the periphery  Core is composed of 79 people, each having 68 edges among them 28

 Remove nodes (in some order) and observe how network falls apart:  Number of edges deleted  Size of largest connected component 29 Order nodes by:  Number of links  Total conversations  Total conv. Duration  Messages/conversation  Avg. sent, avg. duration

30

31

 Social network of the whole planet Earth  The largest social network analyzed  Strong presence of homophily  people that communicate are similar (except gender)  Well connected Small-world  in only few hops one can research most of the network  Very robust:  many (random) people can be removed and the network is still connected 32