J.P. Onnela. Claim and Roadmap We examine the communication patterns of millions of mobile phone users, allowing us to simultaneously study the local.

Slides:



Advertisements
Similar presentations
Mobile Communication Networks Vahid Mirjalili Department of Mechanical Engineering Department of Biochemistry & Molecular Biology.
Advertisements

The Strength of Weak Ties
Network Matrix and Graph. Network Size Network size – a number of actors (nodes) in a network, usually denoted as k or n Size is critical for the structure.
Λ14 Διαδικτυακά Κοινωνικά Δίκτυα και Μέσα Strong and Weak Ties Chapter 3, from D. Easley and J. Kleinberg book.
Analysis and Modeling of Social Networks Foudalis Ilias.
Based on chapter 3 in Networks, Crowds and markets (by Easley and Kleinberg) Roy Mitz Supervised by: Prof. Ronitt Rubinfeld November 2014 Strong and weak.
Relationship Mining Network Analysis Week 5 Video 5.
A.k.a. “bell curve”.  If a characteristic is normally distributed in a population, the distribution of scores measuring that characteristic will form.
Social Networks 101 P ROF. J ASON H ARTLINE AND P ROF. N ICOLE I MMORLICA.
Nodes, Ties and Influence
Advanced Topics in Data Mining Special focus: Social Networks.
CONNECTIVITY “The connectivity of a network may be defined as the degree of completeness of the links between nodes” (Robinson and Bamford, 1978).
Identity and search in social networks Presented by Pooja Deodhar Duncan J. Watts, Peter Sheridan Dodds and M. E. J. Newman.
Correlation Chapter 9.
Stress on Social Networks formed in a Mobile and Diverse World CS 460 December 8, 2005.
Computing Trust in Social Networks
Graphs and Topology Yao Zhao. Background of Graph A graph is a pair G =(V,E) –Undirected graph and directed graph –Weighted graph and unweighted graph.
Using Social Networks to Analyze Sexual Relations by Amanda Dargie.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Lorem Ipsum Dolor Structure and tie strengths in mobile communication network.
Chapter 9 Hypothesis Testing.
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network WWW09 报告人: 徐波.
Network Measures Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Measures Klout.
Models of Influence in Online Social Networks
Week 12 Chapter 13 – Association between variables measured at the ordinal level & Chapter 14: Association Between Variables Measured at the Interval-Ratio.
Inference for regression - Simple linear regression
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
The United States air transportation network analysis Dorothy Cheung.
CENTRE FOR INNOVATION, RESEARCH AND COMPETENCE IN THE LEARNING ECONOMY Session 2: Basic techniques for innovation data analysis. Part I: Statistical inferences.
Developing Analytical Framework to Measure Robustness of Peer-to-Peer Networks Niloy Ganguly.
Percolation in Living Neural Networks Ilan Breskin, Jordi Soriano, Tsvi Tlusty, and Elisha Moses Department of Physics of Complex Systems, The Weizmann.
Jure Leskovec, CMU Eric Horwitz, Microsoft Research.
CS 599: Social Media Analysis University of Southern California1 Social Ties and Information Diffusion Kristina Lerman University of Southern California.
Λ14 Διαδικτυακά Κοινωνικά Δίκτυα και Μέσα Strong and Weak Ties Chapter 3, from D. Easley and J. Kleinberg book.
Psychology’s Statistics Statistical Methods. Statistics  The overall purpose of statistics is to make to organize and make data more meaningful.  Ex.
Principles of Social Network Analysis. Definition of Social Networks “A social network is a set of actors that may have relationships with one another”
Presented by Qian Zou.  The purpose of conducting the experiments.  The methodology for the experiments.  The Experimental Design : Cohesion Experiments.
Visual Search: examining relationship between cropping and image size for limited-screen devices Paul Lin Andres Odio.
Research & Statistics Looking for Conclusions. Statistics Mathematics is used to organize, summarize, and interpret mathematical data 2 types of statistics.
Using Pattern of Social Dynamics in the Design of Social Networks of Sensors - Marello Tomasini, Franco Zambonelli, Ronaldo Menezes 한국기술교육대학교 전기전자통신 공학부.
Murtaza Abbas Asad Ali. NETWORKOLOGY THE SCIENCE OF NETWORKS.
Special Topics in Educational Data Mining HUDK5199 Spring 2013 March 25, 2012.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Project funded by the Future and Emerging Technologies arm of the IST Programme Analytical Insights into Immune Search Niloy Ganguly Center for High Performance.
Selfishness, Altruism and Message Spreading in Mobile Social Networks September 2012 In-Seok Kang
Objectives 2.1Scatterplots  Scatterplots  Explanatory and response variables  Interpreting scatterplots  Outliers Adapted from authors’ slides © 2012.
Robustness of complex networks with the local protection strategy against cascading failures Jianwei Wang Adviser: Frank,Yeong-Sung Lin Present by Wayne.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Online Social Networks and Media
Analyzing the Vulnerability of Superpeer Networks Against Attack Niloy Ganguly Department of Computer Science & Engineering Indian Institute of Technology,
Slides are modified from Lada Adamic
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Community Discovery in Social Network Yunming Ye Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology.
OPTIMAL CONNECTIONS: STRENGTH AND DISTANCE IN VALUED GRAPHS Yang, Song and David Knoke RESEARCH QUESTION: How to identify optimal connections, that is,
1 Finding Spread Blockers in Dynamic Networks (SNAKDD08)Habiba, Yintao Yu, Tanya Y., Berger-Wolf, Jared Saia Speaker: Hsu, Yu-wen Advisor: Dr. Koh, Jia-Ling.
Community structure in graphs Santo Fortunato. More links “inside” than “outside” Graphs are “sparse” “Communities”
1 Discovering Web Communities in the Blogspace Ying Zhou, Joseph Davis (HICSS 2007)
Social Networks Strong and Weak Ties
1 Link Privacy in Social Networks Aleksandra Korolova, Rajeev Motwani, Shubha U. Nabar CIKM’08 Advisor: Dr. Koh, JiaLing Speaker: Li, HueiJyun Date: 2009/3/30.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Copyright © 2012 by Nelson Education Limited. Chapter 12 Association Between Variables Measured at the Ordinal Level 12-1.
Alan Mislove Bimal Viswanath Krishna P. Gummadi Peter Druschel.
Connectivity and the Small World
Social Networks Analysis
Applications of graph theory in complex systems research
The Strength of Weak Ties
Software Reliability Models.
Chapter 9 Hypothesis Testing.
Advanced Algebra Unit 1 Vocabulary
Advanced Topics in Data Mining Special focus: Social Networks
Presentation transcript:

J.P. Onnela

Claim and Roadmap We examine the communication patterns of millions of mobile phone users, allowing us to simultaneously study the local and the global structure of a society-wide communication network.  Furthermore the mobile phone data are skewed toward trusted interactions (that is, people tend to share their mobile numbers only with individuals they trust). The data set allows us to explore the relationship between the topology of the network and the tie strengths between individuals. We demonstrate a local coupling between tie strengths and network topology, and show that this coupling has important consequences for the network’s global stability if ties are removed, as well as for the spread of news and ideas within the network.

Data Collected on Mobile Phone Users A significant portion of a country’s communication network was reconstructed from 18 weeks of all mobile phone call records among 20% of the country’s entire population, 90% of whose inhabitants had a mobile phone subscription. To translate the phone log data into a network representation that captures the characteristics of the underlying communication network,  we connected two users with an undirected link if there had been at least one reciprocated pair of phone calls between them (i.e., A called B, and B called A) and defined the strength, wt(AB) = wt(BA), of a tie aggregated duration of calls between users A and B. The resulting mobile call graph “MCG” contains N 4.6*(10)^6 nodes and 7.0*(10)^6 links, the vast majority (84.1%) of these nodes belonging to a single connected cluster Giant Component(GC).

Analysis 1 : MCG The MCG has a skewed degree distribution with a fat tail (Fig.A), indicating that although most users communicate with only a few individuals, a small minority talks with dozens.This decay(k = 8.4 exponential distribution decaying degree distribution) is probably rooted in the fact that institutional phone numbers, corresponding to the vast majority of large hubs in the case of land lines, are absent, and in contrast with , in which a single can be sent to many recipients, resulting in well-connected hubs. The tie strength distribution is broad (Fig. B), however, decaying with an exponent w 1.9, so that although the majority of ties correspond to a few minutes of airtime, a small fraction of users spend hours chatting with each other.

Hypothesis and Definition Global efficiency principle  Meaning that the tie strengths are optimized to maximize the overall flow in the network. In this case the weight of a link should correlate with its betweenness centrality, which is proportional to the number of shortest paths between all pairs of nodes passing through it. Dyadic hypothesis  The strength of a particular tie depends only on the nature of the relationship between two individuals and is thus independent of the network surrounding the tie Weak ties hypothesis  States that the strength of a tie between A and B increases with the overlap of their friendship circles, resulting in the importance of weak ties in connecting communities. The hypothesis leads to high betweenness centrality for weak links, which can be seen as the mirror image of the global efficiency principle

Plot Global efficiency principle, Dyadic hypothesis and Weak ties hypothesis Weak Hypothesis:(Fig A) Selected individual, where the link color corresponds to the strength of each tie the network consists of small local clusters, typically grouped around a high-degree individual. The majority of the strong ties are found within the clusters, indicating that users spend most of their on-air time talking to members of their immediate circle of friends. Dyadic hypothesis:(Fig B) we observe dramatically more weak ties within the communities and more strong ties connecting distinct communities. Global efficiency principle(Fig C) As predicted by the global efficiency principle and betweenness centrality, intercommunity ties (‘‘bridges’’) were strong and intracommunity ties (‘‘local roads’’) weak

Quantify the Hypothesis Calculate Overlap: We measured the relative topological overlap of the neighborhood of two users vi and vj, representing the proportion of their common friends Oij = nij/((ki 1) (kj 1) nij), where nij is the number of common neighbors of vi and vj Obervation:If vi and vj have no common acquaintances, then we have Oij = 0, the link between i and j representing potential bridges between two different communities. If i and j are part of the same circle of friends, then Oij = 1 Weak ties hypothesis:(Blue circles:Fig D) Support for the real communication network, Ow increases as a function of the percentage of links with weights smaller than Constant Weight w, demonstrating that the stronger the tie between two users, the more their friends overlap, a correlation that is valid for 95% of the links Contradiction of dyadic Hypothesis (Red circles:Fig D) and Th e absence of a relationship between the local network topology and weights, and, indeed, we find that permuting randomly the tie strengths between the links results in Oij that is independent of wij. Global efficiency principle(Black Circle) According to the global efficiency principle Ob decreases with the betweenness centrality bij, indicating that on average the links with the highest betweenness centrality bij have the smallest overlap.

To understand the systemic or global implications of this local relationship between tie strength and network structure, we explore the network’s ability to withstand the removal of either strong or weak ties.  Rgc( f )=> the fraction of nodes that can all reach each other through connected paths as a function of the fraction of removed links,f S˜ (¥ ssmax n s s 2 /N) => The precise point at which the network disintegrates can be determined by monitoring, where n s is the number of clusters containing s nodes. According to percolation theory, if the network collapses because of a phase transition at f c, then S˜ diverges as f approaches Fc. Indeed, we find that S develops a peak if we start with the weakest (or smallest overlap) links. By Red line  We find that removing in rank order the smallest link(Fig. A and B) to strongest (greatest overlap) ties leads to the network’s sudden disintegration. sudden, phase transition-driven collapse of the whole network than removing strong links. By Black line  The strong ties are predominantly within the communities, their removal will only locally disintegrate a community but not affect the network’s overall integrity. In contrast, the removal of the weak links will delete the bridges that connect different communities, leading to a phase transition driven network collapse.

By Red line  We find that removing in rank order the smallest overlap (Fig. A and B) to strongest (greatest overlap) ties leads to the network’s sudden disintegration. sudden, phase transition-driven collapse of the whole network than removing strong links. By Black line  The greatest overlap are predominantly within the communities, their removal will only locally disintegrate a community but not affect the network’s overall integrity. In contrast, the removal of the smallest overlap will delete the bridges that connect different communities, leading to a phase transition driven network collapse.

Mobile Phone Diffusion problem To see whether the observed local relationship between the network topology and tie strength affects global information diffusion, at time we infected a randomly selected individual with some novel information. We assumed that at each time step, each infected individual, vi, can pass the information to his/her contact, vj, with effective. Therefore, the more time two individuals spend on the phone, the higher the chance that they will pass on the monitored information. probability Pij = x(wij).

Message Diffusion Analysis Analysis A We find that information transfer is significantly faster on the network for which all weights are equal. the bridges between communities are strengthened, and the spreading becomes a predominantly global process, rapidly reaching all nodes through a hierarchy of hubs. Analysis B We observe rapid diffusion within a single community, corresponding to fast increases in the number of infected users, followed by plateaus, corresponding to time intervals during which no new nodes are infected before the news escapes the community.

Diffusion Analysis 2 Analysis C The distribution of the tie strengths through which each individual was first infected (Fig. C) has a prominent peak at w (10)2 seconds, indicating that, in the vast majority of cases, an individual learns about the news through ties of intermediate strength.Analysis D. When all tie strengths are taken to be equal during the spreading process. In this case, the majority of infections take place along the ties that are otherwise weak Conclusion We find that both weak and strong ties have a relatively insignificant role as conduits for information (‘‘the weakness of weak and strong ties’’), the former because the small amount of on- air time offers little chance of information transfer and the latter because they are mostly confined within communities, with little access to new information

Difference real and control Diffusion In the control runs, the information mainly follows the shortest paths. When the weights are taken into account, however, information flows along a strong tie backbone, and large regions of the network, connected to the rest of the network by weak ties, are only rarely infected. to the rest of the network by weak ties, are only rarely infected. For example, the lower half of the network is rarely infected in the real simulation but is always infected in the control run. Therefore, the diffusion mechanism in the network is drastically altered when we neglect the tie strengths responsible for the differences between the curves seen in Fig. A and B.

Discussion We show that tie strengths correlate with the local network structure around the tie, and both the dyadic hypothesis and the global efficiency principle are unable to account this observation. Our analyses document specifies opposite effect in communication networks: The removal of the weak ties results in a phase transition-like network collapse,although the removal of strong ties has little impact on the network’s overall integrity. Successful searches in social networks are conducted primarily through intermediate- to weak-strength ties while avoiding the hubs. Link weights and betweenness centrality are negatively correlated in mobile communication. communication networks are better suited to local information processing than global information transfer, a result that has the makings of a paradox