Measurement and Analysis of Online Social Networks Alan Mislove,Massimiliano Marcon, Krishna P. Gummadi, Peter Druschel, Bobby Bhattacharjee Presented.

Slides:



Advertisements
Similar presentations
Measurement and Analysis of Online Social Networks 1 A. Mislove, M. Marcon, K Gummadi, P. Druschel, B. Bhattacharjee Presentation by Shahan Khatchadourian.
Advertisements

Stelios Lelis UAegean, FME: Special Lecture Social Media & Social Networks (SM&SN)
Analysis and Modeling of Social Networks Foudalis Ilias.
Web as Network: A Case Study Networked Life CIS 112 Spring 2010 Prof. Michael Kearns.
Information Networks Generative processes for Power Laws and Scale-Free networks Lecture 4.
Advanced Topics in Data Mining Special focus: Social Networks.
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
Asking Questions on the Internet
4. PREFERENTIAL ATTACHMENT The rich gets richer. Empirical evidences Many large networks are scale free The degree distribution has a power-law behavior.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
Masters Thesis Defense Amit Karandikar Advisor: Dr. Anupam Joshi Committee: Dr. Finin, Dr. Yesha, Dr. Oates Date: 1 st May 2007 Time: 9:30 am Place: ITE.
Network Models Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Models Why should I use network models? In may 2011, Facebook.
Monday, June 01, 2015 Online Social Networks: An Introduction Prensenter: IengFat Lam.
Flickr Information propagation in the Flickr social network Meeyoung Cha Max Planck Institute for Software Systems With Alan Mislove.
Mining and Searching Massive Graphs (Networks)
Networks FIAS Summer School 6th August 2008 Complex Networks 1.
By: Roma Mohibullah Shahrukh Qureshi
UNDERSTANDING VISIBLE AND LATENT INTERACTIONS IN ONLINE SOCIAL NETWORK Presented by: Nisha Ranga Under guidance of : Prof. Augustin Chaintreau.
CS 728 Lecture 4 It’s a Small World on the Web. Small World Networks It is a ‘small world’ after all –Billions of people on Earth, yet every pair separated.
Web as Graph – Empirical Studies The Structure and Dynamics of Networks.
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
Maciej Kurant (EPFL / UCI) Joint work with: Athina Markopoulou (UCI),
Common Properties of Real Networks. Erdős-Rényi Random Graphs.
CS Lecture 6 Generative Graph Models Part II.
Sampling from Large Graphs. Motivation Our purpose is to analyze and model social networks –An online social network graph is composed of millions of.
Measurement and Analysis of Online Social Networks By Alan Mislove, Massimiliano Marcon, Krishna P. Gummadi, Peter Druschel, Bobby Bhattacharjee Attacked.
Advanced Topics in Data Mining Special focus: Social Networks.
Graphs and Topology Yao Zhao. Background of Graph A graph is a pair G =(V,E) –Undirected graph and directed graph –Weighted graph and unweighted graph.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
On Distinguishing between Internet Power Law B Bu and Towsley Infocom 2002 Presented by.
Computer Science 1 Web as a graph Anna Karpovsky.
1 Measurement and Analysis of Online Social Networks A. Mislove, M. Marcon, K Gummadi, P. Druschel, B. Bhattacharjee Presentation by Yong Wang (Defense.
Analysis of Topological Characteristics of Huge Online Social Networking Services Y.-Y. Ahn, S. Han, H. Kwak, S. Moon, H. Jeong KAIST, Deajeon, South Korea.
Introduction to compact routing Dmitri Krioukov UCSD/CAIDA IDRWS 2004.
Social Media: YouTube as a Case. 2 New generation of video sharing service Feb.15th, 2005 Some statistics: 60 hours video uploaded very minute 4 billion.
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network WWW09 报告人: 徐波.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Alan Frieze Charalampos (Babis) E. Tsourakakis WAW June ‘12 WAW '121.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
University of California at Santa Barbara Christo Wilson, Bryce Boe, Alessandra Sala, Krishna P. N. Puttaswamy, and Ben Zhao.
Analysis of Topological Characteristics of Huge Online Social Networking Services Friday 10am Telefonica Barcelona Yong-Yeol Ahn Seungyeop Han.
Network properties Slides are modified from Networks: Theory and Application by Lada Adamic.
Data Analysis in YouTube. Introduction Social network + a video sharing media – Potential environment to propagate an influence. Friendship network and.
WALKING IN FACEBOOK: A CASE STUDY OF UNBIASED SAMPLING OF OSNS junction.
Network Characterization via Random Walks B. Ribeiro, D. Towsley UMass-Amherst.
M EASUREMENT AND A NALYSIS OF O NLINE S OCIAL N ETWORKS Professor : Dr Sheykh Esmaili Presenters: Pourya Aliabadi Boshra Ardallani Paria Rakhshani 1.
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network author: Meeyoung Cha Alan Mislove Krishna P. Gummadi From Saarbrucken,
COM1721: Freshman Honors Seminar A Random Walk Through Computing Lecture 2: Structure of the Web October 1, 2002.
COLOR TEST COLOR TEST. Social Networks: Structure and Impact N ICOLE I MMORLICA, N ORTHWESTERN U.
Murtaza Abbas Asad Ali. NETWORKOLOGY THE SCIENCE OF NETWORKS.
Understanding Crowds’ Migration on the Web Yong Wang Komal Pal Aleksandar Kuzmanovic Northwestern University
Shi Zhou University College London Second-order mixing in networks Shi Zhou University College London.
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
1. 2 CIShell Features A framework for easy integration of new and existing algorithms written in any programming language. CIShell Sci2 Tool NWB Tool.
A measurement-driven Analysis of Information Propagation in the Flickr Social Network Meeyoung Cha Alan Mislove Krisnna P. Gummadi.
User Interactions in Social Networks and their Implications Christo Wilson, Bryce Boe, Alessandra Sala, Krishna P. N. Puttaswamy, Ben Y. Zhao (UC Santa.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Social World Connectivity among Indian Celebrities Made and Presented By : Harshit Bhatt.
Most of contents are provided by the website Network Models TJTSD66: Advanced Topics in Social Media (Social.
GRAPHS. Graph Graph terminology: vertex, edge, adjacent, incident, degree, cycle, path, connected component, spanning tree Types of graphs: undirected,
Models of Web-Like Graphs: Integrated Approach
1 Link Privacy in Social Networks Aleksandra Korolova, Rajeev Motwani, Shubha U. Nabar CIKM’08 Advisor: Dr. Koh, JiaLing Speaker: Li, HueiJyun Date: 2009/3/30.
GRAPH AND LINK MINING 1. Graphs - Basics 2 Undirected Graphs Undirected Graph: The edges are undirected pairs – they can be traversed in any direction.
Randolph’s Community Health Network RANDOLPH HEALTH SERVICE AREA JULY 2015.
Generative Model To Construct Blog and Post Networks In Blogosphere
The likelihood of linking to a popular website is higher
Department of Computer Science University of York
Graphs G = (V,E) V is the vertex set.
Presentation transcript:

Measurement and Analysis of Online Social Networks Alan Mislove,Massimiliano Marcon, Krishna P. Gummadi, Peter Druschel, Bobby Bhattacharjee Presented by Aleksandra Potapova

Focus graphs of online social networks – how they were obtained – how they were verified how measurement and analysis was performed properties of obtained graphs why these properties are relevant

What was studied? Flickr YouTube LiveJournal, Orkut

Why should we perform measurements and analysis in social networks? To design future online social network based systems To understand the impact of online social networks on the Internet To reduce the number of spam To improve security aspect

5 Summary of graph properties small-world power-law scale-free correlation between indegree and outdegree large strongly connected core of high-degree nodes surrounded by small clusters of low- degree nodes

Crawling Algorithms for large graphs BFS and DFS Snowball method(crawling only small subset of a graph by ending BFS early): – Partial BFS craws overestimate node degree and underestimate the level of symmetry. – In social networks, they underestimate the power- law coefficient, but closely match other metrics such as overall clustering coefficient.

How social networks should be crawled? The focus of the paper – WCC – Forward and reverse links should be used

How the graphs were obtained? API – users – groups – forward/backward links HTML Screen Scraping

How to Verify Samples 1.Obtain a random user sample – LJ: feature which returns 5,000 random users – Flickr: random 8-digit user id generation 2.Conduct a crawl using these random users as seeds 3.See if these random nodes connect to the original WCC 4.See what the graph structure of the newly crawled graph compares to original

Crawling Concerns – FW links no effect on largest WCC

11 Crawling Concerns – FW links increasing the size of the WCC by starting at a different seed

12 SiteYTFlickrLJOrkut Users(mill) Links(mill) symmetry79.1%62.0%73.5%100.0% Access (FW: Forward- only) (SS: HTML screen- scraping) API (users only) FW SS for group info API (users + groups) FW API (users + groups) FW + BW SS for users + groups

13 Link Symmetry even with directed links, there is a high level of symmetry possibly contributed to by informing users of new incoming links makes it harder to identify reputable sources due to dilution

14 Power-law node degrees Orkut deviates: – only 11.3% of network reached (effect of partial BFS crawl – Snowball method) – artificial cap of user’s number of outgoing links, leads to a distortion in distribution of high degrees differs from Web

15 Power-law node degrees

17 Correlation of indegree and outdegree over 50% of nodes have indegree within 20% of their outdegree

18 Path lengths and diameter all four networks have short path length

19 Link degree correlations JDD: joint degree distribution(how often nodes of different degree connect to each other) K nn --- mapping between outdegree and average indegree of all nodes connected to nodes of that outdegree – Used for aproxmation of JDD YouTube different due to extremely popular users being connected to by many unpopular users Orkut shows bump due to undersampling

Measurement and Analysis of Online Social Networks 20 Joint degree distribution and Scale-free behaviour undersampling of low-degree nodes celebrity-driven nature cap on links

Measurement and Analysis of Online Social Networks 21 Densely connected core removing 10% of core nodes results in breaking up graph into millions of very small SCCs graphs below show results as nodes are removed starting with highest- degree nodes (left) and path length as graph is constructed beginning with highest-degree nodes(right) Sub logarithmic growth

Measurement and Analysis of Online Social Networks 22 Tightly clustered fringe based on clustering coefficient social network graphs show stronger clustering, most likely due to mutual friends Possibly because personal content is not shared

Measurement and Analysis of Online Social Networks 23 Groups group sizes follow power-law distribution represent tightly clustered communities

Measurement and Analysis of Online Social Networks 24 Groups Orkut special case maybe because of partial crawl

Measurement and Analysis of Online Social Networks 25 Node Value Determination Directed Graph, current model nodes with many incoming links (hubs) have value due to their connection to many users it becomes easy to spread important information to the other nodes, e.g. DNS unhealthy in case of spam or viruses in order for a user to send spam, they have become a more important node, amass friends

Questions?