Analysis of Social Media MLD 10-802, LTI 11-772 William Cohen 9-25-12.

Slides:



Advertisements
Similar presentations
Scale Free Networks.
Advertisements

Algorithmic and Economic Aspects of Networks Nicole Immorlica.
Analysis and Modeling of Social Networks Foudalis Ilias.
Lecture 21 Network evolution Slides are modified from Jurij Leskovec, Jon Kleinberg and Christos Faloutsos.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Analysis of Social Media MLD , LTI William Cohen
Synopsis of “Emergence of Scaling in Random Networks”* *Albert-Laszlo Barabasi and Reka Albert, Science, Vol 286, 15 October 1999 Presentation for ENGS.
Models of Network Formation Networked Life NETS 112 Fall 2013 Prof. Michael Kearns.
Generative Models for the Web Graph José Rolim. Aim Reproduce emergent properties: –Distribution site size –Connectivity of the Web –Power law distriubutions.
Information Retrieval Lecture 8 Introduction to Information Retrieval (Manning et al. 2007) Chapter 19 For the MSc Computer Science Programme Dell Zhang.
Advanced Topics in Data Mining Special focus: Social Networks.
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
CSE 522 – Algorithmic and Economic Aspects of the Internet Instructors: Nicole Immorlica Mohammad Mahdian.
1 Evolution of Networks Notes from Lectures of J.Mendes CNR, Pisa, Italy, December 2007 Eva Jaho Advanced Networking Research Group National and Kapodistrian.
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
Graphs (Part I) Shannon Quinn (with thanks to William Cohen of CMU and Jure Leskovec, Anand Rajaraman, and Jeff Ullman of Stanford University)
Directional triadic closure and edge deletion mechanism induce asymmetry in directed edge properties.
Network Models Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Network Models Why should I use network models? In may 2011, Facebook.
The Structure of Networks with emphasis on information and social networks RU T-214-SINE Summer 2011 Ýmir Vigfússon.
The Barabási-Albert [BA] model (1999) ER Model Look at the distribution of degrees ER ModelWS Model actorspower grid www The probability of finding a highly.
1 Complex systems Made of many non-identical elements connected by diverse interactions. NETWORK New York Times Slides: thanks to A-L Barabasi.
Network Statistics Gesine Reinert. Yeast protein interactions.
Peer-to-Peer and Grid Computing Exercise Session 3 (TUD Student Use Only) ‏
Common Properties of Real Networks. Erdős-Rényi Random Graphs.
CS Lecture 6 Generative Graph Models Part II.
Advanced Topics in Data Mining Special focus: Social Networks.
1 Algorithms for Large Data Sets Ziv Bar-Yossef Lecture 7 May 14, 2006
Peer-to-Peer and Social Networks Random Graphs. Random graphs E RDÖS -R ENYI MODEL One of several models … Presents a theory of how social webs are formed.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Web Science Course Lecture: Social Networks - * Dr. Stefan Siersdorfer 1 * Figures from Easley and Kleinberg 2010 (
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
COLOR TEST COLOR TEST. Social Networks: Structure and Impact N ICOLE I MMORLICA, N ORTHWESTERN U.
Emergence of Scaling and Assortative Mixing by Altruism Li Ping The Hong Kong PolyU
Online Social Networks and Media
Social Network Analysis Prof. Dr. Daning Hu Department of Informatics University of Zurich Mar 5th, 2013.
Graph Algorithms: Properties of Graphs? William Cohen.
Professor Yashar Ganjali Department of Computer Science University of Toronto
Class 9: Barabasi-Albert Model-Part I
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Analysis of Social Media MLD , LTI William Cohen
Most of contents are provided by the website Network Models TJTSD66: Advanced Topics in Social Media (Social.
Analysis of Social Media MLD , LTI William Cohen
How Do “Real” Networks Look?
1 CIS 4930/6930 – Recent Advances in Bioinformatics Spring 2014 Network models Tamer Kahveci.
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Hierarchical Organization in Complex Networks by Ravasz and Barabasi İlhan Kaya Boğaziçi University.
Network (graph) Models
Lecture 23: Structure of Networks
Topics In Social Computing (67810)
How Do “Real” Networks Look?
Lecture 23: Structure of Networks
Generative Model To Construct Blog and Post Networks In Blogosphere
How Do “Real” Networks Look?
How Do “Real” Networks Look?
Models of Network Formation
Lecture 13 Network evolution
The likelihood of linking to a popular website is higher
Models of Network Formation
Peer-to-Peer and Social Networks Fall 2017
Models of Network Formation
How Do “Real” Networks Look?
Models of Network Formation
Peer-to-Peer and Social Networks
Lecture 23: Structure of Networks
Lecture 9: Network models CS 765: Complex Networks
Lecture 21 Network evolution
Network Science: A Short Introduction i3 Workshop
Diffusion in Networks
Network Models Michael Goodrich Some slides adapted from:
Presentation transcript:

Analysis of Social Media MLD , LTI William Cohen

Administrivia William’s office hours: 1:00-2:00 Friday or by appointment but not this week, 9/21 Wiki access: send to Katie Rivard First assignment….

Assignments Monday 9/24, by 10am: – send us 3 papers you plan to summarize title, authors, ptr to on-line version – any not-yet-summarized papers on this years or last year’s syllabus are pre-approved search the wiki to make sure it’s new! also, don’t propose papers I’ve presented in class or, you can propose another paper I don’t know about – you can work in groups if you like - but you still need to do 3 papers per person eg if this will be a group for the project – hint: short papers are not necessarily easier to read Thus 9/27 at 10:30am: – first summary is due – structured summary + the study plan you followed Example of a structured summary: Example of a study plan: to be posted – if the paper contains a novel Problem, Method, or Dataset then also add a page for it to the wiki Tues 10/2: – three summaries per student on the wiki Thus 10/4: – project proposal - first draft - one per student – we will allow lit review projects - more on this later Tues 10/9: – form project teams of 2-3 people Now

Syllabus 1. Background(6-8 wks, mostly me): – Opinion mining and sentiment analysis. Pang & Li, FnTIR – Properties of social networks. Easley & Kleinberg, ch 1-5, ; plus some other stuff. – Stochastic graph models. Goldenberg et al, – Models for graphs and text. [Recent papers] Analysis Language People Networks Social Media

Schoolkids

Heart study participants

citations

What are we trying to do? Normal curve: Data to fit Underlying process that “explains” the data Closed-form parametric form Only models a small part of the data

Graphs Some common properties of graphs: – Distribution of node degrees – Distribution of cliques (e.g., triangles) – Distribution of paths Diameter (max shortest- path) Effective diameter (90 th percentile) Connected components – … Some types of graphs to consider: – Real graphs (social & otherwise) – Generated graphs: Erdos-Renyi “Bernoulli” or “Poisson” Watts-Strogatz “small world” graphs Barbosi-Albert “preferential attachment” …

Random graph (Erdos Renyi) Take n nodes, and connect each pair with probability p – Mean degree is z=p(n-1) – More precisely, the distribution of degrees is approximately Gaussian

Erdos-Renyi graphs Take n nodes, and connect each pair with probability p – Mean degree is z=p(n-1) – Mean number of neighbors distance d from v is z d – How large does d need to be so that z d >=n ? If z>1, d = log(n)/log(z) If z<1, you can’t do it – So: If z>1, diameters tend to be small (relative to n)

Graphs Some common properties of graphs: – Distribution of node degrees – Distribution of cliques (e.g., triangles) – Distribution of paths Diameter (max shortest- path) Effective diameter (90 th percentile) Connected components – … Some types of graphs to consider: – Real graphs (social & otherwise) – Generated graphs: Erdos-Renyi “Bernoulli” or “Poisson” Watts-Strogatz “small world” graphs Barbosi-Albert “preferential attachment” …

Degree distribution Plot cumulative degree – X axis is degree – Y axis is #nodes that have degree at least k Typically use a log-log scale – Straight lines are a power law; normal curve dives to zero at some point – Left: trust network in epinions web site from Richardson & Domingos

Degree distribution Plot cumulative degree – X axis is degree – Y axis is #nodes that have degree at least k Typically use a log-log scale – Straight lines are a power law; normal curve dives to zero at some point This defines a “scale” for the network – Left: trust network in epinions web site from Richardson & Domingos

Friendship network in Framington Heart Study

Graphs Some common properties of graphs: – Distribution of node degrees: often scale-free – Distribution of cliques (e.g., triangles) – Distribution of paths Diameter (max shortest- path) Effective diameter (90 th percentile) often small Connected components usually one giant CC – … Some types of graphs to consider: – Real graphs (social & otherwise) – Generated graphs: Erdos-Renyi “Bernoulli” or “Poisson” Watts-Strogatz “small world” graphs Barbosi-Albert “preferential attachment” generates scale-free graphs …

Barabasi-Albert Networks Science 286 (1999) Start from a small number of node, add a new node with m links Preferential Attachment Probability of these links to connect to existing nodes is proportional to the node’s degree ‘Rich gets richer’ This creates ‘hubs’: few nodes with very large degrees

Preferential attachment (Barabasi-Albert) Random graph (Erdos Renyi)

Graphs Some common properties of graphs: – Distribution of node degrees: often scale-free – Distribution of cliques (e.g., triangles) – Distribution of paths Diameter (max shortest- path) Effective diameter (90 th percentile) often small Connected components usually one giant CC – … Some types of graphs to consider: – Real graphs (social & otherwise) – Generated graphs: Erdos-Renyi “Bernoulli” or “Poisson” Watts-Strogatz “small world” graphs Barbosi-Albert “preferential attachment” generates scale-free graphs …

Homophily One definition: excess edges between similar nodes – E.g., assume nodes are male and female and Pr(male)=p, Pr(female)=q. – Is Pr(gender(u)≠ gender(v) | edge (u,v)) >= 2pq? Another def’n: excess edges between common neighbors of v

Homophily Another def’n: excess edges between common neighbors of v

Homophily In a random Erdos-Renyi graph: In natural graphs two of your mutual friends might well be friends: Like you they are both in the same class (club, field of CS, …) You introduced them

Watts-Strogatz model Start with a ring Connect each node to k nearest neighbors  homophily Add some random shortcuts from one point to another  small diameter Degree distribution not scale-free Generalizes to d dimensions

A big question Which is cause and which is effect? – Do birds of a feather flock together? – Do you change your behavior based on the behavior of your peers? In natural graphs two of your mutual friends might well be friends: Like you they are both in the same class (club, field of CS, …) You introduced them

A big question Which is cause and which is effect? – Do birds of a feather flock together? – Do you change your behavior based on the behavior of your peers? How can you tell? – Look at when links are added and see what patterns emerge: Pr(new link btwn u and v | #common friends)

Final example: spatial segregation How picky do people have to be about their neighbors for homophily to arise?

Thomas Schelling - spatial segregation models

Final example: spatial segregation Imagine a grid world where – Agents are red or blue – Agents move to a random location if they are unhappy Agents are happy unless <k neighbors are the same color they are (k= i.e., they prefer not to be in a small minority – What’s the result over time? –

First guest lecture on Thursday…. Cosma Shalizi “it does give an accurate impression of my desk (and hair), and people recognize me from it at meetings.”