Social Media and Social Computing

Slides:



Advertisements
Similar presentations
Workshop on Online Social Networks Microsoft Research Cambridge December 7, 2007.
Advertisements

Performance in Decentralized Filesharing Networks Theodore Hong Freenet Project.
An analysis of Social Network-based Sybil defenses Bimal Viswanath § Ansley Post § Krishna Gummadi § Alan Mislove ¶ § MPI-SWS ¶ Northeastern University.
Peer-to-peer and agent-based computing Peer-to-Peer Computing: Introduction.
Measurement and Analysis of Online Social Networks 1 A. Mislove, M. Marcon, K Gummadi, P. Druschel, B. Bhattacharjee Presentation by Shahan Khatchadourian.
Social Media Marketing Presented by Jacob Richman TechShoret Conference, February 26, 2009.
- A Powerful Computing Technology Department of Computer Science Wayne State University 1.
SOCIAL NETWORKS ANALYSIS SEMINAR INTRODUCTORY LECTURE Danny Hendler and Yehonatan Cohen Advanced Topics in on-line Social Networks Analysis.
Complex Network Theory
Options for communicating. A social network is a social structure made of individuals (or organizations) called "nodes," which are tied (connected) by.
Location-Based Social Networks Yu Zheng and Xing Xie Microsoft Research Asia Chapter 8 and 9 of the book Computing with Spatial Trajectories.
Community Detection and Graph-based Clustering
Scale Free Networks.
1 Dynamics of Real-world Networks Jure Leskovec Machine Learning Department Carnegie Mellon University
25 seconds left…...
Profile. 1.Open an Internet web browser and type into the web browser address bar. 2.You will see a web page similar to the one on.
The Small World Phenomenon: An Algorithmic Perspective Speaker: Bradford Greening, Jr. Rutgers University – Camden.
1 Analyzing Kleinberg’s Small-world Model Chip Martel and Van Nguyen Computer Science Department; University of California at Davis.
Small-world networks.
Social Media and Social Computing Chapter 1 1 Chapter 1, Community Detection and Mining in Social Media. Lei Tang and Huan Liu, Morgan & Claypool, September,
Social Media and Teaching Tools by Hongmei Chi
Analysis and Modeling of Social Networks Foudalis Ilias.
Social Media Mining Chapter 5 1 Chapter 5, Community Detection and Mining in Social Media. Lei Tang and Huan Liu, Morgan & Claypool, September, 2010.
Nodes, Ties and Influence
Advanced Topics in Data Mining Special focus: Social Networks.
Social Media Networking Sites Charlotte Jenkins Designing the Social Web
CS 599: Social Media Analysis University of Southern California1 The Basics of Network Analysis Kristina Lerman University of Southern California.
Complex network of the brain I Small world vs. scale-free networks Jaeseung Jeong, Ph.D. Department of Bio and Brain Engineering, KAIST.
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
Communities in Heterogeneous Networks Chapter 4 1 Chapter 4, Community Detection and Mining in Social Media. Lei Tang and Huan Liu, Morgan & Claypool,
Discovering Overlapping Groups in Social Media Xufei Wang, Lei Tang, Huiji Gao, and Huan Liu Arizona State University.
Sampling from Large Graphs. Motivation Our purpose is to analyze and model social networks –An online social network graph is composed of millions of.
Peer-to-Peer and Social Networks Introduction. What is a P2P network Uses the vast resource of the machines at the edge of the Internet to build a network.
COMMUNITIES IN MULTI-MODE NETWORKS 1. Heterogeneous Network Heterogeneous kinds of objects in social media – YouTube Users, tags, videos, ads – Del.icio.us.
Research Meeting Seungseok Kang Center for E-Business Technology Seoul National University Seoul, Korea.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
AVI/Psych 358/IE 340: Human Factors Web 2.0 November
Jure Leskovec Computer Science Department Cornell University / Stanford University Joint work with: Eric Horvitz, Michael Mahoney,
Section 8 – Ec1818 Jeremy Barofsky March 31 st and April 1 st, 2010.
Chapter 1. Social Media and Social Computing October 2012 Youn-Hee Han
Web Science Course Lecture: Social Networks - * Dr. Stefan Siersdorfer 1 * Figures from Easley and Kleinberg 2010 (
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Small-world networks. What is it? Everyone talks about the small world phenomenon, but truly what is it? There are three landmark papers: Stanley Milgram.
COM1721: Freshman Honors Seminar A Random Walk Through Computing Lecture 2: Structure of the Web October 1, 2002.
COLOR TEST COLOR TEST. Social Networks: Structure and Impact N ICOLE I MMORLICA, N ORTHWESTERN U.
Complex Network Theory – An Introduction Niloy Ganguly.
Social Network Analysis. Outline l Background of social networks –Definition, examples and properties l Data in social networks –Data creation, flow and.
Complex Network Theory – An Introduction Niloy Ganguly.
Most of contents are provided by the website Network Models TJTSD66: Advanced Topics in Social Media (Social.
What Is A Network? (and why do we care?). An Introduction to Network Theory | Kyle Findlay | SAMRA 2010 | 2 “A collection of objects (nodes) connected.
CS:4980:0005 Peer-to-Peer and Social Networks Fall 2015 Introduction.
Mining information from social media
Web 2.0 Ali Ghandour Based on slides from: Clara Ko, EuropeanPWN Amsterdam.
Small World Social Networks With slides from Jon Kleinberg, David Liben-Nowell, and Daniel Bilar.
Social Information Processing March 26-28, 2008 AAAI Spring Symposium Stanford University
Analyzing Networks. Milgram’s Experiments “Six degrees of Separation” Milgram’s letters to various recruits in Nebraska who were asked to forward the.
Class 4: It’s a Small World After All Network Science: Small World February 2012 Dr. Baruch Barzel.
Netlogo demo. Complexity and Networks Melanie Mitchell Portland State University and Santa Fe Institute.
Social Media & Social Networking 101 Canadian Society of Safety Engineering (CSSE)
Topics In Social Computing (67810) Module 1 Introduction & The Structure of Social Networks.
GRAPH AND LINK MINING 1. Graphs - Basics 2 Undirected Graphs Undirected Graph: The edges are undirected pairs – they can be traversed in any direction.
CS:4980:0001 Peer-to-Peer and Social Networks Fall 2017
Social Networks Some content from Ding-Zhu Du, Lada Adamic, and Eytan Adar.
Social Networks Analysis
Social Media and Social Computing
SOCIAL Networks Introduction Modified from
Network Science: A Short Introduction i3 Workshop
Lecture 9: Network models CS 765: Complex Networks
Graph and Link Mining.
Advanced Topics in Data Mining Special focus: Social Networks
Presentation transcript:

Social Media and Social Computing Adapted from Chapter 1 Of Lei Tang and Huan Liu’s Book Chapter 1, Community Detection and Mining in Social Media.  Lei Tang and Huan Liu, Morgan & Claypool, September, 2010. 

Social Media: Many-to-Many Social Networking Blogs Microblogging Forum Wiki Sharing Content There are various of social media: like social network , blogs, wiki, forums content sharing. Most of them are an open platform so users can connect with each other, generate content. These web 2.0 applications gained a lot of online traffic recently. They provide rich information about online users, also new tasks and challenges.

Various forms of Social Media Blog: Wordpress, blogspot, LiveJournal Forum: Yahoo! Answers, Epinions Media Sharing: Flickr, YouTube, Scribd Microblogging: Twitter, FourSquare Social Networking: Facebook, LinkedIn, Orkut Social Bookmarking: Del.icio.us, Diigo Wikis: Wikipedia, scholarpedia, AskDrWiki

Characteristics of Social Media “Consumers” become “Producers” Rich User Interaction User-Generated Contents Collaborative environment Collective Wisdom Long Tail Broadcast Media Filter, then Publish Social Media Publish, then Filter

Top 20 Websites at USA 1 Google.com 11 Blogger.com 2 Facebook.com 12 msn.com 3 Yahoo.com 13 Myspace.com 4 YouTube.com 14 Go.com 5 Amazon.com 15 Bing.com 6 Wikipedia.org 16 AOL.com 7 Craigslist.org 17 LinkedIn.com 8 Twitter.com 18 CNN.com 9 Ebay.com 19 Espn.go.com 10 Live.com 20 Wordpress.com This is based on the Alex Report as of August 3, 2010 40% of websites are social media sites

Networks and Representation Social Network: A social structure made of nodes (individuals or organizations) and edges that connect nodes in various relationships like friendship, kinship etc. Graph Representation Matrix Representation

Basic Concepts A: the adjacency matrix V: the set of nodes E: the set of edges vi: a node vi e(vi, vj): an edge between node vi and vj Ni: the neighborhood of node vi di: the degree of node vi geodesic: a shortest path between two nodes geodesic distance

Properties of Large-Scale Networks Networks in social media are typically huge, involving millions of actors and connections. Large-scale networks in real world demonstrate similar patterns Scale-free distributions Small-world effect Strong Community Structure

Scale-free Distributions Degree distribution in large-scale networks often follows a power law. Nodes A.k.a. long tail distribution, scale-free distribution Degrees

log-log plot Power law distribution becomes a straight line if plot in a log-log scale Friendship Network in Flickr Friendship Network in YouTube

Small-World Effect “Six Degrees of Separation” A famous experiment conducted by Travers and Milgram (1969) Subjects were asked to send a chain letter to his acquaintance in order to reach a target person The average path length is around 5.5 Verified on a planetary-scale IM network of 180 million users (Leskovec and Horvitz 2008) The average path length is 6.6

The Milgram Experiment (Wikipedia) Basic procedure 1. Milgram typically chose individuals in the U.S. cities of Omaha, Nebraska and Wichita, Kansas to be the starting points and Boston, Massachusetts to be the end point of a chain of correspondence, because they were thought to represent a great distance in the United States, both socially and geographically. 2. Information packets were initially sent to "randomly" selected individuals in Omaha or Wichita. They included letters, which detailed the study's purpose, and basic information about a target contact person in Boston. It additionally contained a roster on which they could write their own name, as well as business reply cards that were pre-addressed to Harvard.

The Milgram Experiment (cont.) 3. Upon receiving the invitation to participate, the recipient was asked whether he or she personally knew the contact person described in the letter. If so, the person was to forward the letter directly to that person. For the purposes of this study, knowing someone "personally" was defined as knowing them on a first-name basis. 4. In the more likely case that the person did not personally know the target, then the person was to think of a friend or relative they know personally that is more likely to know the target. A postcard was also mailed to the researchers at Harvard so that they could track the chain's progression toward the target. 5. When and if the package eventually reached the contact person in Boston, the researchers could examine the roster to count the number of times it had been forwarded from person to person. Additionally, for packages that never reached the destination, the incoming postcards helped identify the break point in the chain.

Result of the Experiment However, a significant problem was that often people refused to pass the letter forward, and thus the chain never reached its destination. In one case, 232 of the 296 letters never reached the destination.[3] However, 64 of the letters eventually did reach the target contact. Among these chains, the average path length fell around 5.5 or six.

Diameter Measures used to calibrate the small world effect Diameter: the longest shortest path in a network Average shortest path length The shortest path between two nodes is called geodesic. The number of hops in the geodesic is the geodesic distance. The geodesic distance between node 1 and node 9 is 4. The diameter of the network is 5, corresponding to the geodesic distance between nodes 2 and 9.

Community Structure Community: People in a group interact with each other more frequently than those outside the group ki = number of edges among node Ni’s neighbors Friends of a friend are likely to be friends as well Measured by clustering coefficient: density of connections among one’s friends

Clustering Coefficient d6=4, N6= {4, 5, 7,8} k6=4 as e(4,5), e(5,7), e(5,8), e(7,8) C6 = 4/(4*3/2) = 2/3 Average clustering coefficient C = (C1 + C2 + … + Cn)/n C = 0.61 for the left network In a random graph, the expected coefficient is 14/(9*8/2) = 0.19.

Challenges Scalability Heterogeneity Evolution Collective Intelligence Social networks are often in a scale of millions of nodes and connections Traditional Network Analysis often deals with at most hundreds of subjects Heterogeneity Various types of entities and interactions are involved Evolution Timeliness is emphasized in social media Collective Intelligence How to utilize wisdom of crowds in forms of tags, wikis, reviews Evaluation Lack of ground truth, and complete information due to privacy

Social Computing Tasks Social Computing: a young and vibrant field Conferences: KDD, WSDM, WWW, ICML, AAAI/IJCAI, SocialCom, etc. Tasks Network Modeling Centrality Analysis and Influence Modeling Community Detection Classification and Recommendation Privacy, Spam and Security

Network Modeling Large Networks demonstrate statistical patterns: Small-world effect (e.g., 6 degrees of separation) Power-law distribution (a.k.a. scale-free distribution) Community structure (high clustering coefficient) Model the network dynamics Reproducing large-scale networks Examples: random graph, preferential attachment process, Watts and Strogatz model Simulation to understand network properties Thomas Shelling’s famous simulation: What could cause the segregation of white and black people Network robustness under attack He wondered if racial segregation might, in principle, have absolutely nothing to do with racism. First experiment – racism can cause segregation Second experiment – racially tolerant people might prefer to avoid being part of extreme minority (< 30%)

Centrality Analysis and Influence Modeling Identify the most important actors or edges E.g. PageRank in Google Various other criteria Influence modeling: How is information diffused? How does one influence each other? Related Problems Viral marketing: word-of-mouth effect Influence maximization

Community Detection A community is a set of nodes between which the interactions are (relatively) frequent A.k.a., group, cluster, cohesive subgroups, modules Applications: Recommendation based communities, Network Compression, Visualization of a huge network New lines of research in social media Community Detection in Heterogeneous Networks Community Evolution in Dynamic Networks Scalable Community Detection in Large-Scale Networks

Classification and Recommendation Common in social media applications Tag suggestion, Product/Friend/Group Recommendation Link prediction Network-Based Classification

Privacy, Spam and Security Privacy is a big concern in social media Facebook, Google buzz often appear in debates about privacy NetFlix Prize Sequel cancelled due to privacy concern Simple annoymization does not necessarily protect privacy Spam blog (splog), spam comments, Fake identity, etc., all requires new techniques As private information is involved, a secure and trustable system is critical Need to achieve a balance between sharing and privacy

Two Books: Huan Liu and Lei Tang Book Available at Morgan & claypool Publishers Amazon If you have any comments, please feel free to contact: Lei Tang, Yahoo! Labs, ltang@yahoo-inc.com Huan Liu, ASU huanliu@asu.edu

Book 2: available online Networks, Crowds, and Markets: Reasoning About a Highly Connected World By David Easley and Jon Kleinberg