Mining information from social media

Slides:



Advertisements
Similar presentations
Community Detection and Graph-based Clustering
Advertisements

Social Media and Social Computing Chapter 1 1 Chapter 1, Community Detection and Mining in Social Media. Lei Tang and Huan Liu, Morgan & Claypool, September,
Social Media and Social Computing
Analysis and Modeling of Social Networks Foudalis Ilias.
Cognitive Publish/Subscribe for Heterogeneous Clouds Šarūnas Girdzijauskas, Swedish Institute of Computer Science (SICS) Joint work with:
Community Detection and Evaluation
Social Media Mining Chapter 5 1 Chapter 5, Community Detection and Mining in Social Media. Lei Tang and Huan Liu, Morgan & Claypool, September, 2010.
Intelligent Systems Lab. Recognizing Human actions from Still Images with Latent Poses Authors: Weilong Yang, Yang Wang, and Greg Mori Simon Fraser University,
Nodes, Ties and Influence
4. PREFERENTIAL ATTACHMENT The rich gets richer. Empirical evidences Many large networks are scale free The degree distribution has a power-law behavior.
1 Yuxiao Dong *$, Jie Tang $, Sen Wu $, Jilei Tian # Nitesh V. Chawla *, Jinghai Rao #, Huanhuan Cao # Link Prediction and Recommendation across Multiple.
Emergence of Scaling in Random Networks Barabasi & Albert Science, 1999 Routing map of the internet
Presentation Topic : Modeling Human Vaccinating Behaviors On a Disease Diffusion Network PhD Student : Shang XIA Supervisor : Prof. Jiming LIU Department.
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
Lecture 8 Communities Slides modified from Huan Liu, Lei Tang, Nitin Agarwal.
Communities in Heterogeneous Networks Chapter 4 1 Chapter 4, Community Detection and Mining in Social Media. Lei Tang and Huan Liu, Morgan & Claypool,
Discovering Overlapping Groups in Social Media Xufei Wang, Lei Tang, Huiji Gao, and Huan Liu Arizona State University.
CS 728 Lecture 4 It’s a Small World on the Web. Small World Networks It is a ‘small world’ after all –Billions of people on Earth, yet every pair separated.
Web as Graph – Empirical Studies The Structure and Dynamics of Networks.
The Social Web: A laboratory for studying s ocial networks, tagging and beyond Kristina Lerman USC Information Sciences Institute.
Overview of Web Data Mining and Applications Part I
Leveraging Big Data: Lecture 11 Instructors: Edith Cohen Amos Fiat Haim Kaplan Tova Milo.
COMMUNITIES IN MULTI-MODE NETWORKS 1. Heterogeneous Network Heterogeneous kinds of objects in social media – YouTube Users, tags, videos, ads – Del.icio.us.
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
(Social) Networks Analysis III Prof. Dr. Daning Hu Department of Informatics University of Zurich Oct 16th, 2012.
Using Friendship Ties and Family Circles for Link Prediction Elena Zheleva, Lise Getoor, Jennifer Golbeck, Ugur Kuter (SNAKDD 2008)
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
Social Networking and On-Line Communities: Classification and Research Trends Maria Ioannidou, Eugenia Raptotasiou, Ioannis Anagnostopoulos.
Introduction The large amount of traffic nowadays in Internet comes from social video streams. Internet Service Providers can significantly enhance local.
Mark Levene, An Introduction to Search Engines and Web Navigation © Pearson Education Limited 2005 Slide 9.1 Chapter 9 : Social Networks What is a social.
Lecture 18 Community structures Slides modified from Huan Liu, Lei Tang, Nitin Agarwal.
Kristina Lerman Aram Galstyan USC Information Sciences Institute Analysis of Social Voting Patterns on Digg.
Popularity versus Similarity in Growing Networks Fragiskos Papadopoulos Cyprus University of Technology M. Kitsak, M. Á. Serrano, M. Boguñá, and Dmitri.
1 Computing with Social Networks on the Web (2008 slide deck) Jennifer Golbeck University of Maryland, College Park Jim Hendler Rensselaer Polytechnic.
Using Transactional Information to Predict Link Strength in Online Social Networks Indika Kahanda and Jennifer Neville Purdue University.
Artificial Neural Nets and AI Connectionism Sub symbolic reasoning.
Data Mining and Machine Learning Lab Network Denoising in Social Media Huiji Gao, Xufei Wang, Jiliang Tang, and Huan Liu Data Mining and Machine Learning.
Data Analysis in YouTube. Introduction Social network + a video sharing media – Potential environment to propagate an influence. Friendship network and.
Page 1 Ming Ji Department of Computer Science University of Illinois at Urbana-Champaign.
To Blog or Not to Blog: Characterizing and Predicting Retention in Community Blogs Imrul Kayes 1, Xiang Zuo 1, Da Wang 2, Jacob Chakareski 3 1 University.
A Graph-based Friend Recommendation System Using Genetic Algorithm
 Building Networks. First Decisions  What do the nodes represent?  What do the edges represent?  Know this before doing anything with data!
Emergence of Scaling and Assortative Mixing by Altruism Li Ping The Hong Kong PolyU
Class 10: Introduction to CINET Using CINET for network analysis and visualization Network Science: Introduction to CINET 2015 Prof. Boleslaw K. Szymanski.
Complexity Bruce Kogut October We are entering the epoch of the digitalization of knowledge: past, present, and future Sciences bring to this new.
Yongqin Gao, Greg Madey Computer Science & Engineering Department University of Notre Dame © Copyright 2002~2003 by Serendip Gao, all rights reserved.
Xintao Wu Jan 18, 2013 Retweeting Behavior and Spectral Graph Analysis in Social Media.
Lecture 10: Network models CS 765: Complex Networks Slides are modified from Networks: Theory and Application by Lada Adamic.
Comparing Snapshots of Networks Shah Jamal Alam and Ruth Meyer Centre for Policy Modelling 28 th March, 2007 – CAVES Bi-annual Meeting, IIASA,
Social Network Analysis. Outline l Background of social networks –Definition, examples and properties l Data in social networks –Data creation, flow and.
Network Community Behavior to Infer Human Activities.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
Lecture 7 Communities Slides modified from Huan Liu, Lei Tang, Nitin Agarwal.
Scalable Learning of Collective Behavior Based on Sparse Social Dimensions Lei Tang, Huan Liu CIKM ’ 09 Speaker: Hsin-Lan, Wang Date: 2010/02/01.
A Connectivity-Based Popularity Prediction Approach for Social Networks Huangmao Quan, Ana Milicic, Slobodan Vucetic, and Jie Wu Department of Computer.
Netlogo demo. Complexity and Networks Melanie Mitchell Portland State University and Santa Fe Institute.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
Alan Mislove Bimal Viswanath Krishna P. Gummadi Peter Druschel.
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
Cmpe 588- Modeling of Internet Emergence of Scale-Free Network with Chaotic Units Pulin Gong, Cees van Leeuwen by Oya Ünlü Instructor: Haluk Bingöl.
Automated Experiments on Ad Privacy Settings
E-Commerce Theories & Practices
Social Media and Social Computing
Dieudo Mulamba November 2017
Using Friendship Ties and Family Circles for Link Prediction
Peer-to-Peer and Social Networks
Slides modified from Huan Liu, Lei Tang, Nitin Agarwal, Reza Zafarani
GANG: Detecting Fraudulent Users in OSNs
Presentation transcript:

Mining information from social media Contains: Classification of Social Behavior Link Prediction Viral Marketing/Outbreak Detection Network Modeling Social Dimensions

Classification of Social Behavior User Preference or Behavior can be represented as class labels In a given Web-Page, we can classify user behavior as: Whether or not clicking on an ad Whether or not interested in certain topics Like/Dislike a product In a social network sites Comments by friends Like/dislike of any posts/status In commercial web sites Like/dislike of product Views toward the product etc.

Visualization after Prediction Predictions 6: Non-Smoking 7: Non-Smoking 8: Smoking 9: Non-Smoking 10: Smoking : Smoking : Non-Smoking : ? Unknown Content Reference: Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009.

Link Prediction Given a social network, predict which nodes are likely to get connected Output a list of (ranked) pairs of nodes Example: Friend recommendation in Facebook Link Prediction (2, 3) (4, 12) (5, 7) (7, 13) Content Reference: Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009.

Viral Marketing/Outbreak Detection Users have different social capital (or network values) within a social network, hence, how can one make best use of this information? Viral Marketing: find out a set of users to provide coupons and promotions to influence other people in the network so my benefit is maximized Outbreak Detection: monitor a set of nodes that can help detect outbreaks or interrupt the infection spreading (e.g., H1N1 flu) Goal: given a limited budget, how to maximize the overall benefit? Content Reference: Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009.

An Example of Viral Marketing Find the coverage of the whole network of nodes with the minimum number of nodes How to realize it – an example Basic Greedy Selection: Select the node that maximizes the utility, remove the node and then repeat Select Node 1 Select Node 8 Select Node 7 Node 7 is not a node with high centrality! Content Reference: Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009.

Network Modeling Large Networks demonstrate statistical patterns: Small-world effect (e.g., 6 degrees of separation) Power-law distribution (a.k.a. scale-free distribution) Community structure (high clustering coefficient) Model the network dynamics Find a mechanism such that the statistical patterns observed in large-scale networks can be reproduced. Examples: random graph, preferential attachment process Used for simulation to understand network properties Thomas Shelling’s famous simulation: What could cause the segregation of white and black people Network robustness under attack He wondered if racial segregation might, in principle, have absolutely nothing to do with racism. First experiment – racism can cause segregation Second experiment – racially tolerant people might prefer to avoid being part of extreme minority (< 30%) Content Reference: Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009.

Comparing Network Models Last figure is a model of similar size to A. observations over various real-word large-scale networks outcome of a network model (Figures borrowed from “Emergence of Scaling in Random Networks”) Content Reference: Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009.

Social Dimensions 2 1 1 3 Actor Affiliation 1 Affiliation 2 1 2 3 … …… Affiliation 1 Affiliation 2 Affiliations of actors are represented as social dimensions Each Dimension represents one potential affiliation Social dimensions capture prominent interaction patterns presented in the network Based on this affiliation concept, we can define social dimensions. Social dimension captures how actors is interacted with others. For instance, here we have two affiliations. (Affiliation 1 and 2) then we can represent this information as social dimensions on the right. Each dimension in the table represents one potential affiliation. Actor 1 is involved in both affiliations, thus he has 1 marked on both dimensions. Then, these social dimensions can be considered as features for learning. Content Reference: Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009.

Approach II: Social-Dimension Approach (SocDim) Labels Training classifier Extract Potential Affiliations Prediction Predicted Labels Social Dimensions Training: Extract social dimensions to represent potential affiliations of actors Any community detection methods is applicable (block model, spectral clustering) Build a classifier to select those discriminative dimensions Any discriminative classifier is acceptable (SVM, Logistic Regression) Prediction: Predict labels based on one actor’s latent social dimensions No collective inference is necessary Based on the concept of social dimensions, we have the following framework to do relational learning. Given the network and some label information, we first extract actors’ social dimensions to represent their potential affiliations. This can be done using any soft clustering methods. Then, we consider the extracted social dimension as features and build a discriminative classifier. This step will automatically determine which affiliation are more correlated with the label and assign proper weights. For prediction, since the social dimensions for all the nodes are available, we just apply the classifier directly to make the prediction. No collective inference is necessary. Just one shot. Content Reference: Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009.

Underlying Assumption the label of one node is determined by its social dimension P(yi|A) = P(yi|Si) Community membership serves as latent features Content Reference: Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009.

Reference Huan Liu, Lei Tang and Nitin Agarwal. Tutorial on Community Detection and Behavior Study for Social Computing. Presented in The 1st IEEE International Conference on Social Computing (SocialCom'09), 2009. Lei Tang and Huan Liu. Community Detection and Mining in Social Media, Morgan & Claypool Publishers, 2010