1 Social Influence and Sentiment Analysis —From Sentiment to Emotion Analysis in Social Networks Jie Tang Department of Computer Science and Technology.

Slides:



Advertisements
Similar presentations
1 From Sentiment to Emotion Analysis in Social Networks Jie Tang Department of Computer Science and Technology Tsinghua University, China.
Advertisements

Entity-Centric Topic-Oriented Opinion Summarization in Twitter Date : 2013/09/03 Author : Xinfan Meng, Furu Wei, Xiaohua, Liu, Ming Zhou, Sujian Li and.
Multi-label Relational Neighbor Classification using Social Context Features Xi Wang and Gita Sukthankar Department of EECS University of Central Florida.
Active Learning for Streaming Networked Data Zhilin Yang, Jie Tang, Yutao Zhang Computer Science Department, Tsinghua University.
One Theme in All Views: Modeling Consensus Topics in Multiple Contexts Jian Tang 1, Ming Zhang 1, Qiaozhu Mei 2 1 School of EECS, Peking University 2 School.
Confluence: Conformity Influence in Large Social Networks
Modelling Paying Behavior in Game Social Networks Zhanpeng Fang +, Xinyu Zhou +, Jie Tang +, Wei Shao #, A.C.M. Fong *, Longjun Sun #, Ying Ding -, Ling.
Social Media Mining Chapter 5 1 Chapter 5, Community Detection and Mining in Social Media. Lei Tang and Huan Liu, Morgan & Claypool, September, 2010.
1 Social Influence Analysis in Large-scale Networks Jie Tang 1, Jimeng Sun 2, Chi Wang 1, and Zi Yang 1 1 Dept. of Computer Science and Technology Tsinghua.
1 Yuxiao Dong *$, Jie Tang $, Sen Wu $, Jilei Tian # Nitesh V. Chawla *, Jinghai Rao #, Huanhuan Cao # Link Prediction and Recommendation across Multiple.
Graph Data Management Lab School of Computer Science , Bristol, UK.
1 1 Chenhao Tan, 1 Jie Tang, 2 Jimeng Sun, 3 Quan Lin, 4 Fengjiao Wang 1 Department of Computer Science and Technology, Tsinghua University, China 2 IBM.
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
Forecasting with Twitter data Presented by : Thusitha Chandrapala MARTA ARIAS, ARGIMIRO ARRATIA, and RAMON XURIGUERA.
1 Computational Models for Micro-level Social Network Analysis Jie Tang Tsinghua University, China.
1 Social Influence and Information Diffusion Jie Tang Department of Computer Science and Technology Tsinghua University.
Social Network Analysis via Factor Graph Model
Modelling Paying Behavior in Game Social Networks Zhanpeng Fang +, Xinyu Zhou +, Jie Tang +, Wei Shao #, A.C.M. Fong *, Longjun Sun #, Ying Ding -, Ling.
1 1 Chenhao Tan, 1 Jie Tang, 2 Jimeng Sun, 3 Quan Lin, 4 Fengjiao Wang 1 Department of Computer Science and Technology, Tsinghua University, China 2 IBM.
Active Learning for Networked Data Based on Non-progressive Diffusion Model Zhilin Yang, Jie Tang, Bin Xu, Chunxiao Xing Dept. of Computer Science and.
SOCIAL NETWORKS AND THEIR IMPACTS ON BRANDS Edwin Dionel Molina Vásquez.
Modeling Relationship Strength in Online Social Networks Rongjing Xiang: Purdue University Jennifer Neville: Purdue University Monica Rogati: LinkedIn.
Data Mining and Machine Learning Lab Network Denoising in Social Media Huiji Gao, Xufei Wang, Jiliang Tang, and Huan Liu Data Mining and Machine Learning.
1 From Sentiment to Emotion Analysis in Social Networks Jie Tang Department of Computer Science and Technology Tsinghua University, China.
Influence Maximization in Dynamic Social Networks Honglei Zhuang, Yihan Sun, Jie Tang, Jialin Zhang, Xiaoming Sun.
Understanding the Emotional Impact of Images Jia JIA Computer Science, Tsinghua University Joint work with Xiaohui WANG, Peiyun HU, Sen WU, Jie TANG and.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
1 Yang Yang *, Yizhou Sun +, Jie Tang *, Bo Ma #, and Juanzi Li * Entity Matching across Heterogeneous Sources *Tsinghua University + Northeastern University.
Mining Social Network for Personalized Prioritization Language Techonology Institute School of Computer Science Carnegie Mellon University Shinjae.
The Database and Info. Systems Lab. University of Illinois at Urbana-Champaign User Profiling in Ego-network: Co-profiling Attributes and Relationships.
Department of Electrical Engineering and Computer Science Kunpeng Zhang, Yu Cheng, Yusheng Xie, Doug Downey, Ankit Agrawal, Alok Choudhary {kzh980,ych133,
1 Panther: Fast Top-K Similarity Search on Large Networks Jing Zhang 1, Jie Tang 1, Cong Ma 1, Hanghang Tong 2, Yu Jing 1, and Juanzi Li 1 1 Department.
Click to Add Title A Systematic Framework for Sentiment Identification by Modeling User Social Effects Kunpeng Zhang Assistant Professor Department of.
1 From Sentiment to Emotion Analysis in Social Networks Jie Tang Department of Computer Science and Technology Tsinghua University, China.
Towards Social User Profiling: Unified and Discriminative Influence Model for Inferring Home Locations Rui Li, Shengjie Wang, Hongbo Deng, Rui Wang, Kevin.
1 Yang Yang *, Yizhou Sun +, Jie Tang *, Bo Ma #, and Juanzi Li * Entity Matching across Heterogeneous Sources *Tsinghua University + Northeastern University.
Lecture 5-1 Positive Influence Dominating Set Ding-Zhu Du Univ of Texas at Dallas.
Unsupervised Streaming Feature Selection in Social Media
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Mining Advisor-Advisee Relationships from Research Publication.
A Connectivity-Based Popularity Prediction Approach for Social Networks Huangmao Quan, Ana Milicic, Slobodan Vucetic, and Jie Wu Department of Computer.
1 Zi Yang Tsinghua University Joint work with Prof. Jie Tang, Prof. Juanzi Li, Dr. Keke Cai, Jingyi Guo, Chi Wang, etc. July 21, 2011, CASIN 2011, Tsinghua.
1 Zi Yang Tsinghua University Joint work with Prof. Jie Tang, Prof. Juanzi Li, Dr. Keke Cai, Jingyi Guo, Chi Wang, etc. July 21, 2011, CASIN 2011, Tsinghua.
 DM-Group Meeting Liangzhe Chen, Oct Papers to be present  RSC: Mining and Modeling Temporal Activity in Social Media  KDD’15  A. F. Costa,
Topic Modeling for Short Texts with Auxiliary Word Embeddings
Social networks that matter: Twitter under the microscope
MINING DEEP KNOWLEDGE FROM SCIENTIFIC NETWORKS
Emotion propagation in online communities
Greedy & Heuristic algorithms in Influence Maximization
DM-Group Meeting Liangzhe Chen, Nov
Cross-lingual Knowledge Linking Across Wiki Knowledge Bases
Rosta Farzan and Keyang Zheng, School of Computing and Information
Personalized Social Image Recommendation
Summary Presented by : Aishwarya Deep Shukla
Social Role-Aware Emotion Contagion in Image Social Networks
E-Commerce Theories & Practices
Data Science Process Chapter 2 Rich's Training 11/13/2018.
Weakly Learning to Match Experts in Online Community
MEgo2Vec: Embedding Matched Ego Networks for User Alignment Across Social Networks Jing Zhang+, Bo Chen+, Xianming Wang+, Fengmei Jin+, Hong Chen+, Cuiping.
Structural influence:
Binghui Wang, Le Zhang, Neil Zhenqiang Gong
Socialized Word Embeddings
Graph-based Security and Privacy Analytics via Collective Classification with Joint Weight Learning and Propagation Binghui Wang, Jinyuan Jia, and Neil.
GANG: Detecting Fraudulent Users in OSNs
Hierarchical Relational Models for Document Networks
“The Spread of Physical Activity Through Social Networks”
GhostLink: Latent Network Inference for Influence-aware Recommendation
Yingze Wang and Shi-Kuo Chang University of Pittsburgh
Modeling Topic Diffusion in Scientific Collaboration Networks
Presentation transcript:

1 Social Influence and Sentiment Analysis —From Sentiment to Emotion Analysis in Social Networks Jie Tang Department of Computer Science and Technology Tsinghua University, China

2 Networked World 1.65 billion MAU (users) 2.5 trillion minutes/month 1.65 billion MAU (users) 2.5 trillion minutes/month 220 million users influencing our daily life 220 million users influencing our daily life QQ: 800 million MAU WeChat: 700 million MAU QQ: 800 million MAU WeChat: 700 million MAU 255 million MAU Peak: 143K tweets/s 255 million MAU Peak: 143K tweets/s 304 million active users 14 billion items/year 304 million active users 14 billion items/year 710 million trans. on 11/ billion USD in 24 hrs 710 million trans. on 11/ billion USD in 24 hrs

3 The Era of Big Social Data We generate 2.5x10 18 byte big data per day. Big social data: –90% of the data was generated in the past 2 yrs –Mining in single data center  mining deep knowledge from multiple data sources 28% Global Population Penetration Rate Hours Spent on Social Media each day

4 User Opinion and Influence: “Love Obama” I love Obama Obama is great! Obama is fantastic I hate Obama, the worst president ever He cannot be the next president! No Obama in 2012! Positive Negative

5 Does Social Influence really matter? Case 1: Social influence and political mobilization [1] –Will online political mobilization really work? [1] R. M. Bond, C. J. Fariss, J. J. Jones, A. D. I. Kramer, C. Marlow, J. E. Settle and J. H. Fowler. A 61-million-person experiment in social influence and political mobilization. Nature, 489: , A controlled trial (with 61M users on FB) -Social msg group: was shown with msg that indicates one’s friends who have made the votes. -Informational msg group: was shown with msg that indicates how many other. -Control group: did not receive any msg. A controlled trial (with 61M users on FB) -Social msg group: was shown with msg that indicates one’s friends who have made the votes. -Informational msg group: was shown with msg that indicates how many other. -Control group: did not receive any msg.

6 Case 1: Social Influence and Political Mobilization Social msg group v.s. Info msg group Result: The former were 2.08% (t- test, P<0.01) more likely to click on the “I Voted” button Social msg group v.s. Info msg group Result: The former were 2.08% (t- test, P<0.01) more likely to click on the “I Voted” button Social msg group v.s. Control group Result: The former were 0.39% (t- test, P=0.02) more likely to actually vote (via examination of public voting records) Social msg group v.s. Control group Result: The former were 0.39% (t- test, P=0.02) more likely to actually vote (via examination of public voting records) [1] R. M. Bond, C. J. Fariss, J. J. Jones, A. D. I. Kramer, C. Marlow, J. E. Settle and J. H. Fowler. A 61-million-person experiment in social influence and political mobilization. Nature, 489: , 2012.

7 Twitter Data Twitter –1,414,340 users and 480,435,500 tweets –274,644,047 t-follow edges and edges [1] Chenhao Tan, Lillian Lee, Jie Tang, Long Jiang, Ming Zhou, and Ping Li. User-level sentiment analysis incorporating social networks. In KDD’11, pages 1397–1405, 2011.

8 From text sentiment to user sentiment Obama is making the repubs look silly and petty. Classifier with dictionary Positive Negative Only thing we have 2 fear is Obama himself & Pelosi & Cong & liberal news & Dems &... Barack Obama can no more disown ACORN than he could disown his own grandmother. However, the social text is really short and noisy … Classifier Positive Negative User-level Sentiment Analysis User A

9 From user sentiment to network sentiment I love Obama Obama is great! Obama is fantastic I hate Obama, the worst president ever He cannot be the next president! No Obama in 2012! Positive Negative Who influenced who? What is the influence probability? Can we leverage the social influence to help sentiment analysis? 2

10 Sentiment Influence in Twitter Shared sentiment conditioned on type of connection. —people tend to follow the opinion of their friends [1] Chenhao Tan, Lillian Lee, Jie Tang, Long Jiang, Ming Zhou, and Ping Li. User-level sentiment analysis incorporating social networks. In KDD’11, pages 1397–1405, 2011.

11 Selection Connectedness conditioned on labels —people tend to create relationships with other people who share the same opinion with them

12 Learning for network sentiment analysis I love Obama Obama is great! Obama is fantastic I hate Obama, the worst president ever He cannot be the next president! No Obama in 2012! Positive Negative Networked Classification Model: Learning for sentiment analysis by considering the network information Another challenge: labeled data is very limited…

13 [1] Chenhao Tan, Lillian Lee, Jie Tang, Long Jiang, Ming Zhou, and Ping Li. User-level sentiment analysis incorporating social networks. In KDD’11, pages 1397–1405, Semi-supervised Factor Graph Model Tweets by user v 3 Semi-FGM: learning to classify sentiments by considering both content and network structure in a semi-supervised fashion. Social link User-specific attributes indicate our confidence level in labeled/unlabeled users

14 [1] Chenhao Tan, Lillian Lee, Jie Tang, Long Jiang, Ming Zhou, and Ping Li. User-level sentiment analysis incorporating social networks. In KDD’11, pages 1397–1405, Semi-supervised Factor Graph Model Tweets by user v 3 Semi-FGM: learning to classify sentiments by considering both content and network structure in a semi-supervised fashion. Social link indicate our confidence level in network-based influence User-user factor

15 Semi-supervised Factor Graph Model +

16 Parameter Estimation for Semi-FGM “NoLearning”: simply use counts from the labeled subset of the data SampleRank (“Learning”): A sampling-based learning algorithm using Metropolis–Hastings the subset of edges in our dataset in which both endpoints are labeled indicator function

17 SampleRank (“Learning”) likelihood ratio of new sample Y new and previous label Y for all users Relative performance between new sample Y new and previous label Y on labeled user only. Update model parameters when two results are inconsistent

18 Results of network sentiment analysis Twitter –1,414,340 users and 480,435,500 tweets –274,644,047 t-follow edges and edges Methods –SVM Vote –Semi-FGM (NoLearning) –Semi-FGM (SampleRank) Measures –Accuracy and Macro F1

19

20 Performance

21 Performance Analysis in Different Topics

22 Results of Different Learning Algorithms

23 Twitter to Weibo

24 We have a picture of sentiment analysis in social networks… From text sentiment to user sentiment From user sentiment to network sentiment Challenges: –Short text and noisy data –Limited labeled data –Networked user sentiments Proposal of a Semi-supervised Factor Graph Model (Semi-FGM) to learn to classify sentiments by considering both content and network structure

25 Now, let us think… What are the fundamental factors behind –What is behind the network of social users? –What is behind the sentiment of social users?

26 Info. Space Social Space Interaction Understanding the mechanism of interaction dynamics Info. Space vs. Social Space Well, what is the fundamental factor… From the social network research perspective, what are the fundamental factors behind?

27 Topic-based Social Influence Analysis Positive Negative output I love Obama I hate Obama Politics Entertainment Trademarks Market Strategy Politics How to? Topics

28 The Solution: Topical Affinity Propagation [1] Jie Tang, Jimeng Sun, Chi Wang, and Zi Yang. Social Influence Analysis in Large-scale Networks. In KDD, pages , Market Strategy Basic Idea: If a user is located in the center of a “Market” community, and is “similar” to the other users, then she/he would have a strong influence on the other users. —Homophily theory

29 The Solution: Topical Affinity Propagation [1] Jie Tang, Jimeng Sun, Chi Wang, and Zi Yang. Social Influence Analysis in Large-scale Networks. In KDD, pages , Politics Market Strategy Define a function to quantify the similarity between neighborhood users Estimate how a user can represent his neighbors The topic information can be obtained by any tagging system or topic modeling approach How “Ada” thought he influenced “Bob”? How “Bob” thought he was influenced by “Ada”?

30 Topical Factor Graph (TFG) Model Node/user Nodes that have the highest influence on the current node The problem is cast as identifying which node has the highest probability to influence another node on a specific topic along with the edge. Social link User-specific attributes Asymmetric similarity Topological feature or global constraint

31 The learning task is to find a configuration for all {y i } to maximize the joint probability. Topical Factor Graph (TFG) Objective function: 1. How to define? 2. How to optimize?

32 How to define (topical) feature functions? –Node feature function –Edge feature function –Global feature function similarity or simply binary

33 Model Learning Algorithm Sum-product: - Low efficiency! - Not easy for distributed learning!

34 New TAP Learning Algorithm 1. Introduce two new variables r and a, to replace the original message m. 2. Design new update rules: m ij [1] Jie Tang, Jimeng Sun, Chi Wang, and Zi Yang. Social Influence Analysis in Large-scale Networks. In KDD, pages , How user i thought he influenced user j? How user j thought he was influenced by user i?

35 The TAP Learning Algorithm

36 Experiments Data&Codes: ( Evaluation measures –CPU time –Case study –Application Data set#Nodes#Edges Coauthor640,1341,554,643 Citation2,329,76012,710,347 Film (Wikipedia) 18,518 films 7,211 directors 10,128 actors 9,784 writers 142,426

37 Social Influence Sub-graph on “Data mining” On “Data Mining” in 2009

38 Now, let us think… What are the fundamental factors behind –What is behind the network of social users? –What is behind the sentiment of social users? What drives users’ sentiments?

39 Sentiment vs. Emotion Charles Darwin: –Emotion serves as a purpose for humans in aiding their survival during the evolution. [1] Emotion is the driving force of user’s sentiments… Emotion stimulates the mind 3000 times quicker than rational thought! [1] Charles Darwin. The Expression of Emotions in Man and Animals. John Murray, 1872.

40 Potential Directions From sentiment to emotion analysis? Add social theories into emotion analysis? Sentiment/emotion analysis for “Social Good”?

41 Was Anna Happy When She Published This Photo On Flickr? A lovely doorplate Anna: a girl who just graduated

42 Was Anna Happy When She Published This Photo On Flickr?

43 Problem [1] Yang Yang, Jia Jia, Shumei Zhang, Boya Wu, Qicong Chen, Juanzi Li, Chunxiao Xing, and Jie Tang. How Do Your Friends on Social Media Disclose Your Emotions? In AAAI'14. pp

44 Emotion Learning Method [1] Yang Yang, Jia Jia, Shumei Zhang, Boya Wu, Qicong Chen, Juanzi Li, Chunxiao Xing, and Jie Tang. How Do Your Friends on Social Media Disclose Your Emotions? In AAAI'14. pp

45 Flickr Data 354,192 images posted by 4,807 users –For each image, we also collect its tags and all comments. –Thus we get 557,177 comments posted by 6,735 users in total Infer emotion of users by considering both image and tag/comments

46 Emotion Inference SVM: regards the visual features of images as inputs and uses a SVM as a classifier. PFG: considers both color features and social correlations among images. LDA+SVM: first uses LDA to extract latent topics from comments, then uses visual features, topic distributions, and social ties as features to train a SVM. Averagely +37.4% in terms of F1

47 To What Extend Your Friends Can Disclose Your Emotions? -Comments stands for the proposed method ignoring comment information -Tie ignores social tie information Fear images have similar visual features with Sadness and Anger. Homophily suggests that friends with similar interests tend to have similar understanding of disgust

48 Image Interpretations Our model demonstrates how visual features distribute over different emotions. (e.g., images representing Happiness have high saturation) Positive emotions attract more response (+4.4 times) and more easily to influence others compared with negative emotions.

49 Potential Directions From sentiment to emotion analysis? Add social theories into emotion analysis? Sentiment/emotion analysis for “Social Good”?

50 Summary From text sentiment to user sentiment From user sentiment to network sentiment From sentiment analysis to emotion analysis From network interaction to social influence

51 Related Publications Jie Tang, Jimeng Sun, Chi Wang, and Zi Yang. Social Influence Analysis in Large-scale Networks. In KDD'09, pages , Chenhao Tan, Lillian Lee, Jie Tang, Long Jiang, Ming Zhou, and Ping Li. User-level sentiment analysis incorporating social networks. In KDD’11, pages 1397–1405, Jie Tang, Sen Wu, and Jimeng Sun. Confluence: Conformity Influence in Large Social Networks. In KDD'13, pages , Yang Yang, Jia Jia, Shumei Zhang, Boya Wu, Qicong Chen, Juanzi Li, Chunxiao Xing, and Jie Tang. How Do Your Friends on Social Media Disclose Your Emotions? In AAAI'14. pp Jie Tang, Yuan Zhang, Jimeng Sun, Jinghai Rao, Wenjing Yu, Yiran Chen, and ACM Fong. Quantitative Study of Individual Emotional States in Social Networks. IEEE Transactions on Affective Computing (TAC), 2012, Volume 3, Issue 2, Pages (Selected as the Spotlight Paper) Xiaohui Wang, Jia Jia, Jie Tang, Boya Wu, Lianhong Cai, and Lexing Xie. Modeling Emotion Influence in Image Social Networks. IEEE Transactions on Affective Computing (TAC), Volume 6, Issue 3, 2015, Pages Yuan Zhang, Jie Tang, Jimeng Sun, Yiran Chen, and Jinghai Rao. MoodCast: Emotion Prediction via Dynamic Continuous Factor Graph Model. In ICDM’10. pp Jia Jia, Sen Wu, Xiaohui Wang, Peiyun Hu, Lianhong Cai, and Jie Tang. Can We Understand van Gogh’s Mood? Learning to Infer Affects from Images in Social Networks. In ACM MM, pages , Xiaohui Wang, Jia Jia, Peiyun Hu, Sen Wu, Lianhong Cai, and Jie Tang. Understanding the Emotional Impact of Images. (Grand Challenge) In ACM MM. pp (Grand Challenge 2nd Prize Award)

52 Thank you ! Collaborators: Lillian Lee, Chenhao Tan (Cornell) Jinghai Rao (Nokia) Jimeng Sun (IBM/GIT) Ming Zhou, Long Jiang (Microsoft) Yuan Zhang, Jia Jia, Yang Yang, Boya Wu, Xiaohui Wang (THU) Jie Tang, KEG, Tsinghua U, Download all data & Codes,