1 I am here… Lu Chen Kno.e.sis Center Amit Sheth Wright State University advisor of has PhD student is in director of.

Slides:

Advertisements

Similar presentations

Dan Jurafsky Lecture 4: Sarcasm, Alzheimers, +Distributional Semantics Computational Extraction of Social and Interactional Meaning SSLST, Summer 2011.

Advertisements

Farag Saad i-KNOW 2014 Graz- Austria,

Distant Supervision for Emotion Classification in Twitter posts 1/17.

Problem Semi supervised sarcasm identification using SASI

Pollyanna Gonçalves (UFMG, Brazil) Matheus Araújo (UFMG, Brazil) Fabrício Benevenuto (UFMG, Brazil) Meeyoung Cha (KAIST, Korea) Comparing and Combining.

Extract from various presentations: Bing Liu, Aditya Joshi, Aster Data … Sentiment Analysis January 2012.

Sentiment Analysis An Overview of Concepts and Selected Techniques.

A Joint Model of Text and Aspect Ratings for Sentiment Summarization Ivan Titov (University of Illinois) Ryan McDonald (Google Inc.) ACL 2008.

S ENTIMENTAL A NALYSIS O F B LOGS B Y C OMBINING L EXICAL K NOWLEDGE W ITH T EXT C LASSIFICATION. 1 By Prem Melville, Wojciech Gryc, Richard D. Lawrence.

Annotating Topics of Opinions Veselin Stoyanov Claire Cardie.

Joint Sentiment/Topic Model for Sentiment Analysis Chenghua Lin & Yulan He CIKM09.

CIS630 Spring 2013 Lecture 2 Affect analysis in text and speech.

Peiti Li 1, Shan Wu 2, Xiaoli Chen 1 1 Computer Science Dept. 2 Statistics Dept. Columbia University 116th Street and Broadway, New York, NY 10027, USA.

Sentiment Lexicon Creation from Lexical Resources BIS 2011 Bas Heerschop Erasmus School of Economics Erasmus University Rotterdam

OCFS: Optimal Orthogonal Centroid Feature Selection for Text Categorization Jun Yan, Ning Liu, Benyu Zhang, Shuicheng Yan, Zheng Chen, and Weiguo Fan et.

1 A Survey of Affect Recognition Methods: Audio, Visual, and Spontaneous Expressions Zhihong Zeng, Maja Pantic, Glenn I. Roisman, Thomas S. Huang Reported.

Analyzing Sentiment in a Large Set of Web Data while Accounting for Negation AWIC 2011 Bas Heerschop Erasmus School of Economics Erasmus University Rotterdam.

Coletto, Lucchese, Orlando, Perego ELECTORAL PREDICTIONS WITH TWITTER: A MACHINE-LEARNING APPROACH M. Coletto 1,3, C. Lucchese 1, S. Orlando 2, and R.

Models of Influence in Online Social Networks

More than words: Social networks’ text mining for consumer brand sentiments A Case on Text Mining Key words: Sentiment analysis, SNS Mining Opinion Mining,

Opinion mining in social networks Student: Aleksandar Ponjavić 3244/2014 Mentor: Profesor dr Veljko Milutinović.

(ACM KDD 09’) Prem Melville, Wojciech Gryc, Richard D. Lawrence

U & I: Users & Information Lab Sept 2008  Alice Oh 

Introduction to Text and Web Mining. I. Text Mining is part of our lives.

1 A Unified Relevance Model for Opinion Retrieval (CIKM 09’) Xuanjing Huang, W. Bruce Croft Date: 2010/02/08 Speaker: Yu-Wen, Hsu.

1 Emotion Classification Using Massive Examples Extracted from the Web Ryoko Tokuhisa, Kentaro Inui, Yuji Matsumoto Toyota Central R&D Labs/Nara Institute.

1 Statistical NLP: Lecture 9 Word Sense Disambiguation.

Human Learning Asma Marghalani.

Image Classification 영상분류

Multimodal Information Analysis for Emotion Recognition

Bo Pang , Lillian Lee Department of Computer Science

Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.

Extending the Multi- Instance Problem to Model Instance Collaboration Anjali Koppal Advanced Machine Learning December 11, 2007.

Prediction of Influencers from Word Use Chan Shing Hei.

Sentiment Analysis with Incremental Human-in-the-Loop Learning and Lexical Resource Customization Shubhanshu Mishra 1, Jana Diesner 1, Jason Byrne 2, Elizabeth.

Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.

Computational Approaches for Biomarker Discovery SubbaLakshmiswetha Patchamatla.

CSC 594 Topics in AI – Text Mining and Analytics

Measuring Behavioral Trust in Social Networks

Recognizing Stances in Online Debates Unsupervised opinion analysis method for debate-side classification. Mine the web to learn associations that are.

DeepDive Model Dongfang Xu Ph.D student, School of Information, University of Arizona Dec 13, 2015.

Post-Ranking query suggestion by diversifying search Chao Wang.

Subjectivity Recognition on Word Senses via Semi-supervised Mincuts Fangzhong Su and Katja Markert School of Computing, University of Leeds Human Language.

From Words to Senses: A Case Study of Subjectivity Recognition Author: Fangzhong Su & Katja Markert (University of Leeds, UK) Source: COLING 2008 Reporter:

Extracting and Ranking Product Features in Opinion Documents Lei Zhang #, Bing Liu #, Suk Hwan Lim *, Eamonn O’Brien-Strain * # University of Illinois.

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

STAR LESSONS: Comprehension Strategies. Comprehension Strategies Reading Process Skills Reading Process Skills Make predictions Make predictions Identify.

2014 Lexicon-Based Sentiment Analysis Using the Most-Mentioned Word Tree Oct 10 th, 2014 Bo-Hyun Kim, Sr. Software Engineer With Lina Chen, Sr. Software.

Sentiment Analysis Using Common- Sense and Context Information Basant Agarwal 1,2, Namita Mittal 2, Pooja Bansal 2, and Sonal Garg 2 1 Department of Computer.

Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.

Project Deliverable-1 -Prof. Vincent Ng -Girish Ramachandran -Chen Chen -Jitendra Mohanty.

A Document-Level Sentiment Analysis Approach Using Artificial Neural Network and Sentiment Lexicons Yan Zhu.

A Sentiment-Based Approach to Twitter User Recommendation BY AJAY ABDULPUR RAJARAM NIKKAM.

IDENTIFYING GREAT TEACHERS THROUGH THEIR ONLINE PRESENCE Evanthia Faliagka, Maria Rigou, Spiros Sirmakessis.

Automatically Labeled Data Generation for Large Scale Event Extraction

Negative Link Prediction and Its Applications in Online Political Networks Mehmet Yigit Yildirim Mert Ozer Hasan Davulcu.

Lecture: Sentiment Analysis

Sentiment analysis algorithms and applications: A survey

Sentiment Analysis Seminar Social Media Mining University UC3M

Grey Sentiment Analysis

Aspect-Based Sentiment Analysis on the Web using Rhetorical Structure Theory Rowan Hoogervorst1, Erik Essink1, Wouter Jansen1, Max van den Helder1 Kim.

Social Dynamics: Coding Blame and Emotion in Social Media

Summary Presented by : Aishwarya Deep Shukla

Aspect-based sentiment analysis

Proportion of Original Tweets

Quanzeng You, Jiebo Luo, Hailin Jin and Jianchao Yang

Statistical NLP: Lecture 9

An Overview of Concepts and Selected Techniques

Analyzing Influence of Social Media Through Twitter

Presentation transcript:

1 I am here… Lu Chen Kno.e.sis Center Amit Sheth Wright State University advisor of has PhD student is in director of

Extracting What We Think and How We Feel from What We Say in Social Media ---- Subjective Information Extraction Subjective Information Extraction, Lu Chen 2 Lu Chen Kno.e.sis Center Wright State University

Subjectivity refers to the subject and his or her perspective, feelings, beliefs, and desires. in philosophy, the term is usually contrasted with objectivity. [1] Subjective Information Extraction, Lu Chen 3 [1] Block, Ned; Flanagan, Owen J.; & Gzeldere, Gven (Eds.) The Nature of Consciousness: Philosophical Debates. Cambridge, MA: MIT Press. Extraction of subjective information: Extracting structured subjective information from unstructured content Allowing computation to be done on “what people think” and “how people feel”

Directions From coarse-grained to fine-grained –Document level -> sentence level -> expression level –General sentiment -> domain-dependent sentiment -> target-dependent sentiment –Sentiment  Subjective information Sentiment (positive/negative/neutral) -> emotion (happy, sad, angry, surprise, etc.) Other types of subjective information: Intent, suggestion/recommendation, wish/expectation, outlook, viewpoint, etc. From static to dynamic –Our attitude can be changed during social communication. Modeling, detecting, and tracking the change of attitude What leads to the change of attitude? E.g., persuasion campaign Subjective Information Extraction, Lu Chen 4 static dynamic coarse-grained fine-grained subjective information

Subjective Information Extraction, Lu Chen 5 Jan, 2012 Aug, 2011May, 2012 Discovering Fine- grained Sentiment in Suicide Notes Extracting Sentiment Expressions from Twitter Electoral Prediction Understanding and Modeling Emotions with Tweets Progress

Subjective Information Extraction, Lu Chen 6 Extracting a diverse and richer set of sentiment-bearing expressions, including formal and slang words/phrases Assessing the target-dependent polarity of each sentiment expression A novel formulation of assigning polarity to a sentiment expression as a constrained optimization problem over the tweet corpus Extracting Diverse Sentiment Expressions With Target-dependent Polarity from Twitter Lu Chen, Wenbo Wang, Meenakshi Nagarajan, Shaojun Wang, and Amit P. Sheth

Challenges Sentiment expressions in tweets can be very diverse. Subjective Information Extraction, Lu Chen 7 Quantitative Study of 3000 Tweets: Distributions of N-grams and Part-of-speech of the Sentiment Expressions

Challenges The polarity of a sentiment expression is sensitive to its target. Subjective Information Extraction, Lu Chen 8 predictable predictable movie predictable stock long long river long battery life long time for downloading

Approach Subjective Information Extraction, Lu Chen 9 Extracting Candidate Expressions Identifying Inter-Expression Relations Assessing Target-dependent Polarity

Extracting Candidate Expressions Root word: a word that is considered sentiment-bearing in general sense. Collecting root words from –General-purpose sentiment lexicons: MPQA, General Inquirer, and SentiWordNet –Slang dictionary: Urban Dictionary For each tweet, selecting the “on-target” root words, and extracting all the n-grams that contain at least one selected root word as candidates Subjective Information Extraction, Lu Chen 10

Identifying Inter-Expression Relations Connecting the candidate expressions via two types of inter- expression relations – consistency relation and inconsistency relation Basic ideas: –A sentiment expression is inconsistent with its negation; two sentiment expressions linked by contrasting conjunctions are likely to be inconsistent. –Two adjacent expressions are consistent if they do not overlap, and there is no extra negation applied to them or no contrasting conjunction connecting them. Subjective Information Extraction, Lu Chen 11

An Example 1.I saw The Avengers yesterday evening. It was long but it was very good! 2.I do enjoy The Avengers, but it's both overrated and problematic. 3.Saw the avengers last night. Mad overrated. Cheesy lines and horrible writing. Very predictable. 4.The avengers was good but the plot was just simple minded and predictable. 5.The Avengers was good. I was not disappointed. Subjective Information Extraction, Lu Chen 12

Assessing Target-dependent Polarity For each candidate expression, –P-Probability – the probability that indicates positive sentiment –N-Probability – the probability that indicates negative sentiment For each pair of candidate expressions and, –Consistency probability – the probability that and have the same polarity: –Inconsistency probability – the probability that and have different polarities: Subjective Information Extraction, Lu Chen 13

An Optimization Model We want the consistency and inconsistency probabilities derived from the the P-Probabilities and N-Probabilities of the candidates will be closest to their expectations suggested by the relation networks. Objective Function: Subjective Information Extraction, Lu Chen 14 where and are the weights of the edges (the frequency of the relations) between and in the consistency and inconsistency relation networks, and n is the total number of candidate expressions.

The Example Subjective Information Extraction, Lu Chen 15

Evaluation Datasets: –168,005 tweets about movies –258,655 tweets about persons Gold standard: –1,500 tweets labeled with sentiment expressions and overall polarities for the movie targets –1,500 tweets labeled with sentiment expressions and overall polarities for the person targets Baseline methods: –MPQA, GI, SWN: For each extracted root word regarding the target, simply look up its polarity in MPQA, General Inquirer and SentiWordNet, respectively. –PROP: a propagation approach proposed by Qiu et al. (2009) –COM-const: Assign 0.5 to all the candidates as their initial P-Probabilities. –COM-gelex: Initialize the candidates’ polarities according to the root word set. Subjective Information Extraction, Lu Chen 16 Reference: Qiu, G.; Liu, B.; Bu, J.; and Chen, C Expanding domain sentiment lexicon through double propagation. In Proc. of IJCAI.

Subjective Information Extraction, Lu Chen 17

Subjective Information Extraction, Lu Chen 18

Application Subjective Information Extraction, Lu Chen 19

Subjective Information Extraction, Lu Chen 20 Relevance of User Groups Based on Demographics and Participation to Social Media Based Prediction A Case Study of 2012 U.S. Republican Presidential Primaries Lu Chen, Wenbo Wang, and Amit P. Sheth Existing studies on predicting election result are under the assumption that all the users should be treated equally. How could different groups of users be different in predicting election results? 1. Providing a detailed analysis of the social media users on different dimensions 2. Estimating the “vote” of each user by analyzing his/her tweets, and predicted the results based on “vote-counting” 3. Examining the predictive power of different user groups in predicting the results of Super Tuesday races in 10 states

User Categorization Subjective Information Extraction, Lu Chen 21 Engagement Degree Tweet Mode Content Type Political Preference Location

Electoral Prediction with Different User Groups Subjective Information Extraction, Lu Chen 22 Revealing the challenge of identifying the vote intent of “silent majority” Retweets may not necessarily reflect users' attitude. Prediction of user’s vote based on more opinion tweets is not necessarily more accurate than the prediction using more information tweets The right-leaning user group provides the most accurate prediction result. In the best case (56-day time window), it correctly predict the winners in 8 out of 10 states with an average prediction error of 0.1. To some extent, it demonstrates the importance of identifying likely voters in electoral prediction.

Emotion Discovering Fine-grained Sentiment in Suicide Notes: Classify each sentence from suicide notes into 15 emotional categories, e.g., love, pride, guilt, blame, hopelessness, etc. Emotion Identification from Twitter Data: 7 emotion categories, including joy, sadness, anger, lover, fear, thankfulness, and surprise –Can we automatically create a large emotion dataset with high quality labels from Twitter? How? –What features can effectively improve the performance of supervised machine learning algorithms? –How much performance will be gained by increasing the size of the training data? –Can the system developed on Twitter data be directly applied to identify emotions from other datasets? Subjective Information Extraction, Lu Chen 23

What’s next? Subjective Information Extraction, Lu Chen 24 static dynamic coarse-grained fine-grained subjective information Detecting the change of attitude during persuasive communication Discriminating other types of subjective information from sentiment, e.g., wish, intent

Thank you ! Subjective Information Extraction, Lu Chen 25