Sudeshna Sarkar IIT Kharagpur

Slides:

Advertisements

Similar presentations

Trends in Sentiments of Yelp Reviews Namank Shah CS 591.

Advertisements

Product Review Summarization Ly Duy Khang. Outline 1.Motivation 2.Problem statement 3.Related works 4.Baseline 5.Discussion.

Farag Saad i-KNOW 2014 Graz- Austria,

Distant Supervision for Emotion Classification in Twitter posts 1/17.

COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.

Computational Models of Discourse Analysis Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.

TEMPLATE DESIGN © Identifying Noun Product Features that Imply Opinions Lei Zhang Bing Liu Department of Computer Science,

Sentiment Analysis An Overview of Concepts and Selected Techniques.

Annotating Topics of Opinions Veselin Stoyanov Claire Cardie.

CSE 538 Bing Liu Book Chapter 11: Opinion Mining and Sentiment Analysis.

Product Feature Discovery and Ranking for Sentiment Analysis from Online Reviews. __________________________________________________________________________________________________.

A Novel Lexicalized HMM-based Learning Framework for Web Opinion Mining Wei Jin Department of Computer Science, North Dakota State University, USA Hung.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Mining and Summarizing Customer Reviews Advisor ： Dr.

Product Review Summarization from a Deeper Perspective Duy Khang Ly, Kazunari Sugiyama, Ziheng Lin, Min-Yen Kan National University of Singapore.

Sentiment Lexicon Creation from Lexical Resources BIS 2011 Bas Heerschop Erasmus School of Economics Erasmus University Rotterdam

Finding Advertising Keywords on Web Pages Scott Wen-tau YihJoshua Goodman Microsoft Research Vitor R. Carvalho Carnegie Mellon University.

Extracting Opinions, Opinion Holders, and Topics Expressed in Online News Media Text Soo-Min Kim and Eduard Hovy USC Information Sciences Institute 4676.

Mining and Searching Opinions in User-Generated Contents Bing Liu Department of Computer Science University of Illinois at Chicago.

A Holistic Lexicon-Based Approach to Opinion Mining

1 Extracting Product Feature Assessments from Reviews Ana-Maria Popescu Oren Etzioni

Chapter 11: Opinion Mining

Chapter 11. Opinion Mining. Bing Liu, UIC ACL-07 2 Introduction – facts and opinions Two main types of information on the Web.  Facts and Opinions Current.

Mining and Summarizing Customer Reviews

Mining and Summarizing Customer Reviews Minqing Hu and Bing Liu University of Illinois SIGKDD 2004.

Sudeshna Sarkar 24th and 26th October 2007

AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.

Keyphrase Extraction in Scientific Documents Thuy Dung Nguyen and Min-Yen Kan School of Computing National University of Singapore Slides available at.

Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification on Reviews Peter D. Turney Institute for Information Technology National.

Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.

Opinion Mining : A Multifaceted Problem Lei Zhang University of Illinois at Chicago Some slides are based on Prof. Bing Liu’s presentation.

A Holistic Lexicon-Based Approach to Opinion Mining Xiaowen Ding, Bing Liu and Philip Yu Department of Computer Science University of Illinois at Chicago.

1 Entity Discovery and Assignment for Opinion Mining Applications (ACM KDD 09’) Xiaowen Ding, Bing Liu, Lei Zhang Date: 09/01/09 Speaker: Hsu, Yu-Wen Advisor:

2007. Software Engineering Laboratory, School of Computer Science S E Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying.

FEATURE BASED RECOMMENDER SYSTEMS Mehrbod Sharifi Language Technology Institute Carnegie Mellon University 1.

Identifying Comparative Sentences in Text Documents

 Text Representation & Text Classification for Intelligent Information Retrieval Ning Yu School of Library and Information Science Indiana University.

WSDM’08 Xiaowen Ding 、 Bing Liu 、 Philip S. Yu Department of Computer Science University of Illinois at Chicago Conference on Web Search and Data Mining.

Chapter 11: Opinion Mining Bing Liu Department of Computer Science University of Illinois at Chicago

Chapter 11: Opinion Mining Bing Liu Department of Computer Science University of Illinois at Chicago

Opinion Mining of Customer Feedback Data on the Web Presented By Dongjoo Lee, Intelligent Databases Systems Lab. 1 Dongjoo Lee School of Computer Science.

Bootstrapping for Text Learning Tasks Ramya Nagarajan AIML Seminar March 6, 2001.

A Scalable Machine Learning Approach for Semi-Structured Named Entity Recognition Utku Irmak(Yahoo! Labs) Reiner Kraft(Yahoo! Inc.) WWW 2010(Information.

Automatic Identification of Pro and Con Reasons in Online Reviews Soo-Min Kim and Eduard Hovy USC Information Sciences Institute Proceedings of the COLING/ACL.

Department of Software and Computing Systems Research Group of Language Processing and Information Systems The DLSIUAES Team’s Participation in the TAC.

Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.

CSC 594 Topics in AI – Text Mining and Analytics

Finding frequent and interesting triples in text Janez Brank, Dunja Mladenić, Marko Grobelnik Jožef Stefan Institute, Ljubljana, Slovenia.

Recognizing Stances in Online Debates Unsupervised opinion analysis method for debate-side classification. Mine the web to learn associations that are.

CSE 538 Bing Liu Book Chapter 11: Opinion Mining and Sentiment Analysis.

Opinion Observer: Analyzing and Comparing Opinions on the Web

Learning Subjective Nouns using Extraction Pattern Bootstrapping Ellen Riloff School of Computing University of Utah Janyce Wiebe, Theresa Wilson Computing.

Exploiting Named Entity Taggers in a Second Language Thamar Solorio Computer Science Department National Institute of Astrophysics, Optics and Electronics.

From Words to Senses: A Case Study of Subjectivity Recognition Author: Fangzhong Su & Katja Markert (University of Leeds, UK) Source: COLING 2008 Reporter:

Extracting and Ranking Product Features in Opinion Documents Lei Zhang #, Bing Liu #, Suk Hwan Lim *, Eamonn O’Brien-Strain * # University of Illinois.

Event-Based Extractive Summarization E. Filatova and V. Hatzivassiloglou Department of Computer Science Columbia University (ACL 2004)

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,

Opinion Observer: Analyzing and Comparing Opinions on the Web WWW 2005, May 10-14, 2005, Chiba, Japan. Bing Liu, Minqing Hu, Junsheng Cheng.

Semi-Supervised Recognition of Sarcastic Sentences in Twitter and Amazon -Smit Shilu.

COMP423 Summary Information retrieval and Web search  Vecter space model  Tf-idf  Cosine similarity  Evaluation: precision, recall  PageRank 1.

Twitter as a Corpus for Sentiment Analysis and Opinion Mining

Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.

Summary Report 3  Opinion: user generated media ◦ Opinion on the Web: ◦ contains personal experiences and opinions (sentiments) on almost anything,

Research Progress Kieu Que Anh School of Knowledge, JAIST.

Opinion spam and Analysis 소프트웨어공학 연구실 G 최효린 1 / 35.

COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.

Data mining (KDD) process

University of Computer Studies, Mandalay

Aspect-based sentiment analysis

Presentation transcript:

Sudeshna Sarkar IIT Kharagpur Opinion Analysis Sudeshna Sarkar IIT Kharagpur

Fine grained opinion analysis

Basic components Opinion Holder: A person or an organization that holds an specific opinion on a particular object Opinion Target / Object: on which an opinion is expressed Aspect / Feature Opinion Expression: a view, attitude, or appraisal on an object from an opinion holder.

Opinion Analysis Pattern-based or proximity-based approach Use predefined extraction patterns Lexico-syntactic patterns (Riloff & Wiebe 2003) way with <np>: … to ever let China use force to have its way with … expense of <np>: at the expense of the world’s security and stability underlined <dobj>: Jiang’s subdued tone … underlined his desire to avoid disputes …

Riloff & Wiebe 2003 Learning extraction patterns for subjective expressions Observation: subjectivity comes in many (low-frequency) forms  better to have more data Boot-strapping produces cheap data High-precision classifiers label sentences as subjective or objective Extraction pattern learner gathers patterns biased towards subjective texts Learned patterns are fed back into high precision classifier Extraction patterns are the kinds of things we saw earlier “have X’s way with”. They capture role relationships of nouns and verbs with entities referred to in the text.

Subjective Expressions as IE Patterns PATTERN FREQ P(Subj | Pattern) <subj> asked 128 0.63 <subj> was asked 11 1.00 <subj> was expected 45 0.42 was expected from <np> 5 1.00 <subj> put 187 0.67 <subj> put end 10 0.90 <subj> talk 28 0.71 talk of <np> 10 0.90 <subj> is talk 5 1.00 <subj> is fact 38 1.00 fact is <dobj> 12 1.00

Opinion Analysis Some publications: Extracting Opinions, Opinion Holders, and Topics Expressed in Online News Media Text [Kim and Hovy 2006] Exploit the semantic structure of a sentence, anchored to an opinion bearing verb or adjective Fine-grained opinion topic and polarity identification [Cheng & Xu] Joint extraction of entities and relations for opinion recognition Opinion Mining from Web documents Nozomi Kobayashi Doctoral Dissertation, Nara Institute of Science & Technology

Graphical models for fine-grained opinion analysis Obtain an annotated corpus with sentences annotated with target, holder, opinion, aspect, etc Train a graphical model: maxent, crf, semi-crf, etc

Opinion holder [Kim and Hovy 2005] holders: [Kim and Hovy 2006] Using parses Train Maximum Entropy ranker to identify sentiment holders based on parse features [Kim and Hovy 2006] Collect and label opinion words Find opinion-related frames (FrameNet) Use semantic role lebelling to identify fillers for the frames, based on manual mapping tables

Subjectivity analysis Subjective vs objective language Presence of opinion phrases WSD Joint classification at the sentence and document level improves sentence level classification significantly (McDonald et al 2007)

The role of linguistic analysis Polarity classification [Bayen etal 1996], [Gamon 2004]: Syntactic analysis features help in noisy customer feedback domain Holder, target identification: Patterns, semantic role labeling, semantic resources for synonymy antonymy (FrameNet, WordNet) Strength: Syntactic analysis

Feature-based opinion mining and summarization focus on reviews (easier to work in a concrete domain!) Objective: find what reviewers (opinion holders) liked and disliked Product features and opinions on the features Since the number of reviews on an object can be large, an opinion summary should be produced. Desirable to be a structured summary. Easy to visualize and to compare. Analogous to but different from multi-document summarization.

The tasks the three tasks in the model. Task 1: Extracting object features that have been commented on in each review. Task 2: Determining whether the opinions on the features are positive, negative or neutral. Task 3: Grouping feature synonyms. Summary Task 2 may not be needed depending on the format of reviews.

Target/ feature identification Product review: what the opinion is about? Digital camera: lens, resolution, stability, battery life, ease of use, etc Hotel: room comfort, service, noise, cleanliness, budget, etc How to obtain target/ feature? Manual ontology Use “meronymy patterns”: X contains Y, X has Y, etc High PMI between a noun phrase and a patterns indicates candidates for features Use supervised learning to discover features of products

Different review format Format 1 - Pros, Cons and detailed review: The reviewer is asked to describe Pros and Cons separately and also write a detailed review. Epinions.com uses this format. Format 2 - Pros and Cons: The reviewer is asked to describe Pros and Cons separately. C|net.com used to use this format. Format 3 - free format: The reviewer can write freely, i.e., no separation of Pros and Cons. Amazon.com uses this format.

Format 2 Format 3 Format 1 GREAT Camera., Jun 3, 2004 Reviewer: jprice174 from Atlanta, Ga. I did a lot of research last year before I bought this camera... It kinda hurt to leave behind my beloved nikon 35mm SLR, but I was going to Italy, and I needed something smaller, and digital. The pictures coming out of this camera are amazing. The 'auto' feature takes great pictures most of the time. And with digital, you're not wasting film if the picture doesn't come out.

Feature-based Summary (Hu and Liu, KDD-04) GREAT Camera., Jun 3, 2004 Reviewer: jprice174 from Atlanta, Ga. I did a lot of research last year before I bought this camera... It kinda hurt to leave behind my beloved nikon 35mm SLR, but I was going to Italy, and I needed something smaller, and digital. The pictures coming out of this camera are amazing. The 'auto' feature takes great pictures most of the time. And with digital, you're not wasting film if the picture doesn't come out. … …. Feature Based Summary: Feature1: picture Positive: 12 The pictures coming out of this camera are amazing. Overall this is a good camera with a really good picture clarity. … Negative: 2 The pictures come out hazy if your hands shake even for a moment during the entire process of taking a picture. Focusing on a display rack about 20 feet away in a brightly lit room during day time, pictures produced by this camera were blurry and in a shade of orange. Feature2: battery life

Visual summarization & comparison Summary of reviews of Digital camera 1 Picture Battery Size Weight Zoom + _ Comparison of reviews of Digital camera 1 Digital camera 2 _ +

Extraction using label sequential rules Label sequential rules (LSR) are a special kind of sequential patterns, discovered from sequences. LSR Mining is supervised (Liu’s Web mining book 2006). The training data set is a set of sequences, e.g., “Included memory is stingy” is turned into a sequence with POS tags. {included, VB}{memory, NN}{is, VB}{stingy, JJ} then turned into {included, VB}{$feature, NN}{is, VB}{stingy, JJ}

Using LSRs for extraction Based on a set of training sequences, we can mine label sequential rules, e.g., {easy, JJ }{to}{*, VB}  {easy, JJ}{to}{$feature, VB} [sup = 10%, conf = 95%] Feature Extraction Only the right hand side of each rule is needed. The word in the sentence segment of a new review that matches $feature is extracted. We need to deal with conflict resolution also (multiple rules are applicable.

Extraction of features of formats 2 and 3 Reviews of these formats are usually complete sentences e.g., “the pictures are very clear.” Explicit feature: picture “It is small enough to fit easily in a coat pocket or purse.” Implicit feature: size Extraction: Frequency based approach Frequent features Infrequent features

Frequency based approach (Hu and Liu, KDD-04) Frequent features: those features that have been talked about by many reviewers. Use sequential pattern mining Why the frequency based approach? Different reviewers tell different stories (irrelevant) When product features are discussed, the words that they use converge. They are main features. Sequential pattern mining finds frequent phrases. Froogle has an implementation of the approach (no POS restriction).

Using part-of relationship and the Web (Popescu and Etzioni, EMNLP-05) Improved (Hu and Liu, KDD-04) by removing those frequent noun phrases that may not be features: better precision (a small drop in recall). It identifies part-of relationship Each noun phrase is given a pointwise mutual information score between the phrase and part discriminators associated with the product class, e.g., a scanner class. The part discriminators for the scanner class are, “of scanner”, “scanner has”, “scanner comes with”, etc, which are used to find components or parts of scanners by searching on the Web: the KnowItAll approach, (Etzioni et al, WWW-04).

Infrequent features extraction How to find the infrequent features? Observation: the same opinion word can be used to describe different features and objects. “The pictures are absolutely amazing.” “The software that comes with it is amazing.” Frequent features Infrequent features Opinion words

Identify feature synonyms Liu et al (WWW-05) made an attempt using only WordNet. Carenini et al (K-CAP-05) proposed a more sophisticated method based on several similarity metrics, but it requires a taxonomy of features to be given. The system merges each discovered feature to a feature node in the taxonomy. The similarity metrics are defined based on string similarity, synonyms and other distances measured using WordNet. Experimental results based on digital camera and DVD reviews show promising results. Many ideas in information integration are applicable.

Identify opinion orientation on feature For each feature, we identify the sentiment or opinion orientation expressed by a reviewer. We work based on sentences, but also consider, A sentence may contain multiple features. Different features may have different opinions. E.g., The battery life and picture quality are great (+), but the view founder is small (-). Almost all approaches make use of opinion words and phrases. But note again: Some opinion words have context independent orientations, e.g. great. Some other opinion words have context dependent orientations, e.g., “small” Many ways to use them.

Aggregation of opinion words (Hu and Liu, KDD-04; Ding and Liu, SIGIR-07) Input: a pair (f, s), where f is a feature and s is a sentence that contains f. Output: whether the opinion on f in s is positive, negative, or neutral. Two steps: Step 1: split the sentence if needed based on BUT words (but, except that, etc). Step 2: work on the segment sf containing f. Let the set of opinion words in sf be w1, .., wn. Sum up their orientations (1, -1, 0), and assign the orientation to (f, s) accordingly. In (Ding and Liu, SIGIR-07), step 2 is changed to with better results. wi.o is the opinion orientation of wi. d(wi, f) is the distance from f to wi.

Context dependent opinions Popescu and Etzioni (2005) used constraints of connectives in (Hazivassiloglou and McKeown, ACL-97), and some additional constraints, e.g., morphological relationships, synonymy and antonymy, and relaxation labeling to propagate opinion orientations to words and features. Ding and Liu (2007) used constraints of connectives both at intra-sentence and inter-sentence levels, and additional constraints of, e.g., TOO, BUT, NEGATION. to directly assign opinions to (f, s) with good results (> 0.85 of F-score).

Extraction of Comparatives Comparative sentence mining Identify comparative sentences Extract comparative relations from them

Linguistic Perspective Comparative sentences use morphemes like more/most, -er/-est, less/least, as than and as are used to make a `standard’ against which an entire entity is compared Limitations Limited coverage “In market capital, Intel is way ahead of AMD.” Non-comparatives with comparative words “In the context of speed, faster means better.”

Types of Comparatives Gradable Non-Equal Gradable: Relations of the type greater or less than Keywords like better, ahead, beats, etc “Optics of camera A is better than that of camera B” Equative: Relations of type equal to Keywords and phrases like equal to, same as, both, all “Camera A and camera B both come in 7MP” Superlative: Relations of the type greater or less than all others Keywords and phrases like best, most, better than all “Camera A is the cheapest camera available in the market.”

Types of Comparatives: non-gradable Non-gradable: Sentences that compare features of two or more objects, but do not grade them. Sentences which imply: Object A is similar to or different from Object B with regard to some features Object A has feature F1, Object B has feature F2 Object A has feature F, but Object B does not have

Comparative Relation: gradable Definition: A gradable comparative relation captures the essence of gradable comparative sentence and is represented with the following: (relationWord, features, entityS1, entityS2, type) relationWord: The keyword used to express a comparative relation in a sentence. features: a set of features being compared. entityS1 and entityS2: Sets of entities being compared. type: non-equal gradable, equative or superlative

Examples: Comparative relations “car X has better controls than car Y” (relationWord = better, features = controls, entityS1 = carX, entityS2 = carY, type = non-equal-gradable) “car X and car Y have equal mileage” (relationWord = equal, features = mileage, entityS1 = carX, entityS2 = carY, type = equative) “car X is cheaper than both car Y and car Z” (relationWord = cheaper, features = null, entityS1 = carX, entityS2 = {carY, carZ}, type = non-equal-gradable) “company X produces a variety of cars, but still best cars come from company Y” (relationWord = best, features = cars, entityS1 = companyY, entityS2 = companyX, type = superlative)

Given a collection of evaluative texts Tasks Given a collection of evaluative texts Task 1: Identify comparative sentences Task 2: Categorize different types of comparative sentences. Task 3: Extract comparative relations from the sentences

Identify comparative sentences Keyword strategy An observation: Its is easy to find a small set of keywords that covers almost all comparative sentences, i.e., with a very high recall and a reasonable precision A list of 83 keywords used in comparative sentences compiled by (Jinal and Liu, Sigir-06) including Words with POS tags of JJR, JJS, RBR, RBS POS tags are used as keyword instead of individual words Exceptions: more, less, most, least Other indicative word like beat, exceed, ahead, etc. Phrases like in the lead, on par with, etc.

2-step learning strategy Step 1: Extract sentences which contain at least one keyword (recall = 98%, precision = 32% on our data set of gradables) Step 2: Use Naïve Bayes classifier to classify sentences into two classes Comparative and non-comparative Attributes: class sequential rules (CSRs) generated from sentences in step 1

Sequence data preparation Use words within a radius r of a keyword to form a sequence (words are replaced with POS tags) … CSR generation Use different minimum supports for different keywords 13 manual rules, which were hard to generate automatically Learning using a NB classifier Use CSRs and manual rules as attributes to build a final classifier

Classify different types of comparatives Classify comparative sentences into three types: non-equal gradable, equative and superlative SVM learner gives the best result Asstribute set is the set of keywords If the sentence has a particular keywords in the attribute set, the corresponding value is 1, and 0 otherwise.

Extraction of comparative relations Assumptions There is only one relation in a sentence Entities and features are nominals Adjectival comparatives Does not deal with adverbial comparatives 3 steps Sequence data generation Label sequential rule (LSR) generation Build a sequential cover/extractor from LSRs

Sequence data generation Label Set = {$entityS1, $entityS2, $feature} Three labels are used as pivots to generate sequences. Radius of 4 for optimal results Following words are also added Distance words = {l1, l2, l3, l4, r1, r2, r3, r4} Special words #start and #end are used to mark the start and the end of a sentence.

Sequence data generation example The comparative sentence “Canon/NNP has/VBZ better/JJR optics/NNS” has $entityS1 “Canon” and $feature “optics” Sequences are: <{#start>{l1}{$entityS1, NNP){r1}{has, VBZ}{r2}{better, JJR}{r3}{$Feature, NNS}{r4}{#end}> <{#start>{l4}{$entityS1, NNP){l3}{has, VBZ}{l2}{better, JJR}{l1}{$Feature, NNS}{r1}{#end}>

Build a sequential cover from LSRs LSR:  {*, NN}{VBZ}  { $entityS1, NN}{VBZ} Select the LSR rule with the highest confidence. Replace the matched elements in the sentences that satisfy the rules with the labels in the rule. Recalculate the confidence of each remaining rule based on the modified data from step 1. Repeat step 1 and 2 until no rule left with confidence higher than minconf value (they sued 90%)

Experimental Results (Jindal and Liu, AAAI 06) Identifying Gradable Comparative Sentences Precision = 82% and recall = 81% Classification into three gradable types SVM gave accuracy of 96% Extraction of comparative relations LSR: F-score = 72%

Summary Two types of evaluations Very hard problems, but very useful Direct opinions: We studied The problem abstraction Sentiment analysis at document level, sentence level and feature level Comparisons: Very hard problems, but very useful The current techniques are still in their infancy. Industrial applications are coming up

References Sentiment Detection and its applications Michael Gamon, Microsoft Research, USA Talk at summer school on NLP & Text Mining, IIT Kharagpur (http://cse.iitkgp.ac.in/nlpschool) Bing Liu’s tutorials on opinion mining http://www.cs.uic.edu/~liub/