Creating Subjective and Objective Sentence Classifiers from Unannotated Texts Ellen Riloff University of Utah (Joint work with Janyce Wiebe at the University.

Slides:

Advertisements

Similar presentations

The Impact of Task and Corpus on Event Extraction Systems Ralph Grishman New York University Malta, May 2010 NYU.

Advertisements

1 CERATOPS Center for Extraction and Summarization of Events and Opinions in Text Janyce Wiebe, U. Pittsburgh Claire Cardie, Cornell U. Ellen Riloff, U.

NYU ANLP-00 1 Automatic Discovery of Scenario-Level Patterns for Information Extraction Roman Yangarber Ralph Grishman Pasi Tapanainen Silja Huttunen.

Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis Theresa Wilson Janyce Wiebe Paul Hoffmann University of Pittsburgh.

Extract from various presentations: Bing Liu, Aditya Joshi, Aster Data … Sentiment Analysis January 2012.

Annotating Topics of Opinions Veselin Stoyanov Claire Cardie.

A Framework for Automated Corpus Generation for Semantic Sentiment Analysis Amna Asmi and Tanko Ishaya, Member, IAENG Proceedings of the World Congress.

Comparing Methods to Improve Information Extraction System using Subjectivity Analysis Prepared by: Heena Waghwani Guided by: Dr. M. B. Chandak.

Multi-Perspective Question Answering Using the OpQA Corpus Veselin Stoyanov Claire Cardie Janyce Wiebe Cornell University University of Pittsburgh.

Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.

Semi-supervised learning and self-training LING 572 Fei Xia 02/14/06.

CS Word Sense Disambiguation. 2 Overview A problem for semantic attachment approaches: what happens when a given lexeme has multiple ‘meanings’?

Event Extraction: Learning from Corpora Prepared by Ralph Grishman Based on research and slides by Roman Yangarber NYU.

Predicting the Semantic Orientation of Adjective Vasileios Hatzivassiloglou and Kathleen R. McKeown Presented By Yash Satsangi.

An Overview of Text Mining Rebecca Hwa 4/25/2002 References M. Hearst, “Untangling Text Data Mining,” in the Proceedings of the 37 th Annual Meeting of.

Inducing Information Extraction Systems for New Languages via Cross-Language Projection Ellen Riloff University of Utah Charles Schafer, David Yarowksy.

Desiderata for Annotating Data to Train and Evaluate Bootstrapping Algorithms Ellen Riloff School of Computing University of Utah.

1 Noun Homograph Disambiguation Using Local Context in Large Text Corpora Marti A. Hearst Presented by: Heng Ji Mar. 29, 2004.

Learning Subjective Nouns using Extraction Pattern Bootstrapping Ellen Riloff, Janyce Wiebe, Theresa Wilson Presenter: Gabriel Nicolae.

1 LM Approaches to Filtering Richard Schwartz, BBN LM/IR ARDA 2002 September 11-12, 2002 UMASS.

Learning Subjective Adjectives from Corpora Janyce M. Wiebe Presenter: Gabriel Nicolae.

Automatic Acquisition of Lexical Classes and Extraction Patterns for Information Extraction Kiyoshi Sudo Ph.D. Research Proposal New York University Committee:

Information Extraction with Unlabeled Data Rayid Ghani Joint work with: Rosie Jones (CMU) Tom Mitchell (CMU & WhizBang! Labs) Ellen Riloff (University.

Extracting Opinions, Opinion Holders, and Topics Expressed in Online News Media Text Soo-Min Kim and Eduard Hovy USC Information Sciences Institute 4676.

1 Extracting Product Feature Assessments from Reviews Ana-Maria Popescu Oren Etzioni

Mining and Summarizing Customer Reviews

Opinion mining in social networks Student: Aleksandar Ponjavić 3244/2014 Mentor: Profesor dr Veljko Milutinović.

Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification on Reviews Peter D. Turney Institute for Information Technology National.

Processing of large document collections Part 10 (Information extraction: learning extraction patterns) Helena Ahonen-Myka Spring 2005.

Automatic Extraction of Opinion Propositions and their Holders Steven Bethard, Hong Yu, Ashley Thornton, Vasileios Hatzivassiloglou and Dan Jurafsky Department.

Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.

Carmen Banea, Rada Mihalcea University of North Texas A Bootstrapping Method for Building Subjectivity Lexicons for Languages.

Processing of large document collections Part 2 (Text categorization) Helena Ahonen-Myka Spring 2006.

Distributional Part-of-Speech Tagging Hinrich Schütze CSLI, Ventura Hall Stanford, CA , USA NLP Applications.

2007. Software Engineering Laboratory, School of Computer Science S E Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying.

Researcher affiliation extraction from homepages I. Nagy, R. Farkas, M. Jelasity University of Szeged, Hungary.

1 Statistical NLP: Lecture 9 Word Sense Disambiguation.

Exploiting Subjectivity Classification to Improve Information Extraction Ellen Riloff University of Utah Janyce Wiebe University of Pittsburgh William.

A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources Author: Carmen Banea, Rada Mihalcea, Janyce Wiebe Source:

CS 4705 Lecture 19 Word Sense Disambiguation. Overview Selectional restriction based approaches Robust techniques –Machine Learning Supervised Unsupervised.

Opinion Mining of Customer Feedback Data on the Web Presented By Dongjoo Lee, Intelligent Databases Systems Lab. 1 Dongjoo Lee School of Computer Science.

Bootstrapping for Text Learning Tasks Ramya Nagarajan AIML Seminar March 6, 2001.

1 Multi-Perspective Question Answering Using the OpQA Corpus (HLT/EMNLP 2005) Veselin Stoyanov Claire Cardie Janyce Wiebe Cornell University University.

Summarization Focusing on Polarity or Opinion Fragments in Blogs Yohei Seki Toyohashi University of Technology Visiting Scholar at Columbia University.

Bootstrapping Information Extraction with Unlabeled Data Rayid Ghani Accenture Technology Labs Rosie Jones Carnegie Mellon University & Overture (With.

Learning Subjective Nouns using Extraction Pattern Bootstrapping Ellen Riloff, Janyce Wiebe, Theresa Wilson 서 진 이 HPC Lab, UOS.

1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )

Multilingual Opinion Holder Identification Using Author and Authority Viewpoints Yohei Seki, Noriko Kando,Masaki Aono Toyohashi University of Technology.

Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.

Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏

Have we had Hard Times or Cosy Times? A Discourse Analysis of Opinions Expressed over Socio-political Events in News Editorials Bal Krishna Bal Information.

UWMS Data Mining Workshop Content Analysis: Automated Summarizing Prof. Marti Hearst SIMS 202, Lecture 16.

Topic: Opinion Extraction and Summarization. Opinion Extraction and Summarization What follows: perspective of Cardie, Riloff, Wiebe We can see similar.

Learning Subjective Nouns using Extraction Pattern Bootstrapping Ellen Riloff School of Computing University of Utah Janyce Wiebe, Theresa Wilson Computing.

FILTERED RANKING FOR BOOTSTRAPPING IN EVENT EXTRACTION Shasha Liao Ralph York University.

7/2003EMNLP031 Learning Extraction Patterns for Subjective Expressions Ellen Riloff Janyce Wiebe University of Utah University of Pittsburgh.

Learning Extraction Patterns for Subjective Expressions 2007/10/09 DataMining Lab 안민영.

Exploiting Named Entity Taggers in a Second Language Thamar Solorio Computer Science Department National Institute of Astrophysics, Optics and Electronics.

From Words to Senses: A Case Study of Subjectivity Recognition Author: Fangzhong Su & Katja Markert (University of Leeds, UK) Source: COLING 2008 Reporter:

Extracting and Ranking Product Features in Opinion Documents Lei Zhang #, Bing Liu #, Suk Hwan Lim *, Eamonn O’Brien-Strain * # University of Illinois.

Processing of large document collections Part 9 (Information extraction: learning extraction patterns) Helena Ahonen-Myka Spring 2006.

Subjectivity and Sentiment Analysis Jan Wiebe Department of Computer Science Intelligent Systems Program University of Pittsburgh.

Finding strong and weak opinion clauses Theresa Wilson, Janyce Wiebe, Rebecca Hwa University of Pittsburgh Just how mad are you? AAAI-2004.

NTNU Speech Lab 1 Topic Themes for Multi-Document Summarization Sanda Harabagiu and Finley Lacatusu Language Computer Corporation Presented by Yi-Ting.

Twitter as a Corpus for Sentiment Analysis and Opinion Mining

Identifying Expressions of Opinion in Context Eric Breck and Yejin Choi and Claire Cardie IJCAI 2007.

Trends in NL Analysis Jim Critz University of New York in Prague EurOpen.CZ 12 December 2008.

Learning Extraction Patterns for Subjective Expressions

Statistical NLP: Lecture 9

Statistical NLP : Lecture 9 Word Sense Disambiguation

Presentation transcript:

Creating Subjective and Objective Sentence Classifiers from Unannotated Texts Ellen Riloff University of Utah (Joint work with Janyce Wiebe at the University of Pittsburgh)

What is Subjectivity? Subjective language includes opinions, rants, allegations, accusations, suspicions, and speculation. Distinguishing factual information from subjective information could benefit many applications, including: –information extraction –question answering –summarization –spam filtering

Previous Work on Subjectivity Classification Document-level subjectivity classification (e.g., [Turney 2002; Pang et al. 2002; Spertus 1997]) But most documents contain subjective and objective sentences. [Wiebe et al. 01] reported that 44% of sentences in their news corpus were subjective! Sentence-level subjectivity classification [Dave et al. 2003; Yu et al. 2003; Riloff, Wiebe, & Wilson 2003]

Goals of our research Create classifiers that label sentences as subjective or objective. Learn subjectivity and objectivity clues from unannotated corpora. Use information extraction techniques to learn subjective nouns. Use information extraction techniques to learn subjective and objective patterns.

Outline of Talk Learning subjective nouns with extraction patterns Automatically generating training data with high-precision classifiers Learning subjective and objective extraction patterns Naïve Bayes classification and self-training

Information Extraction Information extraction (IE) systems identify facts related to a domain of interest. Extraction patterns are lexico-syntactic expressions that identify the role of an object. For example: was killed assassinated murder of

Learning Subjective Nouns Goal: to learn subjective nouns from unannotated texts. Method: applying IE-based bootstrapping algorithms that were designed to learn semantic categories. Hypothesis: extraction patterns can identify subjective contexts that co-occur with subjective nouns. Example: “expressed ” concern, hope, support

Extraction Examples expressed condolences, hope, grief, views, worries indicative of compromise, desire, thinking inject vitality, hatred reaffirmed resolve, position, commitment voiced outrage, support, skepticism, opposition, gratitude, indignation show of support, strength, goodwill, solidarity was sharedanxiety, view, niceties, feeling

Meta-Bootstrapping [Riloff & Jones 99] Unannotated Texts Best Extraction Pattern Extractions (Nouns) Ex: hope, grief, joy, concern, worries Ex: expressed Ex: happiness, relief, condolences

Basilisk [Thelen & Riloff 02] extraction patterns and their extractions corpus seed words semantic lexicon 5 best candidate words Pattern Pool best patterns Candidate Word Pool extractions

Subjective Seed Words cowardiceembarrassment hatred outrage crapfool hell slander delightgloom hypocrisy sigh disdaingrievance love twit dismayhappiness nonsense virtue

Subjective Noun Results Bootstrapping corpus: 950 unannotated FBIS documents (English-language foreign news) We ran each bootstrapping algorithm for 400 cycles, generating ~2000 words. We manually reviewed the words and labeled them as strongly subjective or weakly subjective. Together, they learned 1052 subjective nouns (454 strong, 598 weak).

Examples of Strong Subjective Nouns anguish exploitation pariah antagonism evil repudiation apologist fallacies revenge atrocities genius rogue barbarian goodwill sanctimonious belligerence humiliationscum bully ill-treatment smokescreen condemnation injustice sympathy denunciation innuendo tyranny devil insinuation venom diatribe liar exaggeration mockery

Examples of Weak Subjective Nouns aberration eyebrowsresistant allusion failuresrisk apprehensions inclinationsincerity assault intrigue slump beneficiary liabilityspirit benefit likelihoodsuccess blood peacefultolerance controversy persistenttrick credence plaguetrust distortion pressureunity drama promise eternity rejection

Outline of Talk Learning subjective nouns with extraction patterns Automatically generating training data with high-precision classifiers Learning subjective and objective extraction patterns Naïve Bayes classification and self-training

Initial Training Data Creation rule-based subjective sentence classifier rule-based objective sentence classifier subjective & objective sentences unlabeled texts subjective clues

Subjective Clues entries from manually developed resources [Levin 93; Ballmer & Brennenstuhl 81] Framenet lemmas with frame element experiencer [Baker et al. 98] adjectives manually annotated for polarity [Hatzivassiloglou & McKeown 97] n-grams learned from corpora [Dave et al. 03; Wiebe et al. 01] words distributionally similar to subjective seed words [Wiebe 00] subjective nouns learned from extraction pattern bootstrapping [Riloff et al. 03]

Creating High-Precision Rule-Based Classifiers subjectivea sentence is subjective if it contains  2 strong subjective clues a sentence is objective if: –it contains no strong subjective clues –the previous and next sentence contain  1 strong subjective clue –the current, previous, and next sentence together contain  2 weak subjective clues GOAL: use subjectivity clues from previous research to build a high-precision (low-recall) rule-based classifier

Data Set The MPQA Corpus contains 535 FBIS texts that have been manually annotated for subjectivity. Our test set consisted of 9,289 sentences from the MPQA corpus. We consider a sentence to be subjective if it has at least one private state of strength medium or higher. 54.9% of the sentences in our test set are subjective.

Accuracy of Rule-Based Classifiers SubjRecSubjPrecSubjF Subj RBC ObjRecObjPrecObjF Obj RBC

Generated Data We applied the rule-based classifiers to 298,809 sentences from (unannotated) FBIS documents. 52,918 were labeled subjective 47,528 were labeled objective training set of over 100,000 labeled sentences!

Outline of Talk Learning subjective nouns with extraction patterns Automatically generating training data with high-precision classifiers Learning subjective and objective extraction patterns Naïve Bayes classification and self-training

Representing Subjective Expressions with Extraction Patterns Extraction patterns can represent linguistic expressions that are not fixed word sequences. drove [NP] up the wall - drove him up the wall - drove George Bush up the wall - drove George Herbert Walker Bush up the wall step on [modifiers] toes - step on her toes - step on the mayor’s toes - step on the newly elected mayor’s toes gave [NP] a [modifiers] look - gave his annoying sister a really really mean look

The Extraction Pattern Learner Used AutoSlog-TS [Riloff 96] to learn extraction patterns. AutoSlog-TS needs relevant and irrelevant texts as input. Statistics are generated measuring each pattern’s association with the relevant texts. The subjective sentences were called relevant, and the objective sentences were called irrelevant.

passive-vp was satisfied active-vp complained active-vp dobj dealt blow active-vp infinitive appears to be passive-vp infinitive was thought to be auxiliary dobj has position active-vp endorsed infinitive to condemn active-vp infinitive get to know passive-vp infinitive was meant to show subject auxiliary fact is passive-vp prep opinion on active-vp prep agrees with infinitive prep was worried about noun prep to resort to

RelevantIrrelevant [The World Trade Center], [an icon] of [New York City], was intentionally attacked very early on [September 11, 2001]. Parser Extraction Patterns: was attacked icon of was attacked on Syntactic Templates AutoSlog-TS (Step 1)

AutoSlog-TS (Step 2) RelevantIrrelevant Extraction PatternsFreqProb was attacked icon of 5.20 was attacked on Extraction Patterns: was attacked icon of was attacked on

Identifying Subjective and Objective Patterns AutoSlog-TS generates 2 statistics for each pattern: F = pattern frequency P = relevant frequency / pattern frequency We call a pattern subjective if F  5 and P .95 (6364 subjective patterns were learned) We call a pattern objective if F  5 and P .15 (832 objective patterns were learned)

Examples of Learned Extraction Patterns Subjective Patterns believes was convinced aggression against to express support for Objective Patterns increased production took effect delegation from occurred on plans to produce

Patterns with Interesting Behavior PATTERNFREQP(Subj | Pattern) asked was asked was expected was expected from talk talk of is talk put put end is fact fact is

Augmenting the Rule-Based Classifiers with Extraction Patterns SubjRecSubjPrecSubjF Subj RBC Subj RBC w/Patterns ObjRecObjPrecObjF Obj RBC Obj RBC w/Patterns

Outline of Talk Learning subjective nouns with extraction patterns Automatically generating training data with high-precision classifiers Learning subjective and objective extraction patterns Naïve Bayes classification and self-training

Naïve Bayes Classifier We created an NB classifier using the initial training set and several set-valued features: –strong & weak subjective clues from RBCs –subjective & objective extraction patterns –POS tags (pronouns, modals, adjectives, cardinal numbers, adverbs) –separate features for each of the current, previous, and next sentences

Naïve Bayes Training extraction pattern learner training set objective patterns subjective patterns Naïve Bayes training POS features subjective clues

Naïve Bayes Results SubjRecSubjPrecSubjF Naïve Bayes ObjRecObjPrecObjF Naïve Bayes RWW (supervised) RWW (supervised)

Self-Training Process best N sentences Naïve Bayes classifier unlabeled sentences extraction pattern learner training set objective patterns subjective patterns Naïve Bayes training POS features subjective clues

Self-Training Results SubjRecSubjPrecSubjF Subj RBC w/Patts Subj RBC w/Patts ObjRecObjPrecObjF Obj RBC w/Patts Obj RBC w/Patts Naïve Bayes Naïve Bayes Naïve Bayes Naïve Bayes RWW03 (supervised) RWW03 (supervised)

Conclusions We can build effective subjective sentence classifiers using only unannotated texts. Extraction pattern bootstrapping can learn subjective nouns. Extraction patterns can represent richer subjective expressions. Learning methods can discover subtle distinctions between very similar expressions.

THE END Thank you!

Related Work Genre classification (e.g., [Karlgren and Cutting 1994; Kessler et al. 1997; Wiebe et al. 2001]) Learning adjectives, adj. phrases, verbs, and N-grams [ Turney 2002; Hatzivassiloglou & McKeown 1997; Wiebe et al. 2001] Semantic lexicon learning [Hearst 1992; Riloff & Shepherd 1997; Roark & Charniak 1998; Caraballo 1999] –Meta-Bootstrapping [Riloff & Jones 99] –Basilisk [Thelen & Riloff 02]

What is Information Extraction? Extracting facts relevant to a specific topic from narrative text. Example Domains Terrorism: perpetrator, victim, target, date, location Management succession: person fired, successor, position, organization, date Infectious disease outbreaks: disease, organism, victim, symptoms, location, date

Information Extraction from Narrative Text Role relationships define the information of interest …keywords and named entities are not sufficient. Researchers have discovered how anthrax toxin destroys cells and rapidly causes death... Troops were vaccinated against anthrax, cholera, …

Ranking and Manual Review The patterns are ranked using the metric: A domain expert reviews the top-ranked patterns and assigns thematic roles to the good ones. RlogF (pattern i ) = FiFi NiNi * log 2 (F i ) F i is the # of instances of pattern i in relevant texts N i is the # of instances of pattern i in all texts

Semantic Lexicons A semantic lexicon assigns categories to words. Semantic dictionaries are hard to come by, especially for specialized domains. WordNet [Miller 90] is popular but is not always sufficient. [Roark & Charniak 98] found that 3 of every 5 words learned by their system were not present in WordNet. politician human truckvehicle grenadeweapon

The Bootstrapping Era Unannotated Texts + = KNOWLEDGE !

Meta-Bootstrapping Unannotated Texts Best Extraction Pattern Extractions (Nouns) Ex: anthrax, ebola, cholera, flu, plague Ex: outbreak of Ex: smallpox, tularemia, botulism

Semantic Lexicon (NP) Results IterCompanyLocationTitleLocationWeapon (Web)(Web)(Web)(Terror)(Terror) 15/5 (1.0)5/5 (1.0) 0/1 (0) 5/5(1.0) 4/4(1.0) 10 25/32 (.78) 46/50 (.92) 22/31 (.71) 32/50 (.92) 31/44 (.70) 20 52/65 (.80) 88/100 (.88) 63/81 (.78)66/100 (.66) 68/94 (.72) 30 72/113 (.64) 129/150 (.86) 86/131 (.66) 100/150 (.67) 85/144 (.59)

Basilisk extraction patterns and their extractions corpus seed words semantic lexicon 5 best candidate words Pattern Pool best patterns Candidate Word Pool extractions

The Pattern Pool RlogF (pattern i ) = FiFi NiNi * log 2 (F i ) F i is the number of category members extracted by pattern i N i is the total number of nouns extracted by pattern i where: Every extraction pattern is scored and the best patterns are put into a Pattern Pool. The scoring function is:

Scoring Candidate Words 1. collecting all patterns that extracted it 2. computing the average number of category members extracted by those patterns. Each candidate word is scored by:  j=1 NiNi AvgLog (word i ) = log 2 (F j + 1) NiNi

Bootstrapping a Single Category

Bootstrapping Multiple Categories

A Smarter Scoring Function diff (w i,c a ) = AvgLog (w i,c a ) - max (AvgLog(w i,c b )) b  a We incorporated knowledge about competing semantic categories directly into the scoring function. The modified scoring function computes the difference between the score for the target category and the best score among competing categories.