Subjectivity Annotation Update Josef Ruppenhofer Jan Wiebe.

Slides:



Advertisements
Similar presentations
Critical Reading Strategies: Overview of Research Process
Advertisements

Recognizing Question Stems So you’ve read the text… now what?
Professor Sanjoy Bandopadhyay Department of Instrumental Music, Rabindra Bharati University.

1 Extraction and Summarization of Opinions. 2 Subjectivity: opinions, emotions, motivations, speculations, sentiments Information Extraction of –NL expressions.
Manual Subjectivity Analysis. EUROLAN July 30, Preliminaries What do we mean by subjectivity? The linguistic expression of somebody’s emotions,
A Brief Overview. Contents Introduction to NLP Sentiment Analysis Subjectivity versus Objectivity Determining Polarity Statistical & Linguistic Approaches.
The JDPA Sentiment Corpus for the Automotive Domain Miriam Eckert, Lyndsie Clark, Nicolas Nicolov J.D. Power and Associates Jason S. Kessler Indiana University.
Annotating Topics of Opinions Veselin Stoyanov Claire Cardie.
Reading Strategies For ACTIVE reading.
April 26th, 2007 Workshop on Treebanking, HLT/NAACL, Rochester 1 Layering of Annotations in the Penn Discourse TreeBank (PDTB) Rashmi Prasad Institute.
Recognizing Implicit Discourse Relations in the Penn Discourse Treebank Ziheng Lin, Min-Yen Kan, and Hwee Tou Ng Department of Computer Science National.
Active ReadingStrategies. Reader Reception Theory emphasizes that the reader actively interprets the text based on his or her particular cultural background.
ISHER: Integrated Social History Environment for Research Sophia Ananiadou National Centre for Text Mining School of Computer Science.
Multi-Perspective Question Answering Using the OpQA Corpus Veselin Stoyanov Claire Cardie Janyce Wiebe Cornell University University of Pittsburgh.
Intellectual Challenge of Teaching
Annotating Expressions of Opinions and Emotions in Language Wiebe, Wilson, Cardie.
Position Papers Drafting. Drafting n Developing Your Topic –Draw on personal experience. –Use secondary sources.
1 Attributions and Private States Jan Wiebe (U. Pittsburgh) Theresa Wilson (U. Pittsburgh) Claire Cardie (Cornell U.)
Combining Low-Level and Summary Representations of Opinions for Multi- Perspective Question Answering Claire Cardie, Janyce Wiebe, Theresa Wilson, Diane.
Learning Subjective Adjectives from Corpora Janyce M. Wiebe Presenter: Gabriel Nicolae.
Chapter 6 Consumer Attitudes Consumer Attitudes.
JOHN F. SHEEHAN, PH.D. EVIDENCE-BASED OBSERVATIONS.
Print slide 8,9,10 handouts.
Academic English Seminar Skills “An Introduction to EAP – Academic Skills in English” Lesson 1.
--- Hephizibah Roskelly and David A. Jolliffee, Everyday Use
NSW Curriculum and Learning Innovation Centre Draft Senior Secondary Curriculum ENGLISH May, 2012.
Scott Duvall, Brett South, Stéphane Meystre A Hands-on Introduction to Natural Language Processing in Healthcare Annotation as a Central Task for Development.
Developing reading skills Factors involved in effective reading
RESEARCH POSTER PRESENTATION DESIGN © © 2012 PosterPresentations.com 2117 Fourth Street, Unit C Berkeley CA 94710
Summary-Response Essay Responding to Reading. Reading Critically Not about finding fault with author Rather engaging author in a discussion by asking.
Josef Ruppenhofer, Swapna Somasundaran, Janyce Wiebe University of Pittsburgh Finding the sources and targets of subjective expressions.
Attribution: speech and thought representation Bringing other voices into a text.
Some Ways of Taking Notes. Before reading… Look at the table of contents. Skim introductions and conclusions. Avoid isolating facts without considering.
Sentiment and Opinion Sep18, 2012 Analysis of Social Media Seminar William Cohen.
Minimally Supervised Event Causality Identification Quang Do, Yee Seng, and Dan Roth University of Illinois at Urbana-Champaign 1 EMNLP-2011.
Globally Speaking: The Effect of Internal Message Frames on Attitudes and Cognitive Processing Focused on Internationalizing Agricultural Extension within.
1 Discussion of: Discourse Referents and External Anchors in Developmental Thought by Josef Perner Alan Garnham Psychology University of Sussex
Blog Summarization We have built a blog summarization system to assist people in getting opinions from the blogs. After identifying topic-relevant sentences,
Multilingual Opinion Holder Identification Using Author and Authority Viewpoints Yohei Seki, Noriko Kando,Masaki Aono Toyohashi University of Technology.
Academic Reading ENG 115.
Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.
REVIEW: 6 Elements of an Argument
Topic: Opinion Extraction and Summarization. Opinion Extraction and Summarization What follows: perspective of Cardie, Riloff, Wiebe We can see similar.
Annotating Opinions in the World Press Theresa Wilson and Janyce Wiebe University of Pittsburgh Intelligent Systems Program and Department of Computer.
7/2003EMNLP031 Learning Extraction Patterns for Subjective Expressions Ellen Riloff Janyce Wiebe University of Utah University of Pittsburgh.
From Words to Senses: A Case Study of Subjectivity Recognition Author: Fangzhong Su & Katja Markert (University of Leeds, UK) Source: COLING 2008 Reporter:
CS3730 Fall 2008 Subjectivity and Sentiment Analysis Lecture (Day 2): Introduction to linguistic subjectivity.
Annotating and measuring Temporal relations in texts Philippe Muller and Xavier Tannier IRIT,Université Paul Sabatier COLING 2004.
Word Sense and Subjectivity (Coling/ACL 2006) Janyce Wiebe Rada Mihalcea University of Pittsburgh University of North Texas Acknowledgements: This slide.
What is rhetoric? What you need to know for AP Language.
Finding strong and weak opinion clauses Theresa Wilson, Janyce Wiebe, Rebecca Hwa University of Pittsburgh Just how mad are you? AAAI-2004.
CAS Managebac update CAS opportunity for someone with a scanner. Cambodia?
STEPS FOR PASSING THE AP RHETORICAL ESSAY 4 Components 4 Components 1) What is the author’s purpose? What does the author hope to achieve? 1) What is the.
LO: To understand stereotypes. What is a stereotype? a widely held but fixed and oversimplified image or idea of a particular type of person or thing.
Lobbying your MP: When you’d rather meet than tweet! Training Materials on Refugee Rights March 23, 2016.
A Change of Heart About Animals
Identifying Expressions of Opinion in Context Eric Breck and Yejin Choi and Claire Cardie IJCAI 2007.
Non-fiction and Media Higher Tier.
English Hub School networks A-level English Language
Manual Subjectivity Analysis
SEMANTICS VS PRAGMATICS
Effective Reading Strategies
Improving a Pipeline Architecture for Shallow Discourse Parsing
Language Component 1: 20th Century Reading
The meaning, association, or emotion that has come to be attached to a word is its connotation.
TEACHING READING Indawan Syahri 12/8/2018 indawansyahri.
READING NON-FICTION QUESTION ONE QUESTION TWO QUESTION THREE
READING NON-FICTION QUESTION ONE QUESTION TWO QUESTION THREE
Is murder wrong? A: What is murder? B: What is the law on murder in the UK? A: Do you think murder is wrong? B: Do you think murder is wrong? ‘Garment.
Presentation transcript:

Subjectivity Annotation Update Josef Ruppenhofer Jan Wiebe

Outline Update on our annotations Update on our annotations Exploration of subjectivity and Discourse Treebank annotations Exploration of subjectivity and Discourse Treebank annotations

Subjectivity Annotation Before this year: MPQA annotation scheme and corpus Before this year: MPQA annotation scheme and English language versions of articles from the world press (187 news sources) English language versions of articles from the world press (187 news sources) 535 Documents; 11,114 sentences 535 Documents; 11,114 sentences Wiebe, Wilson, Cardie. Annotating Expressions of Opinions and Emotions in Language. LRE 2005.

Subjectivity Annotation Current work (goals have been expanded schemes, not high volume annotation) Current work (goals have been expanded schemes, not high volume annotation) Extended MPQA Extended MPQA Additional data annotated Additional data annotated 2005 LRE scheme plus extensions 2005 LRE scheme plus extensions Theresa Wilson’s PhD dissertation (2008) Theresa Wilson’s PhD dissertation (2008) [not-yet-released extensions added to the MPQA corpus] [not-yet-released extensions added to the MPQA corpus] Discourse level relations between opinions Discourse level relations between opinions Subjectivity in health surveillance texts Subjectivity in health surveillance texts Word sense subjectivity and polarity Word sense subjectivity and polarity

Extended MPQA Scheme Documents Documents 85 Xbank files 85 Xbank files “Boyan” subset of ULA data “Boyan” subset of ULA data 1/3 completed 1/3 completed expected completion: early summer expected completion: early summer [MPQA] [MPQA]

Extended MPQA Scheme Annotators Annotators Josef Josef two undergraduates two undergraduates

Training Time & Effort Training Training Josef’s effort: 75 hours (~ 2 weeks) Josef’s effort: 75 hours (~ 2 weeks) 10 preparing materials 10 preparing materials 40 basic training 40 basic training 25 extensions 25 extensions Annotators: 120 hours combined (~1.5 weeks each) Annotators: 120 hours combined (~1.5 weeks each) Problems with scheduling arose (annotators did not work planned hours per week; redundant one-on-one meetings) Problems with scheduling arose (annotators did not work planned hours per week; redundant one-on-one meetings) With perfect scheduling, estimate 1 week to train two annotators (though Josef is involved in production annotation) With perfect scheduling, estimate 1 week to train two annotators (though Josef is involved in production annotation)

Production annotation Single annotator per document Single annotator per document Annotator time per document (very rough est) Annotator time per document (very rough est) 2 hours 45 mins 2 hours 45 mins 45 mins of which is time spent on consultation, 15 with each other, 30 with Josef 45 mins of which is time spent on consultation, 15 with each other, 30 with Josef

Periodic Agreement Testing documents with known gold standard documents with known gold standard no consultation no consultation ~every 5 documents ~every 5 documents post-mortem meetings (one on one, group) post-mortem meetings (one on one, group) Four annotations to compare (Theresa Wilson, Josef, two undergraduate annotators) Four annotations to compare (Theresa Wilson, Josef, two undergraduate annotators) [Results of previous agreement studies in previous papers] [Results of previous agreement studies in previous papers]

Agreement measurement So far, average pair-wise agreement calculated per document So far, average pair-wise agreement calculated per document Full analysis forthcoming Full analysis forthcoming Relative label reliability: agent > direct-subjective > target > attitude > objective-speech-event > expressive-subjective- element Relative label reliability: agent > direct-subjective > target > attitude > objective-speech-event > expressive-subjective- element Given the interactions between the labels, errors are interrelated Given the interactions between the labels, errors are interrelated

Annotation Schemes

What is Subjectivity? The linguistic expression of somebody’s opinions, sentiments, emotions, evaluations, beliefs, speculations (private states) Private state: state that is not open to objective observation or verification Quirk, Greenbaum, Leech, Svartvik (1985). A Comprehensive Grammar of the English Language.

Overview Fine-grained: expression-level rather than sentence or document level Annotate – Subjective xpressions – material attributed to a source, but presented objectively

Overview Focus on three ways private states are expressed in language

Direct Subjective Expressions Direct mentions of private states The United States fears a spill-over from the anti- terrorist campaign. Private states expressed in speech events “We foresaw electoral fraud but not daylight robbery,” Tsvangirai said.

Expressive Subjective Elements [Banfield 1982] “ We foresaw electoral fraud but not daylight robbery, ” Tsvangirai said The part of the US human rights report about China is full of absurdities and fabrications

Objective Speech Events Material attributed to a source, but presented as objective fact The government, it added, has amended the Pakistan Citizenship Act 10 of 1951 to enable women of Pakistani descent to claim Pakistani nationality for their children born to foreign husbands.

Nested Sources “The US fears a spill-over’’, said Xirao-Nima, a professor of foreign affairs at the Central University for Nationalities. (writer, Xirao-Nima, US) (writer, Xirao-Nima) (writer) “The report is full of absurdities,’’ he continued. (writer, Xirao-Nima) (writer)

“The report is full of absurdities,” Xirao-Nina said. Objective speech event anchor: the entire sentence source: implicit: true Direct subjective anchor: said source: intensity: high expression intensity: neutral attitude type: negative target: report Expressive subjective element anchor: full of absurdities source: intensity: high attitude type: negative

Objective speech event anchor: the entire sentence source: implicit: true Objective speech event anchor: said source: Direct subjective anchor: fears source: intensity: medium expression intensity: medium attitude type: negative target: [new work] “The US fears a spill-over’’, said Xirao-Nima, a professor of foreign affairs at the Central University for Nationalities.

Extensions Wilson 2008 I think people are happy because Chavez has fallen. direct subjective span: are happy source: attitude: inferred attitude span: are happy because Chavez has fallen type: neg sentiment intensity: medium target: target span: Chavez has fallen target span: Chavez attitude span: are happy type: pos sentiment intensity: medium target: direct subjective span: think source: attitude: attitude span: think type: positive arguing intensity: medium target: target span: people are happy because Chavez has fallen

Subjectivity Types Wilson 2008 Other (esp. general cognition)

Discourse-Level Opinion Frames in Task-Oriented Dialogs (AMI) Frames are defined in terms of their components Frames are defined in terms of their components Opinion spans Opinion spans Opinion type Opinion type Sentiment Sentiment Arguing Arguing Opinion Polarity Opinion Polarity Targets Targets Sources Sources Relationships between targets Relationships between targets Same or alternative Same or alternative Example motivation: polarity and targets interact Example motivation: polarity and targets interact E.g. an argument for one design that is simultaneously an argument against an alternative design E.g. an argument for one design that is simultaneously an argument against an alternative design

Subjectivity in Health Surveillance Texts Types Types Sentiment Sentiment Belief Belief Belief about what is the case Belief about what is the case Belief about what should or should not be done Belief about what should or should not be done Knowledge/Awareness of facts Knowledge/Awareness of facts Agreement/Disagreement between sources in the text Agreement/Disagreement between sources in the text … Sources Sources Writer Writer Media Media Non-media organizations Non-media organizations Members of the general public Members of the general public … Targets Targets Occurrence of a disease outbreak Occurrence of a disease outbreak Danger/severity of an outbreak Danger/severity of an outbreak Cause of a disease Cause of a disease Symptoms Symptoms …

Exploring the relationship between PDTB (2) and Extended MPQA

Penn Discourse TreeBank Hierarchy of discourse relations with 4 top nodes – Temporal – Contingency – Comparison – Expansion

Overview Richer interpretations via combination Potential disambiguation both ways

A connective is marked as “Restatement” when it indicates that the semantics of Arg2 restates the semantics of Arg1. It is inferred that the situations described in Arg1 and Arg2 hold true at the same time. Expansion Restatement GeneralizationSpecificationEquivalence

Subjectivity preserved in a restatement “This means Nestle is now in the candybar business in a big way," said Lisbeth Echeandia, publisher of Orlando, Fla.-based Confectioner Magazine. “For them, it makes all kinds of sense.”

Subjectivity preserved in a restatement: PDTB “[This means Nestle is now in the candybar business in a big way ARG1]," said Lisbeth Echeandia, publisher of Orlando, Fla.-based Confectioner Magazine. “IMPLICIT_IN SHORT [For them, it makes all kinds of sense ARG2].

Subjectivity preserved in a restatement "This [means Nestle is now in the candybar business in a big way ARGUING-POS]," said Lisbeth Echeandia, publisher of Orlando, Fla.- based Confectioner Magazine. “[For them, it makes all kinds of sense ARGUING-POS]. Related opinions; part of the same larger opinion

Subjectivity preserved in a restatement "This [means Nestle is now in the candybar business in a big way ARGUING-POS]," said Lisbeth Echeandia, publisher of Orlando, Fla.- based Confectioner Magazine. “[For them, it makes all kinds of sense ARGUING-POS]. Same polarity, type, source; Hyp: common pattern with restatement

Subjectivity preserved in a restatement "This [means Nestle is now in the candybar business in a big way ARGUING-POS]," said Lisbeth Echeandia, publisher of Orlando, Fla.- based Confectioner Magazine. “[For them, it makes all kinds of sense ARGUING-POS]. Semantics of restatement: sameness includes subjectivity

Note: Sentiment "This means Nestle is now in the candybar business in a big way," said Lisbeth Echeandia, publisher of Orlando, Fla.-based Confectioner Magazine. “For them, it [makes all kinds of sense SENTIMENT-POS].” Not directly part of the restatement relation

The type “Cause” is used when the connective indicates that the situations described in Arg1 and Arg2 are causally influenced and the two are not in a conditional relation … Contingency Cause ReasonResult

Subjectivity preserved across Reason relation But Mr. Schwarz welcomes the competition in U.S. Trust's flagship businesses, calling it "flattery." Mr. Schwarz says the competition "broadens the base of opportunity for us." Other firms "are dealing with the masses…”

Subjectivity preserved across Reason relation: PDTB [But Mr. Schwarz welcomes the competition in U.S. Trust's flagship businesses ARG1], [calling it "flattery SUP1]." Mr. Schwarz says IMPLICIT_BECAUSE [the competition "broadens the base of opportunity for us ARG2]." Other firms "are dealing with the masses…” ARG2 is a reason for ARG1

Subjectivity preserved across Reason relation: subjectivity But Mr. Schwarz [welcomes SENTIMENT-POS] the competition in U.S. Trust's flagship businesses, calling it "flattery." Mr. Schwarz says the competition “[broadens the base of opportunity for us SENTIMENT-POS]." Other firms "are dealing with the masses. Positive evaluation which is a reason for a positive feeling; same overall opinion “I like it because it is so good”

Subjectivity preserved across Reason relation: subjectivity But Mr. Schwarz [welcomes SENTIMENT-POS] the competition in U.S. Trust's flagship businesses, calling it "flattery." Mr. Schwarz says the competition “[broadens the base of opportunity for us SENTIMENT-POS]." Other firms "are dealing with the masses. Subjectivity: same source, target, polarity, type; Hyp: common with reason; Help with target recognition, for example.

Subjectivity preserved across Reason relation: subjectivity But Mr. Schwarz [welcomes SENTIMENT-POS] the competition in U.S. Trust's flagship businesses, calling it "flattery." Mr. Schwarz says the competition “[broadens the base of opportunity for us SENTIMENT-POS]." Other firms "are dealing with the masses. Semantics of reason: specific subtype, where an evaluation is a reason for an attitude

The type “Cause” is used when the connective indicates that the situations described in Arg1 and Arg2 are causally influenced and the two are not in a conditional relation … Contingency Cause ReasonResult

Polarity preserved across Result relation Other firms "are dealing with the masses. I don't believe they have the culture" to adequately service high-net-worth individuals, he adds.

Polarity preserved across Result relation: PDTB [Other firms "are dealing with the masses ARG1]. I don't believe IMPLICIT_SO [they have the culture" to adequately service high-net- worth individuals ARG2], he adds. ARG2 is a result of ARG1

Polarity preserved across Result relation: PDTB [Other firms "are dealing with the masses ARG1]. I don't believe IMPLICIT_SO [they have the culture" to adequately service high-net- worth individuals ARG2], he adds. X said Y: “X said”  X’s belief space “I don’t believe” explicit in second sentence “Swartz said” implicit in first sentence ARG spans: Dis. Rel within Swartz’s belief space

Polarity preserved across Result relation: subjectivity Other firms “[are dealing with the masses SENTIMENT-NEG]. I [don't believe they have the culture" to adequately service high-net- worth individuals SENTIMENT-NEG], he adds. Attitude span includes “don’t believe”; schemes require different notions of spans

Polarity preserved across Result relation: subjectivity Other firms “[are dealing with the masses SENTIMENT-NEG]. I [don't believe they have the culture" to adequately service high-net- worth individuals SENTIMENT-NEG], he adds. Two negative properties, where the second is a result of the first

Polarity preserved across Result relation: subjectivity Other firms “[are dealing with the masses SENTIMENT-NEG]. I [don't believe they have the culture" to adequately service high-net- worth individuals SENTIMENT-NEG], he adds. Dis Rel between ARGS inside his belief space

Polarity preserved across Result relation: subjectivity Other firms “[are dealing with the masses SENTIMENT-NEG]. I [don't believe they have the culture" to adequately service high-net- worth individuals SENTIMENT-NEG], he adds. Semantics of result: specific subtype, where a negative state of affairs is the result of another one

The class tag “COMPARISON” applies when the connective indicates that a discourse relation is established between Arg1 and Arg2 in order to highlight prominent differences between the two situations. Semantically, the truth of both arguments is independent of the connective or the established relation. Comparison Contrast JuxtapositionOpposition Concession expectation Contra- expectation

In that suit, the SEC accused Mr. Antar of engaging in a "massive financial fraud" to overstate the earnings of Crazy Eddie, Edison, N.J., over a three-year period. Through his lawyers, Mr. Antar has denied allegations in the SEC suit and in civil suits previously filed by shareholders against Mr. Antar and others.

PDTB [In that suit, the SEC accused Mr. Antar of engaging in a "massive financial fraud" to overstate the earnings of Crazy Eddie, Edison, N.J., over a three- year period. ARG1] IMPLICIT_HOWEVER [ Through his lawyers, Mr. Antar has denied allegations in the SEC suit and in civil suits previously filed by shareholders against Mr. Antar and others. ARG2] Contrast between the SEC accusing Mr. Antar of something, and his denying the accusation

Subjectivity In that suit, the SEC [[accused SENTIMENT-NEG] Mr. Antar of engaging in a "massive financial fraud" to overstate the earnings of Crazy Eddie, Edison, N.J. ARGUING-POS], over a three-year period. Through his lawyers, Mr. Antar [has denied AGREE- NEG] allegations in the SEC suit and in civil suits previously filed by shareholders against Mr. Antar and others. Two attitudes combined into one large disagreement between two parties

Subjectivity In that suit, the SEC [[accused SENTIMENT-NEG] Mr. Antar of engaging in a "massive financial fraud" to overstate the earnings of Crazy Eddie, Edison, N.J. ARGUING-POS], over a three-year period. Through his lawyers, Mr. Antar [has denied AGREE-NEG] allegations in the SEC suit and in civil suits previously filed by shareholders against Mr. Antar and others. Subjectivity: arguing-pos and agree-neg with different sources; Hyp: common with contrast. Help recognize the implicit contrast.

Subjectivity In that suit, the SEC [[accused SENTIMENT-NEG] Mr. Antar of engaging in a "massive financial fraud" to overstate the earnings of Crazy Eddie, Edison, N.J. ARGUING-POS], over a three-year period. Through his lawyers, Mr. Antar [has denied AGREE-NEG] allegations in the SEC suit and in civil suits previously filed by shareholders against Mr. Antar and others. Semantics of comparison: specific case of highlighting prominent differences in attitudes of different people

Discourse and opinion relations are not redundant Compare (PDTB) – Its like [people hate him ARG1] because Reason [people love him so much ARG2] and people love him so much because people hate him so much. ARG2 reason for ARG1 – [Some people hate him ARG1]. IMPLICIT Contrast [ Others love him ARG2]. – Subjectivity is the same: contrasting polarities, the same target, different sources.

Discourse and opinion relations are not redundant [Some people hate him ARG1]. IMPLICIT Contrast [ Others love him ARG2]. – Sentiment-neg Sentiment-pos – Source and polarity contrasts [I like the Lexus ARG1] but Contrast [my wife likes the Prius ARG2]. – Sentiment-pos Sentiment-pos – Source and target contrasts

Discourse and opinion relations are not redundant ARGument and subjectivity spans often do not match exactly

Etc. Attributions and nested sources Though the schemes are not redundant, some relations seem to imply subjectivity – E.g., Pragmatic cause; implicit assertions Discourse relations may help uncover inferred attitudes