Identifying Opinion Holders for Question Answering in Opinion Texts Soo-Min Kim and Eduard Hovy Information Sciences Institute University of Southern California.

Slides:



Advertisements
Similar presentations
1 Opinion Summarization Using Entity Features and Probabilistic Sentence Coherence Optimization (UIUC at TAC 2008 Opinion Summarization Pilot) Nov 19,
Advertisements

Collaborative Filtering in Social Tagging System on Joint Item-Tag Recommendations Date : 2011/11/7 Source : Jing Peng et. al (CIKM’10) Speaker : Chiu.
Playing the Telephone Game: Determining the Hierarchical Structure of Perspective and Speech Expressions Eric Breck and Claire Cardie Department of Computer.
Annotating Topics of Opinions Veselin Stoyanov Claire Cardie.
第三講 Recode、missing value、假設檢定與信賴區間
: Arrange the Numbers ★★★☆☆ 題組: Contest Archive with Online Judge 題號: 11481: Arrange the Numbers 解題者:李重儀 解題日期: 2008 年 9 月 13 日 題意: 將數列 {1,2,3, …,N}
布林代數的應用--- 全及項(最小項)和全或項(最大項)展開式
Mathcad 基本認識 再mathcad中等於(=)的符號有區分為三種: 第一種:冒號等於(:=)是代表我們要定義ㄧ個參數
Mar. 14, :Vito ’ s family ★☆☆☆☆ 題組: Problem Set Archive with Online Judge 題號: 10014:Vito’s family 解題者:劉淑惠、侯沛彣 解題日期: 2006 年 3 月 9 日 題意: Vito.
HSR 課程介紹. 指定用書 Health Services Research Method Leiyu Shi 2008.
核心能力意見調查 計畫主持人:劉義周教授 研究助理: 林珮婷 報告日期: 調查案的目標與性質 調查的主要目的在進行宣傳,讓全校師生可以瞭 解何謂「課程地圖」與「核心能力」。 通識中心將核心能力主要區分為「學術訓練」、 「就業準備」、「公民文化養成」、「個人特質 提升」等四大面向,本調查依據此四大面向進一.
Self proxy signature scheme IJCSNS International Journal of Computer Science and Network Security,VOL.7 No.2,Februry 2007 Author:Young-seol Kim,Jik Hyun.
: Boxes ★★★☆☆ 題組: Problem Set Archive with Online Judge 題號: 11003: Boxes 解題者:蔡欣燁 解題日期: 2007 年 3 月 19 日.
: Y-game ★★★☆☆ 題組: Contest Volumes with Online Judge 題號: 11841: Y-game 解題者:林雨姍 解題日期: 2011 年 5 月 19 日 題意: Y-game 是一個在三角型網格上放白 子與黑子的遊戲。根據題目給的點,判斷.
: OPENING DOORS ? 題組: Problem Set Archive with Online Judge 題號: 10606: OPENING DOORS 解題者:侯沛彣 解題日期: 2006 年 6 月 11 日 題意: - 某間學校有 N 個學生,每個學生都有自己的衣物櫃.
程式註解說明. 2 程式註解格式 塊狀註解 對檔案、 class 、 method 、資料結構、一段程式 …. 等程式區塊 做說明。 第一行的開頭必需為 “/*” 且沒有其他文字,最後一行的開頭 必需以 “*/” 做為結束,在中間每一行的開頭都必需是一個 “*” 。 單行註解 佔據一整行的說明。 以.
: The Playboy Chimp ★★☆☆☆ 題組: Problem Set Archive with Online Judge 題號: 10611: The Playboy Chimp 解題者:蔡昇宇 解題日期: 2010 年 2 月 28 日 題意:給一已排序的數列 S( 升冪.
:Problem D: Bit-wise Sequence ★★★☆☆ 題組: Problem Set Archive with Online Judge 題號: 10232: Problem D: Bit-wise Sequence 解題者:李濟宇 解題日期: 2006 年 4 月 16.
: The largest Clique ★★★★☆ 題組: Contest Archive with Online Judge 題號: 11324: The largest Clique 解題者:李重儀 解題日期: 2008 年 11 月 24 日 題意: 簡單來說,給你一個 directed.
: Point of View in Flatland ★★☆☆☆ 題組: Problem Set Archive with Online Judge 題號: 11037: Point of View in Flatland 解題者:黃士庭 解題日期: 2007 年 5 月 15 日 題意:在座標平面上給三個圓,找一個點使從這.
: Happy Number ★ ? 題組: Problem Set Archive with Online Judge 題號: 10591: Happy Number 解題者:陳瀅文 解題日期: 2006 年 6 月 6 日 題意:判斷一個正整數 N 是否為 Happy Number.
一、單選題: ( 共 25 題。每題 2 分,每題只能選擇單一答案,若選兩項或兩項 以上者不予計分。註:此部份本學期改用中文命題 ) 1.__ _ Among the following workers, who are not contingent workers? a. Part-time employees.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. 參 資料蒐集的方法.
: Problem A : MiniMice ★★★★☆ 題組: Contest Archive with Online Judge 題號: 11411: Problem A : MiniMice 解題者:李重儀 解題日期: 2008 年 9 月 3 日 題意:簡單的說,題目中每一隻老鼠有一個編號.
: Count DePrimes ★★★★☆ 題組: Contest Archive with Online Judge 題號: 11408: Count DePrimes 解題者:李育賢 解題日期: 2008 年 9 月 2 日 題意: 題目會給你二個數字 a,b( 2 ≦ a ≦ 5,000,000,a.
North Point Government Primary PM School 北角官立下午小學 應用 ‘ 基本能力學生評估 ’ 及 ‘ 網上學與教支援系統 ’ 經驗分享.
:Nuts for nuts..Nuts for nuts.. ★★★★☆ 題組: Problem Set Archive with Online Judge 題號: 10944:Nuts for nuts.. 解題者:楊家豪 解題日期: 2006 年 2 月 題意: 給定兩個正整數 x,y.
: A-Sequence ★★★☆☆ 題組: Problem Set Archive with Online Judge 題號: 10930: A-Sequence 解題者:陳盈村 解題日期: 2008 年 5 月 30 日 題意: A-Sequence 需符合以下的條件, 1 ≤ a.
1 100: The 3n+1 Problem ★★★☆☆ 題組: VOLUME CII 題號: 10721: Problem C-Chopsticks 陳冠男 解題者:陳冠男、侯沛彣 解題日期: 2006 年 4 月 23 日 給定一個正整數 n (n>1) ,當 n 為奇數時令 n  3n+1.
: THE SAMS' CONTEST ☆☆★★★ 題組: Problem Set Archive with Online Judge 題號: 10520: THE SAMS' CONTEST 解題者:陳相廷,林祺光 解題日期: 2006 年 5 月 22 日 題意:依以下式子,給定 n.
: 05-2 Rendezvous ★★★☆☆ 題組: Problem Set Archive with Online Judge 題號: 11015: 05-2 Rendezvous 解題者:池明洋 解題日期: 2006 年 4 月 16 日 題意: 給 node N 個 , edge.
: Lucky Number ★★★★☆ 題組: Proble Set Archive with Online Judge 題號: 10909: Lucky Number 解題者:李育賢 解題日期: 2008 年 4 月 25 日 題意:給一個奇數數列 1,3,5,7,9,11,13,15…
: Flip Sort ★★☆☆☆ 題組: Problem Set Archive with Online Judge 題號: 10327: Flip Sort 解題者:歐子揚 解題日期: 2010 年 2 月 26 日 題意:在這個問題中使用一種排序方式 (Flip) ,意思就是 只能交換相鄰的.
845: Gas Station Numbers ★★★ 題組: Problem Set Archive with Online Judge 題號: 845: Gas Station Numbers. 解題者:張維珊 解題日期: 2006 年 2 月 題意: 將輸入的數字,經過重新排列組合或旋轉數字,得到比原先的數字大,
DECISION TREE LEARNING. Outlook Sunny Overcast Rain Humidity Yes Wind High Normal Strong Weak No Yes No Yes.
1 523: Minimum Transport Cost ★★★☆☆ 題組: Problem Set Archive with Online Judge 題號: 523: Minimum Transport Cost 解題者:林祺光 解題日期: 2006 年 6 月 12 日 題意:計算兩個城市之間最小的運輸成本,運輸.
Optimization And Differential Equations 最佳化與微分方程 Peng-Jen Lai ( 賴鵬仁 ) Department of Mathematics National Kaohsiung Normal University ( 高雄師範大學數學系 ) ( 高雄師範大學數學系.
Extreme Discrete Summation ★★★★☆ 題組: Contest Archive with Online Judge 題號: Extreme Discrete Summation 解題者:蔡宗翰 解題日期: 2008 年 10 月 13 日.
: Expressions ★★★☆☆ 題組: Problem Set Archive with Online Judge 題號: 10157: Expressions 解題者:張庭愿 解題日期: 2009 年 8 月 16 日 題意:所有的括號必須成對,且必須先出現過左 括號後才能出現右括號,如果有.
1 威斯康辛「學生教育成就保證」計劃 ( SAGE ) ( Ehrenberg, Brewer, Gamoran & Willms, 2001 ) 年間試行 對象是幼稚園到小三 (K-3) 30% 學生來自貧困家庭 每班人數 人.
: Sum-up the Primes ★★★★☆ 題組: Problem A 題號: 10419: Sum-up the Primes 解題者:林一帆 解題日期: 2006 年 5 月 15 日 題意: 評斷一個給予的數字是否是由給予個數 的質數所組成的.
連續隨機變數 連續變數:時間、分數、重量、……
資料結構實習-六.
: SAM I AM ★★★★☆ 題組: Contest Archive with Online Judge 題號: 11419: SAM I AM 解題者:李重儀 解題日期: 2008 年 9 月 11 日 題意: 簡單的說,就是一個長方形的廟裡面有敵人,然 後可以橫的方向開砲或縱向開砲,每次開砲可以.
:Count the Trees ★★★☆☆ 題組: Problem Set Archive with Online Judge 題號: 10007:Count the Trees 解題者:楊家豪 解題日期: 2006 年 3 月 題意: 給 n 個點, 每一個點有自己的 Label,
多媒體技術與應用 實習作業 Part II. 實習作業 利用 Corel Paint Shop Pro X2 完成作業。 作業一:利用影像處理的技巧,讓這張影像變 的更清晰。
第12章 團體溝通情境中的領導者.
幼兒行為觀察與記錄 第八章 事件取樣法.
Extracting Opinions, Opinion Holders, and Topics Expressed in Online News Media Text Soo-Min Kim and Eduard Hovy USC Information Sciences Institute 4676.
Agile methods: a comparative analysis Diane Strode-University of Wellington the 19 th Annual Conference of the National Advisory Committee on Computing.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Illinois-Coref: The UI System in the CoNLL-2012 Shared Task Kai-Wei Chang, Rajhans Samdani, Alla Rozovskaya, Mark Sammons, and Dan Roth Supported by ARL,
1 Determining the Hierarchical Structure of Perspective and Speech Expressions Eric Breck and Claire Cardie Cornell University Department of Computer Science.
1 A preliminary study on unknown word problem in Chinese word segmentation Authors: Ming -Yu Lin Tung – Hui Chiang Keh-Yih Su Speaker: Jbc.
Opinion Holders in Opinion Text from Online Newspapers Youngho Kim, Yuchul Jung and Sung-Hyon Myaeng Reporter: Chia-Ying Lee Advisor: Prof. Hsin-Hsi Chen.
Finding What Matters in Questions Xiaoqiang Luo, Hema Raghavan, Vittorio Castelli, Sameer Maskey and Radu Florian IBM T.J. Watson Research Center NAACL-HLT.
1 Multi-Perspective Question Answering Using the OpQA Corpus (HLT/EMNLP 2005) Veselin Stoyanov Claire Cardie Janyce Wiebe Cornell University University.
Face Recognition by Support Vector Machines 指導教授 : 王啟州 教授 學生 : 陳桂華 Guodong Guo, Stan Z. Li, and Kapluk Chan School of Electrical and Electronic Engineering.
1 Toward Opinion Summarization: Linking the Sources Veselin Stoyanov and Claire Cardie Department of Computer Science Cornell University Ithaca, NY 14850,
! !美洲華語 李雅莉老師製作 TextVocabularyidiomStoryChallenge $100 $200 $300 $400 $500 $600 $100 $200 $300 $400 $500 $600 $100 $200 $300 $400 $500 $600 $100 $200.
Automatic Identification of Pro and Con Reasons in Online Reviews Soo-Min Kim and Eduard Hovy USC Information Sciences Institute Proceedings of the COLING/ACL.
Yang Liu State Key Laboratory of Intelligent Technology and Systems Tsinghua National Laboratory for Information Science and Technology Department of Computer.
Multilingual Opinion Holder Identification Using Author and Authority Viewpoints Yohei Seki, Noriko Kando,Masaki Aono Toyohashi University of Technology.
Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.
Evaluating an Opinion Annotation Scheme Using a New Multi- perspective Question and Answer Corpus (AAAI 2004 Spring) Veselin Stoyanov Claire Cardie Diane.
1 Measuring the Semantic Similarity of Texts Author : Courtney Corley and Rada Mihalcea Source : ACL-2005 Reporter : Yong-Xiang Chen.
Easy Teaching Tasks/Activities for EFL Low Achievers Joe Yi-ming Lee Taipei Municipal Zhongzheng Senior High School.
1 Reversible and lossless data hiding in the integer wavelet transform domain (Review) Authors: S. Yousefi, H. R. Rabiee, E. Yousefi, and M. Ghanbari Speaker:
Identifying Expressions of Opinion in Context Eric Breck and Yejin Choi and Claire Cardie IJCAI 2007.
Improving Automated Patent Claim Parsing:
Presentation transcript:

Identifying Opinion Holders for Question Answering in Opinion Texts Soo-Min Kim and Eduard Hovy Information Sciences Institute University of Southern California 4676 Admiralty Way Marina del Rey, CA {skim, Advisor: Hsin-Hsi Chen Speaker: Yong-Sheng Lo Date: 2007/08/16 AAAI

Introduction 1/2 Question answering in opinion texts “ Who strongly believes in Y ” A system to recognize the holder of opinion Y Application Stock market predictors Earlier work (Kim and Hovy,2004) Focus on identifying opinion expressions within text 現在進一步要找出 opinion holder Example 小叮噹認為銅鑼燒很好吃 Opinion holder :小叮噹 Opinion expression :認為 Opinion :銅鑼燒很好吃

Introduction 2/2 Define the opinion holder as an entity who expresses explicitly or implicitly the opinion contained in a sentence Entity =(person, country, organization, or special group of people) 一個 opinion expression 對應一個 holder “ A think B ’ s criticism of T is wrong ” B is the holder of “ the criticism of T ” A is the person who has an opinion that B ’ s criticism is wrong

辨別 opinion holder 的困難點 1. The opinion sentence contains more than one likely holder entity “ Russia ’ s defense minister said Sunday that his country disagrees with the U.S. view of Iraq, Iran and North Korea as an ‘ axis of evil ’”. The candidate holders : “ Russia ”, “ Russia ’ s defense minister ”, “ U.S. ”, “ Iraq ”, “ Iran ”, “ North Korea ” 2. There is more than one opinion in a sentence “ In relation to Bush ’ s axis of evil remarks, the German Foreign Minister also said, Allies are not satellites, and the French Foreign Minister caustically criticized that the United States ’ unilateral, simplistic worldview poses a new threat to the world ”.

本文提的解法 Automatic method for identifying opinion holders (OH) 1. Identify all possible opinion holder entities in a sentence 使用現有工具找出句子中的 Name entities 和 Noun phrases 2. Apply the Maximum Entropy (ME) ranking algorithm to select the most probable entity

System architecture

Holder candidate set Named entities (NE) Using BBN ’ s named entity tagger IdentiFinder Noun phrases (NP) Using Charniak ’ s parser For example

Maximum Entropy ranking algorithm A machine learning approach Maximum Entropy modeling Classification Select many candidates as answers as long as they are marked as true and does not select any candidate if every one is marked as false Poor performance Ranking Select the most probable candidate as an answer To maximize a given conditional probability distribution

Training data MPQA corpus (Wiebe et al., 2003) 535 documents (10657 sentences) 以下是標記者的標記例子: 只選意見強度 (Strength) 為 high or extreme 的句子 Opinion Holder

Training 流程

Feature selection for ME 1. Full parsing features (f2,f3,f4,f6) 2. Partial parsing features (f7,f8,f9) 3. Others (f1,f5)

Full parsing features 1/5 Using charniak ’ s parser For example: China ’ s official Xinhua news agency Form MPQA accusing From Earlier work (Kim and Hovy,2004)

Full parsing features 2/5

Full parsing features 3/5 To express tree structure for ME training “ NP S VP S S VP VBG ” Data sparseness problem

Full parsing features 4/5 Solution: 分成三條 path(f2,f3,f4) For example “ NP H S HE VP E S E S E VP E VBG E ”

Full parsing features 5/5 f6: The top two levels below a child node of HEhead on the path toward Hhead For example P1 = “ NPH PPH NPH ” P2 = “ NPH NPH PPH VPH NPH PPH NPH ” P1 and P2 as the same because they share “ PPH NPH ” at the top

Partial parsing features Using CASS parser f7 : (vgp … ) f8 : (c … ) f9 : Yes or No

Other features Non-structural features f1 : Type of The type of the candidate, with values NP, PERSON, ORGANIZATION, and LOCATION This feature enables ME to determine the most probable one among them automatically f5 : The distance between and, counted in parse tree words

Answer selection for evaluation 1/2 Strict selection For example 標準答案:小叮噹 System :小叮噹 Lenient selection For example 標準答案: “ Michel Sidibe, Director of the Country and Regional Support Department of UNAIDS ” System : “ Michel Sidibe ” Accept candidates with priority 1 & 2 & 3

Answer selection for evaluation 2/2 Threshold 1 = 0.5 Allow a candidate as an answer in case half of the words in a holder appear in the candidate as well Threshold 2 = 4 The average number words in human annotated holders is 3.71

Experiments 1/3 961 pairs of (, ) 863 for training 98 for testing Baseline The system choose the closest candidate to the expression as a holder without ME decision

Experiments 2/3

Experiments 3/3

Conclusions The importance of opinion holder identification was noticed yet it has not been much studied to date, partly because of the lack of annotated data. Using Maximum Entropy ranking to select the most probable holder among multiple candidates. Adopting parsing features significantly improved system performance.