Question Answering Passage Retrieval Using Dependency Relations (SIGIR 2005) (National University of Singapore) Hang Cui, Renxu Sun, Keya Li, Min-Yen Kan,

Slides:



Advertisements
Similar presentations
Introduction to Information Retrieval
Advertisements

Statistical Machine Translation Part II: Word Alignments and EM Alexander Fraser ICL, U. Heidelberg CIS, LMU München Statistical Machine Translation.
Improved TF-IDF Ranker
QA-LaSIE Components The question document and each candidate answer document pass through all nine components of the QA-LaSIE system in the order shown.
Statistical Machine Translation Part II – Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart
SEARCHING QUESTION AND ANSWER ARCHIVES Dr. Jiwoon Jeon Presented by CHARANYA VENKATESH KUMAR.
DOMAIN DEPENDENT QUERY REFORMULATION FOR WEB SEARCH Date : 2013/06/17 Author : Van Dang, Giridhar Kumaran, Adam Troy Source : CIKM’12 Advisor : Dr. Jia-Ling.
A Maximum Coherence Model for Dictionary-based Cross-language Information Retrieval Yi Liu, Rong Jin, Joyce Y. Chai Dept. of Computer Science and Engineering.
Passage Retrieval & Re-ranking Ling573 NLP Systems and Applications May 5, 2011.
Information Retrieval Ling573 NLP Systems and Applications April 26, 2011.
Query Operations: Automatic Local Analysis. Introduction Difficulty of formulating user queries –Insufficient knowledge of the collection –Insufficient.
Heuristic alignment algorithms and cost matrices
A Markov Random Field Model for Term Dependencies Donald Metzler and W. Bruce Croft University of Massachusetts, Amherst Center for Intelligent Information.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) IR Queries.
The Informative Role of WordNet in Open-Domain Question Answering Marius Paşca and Sanda M. Harabagiu (NAACL 2001) Presented by Shauna Eggers CS 620 February.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
A Web-based Question Answering System Yu-shan & Wenxiu
Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-Language Information Retrieval Paul Clough 1 and Mark Stevenson 2 Department.
Title Extraction from Bodies of HTML Documents and its Application to Web Page Retrieval Microsoft Research Asia Yunhua Hu, Guomao Xin, Ruihua Song, Guoping.
Learning Information Extraction Patterns Using WordNet Mark Stevenson and Mark A. Greenwood Natural Language Processing Group University of Sheffield,
Query Rewriting Using Monolingual Statistical Machine Translation Stefan Riezler Yi Liu Google 2010 Association for Computational Linguistics.
Probabilistic Model for Definitional Question Answering Kyoung-Soo Han, Young-In Song, and Hae-Chang Rim Korea University SIGIR 2006.
Hang Cui et al. NUS at TREC-13 QA Main Task 1/20 National University of Singapore at the TREC- 13 Question Answering Main Task Hang Cui Keya Li Renxu Sun.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
1 Cross-Lingual Query Suggestion Using Query Logs of Different Languages SIGIR 07.
Concept Unification of Terms in Different Languages for IR Qing Li, Sung-Hyon Myaeng (1), Yun Jin (2),Bo-yeong Kang (3) (1) Information & Communications.
AnswerBus Question Answering System Zhiping Zheng School of Information, University of Michigan HLT 2002.
A Markov Random Field Model for Term Dependencies Donald Metzler W. Bruce Croft Present by Chia-Hao Lee.
A Probabilistic Graphical Model for Joint Answer Ranking in Question Answering Jeongwoo Ko, Luo Si, Eric Nyberg (SIGIR ’ 07) Speaker: Cho, Chin Wei Advisor:
April 14, 2003Hang Cui, Ji-Rong Wen and Tat- Seng Chua 1 Hierarchical Indexing and Flexible Element Retrieval for Structured Document Hang Cui School of.
Structured Use of External Knowledge for Event-based Open Domain Question Answering Hui Yang, Tat-Seng Chua, Shuguang Wang, Chun-Keat Koh National University.
Retrieval Models for Question and Answer Archives Xiaobing Xue, Jiwoon Jeon, W. Bruce Croft Computer Science Department University of Massachusetts, Google,
QUALIFIER in TREC-12 QA Main Task Hui Yang, Hang Cui, Min-Yen Kan, Mstislav Maslennikov, Long Qiu, Tat-Seng Chua School of Computing National University.
21/11/2002 The Integration of Lexical Knowledge and External Resources for QA Hui YANG, Tat-Seng Chua Pris, School of Computing.
Weighting and Matching against Indices. Zipf’s Law In any corpus, such as the AIT, we can count how often each word occurs in the corpus as a whole =
INTERESTING NUGGETS AND THEIR IMPACT ON DEFINITIONAL QUESTION ANSWERING Kian-Wei Kor, Tat-Seng Chua Department of Computer Science School of Computing.
Adding Semantics to Clustering Hua Li, Dou Shen, Benyu Zhang, Zheng Chen, Qiang Yang Microsoft Research Asia, Beijing, P.R.China Department of Computer.
Using a Named Entity Tagger to Generalise Surface Matching Text Patterns for Question Answering Mark A. Greenwood and Robert Gaizauskas Natural Language.
A Semantic Approach to IE Pattern Induction Mark Stevenson and Mark A. Greenwood Natural Language Processing Group University of Sheffield, UK.
Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.
LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.
A Novel Pattern Learning Method for Open Domain Question Answering IJCNLP 2004 Yongping Du, Xuanjing Huang, Xin Li, Lide Wu.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. An information-pattern-based approach to novelty detection Presenter : Lin, Shu-Han Authors : Xiaoyan.
Iterative Translation Disambiguation for Cross Language Information Retrieval Christof Monz and Bonnie J. Dorr Institute for Advanced Computer Studies.
Department of Software and Computing Systems Research Group of Language Processing and Information Systems The DLSIUAES Team’s Participation in the TAC.
1 A Web Search Engine-Based Approach to Measure Semantic Similarity between Words Presenter: Guan-Yu Chen IEEE Trans. on Knowledge & Data Engineering,
A Word Clustering Approach for Language Model-based Sentence Retrieval in Question Answering Systems Saeedeh Momtazi, Dietrich Klakow University of Saarland,Germany.
Using a Named Entity Tagger to Generalise Surface Matching Text Patterns for Question Answering Mark A. Greenwood and Robert Gaizauskas Natural Language.
August 17, 2005Question Answering Passage Retrieval Using Dependency Parsing 1/28 Question Answering Passage Retrieval Using Dependency Parsing Hang Cui.
Automatic Question Answering  Introduction  Factoid Based Question Answering.
Multi-level Bootstrapping for Extracting Parallel Sentence from a Quasi-Comparable Corpus Pascale Fung and Percy Cheung Human Language Technology Center,
Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.
1 Evaluating High Accuracy Retrieval Techniques Chirag Shah,W. Bruce Croft Center for Intelligent Information Retrieval Department of Computer Science.
Answer Mining by Combining Extraction Techniques with Abductive Reasoning Sanda Harabagiu, Dan Moldovan, Christine Clark, Mitchell Bowden, Jown Williams.
Context-Aware Query Classification Huanhuan Cao, Derek Hao Hu, Dou Shen, Daxin Jiang, Jian-Tao Sun, Enhong Chen, Qiang Yang Microsoft Research Asia SIGIR.
Finding document topics for improving topic segmentation Source: ACL2007 Authors: Olivier Ferret (18 route du Panorama, BP6) Reporter:Yong-Xiang Chen.
Discovering Relations among Named Entities from Large Corpora Takaaki Hasegawa *, Satoshi Sekine 1, Ralph Grishman 1 ACL 2004 * Cyberspace Laboratories.
Natural Language Generation with Tree Conditional Random Fields Wei Lu, Hwee Tou Ng, Wee Sun Lee Singapore-MIT Alliance National University of Singapore.
Event-Based Extractive Summarization E. Filatova and V. Hatzivassiloglou Department of Computer Science Columbia University (ACL 2004)
Learning in a Pairwise Term-Term Proximity Framework for Information Retrieval Ronan Cummins, Colm O’Riordan Digital Enterprise Research Institute SIGIR.
Statistical Machine Translation Part II: Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart
(Pseudo)-Relevance Feedback & Passage Retrieval Ling573 NLP Systems & Applications April 28, 2011.
Learning to Rank: From Pairwise Approach to Listwise Approach Authors: Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li Presenter: Davidson Date:
ENHANCING CLUSTER LABELING USING WIKIPEDIA David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab SIGIR’09.
Feature Assignment LBSC 878 February 22, 1999 Douglas W. Oard and Dagobert Soergel.
Automatic Question Answering Beyond the Factoid Radu Soricut Information Sciences Institute University of Southern California Eric Brill Microsoft Research.
CS791 - Technologies of Google Spring A Web­based Kernel Function for Measuring the Similarity of Short Text Snippets By Mehran Sahami, Timothy.
LEARNING IN A PAIRWISE TERM-TERM PROXIMITY FRAMEWORK FOR INFORMATION RETRIEVAL Ronan Cummins, Colm O’Riordan (SIGIR’09) Speaker : Yi-Ling Tai Date : 2010/03/15.
Statistical Machine Translation Part II: Word Alignments and EM
Compact Query Term Selection Using Topically Related Text
Presentation transcript:

Question Answering Passage Retrieval Using Dependency Relations (SIGIR 2005) (National University of Singapore) Hang Cui, Renxu Sun, Keya Li, Min-Yen Kan, Tat-Seng Chua

2 Abstract Problems: Relationships among Q terms are not considered. Strict matching is used, but semantically equivalent relationships are phrased differently. Solution: fuzzy relation matching based on statistical models. Mutual information Expectation maximization Results: Significantly outperforms by up to 78% in MRR. Brings a 50% improvement enhanced by query expansion.

3 Introduction (1/3) QA steps: Locating the relevant documents. Retrieving passages that may contain the A. Pinpointing the exact A from candidate passages. Passage retrieval greatly affects the performance of a QA system. Too many irrelevant passages  fail to pinpoint the correct A due to too much noise. Passages provide sufficient context to understand the A.

4 Introduction (2/3) Importance of relationships: Neglecting crucial relations b/w words is a major source of false positives for current lexical matching based retrieval techniques. Many irrelevant passages share the same question terms w/ correct ones, but different relationships. Example

5 Introduction (3/3) Fuzzy relation matching method: Examines grammatical dependency relations b/w Q terms. Minipar: dependency parser. Statistical technique: measuring the degree of match of candidate sentences and Q. Fuzzy matching: b/c the same relationship is often phrased differently of Q&A. Training: MI & EM.

6 Related Work Passage retrieval: Count the # of matched Q terms. Consider the distance b/w Q terms in passages.  Dependency relations exist b/w words. Approximate such relations statistically: Using bigrams.  Only b/w adjacent words. Select answers: Strict matching.  Words often differ b/w Q and A sentences.

7 Fuzzy Relation Matching for Passage Retrieval Extracting and pairing relation paths. Present how relation paths are extracted and paired from parse trees. Measuring path matching score. A variation of IBM translation model. Model training: learn the mapping scores b/w relations in Q and potential A sentences. MI: capture the bipartite co-occurrences of 2 relations. EM: iterative training process.

8 1. Extracting and Pairing Relation Paths Dependency trees: generated by Minipar. Dependency trees Node: a word / a chunked phrase. Link: relation Ignore the directions of relations. (roles can change in Q & A) Paths: 2 constraints Paths The path length cannot exceed a pre-defined threshold (7). Minipar only resolves nearby dependencies reliably. Ignore relation paths b/w 2 words if they belong to the same chunk. “ 28 percent ”, “ New York ”

9 2. Measuring Path Matching Score Measure the matching score of the paths extracted from the sentence according to those from the question. Matching their nodes at both ends. Extending IBM statistical translation model 1. Matching score of a relation path from a candidate sentence = the probability of translating to it from it ’ s corresponding path in the Q. Sum up the matching scores of each path from the sentence which has a corresponding path in the Q.

10 3. Model Training Relation translation probability 2 methods (MI & EM). Mutual information Measured by their bipartite co-occurrences. Expectation maximization: Employ GIZA, a publicly available statistical translation package. Perform an iterative training process using EM to learn pair-wise translation probabilities.

11 Evaluations (1/5) Experiment setup: Training data: 10,255 factoid QA pairs from TREC- 8 and 9 QA tasks. Test data: TREC-12. (324 QA pairs) 4 systems for comparison: MITRE (baseline) Strict Matching of Relations SiteQ NUS 3 performance metrics: MRR. % of Q that have NO correct A. Precision at the top one passage.

12 Evaluations (2/5) Correct: Must be matched by the exact answer pattern. Must have a cosine similarity equal to or above.75 w/ any standard answer passage. Evaluation observations: Overall performance. Performance variation to Q length. Error Analysis for Relation Matching. Performance w/ Query Expansion.

13 Evaluations (3/5) Overall performance of relation matching technique compared to other passage retrieval systems. Overall performance Relation matching HELPS. Fuzzy outperforms strict matching. MI ≈ EM Performance variation to Q length: longer Qs are likely to benefit more from relation matching than shorter Q. Performance variation to Q length

14 Evaluations (4/5) Error Analysis for Relation Matching: Mismatch of Q terms: paths are incorrectly paired due to the mismatch of Q terms. Q: “ In which city is the River Seine? ” A: “ Paris. ” Paraphrasing b/w Q and A sentences: some correct sentences are paraphrases of the given Q. Q: “ What company manufactures X? ” A: “… C, the manufacturer of X …” Paraphrase: “ C is the manufacturer of X  C manufactures X ”

15 Evaluations (5/5) Performance w/ Query Expansion (QE): Short Q and paraphrases are obstacles in enhancing performance using relation matching. Submit Q to Google and select expansion terms based on their co-occurrences w/ Q terms in result snippets. External expansion terms do not have relation paths w/ the original Q terms in the Q.

16 Future Work Future work: Larger amount of training data. (scalability of MI & EM) Larger amount of test data. (effect of # of relation paths on matching) Question analysis and typing. Seamless integration of query expansion w/ relation matching. (take advantage of expanded terms in relation matching)

17 Conclusion Present a novel fuzzy relation matching technique (based on the degree of match b/w relation paths in candidate sentences and Q) for factoid QA passage retrieval. Produce significant improvements in retrieval performance in current systems: a vast 50~138% improvement in MRR, and over 95% in precision at top one passage.

18 ThanQ

19 Lexical Matching

20 Dependency Tree

21 Relation Paths

22 Note: Measuring Path Matching Score (1/2) Translation probability Prob(P S | P Q ): the sum over all possible alignments. P Q : corresponding paths from question Q. (length: m) P S : corresponding paths from sentence S. (length: n) Rel i (S) : ith relation in path P S. Relα i (Q) : corresponding relation in path P Q. P t (Rel i (S) |Relα i (Q) ): relation mapping scores.

23 Note: Measuring Path Matching Score (2/2) Consider only the most probable alignment: A i : the most probable alignment. Use only n for normalization. m is constant for the same Q.

24 Note: Model Training - MI P t (Rel i (S) |Relα i (Q) ): 2 methods (MI & EM) Mutual information: measured by their bipartite co- occurrences. : 1 when Rel i (S) & Rel j (Q) appear together in a training path pair, and 0 otherwise. γ: the inverse proportion of the sum of the lengths of the 2 paths. | Rel (Q) |: the # of paths extracted from all Q in which relation Rel occurs. | Rel (S) |: the # of paths extracted from all A sentences in which relation Rel occurs.

25 Overall Performance

26 Performance Variation to Q Length (Along w/ 95% error bars)

27 Query Expansion