Date: 2014/02/25 Author: Aliaksei Severyn, Massimo Nicosia, Aleessandro Moschitti Source: CIKM’13 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Building.

Slides:



Advertisements
Similar presentations
School of something FACULTY OF OTHER School of Computing FACULTY OF ENGINEERING Chunking: Shallow Parsing Eric Atwell, Language Research Group.
Advertisements

ThemeInformation Extraction for World Wide Web PaperUnsupervised Learning of Soft Patterns for Generating Definitions from Online News Author Cui, H.,
QA-LaSIE Components The question document and each candidate answer document pass through all nine components of the QA-LaSIE system in the order shown.
Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
Time-sensitive Personalized Query Auto-Completion
DOMAIN DEPENDENT QUERY REFORMULATION FOR WEB SEARCH Date : 2013/06/17 Author : Van Dang, Giridhar Kumaran, Adam Troy Source : CIKM’12 Advisor : Dr. Jia-Ling.
Author : Zhen Hai, Kuiyu Chang, Gao Cong Source : CIKM’12 Speaker : Wei Chang Advisor : Prof. Jia-Ling Koh ONE SEED TO FIND THEM ALL: MINING OPINION FEATURES.
Recognizing Implicit Discourse Relations in the Penn Discourse Treebank Ziheng Lin, Min-Yen Kan, and Hwee Tou Ng Department of Computer Science National.
Mining Query Subtopics from Search Log Data Date : 2012/12/06 Resource : SIGIR’12 Advisor : Dr. Jia-Ling Koh Speaker : I-Chih Chiu.
47 th Annual Meeting of the Association for Computational Linguistics and 4 th International Joint Conference on Natural Language Processing Of the AFNLP.
Passage Retrieval & Re-ranking Ling573 NLP Systems and Applications May 5, 2011.
Shallow Processing: Summary Shallow Processing Techniques for NLP Ling570 December 7, 2011.
An SVM Based Voting Algorithm with Application to Parse Reranking Paper by Libin Shen and Aravind K. Joshi Presented by Amit Wolfenfeld.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Learning Table Extraction from Examples Ashwin Tengli, Yiming Yang and Nian Li Ma School of Computer Science Carnegie Mellon University Coling 04.
Information Retrieval in Practice
Web Usage Mining with Semantic Analysis Date: 2013/12/18 Author: Laura Hollink, Peter Mika, Roi Blanco Source: WWW’13 Advisor: Jia-Ling Koh Speaker: Pei-Hao.
Leveraging Conceptual Lexicon : Query Disambiguation using Proximity Information for Patent Retrieval Date : 2013/10/30 Author : Parvaz Mahdabi, Shima.
1 A study on automatically extracted keywords in text categorization Authors:Anette Hulth and Be´ata B. Megyesi From:ACL 2006 Reporter: 陳永祥 Date:2007/10/16.
Authors: Ting Wang, Yaoyong Li, Kalina Bontcheva, Hamish Cunningham, Ji Wang Presented by: Khalifeh Al-Jadda Automatic Extraction of Hierarchical Relations.
Natural Language Processing Group Department of Computer Science University of Sheffield, UK Improving Semi-Supervised Acquisition of Relation Extraction.
Discriminative Syntactic Language Modeling for Speech Recognition Michael Collins, Brian Roark Murat, Saraclar MIT CSAIL, OGI/OHSU, Bogazici University.
Detecting Semantic Cloaking on the Web Baoning Wu and Brian D. Davison Lehigh University, USA WWW 2006.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
Ling 570 Day 17: Named Entity Recognition Chunking.
A search-based Chinese Word Segmentation Method ——WWW 2007 Xin-Jing Wang: IBM China Wen Liu: Huazhong Univ. China Yong Qin: IBM China.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
A Probabilistic Graphical Model for Joint Answer Ranking in Question Answering Jeongwoo Ko, Luo Si, Eric Nyberg (SIGIR ’ 07) Speaker: Cho, Chin Wei Advisor:
1 Boosting-based parse re-ranking with subtree features Taku Kudo Jun Suzuki Hideki Isozaki NTT Communication Science Labs.
1 Learning Sub-structures of Document Semantic Graphs for Document Summarization 1 Jure Leskovec, 1 Marko Grobelnik, 2 Natasa Milic-Frayling 1 Jozef Stefan.
NTCIR /21 ASQA: Academia Sinica Question Answering System for CLQA (IASL) Cheng-Wei Lee, Cheng-Wei Shih, Min-Yuh Day, Tzong-Han Tsai, Tian-Jian Jiang,
Tuning Before Feedback: Combining Ranking Discovery and Blind Feedback for Robust Retrieval* Weiguo Fan, Ming Luo, Li Wang, Wensi Xi, and Edward A. Fox.
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
LOGO Finding High-Quality Content in Social Media Eugene Agichtein, Carlos Castillo, Debora Donato, Aristides Gionis and Gilad Mishne (WSDM 2008) Advisor.
Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.
Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.
Algorithmic Detection of Semantic Similarity WWW 2005.
TimeML compliant text analysis for Temporal Reasoning Branimir Boguraev and Rie Kubota Ando.
1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )
A Word Clustering Approach for Language Model-based Sentence Retrieval in Question Answering Systems Saeedeh Momtazi, Dietrich Klakow University of Saarland,Germany.
Date: 2013/10/23 Author: Salvatore Oriando, Francesco Pizzolon, Gabriele Tolomei Source: WWW’13 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang SEED:A Framework.
Liangjie Hong and Brian D. Davison Department of Computer Science and Engineering Lehigh University SIGIR 2009.
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.
Support Vector Machines and Kernel Methods for Co-Reference Resolution 2007 Summer Workshop on Human Language Technology Center for Language and Speech.
1 Adaptive Subjective Triggers for Opinionated Document Retrieval (WSDM 09’) Kazuhiro Seki, Kuniaki Uehara Date: 11/02/09 Speaker: Hsu, Yu-Wen Advisor:
LINDEN : Linking Named Entities with Knowledge Base via Semantic Knowledge Date : 2013/03/25 Resource : WWW 2012 Advisor : Dr. Jia-Ling Koh Speaker : Wei.
Text segmentation Amany AlKhayat. Before any real processing is done, text needs to be segmented at least into linguistic units such as words, punctuation,
Using Wikipedia for Hierarchical Finer Categorization of Named Entities Aasish Pappu Language Technologies Institute Carnegie Mellon University PACLIC.
Question Classification using Support Vector Machine Dell Zhang National University of Singapore Wee Sun Lee National University of Singapore SIGIR2003.
CONTEXTUAL SEARCH AND NAME DISAMBIGUATION IN USING GRAPHS EINAT MINKOV, WILLIAM W. COHEN, ANDREW Y. NG SIGIR’06 Date: 2008/7/17 Advisor: Dr. Koh,
SEMANTIC VERIFICATION IN AN ONLINE FACT SEEKING ENVIRONMENT DMITRI ROUSSINOV, OZGUR TURETKEN Speaker: Li, HueiJyun Advisor: Koh, JiaLing Date: 2008/5/1.
Date: 2013/9/25 Author: Mikhail Ageev, Dmitry Lagun, Eugene Agichtein Source: SIGIR’13 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Improving Search Result.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Short Text Similarity with Word Embedding Date: 2016/03/28 Author: Tom Kenter, Maarten de Rijke Source: CIKM’15 Advisor: Jia-Ling Koh Speaker: Chih-Hsuan.
QUERY-PERFORMANCE PREDICTION: SETTING THE EXPECTATIONS STRAIGHT Date : 2014/08/18 Author : Fiana Raiber, Oren Kurland Source : SIGIR’14 Advisor : Jia-ling.
DeepWalk: Online Learning of Social Representations
INAGO Project Automatic Knowledge Base Generation from Text for Interactive Question Answering.
Relation Extraction CSCI-GA.2591
Social Knowledge Mining
Clustering Algorithms for Noun Phrase Coreference Resolution
Kateryna Tymoshenko , Alessandro Moschitti University of Trento
Automatic Detection of Causal Relations for Question Answering
How to publish in a format that enhances literature-based discovery?
Intent-Aware Semantic Query Annotation
Enriching Taxonomies With Functional Domain Knowledge
Date: 2012/11/15 Author: Jin Young Kim, Kevyn Collins-Thompson,
Extracting Why Text Segment from Web Based on Grammar-gram
Presentation transcript:

Date: 2014/02/25 Author: Aliaksei Severyn, Massimo Nicosia, Aleessandro Moschitti Source: CIKM’13 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Building Structures from Classifiers for Passage Reranking

Outline Introduction Learning to rank with kernels Structural models of Q/A pairs Feature vector representation Experiment Conclusion 2

Introduction 3 Search engine What is Mark Twain‘s real name ? Samuel Langhorne Clemens, better known as Mark Twain. Mark Twain couldn't have put it any better. Roll over, Mark Twain, because Mark McGwire is on the scene. Mark Twain couldn't have put it any better. Samuel Langhorne Clemens, better known as Mark Twain.

Automated question answering(QA) is a complex task that often requires manual definition of rules and syntactic patterns to detect the relations between a question and its candidate answers Compute a textual similarity Machine learning Build a answer passage reranking model encodes powerful syntactic patterns relating q/a pairs Automatically extracted and learned the feature not require manual feature engineering Introduction 4

Outline Introduction Learning to rank with kernels Tree kernel Preference reranking with kernels Structural models of Q/A pairs Feature vector representation Experiment Conclusion 5

Kernel-based answer passage reranking system 6

Model q/a pairs as linguistic structure Automatically extracted and learn powerful syntactic patterns 7 What is Mark Twain‘s real name ? Samuel Langhorne Clemens, better known as Mark Twain.

Syntactic Tree Kernels (M. Collins, 2002) Partial Tree Kernel (A. Moschitti, 2006) A function that can be effectively applied to both constituency and dependency parse trees. Tree kernels 8

To enable the use of kernels for learning to rank with SVMs, use preference reranking (T. Joachims, 2002) A set of q/a pairs {a, b, c, d, e} a>b, a>c, b>c, c > e … Preference kernel Preference reranking with kernels 9

q/a pair:a, c K TK (Q a, Q c ) = 1.0, K TK (A a, A c ) = 0.7, K fvec (a, c) = 0.3 K(a, c) = = 2.0 Preference reranking with kernels 10

Outline Introduction Learning to rank with kernels Structural models of Q/A pairs Structural representations Relational tag using question focus and question category Feature vector representation Experiment Conclusion 11

Shallow tree structure This is a essentially a shallow syntactic structure built from part-of-speech tags grouped into chunks A special REL tag links the related structures Structural representation 12

Use the question category to link the focus word of a question with the name entities extracted form the candidate answer Six non overlapping classes abbreviations(ABBR) Descriptions(DESC) Entity(ENTY) Human(HUM) Location(LOC) Numeric(NUM) Relational tag using question focus and question category 13

Relational tag using question focus and question category 14

Outline Introduction Learning to rank with kernels Structural models of Q/A pairs Feature vector representation Experiment Conclusion 15

The total number of basic feature is 24 N-gram of part-of-speech NER relatedness LDA … Feature vector representation 16

Outline Introduction Learning to rank with kernels Structural models of Q/A pairs Feature vector representation Experiment Conclusion 17

Corpora Train data: TREC QA data 2002 and 2003 years (824 questions) Test data: TREC 13 Search engines TREC and Answerbag 50 candidate answers for each questions Experiments

Baseline BM25 CH (shallow tree) V (reranker model) Approach F ( semantic linking) TF ( adding a question category) Experiments

Corpora Train data: TREC 8-12 Test data: TREC 13 Advanced feature ( V adv ) Greedy String Tiling Resnik similarity … Experiments

Outline Introduction Learning to rank with kernels Structural models of Q/A pairs Feature vector representation Experiment Conclusion 21

Use of structural kernel technology to induce rich feature spaces form structural sematic representations of question and answer passage pairs. Provide adaptable question and focus classifiers. Conceptually simpler than previous system. Conclusion 22

23 MRR( Mean Reciprocal Rank) mean reciprocal rank = (1/3 + 1/2 + 1)/3 = 11/18