Question Answer System Deliverable #2

Slides:



Advertisements
Similar presentations
1 Opinion Summarization Using Entity Features and Probabilistic Sentence Coherence Optimization (UIUC at TAC 2008 Opinion Summarization Pilot) Nov 19,
Advertisements

Search in Source Code Based on Identifying Popular Fragments Eduard Kuric and Mária Bieliková Faculty of Informatics and Information.
Chapter 5: Introduction to Information Retrieval
Elliot Holt Kelly Peterson. D4 – Smells Like D3 Primary Goal – improve D3 MAP with lessons learned After many experiments: TREC 2004 MAP = >
Group 3 Chad Mills Esad Suskic Wee Teck Tan. Outline  System and Data  Document Retrieval  Passage Retrieval  Results  Conclusion.
Lecture 11 Search, Corpora Characteristics, & Lucene Introduction.
Answer Extraction Ling573 NLP Systems and Applications May 19, 2011.
Information Retrieval in Practice
 How many pages does it search?  How does it access all those pages?  How does it give us an answer so quickly?  How does it give us such accurate.
Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan Susan T.Dumains Eric Horvitz MIT,CSAILMicrosoft Researcher Microsoft.
’ strict ’ strict ’ strict ’ lenient ‘ lenient ‘ lenient
Information Retrieval Ling573 NLP Systems and Applications April 26, 2011.
Ang Sun Ralph Grishman Wei Xu Bonan Min November 15, 2011 TAC 2011 Workshop Gaithersburg, Maryland USA.
Creating Concept Hierarchies in a Customer Self-Help System Bob Wall CS /29/05.
Information Retrieval and Question-Answering
1 Question-Answering via the Web: the AskMSR System Note: these viewgraphs were originally developed by Professor Nick Kushmerick, University College Dublin,
Chapter 5: Information Retrieval and Web Search
Overview of Search Engines
Improving Software Package Search Quality Dan Fingal and Jamie Nicolson.
CS344: Introduction to Artificial Intelligence Vishal Vachhani M.Tech, CSE Lecture 34-35: CLIR and Ranking in IR.
CSC 9010 Spring Paula Matuszek A Brief Overview of Watson.
A Technical Seminar on Question Answering SHRI RAMDEOBABA COLLEGE OF ENGINEERING & MANAGEMENT Presented By: Rohini Kamdi Guided By: Dr. A.J.Agrawal.
AnswerBus Question Answering System Zhiping Zheng School of Information, University of Michigan HLT 2002.
Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.
Question Answering.  Goal  Automatically answer questions submitted by humans in a natural language form  Approaches  Rely on techniques from diverse.
Question Answering From Zero to Hero Elena Eneva 11 Oct 2001 Advanced IR Seminar.
A search-based Chinese Word Segmentation Method ——WWW 2007 Xin-Jing Wang: IBM China Wen Liu: Huazhong Univ. China Yong Qin: IBM China.
Querying Structured Text in an XML Database By Xuemei Luo.
Presenter: Lung-Hao Lee ( 李龍豪 ) January 7, 309.
Day 14 Information Retrieval Question Answering 1.
Term Frequency. Term frequency Two factors: – A term that appears just once in a document is probably not as significant as a term that appears a number.
INTERESTING NUGGETS AND THEIR IMPACT ON DEFINITIONAL QUESTION ANSWERING Kian-Wei Kor, Tat-Seng Chua Department of Computer Science School of Computing.
Chapter 6: Information Retrieval and Web Search
Search Engines. Search Strategies Define the search topic(s) and break it down into its component parts What terms, words or phrases do you use to describe.
Dataware’s Document Clustering and Query-By-Example Toolkits John Munson Dataware Technologies 1999 BRS User Group Conference.
NTCIR /21 ASQA: Academia Sinica Question Answering System for CLQA (IASL) Cheng-Wei Lee, Cheng-Wei Shih, Min-Yuh Day, Tzong-Han Tsai, Tian-Jian Jiang,
Alexey Kolosoff, Michael Bogatyrev 1 Tula State University Faculty of Cybernetics Laboratory of Information Systems.
 Examine two basic sources for implicit relevance feedback on the segment level for search personalization. Eye tracking Display time.
By: Namrata Lele Mentors: Dave Vieglais Bruce Wilson 1 VDC/TWG Meeting August 09.
Lucene. Lucene A open source set of Java Classses ◦ Search Engine/Document Classifier/Indexer 
LING 573 Deliverable 3 Jonggun Park Haotian He Maria Antoniak Ron Lockwood.
August 17, 2005Question Answering Passage Retrieval Using Dependency Parsing 1/28 Question Answering Passage Retrieval Using Dependency Parsing Hang Cui.
ITCS 6265 Details on Project & Paper Presentation.
Comparing Document Segmentation for Passage Retrieval in Question Answering Jorg Tiedemann University of Groningen presented by: Moy’awiah Al-Shannaq
Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.
 General domain question answering system.  The starting point was the architecture described in Brill, Eric. ‘Processing Natural Language without Natural.
Shallow & Deep QA Systems Ling 573 NLP Systems and Applications April 9, 2013.
Consumer Health Question Answering Systems Rohit Chandra Sourabh Singh
CS520 Web Programming Full Text Search Chengyu Sun California State University, Los Angeles.
1 Predicting Answer Location Using Shallow Semantic Analogical Reasoning in a Factoid Question Answering System Hapnes Toba, Mirna Adriani, and Ruli Manurung.
A Trainable Multi-factored QA System Radu Ion, Dan Ştefănescu, Alexandru Ceauşu, Dan Tufiş, Elena Irimia, Verginica Barbu-Mititelu Research Institute for.
Information Retrieval in Practice
CSCE 590 Web Scraping – Information Extraction II
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Search Engine Architecture
Query Reformulation & Answer Extraction
Clustering of Web pages
Ling573 NLP Systems and Applications May 16, 2013
Aidan Hogan CC Procesamiento Masivo de Datos Otoño 2017 Lecture 7: Information Retrieval II Aidan Hogan
Aidan Hogan CC Procesamiento Masivo de Datos Otoño 2018 Lecture 7 Information Retrieval: Ranking Aidan Hogan
Web IR: Recent Trends; Future of Web Search
Applying Key Phrase Extraction to aid Invalidity Search
Information Retrieval
ABDULLAH ALOTAYQ, DONG WANG, ED PHAM PROJECT BY:
Introduction to Search Engines
Discussion Class 7 Lucene.
Searching and browsing through fragments of TED Talks
CMU Y2 Rosetta GnG Distillation
Information Retrieval and Web Design
Introduction to Search Engines
Presentation transcript:

Question Answer System Deliverable #2 Jonggun Park Haotian He Maria Antoniak Ron Lockwood

System architecture Two modules: Indexing Querying query processing passage retrieval answer processing/ranking

Document Indexing/Retrieval Apache Lucene Two indices Full text (used for idf calculations) Paragraphs (used for scoring results)

Why did Chuck Norris uppercut a horse? Query processing Why did Chuck Norris uppercut a horse? chuck, norris, uppercut, horse Search

Query processing + POS + NER + Chunking + Stemming

Chuck Norris uppercut a horse to make a giraffe

Answer Extraction/Processing Initial solution is a redundancy-based strategy With one big difference Instead of using web queries for snippets We are using results (top 100) from a Lucene query

Answer Extraction Details Input to the Extraction Engine Query word list Stop-word list Focus-word list (e.g. meters, liters, miles, etc.) Passage list – the paragraph results of the query N-gram generation and occurrence counting Filtering out stop words and query words Combining unigram counts with n-gram counts Weighting candidates with idf scores Verifying candidates in documents Lin, J. 2007. An exploration of the principles underlying redundancy-based factoid question answering. Penn Plaza, Suite 701, New York, NY.

D2 Results Strict = 0.01 Lenient = 0.064 Low results… but improvements are coming!

Future work NER Web boosting Query/answer classification

Thank you! Questions?