Going Beyond Simple Question Answering Bahareh Sarrafzadeh CS 886 – Spring 2015.

Slides:



Advertisements
Similar presentations
SEARCHING QUESTION AND ANSWER ARCHIVES Dr. Jiwoon Jeon Presented by CHARANYA VENKATESH KUMAR.
Advertisements

UCLA : GSE&IS : Department of Information StudiesJF : 276lec1.ppt : 5/2/2015 : 1 I N F S I N F O R M A T I O N R E T R I E V A L S Y S T E M S Week.
Web Mining Research: A Survey Authors: Raymond Kosala & Hendrik Blockeel Presenter: Ryan Patterson April 23rd 2014 CS332 Data Mining pg 01.
 Andisheh Keikha Ryerson University Ebrahim Bagheri Ryerson University May 7 th
Dialogue – Driven Intranet Search Suma Adindla School of Computer Science & Electronic Engineering 8th LANGUAGE & COMPUTATION DAY 2009.
Search Engines and Information Retrieval
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Web Mining Research: A Survey
Queensland University of Technology An Ontology-based Mining Approach for User Search Intent Discovery Yan Shen, Yuefeng Li, Yue Xu, Renato Iannella, Abdulmohsen.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
Information Retrieval
Data Mining – Intro.
REFLECTIONS ON NOTECARDS: SEVEN ISSUES FOR THE NEXT GENERATION OF HYPERMEDIA FRANK G. HALASZ.
Query session guided multi- document summarization THESIS PRESENTATION BY TAL BAUMEL ADVISOR: PROF. MICHAEL ELHADAD.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Result presentation. Search Interface Input and output functionality – helping the user to formulate complex queries – presenting the results in an intelligent.
Integrative Model Designed to help students develop a deep understanding of organized bodies of knowledge while simultaneously developing critical thinking.
Rui Yan, Yan Zhang Peking University
Query Relevance Feedback and Ontologies How to Make Queries Better.
CSC 9010 Spring Paula Matuszek A Brief Overview of Watson.
Search Engines and Information Retrieval Chapter 1.
Guided Interactive Discovery of e-Government Services Giovanni Maria Sacco Dipartimento di Informatica, Università di Torino Corso Svizzera 185,
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
Author: William Tunstall-Pedoe Presenter: Bahareh Sarrafzadeh CS 886 Spring 2015.
Information retrieval wed sept data…. -start at 6.45.
Information Retrieval and Knowledge Organisation Knut Hinkelmann.
Question Answering.  Goal  Automatically answer questions submitted by humans in a natural language form  Approaches  Rely on techniques from diverse.
INTRODUCTION TO RESEARCH. Learning to become a researcher By the time you get to college, you will be expected to advance from: Information retrieval–
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Information Retrieval Evaluation and the Retrieval Process.
CORPORUM-OntoExtract Ontology Extraction Tool Author: Robert Engels Company: CognIT a.s.
Mining Structured vs. Unstructured Data Where is the structure and where did the semantics go? Rahim Yaseen SAP Labs LLC.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
For: CS590 Intelligent Systems Related Subject Areas: Artificial Intelligence, Graphs, Epistemology, Knowledge Management and Information Filtering Application.
The Structure of Information Retrieval Systems LBSC 708A/CMSC 838L Douglas W. Oard and Philip Resnik Session 1: September 4, 2001.
Algorithmic Detection of Semantic Similarity WWW 2005.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Working Memory and Learning Underlying Website Structure
Automatic Question Answering  Introduction  Factoid Based Question Answering.
Jane Reid, AMSc IRIC, QMUL, 30/10/01 1 Information seeking Information-seeking models Search strategies Search tactics.
Date: 2013/10/23 Author: Salvatore Oriando, Francesco Pizzolon, Gabriele Tolomei Source: WWW’13 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang SEED:A Framework.
1 Information Retrieval LECTURE 1 : Introduction.
Information Retrieval
HITIQA: Scenario Based Question Answering Tomek Strzalkowski, et al The State University of New York at Albany Paul Kantor, et al Rutgers University Boris.
Information Retrieval Transfer Cycle Dania Bilal IS 530 Fall 2007.
Relevance Models and Answer Granularity for Question Answering W. Bruce Croft and James Allan CIIR University of Massachusetts, Amherst.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Strategies for Advanced Question Answering Sanda Harabagiu & Finley Lacatusu Language Computer Corporation HLT-NAACL2004 Workshop.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
IS 530: Information Access & Retrieval Class Activity Examples Fall 2012 Dr. Andrea Baer.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Information Seeking from the User’s Perspective CMPT 455/826 - Week 11, Day 1 (Based on Kuhlthau) 1.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
INFORMATION RETRIEVAL MEASUREMENT OF RELEVANCE EFFECTIVENESS 1Adrienn Skrop.
Big Data Quality the next semantic challenge
Lecture 19: Question Answering
Web IR: Recent Trends; Future of Web Search
Clustering Semantically Enhanced Web Search Results
Exploratory Search Beyond the Query–Response Paradigm
CSE 635 Multimedia Information Retrieval
Introduction to Information Retrieval
Lecture 8 Information Retrieval Introduction
Search Engine Architecture
Information Retrieval and Web Design
Presentation transcript:

Going Beyond Simple Question Answering Bahareh Sarrafzadeh CS 886 – Spring 2015

Current Challenges Information Overload Lack of Interaction Lack of Overviews 2

Current Efforts Ranking Methods Ordering retrieved documents Novelty Diversity Going Beyond Documents QA Summarization IE 3

Motivation Web Search and Increasing Expectations Largest source of information Satisfies different types of information needs Search engines facilitated finding information But … the expectations exceeds the capabilities

5

Motivation – Contd. Search Engines do NOT fully support Information need is not well defined; The nature of Search is Complex; It involves Learning; … 6

Simple v.s. Complex People’s day-to-day search activities can vary greatly motivations, objectives, and outcomes. These search activities can be broadly classified into two groups: “Simple” and “Complex”. Simple search tasks are similar to “known-item” search tasks involve looking up some discrete, well-structured information object; e.g., numbers, names and facts; Complex search tasks, are seen to be more exploratory involve investigating, learning and synthesis of information 7

Supporting Simple Questions IR-based approaches QUESTION PROCESSING Detect question type, answer type, focus, relations Formulate queries to send to a search engine PASSAGE RETRIEVAL Retrieve ranked documents Break into suitable passages and re-rank ANSWER PROCESSING Extract candidate answers Rank candidates using evidence from the text and external sources Knowledge-based and Hybrid approaches Build a semantic representation of the query Times, dates, locations, entities, numeric quantities Map from this semantics to query structured data or resources Geospatial databases Ontologies (Wikipedia infoboxes, dbPedia, WordNet, Yago) 8

Supporting Complex Search Finding relevant information Making sense of the information Offloading and Organizing the gathered information 9

How Complex Questions are Different? Seek multiple different types of information simultaneously One single answer will not suffice General (rather than specific) Open-ended Target multiple items/documents Involve Uncertainty Motivated by ill-defined problems Dynamic and evolve over time Multi-faceted 10

Example “ How accurate are HIV tests? ” answer: a number or a range “ What are the causes of AIDS? ” wider focus; less well-defined information need; undefined informational goal; 11

Why Complex QA is more Challenging? The Information Need is often Subjective and Ill- defined; The answers do not necessarily fall into predictable semantic classes; They may need some intellectual and cognitive development at the user side; It’s not always possible to formulate a Query 12

Automatic Complex QA Returning unstructured lists of candidate answers e.g. “ What effect does steroid use have on athletes’ performance? ” {“ steroid helps boost athletic performance by improving muscle mass ”, “ steroids can cause many harmful effects ”} Returning paragraph-length answers responsive; relevant; coherent; e.g. “ Describe steps taken and worldwide reaction prior to the introduction of Euro on January 1, Include predictions and expectations reported in the press. ” “ Despite skepticism about the actual realization of a single European currency as scheduled on January 1, 1999, preparations for the design of the Euro note have already begun. Europe’s new currency, the euro, will rival the U.S. dollar as an international currency over the long term, Der Spiegel magazine reported Sunday. ” 13

A Simple Paradigm 1. Question Decomposition 2. Factoid QA Techniques 3. Multi-Document Summarization (MSD) 14

Example 15 How have thefts impacted on the safety of Russia’s nuclear navy, and has the theft problem been increased or reduced over time? Need of Domain Knowledge Question Decomposition To what degree do different thefts put nuclear or radioactive materials at risk? Definition questions: What is meant by nuclear navy? What does ‘impact’ mean? How does one define the increase or decrease of a problem? Factoid questions: What is the number of thefts that are likely to be reported? What sort of items have been stolen? Alternative questions: What is meant by Russia? Only Russia, or also former Soviet facilities in non-Russian republics? Definition questions: What is meant by nuclear navy? What does ‘impact’ mean? How does one define the increase or decrease of a problem? Factoid questions: What is the number of thefts that are likely to be reported? What sort of items have been stolen? Alternative questions: What is meant by Russia? Only Russia, or also former Soviet facilities in non-Russian republics?

MDS and Complex QA Sentence Retrieval Translation Models Highly Frequent Terms Graph-based Random Walks … Sentence Ranking Relevance Informativeness / Interestingness Novelty Diversity … Sentence Ordering Coherence Cohesion Sentence Similarity Clustering Textual Entailment Sentence Alignment 16

Rather than seeking an exact answer, a user might be interested in related information; It may not be possible to formulate the information need as a question; Information is gathered in bits and pieces; The user has to assimilate this new Knowledge into his existing one; “Intellectual Development” Is the Perfect QA System enough? 17 It’s about the Journey; not the Destination!

Supporting Complex Search Finding relevant information Making sense of the information Offloading and Organizing the gathered information 18

Supporting the Exploration 19

20

Thank You! 21