(Spoken) Dialogue and Information Retrieval Antoine Raux Dialogs on Dialogs Group 10/24/2003.

Slides:

Advertisements

Similar presentations

Non-Native Users in the Let s Go!! Spoken Dialogue System: Dealing with Linguistic Mismatch Antoine Raux & Maxine Eskenazi Language Technologies Institute.

Advertisements

Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.

SEARCHING QUESTION AND ANSWER ARCHIVES Dr. Jiwoon Jeon Presented by CHARANYA VENKATESH KUMAR.

Developing and Evaluating a Query Recommendation Feature to Assist Users with Online Information Seeking & Retrieval With graduate students: Karl Gyllstrom,

Information Retrieval: Human-Computer Interfaces and Information Access Process.

Information Retrieval in Practice

Search Engines and Information Retrieval

IR Challenges and Language Modeling. IR Achievements Search engines  Meta-search  Cross-lingual search  Factoid question answering  Filtering Statistical.

ADVISE: Advanced Digital Video Information Segmentation Engine

Information Retrieval February 24, 2004

Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.

Speech recognition, understanding and conversational interfaces Alexander Rudnicky School of Computer Science

INFO 624 Week 3 Retrieval System Evaluation

A Task Oriented Non- Interactive Evaluation Methodology for IR Systems By Jane Reid Alyssa Katz LIS 551 March 30, 2004.

FACT: A Learning Based Web Query Processing System Hongjun Lu, Yanlei Diao Hong Kong U. of Science & Technology Songting Chen, Zengping Tian Fudan University.

Information Retrieval: Human-Computer Interfaces and Information Access Process.

ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.

1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.

Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University

Web Search – Summer Term 2006 II. Information Retrieval (Basics Cont.) (c) Wolfgang Hürst, Albert-Ludwigs-University.

An Overview of Relevance Feedback, by Priyesh Sudra 1 An Overview of Relevance Feedback PRIYESH SUDRA.

Web Search – Summer Term 2006 II. Information Retrieval (Basics Cont.) (c) Wolfgang Hürst, Albert-Ludwigs-University.

Creating and Visualizing Document Classification J. Gelernter, D. Cao, R. Lu, E. Fink, J. Carbonell.

Overview of Search Engines

DIVINES – Speech Rec. and Intrinsic Variation W.S.May 20, 2006 Richard Rose DIVINES SRIV Workshop The Influence of Word Detection Variability on IR Performance.

Information Retrieval in Practice

AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.

Faculty of Informatics and Information Technologies Slovak University of Technology Personalized Navigation in the Semantic Web Michal Tvarožek Mentor:

Learner Modelling in a Multi-Agent System through Web Services Katerina Kabassi, Maria Virvou Department of Informatics, University of Piraeus.

Search Engines and Information Retrieval Chapter 1.

Evaluation Experiments and Experience from the Perspective of Interactive Information Retrieval Ross Wilkinson Mingfang Wu ICT Centre CSIRO, Australia.

Online Autonomous Citation Management for CiteSeer CSE598B Course Project By Huajing Li.

Spoken dialog for e-learning supported by domain ontologies Dario Bianchi, Monica Mordonini and Agostino Poggi Dipartimento di Ingegneria dell’Informazione.

Master Thesis Defense Jan Fiedler 04/17/98

A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.

Chapter 2 Architecture of a Search Engine. Search Engine Architecture n A software architecture consists of software components, the interfaces provided.

UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.

Topical Crawlers for Building Digital Library Collections Presenter: Qiaozhu Mei.

1 Information Retrieval Acknowledgements: Dr Mounia Lalmas (QMW) Dr Joemon Jose (Glasgow)

Information in the Digital Environment Information Seeking Models Dr. Dania Bilal IS 530 Spring 2006.

Dept. of Computer Science University of Rochester Rochester, NY By: James F. Allen, Donna K. Byron, Myroslava Dzikovska George Ferguson, Lucian Galescu,

1 Boostrapping language models for dialogue systems Karl Weilhammer, Matthew N Stuttle, Steve Young Presenter: Hsuan-Sheng Chiu.

Search Engine Architecture

Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.

Recuperação de Informação B Cap. 10: User Interfaces and Visualization , , 10.9 November 29, 1999.

Faculty of Informatics and Information Technologies Slovak University of Technology Personalized Navigation in the Semantic Web Michal Tvarožek Mentor:

Information in the Digital Environment Information Seeking Models Dr. Dania Bilal IS 530 Spring 2005.

Material from Authors of Human Computer Interaction Alan Dix, et al

Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.

Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.

Measuring How Good Your Search Engine Is. *. Information System Evaluation l Before 1993 evaluations were done using a few small, well-known corpora of.

Jane Reid, AMSc IRIC, QMUL, 30/10/01 1 Information seeking Information-seeking models Search strategies Search tactics.

Information Retrieval

For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.

1 Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents S. Kawamoto, et al. October 27, 2004.

WIRED Future Quick review of Everything What I do when searching, seeking and retrieving Questions? Projects and Courses in the Fall Course Evaluation.

Chapter. 3: Retrieval Evaluation 1/2/2016Dr. Almetwally Mostafa 1.

User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d)

Speech Processing 1 Introduction Waldemar Skoberla phone: fax: WWW:

Document Clustering for Natural Language Dialogue-based IR (Google for the Blind) Antoine Raux IR Seminar and Lab Fall 2003 Initial Presentation.

Information Retrieval in Practice

Information Retrieval in Practice

Web Search – Summer Term 2006 II. Information Retrieval (Basics Cont.)

Search Engine Architecture

Proposal for Term Project

Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin

Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.

Assoc. Prof. Dr. Syed Abdul-Rahman Al-Haddad

John Lafferty, Chengxiang Zhai School of Computer Science

CSE 635 Multimedia Information Retrieval

Presentation transcript:

(Spoken) Dialogue and Information Retrieval Antoine Raux Dialogs on Dialogs Group 10/24/2003

Outline Interactive Information Retrieval Systems (Belkin et al) Interactive Information Retrieval Systems (Belkin et al) EUREKA: Dialogue-based IR for Low Bandwidth Devices EUREKA: Dialogue-based IR for Low Bandwidth Devices Voice Access to IR Voice Access to IR

Cases, Scripts, and Information- Seeking Strategies Belkin, Cool (Rutgers) Stein, Thiel (GMD-IPSI) Belkin, Cool (Rutgers) Stein, Thiel (GMD-IPSI) Long journal article (1995) Long journal article (1995) From the IR community (Expert Systems) From the IR community (Expert Systems)

IR as Interaction Traditional IR research focuses on document/query representation and comparison Traditional IR research focuses on document/query representation and comparison Need to focus on the user Need to focus on the user Represent IR as a dialogue between an information seeker and an information provider Represent IR as a dialogue between an information seeker and an information provider

Information-Seeking Strategies Represent information-seeking behavior along 4 dimensions: Represent information-seeking behavior along 4 dimensions: Method of Interaction (scanning vs searching) Method of Interaction (scanning vs searching) Goal of Interaction (learning vs selecting) Goal of Interaction (learning vs selecting) Mode of Retrieval (recognition vs specification) Mode of Retrieval (recognition vs specification) Resource Considered (information vs meta-info) Resource Considered (information vs meta-info) Binary values  16 strategies (ISS) Binary values  16 strategies (ISS)

Dialogue Structures for Information Seeking Mix of different formalisms: Mix of different formalisms: Recursive state-based schemas (COR) e.g. Request  Promise  Inform  Be contented Recursive state-based schemas (COR) e.g. Request  Promise  Inform  Be contented Scripts: prototypical interaction for each ISS Scripts: prototypical interaction for each ISS Goal trees Goal trees Retrieve Specified Items Specify CharacteristicRecognize Desired Items Offer choiceSelect and Specify

Deriving Scripts from Data Case-based approach: problem solving using previously stored solved instances Case-based approach: problem solving using previously stored solved instances Match a sequence of action to a state- based schema Match a sequence of action to a state- based schema Extract goal tree Extract goal tree Identify goal (which ISS?) Identify goal (which ISS?)

The MERIT System Theory vs Practice… Theory vs Practice… Graphical interface (not NL dialogue) Graphical interface (not NL dialogue) User does case selection (for eventual case-based reasoning) User does case selection (for eventual case-based reasoning) Example task is relational database (not free text IR): uses form filling (!) Example task is relational database (not free text IR): uses form filling (!)

Discussion Contribution to IR: user-centered view, application of many non-IR theories (discourse, CBR) Contribution to IR: user-centered view, application of many non-IR theories (discourse, CBR) BUT: too complicated for the user? BUT: too complicated for the user?

Discussion Contribution to Dialogue Systems: difficult task (not often dealt with in DS), CBR (can we learn dialogue structure from data?) Contribution to Dialogue Systems: difficult task (not often dealt with in DS), CBR (can we learn dialogue structure from data?) BUT: lacks a good, unified, practical framework (too many different paradigms applied…) BUT: lacks a good, unified, practical framework (too many different paradigms applied…)

Dialogue-based IR: Why? Google-like interface still predominant (despite MERIT) Google-like interface still predominant (despite MERIT) Why? Why? Users receives a lot of information (document titles, summaries) and use it as they want Users receives a lot of information (document titles, summaries) and use it as they want Very simple to learn Very simple to learn Very flexible Very flexible BUT: works on high bandwidth devices BUT: works on high bandwidth devices

Dialogue-based IR: Why? For low bandwidth devices (PDA, phone), information-rich interface don’t work For low bandwidth devices (PDA, phone), information-rich interface don’t work Only small pieces of information exchanged at a time Only small pieces of information exchanged at a time System has to select System has to select Less information, more interaction Less information, more interaction

EUREKA: Idea Use dialogue to submit queries to a web search engine, browse through the hierarchically clustered results, perform query reformulation/refinement, etc… Use dialogue to submit queries to a web search engine, browse through the hierarchically clustered results, perform query reformulation/refinement, etc…

EUREKA: Overview Backend: Vivisimo (through web scraper) Backend: Vivisimo (through web scraper) Dialogue Management: RavenClaw (successor of CMU Communicator) Dialogue Management: RavenClaw (successor of CMU Communicator) Language Understanding: Light Open Vocabulary Parser Language Understanding: Light Open Vocabulary Parser NLG/TTS: template-based & Festival NLG/TTS: template-based & Festival

Backend: Vivisimo Available clustering meta-search engine Available clustering meta-search engine Hand-written Perl web scraper (hope Vivisimo doesn’t change their page design by the end of the semester…) Hand-written Perl web scraper (hope Vivisimo doesn’t change their page design by the end of the semester…)

LOV Parser Problem: traditional NL parsers require a dictionary  not applicable to open domain IR Problem: traditional NL parsers require a dictionary  not applicable to open domain IR Solution (implemented in C++): Solution (implemented in C++): fix a small number of one-word commands (new_query, open, list_clusters) fix a small number of one-word commands (new_query, open, list_clusters) parse each line as “[command] [arguments]” or “[command]” or “[arguments]” parse each line as “[command] [arguments]” or “[command]” or “[arguments]”

Dialogue Management: RavenClaw Hierarchical agent architecture: Hierarchical agent architecture: EUREKA Greet User Prompt Query New Query Open Cluster … Submit Query Get Cluster List Get Doc List Inform Results Close Cluster

NLG/TTS Template-based Language Generation (e.g. “I found documents.”) Template-based Language Generation (e.g. “I found documents.”) General purpose Festival voice for TTS General purpose Festival voice for TTS NB: Browsing through lists is not efficient with speech, even for lists of clusters

Already Implemented Working prototype Working prototype Commands: Commands: new_query new_query list_clusters, list_documents list_clusters, list_documents open, close (cluster) open, close (cluster) more, back (list of clusters/documents) more, back (list of clusters/documents)

Demo

Future Work Add more functionalities (query refinement, summarization…) Add more functionalities (query refinement, summarization…) Make clever use of the dialogue (not only command and control + browsing) Make clever use of the dialogue (not only command and control + browsing) System can provide advice to user on search strategies (e.g. “you need to refine the query”) System can provide advice to user on search strategies (e.g. “you need to refine the query”) User and system can negotiate to specify the user’s information need (cf Belkin: overview vs specific document) User and system can negotiate to specify the user’s information need (cf Belkin: overview vs specific document)

Future Work/Discussion Advantage of dialogue: more feedback from the user Advantage of dialogue: more feedback from the user How can dialogue improve the efficiency of low bandwidth IR? How can dialogue improve the efficiency of low bandwidth IR? Do we need to tailor IR techniques (e.g. clustering) for dialogue, or even design new techniques? Do we need to tailor IR techniques (e.g. clustering) for dialogue, or even design new techniques?

Vocal Access to IR Problem: ASR introduces a lot of erroneous words in a spoken query (for an open domain, speaker independent system) Problem: ASR introduces a lot of erroneous words in a spoken query (for an open domain, speaker independent system) However, in an IR system: access to many text documents to help language modeling… However, in an IR system: access to many text documents to help language modeling…

Vocal Access to a Newspaper Archive (Crestani 02) Presents studies for a full voice-controlled IR system Presents studies for a full voice-controlled IR system No dialogue: user query  list of summaries No dialogue: user query  list of summaries Focuses on issues of: Focuses on issues of: TTS: can user make relevance judgments when they hear document descriptions synthesized over the phone? (answer: yes) TTS: can user make relevance judgments when they hear document descriptions synthesized over the phone? (answer: yes) ASR: how does IR perform with recognized queries? ASR: how does IR perform with recognized queries?

Using IR Techniques to Deal with Recognition Errors WER does have an impact on precision, although not much variation for WER in 27%-47% WER does have an impact on precision, although not much variation for WER in 27%-47% Relevance feedback: use documents judged relevant by the user as query Relevance feedback: use documents judged relevant by the user as query Use prosodic stress to estimate information content of query terms Use prosodic stress to estimate information content of query terms Include semantically/phonetically close terms in the query Include semantically/phonetically close terms in the query

Improving ASR (Fujii et al 02) Fujii et al propose LM adaptation based on the IR corpus: Fujii et al propose LM adaptation based on the IR corpus: Offline “adaptation”: train on the whole corpus Offline “adaptation”: train on the whole corpus Online adaptation: adapt on the top retrieved documents (then reperform ASR and IR) Online adaptation: adapt on the top retrieved documents (then reperform ASR and IR) Good results with offline trained LM (WER < 20%, AP loss of 20-30% from text IR) Good results with offline trained LM (WER < 20%, AP loss of 20-30% from text IR) No evaluation of online adaptation… No evaluation of online adaptation…

Vocal Access to IR: Discussion Seems to work ok for some tasks Seems to work ok for some tasks Clever use of IR techniques Clever use of IR techniques BUT queries are not spontaneous nor natural (maybe) BUT queries are not spontaneous nor natural (maybe) LM for Web queries?? LM for Web queries?? What about dialogue? What about dialogue?