Siridus Specification, Interaction and Reconfiguration in Dialogue Understanding Systems an information state approach to flexible spoken dialogue systems.

Slides:



Advertisements
Similar presentations
CONCEPTUAL WEB-BASED FRAMEWORK IN AN INTERACTIVE VIRTUAL ENVIRONMENT FOR DISTANCE LEARNING Amal Oraifige, Graham Oakes, Anthony Felton, David Heesom, Kevin.
Advertisements

1 © 2005 CHIL KTH ASIDE 2005, Aalborg, Applications of distributed dialogue systems: The KTH Connector Jens Edlund & Anna Hjalmarsson Applications.
Chapter 11 user support. Issues –different types of support at different times –implementation and presentation both important –all need careful design.
Talk and Look: Tools for Ambient Linguistic Knowledge A Project funded by the European Community under the Sixth Framework Programme for Research and Technological.
Specification, Partitioning, and Composition Techniques for Web Applications in the Context of Event-B Abdolbaghi Rezazadeh Michael Butler University of.
Natural Language Understanding Difficulties: Large amount of human knowledge assumed – Context is key. Language is pattern-based. Patterns can restrict.
Probabilistic Adaptive Real-Time Learning And Natural Conversational Engine Seventh Framework Programme FP7-ICT
ABSTRACT The goal of this project was to create a more realistic and interactive appliance interface for a Usability Science class here at Union. Usability.
Towards an NLP `module’ The role of an utterance-level interface.
Object-Oriented Analysis and Design
U1, Speech in the interface:2. Dialogue Management1 Module u1: Speech in the Interface 2: Dialogue Management Jacques Terken HG room 2:40 tel. (247) 5254.
LE TRINDIKIT A toolkit for building and experimenting with dialogue move engines and systems, based on the information state approach.
Goteborg University Dialogue Systems Lab WP1: GoDiS VCR application Edinburgh TALK meeting 7/
Research about dialogue and dialogue systems and the department of linguistics goal: –develop theories about human dialogue which can be used when building.
1 error handling – Higgins / Galatea Dialogs on Dialogs Group July 2005.
TrindiKit A toolkit for building and experimenting with dialogue move engines and systems, based on the information state approach.
Help and Documentation CSCI324, IACT403, IACT 931, MCS9324 Human Computer Interfaces.
LE A toolkit for building and experimenting with dialogue move engines and systems, based on the information state approach TrindiKit.
Goteborg University Dialogue Systems Lab GoDiS and TrindiKit MITRE workshop 27/10-03 Staffan Larsson Göteborg University Sweden.
Siridus an information state approach to flexible spoken dialogue systems David Milward Staffan Larsson
1 An introduction to design patterns Based on material produced by John Vlissides and Douglas C. Schmidt.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
VoiceXML Builder Arturo Ramirez ACS 494 Master’s Graduate Project May 04, 2001.
9/8/20151 Natural Language Processing Lecture Notes 1.
Role-plays for CALL: System Architecture and Resources Sabrina Wilske & Magdalena Wolska Saarland University ICL, Villach, September.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
1. 2 Purpose of This Presentation ◆ To explain how spacecraft can be virtualized by using a standard modeling method; ◆ To introduce the basic concept.
System Design: Designing the User Interface Dr. Dania Bilal IS582 Spring 2009.
1 CMPT 275 Software Engineering Requirements Analysis Phase Requirements Analysis Activity (Identifying Objects, Scenarios) Janice Regan,
circle Adding Spoken Dialogue to a Text-Based Tutorial Dialogue System Diane J. Litman Learning Research and Development Center & Computer Science Department.
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
McIlraith - Knowledge Systems Laboratory DAML/Horus Meeting 02/16/2001 Mobilizing the Web with DAML-Enabled Web Services Sheila McIlraith Knowledge Systems.
The roots of innovation Future and Emerging Technologies (FET) Future and Emerging Technologies (FET) The roots of innovation Proactive initiative on:
Dept. of Computer Science University of Rochester Rochester, NY By: James F. Allen, Donna K. Byron, Myroslava Dzikovska George Ferguson, Lucian Galescu,
Issues in Multiparty Dialogues Ronak Patel. Current Trend  Only two-party case (a person and a Dialog system  Multi party (more than two persons Ex.
Agenda 1. What we have done on which tasks 2. Further specification of work on all our tasks 3. Planning for deliverable writing this autumn (due in December)
17 th October 2005CCP4 Database Meeting (York) CCP4(i)/BIOXHIT Database Project: Scope, Aims, Plans, Status and all that jazz Peter Briggs, Wanjuan Yang.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
E.g.: MS-DOS interface. DIR C: /W /A:D will list all the directories in the root directory of drive C in wide list format. Disadvantage is that commands.
Evaluation of Agent Building Tools and Implementation of a Prototype for Information Gathering Leif M. Koch University of Waterloo August 2001.
Introduction to Computational Linguistics
Cognitive Systems Foresight Language and Speech. Cognitive Systems Foresight Language and Speech How does the human system organise itself, as a neuro-biological.
Artificial Intelligence: Natural Language
Information state and dialogue management in the TRINDI Dialogue Move Engine Toolkit, Larsson and Traum 2000 D&QA Reading Group, Feb 20 th 2007 Genevieve.
Information-State Dialogue Modelling in Several Versions HS Dialogmanagement, SS 2002 Universität Saarbrücken Michael Götze.
Conditional Random Fields for ASR Jeremy Morris July 25, 2006.
LanguageLab A Meta-modelling Environment Terje Gjøsæter and Andreas Prinz, University of Agder, Norway SDL Forum 2015, Berlin, Germany.
CS 4705 Lecture 17 Semantic Analysis: Robust Semantics.
Slide 1 2/22/2016 Policy-Based Management With SNMP SNMPCONF Working Group - Interim Meeting May 2000 Jon Saperia.
Speech Processing 1 Introduction Waldemar Skoberla phone: fax: WWW:
A Speech Interface to Virtual Environment Authors Scott McGlashan and Tomas Axling Swedish Institute of Computer Science.
Agent-Based Dialogue Management Discourse & Dialogue CMSC November 10, 2006.
Objectives of session By the end of today’s session you should be able to: Define and explain pragmatics and prosody Draw links between teaching strategies.
How to complete and submit a Final Report through Mobility Tool+ Technical guidelines Authentication, Completion and Submission 1 Antonia Gogaki IT Officer.
Metadata Schema Registries: background and context MEG Registry Workshop, Bath, 21 January 2003 Rachel Heery UKOLN, University of Bath Bath, BA2 7AY UKOLN.
Human Computer Interaction Lecture 21 User Support
IP Telephony (VoIP).
Human Computer Interaction Lecture 21,22 User Support
Program Management Portal: Request Management, PCRs and the Client
Intelligent Agent Solution
Conditional Random Fields for ASR
Home Automation System
Tomás Murillo-Morales and Klaus Miesenberger
An Overview of Simultaneous Remote Interpretation By Telephone
Modular design of domain-specific languages
Chapter 11 user support.
Chapter 12 cognitive models.
Network Training for Continuous Speech Recognition
Ponder policy toolkit Jovana Balkoski, Rashid Mijumbi
Presentation transcript:

Siridus Specification, Interaction and Reconfiguration in Dialogue Understanding Systems an information state approach to flexible spoken dialogue systems

Overview Siridus project Information State Approach Main research areas Demonstrators

Siridus Project Partners –University of Gothenburg (technical coordinator) –University of the Saarland (administrative coordinator) –University of Seville –Telefonica I+D –Linguamatics –SRI (original coordinator) Duration –Jan Dec 2002

Motivation Provide flexible, user-centred dialogue systems Improve reconfigurability of dialogue systems for – new tasks – new domains Provide architectures allowing the dialogue state to be accessible for speech recognition, generation of prosody etc.

IS transitions IS for System IS for System dialogue as transitions between IS states what information do we need in the IS? how should it be structured? can this information be reused e.g. synthesis/recognition? examples –underspecified commands (DELFOS) –questions under discussion (GODIS) Interpret U Generate U’ IS for System

TrindiKit GoDiS GoDiS-IGoDiS-A Travel Agency VCR manager IBDM / KOS ISU approach Delfos basic system Delfos framework DELFOS-NCL Home manager ISU approach TeleDELFOS Home manager

Research Areas Dialogue types Dialogue phenomena Enhancing Contextual Appropriateness of System Output Enhancing speech recognition through IS Robust interpretation

Dialogue Types Natural command dialogues –user initiated commands e.g. transfer my calls to Mr. Jones turn off all the lights upstairs and lock the front door Negotiative dialogues –Propose, negotiate & agree parameters for a task e.g. arrival-time, destination-time alternative solutions to a problem e.g. different flight options Tutorial dialogues –guide student through task –non collaborative –hinting (to encourage active learning)

Dialogue phenomena Conditional responses –Enhancing collaboration by: Indicating source of failure: –U: Can I fly on the second? –S: Not if you want to fly economy class. Indicating contingency of success: –S: Yes, if you fly with SAS. Clarifications –Lack of specificity S: Which account do you want to transfer to? U: My bank account S: Do you mean your deposit account or current account? –Unspecified task e.g. U: Channel five S: Do you want to add a program or change channel?

Enhancing Contextual Appropriateness of System Output In order to make dialogue system utterances sound natural and contextually appropriate, it is important –to control intonation of spoken output –to control other aspects of realization, such as generation of short utterances We handle these aspects using Information Structure –what speaker means to address vs. what she wants to say about it –what is the same vs. what discriminates between similar bits of information in context Information Structure is determined from the IS

Need for Varied Intonation U: Which devices are switched on? S: The KITCHEN LIGHT is switched on. U: What is the status of the kitchen light? S: The kitchen light is switched ON. Die Küchenleuchte ist EINGESCHALTET. Die KÜCHENLEUCHTE ist eingeschaltet.

Enhancing speech recognition using IS In flexible dialogue, may have dynamically generated prompts Can’t rely on pre-defined grammars to recognise the response for each prompt Potential sources of information: –previous move/dialogue history (choice from n-best) –syntactic/semantic coherence of fragments (direct parsing of word lattices) –reference resolution –state of the world Combine/contrast sources of evidence to decide –most likely utterance in word lattice / n-best –when to clarify

Robust Interpretation Phrase spotters go to a particular depth of analysis even if a full parse is possible When syntactic information is available should use it When not, should do at least as well as keyword/phrase spotting Two novel techniques: –distribute the semantic representation using a semantic chart –ontological knowledge to propose combinations via semantics in competition with syntax: semantic-based composition turn on the light and the kitchen Reconciling traditional syntax and compositional semantics with robust approaches

Main Demonstrators TrindiKit 3 Automated Telephone Operator

Trindikit 3 Toolkit for dialogue researchers Developed from TrindiKit 1 built in EU Trindi Project IS update rules allow you to write your own dialogue manager –IS +move(s)  IS Update rules suited for domain and language independent dialogue management Global information state –all modules can access all information –composed from library of datatypes

Trindikit advantages Theoretical –theory-independent allows implementation and comparison of competing theories promotes exploration of middle ground between simplistic and very complex theories of dialogue –intuitive formalisation and implementation of dialogue theories; the implementation is close to the theory Practical –promotes reuse and reconfigurability on multiple levels –general solutions to general phenomena enables rapid prototyping of applications –allows dealing with more complex dialogue phenomena not handled by current commercial systems

Automated telephone operator Natural language telephone-based access to company telephone directory/PABX Functionality –Dial by name –Multi-party conference –Call transfers –Database search ( address, office, phone number) KQML based message passing Delfos dialogue management User trials at Telefónica I+D

Telephone Operator GUI Corporate Directory / PABX

Further Demonstrations Specific modules –Intonation –Conditional responses –Robust interpretation Applications to illustrate reconfigurability –Travel agency –VCR –Command and control of multiple devices (D’Homme project)

Demonstrators: VCR S> “What can I do for you? " U> five thirty S> "five thirty. I dont quite understand. Do you want to add a program or set the clock?“ U> add a program today S> "Okay. add a program…. Lets see. today… Do you want from five thirty or until five thirty?“ U> from five thirty S> "Okay. from five thirty… What channel do you want? " U> set the clock S> "Okay. Do you want to set the clock?“ U> yes S> "Lets see. What time? " U> four fifteen S> "Okay. four fifteen… The clock was set…Returning to add a program. What channel do you want? “ U> channel five S> "Okay. channel five." U> pardon S> "Okay. channel five." U> yes S> "What time do you want to stop recording? " U> turn on the t v S> "turn on the TV… Sorry, I cannot control the TV set. What time do you want to stop recording? “ U> seven thirty S> "Okay. until seven thirty…The program has been added. " + Feedback Task clarification Load plan Issue clarification Requesting other action ? Feedback Reraising - Feedback (by User) Rejection

Demonstrators: Home device control

Siridus Summary IS approach has provided a stimulus for theoretical work –use of IS to improve intonation of spoken output recognition –exploration of different dialogue genres and phenomena systems which are –modular –reconfigurable –between FS/form filling and BDI/planning an automated telephone operator demonstrator research tool for dialogue developers (TrindiKit 3)