Developing Spoken Dialogue Systems in the Communicator / RavenClaw Framework Sphinx Lunch Talk Carnegie Mellon University, October 2004 Presented by:Dan.

Slides:



Advertisements
Similar presentations
The 4 T’s of Test Automation:
Advertisements

Key architectural details RavenClaw: Dialog Management Using Hierarchical Task Decomposition and an Expectation Agenda Dan BohusAlex Rudnicky School of.
.NET Technology. Introduction Overview of.NET What.NET means for Developers, Users and Businesses Two.NET Research Projects:.NET Generics AsmL.
Configuration management
Request Tracker IT Partners Conference Oliver Thomas 19 April 2005.
Non-Native Users in the Let s Go!! Spoken Dialogue System: Dealing with Linguistic Mismatch Antoine Raux & Maxine Eskenazi Language Technologies Institute.
ICS 434 Advanced Database Systems
Database System Concepts and Architecture
System Development Life Cycle (SDLC)
Executional Architecture
Visual Scripting of XML
Chapter 13 Review Questions
COEN/ELEC I Performance Testing slides 1-12 II User Documentation slides 13 - end 2.
Manuela Veloso, Anthony Stentz, Alexander Rudnicky Brett Browning, M. Bernardine Dias Faculty Thomas Harris, Brenna Argall, Gil Jones Satanjeev Banerjee.
1 Team Talk – A Report. 2 Introduction Project done as part of , Spring ’03. Design and implementation of a spoken dialog system. Thrust of project:
Software Modeling SWE5441 Lecture 3 Eng. Mohammed Timraz
Error Handling in the RavenClaw Dialog Management Framework Dan Bohus, Alexander I. Rudnicky Computer Science Department, Carnegie Mellon University (
Data Model driven applications using CASE Data Models as the nucleus of software development in a Computer Aided Software Engineering environment.
(Spoken) Dialogue and Information Retrieval Antoine Raux Dialogs on Dialogs Group 10/24/2003.
Sean Powers Florida Institute of Technology ECE 5525 Final: Dr. Veton Kepuska Date: 07 December 2010 Controlling your household appliances through conversation.
An Investigation into Recovering from Non-understanding Errors Dan Bohus Dialogs on Dialogs Reading Group Talk Carnegie Mellon University, October 2004.
T-FLEX DOCs PLM, Document and Workflow Management.
Building Enterprise Applications Using Visual Studio ®.NET Enterprise Architect.
1 CS 106, Winter 2009 Class 4, Section 4 Slides by: Dr. Cynthia A. Brown, Instructor section 4: Dr. Herbert G. Mayer,
Learning in the Wild Satanjeev “Bano” Banerjee Dialogs on Dialog March 18 th, 2005 In the Meeting Room Scenario.
What can humans do when faced with ASR errors? Dan Bohus Dialogs on Dialogs Group, October 2003.
RavenClaw Yet another (please read “An improved”) dialog management architecture for task-oriented spoken dialog systems Presented by: Dan Bohus
Extending VERA (Conference Information) Design Specification & Schedules Arthur Chan (AC) Rohit Kumar (RK) Lingyun Gao (LG)
Multiple-Agent Dialogue Thomas K Harris, Satanjeev (Bano) Banerjee Alex Rudnicky Boeing Kick-off Meeting June 8, 2004.
Spoken Dialog Management for an Astronaut’s Procedure Assistant Presented by: Dan Bohus Collaborators: Gregory Aist, RIALIST Group.
Madeleine, a RavenClaw Exercise in the Medical Diagnosis Domain Dan Bohus, Alex Rudnicky MITRE Workshop on Dialog Management, Boston, October 2003.
Extending VERA (Conference Information) Design Specification & Schedules Arthur Chan (AC) Rohit Kumar (RK) Lingyun Gao (LG)
CS 330 Programming Languages 09 / 16 / 2008 Instructor: Michael Eckmann.
Modules, Hierarchy Charts, and Documentation
CONVERSE Intelligent Research Ltd. David Levy, Bobby Batacharia University of Sheffield Yorick Wilks, Roberta Catizone, Alex Krotov.
Goteborg University Dialogue Systems Lab GoDiS and TrindiKit MITRE workshop 27/10-03 Staffan Larsson Göteborg University Sweden.
Course Instructor: Aisha Azeem
MSF Testing Introduction Functional Testing Performance Testing.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 18 Slide 1 Software Reuse.
TESTING STRATEGY Requires a focus because there are many possible test areas and different types of testing available for each one of those areas. Because.
DEV334 Creating Application Starting Points & Sharing Best Practices with Enterprise Templates Marc Gusmano Director of Emerging Technologies The Information.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
COMP 523 DIANE POZEFSKY 20 August AGENDA Introductions Logistics Software Engineering Overview Selecting a project Working with a client.
DataFlex Web Framework Symposium – Part 2 Web Framework Overview John Tuohy Development Team
Spoken dialog for e-learning supported by domain ontologies Dario Bianchi, Monica Mordonini and Agostino Poggi Dipartimento di Ingegneria dell’Informazione.
21 August Agenda  Introductions  Logistics  Selecting a project  Working with a client.
Cracow Grid Workshop, October 27 – 29, 2003 Institute of Computer Science AGH Design of Distributed Grid Workflow Composition System Marian Bubak, Tomasz.
LCG Middleware Testing in 2005 and Future Plans E.Slabospitskaya, IHEP, Russia CERN-Russia Joint Working Group on LHC Computing March, 6, 2006.
Workflow Project Status Update Luciano Piccoli - Fermilab, IIT Nov
Sports Scores Speech Recognition System Major League Baseball Score System.
CE Operating Systems Lecture 3 Overview of OS functions and structure.
Systems Analysis and Design in a Changing World, Fourth Edition
ECE450 - Software Engineering II1 ECE450 – Software Engineering II Today: Introduction to Software Architecture.
Extending VERA (Conference Information) Design Specification & Schedules Arthur Chan (AC) Rohit Kumar (RK) Lingyun Gu (LG)
CSC480 Software Engineering Lecture 10 September 25, 2002.
1 / 26 CS 425/625 Software Engineering Architectural Design Based on Chapter 10 of the textbook [Somm00] Ian Sommerville, Software Engineering, 6 th Ed.,
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
Talking to Robots Using MS Speech SDK 5.1 in C# Sebastian van Delden USC Upstate
Mantid Stakeholder Review Nick Draper 01/11/2007.
Rapid Development in new languages Limited training data (6hrs) provided by NECTEC from 34 speakers, + 8 spks for development and test Romanization of.
CSC 480 Software Engineering High Level Design. Topics Architectural Design Overview of Distributed Architectures User Interface Design Guidelines.
Website Design, Development and Maintenance ONLY TAKE DOWN NOTES ON INDICATED SLIDES.
Designing and Implementing an ETL Framework
PLM, Document and Workflow Management
Architecture Components
Sphinx Lunch Talk Carnegie Mellon University, October 2004
TechEd /23/2019 9:23 AM © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks.
Modern Systems Analysis and Design Third Edition
T-FLEX DOCs PLM, Document and Workflow Management.
Presentation transcript:

Developing Spoken Dialogue Systems in the Communicator / RavenClaw Framework Sphinx Lunch Talk Carnegie Mellon University, October 2004 Presented by:Dan Bohus Special appearances: Antoine Raux, Jahanzeb Sherwani, Thomas Harris

Examples  RoomLine conference room reservations within SCS; system can access schedules of 13 conf rooms in Wean-Hall and NSH  Let’s Go! Bus Information System bus schedule information system for Port Authority buses in Oakland and Squirrel Hill [Let’s Go! Project]  Sublime personalized information management system  TeamTalk an investigation into human and multi-robot spoken language communication in unstructured environments

Examples  RoomLine conference room reservations within SCS; system can access schedules of 13 conf rooms in Wean-Hall and NSH  Let’s Go! Bus Information System bus schedule information system for Port Authority buses in Oakland and Squirrel Hill [Let’s Go! Project]  Sublime personalized information management system  TeamTalk an investigation into human and multi-robot spoken language communication in unstructured environments

Examples  RoomLine conference room reservations within SCS; system can access schedules of 13 conf rooms in Wean-Hall and NSH  Let’s Go! Bus Information System bus schedule information system for Port Authority buses in Oakland and Squirrel Hill [Let’s Go! Project]  Sublime personalized information management system  TeamTalk an investigation into human and multi-robot spoken language communication in unstructured environments

Examples  RoomLine conference room reservations within SCS; system can access schedules of 13 conf rooms in Wean-Hall and NSH  Let’s Go! Bus Information System bus schedule information system for Port Authority buses in Oakland and Squirrel Hill [Let’s Go! Project]  Sublime personalized information management system  TeamTalk an investigation into human and multi-robot spoken language communication in unstructured environments

More Systems  LARRI multimodal system that assists F/A-18 aircraft maintenance personnel throughout the execution of procedural tasks [Symphony]  Madeleine text-based prototype for medical diagnosis system [MITRE workshop]  Eureka dialogue interface to the Vivisimo web search engine

The Communicator / RavenClaw Spoken Dialogue Systems Framework  Examples  Overall Architecture  System Development  Components & Resources  Miscellaneous  Current Research examples : architecture : development : components : miscellaneous : research

Overall Architecture  Classical pipeline architecture Lang. Understand. PHOENIX/HELIOS Dialog Manag. RAVENCLAW Back-end (various) Lang. Generation ROSETTA Recognition SPHINX Synthesis THETA examples : architecture : development : components : miscellaneous : research

Galaxy HUB Lang. Understand. PHOENIX/HELIOS Dialog Manag. RAVENCLAW Back-end (various) Lang. Generation ROSETTA HUB Recognition SPHINX Synthesis THETA Galaxy -Generic centralized, message- passing communication architecture -Developed at MIT, used in Communicator program -Competitor: OAA examples : architecture : development : components : miscellaneous : research

Getting Even Closer Lang. Understand. PHOENIX/HELIOS Dialog Manag. RAVENCLAW Back-end (perl) Language Gen. ROSETTA HUB Recognition SPHINX Synthesis THETA examples : architecture : development : components : miscellaneous : research

PROCESS MONITOR SPHINX Getting Even Closer Dialog Manag. RAVENCLAW Back-end (perl) Lang. Generation ROSETTA HUB Lang. Understand. PHOENIX/HELIOS Recognition Server Synthesis THETA Multiple, parallel decoders DateTime Other domain agents Back-end Galaxy Stub Actual Perl Back-end Lang. Generation ROSETTA (Perl) Lang. Generation Galaxy Stub Text I/O TTYServer Parsing PHOENIX Confidence HELIOS examples : architecture : development : components : miscellaneous : research Inputs from other modalities

The Communicator / RavenClaw Spoken Dialogue Systems Framework  Examples  Overall Architecture  System Development  Components & Resources  Miscellaneous examples : architecture : development : components : miscellaneous : research

Building a Spoken Dialogue System Lang. Understand. PHOENIX/HELIOS Dialog Manag. RAVENCLAW Back-end (perl) Lang. Generation ROSETTA Recognition SPHINX Synthesis THETA Grammar Templates RavenClaw Dialog Task Specification Back-end (perl) Language, Acoustic, Lexical Models (Limited Domain) Voice examples : architecture : development : components : miscellaneous : research

Language, Acoustic, Lexical Models (Limited Domain) Voice So How Long Will It Take? Lang. Understand. PHOENIX/HELIOS Dialog Manag. RAVENCLAW Back-end (perl) Lang. Generation ROSETTA Recognition SPHINX Synthesis THETA Grammar Templates RavenClaw Dialog Task Specification Back-end (perl) -MITRE Workshop on Dialogue Management (Fall 2003) -Develop a Text-based SDS for medical diagnosis (provided backend) -Madeleine (22 hours) examples : architecture : development : components : miscellaneous : research

Okay, How Long Will It Really Take?  To get a system running with a reasonable performance [poll amongst 3 RavenClaw developers]  1 month to get a working system up and running  1 month to fine-tune performance  Further iterative improvements will continue as more data accumulates examples : architecture : development : components : miscellaneous : research

The Communicator / RavenClaw Spoken Dialogue Systems Framework  Examples  Overall Architecture  System Development  Components & Resources  Miscellaneous examples : architecture : development : components : miscellaneous : research

Components & Resources Lang. Understand. PHOENIX/HELIOS Dialog Manag. RAVENCLAW Back-end (perl) Lang. Generation ROSETTA Recognition SPHINX Synthesis THETA Grammar Templates RavenClaw Dialog Task Specification Back-end (perl) Language, Acoustic Models Limited Domain Voice examples : architecture : development : components : miscellaneous : research

Components & Resources Lang. Understand. PHOENIX/HELIOS Dialog Manag. RAVENCLAW Back-end (perl) Lang. Generation ROSETTA Recognition SPHINX Synthesis THETA Grammar Templates RavenClaw Dialog Task Specification Back-end (perl) Language, Acoustic Models Limited Domain Voice examples : architecture : development : components : miscellaneous : research

SPHINX II  Semi-continuous acoustic models  Off-the-shelf 8kHz, kHz, 16kHz models  Scripts for building your own PLSA adapted models perform better  Language models  2-gram & 3-gram model CMU-Cambridge SLM Toolkit Generate from Phoenix Grammar  Finite state grammar  Sphinx supports state-specific LMs  Dictionary (lexical models)  CMU Dictionary examples : architecture : development : components : miscellaneous : research

Sphinx II - continued  Multiple parallel decoders [e.g., male + female]  Multiple hypothesis forwarded, selection done later  Typical WER: 15-30%  With pronounced differences native vs. non-native  Lowered by retuning acoustic and language models to the domain  Migration to SPHINX 3.x in the near future  Expected: big improvement in WER  Concern: real-time performance

Components & Resources Lang. Understand. PHOENIX/HELIOS Dialog Manag. RAVENCLAW Back-end (perl) Lang. Generation ROSETTA Recognition SPHINX Synthesis THETA Grammar Templates RavenClaw Dialog Task Specification Back-end (perl) Language, Acoustic Models Limited Domain Voice examples : architecture : development : components : miscellaneous : research

Phoenix Parser / Grammar  Phoenix: Robust Parser  CFG Grammar  Manually-generated domain- specific grammar rules  Reusable, generic sub-grammars [Yes], [No], [Number], [DateTime], [Help], [Repeat], [Suspend], etc… [room_size_spec] ([rss_large]) ([rss_small]) ([rss_larger]) ([rss_smaller]) ([rss_smallest]) ([rss_largest]) ; [rss_large] (large) (big) (huge) ; [rss_larger] (*the larger) (*the bigger) (too small) ; [rss_largest] (*the largest) (*the biggest) ; [rss_small] (small) (little) ; examples : architecture : development : components : miscellaneous : research DO YOU HAVE SOMETHING A BIT LARGER? [NeedRoom] ( [_i_want] (DO YOU HAVE SOMETHING) ) [RoomSizeSpec] ( [room_size_spec] ( [rss_larger] (LARGER)))  Parses all incoming hypotheses and passes all parses along…

Helios / Confidence Annotation  Builds accurate confidence scores using features from 3 sources of knowledge:  Speech recognition  Language understanding  Dialogue management  Selects hypothesis with maximum confidence score  Research in progress on hypothesis- selection, and transferability across domains examples : architecture : development : components : miscellaneous : research

Components & Resources Lang. Understand. PHOENIX/HELIOS Dialog Manag. RAVENCLAW Back-end (perl) Lang. Generation ROSETTA Recognition SPHINX Synthesis THETA Grammar Templates RavenClaw Dialog Task Specification Back-end (perl) Language, Acoustic Models Limited Domain Voice examples : architecture : development : components : miscellaneous : research

RavenClaw Architecture  Captures all domain-specific dialog (task) logic using a hierarchical description  The authoring effort is focused entirely here Dialog Task (Specification) Domain-independent Dialog Engine  Manages dialog by executing the dialog task specification  Provides a large number of domain-independent conversational strategies examples : architecture : development : components : miscellaneous : research

RavenClaw Architecture  Captures all domain-specific dialog (task) logic with a hierarchical description  The authoring effort is focused entirely here Dialog Task (Specification) Domain-independent Dialog Engine  Manages dialog by executing the dialog task specification  Provides a large number of domain-independent conversational strategies examples : architecture : development : components : miscellaneous : research

RavenClaw: Dialogue Task Specification Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome general_feeling have_fever diagnostic  Tree of dialog agents  Terminals: Inform, Request, Expect, Execute  Non-terminals / Dialog agency: plans execution of child nodes  Basically a Hierarchical Task Execution Network; each agent:  Preconditions & effects  Success & failure criteria  Trigger (focus) criteria  Effects examples : architecture : development : components : miscellaneous : research

Sample DTS Code // /Madeleine/GeneralFeel DEFINE_AGENCY(CGeneralFeel, DEFINE_CONCEPTS( STRING_USER_CONCEPT(general_feeling, none)) DEFINE_SUBAGENTS( SUBAGENT(HowAreYou, CHowAreYou) SUBAGENT(Glad, CGlad) SUBAGENT(Sorry, CSorry)) SUCCEEDS_WHEN(COMPLETED(Glad) || COMPLETED(Sorry))) // /Madeleine/GeneralFeel/HowAreYou DEFINE_REQUEST_AGENT(CHowAreYou, REQUEST_CONCEPT(general_feeling) GRAMMAR_MAPPING("![Yes]>good, ![FeelingGood]>good, " "![FeelingSoSo]>soso, ![FeelingBad]>bad"))) // /Madeleine/GeneralFeel/Glad DEFINE_INFORM_AGENT(CGlad, PRECONDITION(C("general_feeling") == CString("good")) PROMPT("inform glad_youre_good") ON_COMPLETION(FINISH(/Madeleine))) // /Madeleine/GeneralFeel/Sorry DEFINE_INFORM_AGENT(CSorry, PRECONDITION(C("general_feeling") != CString("good")) PROMPT("inform sorry_youre_bad")) R:HowAreYou? general_feeling GeneralFeel I:GladI:Sorry examples : architecture : development : components : miscellaneous : research

RavenClaw Execution Dialog Stack Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome Expectation Agenda general_feeling chart have_fever diagnostic examples : architecture : development : components : miscellaneous : research

RavenClaw Execution Dialog Stack Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome Expectation Agenda general_feeling chart have_fever diagnostic examples : architecture : development : components : miscellaneous : research

RavenClaw Execution Dialog Stack Madeleine Welcome Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome Expectation Agenda general_feeling chart have_fever diagnostic examples : architecture : development : components : miscellaneous : research

RavenClaw Execution Dialog Stack Madeleine Hi, this is Madeleine, the automated… Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome Expectation Agenda general_feeling chart have_fever diagnostic examples : architecture : development : components : miscellaneous : research

RavenClaw Execution Dialog Stack Madeleine Hi, this is Madeleine, the automated… Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome LoadSymptoms R:HeadacheR: Expectation Agenda general_feeling chart have_fever diagnostic headache examples : architecture : development : components : miscellaneous : research

RavenClaw Execution Dialog Stack Madeleine Hi, this is Madeleine, the automated… Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome R:HeadacheR: Expectation Agenda general_feeling chart have_fever diagnostic headache examples : architecture : development : components : miscellaneous : research

RavenClaw Execution Dialog Stack Madeleine Hi, this is Madeleine, the automated… Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome R:HeadacheR: GeneralFeel Expectation Agenda general_feeling chart have_fever diagnostic headache examples : architecture : development : components : miscellaneous : research

RavenClaw Execution / Input Pass Dialog Stack Madeleine Hi, this is Madeleine, the automated… Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome R:HeadacheR: GeneralFeel How are you feeling today? general_feeling chart have_fever diagnostic HowAreYou Expectation Agenda general_feeling: [good], [bad], [soso] GeneralFeel I:GladI:Sorry Not so good, I think I have a fever [soso](not so good) [fever](I think I have a fever) headache GeneralFeel examples : architecture : development : components : miscellaneous : research

RavenClaw Execution Dialog Stack Madeleine Hi, this is Madeleine, the automated… Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome R:HeadacheR: GeneralFeel Expectation Agenda general_feeling chart have_fever diagnostic headache examples : architecture : development : components : miscellaneous : research How are you feeling today? Not so good, I think I have a fever [soso](not so good) [fever](I think I have a fever)

RavenClaw Execution Dialog Stack Madeleine Hi, this is Madeleine, the automated… Madeleine E:LoadSymptomsGeneralFeel R:HowAreYou?I:GladI:Sorry Diagnose FeverTravel R:AskFeverE:MeasureTempI:InformFever I:Welcome R:HeadacheR: GeneralFeel Expectation Agenda general_feeling chart have_fever diagnostic headache examples : architecture : development : components : miscellaneous : research How are you feeling today? Not so good, I think I have a fever [soso](not so good) [fever](I think I have a fever) Sorry Oh, I’m sorry to hear that… Let me take your temperature…

RavenClaw – Other features  Dialogue Engine transparently provides a set of conversational skills  Universal dialogue mechanisms: Repeat, Suspend / Resume, Quit  Help: Help!, Where are we?, What can I say?  Error handling: Explicit and implicit confirmations Strategies for recovering from non-understandings  Dynamic dialogue task generation  Dynamic dialogue control policy

Components & Resources Lang. Understand. PHOENIX/HELIOS Dialog Manag. RAVENCLAW Back-end (perl) Lang. Generation ROSETTA Recognition SPHINX Synthesis THETA Grammar Templates RavenClaw Dialog Task Specification Back-end (perl) Language, Acoustic Models Limited Domain Voice examples : architecture : development : components : miscellaneous : research

Backend & Domain Agents  Various problem-specific solutions  RoomLine Connects to a static Perl database or to the CMU CorporateTime server;  Let’s Go! Bus Information system Connects to a PostGRES database  Sublime Connects to a MySQL database; also functions as a web-server; DTW search domain agent  Basically, build your own; we provide a stub for interfacing with the Galaxy-Hub examples : architecture : development : components : miscellaneous : research

Components & Resources Lang. Understand. PHOENIX/HELIOS Dialog Manag. RAVENCLAW Back-end (perl) Lang. Generation ROSETTA Recognition SPHINX Synthesis THETA Grammar Templates RavenClaw Dialog Task Specification Back-end (perl) Language, Acoustic Models Limited Domain Voice examples : architecture : development : components : miscellaneous : research

Rosetta Language Generation  Template- and stochastic-based language generation  Input: (act, object, {slot=value})  Output: text (tagged with concepts) # welcome to the system “welcome” => “Welcome to RoomLine, the automated conference room “. “reservation system.”, # greet user “greet_user” => (“Hi,.”, “Hi,, good to hear from you again.”), # inform the user that the system has misunderstood the times (order) “wrong_time_order” => sub { my %args my $time_interval_as_string = get_wrong_time_interval_as_string(\%args, “room_query.date_time.time”); my $answer = “I'm sorry, I must have misunderstood the “. “time you needed the room. “; $answer.= “I heard $time_interval_as_string. “; return [“$answer So, let's see... “, “$answer So, let's try this again... “, “$answer So, let's try this once more... “]; }, examples : architecture : development : components : miscellaneous : research

Components & Resources Lang. Understand. PHOENIX/HELIOS Dialog Manag. RAVENCLAW Back-end (perl) Lang. Generation ROSETTA Recognition SPHINX Synthesis THETA Grammar Templates RavenClaw Dialog Task Specification Back-end (perl) Language, Acoustic Models Limited Domain Voice examples : architecture : development : components : miscellaneous : research

Synthesis  Cepstral Theta synthesis  Open-domain unit-selection synthesis  SSML tags  [Currently working on barge-in location]  Festival synthesis  Diphone synthesis; Open-domain, Limited-domain unit-selection synthesis  SABLE tags  Server running separately on a Linux box examples : architecture : development : components : miscellaneous : research

The Communicator / RavenClaw Spoken Dialogue Systems Framework  Examples  Overall Architecture  System Development  Components & Resources  Miscellaneous  Current Research examples : architecture : development : components : miscellaneous : research

Miscellaneous – Documentation  Transmitted largely by oral tradition :)  A bit of documentation available  Research papers, slides  WIKI: mostly for developers, postings of updates, recent developments; hopefully more introductory materials soon.  More under work  Tutorials: 2 available, but a bit outdated examples : architecture : development : components : miscellaneous : research

Miscellaneous – Portability  Current systems work on PC Windows platforms  Galaxy has Linux version  Components are C, C++, (Visual Studio 6.0, Visual Studio.NET), Perl  How about using different input / output components?  Modify RavenClaw DMInterface class Has been done for the Gemini parser / language generator examples : architecture : development : components : miscellaneous : research

Miscellaneous – Research Platform  Communicator / RavenClaw framework is a research platform!  Constantly evolving  Modular Easy to change, develop and test new technologies  Research on variety of topics in a real-world, full- blown system: Recognition, Language understanding, Dialogue management, Language generation, Synthesis  Your work can be evaluated / reused easily across multiple existing systems examples : architecture : development : components : miscellaneous : research

Miscellaneous - Download   Download a version of RoomLine  An installation script can seed your own project from this RoomLine version examples : architecture : development : components : miscellaneous : research

Miscellaneous – RavenClaw Team  RavenClaw Team  Dan  Antoine  Jahanzeb  Thomas  Satanjeev  Brian  More users / developers / documentation writers are always welcome!!  Dialogs on Dialogs Reading Group  examples : architecture : development : components : miscellaneous : research

The Communicator / RavenClaw Spoken Dialogue Systems Framework  Examples  Overall Architecture  System Development  Components & Resources  Miscellaneous  Current Research examples : architecture : development : components : miscellaneous : research

Error awareness and recovery  Problem: lack of robustness when faced with understanding errors  Solution: build mechanisms for acting robustly at the dialogue management level  Error awareness Building better confidence annotators, hypothesis selection; transference across domains  Error recovery strategies Recovery from non-understandings  Error handling decision process Scalable, adaptable, task-independent architecture for making error handling decisions examples : architecture : development : components : miscellaneous : research

Let’s Go! Research  Speech Recognition: acoustic adaptation on non-native speech WER: 50%  30%  Speech Synthesis: flexible and natural F0 modeling (F0 unit selection) Emphasis on erroneous/uncertain words for utterance confirmation examples : architecture : development : components : miscellaneous : research

Sublime  Interface for personalized information management  Narrow functionality in unrestricted domains  Currently, handle information without understanding it  Eventually, learn relationships and a shallow ontology examples : architecture : development : components : miscellaneous : research

That’s all, folks! THANK YOU!