TAUCHI – Tampere Unit for Computer-Human Interaction Markku Turunen Tampere Unit for Human-Computer Interaction University of Tampere MUMIN PhD course,

Slides:



Advertisements
Similar presentations
Database System Concepts and Architecture
Advertisements

CS Body of Knowledge (ACM) Discrete Structures Programming Fundamentals Algorithms & Complexity Operating Systems Architecture & Organization Social &
A component- and message-based architectural style for GUI software
Key-word Driven Automation Framework Shiva Kumar Soumya Dalvi May 25, 2007.
Empirical and Data-Driven Models of Multimodality Advanced Methods for Multimodal Communication Computational Models of Multimodality Adequate.
MediaHub: An Intelligent Multimedia Distributed Hub Student: Glenn Campbell Supervisors: Dr. Tom Lunney Prof. Paul Mc Kevitt School of Computing and Intelligent.
PGNET, Liverpool JMU, June 2005 MediaHub: An Intelligent MultiMedia Distributed Platform Hub Glenn Campbell, Tom Lunney, Paul Mc Kevitt School of Computing.
XISL language XISL= eXtensible Interaction Sheet Language or XISL=eXtensible Interaction Scenario Language.
Asa MacWilliams Lehrstuhl für Angewandte Softwaretechnik Institut für Informatik Technische Universität München Dec Software.
Lecturer: Sebastian Coope Ashton Building, Room G.18 COMP 201 web-page: Lecture.
Introduction To System Analysis and Design
© 2005 Prentice Hall7-1 Stumpf and Teague Object-Oriented Systems Analysis and Design with UML.
John Hu Nov. 9, 2004 Multimodal Interfaces Oviatt, S. Multimodal interfaces Mankoff, J., Hudson, S.E., & Abowd, G.D. Interaction techniques for ambiguity.
Introduction to HCC and HCM. Human Centered Computing Philosophical-humanistic position regarding the ethics and aesthetics of a workplace Any system.
A CHAT CLIENT-SERVER MODULE IN JAVA BY MAHTAB M HUSSAIN MAYANK MOHAN ISE 582 FALL 2003 PROJECT.
Software Architecture Design Instructor: Dr. Jerry Gao.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Lecture 23: Software Architectures
TAUCHI – Tampere Unit for Computer-Human Interaction Tampere Unit for Human-Computer Interaction University of Tampere Markku Turunen MUMIN workshop, Helsinki,
Interpret Application Specifications
WNT Client/Server SDK Tony Vaccaro CS699 Project Presentation.
Multimodal Architecture for Integrating Voice and Ink XML Formats Under the guidance of Dr. Charles Tappert By Darshan Desai, Shobhana Misra, Yani Mulyani,
WP1 UGOT demos 2nd year review Saarbrucken Mar 2006.
Course Instructor: Aisha Azeem
Architectural Design Establishing the overall structure of a software system Objectives To introduce architectural design and to discuss its importance.
System Design & Software Architecture
Mobile Multimodal Applications. Dr. Roman Englert, Gregor Glass March 23 rd, 2006.
UNIT-V The MVC architecture and Struts Framework.
1 Introduction to Web Development. Web Basics The Web consists of computers on the Internet connected to each other in a specific way Used in all levels.
Copyright 2001 Prentice-Hall, Inc. Essentials of Systems Analysis and Design Joseph S. Valacich Joey F. George Jeffrey A. Hoffer Chapter 1 The Systems.
Introduction To System Analysis and design
1. Human – the end-user of a program – the others in the organization Computer – the machine the program runs on – often split between clients & servers.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 12 Slide 1 Distributed Systems Architectures.
What is Enterprise Architecture?
Some Thoughts on HPC in Natural Language Engineering Steven Bird University of Melbourne & University of Pennsylvania.
1 CMPT 275 High Level Design Phase Architecture. Janice Regan, Objectives of Design  The design phase takes the results of the requirements analysis.
MediaHub: An Intelligent MultiMedia Distributed Platform Hub Glenn Campbell, Tom Lunney & Paul Mc Kevitt School of Computing and Intelligent Systems Faculty.
Experiments on Building Language Resources for Multi-Modal Dialogue Systems Goals identification of a methodology for adapting linguistic resources for.
4/2/03I-1 © 2001 T. Horton CS 494 Object-Oriented Analysis & Design Software Architecture and Design Readings: Ambler, Chap. 7 (Sections to start.
1 Another group of Patterns Architectural Patterns.
Introduction To System Analysis and Design
OBJECT ORIENTED SYSTEM ANALYSIS AND DESIGN. COURSE OUTLINE The world of the Information Systems Analyst Approaches to System Development The Analyst as.
1 Software Design Reference: Software Engineering, by Ian Sommerville, Ch. 12 & 13, 5 th edition and Ch. 10, 6 th edition.
1-1 System Development Process System development process – a set of activities, methods, best practices, deliverables, and automated tools that stakeholders.
1 Software Design Overview Reference: Software Engineering, by Ian Sommerville, Ch. 12 & 13.
Spoken Dialog Systems and Voice XML Lecturer: Prof. Esther Levin.
1 Welcome to CSC 301 Web Programming Charles Frank.
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 10Slide 1 Architectural Design l Establishing the overall structure of a software system.
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
1 CMPT 275 High Level Design Phase Modularization.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
CSC480 Software Engineering Lecture 10 September 25, 2002.
Evaluating the User Interface of a Ubiquitous Computing system Doorman Kaj Mäkelä Tampere University Computer Human Interaction Group.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
SEA Side – Extreme Programming 1 SEA Side Software Engineering Annotations Architectural Patterns Professor Sara Stoecklin Director of Software Engineering-
Presentation Title 1 1/27/2016 Lucent Technologies - Proprietary Voice Interface On Wireless Applications Protocol A PDA Implementation Sherif Abdou Qiru.
Slide 1 Lecture 15 Enterprise Systems Development ( CSC447 ) COMSATS Islamabad Muhammad Usman, Assistant Professor.
PGNET, Liverpool JMU, June 2005 MediaHub: An Intelligent MultiMedia Distributed Platform Hub Glenn Campbell, Tom Lunney, Paul Mc Kevitt School of Computing.
W3C Multimodal Interaction Activities Deborah A. Dahl August 9, 2006.
Architectural Mismatch: Why reuse is so hard? Garlan, Allen, Ockerbloom; 1994.
Software Architecture for Multimodal Interactive Systems : Voice-enabled Graphical Notebook.
LetItFlow Architecture Specification Project Meeting Vienna, – Victor Carmocanu SIVECO Romania.
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 10Slide 1 Chapter 5:Architectural Design l Establishing the overall structure of a software.
Chapter 2: Database System Concepts and Architecture - Outline
CHAPTER 3 Architectures for Distributed Systems
Architectural Mismatch: Why reuse is so hard?
Presentation transcript:

TAUCHI – Tampere Unit for Computer-Human Interaction Markku Turunen Tampere Unit for Human-Computer Interaction University of Tampere MUMIN PhD course, Tampere, Speech Application Architectures

TAUCHI – Tampere Unit for Computer-Human Interaction Outline Topics Background Architecture types Example architectures Topics for research Jaspis architecture

TAUCHI – Tampere Unit for Computer-Human Interaction Software architectures 1 Definitions “software architecture defines the system in terms of components and interactions between them. Connectors are used to mediate interaction between the components” [Garlan & Shaw, 1994] several views can be used to describe different aspects of software architectures: design view, run-time view, module view, logical view, control view, class view, … human-computer interaction viewpoint: support for interaction methods and techniques

TAUCHI – Tampere Unit for Computer-Human Interaction Software architectures 2 Software development tools support and tools for the construction of practical applications core architecture: basic infrastructure (hub/facilitator, communication libraries, blackboard) complete architecture: technology components (ASR, TTS), dialogue manager, database, … toolkit: dialogue editor, ASR grammar builder, corpus collection tool, annotation editor, …

TAUCHI – Tampere Unit for Computer-Human Interaction Speech system components speech recognition natural language processing speech synthesis natural language generation database dialogue management telephone interface user

TAUCHI – Tampere Unit for Computer-Human Interaction Architecture types 1 Pipelines and dialogue management architectures pipeline (batch-sequence) architectures –data flow –one-way interfaces –fixed processing order dialogue manager architectures –function calls –dialogue manager as controller –relaxed processing order TTS ASR NLU DM NLG TTS ASR NLU DM UM NLG DB

TAUCHI – Tampere Unit for Computer-Human Interaction client-server and blackboard architectures client-server architectures –two-way messages –hub as coordinator (star topology) –free processing order blackboard (DB) architectures –data events / db operations –shared information –free processing order IS Architecture types 2 TTS ASR NLU DM UM NLG DB HUB TTS ASR NLU DM UM NLG

TAUCHI – Tampere Unit for Computer-Human Interaction agent architectures independent agents –independent agents –facilitator –collaborative processing compact agents –compact agents –shared knowledge –distributed processing Architecture types 3 TTS ASR NLU DM UM NLG DB Facilitator TTS ASR Facilitator IS PE DA DE PA UA DE IA DA PE PA DA IA PA NLG NLU

TAUCHI – Tampere Unit for Computer-Human Interaction Example architectures 1 GALAXY-II MIT / MITRE DARPA Communicator reference architecture freely available HUB and servers frames (messages) hub scripts route messages [Seneff et al., 1998]

TAUCHI – Tampere Unit for Computer-Human Interaction Example architectures 2 Open Agent Architecture general agent architecture Facilitator as coordinator requesters (tasks) services (solutions) Interagent Communication Language (ICL) freely available used in speech applications [Martin et al., 1999]

TAUCHI – Tampere Unit for Computer-Human Interaction Example architectures 3 WITAS dialogue manager agent reacts to events send by other agents dialogue manager acts as blackboard multimodal inputs are coordinated by DM based on OAA [Lemon et al., 2001]

TAUCHI – Tampere Unit for Computer-Human Interaction Example architectures 4 MITRE architecture dialogue manager as controller default processing order dialogue manager monitors other components dialogue manager is a kind of blackboard based on OAA [Luperfoy et al., 1998]

TAUCHI – Tampere Unit for Computer-Human Interaction Example architectures 5 TRIPS agents, managers and shared databases loosely coupled components no dialogue manager KQML messages facilitator does not contain control logic [Allen et al., 2001]

TAUCHI – Tampere Unit for Computer-Human Interaction Current vs. new application areas Traditional speech applicationsFuture speech applications single usermultiple users desktop and telephony office and home environments, mobile settings Single deterministic dialogue open-ended, dynamically constructed concurrent dialogues active user(pro)active computer centralized dialogue managementdistributed interaction management

TAUCHI – Tampere Unit for Computer-Human Interaction Current vs. new application areas Traditional speech applicationsFuture speech applications mostly unimodalmostly multimodal alternative / exclusive (sequential) modalities concurrent / synergistic (parallel) modalities speech, text, graphicsspeech, sensors, haptics, gestures… monolingualMultilingual ”natural” interaction methods based on human-human interaction ”innovative” interaction methods based on human-computer interaction

TAUCHI – Tampere Unit for Computer-Human Interaction Adaptive systems Need for adaptive applications different users: speech-based communication can differ greatly between individual users and situations –speech is language and culture dependent –preferences and needs between user groups can be large different approaches: people from different backgrounds have different solutions for same problems we need interaction methods and architectures that adapt to the different users and situations and support multiple approaches

TAUCHI – Tampere Unit for Computer-Human Interaction Future example U: S: welcome to the bus timetable system? How may I help you? U: I want to go to hospital. S: Which hospital do you mean? There are three hospitals? U: The northern one. S: what is your departure location? U: railway station. S: bus number 17 leaves 10:12 U: bye. time passes, no bus is coming U: S: welcome to the taxi service. How may I help you? … S: You have appointment with your doctor. You need to hurry to catch the bus. It leaves the central station 10:12. U: thanks. time passes, bus strike hits the city S: I’m really sorry, there is no bus coming. The next train leaves seven minutes from now.. U: no, I take a taxi. S: Please wait… It is coming, please wait in front of the opera building. U: thanks.

TAUCHI – Tampere Unit for Computer-Human Interaction Topics for research Topics for speech systems adaptivity: how to support adaptive methods? how to make systems to be adaptive? reusability: components, interaction methods, … distributed systems: communication protocols, resource sharing, ubiquitous applications distributed interaction management: centralized dialogue manager is not suitable for many tasks shared knowledge: dialogue, user etc. development and evaluation tools: WOZ, corpora, …

TAUCHI – Tampere Unit for Computer-Human Interaction Jaspis architecture speech application development framework implementation of core architecture with extensions designed especially for multilingual and distributed applications overall focus on system level adaptivity current focus on ubiquitous and multimodal applications Java and XML, freely available used in several projects and applications

TAUCHI – Tampere Unit for Computer-Human Interaction Jaspis architecture overview NGLNLU DBUM

TAUCHI – Tampere Unit for Computer-Human Interaction Agents, evaluators and managers agents handle various interaction situations, such as speech input interpretations, dialogue decisions and speech output presentations evaluators measure how well agents can handle current interaction situation managers are used to coordinate agents and evaluators, especially to try to choose the best possible agents to handle each interaction situation Jaspis components

TAUCHI – Tampere Unit for Computer-Human Interaction Jaspis interaction management

TAUCHI – Tampere Unit for Computer-Human Interaction information storing method is not fixed (XML, DB) information access protocol is defined (DTD) Information Managers are used to access the Information Storage – these can be implemented in any language and they can use TCP/IP, XML-RPC or method calls Information management in Jaspis

TAUCHI – Tampere Unit for Computer-Human Interaction presentation agents convert conceptual messages to speech outputs for every output the most suitable agent is selected by presentation evaluators multiple presentation management modules for different phases Presentation management in Jaspis

TAUCHI – Tampere Unit for Computer-Human Interaction different dialogue agents for different dialogue tasks alternative dialogue agents for same dialogue tasks dialogue evaluators select dialogue agents no single controller (the dialogue manager) multiple dialogue management modules Dialogue management in Jaspis

TAUCHI – Tampere Unit for Computer-Human Interaction i/o-agents and evaluators handle, combine and coordinate different input streams devices – clients – servers – engines run-time interpretation and multimodal fusion separate module for selection of input modalities Communication (I/O) management in Jaspis

TAUCHI – Tampere Unit for Computer-Human Interaction Jaspis extensions Beyond core infrastructure XML-based linguistic information (Annotation Graphs) and log formats (corpus collection, usability tests) visualization components (blackboard, interaction) speech technology interfaces for common telephony cards, synthesizer and recognizers reusable components: error handling, general tasks SMS interface, graphical components Wizard Of Oz tools

TAUCHI – Tampere Unit for Computer-Human Interaction Jaspis Future improvements concurrent dialogues and multiple users event-based interaction management

TAUCHI – Tampere Unit for Computer-Human Interaction Tampere Unit for Computer Human Interaction Department of Computer and Information Sciences