MUSCLE Multimodal e-team related activity Technical University of Crete Speech Processing and Dialog Systems Group Presenter: Prof. Alex Potamianos Technical.

Slides:

Advertisements

Similar presentations

Using PeopleSoft’s User Productivity Kit (UPK)

Advertisements

Context-Aware User Interfaces. Gent, 21 maart 2005 Context-Aware User Interfaces Context-Aware User Interfaces is a requirement for all defined scenarios.

User Interfaces. Good interface design  A good interface design can help to ensure that users carry out their tasks: – Safely - in the case of a jumbo.

XISL language XISL= eXtensible Interaction Sheet Language or XISL=eXtensible Interaction Scenario Language.

Visual Basic 2010 How to Program. © by Pearson Education, Inc. All Rights Reserved.2.

Visual Basic 2010 How to Program Reference: Instructor: Maysoon Bin Duwais slides Visual Basic 2010 how to program by Deitel © by Pearson Education,

Spoken Dialogue Systems Prof. Alexandros Potamianos Dept. of Electrical & Computer Engineering Technical University of Crete, Greece May 2003.

A multimodal dialogue-driven interface for accessing the content of recorded meetings Agnes Lisowska ISSCO/TIM/ETI University of Geneva IM2.MDM Work done.

Augmented assembly using a multimodal interface Muscle Showcase Sanni Siltanen, VTT Alex Potamianos, TUC.

- List of Multimodal Libraries - (EIF students only)

Thraxion: Three Dimensional Action Simulator Justin Gerthoffer, Jon Studebaker, David Colborne, Jeff Stuart, Frederick C. Harris, Jr Department of Computer.

Verbal (symbol) Based Interactions Dr.s Barnes and Leventhal.

New work-package WP5: Multimodal Processing and Interaction MUSCLE JPA3 Leaders: Petros Maragos, ICCS-NTUA Alexandros Potamianos, TSI-TUC.

User Interfaces. User Interface What do we mean by a user interface? The user is the person who is using the computer. A user interface is what he or.

Improving Spoken English NativeAccent™. What is NativeAccent? New internet-delivered technology that assesses a student’s English pronunciation skills.

CHAPTER 2 Input & Output Prepared by: Mrs.sara salih 1.

Chapter 12 Designing Interfaces and Dialogues

Mobile Multimodal Applications. Dr. Roman Englert, Gregor Glass March 23 rd, 2006.

1 A Practical Rollout & Tuning Strategy Phil Shinn 08/06.

Should Intelligent Agents Listen and Speak to Us? James A. Larson Larson Technical Services

1 Skip Cave Chief Scientist, Intervoice Inc. Multimodal Framework Proposal.

GUI: Specifying Complete User Interaction Soft computing Laboratory Yonsei University October 25, 2004.

Speaking to Computers Alex Acero Manager, Speech Research Group Microsoft Research Feb 14 th 2003.

Software Development Stephenson College. Classic Life Cycle.

Multimedia Specification Design and Production 2013 / Semester 2 / week 8 Lecturer: Dr. Nikos Gazepidis

Author: James Allen, Nathanael Chambers, etc. By: Rex, Linger, Xiaoyi Nov. 23, 2009.

User Interface in the Digital Decade Kai-Fu Lee Corporate Vice President Microsoft Corporation.

Department of Mechanical Engineering, LSUSession VII MATLAB Tutorials Session VIII Graphical User Interface using MATLAB Rajeev Madazhy

University of Management & Technology 1 Operating Systems & Utility Programs.

11.10 Human Computer Interface www. ICT-Teacher.com.

Interaction Design Session 12 LBSC 790 / INFM 718B Building the Human-Computer Interface.

Interacting with IT Systems Fundamentals of Information Technology Session 5.

Modal Interfaces & Speech User Interfaces Katherine Everitt CSE 490F Section Nov 20 & 21, 2006.

MERCURY BUSINESS PROCESS TESTING. AGENDA  Objective  What is Business Process Testing  Business Components  Defining Requirements  Creation of Business.

Spoken Dialog Systems and Voice XML Lecturer: Prof. Esther Levin.

ICT 1 A multimodal context aware mobile maintenance terminal for noisy environments Fredrik Vraalsen Research scientist SINTEF MOBIS’04 – Oslo, 15/9-04.

UNIT 7 Describing how an item functions [2] (infinitive with or without ‘to’)

1 COST 278 Spoken Language Interaction in TelecommunicationWG2 – MM Communication / July 2002 / PB WG2 – Multimodal Communication CTU Prague (Vaclav Hanzl)

Software Quality Assurance WELCOME Graphic User Interface Testing.

Windows User Interface and Web User Interface By E. Marlene Graham.

E.g.: MS-DOS interface. DIR C: /W /A:D will list all the directories in the root directory of drive C in wide list format. Disadvantage is that commands.

OPERATING SYSTEM - program that is loaded into the computer and coordinates all the activities among computer hardware devices. -controls the hardware.

L C SL C S The Intelligent Room’s MeetingManager: A Look Forward Alice Oh Stephen Peters Oxygen Workshop, January, 2002.

Activity Flow Design Gabriel Spitz 1 Lecture # 12 Guiding the flow of activities.

Adaptive User Interface Modelling for Web-environments T – Antti Martikainen

Conceptual Design Dr. Dania Bilal IS588 Spring 2008.

GUI Meets VUI: Some Possible Guidelines James A. Larson VP, Larson Technical Services 4/21/20151© 2015 Larson Technical Services.

Software 3 See Edmodo for images Group name: topcat Group code: i4qf9a 11/03/11.

Object-Oriented Software Engineering Practical Software Development using UML and Java Chapter 7: Focusing on Users and Their Tasks.

Presentation Title 1 1/27/2016 Lucent Technologies - Proprietary Voice Interface On Wireless Applications Protocol A PDA Implementation Sherif Abdou Qiru.

Oct 091 Example Program DemoInputValidation1.java DemoInputValidation2.java.

CHAPTER 18 DESIGNING USER INTERFACES.  EFFECTIVENESS  EFFICIENCY  USER CONSIDERATION  PRODUCTIVITY USER INTERFACE OBJECTIVES.

SEESCOASEESCOA SEESCOA Meeting Activities of LUC 9 May 2003.

Speech Processing 1 Introduction Waldemar Skoberla phone: fax: WWW:

W3C Multimodal Interaction Activities Deborah A. Dahl August 9, 2006.

Software Architecture for Multimodal Interactive Systems : Voice-enabled Graphical Notebook.

12-Jun-16 Event loops. 2 Programming in prehistoric times Earliest programs were all “batch” processing There was no interaction with the user Input Output.

1 Unit E-Guidelines (c) elsaddik SEG 3210 User Interface Design & Implementation Prof. Dr.-Ing. Abdulmotaleb.

Speech and multimodal Jesse Cirimele. papers “Multimodal interaction” Sharon Oviatt “Designing SpeechActs” Yankelovich et al.

Multimodal and Natural computer interaction Evelina Stanevičienė.

MULTIMODAL AND NATURAL COMPUTER INTERACTION Domas Jonaitis.

International Telecommunication Union The Fully Networked Car Geneva, 3-4 March 2010 Human Machine Interface (HMI) and signal processing for Intelligent.

System Design Ashima Wadhwa.

Unit 2 User Interface Design.

Evaluation of a multimodal Virtual Personal Assistant Glória Branco

GRAPHICAL USER INTERFACE

Multimodal Human-Computer Interaction New Interaction Techniques 22. 1

Event loops 17-Jan-19.

Human and Computer Interaction (H.C.I.) &Communication Skills

Evaluation of a multimodal Virtual Personal Assistant Glória Branco

Presentation transcript:

MUSCLE Multimodal e-team related activity Technical University of Crete Speech Processing and Dialog Systems Group Presenter: Prof. Alex Potamianos Technical University of Crete Speech Processing and Dialog Systems Group Presenter: Prof. Alex Potamianos

Goals  Develop domain-independent algorithms and tools for rapid development by non-experts of state-of-the-art multi-modal dialogue systems  Investigate the optimal modality mix (optimal = maximize UI efficiency and user satisfaction)  Demonstrate the synergies between modalities and built a state-of-the-art MM-UI module

Multi-Modal User Interface  Emphasis on synergies between modalities: Value(s) of attributes are displayed graphically Erroneous values can be easily corrected via the GUI Focus (aka context) of speech modality is highlighted Position and value ambiguity are shown (and typically resolved) via the GUI Voice prompts are significantly shorter GUI takes full advantage of intelligence of voice UI  Three interaction modes implemented: click-to-talk, open-mike and modality selection

GUI examples Button Disabled

GUI Ambiguity Resolution

Click-to-Talk Examples Click to Talk Speech Interface Enabled GUI Disabled Beginning of Next Turn GUI Enabled

Open-Mike Examples Waiting for input via Speech or GUI (mouse and keyboard) Speech has been detected Beginning of Next turn

Modality Tracking Examples Click To Talk Mode Open Mike Mode

Experiments  15 naïve non-native users with varying level of English language knowledge and accent  Application: form-filling, travel reservation (flight, hotel, car)  5 scenarios: one/two/three leg flight, round-trip flight with car, round-trip with hotel  5 systems: speech, GUI, click-to-talk, open-mike, modality selection 5x5 = 25 runs per user  Scenarios and system tested in random order

Results: Objective Metrics

Results: Subjective Metrics

Conclusions  UI efficiency (task completion, task duration) and subjective metrics : GUI-only is the most efficient mode Speech-only is the least efficient mode No differences in efficiency among the three multi-modal modes  Repeating experiments on PDA Different ASR recognition rates Different ASR recognition speed