Multimodal Architecture for Integrating Voice and Ink XML Formats Under the guidance of Dr. Charles Tappert By Darshan Desai, Shobhana Misra, Yani Mulyani,

Slides:



Advertisements
Similar presentations
Chapter Nine Communications and Networks. Objective ONE Discuss the components required for successful communications.
Advertisements

INTEGRATION OF VOICE SERVICES IN INTERNET APPLICATIONS By Eduardo Carrillo (lecturer), J. J Samper, J.J. Martínez-Durá Universidad Autónoma de Bucaramanga.
CLEARSPACE Digital Document Archiving system INTRODUCTION Digital Document Archiving is the process of capturing paper documents through scanning and.
Mobile Computing Advantages and limitations of mobile computing
Rob Marchand Genesys Telecommunications
XISL language XISL= eXtensible Interaction Sheet Language or XISL=eXtensible Interaction Scenario Language.
Sean Powers Florida Institute of Technology ECE 5525 Final: Dr. Veton Kepuska Date: 07 December 2010 Controlling your household appliances through conversation.
COMPUTER CONCEPTS Computer Information Systems. COURSE COMPETENCIES Explain the functions of computer system components. Describe the information processing.
Speech in.NET Sphinx CMU November Presenter casey chesnut brains-N-brawn.com – Web Services – Mobile / Wireless – Speech.
Discovering Computers: Chapter 1
The State of the Art in VoiceXML Chetan Sharma, MS Graduate Student School of CSIS, Pace University.
Pace VoiceXML Absentee System Paul Visokey, Ping Gallivan, Yani Mulyani, Lisa Jordan, Elaine Li, George Mathew, Qisheng Hong Presenter Name : Paul Visokey.
Intro to computers Hardware Operating System, Communication Computers PowerHardware Software OS Application COMMUNICATION.
Voice XML Application Design Issues Darshan Desai And Shreenath Laxman Pace University.
1 Pertemuan 1 Understanding Information Technology Matakuliah: J0282 / Pengantar Teknologi Informasi Tahun: 2005 Versi: 02/02.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
CATEGORIES OF COMPUTERS
Digital Literacy.
Mobile Multimodal Applications. Dr. Roman Englert, Gregor Glass March 23 rd, 2006.
Finding Nearby Wireless Hotspots CSE 403 LCA Presentation Team Members: Chris Scoville Tessa MacDuff Matt Mohebbi Aiman Erbad Khalil El Haitami.
VoiceXML Builder Arturo Ramirez ACS 494 Master’s Graduate Project May 04, 2001.
Basic Data Communication
Introduction to Computers
Healthcom2008 Intelligent Service Integration Laboratory Information and Communications University Korea A Platform for Personalized Mobile u-Health Application.
M. Taimoor Khan * Java Server Pages (JSP) is a server-side programming technology that enables the creation of dynamic,
Introduction to Computers
Introduction to Computers
Introduction to Computers. Objectives Overview Describe the five components of a computer Discuss the advantages and disadvantages that users experience.
What Is a Computer? How is a computer defined?
Living in a Digital World Discovering Computers Fundamentals, 2010 Edition.
Chapter 2: Information Technology and AISs
11.10 Human Computer Interface www. ICT-Teacher.com.
SMS to Converter - A new approach to send .
Integrating VoiceXML with SIP services
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Introduction to Computer
Week 1 Review of Computer Concepts. Objectives Recognize the importance of computer literacy Define the term, computer Identify the components of a computer.
Section 2 Section 2.1 Identify hardware Describe processing components Compare and contrast input and output devices Compare and contrast storage devices.
IT Introduction to Information Technology CHAPTER 01.
TECHNICAL SEMINAR Presented by :- Satya Prakash Pattnaik TECHNICAL SEMINAR By Satya Prakash Pattnaik EC Under the guidance of Mr.
McGraw-Hill Career Education© 2008 by the McGraw-Hill Companies, Inc. All Rights Reserved. Microsoft Office 2007 Introduction to Computer Essentials.
Spoken Dialog Systems and Voice XML Lecturer: Prof. Esther Levin.
March 20, 2006 © 2005 IBM Corporation Distributed Multimodal Synchronization Protocol (DMSP) Chris Cross IETF 65 March 20, 2006 With Contribution from.
January 23-26, 2007 Ft. Lauderdale, Florida Who In The Enterprise Needs UC First? The User Perspective By Art Rosenberg, The Unified-View
E.g.: MS-DOS interface. DIR C: /W /A:D will list all the directories in the root directory of drive C in wide list format. Disadvantage is that commands.
Computer Basics & Keyboarding. What Is A Computer? An electronic device operating under the control of instructions stored in its own memory unit An electronic.
Speech. Understanding. Action. The Voice Web Players Dr. Christian Dugast Director Europe 05/00 The Voice Web Players Dr. Christian Dugast Director Europe.
Web-based Enterprise Telephony Application Development Johnny Wong Principal Member of Technical Staff Oracle Corporation.
Multimedia and Computers Introduction to Computers.
Warm Up  Please write a short response to the following questions:  What technology do we need to effectively educate?  What technology could we do.
Developing an Effective Wireless Middleware Strategy.
Introduction to Computers in General By: Dr. Emelda Ntinglet-Davis Oracle DBA Class.
Multimodal SIG © 2007 IBM Corporation Position Paper on W3C Workshop on Multimodal Architecture and Interfaces - Application control based on device modality.
Introduction To Computers
Intro to Canvas Inservice. Intro to Canvas – What is the purpose of this class?  You will be able to use this presentation to share with your teachers.
W3C Multimodal Interaction Activities Deborah A. Dahl August 9, 2006.
Software Architecture for Multimodal Interactive Systems : Voice-enabled Graphical Notebook.
Accelerometer based motion gestures for mobile devices Presented by – Neel Parikh Advisor Committee members Dr. Chris Pollett Dr. Robert Chun Dr. Mark.
ITT_04101 COMPUTER APPLICATIONS Gaper M CIT
Discovering Computers 2009 Chapter 1 Introduction to Computers.
MULTIMODAL AND NATURAL COMPUTER INTERACTION Domas Jonaitis.
Computer and Digital Technologies in the Classrooms Chapter 3-4 TLT.
Computer Information Systems
A SEMINAR ON ROVER TECHNOLOGY
Supervisor: Prof Michael Lyu Presented by: Lewis Ng, Philip Chan
MOBILE COMPUTING Jitendra Patel ROLL NO :- 38 TY MSC(CA & IT)
Dr Tappert Shreenath Laxman and Darshan Desai
Unit 1 insight – Hardware and software of a computer
VoiceXML An investigation Author: Mya Anderson
Presentation transcript:

Multimodal Architecture for Integrating Voice and Ink XML Formats Under the guidance of Dr. Charles Tappert By Darshan Desai, Shobhana Misra, Yani Mulyani, Than NyiNyi

Agenda Introduction of Architecture System Architecture Implemented Design Model Sample Dialogue Design InkXML Architecture Tools Used Conclusion

Introduction of Architecture Generic nature Supports development of multimodal applications that can handle speech, ink, and touch-tone digits integration patterns, and also can interpret unimodal speech, ink, and touch-tone digits input, as well as combined multi-modal input. System consists of Ink/Voice SDKs and a multimodal integrator. Voice SDK provides the voice processing capabilities. Ink SDK processes the information entered through ink media. Multimodal integrator handles disambiguation, errors and generates the confirmation feedback. Dialogue design

System Architecture

Implemented Design Model PSTN DATABASE VOICE XML Browser INK XML Interpreter (Java/C++) Ink Input Device CISCO Router Speech To Text Engine TTS ENGINE Handwriting Recognition Engine Voice Input/ Output Device

Sample Dialogue Design (Banking information application) System: You can access your existing account or you can open a new account. What would you like to do? User: Check existing account System: Did you say existing account? User: Yes System: Please enter your account number. User: one eight one four six five System: Did you write one eight seven four six five User: No System: Sorry My Mistake. Please enter your account number. User: one eight one four six five System: Did you write one eight one four six five User: Yes System: Please speak your four digit, pin number User: one two three four System: Did you say one two three four? User: Yes System: Please use the ink to input your full name. Control passes to the ink media. The system waits for the user to input the new text and submit. System: Did you write Haeey Potter User: No System: Sorry My Mistake. Please use the ink to input your full name. Control passes to the ink media. The system waits for the user to input the new text and submit. System: Did you write Harry Potter User: Yes System: Thank you for accessing your account.

Cont. System: Choose personal information, checking or savings. User: personal information System: Did you say personal information? User: Yes System: What would you like to do? Access your information or change your information. User: Change information System: Did you say change information User: Yes System: Would you like to change the address or telephone number or exit? User: Address System: Did you say address? User: Yes System: Please enter your new address by ink Control passes to the ink media. The system waits for the user to input the new text and submit. Once the user has submitted the data the control switches back to voice. System: Did you write one martine av white plains new york one zero six zero three User: Yes System: Your address has been changed.

InkXML InkXML’s primary goal is to bring the full power of web development and content delivery to ink applications. InkXML enables the exchange of virtual ink among devices, such as handhelds, laptops, desktops, and servers. InkXML will provide the ink component of web based multimodal applications Numerous standards already exist that are closely related to or could be used to represent digital ink. (eg. ITU T-150, UNIPEN and Jot) InkXML has two requirements – functional meaning enumerate functions required by ink applications and pragmatic makes inkxml usable and efficient for developing ink applications

InkXML Architecture Application SDK Library Ink Log Generator API Event Handler Driver Pen Hardware

Tools Used Software VoiceXML gateway(Nuance Voice Server) Tomcat Server Ink SDK (IBM) Windows 2000 Server Pingtel softphone (for sip dialup) Hardware Wacom pen tablet Cisco 2600 router with FX0 card Enterprise server Microphone and speakers

Conclusion The proposed architecture for developing multimodal voice/ink applications for noisy mobile environments combines different input modalities to facilitate the development of robust and friendly multimodal applications supporting superior error handling. We envision that users will soon employ smart devices such as wireless phones with integrated pen tablets and more powerful processing capabilities to take full advantage of the proposed multimodal voice/ink architecture. Such smart devices should be able to perform locally enhanced media processing, such as voice recognition, speech synthesis, and handwriting recognition. Graphic generation capabilities on the user ’ s pen tablets should also enhance the efficiency of multimodal applications and may allow for the development of applications for a broader spectrum of the population, including permanently and temporarily disabled users

Advisors Dr. Charles Tappert Dr. Zouheir Trabelsi Yi-Min Chee (IBM T.J.Watson) Dr. Michael Perrone (IBM T.J.Watson)