VoiceXML: A Field Evaluation By: Kristy Bradnum Supervisor: Peter Clayton Presented in partial fulfilment of the CS Honours Project.

Slides:



Advertisements
Similar presentations
(1) VoiceXML Overview, Opportunities & Challenges Hitesh Kr. Seth Chief Technology Evangelist SeraNova, Inc OReilly Conference.
Advertisements

.NET Technology. Introduction Overview of.NET What.NET means for Developers, Users and Businesses Two.NET Research Projects:.NET Generics AsmL.
A Taste of Visual Studio 2005 David Grey. Introduction In this session we will introduce Visual Studio 2005 and its features and examine those features.
INTEGRATION OF VOICE SERVICES IN INTERNET APPLICATIONS By Eduardo Carrillo (lecturer), J. J Samper, J.J. Martínez-Durá Universidad Autónoma de Bucaramanga.
1 Open Source Grammars David Thomson CTO, SpeechPhone (VoiceXML Tools Committee chair)
1 A Test Automation Tool For Java Applets Testing of Web Applications TATJA Program Demonstration Conclusions By Matthew Xuereb.
Automatic Switchboard Operator Luboš Šmídl, Tomáš Valenta Department of Cybernetics Faculty of Applied Sciences University of West Bohemia in Pilsen.
Collaborative Customer Relationship Management (CCRM) User Group June 23 rd, 2004.
Rob Marchand Genesys Telecommunications
Trnsport Test Suite Project Tony Compton, Texas DOT Charles Engelke, Info Tech.
Voice XML Team 1 Matt Ganis, Jonathan Hill, Henry Wong Anne I. Mannette-Wright Team 1 Matt Ganis, Jonathan Hill, Henry Wong Anne I. Mannette-Wright.
Which development tool is right for you? Commercial Tools John Fuentes – Principal Solutions Architect
XISL language XISL= eXtensible Interaction Sheet Language or XISL=eXtensible Interaction Scenario Language.
VoiceXML: A Field Evaluation by: Kristy Bradnum Supervisor: Peter Clayton.
Speech in.NET Sphinx CMU November Presenter casey chesnut brains-N-brawn.com – Web Services – Mobile / Wireless – Speech.
Copyright 2004 Monash University IMS5401 Web-based Systems Development Topic 2: Elements of the Web (g) Interactivity.
Lets Talk 9+ Emulator e-Tech for Tots CS590 - Ashok Sahu.
The State of the Art in VoiceXML Chetan Sharma, MS Graduate Student School of CSIS, Pace University.
Pace VoiceXML Absentee System Paul Visokey, Ping Gallivan, Yani Mulyani, Lisa Jordan, Elaine Li, George Mathew, Qisheng Hong Presenter Name : Paul Visokey.
Template-based framework for building Multi-language VoiceXML application.
Template-based framework for building VoiceXML application Jonathan Law.
Illinois Institute of Technology
Multimodal Architecture for Integrating Voice and Ink XML Formats Under the guidance of Dr. Charles Tappert By Darshan Desai, Shobhana Misra, Yani Mulyani,
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
VoiceXML Basic COCOMO Calculator By Greg Kutcher.
Find The Better Way Expand Your Voice with VXML May 10 th, 2005.
4.01B Authoring Languages and Web Authoring Software 4.01 Examine webpage development and design.
Installing Windows XP Professional Using Attended Installation Slide 1 of 41Session 2 Ver. 1.0 CompTIA A+ Certification: A Comprehensive Approach for all.
VoiceXML Builder Arturo Ramirez ACS 494 Master’s Graduate Project May 04, 2001.
2 2 CHAPTER Application Software. Competencies 1. Common software features 2. Word processors 3. Spreadsheets 4. Database management systems 5. Presentations.
UWSP Web Speech Research Group Joe Frost Mark Stenerson Professor Dave Gibbs Presentation to AITP Monday, October 17, 2005.
Introduction to Silverlight. Slide 2 What is Silverlight? It’s part of a Microsoft Web platform called Rich Internet Applications (RIA) There is a service.
Conversational Applications Workshop Introduction Jim Larson.
Introduction OF Enterprise Application Development.
ITCS 6010 SALT. Speech Application Language Tags (SALT) Speech interface markup language Extension of HTML and other markup languages Adds speech and.
Creating Speaking Web Pages: The Text-to-Speech Integrated Development Environment (TTS-IDE) David C. Gibbs Department of Mathematics and Computing University.
VoiceXML Brandon Hannasch. Outline What is VoiceXML? Basic Tags Voice Recognition Audio Files Call Flow.
WS-Security: SOAP Message Security Web-enhanced Information Management (WHIM) Justin R. Wang Professor Kaiser.
Integrating VoiceXML with SIP services
1 David Thomson The Search for a Dialog Metalanguage that Makes Everybody Happy David Thomson Chair, VoiceXML Tools Committee, SpeechPhone CTO.
Project By: Brent Elder, Mike Holovka, Hisham Algadaibi.
Archivists' Toolkit - CRADLE Presentation, 10 Feb The Archivists’ Toolkit CRADLE Presentation 10 Feb
Speech Technologies and VoiceXML try Department of Computer Science National Cheng-Chi University.
The Voice-Enabled Web: VoiceXML and Related Standards for Telephone Access to Web Applications 14 Feb Christophe Strobbe K.U.Leuven - ESAT-SCD-DocArch.
Outline Grammar-based speech recognition Statistical language model-based recognition Speech Synthesis Dialog Management Natural Language Processing ©
Spoken Dialog Systems and Voice XML Lecturer: Prof. Esther Levin.
1 Geospatial and Business Intelligence Jean-Sébastien Turcotte Executive VP San Francisco - April 2007 Streamlining web mapping applications.
1 © 2003, Cisco Systems, Inc. All rights reserved. Proprietary and Confidential Unity Connection 2.0(1) Miu Architectural Overview TOI June 10, 2007 Mike.
C OMPUTING E SSENTIALS Timothy J. O’Leary Linda I. O’Leary Presentations by: Fred Bounds.
Introduction of Geoprocessing Lecture 9. Geoprocessing  Geoprocessing is any GIS operation used to manipulate data. A typical geoprocessing operation.
Listener-Control Navigation of VoiceXML. Nuance Speech Analysis 92% of customer service is through phone. 84% of industrialists believe speech better.
4.01B Authoring Languages and Web Authoring Software 4.01 Examine webpage development and design.
1 Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents S. Kawamoto, et al. October 27, 2004.
Introduction to UNIX CS465. What is UNIX? (1) UNIX is an Operating System (OS). An operating system is a control program that allocates the computer's.
Presentation Title 1 1/27/2016 Lucent Technologies - Proprietary Voice Interface On Wireless Applications Protocol A PDA Implementation Sherif Abdou Qiru.
Java Programming: Advanced Topics1 Introduction to Advanced Java Programming Chapter 1.
® IBM Software Group © 2003 IBM Corporation IBM WebSphere Studio V5.1.2: Making Java Development Easier May 2004.
Opportunities in Deploying Open Source Applications Using LumenVox Speech Recognition on Asterisk.
Speech Processing 1 Introduction Waldemar Skoberla phone: fax: WWW:
VoiceXML. Nuance Speech Analysis 92% of customer service is through phone. 84% of industrialists believe speech better than web.
Presented By Sharmin Sirajudeen S7 CS Reg No :
Introduction ITEC 420.
Recent trends in estimation methodologies
Introduction to Advanced Java Programming
CHAPTER 8 Multimedia Authoring Tools
Introduction to Silverlight
SALT & The Microsoft Speech Application SDK
.NET and .NET Core Foot View of .NET Pan Wuming 2017.
An Introduction to Linux
VoiceXML An investigation Author: Mya Anderson
Presentation transcript:

VoiceXML: A Field Evaluation By: Kristy Bradnum Supervisor: Peter Clayton Presented in partial fulfilment of the CS Honours Project

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Overview Objective of Research Background Aims & Motivation Methodology Tools Results Conclusions Questions

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Objective of Research My project in a nutshell: An evaluation of VoiceXML 2.0, using a range of platforms, looking specifically at its maturity as a technology and its status as an industry standard. Objective of Research

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Background Overview of Speech Technology Overview of VoiceXML History Role Background

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Overview of Speech Technology Conversion between spoken word and binary Output Previously: pre-recorded prompts Today: speech synthesis (TTS) Input Previously: DTMF (pressing keys on the phone) Today: speech recognition (ASR) Background >> Overview of Speech Technology

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 History of VoiceXML AT&T Bells PhoneWeb project Lucents Phone Markup Language Motorolas VoxML IBMs SpeechML VoiceXML version 2.0 full W3C recommendation 16 March 2004 Background >> Overview of VoiceXML

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Role of VoiceXML Background >> Overview of VoiceXML

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Motivation 2002 Mya Andersons Field Investigation of VoiceXML 1.0 New technology Unstable Unsuccessful Now VoiceXML 2.0 = W3C standard Nortel: maturity increasing, widely accepted Jackson: already mature Project Aims & Motivation

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Project Aims Investigate these claims Examine: maturity of VoiceXML 2.0 as a technology its status as an industry standard Project Aims & Motivation

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Methodology Approach Tools WebSphere OptimTalk BeVocal Café Analysis Cross-Platform Analysis Methodology

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Approach Iterative approach Set goal Evaluate outcomes Determine next goal ROSS prototype Relevant to Rhodes Product secondary to investigation Methodology >> Approach Inadequate

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Tools 3 approaches [Seth]: Buy Rent Build 3 environments [Beasley et al]: Hosted Simulated Web-based Methodology >> Tools

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Project Tools WebSphere Studio Application Developer with Voice Toolkit OptimTalk BeVocal Café 2.5 Methodology >> Tools

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 WebSphere IBMs WebSphere Studio Application Developer with Voice Toolkit plug-in Buy approach Voice Toolkit also includes other features: CCXML developer NLU model maintenance Call Flow Builder Grammar developer Pronunciation Builder but Version problems Methodology >> Tools >> WebSphere

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 OptimTalk Simple VoiceXML platform Desktop standalone development environment Set of libraries interpret W3C SIF markup languages Tailored towards research Command line application Requirements: microphone and speakers Methodology >> Tools >> OptimTalk

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 BeVocal Cafe Good background Web-based development environment Hosted platform Rent approach Tools: VoiceXML Checker Vocal Scripter Methodology >> Tools >> BeVocal Café

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Analysis Each platform studied separately 15 examples for OptimTalk Basic + Blackjack 10 projects for BeVocal Café Millers 10 Projects to Voice-Enable Your Web Site Methodology >> Analysis

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Cross-Platform Analysis BeVocal Cafés projects in OptimTalk OptimTalks examples in the Café Run amendments through original platform Methodology >> Cross-Platform Analysis

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Results Platform Independence OptimTalk BeVocal Café Grammars Design Considerations Platform Certification Results

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Platform Independence Environments: steep learning curve Extensible tag set Limits platform independence Proprietary extensions Some features added – some left out Example code usually worked Results >> Platform Independence

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 OptimTalk Results >> Platform Independence >> OptimTalk

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Results >> Platform Independence >> OptimTalk OptimTalk

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 OptimTalk Demo Results >> Platform Independence >> OptimTalk

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 OptimTalk Speech recognition component erratic Built-in grammars not supported in OptimTalk type attribute of Boolean = yes / no grammar Number = ? No Phone numbers (from database) Results >> Platform Independence >> OptimTalk

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 BeVocal Café More mature Very confusing error messages Many proprietary extensions Results >> Platform Independence >> BeVocal Café

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004

Grammars Built-in grammars not supported in OptimTalk OptimTalks grammar not supported by BeVocal W3C passes responsibility to SIF Platforms should support ABNF of SRGS Results >> Grammars

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Design Considerations No long menus Pronunciation Different voices for TTS ROSS by LH Michael vs Microsoft Sam Be careful with ASR Ties in with grammar No break in in OptimTalk So lists run together Results >> Design Considerations

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Platform Certification VoiceXML Forums Platform Certification Program Test Suite v test programs To check compliancy with VoiceXML platforms passed (in September) NVP VoxPilot Open Media Platform VoiceGenie Platform Results >> Platform Certification

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Conclusions Learnt a lot about Speech technology Language does seem fairly mature now Fewer extensions More complete as a standard Still not quite stable Conclusions

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Conclusions Give platforms time to catch up Still hurdles in development Especially in South Africa Possibly better for commercial enterprises Not for research But improving all the time Conclusions

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Future work Windows vs Linux comparison Look at W3C Speech Interface Framework as a whole VoiceXML 2.1 is on its way Conclusion >> Future Works

VoiceXML: A Field Evaluation Kristy Bradnum – Computer Science Honours 2004 Questions Questions ???