Listener Controlled Navigation of VoiceXML Documents Gopal Gupta N. Annamalai, H. Reddy Dept. of Computer Science UT Dallas.

Slides:



Advertisements
Similar presentations
Chapter 6 Server-side Programming: Java Servlets
Advertisements

Demonstration Files for the HDL Plug-in for Acrobat The HDL Plug-in for Adobe Acrobat and Acrobat Reader is an extension that adds functionality to PDF.
Cheryl Jelks Trainer/Applications Support Analyst Richland School District One.
CGI & HTML forms CGI Common Gateway Interface  A web server is only a pipe between user-agents  and content – it does not generate content.
JavaScript FaaDoOEngineers.com FaaDoOEngineers.com.
HTML5 ETDs Edward A. Fox, Sung Hee Park, Nicholas Lynberg, Jesse Racer, Phil McElmurray Digital Library Research Laboratory Virginia Tech ETD 2010, June.
A really fairly simple guide to: mobile browser-based application development (part 1) Chris Greenhalgh G54UBI / Chris Greenhalgh
Introduction to MVC Adding a View Page NTPCUG Tom Perkins, Ph.D.
1 CSC 551: Web Programming Spring 2004 client-side programming with JavaScript  scripts vs. programs  JavaScript vs. JScript vs. VBScript  common tasks.
The Web Warrior Guide to Web Design Technologies
Chapter 51 Scripting With JSP Elements JavaServer Pages By Xue Bai.
Chapter 31 Basic Form-Processing Techniques JavaServer Pages By Xue Bai.
XISL language XISL= eXtensible Interaction Sheet Language or XISL=eXtensible Interaction Scenario Language.
Using Schema Matching to Simplify Heterogeneous Data Translation Tova Milo, Sagit Zohar Tel Aviv University.
Writing HTML specifications: A HTML specification for a WWW document must be stored in a plain text file The text file must not contain any control characters.
The State of the Art in VoiceXML Chetan Sharma, MS Graduate Student School of CSIS, Pace University.
Pace VoiceXML Absentee System Paul Visokey, Ping Gallivan, Yani Mulyani, Lisa Jordan, Elaine Li, George Mathew, Qisheng Hong Presenter Name : Paul Visokey.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
XML Views El Hazoui Ilias Supervised by: Dr. Haddouti Advanced XML data management.
DT211/3 Internet Application Development
Query Rewriting for Extracting Data Behind HTML Forms Xueqi Chen, 1 David W. Embley 1 Stephen W. Liddle 2 1 Department of Computer Science 2 Rollins Center.
Week 2 IBS 685. Static Page Architecture The user requests the page by typing a URL in a browser The Browser requests the page from the Web Server The.
1 Computing for Todays Lecture 22 Yumei Huo Fall 2006.
VoiceXML Application Systems Team By: Shawn Ramdass, Saji Abraham & Billy Santamorena.
Tutorial 3: Adding and Formatting Text. 2 Objectives Session 3.1 Type text into a page Copy text from a document and paste it into a page Check for spelling.
AN EXTENSIBLE TRANSCODER FOR HTML TO VOICEXML CONVERSION by Narayanan Annamala Gopal Gupta B. Prabhakaran DEPARTMENT OF COMPUTER SCIENCE THE UNIVERSITY.
VoiceXML Builder Arturo Ramirez ACS 494 Master’s Graduate Project May 04, 2001.
INTRODUCTION TO WEB DATABASE PROGRAMMING
1 CS 3870/CS 5870 Static and Dynamic Web Pages ASP.NET and IIS.
_______________________________________________________________________________________________________________ E-Commerce: Fundamentals and Applications1.
AN EXTENSIBLE TRANSCODER FOR HTML TO VOICEXML CONVERSION by Narayanan Annamalai B.E. Master’s Thesis Advisors: Dr. Gopal Gupta and Dr. B Prabhakaran THE.
Integrating XML with Microsoft SQL Server ©NIITeXtensible Markup Language/Lesson 9/Slide 1 of 31 Objectives In this lesson, you will learn to: * Generate.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
Conversational Applications Workshop Introduction Jim Larson.
XP 1 CREATING AN XML DOCUMENT. XP 2 INTRODUCING XML XML stands for Extensible Markup Language. A markup language specifies the structure and content of.
ITCS 6010 SALT. Speech Application Language Tags (SALT) Speech interface markup language Extension of HTML and other markup languages Adds speech and.
Publishing a Macromedia Flash Movie – Lesson 131 Publishing a Macromedia Flash Movie Lesson 13.
E-Commerce: Introduction to Web Development 1 Dr. Lawrence West, Management Dept., University of Central Florida Topics What is a Web.
Creating Speaking Web Pages: The Text-to-Speech Integrated Development Environment (TTS-IDE) David C. Gibbs Department of Mathematics and Computing University.
Spoken dialog for e-learning supported by domain ontologies Dario Bianchi, Monica Mordonini and Agostino Poggi Dipartimento di Ingegneria dell’Informazione.
Integrating VoiceXML with SIP services
Murach’s ASP.NET 4.0/VB, C1© 2006, Mike Murach & Associates, Inc.Slide 1.
10/13/2015 ©2006 Scott Miller, University of Victoria 1 Content Serving Static vs. Dynamic Content Web Servers Server Flow Control Rev. 2.0.
Creating Dynamic Web Pages Using PHP and MySQL CS 320.
XSLT Kanda Runapongsa Dept. of Computer Engineering Khon Kaen University.
Extending HTML CPSC 120 Principles of Computer Science April 9, 2012.
Spoken Dialog Systems and Voice XML Lecturer: Prof. Esther Levin.
AN EXTENSIBLE TRANSCODER FOR HTML TO VOICEXML CONVERSION by Narayanan Annamalai B.E. Master’s Thesis Advisors: Dr. Gopal Gupta and Dr. B Prabhakaran THE.
Semantically Processing The Semantic Web Presented by: Kunal Patel Dr. Gopal Gupta UNIVERSITY OF TEXAS AT DALLAS.
Mir Farooq Ali Computer Science, Virginia Tech May 9, 2003 Building Multi-platform User Interfaces using UIML.
DAWN: Dynamic Aural Web Navigation Gopal Gupta, S. Sunder Raman, Mike Nichols, H. Reddy, N. Annamalai Department of Computer Science University of Texas.
Server-side Programming The combination of –HTML –JavaScript –DOM is sometimes referred to as Dynamic HTML (DHTML) Web pages that include scripting are.
AxKit A member of the Apache XML project Ryan Maslyn Kyle Bechtel.
Chapter 29 World Wide Web & Browsing World Wide Web (WWW) is a distributed hypermedia (hypertext & graphics) on-line repository of information that users.
Forms Collecting Data CSS Class 5. Forms Create a form Add text box Add labels Add check boxes and radio buttons Build a drop-down list Group drop-down.
IS-907 Java EE World Wide Web - Overview. World Wide Web - History Tim Berners-Lee, CERN, 1990 Enable researchers to share information: Remote Access.
Listener Controlled Navigation of VoiceXML Documents Gopal Gupta N. Annamalai, H. Reddy Dept. of Computer Science UT Dallas.
Listener-Control Navigation of VoiceXML. Nuance Speech Analysis 92% of customer service is through phone. 84% of industrialists believe speech better.
VoiceXML Version 2.0 Jon Pitcherella. What is it? A W3C standard for specifying interactive voice dialogues. Uses a “voice” browser to interpret documents,
Introduction  “M” “V” “C” stands for “MODEL” “VIEW” “CONTROLLER”. ASP.NET MVC is an architecture to develop ASP.NET web applications in a different manner.
Document Clustering for Natural Language Dialogue-based IR (Google for the Blind) Antoine Raux IR Seminar and Lab Fall 2003 Initial Presentation.
Dr. Abdullah Almutairi Spring PHP is a server scripting language, and a powerful tool for making dynamic and interactive Web pages. PHP is a widely-used,
VoiceXML. Nuance Speech Analysis 92% of customer service is through phone. 84% of industrialists believe speech better than web.
Rendering XML Documents ©NIITeXtensible Markup Language/Lesson 5/Slide 1 of 46 Objectives In this session, you will learn to: * Define rendering * Identify.
Generating XML Data from a Database Eugenia Fernandez IUPUI.
XP Creating Web Pages with Microsoft Office
PHP using MySQL Database for Web Development (part II)
SALT & The Microsoft Speech Application SDK
What is Perl? PERL--Practical Extraction and Report Language
VoiceXML An investigation Author: Mya Anderson
Presentation transcript:

Listener Controlled Navigation of VoiceXML Documents Gopal Gupta N. Annamalai, H. Reddy Dept. of Computer Science UT Dallas

VoiceWEB VoiceXML: The open-standard language for serving voice/audio documents Voice/audio documents can be browsed using a voice browser with speaker & microphone or using the regular phone Voice browser:VoiceXML:: Browser:HTML Voice browser:VoiceXML:: Browser:HTML

VoiceXML (Cont’d) VoiceXML allows scripts/CGIs etc. Can take input from the listener via speech (fill out forms like in HTML) Used extensively for automated call handling. Makes info. accessible over (cell)phones. The next revolution on the WEB.

Systems developed by our LAB Our Lab (ALPS lab, UTD) has developed two systems to automatically convert HTML to Dynamic VoiceXML. They are: A.HTML to VoiceXML Transcoder (in Java, Initial Prototype in Prolog) B.Dynamic VoiceXML Generator (using dynamic SRGS Grammar)

HTML to VoiceXML Transcoder HTML file cannot be converted in a tag-by-tag basis or sentence-by-sentence basis. The structure of the HTML file should be transported to the VoiceXML file. HTML file is parsed and the root node of the input file is obtained. Any HTML file’s root node would be the node Transcoding is done in two phases. i. i.HTML file is parsed into a Document tree ii. ii.Nodes of the tree are converted to VoiceXML using a mapping function

HTML Parsing Sample Example 1 Hello World Input HTML file (htmlRoot = new RootNode()).addNode(new PageNode().addNode(new HeadNode().addNode(new TitleNode().addNode(new StringNode().setHtmlData(“Example1”)) ) //end TitleNode ) //end HeadNode.addNode(new BodyNode().addNode(new H1Node().setAlign(``center’’).addNode(new StringNode().setHtmlData( ``Hello World ‘’)) ) // end H1 Node ) // end Body Node ) //end PageNode

Translation Logic The entire VXML page should have only blocks and forms. HTML form and VoiceXML form - basic difference is submission method and form declaration. Automatic name generation required for VXML forms. Forms are used for collecting inputs from user. Input obtained through more than one type.

Dynamic VoiceXML Generator In the following slides we will introduce the Dynamic VoiceXML Generator (DVG). Static VoiceXML which is output from the Transcoder is fed as input to the DVG. DVG adds dynamic Grammar and direction control elements to the static VoiceXML document to enable anchoring and recalling of marks. The dynamic VoiceXML document allows the user to mark a portion of the document with any user specified name.

Problem with VoiceXML Navigation of the voice document is completely controlled by the page author After each dialog (form) the author has to ask where the listener will like to go next Listener has absolutely no control over navigation. Tedium, Adv. Applications not possible Analogy: Scroll vs a book

Our Solution: Voice Anchors Voice anchors are speech labels that listeners can place on a dialog. Listener can return to that dialog later by uttering that label. Hard to implement this concept, as free- form speech recognition is not possible. Need to incorporate it in the voice browser

System Architecture

Voice Anchors We have developed a number of methods for attaching voice anchors. Most practical method: via spelling The user can state the anchor as a whole word and return to the dialoged labeled Can also have default anchors (turning a scroll into a book). Can also have a no. of default navigation strategies. E.g. skim section headings first

Applications Our system finds its application in the following two scenarios  Result of a database query is a plain VoiceXML document and the listener likes to navigate through it  Mobile User wishes to navigate through a textually rich HTML document while driving