AN EXTENSIBLE TRANSCODER FOR HTML TO VOICEXML CONVERSION by Narayanan Annamalai B.E. Master’s Thesis Advisors: Dr. Gopal Gupta and Dr. B Prabhakaran THE.

Slides:



Advertisements
Similar presentations
Chapter 1: Introduction. Contents Whats New in Dreamweaver CS4? The Dreamweaver CS4 Interface Setting Up a Site Creating a Web Page Adding Text to Your.
Advertisements

WEB DESIGN TABLES, PAGE LAYOUT AND FORMS. Page Layout Page Layout is an important part of web design Why do you think your page layout is important?
Web Development & Design Foundations with XHTML
JavaScript FaaDoOEngineers.com FaaDoOEngineers.com.
SPECIAL TOPIC XML. Introducing XML XML (eXtensible Markup Language) ◦A language used to create structured documents XML vs HTML ◦XML is designed to transport.
XISL language XISL= eXtensible Interaction Sheet Language or XISL=eXtensible Interaction Scenario Language.
The State of the Art in VoiceXML Chetan Sharma, MS Graduate Student School of CSIS, Pace University.
World Wide Web1 Applications World Wide Web. 2 Introduction What is hypertext model? Use of hypertext in World Wide Web (WWW) – HTML. WWW client-server.
Tutorial 1 Developing a Basic Web Page
Web Page Development Identify elements of a Web Page Start Notepad
Macromedia Dreamweaver 4 Advanced Level Course. Add Rollovers Rollovers or mouseovers are possibly the most popular effects used in designing Web pages.
XML Primer. 2 History: SGML vs. HTML vs. XML SGML (1960) XML(1996) HTML(1990) XHTML(2000)
Creating Web Page Forms
AN EXTENSIBLE TRANSCODER FOR HTML TO VOICEXML CONVERSION by Narayanan Annamala Gopal Gupta B. Prabhakaran DEPARTMENT OF COMPUTER SCIENCE THE UNIVERSITY.
Chapter 14 Introduction to HTML
Introducing HTML & XHTML:. Goals  Understand hyperlinking  Understand how tags are formed and used.  Understand HTML as a markup language  Understand.
Chapter 9 Collecting Data with Forms. A form on a web page consists of form objects such as text boxes or radio buttons into which users type information.
VoiceXML Builder Arturo Ramirez ACS 494 Master’s Graduate Project May 04, 2001.
INTRODUCTION TO WEB DATABASE PROGRAMMING
JAVASCRIPT HOW TO PROGRAM -2 DR. JOHN P. ABRAHAM UTPA.
1 Networks and the Internet A network is a structure linking computers together for the purpose of sharing resources such as printers and files Users typically.
Pemrograman Berbasis WEB XML part 2 -Aurelio Rahmadian- Sumber: w3cschools.com.
_______________________________________________________________________________________________________________ E-Commerce: Fundamentals and Applications1.
AN EXTENSIBLE TRANSCODER FOR HTML TO VOICEXML CONVERSION by Narayanan Annamalai B.E. Master’s Thesis Advisors: Dr. Gopal Gupta and Dr. B Prabhakaran THE.
XML introduction to Ahmed I. Deeb Dr. Anwar Mousa  presenter  instructor University Of Palestine-2009.
DAT602 Database Application Development Lecture 14 HTML.
© 2011 Delmar, Cengage Learning Chapter 9 Collecting Data with Forms.
© Cheltenham Computer Training 2001 Macromedia Dreamweaver 4 - Slide No 1 Macromedia Dreamweaver 4 Advanced Level Course.
Chapter 16 The World Wide Web. 2 The Web An infrastructure of information combined and the network software used to access it Web page A document that.
XP 1 CREATING AN XML DOCUMENT. XP 2 INTRODUCING XML XML stands for Extensible Markup Language. A markup language specifies the structure and content of.
CNIT 133 Interactive Web Pags – JavaScript and AJAX JavaScript Environment.
10 Adding Interactivity to a Web Site Section 10.1 Define scripting Summarize interactivity design guidelines Identify scripting languages Compare common.
Creating a Web Site to Gather Data and Conduct Research.
ITCS 6010 SALT. Speech Application Language Tags (SALT) Speech interface markup language Extension of HTML and other markup languages Adds speech and.
Chapter 1 XHTML: Part I The Web Warrior Guide to Web Design Technologies.
Listener Controlled Navigation of VoiceXML Documents Gopal Gupta N. Annamalai, H. Reddy Dept. of Computer Science UT Dallas.
 2008 Pearson Education, Inc. All rights reserved Introduction to XHTML.
Design and Construction of Accessible Web Sites Michael Burks Chairman Internet Society SIG For Internet Accessibility for People with Disabilities June.
Tutorial 1: XML Creating an XML Document. 2 Introducing XML XML stands for Extensible Markup Language. A markup language specifies the structure and content.
INTRODUCTION. What is HTML? HTML is a language for describing web pages. HTML stands for Hyper Text Markup Language HTML is not a programming language,
The Voice-Enabled Web: VoiceXML and Related Standards for Telephone Access to Web Applications 14 Feb Christophe Strobbe K.U.Leuven - ESAT-SCD-DocArch.
XML 2nd EDITION Tutorial 1 Creating An Xml Document.
 2008 Pearson Education, Inc. All rights reserved Introduction to XHTML.
1 John Magee 9 November 2012 CS120 Lecture 17: The World Wide Web and HTML Web Publishing.
Introduction to HTML. What is HTML?  Hyper Text Markup Language  Not a programming language but a markup language  Used for presentation and layout.
Copyright © Osmosys O S M O S Y SO S M O S Y S D e p l o y i n g E x p e r i e n c e & E x p e r t i s e™ HTML Training.
XHTML By Trevor Adams. Topics Covered XHTML eXtensible HyperText Mark-up Language The beginning – HTML Web Standards Concept and syntax Elements (tags)
XP 1 Creating an XML Document Developing an XML Document for the Jazz Warehouse XML Tutorial.
Session: 1. © Aptech Ltd. 2Introduction to the Web / Session 1  Explain the evolution of HTML  Explain the page structure used by HTML  List the drawbacks.
1 Introduction to XML XML stands for Extensible Markup Language. Because it is extensible, XML has been used to create a wide variety of different markup.
XML Design Goals 1.XML must be easily usable over the Internet 2.XML must support a wide variety of applications 3.XML must be compatible with SGML 4.It.
1 Tutorial 11 Creating an XML Document Developing a Document for a Cooking Web Site.
DAWN: Dynamic Aural Web Navigation Gopal Gupta, S. Sunder Raman, Mike Nichols, H. Reddy, N. Annamalai Department of Computer Science University of Texas.
Forms Collecting Data CSS Class 5. Forms Create a form Add text box Add labels Add check boxes and radio buttons Build a drop-down list Group drop-down.
CS562 Advanced Java and Internet Application Introduction to the Computer Warehouse Web Application. Java Server Pages (JSP) Technology. By Team Alpha.
Listener-Control Navigation of VoiceXML. Nuance Speech Analysis 92% of customer service is through phone. 84% of industrialists believe speech better.
©SoftMooreSlide 1 Introduction to HTML: Forms ©SoftMooreSlide 2 Forms Forms provide a simple mechanism for collecting user data and submitting it to.
Introduction to HTML. _______________________________________________________________________________________________________________ 2 Outline Key issues.
XML CSC1310 Fall HTML (TIM BERNERS-LEE) HyperText Markup Language  HTML (HyperText Markup Language): December  Markup  Markup is a symbol.
VoiceXML Version 2.0 Jon Pitcherella. What is it? A W3C standard for specifying interactive voice dialogues. Uses a “voice” browser to interpret documents,
Week 10: HTML Forms HNDIT11062 – Web Development.
XP Review 1 New Perspectives on JavaScript, Comprehensive1 Introducing HTML and XHTML Creating Web Pages with HTML.
LBSC 690 Session 4 Programming. Languages How do we learn a language? Learn by listening Then reading Then writing How do we teach programming? Learn.
Department of Computer Science, Florida State University CGS 3066: Web Programming and Design Spring
Department of Computer Science, Florida State University CGS 3066: Web Programming and Design Spring
1 CSC160 Chapter 1: Introduction to JavaScript Chapter 2: Placing JavaScript in an HTML File.
VoiceXML. Nuance Speech Analysis 92% of customer service is through phone. 84% of industrialists believe speech better than web.
Introduction to XHTML.
CIS 133 mashup Javascript, jQuery and XML
Presentation transcript:

AN EXTENSIBLE TRANSCODER FOR HTML TO VOICEXML CONVERSION by Narayanan Annamalai B.E. Master’s Thesis Advisors: Dr. Gopal Gupta and Dr. B Prabhakaran THE UNIVERSITY OF TEXAS AT DALLAS May 2002

 By One billion people will use wireless devices.  By Half of them will have Internet connectivity.  Growth far surpasses that of wire-bound Internet users.  New Technology is needed to support the masses of Customers to use handheld devices for Internet access.  The technology should be easy to use and efficient.  The right choice is – Speech Recognition The Scenario

The talk is organized in the following manner:  Motivation to solve the problem  Introduction to basic concepts in VoiceXML  Objectives  System Model and Assumptions  Translation Logic  Extensible Feature  Conclusion and Future Work

Motivation  Drawback of Existing Web Infrastructure – content  Users of WAP – not satisfied  Not feasible to maintain multiple versions Client WEB SERVER (content in format A) FORMAT TRANSLATOR (Convert A to B) Request B B A B

Related Work The visually impaired – used Screen readers. Frankie James proposed Auditory HTML Access System (AHA) – used distinct tones Above two systems – No Interactive feature Stuart Goose et al. proposed HTML to VoXML converter. VoXML is the ancestor of VoiceXML. Gopal Gupta et al. proposed a denotational semantics based approach. Dealt with a subset of Tags.

Present Scenario PSTN INTERNET Mobile User Voice Server Transcoder WEB SERVER Req. http req. html VoiceXML Audio

Objectives  Provide means for Visually impaired to access the Web.  Strive to express the structure of HTML pages in Voice form.  Application can be custom made with respect to User’s wish.  Make the transcoder extensible – to accommodate new HTML tags in future

What is VoiceXML? VoiceXML – Standard developed by VoiceXML forum (AT & T, Motorola, IBM, Lucent) Markup language used for creating Human – Computer interfaces through telephone. User can interact with a VoiceXML page through spoken or DTMF inputs (Telephone key press). Plays synthesized speech, audio files using TTS (Text to speech) converters

HTML vs VoiceXML HTMLVoiceXML 1.Single unit, presented with full efficiency. 2.Displays several inputs at the same time. 3.Input does not need any grammar for validation. 1.Consists of forms and blocks alone. 2.Inputs are collected sequentially 3. Every input needs a grammar for validation.

Assumptions Input HTML file needs to comply with the following rules:  Every open tag should have a corresponding close tag.  The input file should be error free.  The file should use only the tags that are specified in the HTML standard. Some browsers inserts special characters during editing.

System Model The application is realized in two phases I.Parsing Phase II.Translation Phase Parsing Phase: The Input HTML file is parsed and the HTML node tree is obtained as output. Parser used - purpose is Web-Wise Systems HTML parser Translation Phase: Each HTML node is converted in to corresponding VoiceXML node.

System Architecture Input Provider Parser Transcoder Internal data sheet External data sheet Output VoiceXML file

Parsing Phase HTML file cannot be converted in a tag-by-tag basis or sentence-by-sentence basis. The structure of the HTML file should be transported to the VoiceXML file. HTML file is parsed and the root node of the input file is obtained. Any HTML file’s root node would be the node

Example 1 Hello World Input HTML fileOutput parse tree (htmlRoot = new RootNode()).addNode(new PageNode().addNode(new HeadNode().addNode(new TitleNode().addNode(new StringNode().setHtmlData(“Example1”)) ) //end TitleNode ) //end HeadNode.addNode(new BodyNode().addNode(new H1Node().setAlign(``center’’).addNode(new StringNode().setHtmlData( ``Hello World ‘’)) ) // end H1 Node ) // end Body Node ) //end PageNode Parsing Example

Translating Phase: Issues Translating phase: Node tree is traversed recursively (from left to right – depth first). Html node converted to appropriate VoiceXML node. Issues:  Verify inputs before submission – different from HTML  Highly structured – follows strict convention eg. consider It is a beautiful city syntactically right, but can be child of only field or block  One to one conversion not possible always

Translation Logic The entire VXML page should have only blocks and forms. HTML form and VoiceXML form - basic difference is submission method and form declaration. Automatic name generation required for VXML forms. Forms are used for collecting inputs from user. Input obtained through more than one type.

Forms: radio tag Radio tags – provide choices, user selects one choice. HTML – radio tags does not have closing tag. When one choice selected, other becomes inactive. Challenge is to identify the last ‘radio’ button of the same type. example: Input HTML section Male Female End of Radio

Forms: radio tag (contd.) Output VoiceXML section …… Please select an Entrée, what sex Male Female ……. Form node Radio: male sex Radio: female sex h1 String: ‘end of radio’

Form: Text Box text box and text area are used to obtain String inputs from user. No sample space for string : e.g., name of person. VoiceXML inputs needs a grammar always. elements are used to solve problem. User can specify record time and attributes. needs list of fields and URL for submission. Should verify the inputs with user before submission.

Form: text box (contd.) Sample HTML extractCorresponding VoiceXML extract ……. Firstname …….. At tone, speak First name: I did not hear anything, please try again Your input is …….

Links In HTML, links are given by tag in two ways: To different part of the same document. To a different document altogether. In VXML, links are provided by method. To Internal documents: Sub-dialogs are created. Sub- dialog is like a function call. To External documents:. The target HTML URL is converted to a VoiceXML page, thus VoiceXML URL is provided.

List and Image Tags In HTML ordered and unordered lists are present. List contains text, so it can be read out easily. System recognizes ordered list – speaks out the numbered items. Image Tags: Description of the image is read out. First Second “ Beginning of an ordered list. Item 1 First Item 2 Second Ending of an ordered list” HTML extractAudio Output

Table In HTML – used to present information in tabular form. Table contains rows and columns, rows may contain tables. Nested table is possible. Information – text, can be read out. Our system maintains table numbers, row number, column numbers and differentiates row and column headings.

Frames Frames – integral part. Source HTML only contains links to other HTML pages (each link is a separate frame) Limitation of oral medium – all frames cannot be spoken simultaneously. Transition to frames provided using element. HTML URLs converted into VoiceXML pages. All Frame URLs stored in separate array, transcoded to VoiceXML recursively.

Text Display Tags Tags used for display – does not make much sense in VoiceXML. Function of the some display tags can be spoken out orally ……. and ……. are tags used to speak out text enclosed between them. Content to be spoken can be tailored using Interface sheet. The Interface sheet – also used to add new HTML tags making the system Extensible

Extensible Feature of Transcoder A B Input Attributes HTML TagsCorresponding Text spoken Input duration in seconds for Text-box : Input duration in seconds for Text-Area : …………. ………… Starting of text quoted from elsewhere Ignore ………….. Row A – Input Attributes can be supplied by the user Row B – Treatment of HTML tags can be altered, ignored. New tags can be added in this section.

Conclusion Our transcoder capable of converting any HTML (4.0 or lower version) file to corresponding VoiceXML file. Prominent feature of the Transcoder – Extensibility and User Inter-activeness. HTML to VoiceXML paves the way for Anytime, Anywhere Internet access for mobile clients.

Future Work  Our system will strive to remove the restriction – all open tags in the input file should have close tags.  Try to process applets and Scripts that may be available in input HTML page.  Analyzing the feasibility of implementing out Transcoder in Proxy Servers.