VoiceXML continued Speech reco/speech synthesis recap rps example ( ) Homework: Do VoiceXML examples. Start planning Project 2.

Slides:



Advertisements
Similar presentations
Presented by Erin Palmer. Speech processing is widely used today Can you think of some examples? Phone dialog systems (bank, Amtrak) Computers dictation.
Advertisements

WEB DESIGN TABLES, PAGE LAYOUT AND FORMS. Page Layout Page Layout is an important part of web design Why do you think your page layout is important?
Rob Marchand Genesys Telecommunications
Small Business Resource Power Point Series Web Site Development – A Beginners Guide.
CS 275Tidwell Course NotesPage 110 Chapter 7: Getting Input From Users Designing interactive forms, in which the user is expected to supply information.
Chapter 6 Problem Solving and Algorithm Design. 6-2 Chapter Goals Determine whether a problem is suitable for a computer solution Describe the computer.
Word Lesson 8 Increasing Efficiency Using Word
Use logic to teach the computer how to play a game
Final Project Presentation A Spoken Dialog System using the VoiceXML 2.1 platform to automate the ordering process for a pizza restaurant Scott Simpson.
Substitute FAQs SubFinder Overview. FAQs Do I have to have touch-tone service to use SubFinder? No, but you do need a telephone that can be switched from.
Copyright 2004 Monash University IMS5401 Web-based Systems Development Topic 2: Elements of the Web (g) Interactivity.
The State of the Art in VoiceXML Chetan Sharma, MS Graduate Student School of CSIS, Pace University.
Pace VoiceXML Absentee System Paul Visokey, Ping Gallivan, Yani Mulyani, Lisa Jordan, Elaine Li, George Mathew, Qisheng Hong Presenter Name : Paul Visokey.
1 XML: Document Type Definitions 2 Road Map  Introduction to DTDs  What’s a DTD?  Why are they important?  What will we cover?  Our First DTD 
VoiceXML and Internet Telephony Kundan Singh and Henning Schulzrinne Columbia University Joint work (in progress) with Daniel,
VoiceXML Application Systems Team By: Shawn Ramdass, Saji Abraham & Billy Santamorena.
Part or all of this lesson was adapted from the University of Washington’s “Web Design & Development I” Course materials.
Chapter Objectives Explain Web page multimedia issues
Form Handling, Validation and Functions. Form Handling Forms are a graphical user interfaces (GUIs) that enables the interaction between users and servers.
Chapter 9 Collecting Data with Forms. A form on a web page consists of form objects such as text boxes or radio buttons into which users type information.
Programming Games Computer science big ideas. Computer Science jargon. Show virtual dog Homework: [Catch up: dice game, credit card or other form.] Plan.
Computer Concepts 2014 Chapter 7 The Web and .
Programming games Flash drawing trick(s). Rock paper scissors. Classwork: Complete rock paper scissors.
Reading Data in Web Pages tMyn1 Reading Data in Web Pages A very common application of PHP is to have an HTML form gather information from a website's.
1 CS 3870/CS 5870 Static and Dynamic Web Pages ASP.NET and IIS.
Beyond Usability: Measuring Speech Application Success Silke Witt-Ehsani, PhD VP, VUI Design Center TuVox.
New challenge: cellphones & PDAs Show homework (majors) 2-phase xml/xsl Nokia Mobile Toolkit, Openwave WML, WMLScript, wbmp XHTML-XP Homework: download,
© 2011 Delmar, Cengage Learning Chapter 9 Collecting Data with Forms.
XP New Perspectives on Microsoft Access 2002 Tutorial 51 Microsoft Access 2002 Tutorial 5 – Enhancing a Table’s Design, and Creating Advanced Queries and.
1 Week Three: Using Scripts and Forms in HTML (Week Three) By Dr Fadi Safieddine.
McGraw-Hill/Irwin © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Dynamic Action with Macromedia Dreamweaver MX Barry Sosinsky Valda Hilley.
COMPUTER PROGRAMMING Source: Computing Concepts (the I-series) by Haag, Cummings, and Rhea, McGraw-Hill/Irwin, 2002.
Design Patterns Phil Smith 28 th November Design Patterns There are many ways to produce content via Servlets and JSPs Understanding the good, the.
VoiceXML Brandon Hannasch. Outline What is VoiceXML? Basic Tags Voice Recognition Audio Files Call Flow.
1 Computational Linguistics Ling 200 Spring 2006.
06/10/2015AJAX 1. 2 Introduction All material from AJAX – what is it? Traditional web pages and operation Examples of AJAX use Creating.
Section 17.1 Add an audio file using HTML Create a form using HTML Add text boxes using HTML Add radio buttons and check boxes using HTML Add a pull-down.
Programming Games Show your simple video. More video examples. Audio. Classwork/Homework: Produce more complex video program.
Creating a Sniffer using the Flash Connection Kit and DreamWeaver Greg Shorts CIS 254.
New challenge: telephone Text To Speech & audio Speech recognition VoiceXML Homework: sign up on studio.tellme.com.
AtoCC Compiler Construction Workshop Michael Hielscher Aalborg, Creating an Interpreter and Compiler for a music language ML.
CIS 451: ASP.NET Objects Dr. Ralph D. Westfall January, 2009.
© 2010 Delmar, Cengage Learning Chapter 8 Collecting Data with Forms.
Programming games Flash drawing trick(s). Past due: Your completion of rock-paper-scissors) Classwork: Bouncing ball: read tutorial. Start coding.
WEB BASED DATA TRANSFORMATION USING XML, JAVA Group members: Darius Balarashti & Matt Smith.
 Whether using paper forms or forms on the web, forms are used for gathering information. User enter information into designated areas, or fields. Forms.
Computers and Scientific Thinking David Reed, Creighton University Functions and Libraries 1.
Creating interfaces XML to WML, Push Use actual cell phones Homework: do WML examples, practice with Nokia Simulator.
Creating Databases Local storage. join & split Classwork: show 1 table application. Share designs for oscars application. Adaptive select. Homework: [Catch.
Creating Web Documents: How the Web works Client / Server Protocols Access methods Homework: Complete experiment & report on Discussion Forum.
Creating User Interfaces [Continue presentations as needed] Speech recognition. Speech synthesis Homework: Report on current products. Register on Tellme.
Introduction to Digital Media. What is it? Digital media is what computers use to; Store, transmit, receive and manipulate data Raw data are numbers,
Creating User Interfaces Directed Speech. XML. VoiceXML Classwork/Homework: Sign up to be Voxeo developer. Do tutorials.
Visual Basic Games: Week 4 Recap Parallel structures Initialization Prepare for Memory Scoring Shuffling Homework: when ready, move on to next game/chapter.
Introduction to JavaScript CS101 Introduction to Computing.
David Lawrence 7/8/091Intro. to PHP -- David Lawrence.
Student Pages
Creating User Interfaces Another example. Classwork/homework: work on VoiceXML project.
ASP-2-1 SERVER AND CLIENT SIDE SCRITPING Colorado Technical University IT420 Tim Peterson.
PHP Form Processing * referenced from
Creating interfaces XML & XSL review VoiceXML: grammar Homework: postings, presentation, study guide.
Speech Processing 1 Introduction Waldemar Skoberla phone: fax: WWW:
Creating interfaces Multi-language example Definition of computer information system VoiceXML example Project proposal presentations Homework: Post proposal,
Creating User Interfaces VoiceXML. Examples. Classwork/Homework: Make proposal and start work on your VoiceXML project.
This was written with the assumption that workbooks would be added. Even if these are not introduced until later, the same basic ideas apply Hopefully.
PHP: Further Skills 02 By Trevor Adams. Topics covered Persistence What is it? Why do we need it? Basic Persistence Hidden form fields Query strings Cookies.
Web Design, 5 th Edition 6 Multimedia and Interactivity Elements.
Cookies BIS1523 – Lecture 23.
Creating User Interfaces
Creating User Interfaces
Presentation transcript:

VoiceXML continued Speech reco/speech synthesis recap rps example ( ) Homework: Do VoiceXML examples. Start planning Project 2.

Speech recognition / Speech synthesis Area of research Area of ‘marketing’ / product research –What is the killer ap? Perhaps phones & PDAs. Focus was on dictation You do not need to understand the science since tellme studio does it for us. –Some background always helpful.

Speech recognition concepts Air pressure  diaphragm in phone  electrical signal  (Fourier Transform)  wave pattern matched against sets of canonical patterns (native speaker of English, perhaps male/female & young/old alternatives) generated for the specified grammar (using a segmentation=dividing up of the parts) produce set of probabilities for each option in grammar –Tellme/VoiceXML provides fieldname$.confidence

Fourier Transform (Discrete Fourier Transform -- FFT) Takes data representing a signal And produces numbers representing the combination of sine and cosine waves that make up the signal

Speech recognition Works on the product of the FFT Uses (in most cases) –Segmentation: attempt to break up into pieces, perhaps syllables or words –Grammar: definition of what is to be expected –Probabilities: if first part matched X, then greater probability that then next would match to Y

Speech synthesis aka TTS (text to speech) lexical units (syllabus of words)  phonemes  pre- recorded (wav) files of phonemes This is again a segmentation process: need to divide up the words and then put together so speech sounds 'natural'. –particular phoneme may [need to] sound different in different context. –also need to deal with abbreviations & local accents –Place names (important in travel & weather applications) Special case: detect and use wav file for each name. Older methods were all synthesized –similar distinction between all synthesized and samples of music

Speech synthesis is essentially ‘the computer’ reading ‘out loud’. Easy to do most things More and more difficult to do complete job

VoiceXML elements –for handling less than ideal conditions and –for selection from list (alternative to grammar) –pause

Accepting caller input Application: getting caller name More complex than you would think There is the element

Simple record/playback Hello. Who is calling? Hello. Have a nice day.

Problems Replays in caller’s voice Can’t use outside the form Could save to server: lookup record under VoiceXML elements. –Sample php code

More involved dialogue Hello. Who is calling? Great to hear from you.

Continued: ask for class Which class are you in? <![CDATA[ [ [databases] { } [interfaces both] { } [no none not] { } ] ]]>

Continued: respond for each class Good to hear from you check the next class notes for how to do this This may not be quite what some of you wanted. I don't know why you are calling but it is nice to chat

Suggestion Get the simple one working and then work on the second one Suggested use is for callers to record a greeting –So it makes sense to be in the caller’s voice Experiment with grammar

rock paper scissors Note: a different version is on the tellme site. This is not a great choice (identify a better one) since it doesn't support the illusion that 'the computer' is making the choice at the same time that the player is. … but it does illustrate the features.

rps logic requirements create a random move for 'the computer' –use Math.random and Math.floor prompt and then distinguish caller saying 'rock', 'paper', 'scissors', 'score', 'quit' –use and elements keep score –use JavaScript variables modest amount of error handling – and and count attribute in

VoiceXML features Menu element replaces form element (and its use of field and grammar elements) Each has a count. Used when nomatch or noinput occurs and you don’t want to re-prompt with exact same words. Can distinguish nomatch from noinput Can control timing: amount of time waiting for caller input and time between system utterances. –My version could be improved!

var moves=new Array('rock','paper','scissors'); function randommove() { var r = Math.floor(Math.random()*3); return moves[r]; }

Say rock, paper, scissors, score or quit Please make a choice, rock, paper or scissors, or say score to get the score or quit to quit I guess you're done.

I didn't understand. I didn't hear anything rock scissors paper score quit

Scores are wins. Losses Ties loops back

Computer played Tie You lose. Paper covers rock. You win. Rock breaks scissors

Exercise: write the missing forms. Exercise: improve prompts. (Exercise when you can test it using the phone: improve use of breaks.)

Good bye

testing Go to your tellme account Point your account at the website. For my examples, find out URLs by going to my XML stuff site. Rock-paper-scissors is at

Adding your voice Use tellme studio procedure for recording using the phone. They mail you the file. You should rename file. You upload that file to your website. You will need to keep track of files. They will use a name/number system for files sent. –Do one at a time

Other VoiceXML elements subdialog –to jump to encapsulated dialog OR server-side script (which generates a VoiceXML document) submit –to jump to a new document generated by server-side script data –to access XML content (generated by server-side programs) record –recording caller input for storage on server transfer –transfer to another phone number

Phone as interface: recap Familiar, ubiquitous, friendly (?), potentially hands-free Design challenge same (similar, not graphic design) –focus on function (information exchange) –caller actions can be guided but still potentially quite variable

Homework VoiceXML exploration exercises Begin to plan Project 2. Look at my pages for ideas. Make posting to Forum (indicate team members). –another Web/XML project –VoiceXML –WML and/or XHTML-MP (using Nokia or OpenWave. Can also try with your phone, if your phone is WAP enabled.)