Speech in, speech out. 24 listopad 2006WS0607 – elevator2/15 Nuance server compiled recognition grammar, master language package, licence manager Nuance.

Slides:



Advertisements
Similar presentations
Chapter 8 Improving the User Interface
Advertisements

Programming Logic and Design Fourth Edition, Introductory
Hand Crafting your own program By Eric Davis for CS103.
Session 6 Server-side programming - ASP. An ASP page is an HTML page interspersed with server-side code. The.ASP extension instead of.HTM denotes server-side.
XSL Unit 6 November 2. XSL –eXtensible Stylesheet Language –Basically a stylesheet for XML documents XSL has three parts: –XSLT –XPath –XSL-FO.
Introduction to VXML. What is VXML? Voice Extensible Markup Language Used in telephone-based speech applications voice browsing of the web.
8 November Forms and JavaScript. Types of Inputs Radio Buttons (select one of a list) Checkbox (select as many as wanted) Text inputs (user types text)
Chapter 7 Improving the User Interface
Chapter 11 ASP.NET JavaScript, Third Edition. 2 Objectives Learn about client/server architecture Study server-side scripting Create ASP.NET applications.
© 2001 CUHK Recognition Software Building Block Overview 1. Task specification — What is the function of the system ? 2. Recognizer Design — Vocabulary.
8/6/2015Auto Attendants 1 Smarter Communications.
Recognizes: Magic Memo, show me memo number three. Magic Memo, show memo one. Magic Memo, display memo number two. Magic Memo, display memo.
Form Handling, Validation and Functions. Form Handling Forms are a graphical user interfaces (GUIs) that enables the interaction between users and servers.
BSC Food Distribution 8181 NW 36 Street, Suite 14-D Doral, FL Phone: Fax:
DAT602 Database Application Development Lecture 15 Java Server Pages Part 1.
Introduction to programming in MATLAB MATLAB can be thought of as an super-powerful graphing calculator Remember the TI-83 from calculus? With many more.
VoiceXML Builder Arturo Ramirez ACS 494 Master’s Graduate Project May 04, 2001.
1 Chapter One A First Program Using C#. 2 Objectives Learn about programming tasks Learn object-oriented programming concepts Learn about the C# programming.
A First Program Using C#
Introduction to Python Lecture 1. CS 484 – Artificial Intelligence2 Big Picture Language Features Python is interpreted Not compiled Object-oriented language.
Lecture Note 3: ASP Syntax.  ASP Syntax  ASP Syntax ASP Code is Browser-Independent. You cannot view the ASP source code by selecting "View source"
An Introduction to Textual Programming
JavaScript, Fifth Edition Chapter 1 Introduction to JavaScript.
Fortran 1- Basics Chapters 1-2 in your Fortran book.
Remote Unit Testing Milestone III Alex Riordan Brian Pruitt-Goddard.
8 Shell Programming Mauro Jaskelioff. Introduction Environment variables –How to use and assign them –Your PATH variable Introduction to shell programming.
Creating a Web Site to Gather Data and Conduct Research.
ITCS 6010 SALT. Speech Application Language Tags (SALT) Speech interface markup language Extension of HTML and other markup languages Adds speech and.
Shaping Learning Together Jessica Wu ELT Consultant
VoiceXML: Forms, Menus, Grammars, Form Interpretation Algorithm.
Java Programming, Second Edition Chapter Five Input and Selection.
This slide deck is for LPI Academy instructors to use for lectures for LPI Academy courses. ©Copyright Network Development Group Module 9 Basic Scripting.
(Stream Editor) By: Ross Mills.  Sed is an acronym for stream editor  Instead of altering the original file, sed is used to scan the input file line.
Flow of Control Part 1: Selection
Python Programming Using Variables and input. Objectives We’re learning to build functions and to use inputs and outputs. Outcomes Build a function Use.
22CS 338: Graphical User Interfaces. Dario Salvucci, Drexel University. Lecture 10: Advanced Input.
An Introduction to Programming with C++ Sixth Edition Chapter 7 The Repetition Structure.
Working With Files. Sources of Data Adding data to a spreadsheet can be done in several ways, including: – Type it in piece by piece – Read it from a.
Tutorial 6 Working with Web Forms. 2New Perspectives on HTML, XHTML, and XML, Comprehensive, 3rd Edition Objectives Explore how Web forms interact with.
Program #2 Cell Phone Usage ….Let’s start with understanding the problem!
1 Computer Science of Graphics and Games MONT 105S, Spring 2009 Session 1 Simple Python Programs Using Print, Variables, Input.
1 Data Manipulation (with SQL) HRP223 – 2010 October 13, 2010 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This.
Repetition Structures Repetition Structures allow you to write programs that will repeat program steps multiple times. –Also called Loops –Counter controlled.
Lesson 1 Operating Systems, Part 1. Objectives Describe and list different operating systems Understand file extensions Manage files and folders.
IBC233 Lecture 2 Updated Winter 2008 Agenda Test next Week – Jan 23 ISeries Architecture CL (Control Language) Library Lists Operations Navigator.
Controlling Computer Using Speech Recognition (CCSR) Creative Masters Group Supervisor : Dr: Mounira Taileb.
By Paul Richard and Jim Fitzgerald Chapter 18 – Drawing Management Tools and Utilities.
1 Working with ASP.NET Master Pages By huda AlSuwailem Reference: Mastering Microsoft Visual Web Developer 2005 Express Edition by John Paul Mueller.
1 Day 18 Bash and the.files. 2 The.files ls shows you the files in your directory –Or at least most of them. –Some files are hidden. Try: ls –a –This.
Files Tutor: You will need ….
Student Pages
Tutorial 6 Working with Web Forms. 2New Perspectives on HTML, XHTML, and XML, Comprehensive, 3rd Edition Objectives Explore how Web forms interact with.
Page Layout You can quickly and easily format the entire document to give it a professional and modern look by applying a document theme. A document theme.
Administriva l James will run a hands on tutorial in WEB 130 today at 3:30 and again at 2:00 and 3:30 on Thursday. l Can everyone that wants to attend,
JavaScript Introduction and Background. 2 Web languages Three formal languages HTML JavaScript CSS Three different tasks Document description Client-side.
Introduction to TouchDevelop Lesson 3 – Comments & Lists Created by S. Johnson
1 Agenda  Unit 7: Introduction to Programming Using JavaScript T. Jumana Abu Shmais – AOU - Riyadh.
Introduction to Algorithm. What is Algorithm? an algorithm is any well-defined computational procedure that takes some value, or set of values, as input.
Guide to Operating Systems, 5th Edition
Specifying, Compiling, and Testing Grammars
Data Management Daniel Sewe, Client Support
Introduction to TouchDevelop
Alternate Version of STARTING OUT WITH C++ 4th Edition
For -G7 programing language Teacher / Shamsa Hassan Alhassouni.
Data Groupings: File File: a group of related records
Lecture Notes - Week 2 Lecture-1. Lecture Notes - Week 2 Lecture-1.
Terminal-Based Programs
Chapter 1: Programming Basics, Python History and Program Components
SEEM 4540 Tutorial 4 Basic PHP based on w3Schools
VoiceXML An investigation Author: Mya Anderson
Presentation transcript:

speech in, speech out

24 listopad 2006WS0607 – elevator2/15 Nuance server compiled recognition grammar, master language package, licence manager Nuance client speech-in components

24 listopad 2006WS0607 – elevator3/15 anticipate user’s responses what pieces of information are needed to complete the dialog? in what order will they be requested? one piece of information at a time in particular order (directed dialog), several pieces at once, in any order, and prompt for missing items (mixed initiative)? recognition grammar

24 listopad 2006WS0607 – elevator4/15 syntax Nuance: Grammar Specification Language (GSL) Diamant: Speech Recognition Grammar Format (SRGF) recognition grammar

24 listopad 2006WS0607 – elevator5/15 GSL grammar: doc in a file with.grammar extension; e.g. mygram.grammar (mygram will be the resulting package name) contents:GrammarRuleName GrammarDescription GrammarRuleName: at least one uppercase character GrammarDescription: sequence of words, grammar names, and operators that define a set of recognizable word sequences words (terminals) in lower-case operators: recognition grammar () concat(A B C... Y)A and B and... [ ] disjunction[A B C... Y ]either A or B or... ? optional?YY is optional + positive closure+Yat least one Y * Kleene star*Yzero or more Y

24 listopad 2006WS0607 – elevator6/15 GSL grammar: example expressions [morning afternoon evening] “morning”, “afternoon”, “evening” (good [morning afternoon evening]) “good morning”, “good afternoon”, “good evening” (?good [morning afternoon evening]) “good morning”, “good afternoon”, “good evening”, “morning”, “afternoon”, “evening” (thanks +very much) “thanks very much”, “thanks very very much”,... (thanks *very much) “thanks much”, “thanks very much”, “thanks very very much”,... recognition grammar

24 listopad 2006WS0607 – elevator7/15 example GSL grammar.grammar file.slot_definitions file.GO_FLOOR [ FLOOR:f (?the FLOOR:f floor) (?the FLOOR:f please) (?Filler ?the FLOOR:f floor ?please) ] { } Filler [ (i would like to go to) (i want to go to) (uh) ] FLOOR [ first{return("1")} second{return("2")} third{return("3")} fourth{return("4")} ] recognition grammar floor

24 listopad 2006WS0607 – elevator8/15 another option: SRGF and export as Nuance GSL GrammarTest.bat recognition grammar

24 listopad 2006WS0607 – elevator9/15 compiling the package (compile-package.bat) set PKGHOME = path to your gsl file (w/o extension) nuance-compile %PKGHOME% English.America recognition grammar master recognition package

24 listopad 2006WS0607 – elevator10/15 testing the grammar (text) parse-tool -package path_to_your_model nl-tool –package path_to_your_model –grammar grammar_in_your_model recognition grammar

24 listopad 2006WS0607 – elevator11/15 running Nuance: licence manager: lm.bat recognition server: rs.bat set PKGHOME = path to your compiled model recserver -package %PKGHOME% lm.Addresses=localhost config.... testing the grammar (speech) xapp -package path to your compiled model lm.Addresses=localhost speech recognition

24 listopad 2006WS0607 – elevator12/15 running nuance client edit Diamant config file: Clients.ini NuanceClient.bat (btw, have the licence manager and the server running too... duh!...) Diamant with speech-in

24 listopad 2006WS0607 – elevator13/15 adding speech-in add device as usual activate recognition: output „start” (start command) to nuance client read (speech) input from nuance client into variable as usual access recognition confidence (of type Real) like this: var#confidence Diamant with speech-in

24 listopad 2006WS0607 – elevator14/15 Mary server online at DFKI... Mary client MaryClient.bat speech-out components

24 listopad 2006WS0607 – elevator15/15 Diamant with speech-out adding speech-out add device as usual optionally, set format: {format = } (default plain text) and voice {voice = } in output node, output to Mary client as usual

24 listopad 2006WS0607 – elevator16/15 speech-enabled dialogs recognition tends to be imperfect... if recognition confidence low, then, for example (btw, think: grounding): repeat question ask for confirmation („did you say blah?”) inform user what they can say („you can say blah, bloo, and blee, please try again”) but... don’t let user get stuck in endless clarification dialog either!