The Speech Speech casey chesnut brains-N-brawn.com Madison.NET April 2007.

Slides:



Advertisements
Similar presentations
Facts about Welcome to this video from Ozeki. In this video I will present what makes Ozeki Phone System XE the Worlds best on-site software PBX for Windows.
Advertisements

VoiceXML: A Field Evaluation By: Kristy Bradnum Supervisor: Peter Clayton Presented in partial fulfilment of the CS Honours Project.
VIPI - VIRTUAL PORTAL FOR INTERACTION AND ICT TRAINING FOR PEOPLE WITH DISABILITIES National ViPi Workshop 03/10/2011, Larnaca, Cyprus
Developing Windows ® CE Applications With Visual Basic ® Larry Roof tonked
Natural Language Systems
                      Digital Audio 1.
Rob Marchand Genesys Telecommunications
XISL language XISL= eXtensible Interaction Sheet Language or XISL=eXtensible Interaction Scenario Language.
Sean Powers Florida Institute of Technology ECE 5525 Final: Dr. Veton Kepuska Date: 07 December 2010 Controlling your household appliances through conversation.
Speech in.NET Sphinx CMU November Presenter casey chesnut brains-N-brawn.com – Web Services – Mobile / Wireless – Speech.
Discovering Computers: Chapter 1
Spik v1.0 Voice Commands Execution in a Windows Environment Dekel Abelson Eliran Dahan Instructor: Ari Todtfeld.
Thomas Kisner.  Unified Communications Architect at BNSF Railway  Board Member, DFW Unified Communications User Group ◦ Meets 4 th Thursday of Every.
1 of 6 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
09/09/2005ENEE408G Fall 2005 Multimedia Signal Processing 1 ENEE408G: Capstone Design Project: Multimedia Signal Processing Design Project 1: Digital Speech.
Office 2003 Tablet Enabled. Office 2003 The Biggest Addition Is Ink.
Multimodal Apps: Tablet PC & Speech Development in.NET casey chesnut brains-N-brawn.com Wisconsin.NET June 2005.
Readily Available Technology to Support Writing Tools for Struggling Writers.
Natural Language Processing and Speech Enabled Applications by Pavlovic Nenad.
Tech Ed North America /20/2017 1:33 AM Required Slide
Windows 10. The New Microsoft Operating System to be released July 29 th. It’s not just a PC operating system, it’s a lot more, it includes phones,
Vishwa Ranjan Program Management Microsoft Albert Kooiman Product Management Microsoft Session Code: UNC325.
1 Dragon NaturallySpeaking: Training Agenda. What to Expect Goals: Method / Essential Skills / Getting Help Starting to use speech-recognition software.
Some Voice Enable Component Group member: CHUAH SIONG YANG LIM CHUN HEAN Advisor: Professor MICHEAL Project Purpose: For the developers,
Setup Guide for Win 7 Speech Recognition 6/30/2014 Debbie Hebert, PT, ATP Central AT Services.
Augmented Reality with.NET casey chesnut brains-N-brawn.com Dallas C# SIG January 2008.
VoiceXML Builder Arturo Ramirez ACS 494 Master’s Graduate Project May 04, 2001.
Develop apps for your Living Room using the Media Center SDK casey chesnut brains-N-brawn.com Madison.NET October 2007.
Speaking to Computers Alex Acero Manager, Speech Research Group Microsoft Research Feb 14 th 2003.
Conversational Applications Workshop Introduction Jim Larson.
PrepTalk a Preprocessor for Talking book production Ted van der Togt, Dedicon, Amsterdam.
Rujchai Ung-arunyawee Department of Computer Engineering Khon Kaen University.
Turbulent change drives Communications and the Voice User Interface Bill Meisel President, TMA Associates Editor, Speech Strategy News
ITCS 6010 SALT. Speech Application Language Tags (SALT) Speech interface markup language Extension of HTML and other markup languages Adds speech and.
CapturaTalk4Android Demonstration Abi James
Unit 1_9 Human Computer Interface. Why have an Interface? The user needs to issue instructions Problem diagnosis The Computer needs to tell the user what.
Alignment (horizontal / vertical): in center! 0cm (center) Voxeo VoiceObject Overview.
Accessibility in Education WORKSHOP. Top 3 learning objectives 1.Every classroom has a student who can benefit from accessibility 2.Accessibility features.
E-commerce Lectures Ravi Raman CERC, West Virginia University.
Getting starting with Screencasts Michael Paskevicius This work is licenced under the Creative Commons Attribution-ShareAlike 2.5 South Africa License.
Augmented Reality with.NET casey chesnut brains-N-brawn.com Wisconsin.NET UG November 2007.
Using Speech Recognition Copyright 2006 South-Western/Thomson Learning.
Speech Technologies and VoiceXML try Department of Computer Science National Cheng-Chi University.
COMPUTER PARTS AND COMPONENTS INPUT DEVICES
The Voice-Enabled Web: VoiceXML and Related Standards for Telephone Access to Web Applications 14 Feb Christophe Strobbe K.U.Leuven - ESAT-SCD-DocArch.
Beyond Windows 2000 Telephony Noel Anderson Development Lead Windows Real Time Communications Microsoft Corporation.
Outline Grammar-based speech recognition Statistical language model-based recognition Speech Synthesis Dialog Management Natural Language Processing ©
Microsoft Virtual Academy North Shore.NET User Group Our Sponsors.
Voice User Interface
TOI Unity 5.0 Voice User Interface (VUI). © 2006 Cisco Systems, Inc. All rights reserved.2 Voice User Interface (VUI) TOI Unity 5.0 Jason Swager UCBU.
SkyNET Visualization Team Demo and Architecture Overview.
Introduction to Windows 10 Windsor Senior Computer Users Group October 12, 2015.
Developing an Effective Wireless Middleware Strategy.
Adding Narration to a PP presentation Paul Hopkins MMVI.
ARTIFICIAL INTELLIGENCE FOR SPEECH RECOGNITION. Introduction What is Speech Recognition?  also known as automatic speech recognition or computer speech.
Speech Recognition Created By : Kanjariya Hardik G.
W3C Multimodal Interaction Activities Deborah A. Dahl August 9, 2006.
Using Commonsense Reasoning to Improve Voice Recognition.
Presented By Sharmin Sirajudeen S7 CS Reg No :
Siri Voice controlled Virtual Assistant Haroon Rashid Mithun Bose 18/25/2014.
G. Anushiya Rachel Project Officer
PC Accessibility Features Part 2 of iPad and PC Accessibility Tools Online training: SD# 52 Prince Rupert.
Google translate app demo
Using Speech Recognition for Input: A Powerful and Readily Available Tool Dr. Donna Olsen Instructional Technologist Central Wyoming College
Continuous Automated Chatbot Testing
PhoNET Voice based web access ASWIN.P S3 EC ROLL : 24.
11/23/2018 8:30 AM BRK3037 BRK3037: Dive deep on building apps and services with the Office 365 Communications Platform David Newman Senior Program Manager.
WEBINAR: Robotic Process Automation (RPA) of Dynamics NAV with Rapise
Alexa Programming.
VoiceXML An investigation Author: Mya Anderson
Presentation transcript:

The Speech Speech casey chesnut brains-N-brawn.com Madison.NET April 2007

Powerpoint Page Up Page Down

brains-N-brawn.com Pervasive Computing –Tablet PC (MVP 03) –Compact Framework (MVP 04) –Advanced Web Services (MVP 05) –Media Center (MVP 06) –Speech –Location Based Services –Artificial Intelligence –3D

Outline Speech Overview Vista Speech Recognition SAPI 5.3 / System.Speech Speech Server 2007

Outline : Speech Overview Voice User Interface How does it work? –Synthesis (TTS) –Recognition (SR)

Overview Speech is just another presentation system –Synthesis = Output to user –Recognition = User input Voice User Interface (VUI)

VUI Modes Applications –Multi-modal –Voice-only

VUI Tips Don't replicate the touch-tone-based menu system Restrict options on the main (opening) menu to 4 or fewer Make sure your opening greeting is short Don't design the app solely for the new user Focus on task completion above all What can I say? 006/02/08/ aspx

Speech Synthesis Text to Speech –Dynamic –Prompt database

How Synthesis Works Text parsing –Sentences, numbers, symbols, pauses Natural language processing –Part of speech, tense Phonemes are looked up or sounded out Diphones are appended together Post process audio to add emphasis Play speech audio

How Synthesis Works Demo –/xnaSynth app Article – – (codebase from /ttSpeech)

Speech Recognition Speech to Text –Dictation –Command and Control

How Recognition Works Audio signal is processed Look for signals which might be speech Phonemes are found in audio signals Phonemes are mapped to a dictionary or words –Dictation or grammar-based Apply natural language processing

How Recognition Works Demo –/wavReader app Article – – (codebase from /noReco)

Outline : Vista Speech Recognizer Built-in to Vista’s shell Microphone bar Language support Can be trained to improve accuracy Command-and-control, also Dictation Automagic application support Horrible Office integration UAC problems

Demo Say what you see Show numbers Correct Spell it Mouse grid /vista-speech-recognition-screencast/

High Risk Demo

Hack 65.stm /micBarExtend – tap and talk

Narrator Vista’s screen reader

Outline : SAPI 5.3 / System.Speech Desktop applications –SAPI 5.3 –System.Speech

SAPI 5.3 COM based Native applications Managed apps which need more control

System.Speech Part of.NET 3.0 WPF Managed wrapper built on SAPI 5.3 Simple API Standards support (SSML, SRGS) Language support Vista Speech Recognition integration Does not work in XBAP

System.Speech.Synthesis SpeechSynthesizer SSML PromptBuilder Voices

System.Speech.Synthesis Demo –/speechSamples - /speechSynth

System.Speech.Recognition SpeechRecognizer / SpeechRecognizerEngine SRGS GrammarBuilder Advanced users –Deep-link functionality –Mixed initiative

System.Speech.Recognition Demo –/speechSamples - /speechReco

System.Speech Demo –/micBarExtend –/mceSapiMcpl Article – – – (not updated for Vista yet)

What about Mobile Devices OEMs can add VoiceCommand –VoiceCommand is not accessible to developers WindowsMobile has the SAPI API, but no engines PlatformBuilder is supposed to have engines There are 3 rd party engines for purchase

Outline : Speech Server 2007

Speech Server 2007 Telephony Applications Outgoing calls Speaker Independent

Speech Server 2007 VOIP Language support VoiceXML / SALT Workflow development model Reports Still in beta

Speech Server 2007 Speech Synthesis –Inline –PromptBuilder –SSML –Prompt databases Speech Recognition –Inline –Dynamic Grammar –SRGS –Conversational Grammar Builder –DTMF

VoiceXML Declarative language Article – – –

SALT Yet another declarative language Multimodal support has been dropped Article – – – –

Speech Workflow Speech Sequence Workflow designer Speech activities –Statement –QuestionAnswer Debugging tools

Speech Workflow Demo –/speechTextAdv –/speakerVerify –/mobileRecord Article – brawn.com/speechTextAdv/ brawn.com/speechTextAdv/ – brawn.com/speakerVerify/ brawn.com/speakerVerify/

Where Accessibility Telephony Telematics Home automation Mobile Devices / Tablets Gaming Warehouses …

Possible Future Telematics Service Pack for Office Support Exchange Server 2007 Speech Server 2007 release Rumors that WindowsMobile will get a public API Dictation has room to improve Hope that System.Speech will ultimately work in XBAP

Questions