Multimodal Apps: Tablet PC & Speech Development in.NET casey chesnut brains-N-brawn.com Wisconsin.NET June 2005.

Slides:



Advertisements
Similar presentations
Facts about Welcome to this video from Ozeki. In this video I will present what makes Ozeki Phone System XE the Worlds best on-site software PBX for Windows.
Advertisements

Introducing Microsoft Surface ™ 1.0 Service Pack 1.
Building International Applications with Visual Studio.NET Achim Ruopp International Program Manager Microsoft Corporation.
CS Body of Knowledge (ACM) Discrete Structures Programming Fundamentals Algorithms & Complexity Operating Systems Architecture & Organization Social &
                      Digital Audio 1.
Page 1. Page 2 Virtual Speaker: A Virtual Studio The software: Virtual Speaker is a package that automatically creates your voice files, prompts or any.
The Microsoft Technical Roadshow 2006 Windows Presentation Foundation (WPF) Marcus Perryman
ConferenceXP and Distance Learning at UW CSE Fred Videon & Rod Prieto.
Which development tool is right for you? Commercial Tools John Fuentes – Principal Solutions Architect
The Speech Speech casey chesnut brains-N-brawn.com Madison.NET April 2007.
Clients for XProtect VMS What’s new presentation
Speech in.NET Sphinx CMU November Presenter casey chesnut brains-N-brawn.com – Web Services – Mobile / Wireless – Speech.
DEV392: Extending SharePoint Products And Technologies Through Web Parts And ASP.NET Clint Covington, Program Manager Data And Developer Services - Office.
Voice Recognition Technology Kathleen Kennedy COMP 1631 Winter 2010.
Teaching with the Tablet PC Dr. Roger Von Holzen Ms. Darla Runyon Northwest Missouri State University.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Multimodal Architecture for Integrating Voice and Ink XML Formats Under the guidance of Dr. Charles Tappert By Darshan Desai, Shobhana Misra, Yani Mulyani,
Thomas Kisner.  Unified Communications Architect at BNSF Railway  Board Member, DFW Unified Communications User Group ◦ Meets 4 th Thursday of Every.
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
Chapter 14 Recording and Editing Sound. Getting Started FAQs: − How does audio capability enhance my PC? − How does your PC record, store, and play digital.
1 Windows Vista Tablet PC: PRS315 Program.
.NET On A Shoe String By: Patrick Brannen Lead Developer iDevCo, Inc.
Symbian os with smart phones Guided by: Hetal A Josiyara
1 / 23 Microsoft Tablet PC Technology Thomas Dodds Declan O’Gorman David Pickles Stephen Pond An overview of Microsoft Tablet PC technology and current.
DEV389 Getting started with the Tablet PC SDK Name: Lester Madden Title: Developer Consultant.
Advanced Auto attendant v3.0. December 2003 Page 2 New Auto Attendant Features for 3.0 Allow different languages on different dialogs New Language support.
The Reality and the Promise of Tablet PCs June Liebert CIO and Lecturer The University of Texas School of Law.
Augmented Reality with.NET casey chesnut brains-N-brawn.com Dallas C# SIG January 2008.
1 JCM 106 Computer Application for Journalism Lecture 1 – Introduction to Computing.
Develop apps for your Living Room using the Media Center SDK casey chesnut brains-N-brawn.com Madison.NET October 2007.
Review of Building Intelligent.NET Applications Stu Egli Andre Inistotov Frenny Saldana Kate Styers Nishant Zinzuwadia MSE 614 February 26, 2008.
UWSP Web Speech Research Group Joe Frost Mark Stenerson Professor Dave Gibbs Presentation to AITP Monday, October 17, 2005.
1 Developing Apps For The Pocket PC Platform Anup Mistry Technology Manager D-2 (22 nd May ’03)
Speaking to Computers Alex Acero Manager, Speech Research Group Microsoft Research Feb 14 th 2003.
Presented by Team D Compare Windows 2000, XP, and.NET By John Leonard, Brian North, Jeffrey Reynolds, Todd Saylor.
A Jumpstart to WPF by Kevin Grossnicklaus ArchitectNow.
User Interface in the Digital Decade Kai-Fu Lee Corporate Vice President Microsoft Corporation.
Group Members: Group Members:.  Introduction  Current Scenario  Proposed Solution  Block Diagram  Technical Implementation  Hardware & Software.
Creating Speaking Web Pages: The Text-to-Speech Integrated Development Environment (TTS-IDE) David C. Gibbs Department of Mathematics and Computing University.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Augmented Reality with.NET casey chesnut brains-N-brawn.com Wisconsin.NET UG November 2007.
Expression Web 3... now with TFS!. What is Expression Web 3? Professional web design and development tool Create standards-based Web sites faster & easier.
Beyond Windows 2000 Telephony Noel Anderson Development Lead Windows Real Time Communications Microsoft Corporation.
Outline Grammar-based speech recognition Statistical language model-based recognition Speech Synthesis Dialog Management Natural Language Processing ©
Speech-Enabled.NET Framework Application for CIMS Murat Semerci Çağdaş Kayra Akman
Chapter 15 Recording and Editing Sound. 2Practical PC 5 th Edition Chapter 15 Getting Started In this Chapter, you will learn: − How sound capability.
Producer 2003 By Mark White. Producer 2003 A add-on to PowerPoint 2003 Stand alone program Allows you to:  Create –audio and video  Edit  Synchronize.
 Notebook running Windows XP Tablet Edition › Revival of pen-based computing idea. › Built-in support for Ink  Draw with stylus, active digitizer 
Jordan Shropshire MIS Seminar Spring 2005 Technology Presentation: Microsoft Longhorn.
Module 2: Using Microsoft Visual Studio.NET. Overview Overview of Visual Studio.NET Creating an ASP.NET Web Application Project.
Controlling Computer Using Speech Recognition (CCSR) Creative Masters Group Supervisor : Dr: Mounira Taileb.
Developing an Effective Wireless Middleware Strategy.
MBL490 Deploying Applications to Smartphone and Microsoft® Powered Pocket PC Marcus Perryman Consultant.
Chapter 1: Applying Computer Basics – Lesson 1 © 2010, 2006 South-Western, Cengage Learning.
Speech Recognition Speech Recognition lets you speak into a microphone to control your computer. You can give commands that the computer will carry out.
What’s New in Windows Forms 2.0 Stephen Turner Software Design Engineer
Presentation Title 1 1/27/2016 Lucent Technologies - Proprietary Voice Interface On Wireless Applications Protocol A PDA Implementation Sherif Abdou Qiru.
Building a Handwriting recognition application with WPF
Opportunities in Deploying Open Source Applications Using LumenVox Speech Recognition on Asterisk.
Technical Sales Specialist Software - OS and Applications John R. Moegling Sr. Systems Engineer.
History of Windows Operating System. Windows 1.0 Debuted in 1985 First version of Windows that was set up to use bitmap displays and mouse pointing devices.
SMART NOTE TAKER BY : V.MEHER MANJUSHA.
Chapter 15 Recording and Editing Sound
Get Microsoft Exam Dumps Questions - Microsoft Braindumps Dumps4Download
Financial Aid: ATRC Services & Intro to Accessibility
תכנות אסינכרוני, תקשורת ופיתוח אפליקציות ל-Windows 8
COUNTRIES NATIONALITIES LANGUAGES.
0. Overview of 2-Day Academic .NET Workshop
Overview of System Development for Windows CE.NET
VoiceXML An investigation Author: Mya Anderson
Presentation transcript:

Multimodal Apps: Tablet PC & Speech Development in.NET casey chesnut brains-N-brawn.com Wisconsin.NET June 2005

Source Code The associated source can be found here: –

Seamless Computing Advanced Web Services (MVP05) Compact Framework (MVP04) MapPoint Tablet PC (MVP03) Speech Artificial Intelligence Direct3D Media Center

Questions How many programmers? –Tablet PC –Speech –Media Center

Outline Tablet PC Speech –Speech API (SAPI) –Speech Application SDK (SASDK) –Speech Server Demo –Tablet and Speech –Media Center and Speech

Outline : Tablet PC Development environment How it works Working with Ink Opinion Future

Development Environment Windows XP Pro (non Tablet edition) Visual Studio.NET 1.1 Tablet PC SDK 1.7 – 4b83-a821-40bc-aa85-c9ee3d6e9699&displaylang=enhttp:// 4b83-a821-40bc-aa85-c9ee3d6e9699&displaylang=en Recognizer Pack – 184DD-5E B E9F918B&displaylang=enhttp:// 184DD-5E B E9F918B&displaylang=en Digitizer Board – Tablet PC

How Ink works Digitizer collects stroke information Strokes are broken up into characters / words / drawings Character / word stroke info is transformed into some feature set Feature set is run through some sort of pre-trained AI Output is mapped to a dictionary or words

Demo Digitizer collects stroke information Tablet PC Inspector –

Demo Strokes are broken up into characters / words / drawings InkDivider –Tablet PC SDK Sample

Demo Character / word stroke info is transformed into some feature set Feature set is run through some sort of pre- trained AI Demo –/aiTabletOcr Article – / /

Demo Output is mapped to a dictionary or words Dictionary Tool – Article – / /

Working with Ink InkControls InkOverlay –Collection –Recognition RealTimeStylus Ink on the web

Ink Controls InkEdit InkPicture Code from scratch

InkOverlay Collection Recognition Demo apps

RealTimeStylus RealTimeStylusPlugin –Tablet PC SDK Sample

Ink on the Web IE only InkBlogWeb –Tablet PC SDK Sample Article –

Opinion Green Light –Tablet PC Edition 2005 improved recognition and usability dramatically –Recognition Pack made development more accessible –Language Support Chinese (Traditional and Simplified),U.S. English, U.K. English, French, German, Italian, Japanese, Korean, Spanish

Possible Future VS.NET 2005? Avalon? Will IE7 have tighter integration with ink? Longhorn – baked in Possiblity for training ink recognition

What about Pocket PCs Handwriting Recognition Form factors

Outline : Speech How does it work? –Synthesis (TTS) –Recognition (SR) Development –Speech API (SAPI) –Speech Application SDK (SASDK) –Speech Server (MSS)

How Synthesis Works Text is converted to phonemes Phonemes are appended together Audio is played back Demo –/ttSpeech app Article –

How Recognition Works Audio wav is transformed to some meaningful form Phonemes are found in audio signals Phonemes are mapped to a dictionary or words Demo –wavReader app Article –

Speech API (SAPI) Old school COM Windows applications Can do dictation Demo –SAPI app

Opinion Yellow light –It works, but is aging –Has to be trained for dictation –Limited language support Green light for Tablet PCs –Tablet PC has recognition and synthesis engines installed –Some Tablets have microphone arrays built in

Future System.Speech –Simple API –Reflection capabilities –Standards support (SSML, SRGS) –Engines should be improved from all the Speech Server work

What about Pocket PCs OEMs can add VoiceCommand WindowsMobile has the SAPI API, but no engines PlatformBuilder is supposed to have engines There are 3 rd party engines for purchase

Speech Application SDK VS.NET 1.1 integration For web based apps –Voice-only telephony –Multimodal browser Demo –Code voice-only from scratch Article –

SASDK Speech Synthesis –Inline –Code behind –Prompt functions –Prompt databases Speech Recognition –Inline –Static Grammar –Dynamic Grammar –DTMF

Speech Server Runs SASDK applications Primarily for Voice-only apps Also for Multimodal PocketPC apps Speech Language Packs –North American Spanish –Canadian French Article –

Deployment

Opinion Green light for Voice-Only –Great tool support –Cheap hardware –Language support Red light for Multimodal –Standards battle with VoiceXml –IE Speech Add-Ins are not accessible –Pocket IE Speech Add-In not updated for R2 release, nor does it support Smartphone

Possible Future VS.NET 2005? XAML? Will IE7 have voice browsing built-in? Other browsers to add SALT support? Pocket IE Professional?

Combo Demos Ink and Speech (WinForm) –InkCollection app – Ink and Speech (WebForm) –Video – Remote and Speech (AddIn) – Remote and Speech (HostedHTML) –

Questions