© GyrusLogic, Inc. 2007 AI and Natural Language VUI design Peter Trompetter, VP Global Development GyrusLogic, Inc. Tuesday, August 21 at 1:30 PM - C202.

Slides:



Advertisements
Similar presentations
Tuning Jenny Burr August Discussion Topics What is tuning? What is the process of tuning?
Advertisements

© 2007 Avaya Inc. All rights reserved. Interactive Voice and Video Response Applications Dr. Valentine C. Matula
Using Asterisk to Implement Intelligent Call Center Solutions James Kleckner AMTELCO.
Convergys Confidential and Proprietary Copyright © 2007 Convergys Corporation To Confirm, or Not to Confirm … That is the Question Kristie Goss Convergys.
You be the Expert! Speech & the End-to-end Customer Experience Lizanne Kaiser, Ph.D. Customer Experience Designer Genesys August 20, 2007, 3:00pm, B104.
© 2002 – 2007 Versay Solutions, LLC. All rights reserved. Building Fault Tolerant Voice User Interfaces SpeechTEK 2007 Tuesday, August 21 Track B “Getting.
1 August 9, David Claiborn SLM Tuning: Lessons Learned.
>Christopher F. Passaretti Software Development Manager >SpeechTek 2007 Expand VoiceXML & CCXML Using Web Services.
Annoucements  Next labs 9 and 10 are paired for everyone. So don’t miss the lab.  There is a review session for the quiz on Monday, November 4, at 8:00.
TECHNOLOGY FOR MOBILE ADVERTISING SEARCH & COMMERCE © 2007 Apptera Inc. Optimizing Software Architecture for Voice Search SpeechTek 2007.
1 Profit from usage data analytics: Recent trends in gathering and analyzing IVR usage data Vasudeva Akula, Convergys Corporation 08/08/2006.
Using Your Voic M6 Release 7.1Business Feature Set Default Menu Options Revised August 31, 2007 TR
SpeechTEK August 22, 2007 Better Recognition by manipulation of ASR results Generic concepts for post computation recognizer result components. Emmett.
Managing Complexity: 3rd Generation Speech Applications Roberto Pieraccini August 7, 2006.
Requirements and Design
Managing Speech Projects for Maximum Efficiency Christoph Mosing, Vice President of Professional Services, Envox Worldwide.
Search Engines and Information Retrieval
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
ITCS 6010 Spoken Language Systems: Architecture. Elements of a Spoken Language System Endpointing Feature extraction Recognition Natural language understanding.
Unified. Simplified. Unified Communications Launch 2007.
Natural Language Processing and Speech Enabled Applications by Pavlovic Nenad.
Sepandar Sepehr McMaster University November 2008
Nextgen Bank Reconciliation Resource Bank Reconciliation Menu Financial Management Bank Reconciliation –Import Bank Statements –Reconcile Bank Accounts.
Listening Methods In-Depth IVR and End to End Call Analytics What is the Real Performance of My Voice Self-Service Solution? Can I compare my IVR Design.
Language Recognition… Searching with Precision Santa Clara, CA October 31, 2001 Julian Henkin Vice President, Worldwide Customer Services LexiQuest, Inc.
1 A Practical Rollout & Tuning Strategy Phil Shinn 08/06.
© GyrusLogic, Inc A Conversational System That Reduces Development Cost Luis Valles, Chief Scientist GyrusLogic, Inc. Monday, August 7 at 1:30 PM.
Speech Guidelines 2 of Errors VUIs are error-prone due to speech recognition. Humans aren’t perfect speech recognizers, therefore, machines aren’t.
Should Intelligent Agents Listen and Speak to Us? James A. Larson Larson Technical Services
VoiceXML Builder Arturo Ramirez ACS 494 Master’s Graduate Project May 04, 2001.
AS Level ICT Mrs. Ghazaal. In the past, when a customer wanted to talk to someone in a company they would usually be able to telephone and be put through.
Beyond Usability: Measuring Speech Application Success Silke Witt-Ehsani, PhD VP, VUI Design Center TuVox.
SpeechCycle Confidential Confidential 1 Optimizing Natural Language Interfaces: No Data Like More Data SpeechTEK New York, 2007 Jonathan Bloom & Roberto.
Review of Building Intelligent.NET Applications Stu Egli Andre Inistotov Frenny Saldana Kate Styers Nishant Zinzuwadia MSE 614 February 26, 2008.
Katanosh Morovat.   This concept is a formal approach for identifying the rules that encapsulate the structure, constraint, and control of the operation.
Solutions Summit 2014 Discrepancy Processing & Resolution Terri Sullivan.
Search Engines and Information Retrieval Chapter 1.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
1 High Resolution Statistical Natural Language Understanding: Tools, Processes, and Issues. Roberto Pieraccini SpeechCycle
Leveraging Speech Analytics for Customer Satisfaction Presented by: Karl Walder, VP – Solutions Noble Systems Corporation.
© GyrusLogic, Inc. Improved call completion with Natural Language Processing Peter Trompetter Vice President Global Development
ITCS 6010 SALT. Speech Application Language Tags (SALT) Speech interface markup language Extension of HTML and other markup languages Adds speech and.
Super-charge Your Speech Application With Cross- Functional Reporting Techniques Bjorn Austraat Director, Solution Delivery TuVox, Inc. August 9 th, 2006.
Integrating VoiceXML with SIP services
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Beyond Call Recording: Speech Improves Quality Assurance Larry Mark Chief Technology Officer SER Solutions, Inc.
© Copyright 2007 Arbinet-thexchange, Inc. All Rights Reserved. Voice Peering Steve Heap Chief Technology Officer.
» Jun 9, 2003 Speaker Verification Secure AND Efficient, Deployments in Finance and Banking Jonathan Moav Director of Marketing
Csi315csi315 Client/Server Models. Client/Server Environment LAN or WAN Server Data Berson, Fig 1.4, p.8 clients network.
Theban Stanley Human and Systems Engineering Center for Advanced Vehicular Systems Enhancements to the DARPA Communicator Architecture.
© 2008 SpeechCycle, Inc. More Than Call Steering Managing Dynamic Contextual Complexity Phillip Hunter VP, Application Design/Development SpeechTek NYC.
© RightNow Technologies, Inc. Ask The Experts: Getting the most out of Smart Assistant David Fulton, Product Manager, Web Experience Center Of Excellence,
Outline Grammar-based speech recognition Statistical language model-based recognition Speech Synthesis Dialog Management Natural Language Processing ©
Chapter 3 DECISION SUPPORT SYSTEMS CONCEPTS, METHODOLOGIES, AND TECHNOLOGIES: AN OVERVIEW Study sub-sections: , 3.12(p )
16.0 Spoken Dialogues References: , Chapter 17 of Huang 2. “Conversational Interfaces: Advances and Challenges”, Proceedings of the IEEE,
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
Theban Stanley, Julie Baca, Matt Elliott and Joseph Picone Human and Systems Engineering Center for Advanced Vehicular Systems Mississippi State University.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Artificial Intelligence: Research and Collaborative Possibilities a presentation by: Dr. Ernest L. McDuffie, Assistant Professor Department of Computer.
Speech Processing 1 Introduction Waldemar Skoberla phone: fax: WWW:
W3C Multimodal Interaction Activities Deborah A. Dahl August 9, 2006.
PREPARED BY MANOJ TALUKDAR MSC 4 TH SEM ROLL-NO 05 GUKC-2012 IN THE GUIDENCE OF DR. SANJIB KR KALITA.
Automating Accounts Payable
OpenWorld 2018 How to Create Chatbots with OMCe
Managing Dialogue Julia Hirschberg CS /28/2018.
David Cyphert CS 2310 – Software Engineering
Smart Business for eGeneration Companies
Huawei CBG AI Challenges
VoiceXML An investigation Author: Mya Anderson
Presentation transcript:

© GyrusLogic, Inc AI and Natural Language VUI design Peter Trompetter, VP Global Development GyrusLogic, Inc. Tuesday, August 21 at 1:30 PM - C202 - “AI & VUI Design” “Just Say It” © GyrusLogic, Inc. 2007

GyrusLogic the Company GyrusLogic, Inc. –Phoenix, AZ, USA –Incorporated in 2001 –Privately funded Technology –25,000+ hours of software development –University projects –3 Patents granted –Fuzzy & Exact dialog processing – - (+1) “Just Say It”

© GyrusLogic, Inc The Right Answer What does the speaker say? What does the speaker mean? What does the speaker want? Unambiguous Understanding in the Dialog Context Reduction of Uncertainty Sprachanalys e Speech Recognition Speech Input Discourse Context Knowledge about Domain of Discourse Grammar Lexical Meaning Acoustic Language Models Word Lists Speech Analysis Speech Under- standing Language Processing

© GyrusLogic, Inc Challenges with Open Ended Dialogs Conversations and human responses can be unpredictable –In many cases, it is impossible to design a dialog flow that anticipates all user’s responses. –Case statements, if-then-else and mix-initiative will not do the trick. Phrases must be well understood –Just picking up keywords doesn’t do the trick. The meaning of the phrase must be captured. Must recognize the difference between a question and a response. Distinguishing between exact questions and vague questions Find the single best response to the question Must keep track of the context of the conversation. –What were we talking about, and is the user now talking about something different? Finish the transaction on what information we need. Keep the development effort to the minimum.

© GyrusLogic, Inc An Artificial Intelligence (AI) Solution Conversations and human responses can be unpredictable –Use AI techniques to build the dialog on the fly, rather than attempting to script the dialog. Phrases must be well understood –Use a combination of semantics and syntax with computational linguistics. Distinguish between a question and a response. –Use multiple engines: answer engine and a response engine. –A broker must recognize which engine to use Distinguish between exact questions and vague questions –Use multiple grammars: user defined grammars and universal grammars that define as a vague question what it is not precisely defined in the user grammar. Find the single best response to the question –Perform computational linguistics for matching the semantics of the question with the semantics of the database repository. –Perform AI search techniques with heuristics for sorting through possible answers

© GyrusLogic, Inc Natural Language Understanding Traditional approaches for Natural Language Understanding (NLU) –Statistical Language Modeling (SLM) –Statistical Semantic Modeling (SSM) –“Say Anything”, “Speak Freely” –Transcription of ,000 utterances –Procedural developments GyrusLogic Platica approach for Natural Language Understanding –Robust Linguistic approach coupled with Artificial Intelligence (no SLM) –Full conversational capabilities (no transcriptions) –Easy industry standard solution (no proprietary scripting) –Interfacing existing “Voice” environments (IVR, ASR, TTS, VoiceXML) –Declarative developments –Significant less effort, with additional operational savings and more application functionality “Just Say It”

© GyrusLogic, Inc Typically Two Types of Requests A Fuzzy request: –They can be spontaneous, arbitrary, ad-hoc, and/or vague –The majority of user queries are fuzzy questions –The best approximation to satisfy the request –User expected accuracy, completeness and an answer the first time –Do I have some bucks in my account? –How much money do I have? –What is the amount of the last deposit? An Exact request: –Specific transactions don’t allow for fuzzy approach, but they can still be spontaneous. (expanded directed dialog) –Transfer $100 from checking to savings –Did check number 123 clear on July 1st.? –I want to travel at 3pm today from Phoenix to Burbank

© GyrusLogic, Inc Platica Server Architecture Universal Grammar & Dictionaries User Transaction Grammar Q & A Sub-dialog System API Web services, dB’s, etc. High Level Dialog and Context Director Execution Interface XML Answer Knowledge Base XML Transaction Rules NLU Directed Parser API N-best input Prompts to play (TTS and/or Audio) Transaction Sub-dialog System Exact Requests Fuzzy Requests

© GyrusLogic, Inc Typical Dialog with GyrusLogic System:How can we help you? User:I would like to transfer funds from my checking to my savings. System:How much money do you want to transfer? User:How much money do I have in my savings account? System: The balance in your savings account is 4000 dollars. I apologize for asking again, how much did you want to transfer? User:Five hundred. System: Ok, let's review this transaction, you want to transfer 500 dollars from your checking to your savings, is this correct? User:No, actually transfer a thousand bucks from savings! System:Ok, let's review this transaction, you want to transfer 1000 dollars from your savings to your checking, is this correct? User:Yes System:How else can we help you? User: What are the last five transactions in my checking? System: The last three transactions in your checking account are…. Note how the user can be spontaneous and say as much as he wants Interruption of dialog with any question, even if it was unrelated User can make a correction on the fly at any point in time The system still handles the question effectively without coding of business rules The system resumes the previous dialog and requests the missing information

© GyrusLogic, Inc XML Transaction Rules Example how much would you like to transfer? from which type of account would you like to transfer? to what type of account would you like to transfer? You want to transfer AMOUNT dollars, from your ORIGIN to your DESTINATION. Is this correct? AMOUNT ORIGIN DESTINATION ACTION_MESSAGE REQUEST is activated when certain information IS provided STATE is activated when a piece of information IS NOT provided. CONFIRMATION is activated when information for all states is provided. OBJECT invokes user’s backend API XML Transaction Rules only apply for Exact Requests

© GyrusLogic, Inc XML Answer Knowledge Base Example ACCOUNT SAVEBALANCE The current balance in your savings account is SAVEBALANCE dollars There are no fees with your existing accounts. There is no debit or credit limit with your existing accounts. OBjECT invokes user’s backend API Retrieves data value and populates the variable in the answer. The system determines which answer to return depending on the context of the user’s question XML Answer Knowledge Base only applies for Fuzzy Requests

© GyrusLogic, Inc No Dialog Flow Development or Maintenance The user application does not need a specification for how the logic of the dialog should look like. GyrusLogic’s inference engines mimic human’s reasoning and build the dialog on the fly, thereby delivering a natural conversation. It is a true conversational AI system and a true AI declarative paradigm. It allows the user to be spontaneous and to interrupt a dialog with questions outside the ongoing dialog. It allows the user to make any correction to a specific transaction, either implicit or explicit.

© GyrusLogic, Inc VoiceXML Mixed Initiative Example  GyrusLogic Platica more spontaneous out of the box!  Automatic dialog interruption and implicit correction  Exponential savings for more complex applications Agent: Thanks for calling Acme Travel Company. How can I help you today? Caller: I'd like to book a flight. Agent: Okay. What is your point of origin, and where are you going? Caller: I wanna fly to Boston, Massachusetts. Agent: You want to fly to Boston, Massachusetts. Where are you flying from? Caller: From San Francisco, California. Agent: Okay, you'll be traveling from San Francisco, California to Boston, Massachusetts. Is that correct? Caller: Yes. 10 times less effort +

© GyrusLogic, Inc GyrusLogic Platica How does GyrusLogic’s Platica help? –Declarative vs. Procedural or Icon based developments. –Implicit and Explicit corrections without additional coding. –Automatic implicit verifications without extra effort. –Spontaneous user interruptions in call flow without additional development effort. –Context and semantics recognition. –Significant savings in time and money with the deployment of Speech related applications. –The same application can be used for Chat, Web and SMS. IVR or

© GyrusLogic, Inc VoiceXML - JSP Example <% if ( request.getParameter("phrase") != null ) { gs.addQuestion(request.getParameter("phrase") ); GyrusResponse gr = cl.ask(gs); session = gr.getNewSession(); } %> No Dialog Flow to develop Send the text phrase we received from the ASR Receive the prompt or audio prompt identifier to play No Exceptions to develop

© GyrusLogic, Inc Source: Voice Information Associates, ASR in Telephony Applications, the World Market > 27% project costs savings Average Speech project deployment costs

© GyrusLogic, Inc Operational savings with Conversational Natural Language Systems Directed Dialog –1:48 avg. call Conversational Dialog –0:57 avg. call Savings –51 seconds per call saved –Avg. 50,000 calls / day –Initial 20% use of conversational dialog –Over 3.1 million minutes saved in initial year

© GyrusLogic, Inc Summary of an AI based GyrusLogic Implementation Fully conversational dialog system, based upon industry standards Flexible answers with enterprise variables in XML knowledge base Powerful implicit and explicit corrections, implicit confirmation No dialog design, minimal grammar development Natural language with context, semantics and meaning understanding Spontaneous user interruptions without additional development effort Improved “recognition rates” by resolving false positives Easy, XML based, implementation of a transactional dialog Automatic parsing for corpus development Back-end database and application support ASR & IVR independent, VoiceXML 2.0 support New language Universal Grammar development can be completed in several days First contact customer resolution for improved customer satisfaction Significant savings in development and total cost of ownership Declarative paradigm, the typical developments will be significantly less error prone Patented context and semantics recognition and parsing technology.

© GyrusLogic, Inc “Just Say It” - (+1) Thank You! questions or demo, visit booth 214