Some Voice Enable Component Group member: CHUAH SIONG YANG 499410001 LIM CHUN HEAN 400415001 Advisor: Professor MICHEAL Project Purpose: For the developers,

Slides:



Advertisements
Similar presentations
GUI Testing. High level System Testing Test only those scenarios and outputs that are observable by the user Event-driven Interactive Two parts to test.
Advertisements

® Copyright 2008 Adobe Systems Incorporated. All rights reserved. ADOBE® ACCESSIBILITY Achieving Accessibility with PDF Greg Pisocky Accessibility Specialist.
Input to the Computer * Input * Keyboard * Pointing Devices
Dedicated Computer Systems
CSE 111: Object Oriented Design. Design “To program is human but to design is divine” (WEH)
By: Soheeb Amin.  A user interface is the connector between man and the technology he is using.  They consist of an input, which is our command to the.
User interface design Designing effective interfaces for software systems Objectives To suggest some general design principles for user interface design.
Designing a User Interface for People with Disabilities u u
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
Chapter 12 Designing Interfaces and Dialogues
 At the end of this class, students are able to  Describe definition of input devices clearly  List out the examples of input devices  Describe.
Prototype & Design Computer Inputs. How to Prototype & Design Computer Inputs Step 1: Review Input Requirements Step 2: Select the GUI Controls Step 3:
Systems Analysis and Design in a Changing World, 6th Edition
Systems Analysis and Design in a Changing World, 6th Edition
Chapter Seven Advanced Shell Programming. 2 Lesson A Developing a Fully Featured Program.
1 “ Speech ” EMPOWERED COMPUTING Greenfield Business Centre, 20 th September, 2006.
Module Designing Computer-based Information Systems
Systems Analysis and Design in a Changing World, 6th Edition
Knowledge Base approach for spoken digit recognition Vijetha Periyavaram.
Real-Time Speech Recognition Subtitling in Education Respeaking 2009 Dr Mike Wald University of Southampton.
Microphone Integration – Can Improve ARS Accuracy? Tom Houy
Chapter 8: Writing Graphical User Interfaces
11.10 Human Computer Interface www. ICT-Teacher.com.
CP SC 881 Spoken Language Systems. 2 of 23 Auditory User Interfaces Welcome to SLS Syllabus Introduction.
CS 4720 Usability and Accessibility CS 4720 – Web & Mobile Systems.
Unit 1_9 Human Computer Interface. Why have an Interface? The user needs to issue instructions Problem diagnosis The Computer needs to tell the user what.
Natural Language Interfaces to Databases Meikiu Lo Gwen Ray October 29, 2003.
Computer Science 101 Database Concepts. Database Collection of related data Models real world “universe” Reflects changes Specific purposes and audience.
1 Nassau Community CollegeProf. Vincent Costa Acknowledgements: An Introduction to Programming Using Visual Basic 2012, All Rights ReservedAn Introduction.
Designing Interface Components. Components Navigation components - the user uses these components to give instructions. Input – Components that are used.
University of Toronto Department of Computer Science © Steve Easterbrook. This presentation is available free for non-commercial use with attribution.
Chapter 2: Basics Operating System Concepts Management (Part 1) F2032 FUNDEMANTAL OF OPERATING SYSTEM.
Voice User Interface
TRANSLATION MEMORY TECHNOLOGY
Software Architecture
© Paradigm Publishing Inc. 4-1 OPERATING SYSTEMS.
INFO 355Week #71 Systems Analysis II User and system interface design INFO 355 Glenn Booker.
A-Level Computing#BristolMet Session Objectives#15 MUST define the term user interface SHOULD describe the characteristics of different UIs and suggest.
Microsoft Assistive Technology Products Brought to you by... Jill Hartman.
E.g.: MS-DOS interface. DIR C: /W /A:D will list all the directories in the root directory of drive C in wide list format. Disadvantage is that commands.
CS221 Algorithm Basics. What is an algorithm? An algorithm is a list of instructions that transform input information into a desired output. Each instruction.
Computer Literacy for IC 3 Unit 2: Using Productivity Software Chapter 3: Formatting and Organizing Paragraphs and Documents © 2010 Pearson Education,
USER INTERFACE DESIGN (UID). Introduction & Overview The interface is the way to communicate with a product Everything we interact with an interface Eg.
BY KALP SHAH Sentence Recognizer. Sphinx4 Sphinx4 is the best and versatile recognition system. Sphinx4 is a speech recognition system which is written.
7th Meeting TYPE and CLICK. Keyboard Keyboard, as a medium of interaction between user and machine. Is a board consisting of the keys to type a sentence.
Reducing uncertainty in speech recognition Controlling mobile devices through voice activated commands Neil Gow, GWXNEI001 Stephen Breyer-Menke, BRYSTE003.
Human Computer Interface INT211
Multi-Modal Dialogue in Personal Navigation Systems Arthur Chan.
Digital Literacy: Computer Basics
Chapter – 8 Software Tools.
Speech Processing 1 Introduction Waldemar Skoberla phone: fax: WWW:
Printed Reports Analysis questions –Who will use the report? –What is the purpose of the report? –When or how often is the report needed? –Where does the.
By: Nicole Cappella. Why I chose Speech Recognition  Always interested me  Dr. Phil Show Manti Teo Girlfriend Hoax  Three separate voice analysts proved.
PREPARED BY MANOJ TALUKDAR MSC 4 TH SEM ROLL-NO 05 GUKC-2012 IN THE GUIDENCE OF DR. SANJIB KR KALITA.
G. Anushiya Rachel Project Officer
Enable Talk Prepared By: Alaa Mayyaleh Shurouq Abu salhiah.
11.10 Human Computer Interface
System Design Ashima Wadhwa.
Artificial Intelligence for Speech Recognition
Unit 2 User Interface Design.
Google translate app demo
Evaluation of a multimodal Virtual Personal Assistant Glória Branco
PhoNET Voice based web access ASWIN.P S3 EC ROLL : 24.
Enable Talk Prepared By: Alaa Mayyaleh Shurouq Abu salhiah.
Command Me Specification
Food Inventory Tracker
Human and Computer Interaction (H.C.I.) &Communication Skills
The Application of Hidden Markov Models in Speech Recognition
Evaluation of a multimodal Virtual Personal Assistant Glória Branco
Presentation transcript:

Some Voice Enable Component Group member: CHUAH SIONG YANG LIM CHUN HEAN Advisor: Professor MICHEAL Project Purpose: For the developers, a more general, kindly user interface for application is important to a product. In the definition of user interface, is the system by which users interact with a machine, and includes hardware(physical) and software(logical) components. Generally, the goal of improvement is to provide a minimal input to achieve the desired output, and also minimizes undesired output. And we are working on the VUI(voice user interface), which makes human interaction with computer possible through a voice/speech platform in order to initiate an automated service or process. A accurate output for users is important, and the VUI must respond quickly, people do not have patient to wait a few seconds for the results, so when we use the VUI at some mobile devices, a lightly VUI is more useful. Because mobile devices mostly have no keyboard, they often use a touch screen keyboard to replace the input method of keyboard, but button-pressing on devices with such small buttons can be tedious and inaccurate, so an easy-to-use, accurate, and reliable VUI would potentially be a major breakthrough in the ease of use. We choose the pocketsphinx, which is a product of CMUSphinx(Carnegie Mellon University's Sphinx), and is a free open source for us to use. Speech Recognition: We use the pocketsphinx to achieve this part, to translate the spoken words into text. The process is also known as “automatic speech recognition(ASR)”, or “speech to text(STT)”. Speech recognition applications include voice user interfaces such as voice dialling, call routing, domotic appliance control, search, simple data entry, preparation of structured documents, speech-to-text processing, and aircraft. The performance of speech recognition systems is usually evaluated in terms of accuracy and speed. Accuracy is usually rated with word error rate (WER), whereas speed is measured with the real time factor. Other measures of accuracy include Single Word Error Rate (SWER) and Command Success Rate (CSR). Both acoustic modeling and language modeling are important parts of modern statistically-based speech recognition algorithms. Hidden Markov models (HMMs) are widely used in many systems. Language modeling is also used in many other natural language processing applications such as document classification or statistical machine translation. Basic concept of Speech: Speech is a complex phenomenon. The naive perception is often that speech is built with words, and each word consists of phones. Speech is a dynamic process without clearly distinguished parts. That means that there are no certain boundaries between units, or between words. Speech to text translation and other applications of speech are never 100% correct. Speech is a continuous audio stream where rather stable states mix with dynamically changed states. Words are understood to be built of phones, but this is certainly not true. The acoustic properties of a waveform corresponding to a phone can vary greatly depending on many factors - phone context, speaker, style of speech and so on. The so called coarticulation makes phones sound very different from their “canonical” representation. Next, since transitions between words are more informative than stable regions, developers often talk about diphones - parts of phones between two consecutive phones. Sometimes developers talk about subphonetic units - different substates of a phone. Our Project: We choose a application of a restaurant’s menu ordering system at Android. Except the interface of touch, also the voice user interface, we can use the voice to order the menu and their amounts. And also a management system is provided to manage the database of menu and the result of the orders. Login interface Login database Ordering system Management system Change menu Add Change Delete Touch user interface Voice user interface Receive Order Check order Order database Menu database References: