Design of a Speech Recognition System to Assist Hearing Impaired Students Richard Kheir 2 and Thomas P. Way Department of Computing Sciences, Villanova.

Slides:

Advertisements

Similar presentations

IBM WebSphere Everyplace Access for Multiplatforms Managing the e-business Customer Experience.

Advertisements

MicroKernel Pattern Presented by Sahibzada Sami ud din Kashif Khurshid.

Speech Recognition There are different kinds of voice or speech “_______" that take the sounds of your voice and match it with words. The engine is software.

EE3P BEng Final Year Project – 1 st meeting SLaTE – Speech and Language Technology in Education Martin Russell

What is the Internet? Internet: The Internet, in simplest terms, is the large group of millions of computers around the world that are all connected to.

Assistive Technology Hearing (deaf or hard of hearing)

UI Standards & Tools Khushroo Shaikh.

Lecturing with Digital Ink Richard Anderson University of Washington.

8 Systems Analysis and Design in a Changing World, Fifth Edition.

11/05/99 1 eBusiness The Software Production View Project Summary.

1 Making the Case for Speech Recognition in the Legal Environment Dragon ® NaturallySpeaking ® Legal from Nuance.

Implementing Unified Messaging Joseph Blanchard Joseph Mancuso S. Paul Petroski.

User Interfaces. User Interface What do we mean by a user interface? The user is the person who is using the computer. A user interface is what he or.

Dragon Naturally Speaking Tutorial What is Dragon Naturally Speaking? Dragon is a dictation software, students can dictate a paper rather than type it.

EIA : “Automated Understanding of Captured Experience” Georgia Institute of Technology, College of Computing Investigators: Irfan Essa, G. Abowd,

1 Dragon NaturallySpeaking: Training Agenda. What to Expect Goals: Method / Essential Skills / Getting Help Starting to use speech-recognition software.

1 Introduction to Web Development. Web Basics The Web consists of computers on the Internet connected to each other in a specific way Used in all levels.

What is Business Intelligence? Business intelligence (BI) –Range of applications, practices, and technologies for the extraction, translation, integration,

Component 4: Introduction to Information and Computer Science Unit 10: Future of Computing Lecture 2 This material was developed by Oregon Health & Science.

Voice Recognition Software : Helping You Teach! A Title V Cooperative Workshop Oct. 3, 2007 Holly Hofmann A Title V Cooperative Workshop Oct. 3, 2007 Holly.

1 “ Speech ” EMPOWERED COMPUTING Greenfield Business Centre, 20 th September, 2006.

QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.

Systems Analysis – Analyzing Requirements.  Analyzing requirement stage identifies user information needs and new systems requirements  IS dev team.

Practical AT session 3 WP4-D4.2. Prepared by: Shams Eldin Mohamed Ahmed Hassan Speech, Text and Braille AT.

Assistive Technology for Students with Auditory Processing Disabilities.

POSITIONING STATEMENT For people who operate shared computers with Genuine Windows XP, the Shared Computer Toolkit is an affordable, integrated, and easy-to-use.

Instant Messaging for the Workplace A pure collaborative communication tool that does not distract users from their normal activities.

Prajaks Jitngernmadan Kanuengnij Kubola Faculty of Informatics, Burapha University IEC 2015 July 22 nd, 2015.

Group Members: Group Members:.  Introduction  Current Scenario  Proposed Solution  Block Diagram  Technical Implementation  Hardware & Software.

Chapter 14 Information System Development

What is the Internet? Internet: The Internet, in simplest terms, is the large group of millions of computers around the world that are all connected to.

Instant Messaging for the Workplace A pure collaborative communication tool that does not distract users from their normal activities.

Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.

Chapter 8 Collecting Data with Forms. Chapter 8 Lessons Introduction 1.Plan and create a form 2.Edit and format a form 3.Work with form objects 4.Test.

Spoken Dialog Systems and Voice XML Lecturer: Prof. Esther Levin.

Intro to Network Design

Technology for Students with Hearing Disabilities Chapter Seven.

Syllabus Management System. The Problem There is need for a management system for syllabi that: Provides a simple and effective user interface Allows.

5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.

Enabling Reuse-Based Software Development of Large-Scale Systems IEEE Transactions on Software Engineering, Volume 31, Issue 6, June 2005 Richard W. Selby,

9 Systems Analysis and Design in a Changing World, Fourth Edition.

A Goal Based Methodology for Developing Domain-Specific Ontological Frameworks Faezeh Ensan, Weichang Du Faculty of Computer Science, University of New.

Why Should I Use Speech Recognition? Kim Larsh, Presenter Mesa Public Schools Mesa,AZ.

Human Computer Interaction CITB 243 Chapter 1 What is HCI

Controlling Computer Using Speech Recognition (CCSR) Creative Masters Group Supervisor : Dr: Mounira Taileb.

1 Title: Introduction to Computer Instructor: I LTAF M EHDI.

Investigation on the Library Robot “Alice” in an enterprise Lin Yuan.

Instructor: Richard Fredrickson. Desktop Support Specialist Diploma program Course: DESK 201.

Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.

HTML5 based Notification System for Updating E-Training Contents Yu-Doo Kim 1 and Il-Young Moon 1 1 Department of Computer Science Engineering, KoreaTech,

Thepul Ginige Lecture-7 Implementation of Information System Thepul Ginige.

Selenium server By, Kartikeya Rastogi Mayur Sapre Mosheca. R

Your Interactive Guide to the Digital World Discovering Computers 2012 Chapter 12 Exploring Information System Development.

DAT602 Database Application Development Lecture 1 Course Structure & Background knowledge.

2.0 PROJECT INITIATION AND PLANNING The initiating and planning are the phase where process or workflow to develop the system will identify and planning.

Notes for Speech Recognition. Speech Recognition Continuous Speech Recognition (CSR) is the software that allows users to speak normally and input data.

PREPARED BY MANOJ TALUKDAR MSC 4 TH SEM ROLL-NO 05 GUKC-2012 IN THE GUIDENCE OF DR. SANJIB KR KALITA.

ARIEL TURNER—ED 505 ASSISTIVE TECHNOLGY. WHAT IS ASSISTIVE TECHNOLOGY?  With the growth of students with disabilities in mainstream classrooms, it is.

INTRODUCTION TO AUDIOLOGY (SPHS 1100) WEEK 7 POWER POINT TOPICS  ASSISTIVE DEVICES FOR HEARING IMPAIRED  AUGMENTING DEVICES  TRANSFORMING DEVICES.

Systems Analysis and Design in a Changing World, Fifth Edition

The Development Process of Web Applications

Automatic Speech Recognition

Computer Aided Software Engineering (CASE)

A presentation on Basics of Speech Recognition Systems

APARTMENT MAINTENANCE SYSTEM

Dr. ElSayed Eissa Hemayed

Building Information Systems

Course: Module: Lesson # & Name Instructional Material 1 of 32 Lesson Delivery Mode: Lesson Duration: Document Name: 1. Professional Diploma in ERP Systems.

Chapter 15: Accounting and Enterprise Software

VoiceXML An investigation Author: Mya Anderson

Presentation transcript:

Design of a Speech Recognition System to Assist Hearing Impaired Students Richard Kheir 2 and Thomas P. Way Department of Computing Sciences, Villanova University Abstract Background Applications Four general application categories for ASR are: Command Recognition Dictation Interactive Voice Response (IVR) Assistive Technologies Motivation System Design Part 1 - DiBS Low recognition rate for domain specific jargon is one of the key weaknesses in ASR. DiBS was developed to solve this problem. Table: Summary of the accuracy results for five scenarios. DescriptionAccuracyRange Usability Untrained75%64%-83%Poor to fair Minimal Training88%78%-93%Sufficient Moderate Training90%81%-96%Good Moderate Training and Customized dictionary91%83%-96%Good Moderate Training, Customized Dictionary and pronunciations94%86%-98%Very good System Design Part 2 - VUST Table. Recognition accuracy for 4 classifications of classroom speech. Classification Words Correct Total Words Percent Recognized Planning % Lecture % Roll-call % Discussion % TOTAL % Contributions & Future Work Contributions Proved to be an affordable and beneficial assistive system Provides an easy to use software Improves Recognition Accuracy Distributed and portable application Future work  Commercial Quality  Post speech profiles and jargon in a central repository  Evaluate other speech engines  Deploy in classrooms SERVER Consists of three major components: the speech recognition software, a dictionary enhancement tool, and a transcription distribution application. Uses an ASR system designed to be affordable, accurate and easy to set up and use. Around one hour of speech training are enough to get good accuracy Training through windows control panel or through the VUST instructor’s Console Simple setup and configuration. User friendly interface Instructor initiates transcription Students connect via web applet Accurate results even without added jargon (table below) We have tested the ASR system with five scenarios: Untrained, some training, moderate training, moderate training and some added jargon using DiBS and moderate training with added jargon and custom pronunciation for the added jargon. Many enhancements took place on specific domains during the following years such as the introduction of the Hidden Markov Model (HMM). At the beginning of the 21st century, commercial speech recognition systems finally became practical and affordable, with many products on the market. The most popular vendors being IBM and Dragon. The quest for automatic speech recognition (ASR) started in 1939 with the introduction of VODER by AT&T. With the now wide availability of ASR software, the technology has become an application area that is emerging in assistive technology. For people who are deaf and hard of hearing, the accessibility and freedom that can be afforded by using a computer to recognize speech is finally beginning to be realized. The design of such a truly usable ASR system requires an understanding of the approaches, user requirements, and available technology. Speech recognition software is maturing, and possesses the potential to provide real-time note taking assistance in the classroom, particularly for deaf and hard of hearing students. This research talks about speech recognition in general, and reports on a practical, portable and readily deployed application that provides a cost-effective, automatic transcription system with the goal of making computer science lectures inclusive of deaf and hard of hearing students. The design of the system is described, some specific technology choices and implementation approaches are discussed, and results of two phases of an in-class evaluation of the system are analyzed. Ideas for student research projects that could extend and enhance the system also are proposed. Nady UHF-3 wireless headset system 3 …click ‘Connect and Start Recognition’ to start VUST server. Run the VUST program and selects a speech profile. 2 1 Connect wireless microphone receiver to computer and wear headset & transmitter. 1 Connect to VUST transcription server URL using web browser. 2 1 Select available connection, and click “Connect”. 3 Transcription is received once the lecture begins. 28 million deaf and hard of hearing individuals in the US (Around 500 million world wide) Limited benefit from hearing aids and cochlear implants as these are most useful in face to face conversations Note takers and sign language interpreters are expensive to hire and provide limited assistance due to the need to paraphrase during a lecture Developing countries provide no assistance Commercial ASR systems are expensive to acquire