Deya Banisakher Megan Biondo. Research group (Summer 2012) Faculty Mentor: Prof. Marjorie Skubic Graduate Mentor: Ms. Tatiana Alexenko Undergraduate Student.

Slides:



Advertisements
Similar presentations
Saya Web Interface Project Edward Rafaelov & Vladimir Postel DEC Advisors: Prof. Shlomi Dolev & Michael Orlov.
Advertisements

Facts about Welcome to this video from Ozeki. In this video I will present what makes Ozeki Phone System XE the Worlds best on-site software PBX for Windows.
Handheld Wireless GPS Tracking Device Senior Design EEL 4914C : UWF, Fall 2006.
Your Interactive Guide to the Digital World Discovering Computers Fundamentals, 2012 Edition.
TURTLEBOT ROBOTIC GUIDE. Project Description Teach a robot to guide a person to a predefined destination. General requirements: 1. Use a Turtlebot as.
UNDERSTANDING JAVA APIS FOR MOBILE DEVICES v0.01.
Discovering Computers: Chapter 1
TC2-Computer Literacy Mr. Sencer February 4, 2010.
SNMP & MIME Rizwan Rehman, CCS, DU. Basic tasks that fall under this category are: What is Network Management? Fault Management Dealing with problems.
SoNDa Sensor Network for Data Explore! 1. SoNDa Sensor Network for Data Explore! KEYWORDS Wireless Sensors Communication 2.
Application Layer. Applications A program or group of programs designed for end users. A program or group of programs designed for end users. Software.
Networking Hardware and Components By: Sean Bell.
Chapter 1: Voilà! Meet the Android. Smartphones –Can browse the Web –Allow you to play games –Use business applications –Check –Play music –Record.
Amarino:a toolkit for the rapid prototyping of mobile ubiquitous computing Bonifaz Kaufmann and Leah Buechley MIT Media Lab High-Low Tech Group Cambridge,
Computer for Health Sciences
SoundSense: Scalable Sound Sensing for People-Centric Application on Mobile Phones Hon Lu, Wei Pan, Nocholas D. lane, Tanzeem Choudhury and Andrew T. Campbell.
Component 4: Introduction to Information and Computer Science Unit 10: Future of Computing Lecture 2 This material was developed by Oregon Health & Science.
Reader Animated Storybooks Animation Process Software Design A Solution The Problem Americans on average watch two hours of television each day. They read.
Xiaoyu Tong and Edith C.-H. Ngai Dept. of Information Technology, Uppsala University, Sweden A UBIQUITOUS PUBLISH/SUBSCRIBE PLATFORM FOR WIRELESS SENSOR.
Assistive Technology Russell Grayson EDUC 504 Summer 2006.
A VERY USEFUL E-LEARNING TOOL FOR TEACHERS, RESEARCHERS, AND STUDENTS.
Semester Programming Project for CS 356
Databases and the Internet. Lecture Objectives Databases and the Internet Characteristics and Benefits of Internet Server-Side vs. Client-Side Special.
Guide to Linux Installation and Administration, 2e1 Chapter 3 Installing Linux.
Chapter 4 System Software.
Ruslan Masinjila Aida Militaru.  Nature of the Problem  Our Solution: The Roaming Security Robot  Functionalities  General System View  System Design.
Leslie Luyt Supervisor: Dr. Karen Bradshaw 2 November 2009.
Component 4: Introduction to Information and Computer Science Unit 10b: Future of Computing.
©G. Millbery 2001Communications and Networked SystemsSlide 1 Purpose of Network Components  Switches A device that controls routing and operation of a.
Chapter 4 – Slide 1 Effective Communication for Colleges, 10 th ed., by Brantley & Miller, 2005© Technology and Electronic Communication.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Android for Java Developers Denver Java Users Group Jan 11, Mike
Mobile Navigation With SVG Christian Schmitt SVG Open 2005.
Web Programming : Building Internet Applications Chris Bates CSE :
GEOREMINDERS ANDROID APPLICATION BY: ADRIENNE KECK.
IT Introduction to Information Technology CHAPTER 01.
CS 3131 Introduction to Programming in Java Rich Maclin Computer Science Department.
MobileMAN Internal meetingHelsinki, June 8 th 2004 NETikos activity in MobileMAN project Veronica Vanni NETikos S.p.A.
NETWORK HARDWARE AND SOFTWARE MR ROSS UNIT 3 IT APPLICATIONS.
Chapter 6 Data Communications. Network Collection of computers Communicate with one another over transmission line Major types of network topologies What.
Foundation year Lec.3: Computer SoftwareLec.3: Computer Software Lecturer: Dalia Mirghani Year: 2014/2015.
Presented by:Supervisors: Fuad Amira Dr. Raed Qadi Reem SalousDr. Samer Arandi MY PATH KEEPER.
Real-Time Cyber Physical Systems Application on MobilityFirst Winlab Summer Internship 2015 Karthikeyan Ganesan, Wuyang Zhang, Zihong Zheng Shantanu Ghosh,
1 Title: Introduction to Computer Instructor: I LTAF M EHDI.
NETWORKING FUNDAMENTALS. Network+ Guide to Networks, 4e2.
The AlphaEntry Welcome Screen is a customizable, schedulable slideshow. Use it for advertising or photos of your building(s). Voice prompts and text prompts.
PROPOSAL : The Use of Voice Command in Operating Personal Computer By : COLLEGE OF ART & SCIENCE UNIVERSITI UTARA MALAYSIA STIW5023 ADVANCED PROGRAMMING.
© 2013, published by Flat World Knowledge Chapter 10 Understanding Software: A Primer for Managers 10-1.
© Paradigm Publishing, Inc. 4-1 Chapter 4 System Software Chapter 4 System Software.
Collaborator Revolutionizing the way you communicate and understand
Mobile AAC Application for Sentence Creation Team Members: Kevin Greene, Christina Fries, Wei Liao, Hien Huynh, Jiho Kim, Apoorva Dewangan Mentor: James.
Nir Mendel, Yuval Pick & Ilya Roginsky Advisor: Prof. Ronen Brafman
By J Swetha ( ) V V Aishwarya ( ).
1 A Service-based Approach to Developing Android Mobile Internet Device (MID) Applications Hyun Jung La and Soo Dong Kim Department of Computer Science.
THE GEORGE WASHINGTON UNIVERSITY SCHOOL OF ENGINEERING & APPLIED SCIENCE DEPARTMENT OF COMPUTER SCIENCE PRELIMINRY DESIGN TECHLER Sisay Habte CSCI
Project Description MintTrack is a mobile application built for the Android OS that will help keep track of where a user’s money is being spent via expense.
Technician Table Editor Academic advisor : Professor Ehud Gudes Technical advisor : Menny Even Danan Team: Olga Peled Doron Avinoam Ira Zaitsev ADD Presentation.
GSU Indoor Navigation Senior Project Fall Semester 2013 Michael W Tucker.
Chapter 27 Network Management Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Application Sharing Bhavesh Amin Casey Miller Casey Miller Ajay Patel Ajay Patel Bhavesh Thakker Bhavesh Thakker.
Android Mobile Application Development
Fundamentals of Information Systems, Sixth Edition
Technologies in everyday’s life
Enable Talk Prepared By: Alaa Mayyaleh Shurouq Abu salhiah.
Directions: GO THROUGH THE FOLLWING SLIDES. Make sure you have quizlet cards for all the vocabulary. Study the terms.
Enterprise Application Architecture
Enable Talk Prepared By: Alaa Mayyaleh Shurouq Abu salhiah.
Digital Literacy 1.00 Computer Basics
Knowledge Sharing Mechanism in Social Networking for Learning
Presentation transcript:

Deya Banisakher Megan Biondo

Research group (Summer 2012) Faculty Mentor: Prof. Marjorie Skubic Graduate Mentor: Ms. Tatiana Alexenko Undergraduate Student Researchers: Megan Biondo Deya Banisakher

Overview The Group will investigate speech to text tools available for the Android smart phone, test their accuracy, and develop an interface that transmits the recognized text to a mobile robot via wireless networking. If time permits, the students will test the speech interface for sending the robot spatial referencing commands, such as “look for the hat on the table behind the couch”.

Why it matters? Recent studies have shown that one of the top five tasks noted by seniors for assistive robots is help with fetching objects, for example, retrieving missing eyeglasses (Beer et al., 2012), and the preferred form of communication with the robot is a speech interface (Scopelliti et al., 2005).

What We Did… We investigated the use of the built-in speech recognition in Android phones for use in this scenario. We created an Android application and implemented the underlying network and process communication system to support its use. We collected voice recognition transcriptions from old and young people; they spoke into an android device that had a testing application installed which we have developed. We also compared the accuracy of speech recognition on the Android phone for older and younger adults, as well as male and female ones. Integrated a server into ROS for communication with the Android device

Previous Work Skubic et al. have studied spatial language in older and younger populations. In collaboration with Carlson et al. at Notre Dame Dept. of Psychology, they collected speech samples of older and younger adults giving spatial descriptions (Carlson et al, in review). They also created a robot capable of recognizing furniture and processing textual spatial descriptions, in addition to the common robot capabilities such as obstacle avoidance. The robot was made to listen to commands coming from the user through a computer’s keyboard that is wired to the robot itself.

Why Android? Known for high accuracy Freely available in Android-based devices which are being activated at a rate of 1 million devices per day worldwide (Android, 2012). It relies on crowd-sourcing in addition to integration of existing acoustic models. The use of Android devices for this purpose also has technical benefits: The audio processing and transcription is handled by Google’s servers. Android application is easy to install on any Android device. Android devices and the operating systems support a wide range of accessibility features for helping the elderly use the different applications installed. Android devices have built-in microphones, eliminating the need for the user to purchase a headset or other microphone. A speech recognition application allows the user to decide when they want to communicate with the robot, which prevents the robot from reacting to speech directed to other people.

Why Android? (Con’t) It utilizes Java’s API into its development. Android makes it easy and practical for developers to change, switch and supply more resources to their applications by dealing with the XML based resources. XML is a simple language that Android allows developers to use to create and reference to sophisticated screen layouts and other resources such as pictures and videos. Android’s platform and its use of Java’s packages such as java.net, allows developers to use the phones hardware in a matter that is no different than a one in a fully featured computer.

System Components Figure 1. View of overall system communication. User Client Server Wireless Router Internet (Google)

What is ROS? ROS is an open-source, meta-operating system for robots. It provides the services you would expect from an operating system, including hardware abstraction, low-level device control, implementation of commonly-used functionality, message- passing between processes, and package management. It also provides tools and libraries for obtaining, building, writing, and running code across multiple computers. The robot uses ROS (Robot Operating System) which is based around publish-subscribe pattern. The server process inside of ROS publishes the textual transcriptions it receives from the Android device while other processes in the robot (primarily language processing) subscribe to the server’s feed.

What is Inside the Server? ROS TCP Server Data In (from device) Process Data/ Create Message Topic Robot Node Move According to Message/ Sensors Server Side (The Robot) Figure 2. Server communication within ROS.

Application Overview (a) (b) (c) (d)(e) Figure 2. (a) User connects to robot. (b) User chooses to speak into phone. (c) User speaks into phone. (d) Phone displays the possible transcriptions to user. (e) Phone prompts user to send transcription selected to the robot.

Results We tested the accuracy of Android speech recognition for older and younger adults. Accuracy is one way to measure effectiveness of Speech-to-Text. It is calculated by taking the number of correctly transcribed words and dividing by the total number of words spoken. First, recordings were tested. Result: VERY BAD Then live voices were tested. Result: EXCELENT(Relatively!)

Speech Testing Results Original Data From recordings (0 out of 16 perfect) Even with recordings from the younger voices only 13 out of 49 were transcribed correctly

The New Data Younger Adult Voices Older Adult Voices # Trans.AverageStd. Dev.Min.Max.# Perfect% Perfect Men %9.69%66.67%100.00% % Women %14.67%37.50%100.00%840.00% All %12.05%37.50%100.00% % # Trans.AverageStd. Dev.Min.Max.# Perfect% Perfect Men %15.86%42.86%100.00%29.09% Women %16.96%16.67%100.00% % All %16.58%16.67%100.00% %

Accuracy Chart Men 87.65% 86.83% Women Older 82.41% Younger 92.55% 79.25%94.25% 84.66% 90.18% 87.23%

Conclusions The developed Android Application has proved to be effective in sending transcriptions to the server. There has been a significant difference of 10% between older and younger adults’ word accuracy rates with the younger voices leading. The binary comparison between older and younger adults’ transcriptions has also shown that younger voices get transcribed better than older voices. However, Android’s speech recognition proved to be very successful for the overall sample population of older and younger voices.

References 1. Android, Android, the world's most popular mobile platform Robot Operating System (ROS), 3. Beer, J.M., Smarr, C., Chen, T.L., Prakash, A., Mitzner, T.L., Kemp, C.C. & Rogers, W.A The domesticated robot: design guidelines for assisting older adults to age in place. In Proc., ACM/IEEE Intl. Conf. on Human-Robot Interaction, , March, 2012, Boston, MA 4. Scopelliti, M., Giuliani, M., and Fornara, F Robots in a domestic setting: a psychological approach. Universal Access in the Information Society, 4(2): Carlson, L., Skubic, M., Miller, J., Huo, Z., and Alexenko, T. In Review. Investigating Spatial Language Usage in a Robot Fetch Task to Guide Development and Implement of Robot algorithms for Natural Human-Robot Interaction. Topics in Cognitive Science.