Kingston University Creating and Editing Mathematical Content using Natural Language Commands A Progress Report of the TalkMaths Project Eckhard Pflügel.

Slides:



Advertisements
Similar presentations
Chapter 11 Designing the User Interface
Advertisements

Map of Human Computer Interaction
Networked Digital Whiteboard with Handwritten-Symbol Interpreter and Dynamic-Display-Object Creator Atsuhide Kobashi Henry M. Gunn High School Palo Alto,
User Driven Modelling and Systematic Interaction for End-User Programming Modelling for Engineering Processes Peter Hale UWE.
CSTEP Cluster Sampling for Tail Estimation of Probability.
Technical Architectures
Lets Talk 9+ Emulator e-Tech for Tots CS590 - Ashok Sahu.
1 Introducing Collaboration to Single User Applications A Survey and Analysis of Recent Work by Brian Cornell For Collaborative Systems Fall 2006.
Input Validation For Free Text Fields ADD Project Members: Hagar Offer & Ran Mor Academic Advisor: Dr Gera Weiss Technical Advisors: Raffi Lipkin & Nadav.
Eclipse[10] MIPSinEclipse. Overview Goal: To provide a friendly development environment for CS students programming in MIPS (particularly CS33 at UCLA),
Senior Project – I.D. Math & Computer Science jsMath Equation Editor Dana Cartwright Advisors – Prof. Cervone & Prof. Striegnitz Editor Design -
Interpret Application Specifications
Programming Introduction November 9 Unit 7. What is Programming? Besides being a huge industry? Programming is the process used to write computer programs.
An Integrated Solution for Web-based Mathematical Expression Inputting Wei Su Department of Computer Science, Lanzhou University, PRC Department of Computer.
About the Presentations The presentations cover the objectives found in the opening of each chapter. All chapter objectives are listed in the beginning.
A Feature-Based Analysis & Comparison of IT Automation Tools: Comparing Kaseya to Developed By: & Advisor : Dr. S. Masoud Sadjadi School of Computing and.
WebQuilt and Mobile Devices: A Web Usability Testing and Analysis Tool for the Mobile Internet Tara Matthews Seattle University April 5, 2001 Faculty Mentor:
Web Design Software Alyssa Lagimoniere. Adobe Dreamweaver Pros Relatively low-cost price ($99) Produces very "clean" HTML code; easy to transport and.
The Project AH Computing. Functional Requirements  What the product must do!  Examples attractive welcome screen all options available as clickable.
Miser-C MISRA-C Compliance Checker Ian Biller, Phillippe Dass, Bryan Eldridge, Jon Senchyna, Tracy Thomas Faculty Coach: Professor Michael Lutz Project.
Mobile Multimodal Applications. Dr. Roman Englert, Gregor Glass March 23 rd, 2006.
Stefan Kreckwitz Senior System Engineer across Systems GmbH „Future Web-Based Translation Environments“ Localisation Research Forum 28 September 2007,
1 Dragon NaturallySpeaking: Training Agenda. What to Expect Goals: Method / Essential Skills / Getting Help Starting to use speech-recognition software.
Project Proposal: Academic Job Market and Application Tracker Website Project designed by: Cengiz Gunay Client: Cengiz Gunay Audience: PhD candidates and.
Lesson 4 Computer Software
MASTERS THESIS DEFENSE QBANK A Web-Based Dynamic Problem Authoring Tool BY ANN PAUL ADVISOR: PROFESSOR CLIFF SHAFFER JUNE 2013 Computer Science Department.
CS110/CS119 Introduction to Computing (Java)
Computing Fundamentals Module A Unit 2: Using Windows Vista LessonTopic 8Looking at Operating Systems 9Looking at the Windows Desktop 10Starting Application.
Joel Bapaga on Web Design Strategies Technologies Commercial Value.
COMPUTER PROGRAMMING Source: Computing Concepts (the I-series) by Haag, Cummings, and Rhea, McGraw-Hill/Irwin, 2002.
14 Chapter 11: Designing the User Interface. 14 Systems Analysis and Design in a Changing World, 3rd Edition 2 Identifying and Classifying Inputs and.
Multimedia Specification Design and Production 2013 / Semester 2 / week 8 Lecturer: Dr. Nikos Gazepidis
Introduction CSE 1310 – Introduction to Computers and Programming Vassilis Athitsos University of Texas at Arlington 1.
IE 411/511: Visual Programming for Industrial Applications
Introduction to Interactive Media The Interactive Media Development Process.
ST01 - Introduction 1 Introduction Lecturer: Smilen Dimitrov Sensors Technology – MED4.
PRESENTED BY: Nadia Qamoum Suzanne Blasingame Rachael Reano Hunza Iqbal.
Se Over the past decade, there has been an increased interest in providing new environments for teaching children about computer programming. This has.
Learningcomputer.com Introduction to Microsoft Word 2007 New Look and Feel.
Productivity Programs Common Features and Commands.
1 3. Computing System Fundamentals 3.1 Language Translators.
CS 460/660 Compiler Construction. Class 01 2 Why Study Compilers? Compilers are important – –Responsible for many aspects of system performance Compilers.
C OMPUTING E SSENTIALS Timothy J. O’Leary Linda I. O’Leary Presentations by: Fred Bounds.
Fundamentals of Graphic Communication 3.5 Accessible Design.
SD1230 Unit 6 Desktop Applications. Course Objectives During this unit, we will cover the following course objectives: – Identify the characteristics.
Chapter 6 CASE Tools Software Engineering Chapter 6-- CASE TOOLS
Capturing, writing and reading maths electronically - what works Dr Abi James Accessibility Group WAIS.
© Paradigm Publishing, Inc. 4-1 Chapter 4 System Software Chapter 4 System Software.
Design of an Integrated Robot Simulator for Learning Applications Brendon Wilson April 15th, 1999.
Multi-Modal Dialogue in Personal Navigation Systems Arthur Chan.
JavaScript 101 Introduction to Programming. Topics What is programming? The common elements found in most programming languages Introduction to JavaScript.
Introduction CSE 1310 – Introduction to Computers and Programming Vassilis Athitsos University of Texas at Arlington 1.
Oman College of Management and Technology Course – MM Topic 7 Production and Distribution of Multimedia Titles CS/MIS Department.
Introduction  Program: Set of sequence instruction that tell the computer what to do.  Software: A collection of programs, data, and information. 
Software Architecture for Multimodal Interactive Systems : Voice-enabled Graphical Notebook.
Best 5 Mobile App Development Tools for Developer's to think beyond the Limitation.
Top 5 IDE that Simplifies PHP Development Biztech IT Consultancy Pvt. Ltd. | |
A S P. Outline  The introduction of ASP  Why we choose ASP  How ASP works  Basic syntax rule of ASP  ASP’S object model  Limitations of ASP  Summary.
Dynamic Online Accessible Math — Online Braille Math
Introduction Edited by Enas Naffar using the following textbooks: - A concise introduction to Software Engineering - Software Engineering for students-
Chapter 03: Basic Application Software
Software for scientific calculations
How can we speak Math? Richard Fateman Computer Science
Course: Introduction to Computers
Understand Windows Forms Applications and Console-based Applications
Introduction Edited by Enas Naffar using the following textbooks: - A concise introduction to Software Engineering - Software Engineering for students-
Tools of Software Development
GRAPHICAL USER INTERFACE
Lesson 9: GUI HTML Editors and Mobile Web Sites
SpeechClipse v 1.0 “An Effective Plug-In for the Eclipse IDE”
Presentation transcript:

Kingston University Creating and Editing Mathematical Content using Natural Language Commands A Progress Report of the TalkMaths Project Eckhard Pflügel Faculty of Science, Engineering and Computing Kingston University

Overview Motivation and Objective Existing Solutions The Technical Challenges Our Answer: TalkMaths Current Progress Future Steps Conclusion

Introduction Our Motivation: –Not everyone can use a computer with traditional input devices –Mathematical content is notoriously difficult to be accessed with alternative technologies Our Objective: –To carry out research and development that will lead to the creation of a user- friendly tool/system, for accessing mathematical content using speech Kingston University

Speech Recognition Technology Continuous speech recognition has been around for a while Recently, experienced tremendous improvements and attraction in the media (iPhone, Google) Two different architectures: client-based, server-based Why would people want to use it? –Out of commodity? –Out of necessity! Kingston University

Desktop-based Speech Recognition Tools Currently, most reliable (commercial) tool available is Dragon NaturallySpeaking (DNS) An alternative is Windows Speech Recognition (free since Vista, much improved in Windows 7) An open source (free) solution exists – Sphinx Other solutions might appear in the near future Kingston University

Dragon NaturallySpeaking Popular commercial speech recognition product Excellent continuous speech recognition rates Effectively allows for hands-free creation and editing of text Additional features: –Commands for typical computer-based tasks –Supports creation of (application-specific or global) macros Kingston University

Dragon NaturallySpeaking – Drawbacks No support for mathematics (all symbols need to be spelled out) Does not integrate in any mathematical editor Cannot translate speech input into specific mathematical markup (MathML, LaTeX) Development of more sophisticated macros needs (expensive) SDK Anecdotal evidence of mediocre user support Kingston University

Standards for Spoken Mathematics Motivated by different contexts: –Dictating to other human beings –Input for computer systems (parsing) –Audio output for text-to-speech systems Probably first documented source: Changs booklet [1] Fateman [2,3] gives fairly detailed rules for spoken mathematics Other approach: [4] Raman [5] is motivated by TTS Our contribution, based on [2,3]: Wigmore [6] Kingston University

Existing Tools and Systems MathTalk –Collection of Dragon NaturallySpeaking macros –Only usable as input for specific computational maths interface –Commercial product (expensive) –Limited scope Maths Speak & Write –Uses Windows speech technology –Desktop application with GUI –Supports multi-modal input –Research Project –Mainly experimental relevance Kingston University

Related Tool Speed –Goal: programming by voice –Speech-plugin for Eclipse IDE –Uses Java interface for DNS –Speech input with keyboard and mouse –Exports spoken version of Java –This includes navigation –Also research project Overall conclusion: none of the existing tools are fit for purpose. Kingston University

Technical Challenges Research aspect: Need tools and techniques from natural language processing, compiler construction and HCI Standards for spoken maths –Need flexible and powerful grammar –Have to deal with ambiguity Parsing Algorithms –Difficulties: incomplete or incorrect input Editing Paradigms –Need novel strategies for speech-driven UIs

Kingston University TalkMaths Background Use speech recognition for my own work Frequently needed specialist tasks: enter/modify mathematical equations Could use Equation Editor combined with DNS macros More problematic if using LaTeX Idea: write more sophisticated commands Turned this into research project

Our Answer: TalkMaths Web-based User Interface/System Separate the application from speech- front-end Devise special class of speech commands (speech templates) Insight: spoken mathematics can and should be process similarly to spoken structured content (i.e. markup languages, programming code) Kingston University

TalkMaths UI Kingston University

TalkMaths.org Web UI Kingston University php

TalkMaths.org Web UI – Future Version Kingston University

How can TalkMaths Help with Accessibility of Mathematics? TalkMaths aims at people for whom it is –Difficult to use keyboard/mouse –Difficult to decipher equations on screen TalkMaths can help by –Speaking mathematical input –Equation rendering: arbitrary big font sizes –Voice-activated zooming function –Editing mathematical expressions by voice –Planned features: Playback of formulae Importing/exporting existing documents Maintaining documents on server

Progress Report Are developing speech front-end prototype, using Windows speech recognition Still have issues with the DNS interface About to release new version of parser with more robust error recovery New website design in progress Kingston University

Future Work Extending the range of covered mathematics Allowing for multi-line equations/multiple/embedded expressions Potentially, allows sharing of maths input Improved speech editing (select and say) Higher robustness of code/ease of use/installation Improved documentation

Ensuring Continuation Ph.D. student support (full-time until March 2014, part-time until 2017) Industrial funding? EU funding? Domain/hosting not that expensive Kingston University

Conclusion TalkMaths seems to be a novel application TalkMaths can be very helpful for members of Higher Education with computer access problems The current prototype will improve significantly over the next years Keep up to date:

Acknowledgements This work would have been impossible without my colleagues and students: –James Denholm-Price –Gordon Hunter –(and others) Funded by –EPSRC Doctoral Training Award –MSOR Mini Project grant Kingston University

References 1.Lawrence A. Chang. Handbook for spoken mathematics (Larry's speakeasy). Lawrence Livermore National Laboratory, University of California, USA, R. Fateman. How can we speak math? University of California at Berkeley, R. Fateman. 2-D Display of Incomplete Mathematical Expressions. University of California at Berkeley, Cameron Elliott and Je A. Bilmes. Computer based mathematics using continuous speech recognition. CHI 2007 Workshop on Striking a C[h]ord: Vocal Interaction in Assistive Technologies, Games and More, T. V. Raman. Audio system for technical readings. Springer Verlag, Berlin, A. Wigmore. Speech-Based Creation and Editing of Mathematical Content. Ph.D. Thesis, Kingston University, U.K., Kingston University