Customized Spell Corrector

Slides:



Advertisements
Similar presentations
© 2000 XTRA Translation Services Is MT technology available today ready to replace human translators?
Advertisements

An Ontology Creation Methodology: A Phased Approach
Semantics and Context in Natural Language Processing (NLP) Ari Rappoport The Hebrew University.
Spelling Correction for Search Engine Queries Bruno Martins, Mario J. Silva In Proceedings of EsTAL-04, España for Natural Language Processing Presenter:
Standard Grade Notes General Purpose Packages. These are Software packages which allow the user to solve a range of problems.
Chapter 5: Introduction to Information Retrieval
© Paradigm Publishing, Inc Word 2010 Level 2 Unit 1Formatting and Customizing Documents Chapter 2Proofing Documents.
Team : Priya Iyer Vaidy Venkat Sonali Sharma Mentor: Andy Schlaikjer Twist : User Timeline Tweets Classifier.
PHONICS The teaching of Phonics and Spelling at Fountains Earth Primary School.
Assistive Technology Training Online (ATTO) University at Buffalo – The State University of New York USDE# H324M Write:Outloud.
Spelling correction as an iterative process that exploits the collective knowledge of web users Silviu Cucerzan and Eric Brill July, 2004 Speaker: Mengzhe.
Normalizing Microtext Zhenzhen Xue, Dawei Yin and Brian D. Davison Lehigh University.
Themes in production Producing speech Reading and writing.
Cognitive Psychology, 2 nd Ed. Chapter 11 Language Production.
Int 1 Revision Word Processing Most people are familiar with word processing packages such as Microsoft Word, Open Office and Word Perfect. Here are some.
Pasewark & Pasewark Microsoft Office XP: Introductory Course 1 INTRODUCTORY MICROSOFT WORD Lesson 3 – Helpful Word Features.
Inventions of Communication M. Koloži SOŠT POPRAD, Slovakia D. Pitoňák.
DLSI Lexical Analysis Prof Brook Wu and Ph.D. student Xin Chen.
Linguisitics Levels of description. Speech and language Language as communication Speech vs. text –Speech primary –Text is derived –Text is not “written.
R EAD & W RITE G OLD : T EXT H ELP S YSTEMS I NC.: T EXT TO S PEECH S OFTWARE By: Ashley, Kathryn, Rine, and Samantha.
Computational Language Andrew Hippisley. Computational Language Computational language and AI Language engineering: applied computational language Case.
Language and Cognition Colombo 2011 Psycholinguistic Assessments of Language Processing in Aphasia - Writing With acknowledgement to Jane Marshall.
Nathan Gallagher13A Nathan Gallagher Spell checker.
NATURAL LANGUAGE TOOLKIT(NLTK) April Corbet. Overview 1. What is NLTK? 2. NLTK Basic Functionalities 3. Part of Speech Tagging 4. Chunking and Trees 5.
PHONETICS Introduction. P HONETICS Definition : The scientific study of speech. Speech? Represents words and other units of language. There are some sounds.
Submitting Book Chapters via Manuscript Central A Short Guide for Wiley-VCH Authors.
To quantitatively test the quality of the spell checker, the program was executed on predefined “test beds” of words for numerous trials, ranging from.
Lesson 5.  Check the spelling in a document  Check a document for grammatical errors  Translate text to and from other languages  Use the thesaurus.
Taxonomies: Hidden but Critical Tools Marjorie M.K. Hlava President Access Innovations, Inc.
1 Lyle H. Ungar, University of Pennsylvania What is AI? “Artificial Intelligence is the study of how to make computers do things at which, at the moment,
Chapter 8: Systems analysis and design
Computer Skills /1436 Department of Computer Science Foundation Year Program Umm Alqura University, Makkah Place photo here 1.
Spelling Belle Vale School Improvement Liverpool 9 th May Sarah Williams.
Learner corpus analysis and error annotation Xiaofei Lu CALPER 2010 Summer Workshop July 13, 2010.
Annual reports and feedback from UMLS licensees Kin Wah Fung MD, MSc, MA The UMLS Team National Library of Medicine Workshop on the Future of the UMLS.
Meta-Knowledge Computer-age study skill or What kids need to know to be effective students Graham Seibert Copyright 2006.
1 Computational Linguistics Ling 200 Spring 2006.
Checking data GCSE ICT.
Complex Data Transformations in Digital Libraries with Spatio-Temporal Information B. Martins, N. Freire, J. Borbinha Instituto Superior Técnico, Technical.
COP 4620 / 5625 Programming Language Translation / Compiler Writing Fall 2003 Lecture 3, 09/11/2003 Prof. Roy Levow.
Chapter 4 Financial Functions, Data Tables, and Amortization Schedules
Errors “Computer says no..”. Types of Errors Many different types of errors new ones are being invented every day by industrious programming students..
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
CSA2050 Introduction to Computational Linguistics Lecture 1 Overview.
Natural Language Processing Menu Based Natural Language Interfaces -Kyle Neumeier.
Lesson 5 Prepared 2/20/11.  Check the spelling in a document  Check a document for grammatical errors  Translate text to and from other languages 
The Functions and Purposes of Translators Syntax (& Semantic) Analysis.
Generating Query Substitutions Alicia Wood. What is the problem to be solved?
User-friendly and Inquisitive Help/FAQ Module. Division of Users for Help Section New users Regular users Users in Trouble All Users.
Module 1 VASC Websites Copyright 2006 VeriFone, Inc. All rights reserved. No part of this publication may be copied, distributed, stored in a retrieval.
Language Technologies Capability Demonstration Alon Lavie, Lori Levin, Alex Waibel Language Technologies Institute Carnegie Mellon University CATANAL Planning.
Minimum Edit Distance Definition of Minimum Edit Distance.
© 2004 The McGraw-Hill Companies, Inc. All rights reserved. The Advantage Series Microsoft Office Word 2003 CHAPTER 2 Modifying a Document.
Using Commonsense Reasoning to Improve Voice Recognition.
Sample Test Questions These are designed to help answer the questions on Exam 2 PLEASE do not ask me to answer any of the questions.
Step –By-Step: Entering Seat Checks
Information Architecture
Chapter 8 Reading and Writing
Microsoft Word 2013 Lesson 5.
Regular Expressions.
Component 1.6.
INTRODUCTORY MICROSOFT WORD Lesson 3 – Helpful Word Features
Number for instant help.
Natural Language Processing (NLP)
Learning to read through phonics
Introduction to System Programming
Natural Language Processing (NLP)
Introduction to Search Engines
Natural Language Processing (NLP)
Presentation transcript:

Customized Spell Corrector Aviad Ashkenazi Matan Zinger March 2012

Agenda Overview about Dysgraphia Short overview of Natural Language Processing Using NLP to solve Dysgraphia symptoms Dispeller Application Demonstration

Overview – Cognitive Writing Process Dysgraphia may caused by a “damage” in any of this modules.

Overview – Different Types of Dysgraphia Surface Dysgraphia – damage in lexical flow Using sub-lexical flow instead Symptoms: replacing homo-phonetic letters, difficulty in irregular words No mistakes will appear for univalent words Similar symptoms will appear for children (w/o dysgraphia) Phonological Dysgraphia – damage in sub-lexical flow Difficulty in writing non-familiar words (which require translation of phoneme into grapheme) No mistakes when using lexical flow (e.g. for familiar words) Peripheral Dysgraphia – damage in grapheme buffer Word length is one of the most critical factors Symptoms: re-ordering of internal letters, doubling letters, omitting letters

Overview – Natural Language Processing Purpose: Machine’s understanding of human-generated text Common terminology: Tokenization Lemmatization / Stemming “Stop Words” Part of Speech Tagging Text Search, TF-IDF Levenshtein Distance For spell checking / fuzzy search Ranking by the level of distance Semantic Understanding Popular Open-Source Library: Lucene.NET Provides many generic NLP capabilities

NLP to Solve Dysgraphia Symptoms Regular spell checker For which cases will it work well? Is it good enough for Dysgraphia? Customized spell checker How will it work? What is required? Isn’t it better? Symptoms we chose to handle Homophonic replacement of letters (“Dyscravia”) Doubling letters (Grapheme Buffer Dysgraphia) Changing internal order (Grapheme Buffer Dysgraphia)

The Dyspeller Application Classification Module Use a series of tests (presented as a “game”) Determines “Dysgraphia Profile” – common symptoms Personalized Spell Checker For every misspelled word, we look for the nearest correct word Search is done not by Levenshtein distance, but by “Personalized Dysgraphia Distance” The distance between two words is calculated by: Number of Dysgraphia symptoms which are typical for this specific user, that are needed to be fixed in order to generate word A from word B. Publishing Module The corrected text can be sent via SMS or Email to any of the contacts.

Dyspeller - Design Suggestion Processing – Calculating Dysgraphia Distance HTTP/GET: Suggestions by symptoms Double Letter Symptom Internal Reorder Symptom Phonetic Replace Symptom Valid Words Data Set Response: misspelled word -> suggestions list (JSON format)

DEMO

Thank You. References: Gvion, Friedmann, Yachini – Dysgraphia (2008) Letter position dysgraphia (Aviah Gvion, Naama Friedmann) – 2009 Dyscravia: Voicing substitution dysgraphia (Aviah Gvion, Naama Friedmann) – 2010