A classifier-based approach to preposition and determiner error correction in L2 English Rachele De Felice, Stephen G. Pulman Oxford University Computing.

Slides:

Advertisements

Similar presentations

Specialized models and ranking for coreference resolution Pascal Denis ALPAGE Project Team INRIA Rocquencourt F Le Chesnay, France Jason Baldridge.

Advertisements

Automatic Identification of Cognates, False Friends, and Partial Cognates University of Ottawa, Canada University of Ottawa, Canada.

An Online Microsoft Word Tutorial & Evaluation Begin.

Linear Model Incorporating Feature Ranking for Chinese Documents Readability Gang Sun, Zhiwei Jiang, Qing Gu and Daoxu Chen State Key Laboratory for Novel.

Named Entity Classification Chioma Osondu & Wei Wei.

Playing the Telephone Game: Determining the Hierarchical Structure of Perspective and Speech Expressions Eric Breck and Claire Cardie Department of Computer.

HOO 2012: A Report on the Preposition and Determiner Error Correction Shared Task Robert Dale, Ilya Anisimoff and George Narroway Centre for Language Technology.

A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts 04 10, 2014 Hyun Geun Soo Bo Pang and Lillian Lee (2004)

Recognizing Implicit Discourse Relations in the Penn Discourse Treebank Ziheng Lin, Min-Yen Kan, and Hwee Tou Ng Department of Computer Science National.

Automatic Metaphor Interpretation as a Paraphrasing Task Ekaterina Shutova Computer Lab, University of Cambridge NAACL 2010.

Using Web Queries for Learner Error Detection Michael Gamon, Microsoft Research Claudia Leacock, Butler-Hill Group.

Predicting Text Quality for Scientific Articles AAAI/SIGART-11 Doctoral Consortium Annie Louis : Louis A. and Nenkova A Automatically.

Chapter 1: Introduction to Pattern Recognition

CS Word Sense Disambiguation. 2 Overview A problem for semantic attachment approaches: what happens when a given lexeme has multiple ‘meanings’?

Ensemble Learning: An Introduction

1 Noun Homograph Disambiguation Using Local Context in Large Text Corpora Marti A. Hearst Presented by: Heng Ji Mar. 29, 2004.

On the Correlation between Energy and Pitch Accent in Read English Speech Andrew Rosenberg, Julia Hirschberg Columbia University Interspeech /14/06.

On the Correlation between Energy and Pitch Accent in Read English Speech Andrew Rosenberg Weekly Speech Lab Talk 6/27/06.

Taking the Kitchen Sink Seriously: An Ensemble Approach to Word Sense Disambiguation from Christopher Manning et al.

Boosting Applied to Tagging and PP Attachment By Aviad Barzilai.

Introduction to Bayesian Learning Ata Kaban School of Computer Science University of Birmingham.

Extracting Interest Tags from Twitter User Biographies Ying Ding, Jing Jiang School of Information Systems Singapore Management University AIRS 2014, Kuching,

Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.

Extracting Opinions, Opinion Holders, and Topics Expressed in Online News Media Text Soo-Min Kim and Eduard Hovy USC Information Sciences Institute 4676.

Preposition Usage Errors by English as a Second Language (ESL) learners: “ They ate by* their hands.”  The writer used by instead of with. This work is.

Na-Rae Han (University of Pittsburgh), Joel Tetreault (ETS), Soo-Hwa Lee (Chungdahm Learning, Inc.), Jin-Young Ha (Kangwon University) May , LREC.

Masaryk University, Brno Friday 13 th September Katie Mansfield

Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.

Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.

A Feedback-Augmented Method for Detecting Errors in the Writing of Learners of English Ryo Nagata et al. Hyogo University of Teacher Education ACL 2006.

Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.

Distributional Part-of-Speech Tagging Hinrich Schütze CSLI, Ventura Hall Stanford, CA , USA NLP Applications.

1 Named Entity Recognition based on three different machine learning techniques Zornitsa Kozareva JRC Workshop September 27, 2005.

1 Determining the Hierarchical Structure of Perspective and Speech Expressions Eric Breck and Claire Cardie Cornell University Department of Computer Science.

Improving Subcategorization Acquisition using Word Sense Disambiguation Anna Korhonen and Judith Preiss University of Cambridge, Computer Laboratory 15.

TALC Applying some Developments in Corpus Building Technology to Language Teaching and Learning TALC 2006 Paris.

A Language Independent Method for Question Classification COLING 2004.

1 Determining query types by analysing intonation.

Opinion Holders in Opinion Text from Online Newspapers Youngho Kim, Yuchul Jung and Sung-Hyon Myaeng Reporter: Chia-Ying Lee Advisor: Prof. Hsin-Hsi Chen.

Automated Suggestions for Miscollocations the Fourth Workshop on Innovative Use of NLP for Building Educational Applications Authors:Anne Li-E Liu, David.

Copyright © 2013 by Educational Testing Service. All rights reserved. 14-June-2013 Detecting Missing Hyphens in Learner Text Aoife Cahill *, Martin Chodorow.

Automatic Identification of Pro and Con Reasons in Online Reviews Soo-Min Kim and Eduard Hovy USC Information Sciences Institute Proceedings of the COLING/ACL.

Cho Yiu Catholic Primary School Interrogatives P.4 English.

Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.

Page 1 NAACL-HLT 2010 Los Angeles, CA Training Paradigms for Correcting Errors in Grammar and Usage Alla Rozovskaya and Dan Roth University of Illinois.

Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.

Intelligent Key Prediction by N-grams and Error-correction Rules Kanokwut Thanadkran, Virach Sornlertlamvanich and Tanapong Potipiti Information Research.

Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Case Study Data Analysis of a Non-native English Language Learner.

Detecting Missing Hyphens in Learner Text Aoife Cahill, SusanneWolff, Nitin Madnani Educational Testing Service ACL 2013 Martin Chodorow Hunter College.

On using context for automatic correction of non-word misspellings in student essays Michael Flor Yoko Futagi Educational Testing Service 2012 ACL.

Discovering Relations among Named Entities from Large Corpora Takaaki Hasegawa *, Satoshi Sekine 1, Ralph Grishman 1 ACL 2004 * Cyberspace Laboratories.

Using Wikipedia for Hierarchical Finer Categorization of Named Entities Aasish Pappu Language Technologies Institute Carnegie Mellon University PACLIC.

Correcting Comma Errors in Learner Essays, and Restoring Commas in Newswire Text Ross Israel Indiana University Joel Tetreault Educational Testing Service.

Paper Title Authors names Conference and Year Presented by Your Name Date.

Error Analysis of Two Types of Grammar for the purpose of Automatic Rule Refinement Ariadna Font Llitjós, Katharina Probst, Jaime Carbonell Language Technologies.

Fill in with the correct preposition start start.

A Brief Maximum Entropy Tutorial Presenter: Davidson Date: 2009/02/04 Original Author: Adam Berger, 1996/07/05

Word Sense and Subjectivity (Coling/ACL 2006) Janyce Wiebe Rada Mihalcea University of Pittsburgh University of North Texas Acknowledgements: This slide.

Correcting Misuse of Verb Forms John Lee, Stephanie Seneff Computer Science and Artiﬁcial Intelligence Laboratory, MIT, Cambridge ACL 2008.

Maximum Entropy techniques for exploiting syntactic, semantic and collocational dependencies in Language Modeling Sanjeev Khudanpur, Jun Wu Center for.

Learning Deep Rhetorical Structure for Extractive Speech Summarization ICASSP2010 Justin Jian Zhang and Pascale Fung HKUST Speaker: Hsiao-Tsung Hung.

Error Analysis Session 11. Models for Error Analysis Corder (1967 & 1973) identified a model for error analysis which included three stages: – Data collection.

Short Text Similarity with Word Embedding Date: 2016/03/28 Author: Tom Kenter, Maarten de Rijke Source: CIKM’15 Advisor: Jia-Ling Koh Speaker: Chih-Hsuan.

The University of Illinois System in the CoNLL-2013 Shared Task Alla RozovskayaKai-Wei ChangMark SammonsDan Roth Cognitive Computation Group University.

Tracking parameter optimization

CALL – AN INTRODUCTION • CALL is – a computer-assisted language learning method • It can be contrasted with book-, library-, pen- or cassette-assisted.

Automatic Detection of Causal Relations for Question Answering

Presentation transcript:

A classifier-based approach to preposition and determiner error correction in L2 English Rachele De Felice, Stephen G. Pulman Oxford University Computing Laboratory Coling 2008

Outline  Introduction  Classifier & Features  Corpus  Evaluation  Testing the model  Conclusions

Introduction Prepositions(at, by, for, from, in, of, on, to, and with) Determiners(a, the, and null) I study in Boston but I study at MIT. He is independent of his parents, but dependent on his son. Boys like sport. The boys like sport. she ate an apple. she ate the apple.

Outline  Introduction  Classifier & Features  Corpus  Evaluation  Testing the model  Conclusions

Classifier & Features maximum entropy classifier Classifiers

Classifier & Features Features(determiner) Pick the juiciest apple on the tree.

Classifier & Features Features(preposition) John drove to London.

Classifier & Features Baselines(Prepositions) Always choosing the most frequent option, namely of. Baselines(Determiners) Always choosing the most frequent option, namely null.

Outline  Introduction  Classifier & Features  Corpus  Evaluation  Testing the model  Conclusions

Corpus British National Corpus(BNC) Training Data BNC Testing Data A section of the BNC not used in training, section J.

Outline  Introduction  Classifier & Features  Corpus  Evaluation  Testing the model  Conclusions

Evaluation Prepositions

Evaluation Prepositions

Evaluation Prepositions

Evaluation Prepositions

Evaluation Determiners

Evaluation Determiners

Evaluation Determiners

Outline  Introduction  Classifier & Features  Corpus  Evaluation  Testing the model  Conclusions

Testing the model Corpus Cambridge Learner Corpus (CLC) Training Data Extracting 2523 instances of preposition use from the CLC. (1282 correct, 1241 incorrect)

Testing the model Prepositions

Testing the model System error discussions on Prepositions 1)Ungrammatical 2)Misspelled 3)Annotator's benchmark e.g. I received a beautiful present at my birthday. suggests correction: for annotators: on

Testing the model Determiners Instance typeAccuracy Correct92.2% Incorrect<10%

Testing the model System error discussions on Determiners The Lexical items which are not very frequently seen in the BNC. e.g. I saw it in internet. I booked it on Internet.

Outline  Introduction  Classifier & Features  Corpus  Evaluation  Testing the model  Conclusions

Conclusions Using contextual feature based approach to automatic identification and correction of preposition and determiner errors in L1, which achieve an accuracy of 70.06% and 92.15% respectively. Showing how it can be applied to an error correction task for L2 writing.