Presentation is loading. Please wait.

Presentation is loading. Please wait.

Correcting Misuse of Verb Forms John Lee, Stephanie Seneff Computer Science and Artificial Intelligence Laboratory, MIT, Cambridge ACL 2008.

Similar presentations


Presentation on theme: "Correcting Misuse of Verb Forms John Lee, Stephanie Seneff Computer Science and Artificial Intelligence Laboratory, MIT, Cambridge ACL 2008."— Presentation transcript:

1 Correcting Misuse of Verb Forms John Lee, Stephanie Seneff Computer Science and Artificial Intelligence Laboratory, MIT, Cambridge ACL 2008

2 Outline  Introduction  Background  System  Baselines  Data  Evaluation  Conclusions

3 Introduction

4

5

6 Outline  Introduction  Background  System  Baselines  Data  Evaluation  Conclusions

7 Background The goal is to correct confusions among the five forms, as well as the infinitive caused by semantic and syntactic errors. Semantic Errors Suppose one wants to say “I am prepared for the exam”, but writes “I am preparing for the exam”.

8 Background Syntactic Errors Subject-Verb Agreement He *have been living there since June. Auxiliary Agreement He has been *live there since June. Complementation He wants *live there.

9 Outline  Introduction  Background  System  Baselines  Data  Evaluation  Conclusions

10 System Step1 Automatic Parsing “My father is *work in the laboratory.”

11 System Step2 Replacing the verb forms

12 System

13 Step3 N-gram counts as a filter Using WEB 1T N-GRAM corpus. Prepared by Google Inc.

14 Outline  Introduction  Background  System  Baselines  Data  Evaluation  Conclusions

15 Baselines majority baseline No correction. verb-only baseline (Only used in Auxiliary Agreement & Complementation) It attempts corrections only when the word in question is actually tagged as a verb.

16 Outline  Introduction  Background  System  Baselines  Data  Evaluation  Conclusions

17 Data Development Data AQUAINT Corpus (English News Text) Evaluation Data JLE (Japanese Learners of English corpus) For 167 of the transcribed interviews, totalling 15,637 sentences. Test Set 477 sentences (3.1%) contain subject-verb agreement errors, and 238 (1.5%) contain auxiliary agreement and complementation errors

18 Data Evaluation Data HKUST (Hong Kong University of Science and Technology) It contains a total of 2556 sentences.

19 Data Evaluation Metric Accuracy (true neg + true pos) / total number of sentences Recall true pos / (true pos + false neg + inv pos) Detection Precision (true pos + inv pos) / (true pos + inv pos + false pos) Correction Precision true pos / (true pos + false pos + inv pos)

20 Outline  Introduction  Background  System  Baselines  Data  Evaluation  Conclusions

21 Evaluation JLE Results for Subject-Verb Agreement CorpusMethodAccuracyPrecision (correction) Precision (detection) Recall JLEall majority 98.93% 96.95% 81.61%83.93%80.92% Results for Auxiliary Agreement & Complementation CorpusMethodAccuracyPrecision (correction) Precision (detection) Recall JLEall verb-only majority 98.94% 98.85% 98.47% 68.00% 71.43% 80.67% 84.75% 42.86% 31.51%

22 Evaluation HKUST Results for Auxiliary Agreement & Complementation Two native speakers of English were given the edited sentences, as well as the original input. For each pair, they were asked to select one of four statements: one of the two is better, or both are equally correct, or both are equally incorrect. CorpusMethodAccuracyPrecision (correction) Precision (detection) Recall HKUSTallNot available71.71%not available Kappa: 0.76

23 Evaluation

24 Outline  Introduction  Background  System  Baselines  Data  Evaluation  Conclusions

25 Conclusions  This paper proposes a method to correct English verb form errors made by non-native speakers.  Investigation of the ways the ways in which verb form errors affect parse trees.


Download ppt "Correcting Misuse of Verb Forms John Lee, Stephanie Seneff Computer Science and Artificial Intelligence Laboratory, MIT, Cambridge ACL 2008."

Similar presentations


Ads by Google