Building an annotated Corpus David Ling 2018-07-24
Contents Error annotation tool – UIUC 2010 Brat -- 2012 WAMP – NUS 2013 Conclusion – build our own tool
Error annotation tool http://cogcomp.org/page/tools_vi ew/17 Web-based PHP Sentence-based annotation Old version PHP many errors during implementation
BRAT http://brat.nlplab.org Common NLP annotating tool Free available online web-based, cgi Named entity recognition Part-of-speech Not for error annotation
WAMP Building a Large Annotated Corpus of Learner English, NUS 2013 In-house, Web-based Guidelines: select minimum span that is necessary to correct the error Functions: Essay browser Users system Annotation
Conclusions Develop our own tool for data collection TO DO Not difficult Implement and modify others also time consuming and difficult Taylor-made for our setting Compatible to our grammar checker/ assistant in the future TO DO Setup a PHP+ SQL database (working, http://10.244.0.191/annotation/) Annotation interface (nearly done)