Terminology translation accuracy in SMT vs. NMT

Slides:



Advertisements
Similar presentations
© 2000 XTRA Translation Services Is MT technology available today ready to replace human translators?
Advertisements

Rationale for a multilingual corpus for machine translation evaluation Debbie Elliott Anthony Hartley Eric Atwell Corpus Linguistics 2003, Lancaster, England.
The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China
The Chinese Room: Understanding and Correcting Machine Translation This work has been supported by NSF Grants IIS Solution: The Chinese Room Conclusions.
Spelling Correction for Search Engine Queries Bruno Martins, Mario J. Silva In Proceedings of EsTAL-04, España for Natural Language Processing Presenter:
How to Use a Translation Memory Prof. Reima Al-Jarf King Saud University, Riyadh, Saudi Arabia Homepage:
Rating Evaluation Methods through Correlation presented by Lena Marg, Language Tools MTE 2014, Workshop on Automatic and Manual Metrics for Operational.
Post-Editing – Professional translation service redefined
TAP-ET: TRANSLATION ADEQUACY AND PREFERENCE EVALUATION TOOL Mark Przybocki, Kay Peterson, Sébastien Bronsart May LREC 2008 Marrakech, Morocco.
MT Evaluation: Human Measures and Assessment Methods : Machine Translation Alon Lavie February 23, 2011.
Machine Translation (Level 2) Anna Sågvall Hein GSLT Course, September 2004.
Machine Translation Anna Sågvall Hein Mösg F
UNDERSTANDING BILINGUAL TRANSLATION OF SPECIALIZED TEXTS.
MACHINE TRANSLATION TRANSLATION(5) LECTURE[1-1] Eman Baghlaf.
© 2014 The MITRE Corporation. All rights reserved. Stacey Bailey and Keith Miller On the Value of Machine Translation Adaptation LREC Workshop: Automatic.
(C) 2013 Logrus International Practical Visualization of ITS 2.0 Categories for Real World Localization Process Part of the Multilingual Web-LT Program.
Help communities share knowledge more effectively across the language barrier Automated Community Content Editing PorTal.
Machine Translation Dr. Radhika Mamidi. What is Machine Translation? A sub-field of computational linguistics It investigates the use of computer software.
Translation Studies 8. Research methods in Translation Studies Krisztina Károly, Spring, 2006 Sources: Károly, 2002; Klaudy, 2003.
Bilingual term extraction revisited: Comparing statistical and linguistic methods for a new pair of languages Špela Vintar Faculty of Arts Dept. of Translation.
Evaluation of the Statistical Machine Translation Service for Croatian-English Marija Brkić Department of Informatics, University of Rijeka
Kyoshiro SUGIYAMA, AHC-Lab., NAIST An Investigation of Machine Translation Evaluation Metrics in Cross-lingual Question Answering Kyoshiro Sugiyama, Masahiro.
Translation Technologies Računalne tehnologije za prevo đ enje dr. Špela Vintar Department of Translation Studies Faculty of Arts University of Ljubljana.
1 Technical & Business Writing (ENG-315) Muhammad Bilal Bashir UIIT, Rawalpindi.
Verifies quality of translated projects Measures resource performance Identifies areas for improvement & rework Evaluating linguists with our LANGUAGE.
Machine Translation (Level 2) Anna Sågvall Hein GSLT Course, January 2003.
Lesson 5 Prepared 2/20/11.  Check the spelling in a document  Check a document for grammatical errors  Translate text to and from other languages 
Problem 1: Word Segmentation whatdoesthisreferto.
 Programming - the process of creating computer programs.
SDL Trados Studio 2014 Getting Started. Components of a CAT Tool Translation Memory Terminology Management Alignment – transforming previously translated.
Error Analysis of Two Types of Grammar for the purpose of Automatic Rule Refinement Ariadna Font Llitjós, Katharina Probst, Jaime Carbonell Language Technologies.
Review: Review: Translating without in-domain corpus: Machine translation post-editing with online learning techniques Antonio L. Lagarda, Daniel Ortiz-Martínez,
Theory of Legal Translation Unit 1 Introduction. The theory of legal translation as a linguistic discipline  General theory of translation studies the.
Kadupitiya JCS Dr. Surangika Ranathunga Prof. Gihan Dias Department of Computer Science and Engineering University of Moratuwa Sri Lanka.
Arnar Thor Jensson Koji Iwano Sadaoki Furui Tokyo Institute of Technology Development of a Speech Recognition System For Icelandic Using Machine Translated.
Approaches to Grammar Ed McCorduck English 402--Grammar SUNY Cortland
Is Neural Machine Translation the New State of the Art?
Certification of Reusable Software Artifacts
Centre for Translation Studies FACULTY OF ARTS
Describing Syntax and Semantics
E303 Part II The Context of Language Research
^ Reviewer’s Workbench
A German Corpus for Similarity Detection
An experiment in teaching translation technologies
Monoligual Semantic Text Alignment and its Applications in Machine Translation Alon Lavie March 29, 2012.
Rule Induction for Classification Using
The ACCEPT Project Enabling machine translation for the emerging community content paradigm. Allowing citizens across the EU better access to communities.
Master of Translation An introduction to post-editing
8. Translation resources
Urdu-to-English Stat-XFER system for NIST MT Eval 2008
Joint Training for Pivot-based Neural Machine Translation
Design for Quality Design for Quality and Safety Design Improvement
Category-Based Pseudowords
Technical translation
Part of the Multilingual Web-LT Program
Yuri Pettinicchi Jeny Tony Philip
CS 430: Information Discovery
Barbara Gastel INASP Associate
On the Impact of Various Types of Noise on Neural Machine Translation
The CoNLL-2014 Shared Task on Grammatical Error Correction
English Language and Literature Specification B Food Anthology
About TeMpTations & Masks
Part of the Multilingual Web-LT Program
Statistical vs. Neural Machine Translation: a Comparison of MTH and DeepL at Swiss Post’s Language service Lise Volkart – Pierrette Bouillon – Sabrina.
Programming Languages, Preliminaries, History & Evolution
5. Presenting a scientific work
ENGLISH ACT STRATEGIES Strategy 1: Know the Test
Extracting Why Text Segment from Web Based on Grammar-gram
Neural Machine Translation
TECHNICAL REPORTS WRITING
Presentation transcript:

Terminology translation accuracy in SMT vs. NMT LREC 2018: MLP & MomenT Workshop, 12 May 2018 Špela Vintar, Dept. of Translation Studies, University of Ljubljana spela.vintar@ff.uni-lj.si , http://www.lojze.si/spela

Our aims Compare the quality of Google’s NMT vs. PBMT for English-Slovene and Slovene-English Domain-specific texts: Karstology Corpus Special focus on terminology translation automatic evaluation using an existing termbase human evaluation by domain expert LREC 2018: MLP & MomenT Workshop, 12 May 2018

Why terminology matters Professional translators spend up to 45% of their total working time researching terminology Terminology errors amount to over 70% of errors found in QA Guidelines for post-editors emphasize terminology consistency as one of the main problems of industry-used MT systems LREC 2018: MLP & MomenT Workshop, 12 May 2018 In professional translation environments, terminology research takes up to 45% of the total working time spent on translating a text, and according to a recent study by SDL terminology errors amount to over 70% of all errors found in the Quality Assurance (QA) process. Post-editing guidelines developed by organisations such as TAUS or SDL suggest that post-editors should pay particular attention to the consistency of terminology, because nearly all state-of-the-art MT systems still produce translations on a segment-by-segment basis and thus choose terms according to local contexts instead of entire texts. http://www.sdl.com/download/the-importance-of-terminology-management/71096/ https://www.taus.net/knowledgebase/index.php?title=Category:Post-edit http://www.sdl.com/download/introduction-to-machine-translation-and-postediting-paradigm-shift/58317/

The Karst Corpus & the Karst Termbase 15 abstracts and 5 articles from 2 scientific journals, Acta Geographica and Acta Carstologica Slovenica, fully bilingual Total size: 25,423 English, 18,985 Slovene All texts translated twice using Google’s PBMT and NMT models (via GT API) QUIKK termbase: karst landforms and processes, 81 fully populated concepts Google Translate is a general purpose MT system, so why test it on a domain- specific text? Karstology – at least for English-Slovene – is not as exotic as it may sound Lots of parallel data in both directions In many professional environments, on-the-fly domain adaptation is still not feasible LREC 2018: MLP & MomenT Workshop, 12 May 2018

Evaluation methods Automatic overall MT evaluation document-level BLEU and NIST Automatic evaluation of term translations linguistic pre-processing matching terms & equivalents from the QUIKK termbase Human evaluation of term translations 300 random term occurrences (both systems & both directions) manual evaluation by domain-expert using three categories: Correct: The system uses the right term equivalent, regardless of grammar errors. False: The system does not use the right equivalent. Partially correct multi- word term was considered wrong. Omitted: Original term skipped in translation. LREC 2018: MLP & MomenT Workshop, 12 May 2018

Automatic evaluation English-Slovene Slovene-English PBMT NMT BLEU NIST 18.50 3.59 22.49 3.85 22.53 4.24 25.43 4.35 LREC 2018: MLP & MomenT Workshop, 12 May 2018

Terms and equivalents matching the termbase For each source term found in the original we check whether the translation contains the equivalent Normalisation on both sides LREC 2018: MLP & MomenT Workshop, 12 May 2018 English-Slovene Slovene-English PBMT NMT Terms in original 538 680 Terms in translation 420 431 476 446

Human evaluation of term translations 500 random occurences for each system and language pair were checked by a domain expert Categories: Correct (even if the case and number were wrong) False (even if one part of a multi-word term was correct, or if the system used the correct expression but not for the domain) Omitted LREC 2018: MLP & MomenT Workshop, 12 May 2018 English-Slovene Slovene-English PBMT % NMT Correct 184 61.3 211 70.3 201 67 195 65 False 113 37.7 85 28.3 94 31.3 99 33 Omitted 3 1 4 1.3 5 1.7 6 2

A glance at errors En-Sl PBMT: NMT: untranslated term / term component epigenic aquifer → epigenic vodonosnik solution runnel → raztopina runnel wrong sense spring → vzmet Mlava Spring → Mlava pomlad NMT: out-of-the-blue translations cave diving → jalovo potapljanje coined words ajerno, nekarska, glacijacija LREC 2018: MLP & MomenT Workshop, 12 May 2018

A glance at errors Sl-En PBMT: NMT: untranslated term / term component nepaleokraške kamnine→ nepaleokraške rocks grammatical but non-terminological translation brezstropa jama → roofless cave (denuded cave) udornica → hollow / precipice / collapsed / sinkhole (collapse doline) NMT: out-of-the-blue translations vrtača → crop rotation (sinkhole) zakraselost → naivety (karstification) melioracija → reclamation (melioration) unsuccessful attempts at proper names inconsistencies: udornica → collapse / udder / cliff / collision / burrow / groove LREC 2018: MLP & MomenT Workshop, 12 May 2018

Conclusions Measured with BLEU/NIST, Google’s NMT outperforms PBMT for En-Sl and Sl-En Translations of domain-specific terminology are not significantly improved in NMT On-the-fly domain adaptation may not be available in many end- user environments Need for post-processing methods LREC 2018: MLP & MomenT Workshop, 12 May 2018