Is Neural Machine Translation the New State of the Art?

Slides:



Advertisements
Similar presentations
© 2000 XTRA Translation Services Is MT technology available today ready to replace human translators?
Advertisements

Rationale for a multilingual corpus for machine translation evaluation Debbie Elliott Anthony Hartley Eric Atwell Corpus Linguistics 2003, Lancaster, England.
Yansong Feng and Mirella Lapata
Proceedings of the Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2007) Learning for Semantic Parsing Advisor: Hsin-His.
Rating Evaluation Methods through Correlation presented by Lena Marg, Language Tools MTE 2014, Workshop on Automatic and Manual Metrics for Operational.
MT Evaluation: Human Measures and Assessment Methods : Machine Translation Alon Lavie February 23, 2011.
Improving Machine Translation Quality via Hybrid Systems and Refined Evaluation Methods Andreas Eisele DFKI GmbH and Saarland University Helsinki, November.
Speech and Language Technologies in the Next Generation Localisation CSET Prof. Andy Way, School of Computing, DCU.
Machine Translation Prof. Alexandros Potamianos Dept. of Electrical & Computer Engineering Technical University of Crete, Greece May 2003.
Course Summary LING 575 Fei Xia 03/06/07. Outline Introduction to MT: 1 Major approaches –SMT: 3 –Transfer-based MT: 2 –Hybrid systems: 2 Other topics.
Does Syntactic Knowledge help English- Hindi SMT ? Avinesh. PVS. K. Taraka Rama, Karthik Gali.
Linda Mitchell Evaluating Community Post-Editing - Bridging the Gap between Translation Studies and Social Informatics Linda Mitchell PhD student.
© 2014 The MITRE Corporation. All rights reserved. Stacey Bailey and Keith Miller On the Value of Machine Translation Adaptation LREC Workshop: Automatic.
Search is not only about the Web An Overview on Printed Documents Search and Patent Search Walid Magdy Centre for Next Generation Localisation School of.
Automating Translation in the Localisation Factory An Investigation of Post-Editing Effort Sharon O’Brien Dublin City University.
Machine Translation Dr. Radhika Mamidi. What is Machine Translation? A sub-field of computational linguistics It investigates the use of computer software.
Blending SEO with Localisation Andy Atkins-Krüger.
Statistical Machine Translation Part III – Phrase-based SMT / Decoding Alex Fraser Institute for Natural Language Processing University of Stuttgart
English-Persian SMT Reza Saeedi 1 WTLAB Wednesday, May 25, 2011.
Evaluation of the Statistical Machine Translation Service for Croatian-English Marija Brkić Department of Informatics, University of Rijeka
Can Controlled Language Rules increase the value of MT? Fred Hollowood & Johann Rotourier Symantec Dublin.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
How do Humans Evaluate Machine Translation? Francisco Guzmán, Ahmed Abdelali, Irina Temnikova, Hassan Sajjad, Stephan Vogel.
Automatic Post-editing (pilot) Task Rajen Chatterjee, Matteo Negri and Marco Turchi Fondazione Bruno Kessler [ chatterjee | negri | turchi
The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in.
Chinese Word Segmentation Adaptation for Statistical Machine Translation Hailong Cao, Masao Utiyama and Eiichiro Sumita Language Translation Group NICT&ATR.
LREC 2004, 26 May 2004, Lisbon 1 Multimodal Multilingual Resources in the Subtitling Process S.Piperidis, I.Demiros, P.Prokopidis, P.Vanroose, A. Hoethker,
Haitham Elmarakeby.  Speech recognition
MACHINE TRANSLATION PAPER 1 Daniel Montalvo, Chrysanthia Cheung-Lau, Jonny Wang CS159 Spring 2011.
Pastra and Saggion, EACL 2003 Colouring Summaries BLEU Katerina Pastra and Horacio Saggion Department of Computer Science, Natural Language Processing.
Review: Review: Translating without in-domain corpus: Machine translation post-editing with online learning techniques Antonio L. Lagarda, Daniel Ortiz-Martínez,
English-Hindi Neural machine translation and parallel corpus generation EKANSH GUPTA ROHIT GUPTA.
Arnar Thor Jensson Koji Iwano Sadaoki Furui Tokyo Institute of Technology Development of a Speech Recognition System For Icelandic Using Machine Translated.
A CASE STUDY OF GERMAN INTO ENGLISH BY MACHINE TRANSLATION: MOSES EVALUATED USING MOSES FOR MERE MORTALS. Roger Haycock 
Post-editing: A Research Perspective
New features in DDC applications
Centre for Translation Studies FACULTY OF ARTS
RECENT TRENDS IN SMT By M.Balamurugan, Phd Research Scholar,
Xiaolin Wang Andrew Finch Masao Utiyama Eiichiro Sumita
TYPES OF TRANSLATION.
How to teach translation technologies
Evaluating materials • When we use materials in class, we understand
Language Technologies Institute Carnegie Mellon University
Wu et. al., arXiv - sept 2016 Presenter: Lütfi Kerem Şenel
Monoligual Semantic Text Alignment and its Applications in Machine Translation Alon Lavie March 29, 2012.
Chapter 6. Data Collection in a Wizard-of-Oz Experiment in Reinforcement Learning for Adaptive Dialogue Systems by: Rieser & Lemon. Course: Autonomous.
8. Translation resources
Alexander Fraser CIS, LMU München Machine Translation
KantanNeural™ LQR Experiment
Joint Training for Pivot-based Neural Machine Translation
HOW TO MAKE MACHINE TRANSLATION WORK FOR YOU
Intelligent Information System Lab
Translating and the Computer London, 16 November 2017
Neural Lattice Search for Domain Adaptation in Machine Translation
PROFESSIONAL TRANSLATION IN A PRE-SINGULARITY WORLD
A hybrid intralinguistic subtitling tool Laura Cacheiro Quintas, UPVD
Terminology translation accuracy in SMT vs. NMT
The Story of the Translation Industry in 2022 in ten mini-chapters www
Eiji Aramaki* Sadao Kurohashi* * University of Tokyo
ONEs - OHT NMT Evaluation score
Machine translation markers in post-edited machine translation output
Memory-augmented Chinese-Uyghur Neural Machine Translation
Statistical Machine Translation Papers from COLING 2004
Machine Translation(MT)
Statistical vs. Neural Machine Translation: a Comparison of MTH and DeepL at Swiss Post’s Language service Lise Volkart – Pierrette Bouillon – Sabrina.
How to increase productivity without loss of quality
Idiap Research Institute University of Edinburgh
Neural Machine Translation
Sequence-to-Sequence Models
Meghan Dowling Teresa Lynn Andy Way
Presentation transcript:

Is Neural Machine Translation the New State of the Art? olhs Is Neural Machine Translation the New State of the Art? Sheila Castilho sheila.castilho@adaptcentre.ie Joss Moorkens Federico Gaspari Iacer Calixto John Tinsley Andy Way The ADAPT Centre is funded under the SFI Research Centres Programme (Grant 13/RC/2106) and is co-funded under the European Regional Development Fund.

MT and the hype Use cases Conclusion NMT for E-Commerce Outline MT and the hype Use cases NMT for E-Commerce NMT for Patents NMT for MOOC Conclusion

Great excitement and anticipation each new wave of MT olhs Great excitement and anticipation each new wave of MT NMT: “bridging the gap between human and MT”

The hype

And translators go… olhs

Us vs them “MT will steal translators' jobs” olhs The reaction “MT will steal translators' jobs” “translators will be merely post-editors of MT” “MT is a threat” Us vs them -Same as SMT came along Translators refusing to use MT vs translators being forced to use MT Problems for translations industry, clients, vendors and translators In Argentina, and in other parts of the world, some “futurologists” are stating that in about 5-10 years there will no longer be a need for translators and that parents should not send their children to university to study translation.

(Philipp Koehn, Omniscien Webinar 2017) olhs great discrepancy between the high expectation of what MT should accomplish and what it is actually able to deliver   (Philipp Koehn, Omniscien Webinar 2017)

But is NMT really that good? Use cases different domains different set of language pairs

NMT for E-Commerce Systems (Calixto et al. 2017): English into German olhs NMT for E-Commerce Systems (Calixto et al. 2017): (1) a PBSMT baseline model built with the Moses SMT Toolkit (2) a text-only NMTt model (3) a multi-modal NMT model (NMTm) English into German Data set: 24k parallel product titles + images Validation/test data: 480/444 tuples 18 German native speakers Ranking Translations from the 3 systems + product image Adequacy (Likert scale 1- All of it to 4- None of it) Source + translation + product image Test whether training an NMT system with access to the product images improved the output quality against two text-only the translation of user-generated product listings poses particular challenges: often ungrammatical can be difficult to interpret even by a native speaker of the language scattered keywords and/or phrases glued together typos

NMT for E-Commerce AEM: Adequacy Ranking olhs NMT for E-Commerce AEM: PBSMT outperforms both NMT models (BLEU, METEOR and chrF3) NMTm performs as well as PBSMT (TER) Adequacy NMTm performs as well as PBSMT Ranking PBSMT: 56.3% preferred system NMTm: 24.8% NMTt: 18.8% Likert scale – less is better 1- All of it 2- 3- 4- None of it

olhs NMT for Patents Compare the performance between the mature patent MT engines used in production with novel NMT Systems PBSMT (a combination of elements of phrase-based, syntactic, and rule-driven MT, along with automatic post-editing) NMT (baseline) English into Chinese Data set: ~1M sentence pairs chemical abstracts, ~350K chemical titles, ~12M general patent, and ~2K glossaries. 2 reviewers Ranking Error analysis Punctuation, part of speech, omission, addition, wrong terminology, literal translation, and word form. Titles are on average ~8 tokens Abstracts ~40

NMT for Patents AEM: Ranking olhs NMT for Patents AEM: SMT outperforms NMT for abstracts, NMT outperforms SMT for titles Ranking General: PBSMT 54% - NMT 39% Long sentences: PBSMT 58% - NMT 33% Short sentences: PBSMT 84% - NMT 8% Medium-length sentences: PBSMT 36% - NMT 57% Short: (<10 words) Medium: (11 and 39 words) Long: (>40 words)

NMT for Patents Error analysis SMT: sentence structure 35% (10% NMT) olhs NMT for Patents Error analysis SMT: sentence structure 35% (10% NMT) NMT: 37% omission (8% SMT) % segments with “no errors” SMT 25% NMT 2% Short: (<10 words) Medium: (11 and 39 words) Long: (>40 words)

olhs NMT for MOOCs Decide which system would provide better quality translations for the project domain Systems PBMST (Moses) NMT (baseline) English into German, Greek, Portuguese and Russian Data set: OFD : ~24M (DE), ~31M (EL), ~32 (PT), ~22(RU) In-domain : ~270K(DE), ~140K(EL), ~58K(PT), ~2M(RU) Ranking Post-editing Fluency and Adequacy (1-4 Likert scale) Error analysis: inflectional morphology, word order, omission, addition, and mistranslation

NMT for MOOCs AEM: Fluency and Adequacy NMT outperforms SMT in terms of BLEU and METEOR More PE for SMT Fluency and Adequacy NMT is preferred across all languages for Fluency Adequacy results a bit less consistent

NMT for MOOCs Post-editing Ranking Technical effort improved for DE, but marginally for other languages Temporal effort marginally improved Ranking NMT is preferred across all languages (DE 80%, EL 56%, PT 61% and RU 63%)

NMT results are really promising! But… olhs So… NMT is good, right? NMT results are really promising! But… human evaluations show that results are not yet so clear-cut NMT can sometimes be better but it is not “much better” all the time!

The hype around NMT must be treated cautiously olhs Conclusion Translation industry is eager for improved MT quality in order to minimise costs The hype around NMT must be treated cautiously Overselling a technology that is still in need of more research may cause negativity about MT “us vs them” “MT is a threat to human translators” 5 years to convice translators to use MT again!

Thank you! Questions? Sheila Castilho sheila.castilho@adaptcentre.ie