KantanNeural™ LQR Experiment

Slides:



Advertisements
Similar presentations
Rationale for a multilingual corpus for machine translation evaluation Debbie Elliott Anthony Hartley Eric Atwell Corpus Linguistics 2003, Lancaster, England.
Advertisements

English only VS. L1 support Looking at English language acquisition of native Spanish speaking children.
Help communities share knowledge more effectively across the language barrier Automated Community Content Editing PorTal.
Improving Machine Translation Quality via Hybrid Systems and Refined Evaluation Methods Andreas Eisele DFKI GmbH and Saarland University Helsinki, November.
Figure 1. SAT-9 Reading Percentile Scores from Sample Schools with Mostly Native English-Speaking Students (
Chinese Word Segmentation Method for Domain-Special Machine Translation Su Chen; Zhang Yujie; Guo Zhen; Xu Jin’an Beijing Jiaotong University.
Unsupervised Turkish Morphological Segmentation for Statistical Machine Translation Coskun Mermer and Murat Saraclar Workshop on Machine Translation and.
1 Language Model Adaptation in Machine Translation from Speech Ivan Bulyko, Spyros Matsoukas, Richard Schwartz, Long Nguyen, and John Makhoul.
LEGAL BASIS OF BILINGUAL TEACHING IN POLAND. Bilingual teaching is conducted in Poland since the school year 1991/1992. Originally, bilingual classes.
© 2014 The MITRE Corporation. All rights reserved. Stacey Bailey and Keith Miller On the Value of Machine Translation Adaptation LREC Workshop: Automatic.
A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora Benjamin Arai Computer Science and Engineering Department.
Tapta4IPC: helping translation of IPC definitions Bruno Pouliquen 25 feb 2013, IPC workshop Translation.
Teaching English to Korean Students Understanding Their Particular Problems.
Help communities share knowledge more effectively across the language barrier Automated Community Content Editing PorTal.
Achieving Domain Specificity in SMT without Over Siloing William Lewis, Chris Wendt, David Bullock Microsoft Research Machine Translation.
By: Mitchell Kattengell and Ryan Anderson DecisionTools Suite Overview 100% within excel – supports Excel 2000 and higher. Support parallel processing-
An Integrated Approach for Arabic-English Named Entity Translation Hany Hassan IBM Cairo Technology Development Center Jeffrey Sorensen IBM T.J. Watson.
Advanced Signal Processing 05/06 Reinisch Bernhard Statistical Machine Translation Phrase Based Model.
Kyoshiro SUGIYAMA, AHC-Lab., NAIST An Investigation of Machine Translation Evaluation Metrics in Cross-lingual Question Answering Kyoshiro Sugiyama, Masahiro.
The worldwide language services market 2.33% Revenue from post-edited machine translation 53.5% Consumers using free MT on the web Copyright © 2011 by.
You can Learn a language in sixty hours Improve your employability Challenge yourself.
What you have learned and how you can use it : Grammars and Lexicons Parts I-III.
For Wednesday No reading Homework –Chapter 23, exercise 15 –Process: 1.Create 5 sentences 2.Select a language 3.Translate each sentence into that language.
Chinese Word Segmentation Adaptation for Statistical Machine Translation Hailong Cao, Masao Utiyama and Eiichiro Sumita Language Translation Group NICT&ATR.
Students will be able to: Translate between words and algebra; Evaluate algebraic expressions and use exponents. Objectives 1.1.
Addressing the Rare Word Problem in Neural Machine Translation
Information Transfer through Online Summarizing and Translation Technology Sanja Seljan*, Ksenija Klasnić**, Mara Stojanac*, Barbara Pešorda*, Nives Mikelić.
Capitalization DAY 1 COMPLETE SLIDES 1-8
MACHINE TRANSLATION PAPER 1 Daniel Montalvo, Chrysanthia Cheung-Lau, Jonny Wang CS159 Spring 2011.
Impact of automated translation on mining knowledge from text data , Brno Luděk Svozil.
Statistical Machine Translation Part II: Word Alignments and EM Alex Fraser Institute for Natural Language Processing University of Stuttgart
Review: Review: Translating without in-domain corpus: Machine translation post-editing with online learning techniques Antonio L. Lagarda, Daniel Ortiz-Martínez,
English-Hindi Neural machine translation and parallel corpus generation EKANSH GUPTA ROHIT GUPTA.
Find International Driving Document Translator Online
How to use the Internet to Speak and Write in Italian 1.Write simple sentences in English. 2.Identify the verb 3.Translate the verb correctlyhttp://dictionary.reverso.net/english-
English-Lithuanian-English Lexicon Database Management System for MT Gintaras Barisevicius and Elvinas Cernys Kaunas University of Technology, Department.
Most Professional Translation Services provider in USA.
Acquisition of Character Translation Rules for Supporting SNOMED CT Localizations Jose Antonio Miñarro-Giménez a Johannes Hellrich b Stefan Schulz a a.
Arnar Thor Jensson Koji Iwano Sadaoki Furui Tokyo Institute of Technology Development of a Speech Recognition System For Icelandic Using Machine Translated.
Is Neural Machine Translation the New State of the Art?
A CASE STUDY OF GERMAN INTO ENGLISH BY MACHINE TRANSLATION: MOSES EVALUATED USING MOSES FOR MERE MORTALS. Roger Haycock 
Neural Machine Translation
Centre for Translation Studies FACULTY OF ARTS
Statistical Machine Translation Part II: Word Alignments and EM
RECENT TRENDS IN SMT By M.Balamurugan, Phd Research Scholar,
Wu et. al., arXiv - sept 2016 Presenter: Lütfi Kerem Şenel
The ACCEPT Project Enabling machine translation for the emerging community content paradigm. Allowing citizens across the EU better access to communities.
Neural Machine Translation by Jointly Learning to Align and Translate
Retrospective 2017 & Future Plans
Joint Training for Pivot-based Neural Machine Translation
HOW TO MAKE MACHINE TRANSLATION WORK FOR YOU
Neural Lattice Search for Domain Adaptation in Machine Translation
Statistical Machine Translation Part III – Phrase-based SMT / Decoding
PROFESSIONAL TRANSLATION IN A PRE-SINGULARITY WORLD
Triangular Architecture for Rare Language Translation
Yuri Pettinicchi Jeny Tony Philip
Eiji Aramaki* Sadao Kurohashi* * University of Tokyo
ONEs - OHT NMT Evaluation score
On the Impact of Various Types of Noise on Neural Machine Translation
Memory-augmented Chinese-Uyghur Neural Machine Translation
Machine Translation(MT)
Statistical vs. Neural Machine Translation: a Comparison of MTH and DeepL at Swiss Post’s Language service Lise Volkart – Pierrette Bouillon – Sabrina.
Domain Mixing for Chinese-English Neural Machine Translation
How do Professional Post-Editors Perform?
Improving IBM Word-Alignment Model 1(Robert C. MOORE)
COUNTRIES NATIONALITIES LANGUAGES.
Idiap Research Institute University of Edinburgh
Statistical Machine Translation Part VI – Phrase-based Decoding
Presented By: Sparsh Gupta Anmol Popli Hammad Abdullah Ayyubi
Presentation transcript:

KantanNeural™ LQR Experiment Solving KantanNeural™ LQR Experiment

Evolution of Machine Translation 2016 Neural MT Quality Statistical MT 2002 Rule-Based 1970

Phrase-Based Statistical MT 1970 2002 2016

Neural MT – The emergence of AI 1970 2002 2016

Scientific Rigour Experiment Setup Identical Training Data Sets Identical Test Reference Sets Automated Scores Used: F-Measure, TER, BLEU Native Speaking, Professional Reviewers NMT – KantanNeural™ – GPU Processors SMT – KantanMT – CPU Processors Translation Evaluation – KantanLQR™

Training Corpora Language Arc Parallel Sentences TWC UWC Domain(s) English->German 8,820,562 110,150,238 859,167 Legal/Medical English->Chinese(Simplified) 6,522,064 84,426,931 956,864 Legal/Technical English->Japanese 8,545,366 87,252,129 676,244 English->Italian 2,756,185 35,295,535 765,930 Medical English->Spanish 3,681,332 44,917,538 952,089 Legal

Training : Time SMT NMT Language Arc Time (Hours) English->German 18 92 English->Chinese(Simplified) 6 10 English->Japanese 9 68 English->Italian 8 83 English->Spanish 71

Training : Automated Scores SMT NMT Language Arc F-Measure BLUE TER Time Perplexity English->German 62.00 54.08 54.31 18 62.53 47.53 53.41 3.02 92 English->Chinese(Simplified) 77.16 45.36 46.85 6 71.85 39.39 47.01 2.00 10 English->Japanese 80.04 63.27 43.77 9 69.51 40.55 49.46 1.89 68 English->Italian 69.74 56.98 42.54 8 64.88 42.00 48.73 2.70 83 English->Spanish 71.53 54.78 41.87 69.41 49.24 44.89 2.59 71

Training : Automated Scores SMT NMT Language Arc F-Measure BLUE TER Time Perplexity English->German 62.00 54.08 54.31 18 62.53 47.53 53.41 3.02 92 English->Chinese(Simplified) 77.16 45.36 46.85 6 71.85 39.39 47.01 2.00 10 English->Japanese 80.04 63.27 43.77 9 69.51 40.55 49.46 1.89 68 English->Italian 69.74 56.98 42.54 8 64.88 42.00 48.73 2.70 83 English->Spanish 71.53 54.78 41.87 69.41 49.24 44.89 2.59 71

Training : Automated Scores SMT NMT Language Arc F-Measure BLUE TER Time Perplexity English->German 62.00 54.08 54.31 18 62.53 47.53 53.41 3.02 92 English->Chinese(Simplified) 77.16 45.36 46.85 6 71.85 39.39 47.01 2.00 10 English->Japanese 80.04 63.27 43.77 9 69.51 40.55 49.46 1.89 68 English->Italian 69.74 56.98 42.54 8 64.88 42.00 48.73 2.70 83 English->Spanish 71.53 54.78 41.87 69.41 49.24 44.89 2.59 71

Training : Automated Scores SMT NMT Language Arc F-Measure BLUE TER Time Perplexity English->German 62.00 54.08 54.31 18 62.53 47.53 53.41 3.02 92 English->Chinese(Simplified) 77.16 45.36 46.85 6 71.85 39.39 47.01 2.00 10 English->Japanese 80.04 63.27 43.77 9 69.51 40.55 49.46 1.89 68 English->Italian 69.74 56.98 42.54 8 64.88 42.00 48.73 2.70 83 English->Spanish 71.53 54.78 41.87 69.41 49.24 44.89 2.59 71

Professional Reviewers

Ranking

Ranking

Ranking

Conclusions Comparative study of identical SMT and NMT Engines Commercial Setting Identical Training Scenarios Compared Automated Scores Conducted A/B Testing Analysed Results

Conclusions Translation Quality Automated Scores As determined by native speaking, professional translators In all cases NMT ranked higher than SMT We observed that 48% of lower scoring BLUE NMT translations when ranked higher than their higher BLUE scoring SMT counterparts Automated Scores BLEU: Susceptible to under-estimating of NMT quality

Solving Thank You