Post-editing: how to future-proof your career in translation Paulo Camargo, PhD. Owner, Terminologist BLC - Brazilian Localization.

Slides:



Advertisements
Similar presentations
Translation in the 21 st Century Impacts of MT and social media on language services.
Advertisements

Global Surveying via the Web: Better, Faster, Cheaper... and More Chaotic Presented by L. Allen Slade and Nicholas Mills at the Society for Industrial.
© 2000 XTRA Translation Services Is MT technology available today ready to replace human translators?
PROJECT MANAGER (PM) 1. Lead and deliver web application development project within time and budget 2. Understand multiple discipline and able to communicate.
Catalyst Preview Enda McDonnell Alchemy User Conference London 2012 London Science Museum 31 May 2012.
Machine Translation The Translator s Choice Heidi Düchting Sylke Krämer Johann Roturier.
Chapter 1 Business Driven Technology
Usage of the memoQ web service API by LSP – a case study
How to Use a Translation Memory Prof. Reima Al-Jarf King Saud University, Riyadh, Saudi Arabia Homepage:
Company Presentation EssayWriters.net. Who we are We are a professional online writing company that has been operating on the market for over 10 years.
Quality Standards, Tools and Metrics March 31, 2013 OmarAly & NohaSayed.
Post-Editing – Professional translation service redefined
A Syntactic Translation Memory Vincent Vandeghinste Centre for Computational Linguistics K.U.Leuven
Carlos S. C. Teixeira Intercultural Studies Group Universitat Rovira i Virgili (Tarragona, Spain) Knowledge of provenance.
The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in.
THE TRANSLATION NETWORK Overview  Easily manage your multilingual sites  Synchronize content and manage changes  Translate content on the fly  Use.
The SDL Translation Forum: San Francisco Thursday 8th June 2006 Helping Organizations Upgrade their Memory through CCM Methodology Toby Newman Worldwide.
Computer Assisted Translation CAT Alexander C. Wu Fall 2004.
Web Localization Online Document Translation Source: Fernando Delgado.
An innovative platform to allow translation and indexing of internet sites Localization World
MACHINE TRANSLATION TRANSLATION(5) LECTURE[1-1] Eman Baghlaf.
Funded under the EU ICT Policy Support Programme Automated Solutions for Patent Translation John Tinsley Project PLuTO WIPO Symposium of.
Linda Mitchell Evaluating Community Post-Editing - Bridging the Gap between Translation Studies and Social Informatics Linda Mitchell PhD student.
Sharon Burton Product Manager/Product Evangelist MadCap Software
IT Job Roles Task 20. Software Engineer Job Description Software engineers are responsible for creating and maintaining software of various different.
MACHINE TRANSLATION A precious key to communicate beyond linguistic barriers 1.
1. Learning Outcomes At the end of this lecture, you should be able to: –Define the term “Usability Engineering” –Describe the various steps involved.
Help communities share knowledge more effectively across the language barrier Automated Community Content Editing PorTal.
Translation & Interpretation as a Profession Presented by CHICATA The Chicago Area Translators & Interpreters Association.
Computers Are Your Future Eleventh Edition Chapter 13: Systems Analysis & Design Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall1.
Streamlining the Review Cycle Michael Oettli, nlg GmbH Santa Clara, October 10 th.
By Anthony W. Hill & Course Technology1 Common End User Problems.
Computer-Based Training Methods
The LSPs and Machine Translation: Why Not Treat MT as TM? David Canek, MemSource Technologies Torben Dahl Jensen, Oversætterhuset.
Localizing Prestashop eCommerce Site with Wordfast
Carlos S. C. Teixeira Universitat Rovira i Virgili Knowledge of provenance: How does it affect TM/MT integration? New Research in Translation and Interpreting.
Silke Gutermuth & Silvia Hansen-Schirra University of Mainz Germany Post-editing machine translation – a usability test for professional translation settings.
Can Controlled Language Rules increase the value of MT? Fred Hollowood & Johann Rotourier Symantec Dublin.
Sofia Garcia/Roberto Silva Tutorial Workshop, GrenobleDate: 31/Jan/2007 The work of a professional translator and the translation agency V1.0.
Translingual Europe Panel Discussion Competition of MT Paradigms Where is RBMT? Translingual Europe Prague May 13, 2009 Daniel Grasmick.
Tenaris is the leading global manufacturer and supplier of tubular products and services used in the drilling, completion and production of oil and gas.
Case Study Summary Link Translation entered a partner agreement with Autodesk to provide translation solutions integrating human and machine translation.
Career Profile: Systems Analyst Jenn Sroka. Is a Career as a Systems Analyst right for you? Duties include: Planning, design, installation, and development.
The worldwide language services market 2.33% Revenue from post-edited machine translation 53.5% Consumers using free MT on the web Copyright © 2011 by.
Translation Memory System (TMS)1 Translation Memory Systems Presentation by1 Melina Takanen & Julianna Ekert CAT Prof. Thorsten Trippel University.
The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in.
Machine Translation (Level 2) Anna Sågvall Hein GSLT Course, January 2003.
© Intercultural Studies Group Universitat Rovira i Virgili Translators’ skill sets in a machine-translation age Anthony Pym.
1 Machine Assisted Human Translation (MAHT) (…aka “Translation Memory” or “CAT tool”) …and what it does for the translator…
Link Translation provides training and practical experience on industry- standard Computer Assisted Translation (CAT) tools for our team of linguists.
Keeping up with translation technologies: a call for experimental pedagogies Anthony Pym.
Oman College of Management and Technology Course – MM Topic 7 Production and Distribution of Multimedia Titles CS/MIS Department.
Axis AI Solves Challenges of Complex Data Extraction and Document Classification through Advanced Natural Language Processing and Machine Learning MICROSOFT.
Text INTERNAL February 11, 2011 Problem Solving. INTERNAL Tech Republic’s railway department wants a solution Tech Republic’s railway department.
A Simple English-to-Punjabi Translation System By : Shailendra Singh.
Review: Review: Translating without in-domain corpus: Machine translation post-editing with online learning techniques Antonio L. Lagarda, Daniel Ortiz-Martínez,
Mind the Gap. UK translation company certified to BS EN Size:92nd in the world in 2011, today 75th (turnover) Staff: 64 (Southampton, London, Stockholm)
TRANSLATION & LOCALIZATION SERVICES Certified Provider Providing Local Translations To Our Global Partners.
#APMP2016. Submitting proposals in more than one language: a survival guide Considering language and translation as a key component of your value proposition.
How to teach what you don’t know: a call for pedagogical experiments Anthony Pym.
Human Computer Interaction Lecture 21 User Support
Centre for Translation Studies FACULTY OF ARTS
How to teach translation technologies
An experiment in teaching translation technologies
Human Computer Interaction Lecture 21,22 User Support
Tools of Software Development
The Translation Management System for Global Enterprises
Translation — Industry and Career Information
User’s Perspective Laurie Gerber.
OU BATTLECARD: Business Intelligence
Presentation transcript:

Post-editing: how to future-proof your career in translation Paulo Camargo, PhD. Owner, Terminologist BLC - Brazilian Localization Company Web:

Purpose of this presentation Promote the adoption of Machine Translation (MT) and Post-editing (PE) –How we can work faster, better, and make more money Target-audience: –Novice, experienced, and advanced free- lance translators –Small LSPs and in-house translators

Perspectives: novice translator Introduce PE as a new profession –Background information –Current adoption of PE –PE productivity/compensation Explore availability of PE training –Why a translator need PE training –What are the required skills –PE certifications available: TAUS, and SDL

Perspectives: experienced/advanced Use MT output as translation aid -Research shows MT increases productivity -Translators prefer MT instead of unaided -GT, SDL Cloud, MS Hub, Systran Advanced: combine MT / term. manag. -Term extraction/customization of MT -Generation/PE of MT output ↑ productivity -Replace combined TM/on-line TM servers?

Perspectives: small LSP Large/medium LSPs use MT (> decade) –Small LSPs: need to catch-up How to get started with low budget -MT developments: ↓ need of specialized IT -Key-resource: in-house translator -Terminology management -Customizable MT -Preliminary analysis/PE Guidelines

Definition of post-editing (TAUS) Post-editing: “the correction of machine-generated translation output to ensure it meets a level of quality negotiated in advance between client and post-editor”. “Post-editing seeks the minimum steps required for an acceptable text”

Background information PE reality: driven by advances in MT –Hybrid MT: rule-based / statistic-based –Rule-based: dictionaries, rules; e.g Systran –Statistic-based: training data (TM); e.g GT Pre-editing –Customization: glossary, training data –Preliminary analysis: language rules, client rules, example card

Preliminary analysis (Rico, 2011) After engine customization –Select MT samples –Check term consistency/accuracy –Check for recurrent MT errors Draw guidelines (quality acceptance) –Quality/errors to expect / how to proceed –Language independent/dependent rules –Feedback (glossary update, errors report)

Guidelines for PE (Rico, 2011) Language independent rules –Fix terminology, syntactic, morphology –Fix misspelling, punctuation, omissions –Edit offensive/inappropriate text Language dependent rules –Language specific examples: –Example card: expected errors/how to fix

Custom MT: what to expect (O’Brien, 2002) Custom MT: high-level MT output –Most segments = 85% TM fuzzy –Some better than 100% TM match (review) –Some bad translations: retranslate Translator: critical to MT success Need human assessment: always! “Not only will MTPE not replace the translator but it also will not happen without the translator”

Full MT post-editing (Dillinger, 2004) Goal: human-quality output Most frequent: higher visibility texts Quality expectations: high (TEP) Grammar, syntactic, semantic correct Stylistically appropriate Productivity expected: 4K – 10 K w/day

Current adoption of post-editing Common Sense Advisory report (2012) –Freelance: 21.7% (15.4% plan) –Small LSP: 32.5% (22.6 % plan) –Large LSP: 72.0% (28.0% plan) ** ALC report (2015) –Small LSP: 20.0% (USA), 25.0% (Europe) Lionbridge (Marciano, 2015) –Apply 30% projects (goal 50%), 60M, 2014

Post-editing productivity data Post-editing productivity (O’Brien, 2006) –Equal/higher than editing TM High Fuzzy –Typical: 4K to 10K words/day –Proficiency: 100K w (1 month full-time PE) Other productivity data – Full PE: 5K–8K w/day (DePalma, 2011)

PE compensation: follow TM fuzzy TM fuzzy matches (Guerberof, 2013) – 60-66% of full TR rate for 75%-94% match MT full post-editing –70-50% of rate (Guerberof, 2013) –65-68% of rate (Marciano, 2015) –Smaller companies: prefer to pay per hour

Proposal for PE training (O’Brien, 2002) PE: what do TRs think about? –Dislike for correcting repetitive errors –Fear of losing proficiency (poor MT output) –Dislike for limited freedom of expression Why do TRs need PE training? –≠ skills: 2 source texts –Quality requirements, different error types –Qualified translator ≠ successful post-editor

What skills does a post-editor need? Same as the translator (O’Brien, 2002) -Expert in subject area and target language -Excellent knowledge of source language -Word-processing (WP) skills, tolerance Skills for post-editor only (Rico, 2011) -Adv WP: RegEx, S&R, term. management -Positive attitude towards MT

Proposal for PE course (O’Brien, 2002) Theoretical component –Intro to PE/MT tech / controlled language –Adv. term. Management / text linguistics –Basic programming skills Required background –TRA skills; basic linguistics/term manag –IT skills; intro lang tech; source/target skills

Sources for PE certification TAUS (Transl. Automation User Society) –English >23 lang (European, Arabic, Asian) –Also Spanish > English –Cost: 60 Euro (member), 80 Euro (non) SDL MT PE Certification –Free with SDL Language Cloud MT

Perspectives: experienced/advanced What possibilities can MT offer other than post-editing? Is it worth using MT output as an aid to increase TR productivity? Can MT replace with advantages the use of combined TM/on-line TM servers?

Efficiency of PE for language translation Rigorous, controlled analysis (Spence, 2013) -Hypothesis 1: PE reduces translation time -Hypothesis 2: PE increases quality -Hypothesis 3: MT primes the translator Compared PE vs. unaided translation –Blind experiment: TR did not GT was used –Pre-interview: TR showed strong MT dislike –16 PRO TRA/pair: EN-AR, EN-FR, EN-GE

Results clarify value of post-editing Which one is faster? 69% PE Useful? Yes 56%, No 29%, Unsure 15% Suggestions improved quality (all) MT output primes the translator –PE text (closer MT) ≠ Unaided ≠ Raw MT –Lower the TR experience → closer to MT

Does MT output increase productivity? Example 1: Google Translate –Now a paid service: $20/M characters –Plug-in to SDL Trados/other CAT tools –General statistical MT engine –Not customizable –Confidentiality issues –See app for complete setup procedure

Does MT output increase productivity? Example 2: SDL Cloud MT –Price range: $5 – $75 /month (Expert) –Plug-in to SDL Trados/other CAT tools –Complete confidentiality (nothing is stored) –Pre-trained engines: Travel, IT, Life Sciences, Automotive, Consumer Electronics –Customizable MT: can add own glossaries –Comprehensive analytics (quality analysis)

Does MT output increase productivity? Example 3: Microsoft Translator Hub –Plug-in to SDL Trados/others, secure –First 2M char free; 4M/mon $40 –Fully customizable MT engine Previous translations (> 20K words) Add glossaries Request training / evaluate results Option to “Use Microsoft Models”

How about confidentiality? Consider e.g. Microsoft and Google –Among the largest providers of MT –Among the largest buyers of translation –Control information flow around the globe Confidentiality should not be problem –Google not option → MS Hub/SDL Cloud –Uncomfortable sending data to MS/SDL –Use desktop/server solution: Systran

Changes in MT offer to TRs Common scenario for TR (4 years ago) –One affordable desktop product (Systran) –Macros, RegEx, format conversion –No plug-in for CAT tools (high-end) Current scenario –Software as a service (GT, SDL, MS Hub) –Plug-in for CAT tools is standard –Much lower IT requirements

Can MT replace combined/online TMs? Experienced/advanced translators –Use combined/on-line TM for productivity –Proud users: ↑ 50% prod, see TM as asset –TM is error-prone (consistency, mistranslation) –Need to check term consistency MT improved a lot in last 5 years –TRs trust TM fuzzy > raw MT (Guerberof, 2008) –Mistake MT output for TM output (human?)

MTPE can provide a better result Avoid problems in combined/on-line TM: –Terminology inconsistencies –Mistranslations –Waste time correcting TUs that will never use New approach using MTPE –Extract terms (Systran, rule-based) –Customizable MT (SDL Cloud or MS Hub) –Post-edit fresh MT for ↑ productivity/quality

How small LSPs can get started Scenario: 40% TRs use MT (TAUS) Actual post-editing offer (2015) –PE of our MT engine output (GT?) –Payment: 900 words/hour –Instruction: as readable as possible –No pre-editing: cal, gloss or guidelines Translators were really upset

Need more than just Google Translate Pre-editing: custom., prelim. analysis –Key: in-house translator (O’Brien, 2002) Allocate translator for MTPE activities –Use secure on-line customizable engines –Define suitable projects –Invest in terminology management –Develop PE guidelines –No rate discount initially (learning curve)

MT implementation at BLC (4 years) Smaller projects: K words –Terminology extraction (Systran, rule-based) –Normal TR + ED procedure –Semi-customized: SDL Cloud + Multiterm Larger projects > 50K words –Extract bigger glossary (higher coverage) –Raw MT (Systran, SDL Cloud, MS Hub) –PE + ED (no pre-editing, no discount)

MT implementation at BLC What does MT do for BLC? –Leverage my knowledge: engineering/science –Increase productivity (more/larger projects) –Increase quality (terminology/TM updates) Future developments –MS Hub, SDL Cloud; Systran –Hire new translator (2016) –Develop a PE team/service

Conclusion The combination of machine translation (MT) and post-editing (PE) is a disruptive innovation that can improve translator’s productivity and translation quality, no matter how you plan to use it. Can you afford to ignore it?

References Guerberof, Ana (2008). Productivity and Quality in the Post-editing of Outputs from Translation Memories and Machine Translation. Masters Dissertation. Universitat Rovira i Virgili. O’Brien, Sharon (2006). Eye-tracking and Translation Memory matches. Perspectives: Studies in Translatology 14 (3), Spence, Green et al (2013), The Efficacy of Human Post-Editing for Language Translation, ACM Human Factors in Computing Systems (CHI), Computer Science Department, Stanford University Rico, Celia et al (2011), EDI-TA: Post-editing Methodology for Machine Translation, MultilingualWeb-LT. O’Brien, Sharon (2002), Teaching Post-editing: A Proposal for Course Content. Proceedings of the 6th EAMT Workshop on Teaching Machine Translation. EAMT/BCS, UMIST, Manchester, UK DePalma, Donald (2011), Common Sense Advisory, Trends in Machine Translation. Dillinger, Mike et al (2004), Implementing Machine Translation, LISA Best Practice Guides. TAUS (2014), MT Post-editing Guidelines Marciano, Jay (2015), Personal communication.