30.11.-2.12.1007practical aspects1 Translation Tools Translation Memory Systems Text Concordance Tools Useful Websites.

Slides:



Advertisements
Similar presentations
Getting Ready for a Career in Translation Equipping Yourself with the Tools for Success © Jost Zetzsche internationalwriters.com.
Advertisements

Machine Translation The Translator s Choice Heidi Düchting Sylke Krämer Johann Roturier.
Presenter: Jamie Lucero
Usage of the memoQ web service API by LSP – a case study
How to Use a Translation Memory Prof. Reima Al-Jarf King Saud University, Riyadh, Saudi Arabia Homepage:
Translation Made Easy STAR Group Top 10 Lessons for Translators January 20 th 2006.
Leveraging TM Technology to Improve Translatability & Usability Dr Jody Byrne University of Sheffield.
HIEROGLIFS TRANSLATIONS Feel the power of word!. WHAT CLIENTS EXPECT FROM TRANSLATION AGENCIES AND HOW HIEROGLIFS TRANSLATIONS MEETS THEIR EXPECTATIONS.
A Syntactic Translation Memory Vincent Vandeghinste Centre for Computational Linguistics K.U.Leuven
Computer Assisted Translation CAT Alexander C. Wu
The SDL Translation Forum: San Francisco Thursday 8th June 2006 Helping Organizations Upgrade their Memory through CCM Methodology Toby Newman Worldwide.
Computerized Translation Process (experience of Argonaut' Translation Center) Oleg Vigodsky Gleb Yalchik.
Computer Assisted Translation CAT Alexander C. Wu Fall 2004.
FLUP - Elena Zagar Galvão Faculdade de Letras da Universidade do Porto INFORMÁTICA DE TRADUÇÃO FALL SEMESTER 2008 Lesson 5 Teacher: Elena Zagar Galvão.
Research methods in corpus linguistics Xiaofei Lu.
Quality Management for the Localization of Marketing Texts.
Lecture 04.  DTP  Some features and their configuration  Fields and Filters  Summary.
Stefan Kreckwitz Senior System Engineer across Systems GmbH „Future Web-Based Translation Environments“ Localisation Research Forum 28 September 2007,
McEnery, T., Xiao, R. and Y.Tono Corpus-based language studies. Routledge. Unit A 2. Representativeness, balance and sampling (pp13-21)
Corpus linguistics for translators Amanda Saksida University of Nova Gorica.
 Trends: › usual trio: desktop version, server version, cloud version › cloud version + free editor › industry standards adopted (XLIFF, TMX, TBX)
Working freelance for an international organisation.
Writing your dissertation. Overview Dissertation structure and components Writing Software assistance A look at past dissertations.
Researching language with computers Paul Thompson.
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
Can Controlled Language Rules increase the value of MT? Fred Hollowood & Johann Rotourier Symantec Dublin.
Lecture 01 (Tuesday 18 September).  Lecture 01 What is a TM, some tools Getting started (UI, create a TM, open file, translate, edit, preview)  Lecture.
© 2006 Pearson Education 1 Obj: cont 1.3 and 1.4, to become familiar with identifiers and to understand how programming languages work HW: p.51 #1.8 –
Translation Technologies Računalne tehnologije za prevo đ enje dr. Špela Vintar Department of Translation Studies Faculty of Arts University of Ljubljana.
Digital Information and Heritage INFuture Zagreb, Sentence Alignment as the Basis For Translation Memory Database Sanja Seljan Faculty of.
1 University of Palestine Topics In CIS ITBS 3202 Ms. Eman Alajrami 2 nd Semester
Sofia Garcia/Roberto Silva Tutorial Workshop, GrenobleDate: 31/Jan/2007 The work of a professional translator and the translation agency V1.0.
Tracking Language Development with Learner Corpora Xiaofei Lu CALPER 2010 Summer Workshop July 12, 2010.
Gerrit Schutte OHIM 9th of December, 2011 Trademark terminology control.
Introduction to GATE Developer Ian Roberts. University of Sheffield NLP Overview The GATE component model (CREOLE) Documents, annotations and corpora.
Seminar in Applied Corpus Linguistics: Introduction APLNG 597A Xiaofei Lu August 26, 2009.
Open Source CAT Tool Patrícia Azeredo Ivone Ferreira IT for Translation 2009/2010.
Translation Memory System (TMS)1 Translation Memory Systems Presentation by1 Melina Takanen & Julianna Ekert CAT Prof. Thorsten Trippel University.
Transforming Parallel Corpora to Translation Memory Steve Legrand IPN 29th Sept
Computer Literacy for IC 3 Unit 2: Using Productivity Software Chapter 3: Formatting and Organizing Paragraphs and Documents © 2010 Pearson Education,
1 Machine Assisted Human Translation (MAHT) (…aka “Translation Memory” or “CAT tool”) …and what it does for the translator…
Corpus Linguistics in Research Doctorate in Education University of Warwick 6th November 2008.
Approximate sentence matching and its applications in corpus-based research Rafał Jaworski INFuture2015, Zagreb, Croatia.
Colorado State University
Development of an Intelligent Translation Memory MorphoLogic SZAK Publishers Balázs Kis
SDL Trados Studio 2014 Getting Started. Components of a CAT Tool Translation Memory Terminology Management Alignment – transforming previously translated.
Software Localization Dr. Keiran J. Dunne Institute for Applied Linguistics Lecture 5.
Ms.Asma AlOtaibi. By the end of the workshop you will be able to:  To create a project  To create a translation memory  To export a file for external.
SDL Trados Studio 2014 Creating and Managing TMs Alignment Reviewing translations.
Corpus Linguistics MOHAMMAD ALIPOUR ISLAMIC AZAD UNIVERSITY, AHVAZ BRANCH.
Evaluating Translation Memory Software Francie Gow MA Translation, University of Ottawa Translator, Translation Bureau, Government of Canada
Metatexis “the easy way to translate” By: Diana Delgado Ma. Victoria Porro Master en Traduction – TAO ETI – automne 2009.
What is a Corpus? What is not a corpus?  the Web  collection of citations  a text Definition of a corpus “A corpus is a collection of pieces of language.
NLP Midterm Solution #1 bilingual corpora –parallel corpus (document-aligned, sentence-aligned, word-aligned) (4) –comparable corpus (4) Source.
PRIMENJENA LINGVISTIKA I NASTAVA JEZIKA II 3 rd class.
Corpus Linguistics Anca Dinu February, 2017.
Can you trust a TM? Results of an experiment conducted in November 2015 and August 2016 with students and professional translators. Daniela Ford Centre.
Open Source CAT Tool.
Computational and Statistical Methods for Corpus Analysis: Overview
Natural Language Processing (NLP)
Introduction to Corpus Linguistics: Dispersion/concordance plots
Translating and the Computer London, 16 November 2017
Translation Workspace File Filters
Using Translation Memory to Speed up Translation Process
Workshop CAT Technology & Localization
Using GOLD to Tracking L2 Development
LINGUA INGLESE 2A – a.a. 2018/2019 Computer-Aided Translation Technology LESSON 3 prof. ssa Laura Liucci –
Natural Language Processing (NLP)
LINGUA INGLESE 2A – a.a. 2018/2019 Computer-Aided Translation Technology LESSON 4 prof. ssa Laura Liucci –
Natural Language Processing (NLP)
Presentation transcript:

practical aspects1 Translation Tools Translation Memory Systems Text Concordance Tools Useful Websites

practical aspects2 Translation Memory Systems General principles and features Scope of application with legal texts: pros and cons Demonstration of OmegaT Hands-on experience with OmegaT

practical aspects3 Principles and Features I Translate only once, re-use of already translated text:  Internal recurrence Segments, which are occur more than once in the source text and have not been translated before and thus have not been saved in the translation memory (exact matches) match propagation.  External recurrence text segments which have been previously translated in other texts and thus are saved in the translation memory Consistency of text and terminology: identical text will always be translated the same way

practical aspects4 Principles and Features II Keep format information intact that means separating text from layout information  edit text in an editor  use of filters for different file formats  import = replace layout information with placeholders  export = replace placeholders with original layout information

practical aspects5 How does it work?  the text is splitted up in text chunks (=segments) which can be sentences or paragraphs.  each segment is matched to the translation memory exact match fuzzy match match-percentage: value of minimal similarity between a new text segment to translate and a segment found in the memory  segments will be saved in pairs: source language text and target language text  these form translation units (TU) in translation memories

practical aspects6 Segmentation rules Sentence level  Pro: smaller text segments = higher reusability better matching  Con: sentence for sentence translation method, more post- editing needed Paragraph level  Pro: bigger text segments = better translation quality, less post-editing  Con: lower reusability

practical aspects7 Best context for the use of TM Conventionalized text structures Stereotypical formulations Identical macrostructure Frequent updates of a text

practical aspects8 Legal texts types Best suited for translation memory systems are highly standardized legal text types such as:  bylaws (e.g.  contracts, agreements (e.g.  sentences  administrative legal texts

practical aspects9 Commercial Software SDL Trados Translator's Workbench SDLX (SDL)‏ DejaVu (Atril)‏ Transit (Star)‏ MemoQ MultiTrans (Multicorpora)‏ Wordfast (Champollion)‏ Heartsome Translation Suite

practical aspects10 OmegaT OpenSource Translation Memory  requirements  Java Runtime Environment (

practical aspects11 Corpus and Concordance Tools A corpus is a collection of pieces of language that are selected and ordered according to explicit linguistic criteria in order to be used as a sample of the language. (EAGLES, 1996)‏ use of corpora: – Word frequency – Lexical context – Syntactic context – Semantic context – Style

practical aspects12 Legal Corpora Criteria for a legal corpus:  Legal system and language  General legal corpus (e.g. Austrian legal language) = extensive, very large to be representativ  Specific legal corpus (e.g. Austrian legal language for Civil Law) = still large  Specilized legal corpus (e.g. Austrian legal language for succession law) = manageable  Genre specific corpus (e.g. Austrian testaments) = very manageable and suited for translation purposes

practical aspects13 Concordance Tools TextStat2 berlin.de/textstat/software-en.htmlhttp:// berlin.de/textstat/software-en.html AntConc  Literature: Pearson, Jennifer; Bowker, Lynne (2002): Working with specialized language. a practical guide to using corpora. Routledge, London. Bowker, Lynne (2002): Computer-aided Translation Technology. A Practical Introduction. University of Ottawa Press, Ottawa..