Download presentation
Presentation is loading. Please wait.
Published byEdith Jones Modified over 9 years ago
1
Translation memory systems to enhance the quality and productivity of localization of teaching materials Sam Joachim 6 th Workshop Software Engineering Education and Reverse Engineering, Ravda (Nessebar), Bulgaria, 18 – 23 September 2006
2
Last years suggestion for future development of S-Bahn Tool Transformation of our PowerPoint material in an independent XML format (perhaps based on 3 ) ◦ With support for different learning paths ◦ With the possibility of easily exchanging parts of the content (case studies, examples, …) ◦ With different outputs (ppt, pdf, html, textbook …) ◦ For different target groups, perhaps with diverse knowledge levels Building of a kind of repository / pool system for the learning objects and an authoring system for adaptable study packs or (e-)learning material Support the use of ‘Authoring by Aggregation’ in the system 2 Translation Memory Systems
3
Authoring by aggregation: main idea A new learning object is composed of some parts from LO A (a picture), LO B (some text) and 2 new modules N a and N b 3 Translation Memory Systems Learning object A Learning object B New learning object Information Objects / Modules Modul N a Modul N b
4
Some problems to solve Learning object models adapted to ‚Authoring by aggregation‘ XML representation of our slides / material S-Bahn tool problems with respect to localization (quality, efficiency of translation) 4 Translation Memory Systems
5
Strategy Environment with support for„Authoring by aggegation“ ◦ Modularisation / generation of fine-granular (information / learning) objects from our material ◦ Transformation PPT XML ◦ Translation support with a Translation Memory System 5 Translation Memory Systems Learning object models adapted to ‚Authoring by aggregation’ XML representation of our slides S-Bahn tool problems with respect to localization
6
Learning object models adapted to ‚Authoring by aggregation’ & XML representation of our slides / material 6 Translation Memory Systems
7
o n n n e t s w y s e e e y t s s w g Learnativity Content Model (Duval & Hodgins 2003) Shapes, Pictures, Textfields, Diagrams 7 7 Translation Memory Systems Source: E. Duval and W. Hodgins, A LOM research agenda. In Proceedings of the twelfth international conference on World Wide Web, pages 1–9. ACM Press, 2003. Modular Content Hierarchy JCSE Course Section, Topic Slides, Section Grouped elements, single or associated slides
8
Learnativity content model Raw Media Elements are the smallest level in this model: these elements reside at a pure data level. (single sentence or paragraph, illustration, animation, etc.). Information Objects are sets of raw media elements. Such objects could be based on the “information block” model developed by Horn (Horn 1998). Based on a single objective, information objects are then selected and assembled into the third level of Application Specific Objects or learning objects in a more restricted sense. The fourth level refers to Aggregate Assemblies that deal with larger (terminal) objectives. This level corresponds with more conventional lessons or chapters. Lessons or chapters can be assembled into larger collections, like courses and whole curricula. The fifth level refers to these Collections. 8 Production of highly adaptable learning and teaching materials by 'Authoring by Aggregation' 8 Translation Memory Systems
9
Modular content objects explained on a typical slide 9 Translation Memory Systems Information object / Remark for the audience Learning Object / Group of elements Raw media object / textfield
10
Generation of fine-granular objects Use / adaptation of an existing XML teaching material language Automatic generation of ◦ Raw media objects (shapes, textfields) ◦ Information objects (groups of objects, graphics) Semi-automatic by selecting from the automatically generated elements ◦ Learning objects (associated slides,..) ◦ Higher level objects like Aggregate assemblies (topics) Collections (whole JCSE) 10 Translation Memory Systems
11
Example for generated objects 11 Translation Memory Systems Waterfall model
12
Teaching / Learning Object or Material Repository Schematic of a transformation process: knowledge grid.doc.pdf.ppt eLesson...... Essence What is the essence? Text? Pictures? Style? Tool Tool 2 Automated. Some AI? Half-baked essence Tool 3 Final document Interactivity. Some NI XML? Moodle Ahyco Something else...... Tool 4 Automatic generation of “Raw media objects“ Raw data and media elements in XML format Information objects (groups of objects, graphics) Automatic grouping of connected objects Objects in some Teaching Material Language (LMML / ) Interactiv „Authoring by Aggregation“ process uses fragments/modules to generate new material
13
Two ways for representing ppt in xml to preserve our ppt layout 13 Translation Memory Systems embedding original ppt code in container elements in the xml-language binary ppt representation Analysis and Definition build a transformation from the elements from ppt to the elements in the xml-language and vice versa (easier modification of extisting material)
14
Translation Memory Systems 14 Translation Memory Systems
15
15 ◦ Word by word translation ◦ Repeated translation of phrases ◦ Difficult use of the feature for the content slides / slides with reappearing content S-Bahn Tool problems Possible solution: ◦ The use of a translation memory system. Translation Memory Systems
16
TM-Systeme: Introduction two main types of translation support ◦ Machine Translation (fully automatic) Try to translate autonomous, in most cases nonsatisfying ◦ Machine Assisted Human Translation (computer-aided) Mainly with Translation Memory Systems 16 Translation Memory Systems
17
Background - History Who: Target group - professional translators Where: professional translation agencies / specialized companies What: Big translation projects with lots of different media and/or documents and dicument versions ◦ Documents to translate: Technical / project / product documentation Texts with technical /natural science background Software in different language variants and during different program versions Including GUI elements, user / product documentation 17 Translation Memory Systems
18
TM System Is a data base Records sentences or word groups (segments) with the corresponding translation During translation, the TM-System searches the already translated segments for similarities with the actual segment The translator can easily use already translated segments 18 Translation Memory Systems
19
TM System (cont.) Every sentence / segment will be checked during translating of other texts If it is already in the data base, it is possible to adopt the translation as it stands ◦ No segment /sentence should be translatet two times. (Similar to the S-Bahn Tool content slides feature, but much more generic in practice.) Highly effective TM Systems ease routine jobs of the translator. He/she can concentrate more on the creative task of translating. 19 Translation Memory Systems
20
Fuzzy searches In realworld texts, a sentence or seqment is very seldom exactly repeated. The most TM systems search not only for exact matches but also for matches with a certain similarity, the so called Fuzzy Matches Fuzzy Matches: only marginal differences, e.g. numbers, names, additional words… Of course, the translator has to do a manual adaptation when taking over a fuzzy match Example from JCSE Topic 3 Slide 26: Already known:Part of the phase ‚Analysis and Design‘, in which the basic use cases of the systems will be detected: use case diagrams New: Part of the phase ‚Analysis and Design‘, in which the basic classes of the problem will be detected: class diagrams 20 Translation Memory Systems
21
Existing TM Systems Trados ◦ http://www.trados.com http://www.trados.com SDLX ◦ http://www.sdlint.com http://www.sdlint.com Transit ◦ http://www.star-ag.ch/products/transit http://www.star-ag.ch/products/transit DéjàVu ◦ http://www.atril.com http://www.atril.com OmegaT ◦ http://www.omegat.org/ http://www.omegat.org/ Transolution ◦ http://transolution.python-hosting.com/ http://transolution.python-hosting.com/ 21 Translation Memory Systems
22
Parts of the TM System 22 Translation Memory Systems Translation Memory DB Terminology / Glossary DB Alignment Tool Inserts already translated texts GUI Dictionary functions and sometimes glossary Collects and provide technical terms during translation Can easily changed and expanded Quick online search with sentences / segments and their translations
23
Parts of the TM System Translation Memory ◦ Data base with sentences / segments and their translations Terminology data base ◦ Dictionary functions and sometimes glossary ◦ Collects and provide technical terms during translation ◦ Can easily changed and expanded, Quick online search like online dictionarys Alignment Tool ◦ Already translated texts an be inserted into the data base 23 Bildliche Darstellung 3 Objekte Translation Memory Systems
24
Alignment Tool: Example GPL 24 Translation Memory Systems
25
Typical user interface Two windows for source and target language Fuzzy index with matches Terminology dictionary 25 Translation Memory Systems
26
TM System: Transolution 26 Text to translate Suggested translation (from fuzzy match) to be adapted by the translator Fuzzy match: explanation of the suggested translation Display of the context of the text to translate Translation Memory Systems
27
27 TM System: OmegaT Text to translate embedded in its context Suggested translation (from fuzzy match) and Glossary Translation Memory Systems
28
TM System: Transit 28 Text to translate Suggested translation (from fuzzy match) to be adapted by the translator Fuzzy match: explanation of the suggested translation Terminology Translation Memory Systems
29
Terminology/ Glossary Seach: Functions Translation: Fuzzy match: (45%) Specification of the structure of the software (50%) specification of components and their relations Spezifikation der Struktur der SoftwaSpezifikation der Komponenten und ihrer Beziehungen Terminology: Specification of components;relation Other languages Macedonian: Специфицирање на структурата на софтверот (софтверска архитектура), специфицирање на компонентите и нивните односи Romanian: Specificarea structurii SW (arhitectura software ), specificarea componentelor şi a relaţiilor între ele Serbian Cyrillic: Спецификација структуре софтвера (софтверска архитектура), спецификација компоненти и нјихових веза Possible User Interface
30
Use of a TM system for localization of teaching materials Applicabe especially for PPT materials because ◦ High number of reappearing text segments ◦ Not too much complete sentences, more phrases, segments Higher support for translators higher productivity Not restricted to PPT, open for any other document format Possibility for a TM system web service for Software Engineering materials Nessesary adaption: not only two languages but many 30 Translation Memory Systems
31
Thank You. Space for Questions
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.