Translating Database Content with www.crossmediasolutions.de.

Slides:



Advertisements
Similar presentations
EndNote Web Reference Management Software (module 5.1)
Advertisements

EndNote Web Reference Management Software (module 5)
Corporate Imaging Kathy Mumford Session 2 Room C Tennessees BEST 2005 Tech Tips User Conference.
Reliable Interoperation between Open Office & MS office by UOML Alex Wang Chair/OASIS UOML TC Chairman / Sursen Co.
Usage of the memoQ web service API by LSP – a case study
How to Use a Translation Memory Prof. Reima Al-Jarf King Saud University, Riyadh, Saudi Arabia Homepage:
KompoZer. This is what KompoZer will look like with a blank document open. As you can see, there are a lot of icons for beginning users. But don't be.
Mine Action Information Center
MEPO Training MEPO Database Access Training Presentation Copyright 2011 Rodger B. Fluke, MPA.
Web publishing training Introduction to the Tera text Web Content Management System (CMS) Learning Objective: Basic knowledge and skills required to publish.
Single Search By Rakphao Theppan, librarian Searching Online Resources.
SOFTWARE PRESENTATION ODMS (OPEN SOURCE DOCUMENT MANAGEMENT SYSTEM)
Business Process Modeling in Microsoft Visio® Interfacing’s BPMN Modeler: Overview.
Russell Taylor Lecturer in Computing & Business Studies.
An innovative platform to allow translation and indexing of internet sites Localization World
What is so good about Archie and RevMan 5
 Definition of HTML Definition of HTML  Tags in HTML Tags in HTML  Creation of HTML document Creation of HTML document  Structure of HTML Structure.
LEARN THE QUICK AND EASY WAY! VISUAL QUICKSTART GUIDE HTML and CSS 8th Edition Chapter 21: Publishing Your Pages on the Web.
Software development. Chapter 7 – Application distribution and sales.
Microsoft SharePoint 2010 Upgrade Preview FSU SharePoint Users Group Presents: Thursday, December 1 st, 2011.
Tutorial Introduction Fidelity NTSConnect is an innovative Web-based software solution designed for use by customers of Fidelity National Title Insurance.
Web Content Management Systems. Lecture Contents Web Content Management Systems Non-technical users manage content Workflow management system Different.
Chapter 1 Database Systems. Good decisions require good information derived from raw facts Data is managed most efficiently when stored in a database.
Scottish Legal Aid Board Content Management using OpenCms Martin Spinks CTO Navyblue Tuesday, March 16, 2010.
Education Supported by Content Management Systems Milena Stanković, Milan Rajković, Ivan Petković, Petar Rajković Faculty of Electronic Engineering, Niš.
GuideLines of the Eurisol-DS web-site. α-version web site: Word-art with font: (TT) Arial Black Italic font: (TT) Niamey regular.
CompuBase Data for CRM / PRM Integration How compuBase fits to an existing CRM / PRM system? Last review 25/03/2007.
Support.ebsco.com EBSCOhost Basic Searching for Academic Libraries Tutorial.
OFC304 Excel 2003 Overview: XML Support Joseph Chirilov Program Manager.
CS621 : Seminar-2008 DEEP WEB Shubhangi Agrawal ( )‏ Jayalekshmy S. Nair ( )‏
Tutorial 1: Getting Started with Adobe Dreamweaver CS4.
Calculation BIM Curriculum 07. Topics  Calculation with BIM  List Types  Output.
XP New Perspectives on Browser and Basics Tutorial 1 1 Browser and Basics Tutorial 1.
Multi-agent Research Tool (MART) A proposal for MSE project Madhukar Kumar.
FishBase Summary Page about Salmo salar in the standard Language of FishBase (English) ENBI-WP-11: Multilingual Access to European Biodiversity Sites through.
New Features in Release 9.2 (July 27, 2009). 2 Release 9.2 New Features Updated Shopping Experience Home/Shop page Shop at the top search New Hosted Supplier.
Business Software What is database software? p. 145 Allows you to create, access, and manage data Add, change, delete, sort, and retrieve data Next.
G053 Lecture 12 Introduction To HTML Mr C Johnston ICT Teacher
HTML PROJECT #1 Project 1 Introduction to HTML HTML Project 1: Introduction to HTML 2 Vocabulary Internet service provider (ISP) A company that has a.
Unit 3 Day 6 FOCS – Web Design. Journal Unit #3 Entry #4 Write the source code that would make the following display on a rendered page: Shopping List.
Coping with Babel How to Localize XML. Designing for Localization Document design can seriously impact the costs of translation and localization. Remember.
Training Guide for Inzalo SOP Users. This guide has been prepared to demonstrate the use of the Inzalo Intranet based SOP applications. The scope of this.
EDI communication system
TRANSLATION MEMORY TECHNOLOGY
TECHONOLOGY experts INDUSTRY Some of our clients Link Translation’s extensive experience includes translation for some of the world's largest and leading.
EndNote. What is EndNote? EndNote is referencing software that enables you to create a database of references from your readings.
Content Management Systems. CMS DEFINED: A system of collective procedures used to manage various work flows in a collaborative environment. A system.
PASSOLO ® Makes Your Software Ready for the Global Market Localisation Standards The Tools Developer’s Perspective.
Xml:tm XML Based Text Memory Using XML technology to reduce the cost of translating XML documents 27 June 2005.
Xml:tm XML Text Memory Using XML technology to reduce the cost of translating XML documents.
Content Management System Copyright © 2005 Liferay, LLC All Rights Reserved. No material may be reproduced electronically or in print without written permission.
Database Management Systems (DBMS)
Microsoft ® Official Course Structuring and Publishing Websites for All Users Microsoft SharePoint 2013 SharePoint Practice.
Web Information Retrieval Prof. Alessandro Agostini 1 Context in Web Search Steve Lawrence Speaker: Antonella Delmestri IEEE Data Engineering Bulletin.
Sensible Content. Sensible Content Management. © 2006 ZiaContent, Inc. All rights reserved. Automated Publishing, CMS and DAM Rita Warren ~ ZiaContent,
Development of an Intelligent Translation Memory MorphoLogic SZAK Publishers Balázs Kis
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
2004/051 >> Supply Chain Solutions That Deliver Users.
Thank you for looking into Policy Manager Two for your Head Start program Teresa K. Wickstrom Senior Associate Center for Community Futures
ICAD3218A Create User Documentation.  Before starting to create any user documentation ask ‘What is the documentation going to be used for?’.  When.
01 OPTIONS 01 OPTIONS Lorem Ipsum In play mode, click the image in the tab to the left. This will load the slide for the option clicked. Be sure to edit.
Microsoft Virtual Academy Jamie McAllister | SharePoint MVP & Solution Architect Rob Latino | Program Manager in Office 365 Support.
Conceptual Overview For Understanding the New Paradigm Provided by: Web Services Section.
Internet Made Easy! Make sure all your information is always up to date and instantly available to all your clients.
Project 1 Introduction to HTML.
Submitted By: Usha MIT-876-2K11 M.Tech(3rd Sem) Information Technology
GDSS – Digital Signature
Continuous Client Side Localization
DITA Translation Management Challenges in Japan
Microsoft Office Illustrated Fundamentals
Presentation transcript:

Translating Database Content with

Who We Are… CMS – Cross Media Solutions  is an independent software vendor from Würzburg, Germany.  Founded in 2006 from elements of the printing company Stürtz GmbH,  CMS has decades of experience in providing database solutions for publishers of technical and scientific documentation and industrial catalogs  Technology partners  My name is Steven Calkins.

What will be discussed… Translating Database Content with KITE  Introduction 1. Why is standardized database translation important? 2. Definition of terms  KITE: 1. Features 2. Advantages of XLIFF and KITE 3. Integration in workflow  Conclusion 1. Why is this issue urgent? 2. Decisions to be made

Why is standardized database translation important? Estimates put all the content stored in databases on the world wide web at more than 500 times the information available on the pages which can be found by popular search engines: Database “Dynamic Searchable Databases represent the Deep Web. The Deep Web is made up of hundreds of thousands of publicly accessible databases and is approximately 500 times bigger than the surface Web. At each Deep Web database, you enter a query and get back a Web page created dynamically (on-the-spot) specific to your search. These dynamic Web pages are not linked since they didn’t exist before your query and cease to exist after being sent to you — the search engines can't see nor find them.” * Using the Deep Web as a model of knowledge distribution we can infer that the vast majority of available knowledge is found in databases. Company information is generally stored in a database, typically an electronic resource planning (ERP) software. So if something needs localization, most likely the original source is found somewhere in a database. Search engines index only the surface web Database Surface Web Deep Web Database queries are not indexed * CMS – Cross Media Solutions GmbH is not responsible for the content of any links external to its own website.

Why is standardized database translation important?  Regulations have been increasing in recent years:  USA: Sarbanes-Oxley Act (finance documents).  EU: EU Directive 2006/42/EG (manuals, in 2009).  The people served by translations have greatly varying needs:  Designers & technicians need more “technical” documentation than normal product users do.

Can translation be accomplished with standard procedures?  For low-complexity, small volumes it can be done  With increasing volume and complexity, automated procedures become necessary  Human translators are absolutely necessary, but Translation Memory systems can help translators 1.translate consistently with proper vocabulary, 2.avoid translating the same terms twice.

Definition of terms TermMeaning TMSTranslation Memory Systems They work by looking up seg- ments in a database containing large numbers of previously translated segments with their translation. They provide matches that fit exactly or are similar to the term being trans- lated and display them as suggestions to the translator. SegmentsUnits of sensible translation, usually sentences. Database contents are well-suited for processing with TMS because the segmentation process can be virtually eliminated. * TMXTranslation Memory eXchange Format OSCAR standard (Open Standards for Container/Content Allowing Reuse, a committee of LISA -- the Localization Industry Standards Association) XLIFFXML Localization Interchange File Format OASIS standard KITEThe TMS of Cross Media Solutions implementing the TMX and XLIFF standards. Fully XLIFF compliant. * Still, the segmentation must not be lost in the database–XLIFF–database roundtrip process.

Features of  Localization of database contents  Translation memory functionality  Termbase functionality (TBX)  Scalable, suitable for very large XLIFF files  XLIFF 1.2 and TMX 1.4 support  for Mac OS X and Windows  Single-user and site licenses  Can be integrated in an outsourced workflow

 Translation memory functionality  Faster, more consistent and non-redundant translation  Vocabulary is project-oriented  It is not proprietary, it is open to all  Permits independent developments  Enables tight integration with extant systems Advantages of XLIFF and TMS

All the advantages of XLIFF and TMS, plus:  Preserves database content assignment  Contextual information available  Scalable, suitable for very large XLIFF files  XLIFF 1.2 and TMX 1.4 support  for Mac OS X and Windows  Low cost: Single-user and site licenses Advantages of

Views of

Start by opening a translation memory

Lucene-indexed folder containing translation memory. Details here.

Information about the translation memory may be inspected below

The translation memory may be manipulated in the TM menu:  Open a TM  Save the segments in the TM  Close the TM  Create a new TM  Import a TM from a TMX file  Export the TM to a TMX file

Once the proper TM is open, an XLIFF file may be edited.

Various meta-information is displayed about the selected XLIFF file before opening it. Note the language pair: German -> English

On the left side a tree view is displayed. This shows the content structure of the XLIFF file. Important: properly handles groups. Not every XLIFF system does this recursively. To our knowledge, only KITE is fully XLIFF compliant.

Other features: Approved translation pairs are colored green for easy identification Other features: Context information is displayed, notes can be inserted

Segments: Segments (translation pairs) can be edited in the Segment menu:  Approved  Not approved  Accept only 100% TM matches  Accept best match  Accept source (for terms not to be translated like proper names, technical terms, etc)

Explorer-like tree view can show group hierarchy to any depth

Matches for the translation units are suggested by the Translation Memory. Double-click transfers the suggestion to the target unit(s).

The button “Accept source text” transfers, e.g., technical terms from the source text without translating them. The button “Accept 100% match” transfers only 100% matches to any selected segments The button “Accept best match” also transfers the best TM suggestion to the segments

The translation memory is searchable: The search text is typed here. The quality here. The results are given here.

For a live demonstration, please visit CMS at Stand E10

How can be integrated into a workflow? The catalog data is found in a Product Information Management (PIM) system and an Electronic Resource Planning (ERP) system. Typically, the PIM system can extract information from the ERP and publish this data enriched with marketing texts online and in print. Where does KITE fit in?

How can be integrated into a workflow? Let assume the following task: a large online and print catalog is to be translated from German into English. The work is to be shared among outside translators and an internal translation department. 1.The internal department first exports the Translation Memory (TM) and XLIFF segments from the database. 2.All 100% matches are found in KITE so that no redundant translation is done. All available terms are now correctly assigned. 3.The XLIFF is re-imported into the database. 4.Now the XLIFF and TM are sent to an external translator, who receives only those segments that need translation.

How can save money? ERP Product Information Management XLIFF + TM export XLIFF + TMX XLIFF import Localized online and printed documentation generation, user manuals, catalogs XLIFF + TMX External translation service Internal translation dept.

Why is this issue urgent? The following cites the new EU directive 2006/42/EG. This directive will become effective : Instructions All machinery must be accompanied by instructions in the official Community language or languages of the Member State in which it is placed on the market and/or put into service… General principles for the drafting of instructions (b) Where no "Original instructions" exist in the official language(s) of the country where the machinery is to be used, a translation into that/those language(s) must be provided by the manufacturer or his authorised representative or by the person bringing the machinery into the language area in question. … (d) In the case of machinery intended for use by non-professional operators, the wording and layout of the instructions for use must take into account the level of general education and acumen that can reasonably be expected from such operators.

What decisions are necessary?  Does my company have large quantities of database information?  Will we be required to translate large sections of the database for technical documentation?  Should we use document-based translation?  What is the source of our company documentation?  Could automation help us? We would be glad to speak with you further on these issues at our stand (E10) or contact:

Thank you for your attention