Tyler Snow Brigham Young University Translation Research Group.

Slides:



Advertisements
Similar presentations
1 Metadata Tools for JISC Digitisation Projects of still images and text Ed Fay BOPCRIS, Hartley Library University of Southampton.
Advertisements

DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
Using the Self Service BMC Helpdesk
CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
Effective management Accurate tracking Easier automation.
Register Laulima Workshop for Instructors Solutions to help you engage your students through Laulima.
Josep Bonet Heras & Olaf-Michael Stefanov On behalf of Alan Melby and Tomas Carrasco Benitez May 2013, Nairobi, Kenya.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
Cross-Jurisdictional Immunization Data Exchange Project Updated 4/29/14.
Analyzing Systems Using Data Dictionaries Systems Analysis and Design, 7e Kendall & Kendall 8 © 2008 Pearson Prentice Hall.
S.R.F.E.R.S. State, Regional, and Federal Enterprise Retrieval System Inter-Agency & Inter-State Integration Using GJXML.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
Russell Taylor Lecturer in Computing & Business Studies.
Information Extraction from Documents for Automating Softwre Testing by Patricia Lutsky Presented by Ramiro Lopez.
Comparing M2T & M2M Complementary Approaches © 2008 INRIA, University of York & SINTEF Comparing M2T & M2M Complementary Approaches Hugo Bruneliere,
Introducing HTML & XHTML:. Goals  Understand hyperlinking  Understand how tags are formed and used.  Understand HTML as a markup language  Understand.
(C) 2013 Logrus International Practical Visualization of ITS 2.0 Categories for Real World Localization Process Part of the Multilingual Web-LT Program.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
The chapter will address the following questions:
XML, DITA and Content Repurposing By France Baril.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Project Proposal: Academic Job Market and Application Tracker Website Project designed by: Cengiz Gunay Client: Cengiz Gunay Audience: PhD candidates and.
1.Database plan 2.Information systems plan 3.Technology plan 4.Business strategy plan 5.Enterprise analysis Which of the following serves as a road map.
SYSTEM ANALYSIS AND DESIGN
San José, CA – September, 2004 Localizing with XLIFF and ICU Markus Scherer Raghuram (Ram) Viswanadha IBM San.
AICT5 – eProject Project Planning for ICT. Process Centre receives Scenario Group Work Scenario on website in October Assessment Window Individual Work.
HOLMES 2 System Administration Through HOLMES 2 System Administration, an incident can be configured such that access is restricted to: – specified users.
Publish Your Work BIM Curriculum 04. Topics  External Collaboration  Sharing the BIM model  Sharing Documents  Sharing the 3D model  Reviewing 
BPMN By Hosein Bitaraf Software Engineering. Business Process Model and Notation (BPMN) is a graphical representation for specifying business processes.
An XMPP (Extensible Message and Presence Protocol) based implementation for NHIN Direct 1.
Profiling Metadata Specifications David Massart, EUN Budapest, Hungary – Nov. 2, 2009.
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in.
Architecture for a Database System
Copyright © 2004 by The Web Services Interoperability Organization (WS-I). All Rights Reserved 1 Interoperability: Ensuring the Success of Web Services.
Promoting Web Services Interoperability Across Platforms, Applications and Programming Languages Basic Profile 1.0 August 12, 2003 Copyright © 2003 by.
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
Copyrighted material John Tullis 10/17/2015 page 1 04/15/00 XML Part 3 John Tullis DePaul Instructor
Tuesday, November 12, 2002 LRC 2002 Conference XLIFF An XML standard for localisation Tony Jewtushenko – Oracle Peter Reynolds – Bowne Global Solutions.
EDI communication system
Information Systems & Databases 2.2) Organisation methods.
The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in.
The eXtensible Markup Language (XML). Presentation Outline Part 1: The basics of creating an XML document Part 2: Developing constraints for a well formed.
CMPS 435 F08 These slides are designed to accompany Web Engineering: A Practitioner’s Approach (McGraw-Hill 2008) by Roger Pressman and David Lowe, copyright.
PASSOLO ® Makes Your Software Ready for the Global Market Localisation Standards The Tools Developer’s Perspective.
Standards for Technology in Automotive Retail STAR Update Michelle Vidanes STAR XML Data Architect April 30 th, 2008.
Project Database Handler The Project Database Handler dbCCP4i is a brokering application that mediates interactions between the project database and an.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Benjamin Post Cole Kelleher.  Availability  Data must maintain a specified level of availability to the users  Performance  Database requests must.
ITS 2.0 in XLIFF 2 FEISGILTT Dublin June 2014 Yves Savourel ENLASO Corporation This presentation was made possible by.
 Shopping Basket  Stages to maintain shopping basket in framework  Viewing Shopping Basket.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
The Claromentis Digital Workplace An Introduction
 XML derives its strength from a variety of supporting technologies.  Structure and data types: When using XML to exchange data among clients, partners,
Accurate  Consistent  Compliant Contact: i4i the structured content company the structured content company.
Academic Cooperation: Terminology Research for IATE.
A report by Olaf-Michael Stefanov to the JIAMCATT community
Project Management: Messages
Information Delivery Manuals: Functional Parts
Active Data Management in Space 20m DG
Chapter 2 Database Environment.
Part of the Multilingual Web-LT Program
Teaching slides Chapter 8.
Open Archive Initiative
Module P4 Identify Data Products and Views So Their Requirements and Attributes Can Be Controlled Learning Objectives: Understand the value of data. Understand.
Metadata The metadata contains
The ultimate in data organization
AICT5 – eProject Project Planning for ICT
Use Cases Simple Machine Translation (using Rainbow)
Reportnet 3.0 Database Feasibility Study – Approach
Presentation transcript:

Tyler Snow Brigham Young University Translation Research Group

Abstract: There are many evolving technology standards in the translation/localization industry, such as XLIFF and the Internationalization Tag Set (ITS). The Linport project aims to define a standard container format for translation projects and tasks within projects. Linport, XLIFF, and ITS are complementary This presentation presents the need for a container standard in addition to XLIFF and ITS and describes the status and likely future of Linport

Container Formats Being able to send a message does not guarantee communication. Language is a barrier A similar communication breakdown can occur between translation systems Many translation file formats! Good technological communication saves time and money Containers as a possible solution?

Shipping containers

“Shipping” in the translation industry Similar to the previous example: Translation projects between different TEnT tools cannot be easily opened in another Best case scenario: similar file types without association Worst case scenario: different file types

Linport Language Interoperability Portfolio An open, comprehensive, interoperable container solution for all translation processes Currently under development Carries the details and various files pertaining to each task of a translation project. Can contain a variety of proprietary formats

How does Linport contain files? Standardized directory structure format. Portinfo metadata Payload EN Docplus (unique name equal to those in other groupLangs) support Doc (one translation file)

STS Structured Translation Specifications 21 translation parameters for translation project definition Considers: Source, Target, Environment, Stakeholder Relationships Defines metadata (localization) at the project and task level in Linport

History of Linport 2011 LISA –Container format and STS 2011 EC DGT -MED IN! TIPP Other groups soon joined in.

Relationship to other formats Other specifications address part of the industry need. Linport works at a different level of abstraction than most specifications mainly at the project data level Enhances capability of already capable file formats Organizes these formats to promote efficiency Two of these formats are: XLIFF ITS

XLIFF Very important localization related standard. Used in storing, translating and reexporting content Any two tools with XLIFF import/export can easily transfer translation data Linport adds: Ability to package multiple XLIFF files as a project Include related support files to those XLIFF files Metadata including STS is added and passed along the translation process with the XLIFF file All this will maintaining the relationship information between XLIFF and support files across TEnT systems

Continued XLIFF helps Linport! Novice Linport users might pass along PDFs as source texts When XLIFF is used as the source text, Linport interoperability is enhanced at the task level Thus it is hoped that Linport users will incorporate XLIFF at the very heart of their portfolios Perhaps a “strict” Linport version will require XLIFF. Taken together these points demonstrate that an ideal ecosystem would support both XLIFF and Linport, with both formats handling their respective strengths in tandem to help deliver process resources and automation (Linport) and low-level content interoperability (XLIFF)

ITS Simplest example of ITS = the translate category Localization notes, which provide guidance for how content should be treated in localization processes The ability to mark and identify terminology The ability to identify text directionality in formats that do not natively support it Information about the language of text. A mechanism to specify whether XML elements within text should interrupt text flow or be seen as part of the surrounding text

Continued Information on the domain (subject field) of text The results of automated text analysis processes (such as named entity extraction) The ability to tag text (locale filter) for translation into specific locales (or to exclude it from translation for specific locales) A mechanism to identify the provenance (source) of text, e.g., to state that particular content was translated with a particular system A mechanism to point to external resources that contain the translatable text associated with a particular piece of content, such as a graphic, that cannot be translated directly

Continued The ability to indicate that particular elements in a file format should contain the translation of particular other elements (target pointer) A mechanism to add ID values to content that lacks them to support translation requirements Indications as whether or not to preserve spaces in content A set of metadata on localization quality issues and localization quality ratings

Continued Information on the internal confidence of machine translation systems (MT confidence) that can be used to flag content for further attention A mechanism to specify the allowed characters in a given piece of content (e.g., to state that a translation should not allow certain forbidden characters) A way to indicate the maximum storage size for content, such as database entries, that must conform to size limitations

Linport and ITS This List indicates some overlap between Linport and ITS functionality The Difference is: abstraction Once again they work together Example: Linport might specify that the domain of an entire project is “legal” but ITS 2.0 can specify that a short section within a document in fact belongs to another domain: A company report needs to be translated BUT (e.g., there is a lengthy quotation from a company’s earnings report [domain = financial] embedded in a legal brief). That the company does not wish to be in the final translation

Ideal scenario ITS

Linport split tool

Quality Assessment Metrics QTLP: (see an emerging translation quality assessment metric format that is customizable for different projects or documents. QTLP or any other metric format can be contained in a Linport portfolio. STS and QTLP recognize that translation quality is not a one size fits all proposition. Translation parameters and quality measurements must be adapted to meet particular needs.

Linport, QTLP, and STS Because all of these are included in Linport and passed along the translation chain, it helps to: Set project expectations Provide relevant ways of ensuring that expectations are met

Advantages of Linport Ease of Implementation -Time It is free! –Money (at least for now)

Current and future development Portfolio data model needs to be refined and formalized with a schema and validation system Submitted to a standards organization Online portfolio builder Splitter Merger

How to get involved Join the Linport community to participate in monthly conference calls and view ongoing progress reports and website posts at: as well as the GALA Linport community group at: Project data!!!!! Send to: or

continued Contribute testing apps developed for Linport Develop your own Linport applications Introduce Linport into your company’s translation workflow as an early adopter

The Linport community is open and thankful for any support you and your company are willing to provide Thank you! Slides and full paper will be available at