9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. TBX TermBase Exchange Format.

Slides:



Advertisements
Similar presentations
OLIF V2 Gr. Thurmair April OLIF April 2000 OLIF: Overview Rationale Principles Entries Descriptions Header Examples Status.
Advertisements

Using OLIF, The Open Lexicon Interchange Format Susan McCormick OLIF2 Consortium October 1, 2004.
Multilingual Ontologies – Standards and Technologies Gerhard Budin University of Vienna Chair, ISO/TC 37/SC 2 22nd APAN Meeting NUS, Singapore 20 July,
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Terminology Markup Framework and TBX-SKOS Interoperability
Alexandria Digital Library Project Integration of Knowledge Organization Systems into Digital Library Architectures Linda Hill, Olha Buchel, Greg Janée.
ICT Monica Monachini – 1° KYOTO Workshop – Amsterdam 2/ KYOTO (ICT ) Yielding Ontologies for Transition-Based Organization Intelligent.
Applying ISO25964 to thesaurus mapping and other forms of linkage Stella Dextre Clarke Convenor, ISO TC46/SC9 WG8 1.
LREC 2000 Athens; Gerhard Budin and Alan Melby Accessibility of Multilingual Terminological Resources Current Problems and Prospects for the Future Gerhard.
Multilingual multimedia thesaurus for conservation and restoration collaborative networked model of construction Lucijana Leoni University of Dubrovnik.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
DDI 3.0 Conceptual Model Chris Nelson. Why Have a Model Non syntactic representation of the business domain Useful for identifying common constructs –Identification,
Interchange using TBX 8 th Metadata conference Berlin April 2005 Alan K. Melby Brigham Young University, Provo campus.
A Registry for controlled vocabularies at the Library of Congress
Methodology Conceptual Database Design
OCLC Online Computer Library Center A Global OpenURL Resolver Registry Phil Norman OCLC Dlsr4lib Workshop March 23 rd, 2006 Arlington VA.
Introduction to Geospatial Metadata – FGDC CSDGM National Coastal Data Development Center A division of the National Oceanographic Data Center Please .
TMF - a tutorial TMF - Terminological Markup Framework Laurent Romary - Laboratoire Loria.
SDC JE-xxxx. Bruce Bargmeyer EPA/OIRM/EIM Division Tel: (202) WWW URL:
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
1/ 27 The Agriculture Ontology Service Initiative APAN Conference 20 July 2006 Singapore.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. Commonalities and Differences.
Provo, 16 Aug 2007 LMF meeting 1 Lexical Markup Framework: ISO Provo meeting Gil Francopoulo.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Towards Translating between XML and WSML based on mappings between.
Rutherford Appleton Laboratory SKOS Ecoterm 2006 Alistair Miles CCLRC Rutherford Appleton Laboratory Semantic Web Best Practices and Deployment.
Standards for language resources the ISO/TC 37(/SC 4) perspective
1 Web Basics Section 1.1 Compare the Internet and the Web Compare Web sites and Web pages Identify Web browser components Describe types of Web sites Section.
“Integrating Standards in Practice” 10th Open Forum on Metadata Registries July 9-11, 2007 New York City, NY USA An international conference to share and.
Experiments with ODD outside the TEI framework Laurent Romary & Piotr Banski The ISO-TEI connection.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
Terminology and Standards Dan Gillman US Bureau of Labor Statistics.
Classification and the Metadata Registry Judith Newton NIST IRS XML Stakeholders/ XML Working Group May 18, 2004.
XML DTDs and other Alternatives: Vocabulary Markup Language (Voc-ML) Project & Friends Joseph A. Busch Director, Solutions Architecture NetLab and Friends.
Multilingual Information Exchange APAN, Bangkok 27 January 2005
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
D4: SKOS and HIVE—Enhancing the Creation, Design and Flow of Information Speakers: Hollie White Jane Greenberg Coordinator: Alan Keely.
1 Collection Specific Vocabularies March Terminology CB - abbreviation for collection builder CV - abbreviation for controlled vocabulary.
ISO a tutorial Part 2: Representing data categories TMF - Terminological Markup Framework Laurent Romary - Laboratoire Loria.
XLIFF 2.0 GLOSSARY MODULE / TBX-BASIC Facilitating Interoperability and Compatibility.
Ontology Summit2007 Survey Response Analysis Ken Baclawski Northeastern University.
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. Presentation Title: Day:
, 1/21, © Library and Documentation Systems Division 21 st APAN Meeting Tokyo, January 2006 AGROVOC and AOS, Margherita Sini, FAO From.
PREMIS Controlled vocabularies Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
ISO 25964: a standard in support of interoperability Stella G Dextre Clarke Project Leader, ISO NP
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
GEMET GEneral Multilingual Environmental Thesaurus leading the way to federated terminologies Stefan Jensen, Head of information services group with input.
TMF - Terminological Markup Framework Laurent Romary Laboratoire LORIA (CNRS, INRIA, Universités de Nancy) ISO meeting London, 14 August 2000.
Shawn Jones INDUS Corporation January 18, 2000 Open Forum on Metadata Registries Santa Fe, NM SDC JE-2029.
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features The Role of the International Nuclear Information System.
Developing OLIF, Version 2 Susan M. McCormick Christian Lieske OLIF2 Consortium SAP/Walldorf, Germany.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
ISO TMF - Terminological Markup Framework Laurent Romary - Laboratoire Loria.
Margherita Sini, FAO 1 / 19 Using RSS to Share KOS Metadata Margherita Sini, Gauri Salokhe IV Ecoterm Vienna, Austria April.
Statistical Data and Metadata Exchange SDMX Metadata Common Vocabulary Status of project and issues ( ) Marco Pellegrino Eurostat
Semantic Wiki: Automating the Read, Write, and Reporting functions Chuck Rehberg, Semantic Insights.
ARIADNE is funded by the European Commission's Seventh Framework Programme Interoperability Holly Wright.
Extended Metadata Registries and Semantics (Part 2: Implementation) Karlo Berket Ecoterm IV Environmental Terminology Workshop April 18, 2007 Diplomatic.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
Marc Kemps-Snijders Menzo Windhouwer Sue Ellen Wright
Networking and Health Information Exchange
Diane Vizine-Goetz OCLC Research
PREMIS Tools and Services
Metadata Framework as the basis for Metadata-driven Architecture
Database Design Hacettepe University
RDA in a non-MARC environment
Taxonomy of public services
Presentation transcript:

9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. TBX TermBase Exchange Format Alan Melby, Marc Carmen, Gerhard Budin Brigham Young University, University of Vienna

9th Open Forum for Metadata Registry, Kobe, 2006 Outline  (A) Introduction to TBX  (B) TBX with Other Standards  (C) MLTMS (An example of TBX)  (D) Other Standards  (E) Tasks for TBX

9th Open Forum for Metadata Registry, Kobe, 2006 A: Introduction to TBX  TBX ® stands for TermBase eXchange  TBX is a Terminological Markup Framework (TMF) markup language –TMF is an ISO standard (16642)  TBX is consistent with ISO (MARTIF)  TBX is maintained by OSCAR (  The TBX specification is free  But why should I care about TBX?

9th Open Forum for Metadata Registry, Kobe, 2006 Who Should Care about TBX?  If you don ’ t care about terminological consistency in your translation/localization work, then you have no reason to care about TBX  If you only need a simple bilingual list of terms (source term and target term) with no additional information, then you don ’ t need TBX; just use a two-column spreadsheet for your list

9th Open Forum for Metadata Registry, Kobe, 2006 On the other hand …  If you do care about terminological consistency and you maintain one or more terminology databases (termbases), then you should be interested in TBX, unless you want your termbase to be locked into the terminology management software you are currently using.  Portability of complex terminological data is the key benefit of TBX

9th Open Forum for Metadata Registry, Kobe, 2006 What does TBX look like?  A TBX file is an XML document  A TBX file consists of: –A header that describes the file –A set of entries, one per concept in the termbase –For each concept, a set of terms, grouped by language, that designate the concept  A terminological concept entry (termEntry) –Can be multilingual –Can be monolingual

9th Open Forum for Metadata Registry, Kobe, 2006 Example of a TBX file  [+ ref to DTD/schema]   [ global info ]   [concept entries] 

9th Open Forum for Metadata Registry, Kobe, 2006 TBX Header   from Budin Kobe 2006   SYSTEM "TBXDCSv05c.xml" 

9th Open Forum for Metadata Registry, Kobe, 2006 Food Sample Dollop of Cream & Frog Legs

9th Open Forum for Metadata Registry, Kobe, 2006 TBX Body   [concept: a dollop of cream]   [concept: frog legs] 

9th Open Forum for Metadata Registry, Kobe, 2006 Concept Entry for “ Dollop of cream ”   Recipe Ingredient  s81  a spoonful of cream   a dollop of cream   sahnehaube 

9th Open Forum for Metadata Registry, Kobe, 2006 Concept Entry for “ Frog Legs ”   Restaurant Menu Item  s79  the thigh of a frog leg (the foot has been removed)   frog legs   cuisses de grenouilles 

9th Open Forum for Metadata Registry, Kobe, 2006 B: TBX with Other Standards  (1) TBX and ISO (TMF)  (2) TBX and ISO (Data Categories)  (3) TBX and SKOS

9th Open Forum for Metadata Registry, Kobe, : TBX and ISO  TBX is a TML (Terminological Markup Language) of TMF (ISO 16642) ( see Annex B )  TBX maps to the TMF meta-model –A TBX file is a TDC (terminological data collection) –martifHeader provides GI (global information) –termEntry: TE (terminological entry) –langSet: LS (language section) –tig/ntig: TS (term section)  A TMF DCS (Data Category Selection) in TBX is in XCS (eXtensible Constraint Specification) format  TBX uses ISO for its XML style

9th Open Forum for Metadata Registry, Kobe, 2006 TMF Metamodel Global Information (GI) Complementary Information (CI) Term Section(s) (TS) Term Component Section(s) (TCS) Language Section(s) (LS) Terminological (Concept) Entry/Entries (TE) Terminological Data Collection (TDC)

9th Open Forum for Metadata Registry, Kobe, 2006 TMF and lexical resources  In general, a terminological resource is organized into concept entries, each of which includes one or more terms designating a particular concept  In general, a lexical resource is organized into lexical entries, each of which includes one or more senses of a particular lexical item (a word or phrase)  A concept entry containing multiple terms can be split into multiple lexical entries, one per term, and multiple lexical entries associated with the same concept can be combined into one concept entry

9th Open Forum for Metadata Registry, Kobe, : TBX and ISO  All data categories in the default TBX DCS are taken from ISO 12620

9th Open Forum for Metadata Registry, Kobe, : TBX and SKOS  A typical concept entry will contain a subject field to specify the domain of the concept.  However, the subject field is typically some kind of hierarchy that is flattened into a string within TBX  SKOS makes it possible to represent the subject field hierarchy as a hierarchy and then create a link within TBX

9th Open Forum for Metadata Registry, Kobe, 2006 Simple Knowledge Organization System (SKOS)  “ SKOS is an area of work developing specifications and standards to support the use of knowledge organisation systems (KOS) such as thesauri, classification schemes, subject heading lists, taxonomies, other types of controlled vocabulary, and perhaps also terminologies and glossaries, within the framework of the Semantic Web. ” - (Accessed on 3/17/06)

9th Open Forum for Metadata Registry, Kobe, 2006 Sample SKOS   Food   Recipe Ingredient   Restaurant Menu Item 

9th Open Forum for Metadata Registry, Kobe, 2006 Visual Representation of SKOS Food Recipe Ingredient Restaurant Menu Item AppetizerEntreeSaladSoup Grocery Store Item Homemade Item

9th Open Forum for Metadata Registry, Kobe, 2006 GEvTerm Initiative  The information previously used dealing with food has been taken from FooNaVar, a project of the GEvTerm Initiative.  The GEvTerm Initiative is a terminological database that has committed to being fully TBX and SKOS compliant  Another example of TBX in use is...

9th Open Forum for Metadata Registry, Kobe, 2006 C: Multilingual Thesaurus for Medieval Studies (MLTMS)  “ Imagine the ability to search across web-resources using your native modern european language and find appropriate primary and secondary sources in Latin, French, Italian, German, Spanish, English, etc., based upon the meaning rather than the form of the search term. Imagine having a tool that would enable you to search for a concept and be able to construct the forms it has taken historically as well as the ability to link outward for both evidence and argument. Imagine a tool that would enable you to study the slippage of concept which is beyond naming. Imagine having a tool that can deconstruct ontological orders asking for different kinds of readings. ” - (Accessed on 3/17/06)

9th Open Forum for Metadata Registry, Kobe, 2006 Why did MLTSM use TBX?  integration of terminological data from multiple sources;  querying multiple termbases through a single user interface by passing data through a common intermediate format on a batch or dynamic basis;  placing data on an FTP site for download by interested parties;  peer review by colleagues of tentative entries - (Accessed on 3/17/06)

9th Open Forum for Metadata Registry, Kobe, 2006 MLTSM Sample   personnel  personne qui accomplit un travail copie ou d'écriture   copiste  entryTerm   écrivain  synonym   scribe  entryTerm 

9th Open Forum for Metadata Registry, Kobe, 2006 MLTSM Sample (Rendered with XSLT)

9th Open Forum for Metadata Registry, Kobe, 2006 TBX  HTML  The last few slides have provided an example of rendering HTML from a TBX file. Here is a brief diagram of the process. TBXXSLTHTML Processed byResults in

9th Open Forum for Metadata Registry, Kobe, 2006 D: Other Standards  ISO and XCS, which defines a flavor of TBX, both provide a list of data element types

9th Open Forum for Metadata Registry, Kobe, 2006 E: Tasks for TBX  Encourage translation technology vendors to implement TBX  Revise the specification  Compare ISO to XCS

9th Open Forum for Metadata Registry, Kobe, 2006 For More Information  TBX webpage:  or  Alan K. Melby Translation Research Group Brigham Young University Department of Linguistics Provo, Utah USA