Using Semantic Mapping to Manage Heterogeneity in XLIFF Interoperability by Dave Lewis, Rob Brennan, Alan Meehan, Declan O’Sullivan CNGL Centre for Global.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Digital Repositories – Linked Open Data – the possible Role of D4Science Workshop, December 2010, FAO use cases A tool to create Linked Data providers.
UKOLN, University of Bath
From content standards to RDF Gordon Dunsire Presented at AKM 15, Porec, 2011.
The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in.
XML Technology in E-Commerce
Semantic Web Introduction
Data Intensive Techniques to Boost the Real-time Performance of Global Agricultural Data Infrastructures SEMAGROW U SING A POWDER T RIPLE S TORE FOR BOOSTING.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
LINKED DATA COMS E6125 Prof. Gail Kaiser Presented By : Mandar Mohe ( msm2181 )
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
Module 2b: Modeling Information Objects and Relationships IMT530: Organization of Information Resources Winter, 2007 Michael Crandall.
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
UKOLUG - July Metadata for the Web RDF and the Dublin Core Andy Powell UKOLN, University of Bath UKOLN.
What Can Do for You! Fabian Christ
Information Integration Intelligence with TopBraid Suite SemTech, San Jose, Holger Knublauch
Rutherford Appleton Laboratory SKOS Ecoterm 2006 Alistair Miles CCLRC Rutherford Appleton Laboratory Semantic Web Best Practices and Deployment.
Entity Recognition via Querying DBpedia ElShaimaa Ali.
Logics for Data and Knowledge Representation
The MMI Tools Carlos Rueda Monterey Bay Aquarium Research Institute OOS Semantic Interoperability Workshop Marine Metadata Interoperability Project Boulder,
The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in.
Linked data the next network?. The Web of documents is for people The Web of data is for computers The Web of documents is difficult for computers to.
© Copyright 2008 STI INNSBRUCK NLP Interchange Format José M. García.
 Yingjie Hu, PhD student  Space and Time Knowledge Organization Lab  Department of Geography, UCSB  Summer intern, APL  Sathya Prasad  Lead and.
MultilingualWeb – Language Technology A New W3C Working Group Felix Sasaki, David Filip, David Lewis.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Resource Description Framework (RDF) Course: Electronic Document Team member: Ding Feng Ding Wei Wang Ling Date:
Semantic Web Programming in Python an Introduction Biju B Jaganath G.
A Systemic Approach for Effective Semantic Access to Cultural Content Ilianna Kollia, Vassilis Tzouvaras, Nasos Drosopoulos and George Stamou Presenter:
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
© Copyright 2013 STI INNSBRUCK “How to put an annotation in HTML?” Ioannis Stavrakantonakis.
Linked Data: Emblematic applications on Legacy Data in Libraries.
Semantic Enhancement: Key to Massive and Heterogeneous Data Pools Violeta Damjanovic, Thomas Kurz, Rupert Westenthaler, Wernher Behrendt, Andreas Gruber,
FEISGILTT Dublin 2014 Yves Savourel ENLASO Corporation QuEst Integration in Okapi This presentation was made possible by This project is sponsored by the.
SPINNING THE SEMANTIC WEB APPLICATIONS FOR THE MODERN ERA LIBRARIES
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Machine Translate Post Edit Quality Check Extract Content I18N Text Analysis Curate Corpora Workflow Analysis Segment Identify Terms Translate Provenance.
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
KAnOE: Research Centre for Knowledge Analytics and Ontological Engineering Managing Semantic Data NACLIN-2014, 10 Dec 2014 Dr. Kavi Mahesh Dean of Research,
SICoP Presentation A story about communication Michael Lang BEARevelytix April 25, 2007.
Paloma Marín Arraiza 17 th International Conference on Grey Literature 1 st and 2 nd December 2015, Amsterdam (Netherlands) SCIENTIFIC AUDIOVISUAL MATERIALS.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Semantic Search - Potential and Opportunities. © 2014 SAPIENT CORPORATION | CONFIDENTIAL 2 Search – Where we were!
© Copyright 2015 STI INNSBRUCK PlanetData D2.7 Recommendations for contextual data publishing Ioan Toma.
Chapter 04 Semantic Web Application Architecture 23 November 2015 A Team 오혜성, 조형헌, 권윤, 신동준, 이인용.
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
RDA and Linked Data Gordon Dunsire Presented at Cita BNE - RDA and Linked Data, 15 April 2016, Madrid, Spain.
Linked Open Data Dataset from Related Documents Petya Osenova and Kiril Simov IICT-BAS LDL-2016, LREC, Portoroz.
RDA and Linked Data Gordon Dunsire Presented at Selmathon 1, 9 May 2016, Stockholm, Sweden.
Dmitry Mouromtsev, Aleksei Romanov, Dmitry Volchek and Fedor Kozlov Laboratory ITMO University, St. Petersburg, Russia “Metadata Extraction from.
RDA and linked data Gordon Dunsire Presented to Code4Lib Ottawa, MacOdrum Library, Carleton University, Ottawa, 27 April 2016.
Session: Towards systematically curating and integrating
^ Reviewer’s Workbench
Cloud based linked data platform for Structural Engineering Experiment
Semantic testing in oneM2M
Middleware independent Information Service
Dave Lewis W3C MultilingualWeb - Language Technology Working Group
Building the Localization Web
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
Grid Computing 7700 Fall 2005 Lecture 18: Semantic Grid
ITS Workbench Two Problems, One Open Standards Based Solution
LOD reference architecture
RDA cataloguing and linked data
Linked Data Reuse in the Language Services Industry
Linked Data 101 Things, URIs, RDF, Triples, Turtle, Ontologies, Vocabularies and SPARQL Linked Data is our Implementation choice for FAIR.
Presentation transcript:

Using Semantic Mapping to Manage Heterogeneity in XLIFF Interoperability by Dave Lewis, Rob Brennan, Alan Meehan, Declan O’Sullivan CNGL Centre for Global Intelligent Content at Trinity College Dublin

Outline Localization industry – interoperability issues Linked Data representation of localization content Still has interoperability issues Language Technology retraining workflow - use case Our mapping representation Evaluation Conclusions

Localization Industry Document Store Extract & Segment Named Entity Recognition Identify terms and translation Prioritise PE based on QE Post edit Machine Translate HTML source Annotated XLIFF source Src XLIFF + glossary Src/Tgt XLIFF Prioritised XLIFF PE‘d XLIFF XLIFF source Translation Workflow

Linked Data Representation – L3 Data Document Store Triple Store Extract & Segment Named Entity Recognition Identify terms and translation Prioritise PE based on QE Post edit Machine Translate HTML source Annotated XLIFF source Src XLIFF + glossary Src/Tgt XLIFF Prioritised XLIFF PE‘d XLIFF L3 data XLIFF source Translation Workflow XSLT Mapper

LT Retraining Workflow Document Store Triple Store Extract & Segment Named Entity Recognition Identify terms and translation Prioritise PE based on QE Post edit Machine Translate HTML source Annotated XLIFF source Src XLIFF + glossary Src/Tgt XLIFF Prioritised XLIFF PE‘d XLIFF L3 data (GLOBIC) New training data Train & deploy MT Tool (GLOBIC unaware) Analyse and select Retrain? XLIFF source Retraining Workflow Translation Workflow Mapping (GLOBIC to ITS) L3 data (ITS) XSLT Mapper

Architecture Diagram of the Process Triple Store Application SPARQL processor SPIN API 1.Application search for resources in the Triple Store 2.None in application’s vocabulary, search for mappings 3.If mappings exist, then retrieve the SPIN representation 4.Convert the SPIN representation to SPARQL syntax via a call to the SPIN API 5.Execute the SPARQL query via the SPARQL processor 6.Consume the newly created data

Mapping Requirements 1.A mapping entity must be expressed as RDF, with a unique URI, allowing it to be published as Linked Data 2.The executable statement must be a SPARQL query 3.The executable statement must be expressed as RDF and linked to a mapping entity 4.A mapping entity is to be modeled with associated meta-data

Meta-data and SPIN Meta-data properties from the GLOBIC and W3C PROV vocabularies: gic:wasCreatedBy, gic:mapDescription, prov:generatedAtTime, prov:wasRevisionOf SPIN vocabulary to express SPARQL queries as RDF: SELECT ?subject ?predicate ?object WHERE { ?subject ?predicate ?object } [] a sp:Select ; sp:templates ([ sp:object _:b1 ; sp:predicate _:b2 ; sp:subject _:b3 ]); sp:where ([ sp:object _:b1 ; sp:predicate _:b2 ; sp:subject _:b3 ]). _:b3 sp:varName “subject”^^xsd:string. _:b2 sp:varName “predicate"^^xsd:string. _:b1 sp:varName “object"^^xsd:string. SPARQL Query SPIN Representation

Mapping Representation Example ex:globic_to_its_mtScore_map_1_1 a gic:Mapping ; gic:hasRepresentation ex:globic_to_its_mtScore_sp_2 ; gic:wasCreatedBy ex:person_1 ; prov:generatedAtTime “ ”^^xsd:date ; gic:mapDescription “Used to map MT confidence data from GLOBIC to ITS vobabulary” ; gic:version “1.1”^^xsd:float ; prov:wasRevisionOf ex:globic_to_its_mtScore_map_1. ex:globic_to_its_mtScore_sp_2 a sp:Construct ; sp:templates ([ sp:object _:b1 ; sp:predicate itsrdf:mtConfidence ; sp:subject _:b2 ]) ; sp:where ([ sp:object _:b1 ; sp:predicate gic:qualityAssessment ; sp:subject _:b2 ]). _:b2 sp:varName "s"^^xsd:string. _:b1 sp:varName "val"^^xsd:string. Mapping Entity + Meta-dataSPIN Representation of SPARQL Query

Evaluation Two initial experiments: 1.Test the mapping capabilities of SPARQL construct queries R2R Framework – 70* test mappings Reproduced R2R Evaluation R2R test mappings as SPARQL construct queries Compared results – SPARQL construct queries as expressive as R2R Framework 2.Test the expressiveness of SPIN vocabulary with regard to expressing SPARQL construct queries as RDF Carried out using online SPIN RDF Converter and TopBraid composer Input the SPARQL construct queries from first evaluation SPIN could represent all queries in RDF Suitable vocabulary to use

Conclusions Mapping representation to increase interoperability within heterogeneous workflows All aspects of mapping representation published as Linked Data Discovery of the mappings through SPARQL queries - ultimately executed through SPARQL processor Evaluation – Capabilities of SPARQL construct queries and expressiveness of SPIN Not just relevant to localization workflows, useful in other Linked Data scenarios

Thank You Questions?