Www.sti-innsbruck.at © Copyright 2013 STI INNSBRUCK www.sti-innsbruck.at “How to put an annotation in HTML?” Ioannis Stavrakantonakis.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

BAH DAML Tools XML To DAML Query Relevance Assessor DAML XSLT Adapter.
Semantically Grounded Briefings Bob Balzer, Neil Goldman, Marcelo Tallis Teknowledge
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
1 UIM with DAML-S Service Description Team Members: Jean-Yves Ouellet Kevin Lam Yun Xu.
The Semantic Web. The Web Today Designed for Human to read Cannot express meaning Architecture: URL –Decentralized: Link structure Language: html.
© Copyright 2012 STI INNSBRUCK Apache Stanbol.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Overview of Search Engines
(C) 2013 Logrus International Practical Visualization of ITS 2.0 Categories for Real World Localization Process Part of the Multilingual Web-LT Program.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
CIMI / FHIR and Shape Expressions. Local DB … …
What Can Do for You! Fabian Christ
Sheet 1XML Technology in E-Commerce 2001Lecture 6 XML Technology in E-Commerce Lecture 6 XPointer, XSLT.
Semantic Web outlook and trends May The Past 24 Odd Years 1984 Lenat’s Cyc vision 1989 TBL’s Web vision 1991 DARPA Knowledge Sharing Effort 1996.
Multilingual Issues in the Representation of International Bibliographic Standards for the Semantic Web Gordon Dunsire Independent Consultant; Chair of.
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. XMDR Prototype Day: 21.
Practical RDF Chapter 1. RDF: An Introduction
The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
XML BIS4430 – unit 10. XML Origins Extensible Markup Language (XML) 1998 Inspired by Standard Generalized Markup Language (SGML) and HTML. SGML defines.
Spoken dialog for e-learning supported by domain ontologies Dario Bianchi, Monica Mordonini and Agostino Poggi Dipartimento di Ingegneria dell’Informazione.
The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in.
© Copyright 2008 STI INNSBRUCK NLP Interchange Format José M. García.
Scalable Metadata Definition Frameworks Raymond Plante NCSA/NVO Toward an International Virtual Observatory How do we encourage a smooth evolution of metadata.
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
MultilingualWeb – Language Technology A New W3C Working Group Felix Sasaki, David Filip, David Lewis.
UKOLN is supported by: Approaches to Metadata Quality Marieke Guy QA Focus A centre of expertise in digital information management
Ontologies and Lexical Semantic Networks, Their Editing and Browsing Pavel Smrž and Martin Povolný Faculty of Informatics,
(C) 2014 Logrus International Visualizing ITS 2.0 Categories for the localization process.
Jan 9, 2004 Symposium on Best Practice LSA, Boston, MA 1 Comparability of language data and analysis Using an ontology for linguistics Scott Farrar, U.
The MultilingualWeb-LT Working Group receives funding by the European Commission (project name LT-Web) through the Seventh Framework Programme (FP7) in.
XML Extras Outline 1 - XML in 10 Points 2 - XML Family of Technologies 3 - XML is Modular 4 - RDF and Semantic Web 5- XML Example: UK GovTalk Group’s Schema.
Using Semantic Mapping to Manage Heterogeneity in XLIFF Interoperability by Dave Lewis, Rob Brennan, Alan Meehan, Declan O’Sullivan CNGL Centre for Global.
Xml:tm XML Text Memory Using XML technology to reduce the cost of translating XML documents.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Working with Ontologies Introduction to DOGMA and related research.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Metadata : an overview XML and Educational Metadata, SBU, London, 10 July 2001 Pete Johnston UKOLN, University of Bath Bath, BA2 7AY UKOLN is supported.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Internationalization Tag Set (ITS) Version 1.0 The Internationalization Tag Set (ITS) is a set of XML elements and attributes that supports the internationalization.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
© Copyright 2015 STI INNSBRUCK PlanetData D2.7 Recommendations for contextual data publishing Ioan Toma.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Subject: Internationalization of AJAX applications using ITS and XML, Best practices and application. Doctoral Program in Technology and Software Engenieering.
26/02/ WSMO – UDDI Semantics Review Taxonomies and Value Sets Discussion Paper Max Voskob – February 2004 UDDI Spec TC V4 Requirements.
1 Introduction to XML Babak Esfandiari. 2 What is XML? introduced by W3C in 98 Stands for eXtensible Markup Language it is more general than HTML, but.
Kynn Bartlett 11 April 2001 STC San Diego The HTML Writers Guild Copyright © 2001 XML, XHTML, XSLT, and other X-named specifications.
A report by Olaf-Michael Stefanov to the JIAMCATT community
Dr. Barry Norton, Development Manager, ResearchSpace*
Linked Data Web that can be processed by machines
Statistical Machine Translation
Semantic Web Technologies
Dave Lewis W3C MultilingualWeb - Language Technology Working Group
Text Analytics in ITS 2.0: Annotation of Named Entities
Part of the Multilingual Web-LT Program
PREMIS Tools and Services
How to publish in a format that enhances literature-based discovery?
Part of the Multilingual Web-LT Program
CSE591: Data Mining by H. Liu
Linked Data Reuse in the Language Services Industry
Text Analytics in ITS 2.0: Annotation of Named Entities
Presentation transcript:

© Copyright 2013 STI INNSBRUCK “How to put an annotation in HTML?” Ioannis Stavrakantonakis

Outline 2 Research question ITS 2.0 NIF What about Microdata? Demo References

Research question 3 We want to annotate Springfield with an URI to make sure that the computer understands we mean the Springfield in Massachusetts. HTML: It is well known, that Springfield has mild summers and short, but hard winters. HTML with annotation (something like that): It is well known, that Springfield has mild summers and short, but hard winters. We don't want to add whole triples, but just annotate the HTML and say "this element refers to the following URI". From: Denny Vrandečić Sent: Wednesday, April 24, :59 PM To: semantic-web at W3C Subject: How to put an annotation in HTML?

ITS International Tag Set (ITS) [2] –enhances the foundation to integrate automated processing of human language into core Web technologies; –focuses on HTML, XML-based formats in general, and can leverage processing based on the XML Localization Interchange File Format (XLIFF), as well as the Natural Language Processing Interchange Format (NIF); –is a technology to add metadata to Web content, for the benefit of localization, language technologies, and internationalization (see more in [5] regarding localization (l10n) and internationalization (i18n))

ITS Potential Users of ITS [2]: –Schema developers starting a schema from the ground up (proposals for attribute and element names to be included in their new schema) –Schema developers working with an existing schema (should check whether their schemas support the markup proposed in this specification, and, where appropriate, add the markup proposed here to their schema) –Vendors of content-related tools (e.g. tools for authoring, translation, etc.) –Content producers (may be used by them to mark up specific bits of content) –Machine Translation Systems –Text Analytics (automatically generated metadata for improving localization, data integration or knowledge management workflows) –Localization Workflow Managers

ITS The Text Analysis use case: This data category is used to annotate content with lexical or conceptual information for the purpose of contextual disambiguation. 3 pieces of annotation: –Confidence: The confidence of the agent (that produced the annotation) in its own computation – XSD double data type (e.g. 0.63) –Entity type: The type of entity, or concept class of the text analysis target – IRI (e.g. [8]) –Entity identifier: A unique identifier for the text analysis target – IRI or String (e.g. or the identifier for “Capital” from Wordnet [9])

ITS Rendered HTML: HTML with ITS metadata: Welcome to Innsbruck in Austria !

ITS Conversion to NIF [2]: –Convert XML or HTML documents that contain ITS metadata to the RDF-based format based on NIF. The conversion results in RDF. –The conversion algorithm to generate NIF consists of seven steps. The output of the algorithm uses the ITS RDF ontology [7]. –The conversion to NIF is a possible basis for a natural language processing (NLP) application that creates, for example, named entity annotations. –To integrate the RDF annotations into the original input document is given in [6] (NIF2ITS).

NLP Interchange Format (NIF) 9 NIF is an RDF/OWL-based format that aims to achieve interoperability between Natural Language Processing (NLP) tools, language resources and annotations. NIF will soon be a normative part of the ITS 2.0 NIF and its community project NLP2RDF serve as an umbrella project liaising with other community of practices, especially: –LOD2 FP7 EU projectLOD2 FP7 EU project –MultilingualWeb-LT Working GroupMultilingualWeb-LT Working Group –Best Practices for Multilingual Linked Open Data Community GroupBest Practices for Multilingual Linked Open Data Community Group –Ontology-Lexica Community GroupOntology-Lexica Community Group –Named Entity Recognition and Disambiguation (NERD)Named Entity Recognition and Disambiguation (NERD) –Ontologies of Linguistic Annotation (OLiA)Ontologies of Linguistic Annotation (OLiA) University of Leipzig

How is it different to Microdata annotations? 10 What is the latitude and longitude of the Empire State Building ? Empire State Building What is the latitude and longitude of the Empire State Building ? Microdata + schema.org ITS2.0 + dbpedia resource

How is it different to Microdata annotations? 11 What is the latitude and longitude of the Empire State Building ? Semantics of ITS2.0 annotations: Specify entity identifiers (IRIs) for the presented information item. Semantics of Microdata annotations: Specify the type of information that is presented. Microdata ITS2.0

Hands-on / Demo 12 HTML with ITS metadata Transformation of HTML with ITS metadata to NIF Notes: Based on the XSLT files shared by the W3C Working Group member Felix Sasaki [4] The Java internal XSLTC processor fails to compile the XSLTs. Use Saxon 9 HE.

References [1] W3C semantic web list thread: web/2013Apr/0218.html web/2013Apr/0218.html [2] ITS 2.0 W3C working draft: [3] NIF Core Ontology: [4] Felix Sasaki ITS 2.0 extractor (github): [5] W3C, Localization vs. Internationalization: [6] W3C, Conversion NIF2ITS: [7] W3C, ITS 2.0 / RDF Ontology: [8] Named Entity Recognition and Disambiguation (NERD): [9] WordNet Search 3.1: 13