University of Rome Tor Vergata VOCBENCH 2.0 A Collaborative Environment for SKOS/SKOS-XL Management: scalability and (inter)operatibility challenges Armando.

Slides:



Advertisements
Similar presentations
Conversion owl to SKOS and Further development of WB V.2 Prepared by - Ahsan, Armando, Sachit and Prashanta.
Advertisements

OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Digital Repositories – Linked Open Data – the possible Role of D4Science Workshop, December 2010, FAO use cases A tool to create Linked Data providers.
Y. Jaques Yves Jaques ICIS Requirements Gathering, June 2008, Rome NeOn Lifecycle Support for Networked Ontologies.
1/ 26 AGROVOC and the OWL Web Ontology Language: the Agriculture Ontology Service - Concept Server OWL model NKOS workshop Alicante,
From Ontology Design to Deployment Semantic Application Development with TopBraid Holger Knublauch
Near East Plant Protection Network for Regional Cooperation & Knowledge Sharing Food and Agriculture Organization of the United Nations An Overview on.
KEOD 2013 – 20 th September 2013 A Comprehensive Framework for Semantic Annotation of Web Content Manuel Fiorelli 1, Maria Teresa Pazienza 2, Armando Stellato.
Semantic Web Thanks to folks at LAIT lab Sources include :
SEVENPRO – STREP KEG seminar, Prague, 8/November/2007 © SEVENPRO Consortium SEVENPRO – Semantic Virtual Engineering Environment for Product.
VocBench 1. Is a web-based working environment for managing the AGROVOC Concept Server Facilitate the collaborative editing of multilingual terminology.
Semantic Web Tools Vagan Terziyan Department of Mathematical Information Technology, University of Jyvaskyla ;
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
SKOS and Other W3C Vocabulary Related Activities Gail Hodge Information International Assoc. NKOS Workshop Denver, CO June 10, 2005.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
AgriDrupal - a “suite of solutions” for agricultural information management and dissemination, built on the Drupal CMS; - the community of practice around.
1 Semantic Data Management Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies.
Audumbar Chormale Advisor: Dr. Anupam Joshi M.S. Thesis Defense
Agricultural Biotechnology Network for Regional Collaboration and Knowledge Sharing Food and Agriculture Organization of the United Nations An Overview.
Triple Stores.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 18 Slide 1 Software Reuse 2.
Danielle Baldwin, ITS Web Services CMS Administrator Application Overview and Joomla 1.5 RC 1 Highlights.
Networking Session: Global Information Structures for Science & Cultural Heritage - The Interoperability Challenge «INTEROPERABILITY FROM THE CULTURAL.
1/ 27 The Agriculture Ontology Service Initiative APAN Conference 20 July 2006 Singapore.
Information Integration Intelligence with TopBraid Suite SemTech, San Jose, Holger Knublauch
4th project meeting 27-29/05/2013, Budapest, Hungary FP 7-INFRASTRUCTURES programme agINFRA agINFRA A data infrastructure for agriculture.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness and Peter Fox CSCI Week 9, October 27, 2008.
A J Miles Rutherford Appleton Laboratory SKOS Standards and Best Practises for USING Knowledge Organisation Systems ON THE Semantic Web NKOS workshop ECDL.
Digital Enterprise Research Institute HADA – An Access Controlled Application for Publishing and Discovering Linked Government Data Owen Sacco.
1 Foundations V: Infrastructure and Architecture, Middleware Deborah McGuinness TA Weijing Chen Semantic eScience Week 10, November 7, 2011.
FI-CORE Data Context Media Management Chapter Release 4.1 & Sprint Review.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
© DATAMAT S.p.A. – Giuseppe Avellino, Stefano Beco, Barbara Cantalupo, Andrea Cavallini A Semantic Workflow Authoring Tool for Programming Grids.
Future Learning Landscapes Yvan Peter – Université Lille 1 Serge Garlatti – Telecom Bretagne.
1 Geospatial and Business Intelligence Jean-Sébastien Turcotte Executive VP San Francisco - April 2007 Streamlining web mapping applications.
1 © 1999 Microsoft Corp.. Microsoft Repository Phil Bernstein Microsoft Corp.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
A Collaborative Framework for Managing and Publishing KOS Armando Stellato, Ahsan Morshed, Gudrun Johannsen, Yves Jaques, Caterina Caracciolo, Sachit Rajbhandari,
-KHUSHBOO BAGHADIYA.  Introduction  System Description  iCAT in use  Evolution of the system  Evolution of modeling  Evolution of features  Evolution.
GEMET GEneral Multilingual Environmental Thesaurus leading the way to federated terminologies Stefan Jensen, Head of information services group with input.
Working with Ontologies Introduction to DOGMA and related research.
Architecture for an Ontology and Web Service Modelling Studio Michael Felderer & Holger Lausen DERI Innsbruck Frankfurt,
Triple Stores. What is a triple store? A specialized database for RDF triples Can ingest RDF in a variety of formats Supports a query language – SPARQL.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Jens Hartmann York Sure Raphael Volz Rudi Studer The OntoWeb Portal.
UNEP Terminology Workshop - Geneva, April 15, Environmental Terminology & Thesaurus Workshop UN Environment Programme Regional Office of Europe.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
Supporting Collaborative Ontology Development in Protégé International Semantic Web Conference 2008 Tania Tudorache, Natalya F. Noy, Mark A. Musen Stanford.
STAR, STELLAR and SKOS Ceri Binding, Phil Carlisle, Keith May, Doug Tudhope, Andreas Vlachidis University of Glamorgan and English Heritage.
GACS: Towards a common concept scheme for information in agriculture International Conference on Big Data and Knowledge Discovery Bangalore, March 9-11,
The AstroGrid-D Information Service Stellaris A central grid component to store, manage and transform metadata - and connect to the VO!
The Palantir Platform… …Changes in 2.3
The Holmes Platform and Applications
The GACS Project Semantics for Open Data
Triple Stores.
Phil Bernstein Microsoft Corp.
NKOS workshop Alicante, 2006
Triple Stores.
Lecture 1: Multi-tier Architecture Overview
LOD reference architecture
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
Triple Stores.
Taxonomy of public services
Taxonomy of public services
SDMX IT Tools SDMX Registry
Presentation transcript:

University of Rome Tor Vergata VOCBENCH 2.0 A Collaborative Environment for SKOS/SKOS-XL Management: scalability and (inter)operatibility challenges Armando Stellato ART Group, Dept of Enterprise Engineering, University of Rome Tor Vergata, Via del Politecnico 1, Rome, Italy Food and Agricultural Organization of the United Nations (FAO), Viale delle Terme di Caracalla, Rome, Italy LDBC2014 Fourth TUC Meeting Amsterdam, 3 rd Apr 2014

University of Rome Tor Vergata Why was it built? AGROVOC (big agriculture vocabulary developed by FAO) –In 2004: > concepts in up to 22 languages –A global group of terminologists. –No existing standard for thesauri –No existing tool that met FAO’s needs 03/04/2014 LDBC2014, Amsterdam, April

University of Rome Tor Vergata V1 – Google Web Toolkit Lucene Protégé API OWLART API MySQL Custom OWL model for modeling thesauri Business logic MySQL Protégé 3.4 OWLART API GWT / Presentation 03/04/2014 LDBC2014, Amsterdam, April

University of Rome Tor Vergata V1 Problems Couldn’t support other triple stores (mostly glued to Protégé API) Custom OWL model for thesauri modeling No support for emerging standards, e.g. SKOS No import Complicated export (pretty tight to Agrovoc Model) No support for alignments –AGROVOC aligned to a dozen other vocabularies No SPARQL support 03/04/2014 LDBC2014, Amsterdam, April

University of Rome Tor Vergata 03/04/2014 LDBC2014, Amsterdam, April Objectives for VB2.0… A completely rebuilt backing framework for the service and data layers, based on an already existing open source project: Semantic Turkey 1 –Based on OSGi Open Services Gateway –Open Connectibility to most notable RDF middleware and triple storing technologies (Sesame2, Jena, Allegrograph…) –Native support for SKOS and SKOSXL over RDF (no more conversions from internal legacy models), other than OWL VB1.0 User Interface remains mostly unchanged in the first release of VB

University of Rome Tor Vergata Three layered extensible architecture Presentation Layer –GWT (Google Web Toolkit) Vocbench User Interface ( Mozilla apps in the original framework ) Services Layer –Enables communication between the client (Vocbench UI) and the ontology persistence layer. –HTTP based Services accessed through the Ajax paradigm –OSGi Extensible Servicing System Persistence Layer –Access to ontological knowledge. –Based on dedicated ontology API, which can be implemented through use of different technologies. Vocbench 2.0 (and ST) Architecture 6

University of Rome Tor Vergata …and here we are!! 03/04/2014 LDBC2014, Amsterdam, April

University of Rome Tor Vergata VB “desktop version”: Semantic Turkey for Firefox

University of Rome Tor Vergata Why should I buy it? Collaborative Management –Validation&Publication Workflow (propose, validate, publish, revise, deprecate…) –Fine grained user management both users and functionalities may be associated in groups Functionalities (or groups of) may be assigned to different users (or groups of) –Full editing history (not only concepts, but most of the actions can be subject to validation too) –RSS Feeds –Fine-grained metadata and editorial notes: SKOS-XL and reified definitions allow for timestamped status and rich editorial notes Multilinguality –Strong support for multi-lingual thesauri management –Application itself is also multilingual (currently support for english, dutch, spanish, more languages coming) Native RDF support –Support for different triple stores –Possibilty to SPARQL query/update through a dedicated interface with syntax completion/highlight –SKOS-XL management If preferred, SKOS-core export through available conversion tools Large scale thesauri management –Scalability limited only by the underlying triple store Extensibility –OSGi connectable services And, last but not the least: Free and Open Source! ( 03/04/2014 LDBC2014, Amsterdam, April

University of Rome Tor Vergata V2.0 Partners Here is a list of partners adopting, or concretely considering the adoption of, VB2.0: FAO (Agrovoc, Biotech, Land and Water, FAO Topics, etc..) EU Documentation Office (EUROVOC) Italian Senate (Teseo) European Environment Agency (GEMET) Harvard (UAT: Unified Astronomy Thesaurus) EC Parliament Library INRA (Infrastructure nationale AnaEE France, in the context of AnaEE project)Infrastructure nationale AnaEE FranceAnaEE CABI UNCCD: United Nations Convention to Combat Desertification (…) Scottish Government (gov metadata) …and others more 03/04/2014 LDBC2014, Amsterdam, April

University of Rome Tor Vergata VocBench Evolution This is a non-exhaustive list of features added along the various versions. Only the major news are reported here VB2.1 (coming in these days) A completely rebuilt installation mechanism for an headache-free installation experience! –Self-installing DB, with auto-updating scripts –Wizard-driven system configuration, with import/export of configuration profiles SPARQL module: query/update content directly through the SPARQL query language for RDF; syntax completion & highlight Multi scheme management: now concepts can be shared among different schemes RSS feeds for all editing actions VB2.0 A Completely re-engineered RDF backend, based on RDF Management platform Semantic Turkey –Support for different triple stores –Extension mechanism based on OSGi Multi scheme management. Several skos:ConceptSchemes can be developed for the same dataset, providing different views on the data Statistics module: a module providing resuming information about the loaded data. Export module: for exporting all or part of the content of a project according to several existing RDF serialization standards Load data module: for loading bulk data serialized in some RDF serialization standard Ontology Import Management (Administration-->Ontologies): to owl:import ontologies to be used as property vocabularies for the modeled thesauri New tabs under the concept view for covering extensively the SKOSXL standard (note, notations) 03/04/2014 LDBC2014, Amsterdam, April

University of Rome Tor Vergata INTEROPERABILITY ISSUES Obstacles towards interoperable components… 03/04/2014 LDBC2014, Amsterdam, April

University of Rome Tor Vergata Dataset Management Operations which are strongly bound to the specific technology being adopted Dataset creation/deletion Indexing 03/04/2014 LDBC2014, Amsterdam, April

University of Rome Tor Vergata SKOS Management SKOS / SKOS-XL Management. –Currently VB works with the more expressive SKOS-XL and exports data to SKOS –Question for the future: native SKOS core support? –For sure, in VB2.2 a lot of data lifting/fixing utilities Multi scheme management –SKOS is maybe «too» permissive on the way information may be organized, this results in non-clear semantics –Not easy to deal with multiple schemes, compromise between powerful operations and simple management 03/04/2014 LDBC2014, Amsterdam, April

University of Rome Tor Vergata Scalability and Extensibility 1 Extensions –We are rebuilding the extension mechanism of Semantic Turkey, the RDF services engine behind VB –Dependency injection and Aspect programming  agile direct-to-businesslogic development of new functionalities and plugins Multi-project Management –Traditional plugin-based approach with local models/repositories is good for toy datasets or ontologies. –In a multi-user environment, working on large repositories, we cannot allow users to freely load huge amount of data –VB2.(>2): projects and their data can be accessed through other projects Read/write access and locks can be specified for each project Definition of an ACL (access control list) 03/04/2014 LDBC2014, Amsterdam, April These and other related improvements, in the context of developing «Ontology Alignment over bigdata» extensions, are funded by the SemaGrow project, Seventh Framework Programme (FP7) of the European Commision (FP7-ICT a Intelligent Information Management) under Grant Agreement No

University of Rome Tor VergataReasoning «Inferred triples» are not something easily manageable. No standard for storing/accessing them Each triple store adopts its own way: –an «inferred triples graph (which graph? No standard!) –default graph with «inferred» switch (such as in sesame), but how to get only them? In sesame, even with an empty null context, they ended up being mixed with other triples, such as the sum of all named graphs 03/04/2014 LDBC2014, Amsterdam, April

University of Rome Tor Vergata The hatred "default graph mantra" Accessing the default graph –The SPARQL UPDATE specs ( tell that: «a Graph Store contains one (unnamed) slot holding a default graph and zero or more named slots holding named graphs. Operations MAY specify graphs to be modified, or they MAY rely on a default graph for that operation. Unless overridden (for instance, by the SPARQL protocol), the unnamed graph for the store will be the default graph for any operations on that store. Depending on implementation, the unnamed graph MAY refer to a separate graph, a graph describing the named graphs, a representation of a union of other graphs, etc” –So, the default graph is not standardized. If we interpret it as a separate graph (e.g. Sesame2 null context)… …there is no way, in a SPARQL QUERY, to access it! Very long story: – – – – – – And probably many more… Default graph mantra: never use the default graph! (other versions: never use the default graph for anything else than metadata) –So…it’s like…my finger hurts…cut it! This really hinders interoperability 03/04/2014 LDBC2014, Amsterdam, April

University of Rome Tor Vergata Shareability of non-open source software What we would expect from triple store producers –Sometimes their products require dedicated/customized clients (mostly compliant with notable APIs, though with some underlying differences, or offering more functionalities) –Sometimes they have light versions of their triple stores completely bundled into a compact solution –Surely understandable not to have them open-source … –…please make them maximally: Accessible (Maven?) Redistributeable with tools 03/04/2014 LDBC2014, Amsterdam, April

University of Rome Tor Vergata References & Links Vocbench home: Fao VB user community: user and developer groups: Publications: Armando Stellato, Ahsan Morshed, Gudrun Johannsen, Yves Jaques, Caterina Caracciolo, Sachit Rajbhandari, Imma Subirats and Johannes Keizer A Collaborative Framework for Managing and Publishing KOS, The 10th European Networked Knowledge Organisation Systems (NKOS) Workshop, Berlin, Germany, September, 2011 Semantic Turkey home: user and developer groups: Publications: Maria Teresa Pazienza, Noemi Scarpato, Armando Stellato and Andrea Turbati Semantic Turkey: A Browser-Integrated Environment for Knowledge Acquisition and Management, Semantic Web Journal, 3, 2, 2012 Manuel Fiorelli, Maria Teresa Pazienza and Armando Stellato Semantic Turkey goes SKOS: Managing Knowledge Organization Systems, ISemantics 2012 Graz, Austria, 5-7 September, /04/2014 LDBC2014, Amsterdam, April

University of Rome Tor Vergata 03/04/2014 LDBC2014, Amsterdam, April