Integrating Data for Archaeology

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

1 Ontolog OOR Use Case Review Todd Schneider 1 April 2010 (v 1.2)
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Status on the Mapping of Metadata Standards
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Alexandria Digital Library Project Integration of Knowledge Organization Systems into Digital Library Architectures Linda Hill, Olha Buchel, Greg Janée.
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.
Joint Information Systems Committee Supporting Higher and Further Education Development of an Information Environment for UK Learning and Teaching NOF-Digitise.
Demonstration of repositories Fedora (Flexible Extensible Digital Object Repository Architecture) Marie Lagerwall MIDESS Partners Meeting February 9, 2007.
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
1 Semantic Data Management Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
ISO/TC211 Geographic Information/Geomatics Implementing ISO Metadata David Danko Work Item 15—Project Leader
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
4th project meeting 27-29/05/2013, Budapest, Hungary FP 7-INFRASTRUCTURES programme agINFRA agINFRA A data infrastructure for agriculture.
Information Extraction with Linked Life Data 19/04/2011.
December 15, 2011 Use of Semantic Adapter in caCIS Architecture.
DASISH Metadata Catalogue Binyam Gebrekidan Gebre, Stephanie Roth, Olof Olsson, Catharina Wasner, Matej Durco, Bartholemeus Worcslav, Przemyslaw Lenkiewicz,
Integrating Live Plant Images with Other Types of Biodiversity Records Steve Baskauf Vanderbilt Dept. of Biological Sciences
Themes Architecture Content Metadata Interoperability Standards Knowledge Organisation Systems Use and Users Legal and Economic Issues The Future.
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
Alexandria Digital Earth ProtoType DIGITAL LIBRARIES AND ENVIRONMENTAL INFORMATION Terence R. Smith Alexandria Digital Library Project.
A Systemic Approach for Effective Semantic Access to Cultural Content Ilianna Kollia, Vassilis Tzouvaras, Nasos Drosopoulos and George Stamou Presenter:
10/24/09CK The Open Ontology Repository Initiative: Requirements and Research Challenges Ken Baclawski Todd Schneider.
The Archaeotools project, faceted classification and natural language processing in an archaeological context. University of York, April 2008.
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
Search Interoperability, OAI, and Metadata Sarah Shreeves University of Illinois at Urbana-Champaign Basics and Beyond Grainger Engineering Library April.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
EConnect WP1 & semantic issues VU members –Guus Schreiber, Antoine Isaac, Jacco van Ossenbruggen, Jan Wielemaker.
The Mint Mapping tool The MoRe aggregator Vassilis Tzouvaras, Dimitris Gavrilis National Technical University of Athens Digital Curation Unit - IMIS, Athena.
Access to distributed resources Lorcan Dempsey VP, Research Research Library Directors Conference OCLC Institute Post-Conference, "Building the Global.
1 Ontolog OOR-BioPortal Comparative Analysis Todd Schneider 15 October 2009.
Toward a framework for statistical data integration Ba-Lam Do, Peb Ruswono Aryan, Tuan-Dat Trinh, Peter Wetz, Elmar Kiesling, A Min Tjoa Linked Data Lab,
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
Brian Matthews, euroCRIS, 18/09/03 CRIS architecture to support an ERA Brian Matthews.
PARTHENOS-project.eu EOSC market demand for art, humanties and cultural heritage Amsterdam– EGI Conference– 7/4/2016 Franco Niccolucci Scientific Coordinator,
ARIADNE is funded by the European Commission's Seventh Framework Programme Interoperability Holly Wright.
XML 1. Chapter 8 © 2013 Pearson Education, Inc. Publishing as Prentice Hall SAMPLE XML SCHEMA (XSD) 2 Schema is a record definition, analogous to the.
ΕΚΤ Access to Knowledge ΕΚΤ Access to Knowledge R&D Statistics Information System: An Interoperability Tail between CERIF and SDMX Dimitris Karaiskos Dimitrios.
The Earth System Curator Metadata Infrastructure for Climate Modeling Rocky Dunlap Georgia Tech.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Overview of the Semantic Web Ralph R. Swick World Wide Web Consortium (W3C) 17 October 2009.
Enhancements to Galaxy for delivering on NIH Commons
Linked Open Data Approaches within the ARIADNE project
Do MORe with your data LoCloud Final Conference 5th February 2016
Fusion Tables.
Cloud based linked data platform for Structural Engineering Experiment
MongoDB Er. Shiva K. Shrestha ME Computer, NCIT
Middleware independent Information Service
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
An Architecture for Complex Objects and their Relationships
Overview of the Semantic Web Ralph R
Outline Pursue Interoperability: Digital Libraries
The Re3gistry software and the INSPIRE Registry
Application of Dublin Core and XML/RDF standards in the KIKERES
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
PREMIS Tools and Services
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
Session 2: Metadata and Catalogues
LOD reference architecture
SDMX in the S-DWH Layered Architecture
Robert Dattore and Steven Worley
Presentation transcript:

Integrating Data for Archaeology Dimitris Gavrilis, Eleni Afiontzi, Johan Fihn, Olof Olsson, Achille Felicetti, Franco Nicollucci, Sebastian Cuy

Introduction Traditional projects in Archaeology focused on aggregating data into one single format / system Provide users with a unified interface Improve search and retrieval Improve retrieval semantics through specialized metadata schemas ARIADNE goes one step further : data integration Try to model the domain information (ARIADNE Catalog Data Model) Use a curation aware aggregator to enrich information using the above model Improve user experience through more substantial and powerful queries

Innovation Why hasn’t anyone done this before ? Complexity Performance Domain knowledge Standard aggregation systems / architectures are insufficient.  ARIADNE Infrastructure

ARIADNE Infrastructure Flexibility Ingest diverse and heterogeneous data XML, RDF, Excel, CSV, … Handle each datastream independently and according to it’s requirements Adapting aggregation, validation, enrichment workflows Add new curation services easily and on demand

ARIADNE Infrastructure Complexity De-couple services complexity through a micro-service oriented architecture Use loosely connecting services in a highly scalable environment. Performance Scalable technologies

ARIADNE Infrastructure Domain knowledge Integrate the domain model (ACDM) into the infrastructure Make extensive use of domain thesauri (e.g. AAT) and label every resource accordingly Create specialized micro-services for curating content according to the domain needs

Data Integration Overall Architecture Repository MORe RDF Store (RDF) Validation Integration Experiments Cleaning RDF Store (CRM) Excel Sheet ARIADNE Portal Enrichment Elastic Search ARIADNE Registry Integration Archive

Use of RDF Every resource is assigned a unique and persistent identifier that is resolved through a URI Every resource has an RDF representation according to the ACDM schema

Data Curation Use of curation micro-services for enriching content Geo-normalization (identify, extract and normalize places and coordinates) Geo-coding (e.g. Geo-names) Thesauri mappings (map native subject terms to a common thesauri : AAT) Temporal normalization (identify, extract and normalize dates) Gazetteers (e.g. DAI Gazetteer) Historical & Ancient place names identification (Pelagios & Pleiades) Temporal information mappings (Perio.do)

Data Integration Data Integration is based on a 3+1 dimensions Subject Space Time Resource type

Identify & Link together Resource Types Model individual information resource types (e.g. collections, bibliographic reports, databases, datasets, etc). Identify each resources type during ingestion Link / group different resource types E.g. put all related heterogeneous resource types (reports, datasets,…) under the same collections

Thematic integration ARIADNE uses the AAT thesaurus to semantically label ALL aggregated information. AAT terms act as a glue and when combined with spatial and temporal information can produce great results Semantic expansion of terms is extensively being used in order to improve retrieval. Expansion of multi-lingual terms facilitates cross-language search without requiring automatic translation.

Spatial & Temporal All resources with spatial information Are assigned WGS84 projected coordinates All resources with temporal information Are normalized according the ACDM dates (that takes into account periods, period names and supports ISO date format).

Subject Terms Curation Lifecycle *nativeSubjects AAT Native Subjects mappings Vocabulary Mapping Tool MORe Provider Native Repository *nativeSubjects *providedSubjects Excel Sheet XML Files Registry *nativeSubjects *providedSubjects ACDM / Subjects (JSON) **providedSubjects **derivedSubjects **broaderGenericSubjects *nativeSubjects ARIADNE Portal Elastic Search *mono-lingual (prefLabel only) ** multi-lingual (prefLabel & altLabel)