The PLANETS-Ontology in the context of the PLANETS-Testbed and the XCL-Software.

Slides:



Advertisements
Similar presentations
Max Kaiser: PLANETS Testbed
Advertisements

PC in TB Manfred Thaller PLANETS TB meeting, DenHaag, Sept 28th. '06.
XCEL / XCDL Tools Jan Schnasse PLANETS: Den Haag,
PC/4 Manfred Thaller PLANETS TB meeting, DenHaag, Sept 29th. '06.
Curating Research: problems and policy Dale Peters Scientific Technical Manager DRIVER II.
Characterisation Adrian Brown The National Archives, UK.
Introduction to Planets Hans Hofman Nationaal Archief Netherlands Prague, 17 October 2008.
Preservation as a Process of a Repository David Tarrant University of Southampton (UK) Preserv Repository Preservation and Interoperability.org.uk.
ETD‘s as pilot materials for long-term preservation efforts in kopal 9th ETD Conference 2006, Quebec Dr. Thomas Wollschläger, German National Library (GNL)
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
Automatic Evaluation of Migration Quality in Distributed Networks of Converters Miguel Ferreira Supervisors Ana Alice Baptista.
Preservation and Long-term access through Networked Services Adam Farquhar, The British Library iPres2006 Cornell University, October 2006.
1 Using Scalable and Secure Web Technologies to Design Global Format Registry Muluwork Geremew, Sangchul Song and Joseph JaJa Institute for Advanced Computer.
© 2010 Microsoft Corporation. All rights reserved. Quality Assurance: Towards Tools for Characterizing and Comparing Digital Documents Natasa Milic-Frayling.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
Columbia University Dept of Computer Science Center for Research on Info Access University of So. Calif Information Sciences Institute (ISI)
A Framework for Distributed Preservation Workflows Rainer Schmidt AIT Austrian Institute of Technology iPres 2009, Oct. 5, San.
P reservation and L ong-term A ccess through N ETworked S ervices.
® IBM Software Group © IBM Corporation IBM Information Server Deliver – Federation Server.
Different approaches to digital preservation Hilde van Wijngaarden Digital Preservation Officer Koninklijke Bibliotheek/ National Library of the Netherlands.
Digital preservation Hydra Europe, LSE 24 April 2015 Anders Conrad.
INTRODUCTION TO WEB DATABASE PROGRAMMING
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Trusted Digital Repositories,
Integrating Digital Curation in a Digital Library curriculum: the International Master DILL case study Anna Maria Tammaro University of Parma Florence,
Robert Sharpe, Tessella PRELIDA Workshop 2013 ENSURE Linked Data Registry.
1 Introduction to Modeling Languages Striving for Engineering Precision in Information Systems Jim Carpenter Bureau of Labor Statistics, and President,
A Generic Software Framework for building Hybrid Ontology-Backed Models for Driving Applications Colin Puleston, James Cunningham, Alan Rector Bio-Health.
Environmental Terminology Research in China HE Keqing, HE Yangfan, WANG Chong State Key Lab. Of Software Engineering
1 DELOS Network of Excellence on Digital Libraries with a focus on the Preservation Cluster Andreas Rauber Vienna University of Technology
Sustainable models for digital preservation Adam Farquhar The British Library Sustainability Models for Digital Preservation, Brussels, Nov, 2007.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Nicholas LoulloudesMarch 3 rd, 2009 g-Eclipse Testing and Benchmarking Grid Infrastructures using the g-Eclipse Framework Nicholas Loulloudes On behalf.
Entity Framework Overview. Entity Framework A set of technologies in ADO.NET that support the development of data-oriented software applications A component.
Caring and Sharing Collaboration in Digital Curation outside North America Ross Harvey Simmons College, Boston Curation Matters: 17 June 2010.
The XCL Languages Digital Preservation – The Planets Way Dresden, April 23 rd 2010 Manfred Thaller, Universität zu Köln.
EXtensible Characterisation Languages (XCL) Manfred Thaller, (University at Cologne) DPP meeting, Glasgow, Nov. 23 rd 2006.
Moving from a locally-developed data model to a standard conceptual model Jenn Riley Metadata Librarian Indiana University Digital Library Program.
Selected Topics in Software Engineering - Distributed Software Development.
METADATA WORKSHOP Conclusions Keith Jeffery Peter Wittenburg.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
Semantic Web - an introduction By Daniel Wu (danielwujr)
Metadata and Documentation Iain Wallace Performing Arts Data Service.
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
1 Digital Preservation Testbed Database Preservation Issues Remco Verdegem Bern, 9 April 2003.
Microsoft Research Faculty Summit Natasa Milic-Frayling & Vijay Rajagopalan Microsoft Corporation.
Tsinghua University Library Yang Zhao & Airong Jiang Tsinghua University Library, Beijing China 4 June, 2004 Electronic Thesis and Dissertation System.
SCAPE Rainer Schmidt SCAPE Training Event September 16 th – 17 th, 2013 The British Library Building Scalable Environments Technologies and SCAPE Platform.
A Fedora 3 to 4 Migration Case Study for UNSW Australia Library Fedora 4 Training Workshop, eResearch Australasia 2015, Brisbane UNSW Library Arif Shaon,
Carcanet Case Study Fran Baker, John Rylands University Library University of Manchester SPRUCE event 19 January 2012.
A Fedora 3 to 4 Migration Case Study for UNSW Australia Library Fedora 4 Training Workshop, eResearch Australasia 2015, Brisbane UNSW Library Arif Shaon,
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
XCL-Tools in relation to Significant characteristics in Planets Manfred Thaller Universität zu* Köln *University at not of Cologne.
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
Technical Awareness on Analysis of Headers.
Digital File Formats By Ali Aslam. JPEG JPEG Stands for Joint Photographic Experts Group. JPEG uses a lossy compression routine. Lossy compression means.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
1 « Luxembourg, 18 April 2007 « Virtual Library of Official Statistics « Dissemination Working Group.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
Digital Stewardship Lee Dotson Digital Initiatives Librarian University of Central Florida John C. Hitt Library Presentation available at
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Large Scale Semantic Data Integration and Analytics through Cloud: A Case Study in Bioinformatics Tat Thang Parallel and Distributed Computing Centre,
Dependency Management
An Introduction to Tessella and The Safety Deposit Box Platform
Architecture Components
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
ADO.NET Entity Framework Marcus Tillett
Digital Preservation Planning:
Semantic Markup for Semantic Web Tools:
Presentation transcript:

The PLANETS-Ontology in the context of the PLANETS-Testbed and the XCL-Software

PLANETS (Introduction) Preservation and Long-term access via Networked Services

PLANETS (Introduction) Preservation and Long-term access via Networked Services Funded by the EC

PLANETS (Introduction) Preservation and Long-term access via Networked Services Funded by the EC 4 year project until June 2010

PLANETS – PARTNERS I  The British Library  National Library, Netherlands  Austrian National Library  State and University Library, Denmark  Royal Library, Denmark  National Archives, UK  Swiss Federal Archives  National Archives, Netherlands

PLANETS – PARTNERS II  Tessella Plc  IBM Netherlands  Microsoft Research, Cambridge  Austrian Institute of Technology  Hatii at University of Glasgow  University of Freiburg  Technical University of Vienna  University at Cologne

The PLANETS project-structure

-Service-oriented architecture that offers several preservation-tools wrapped as Java-based services - provides the basic infrastructure, used by all PLANETS-subprojects

The PLANETS project-structure Creating high quality preservation plans to meet the requirements of memory institutions and the kind of their digital resources

The PLANETS project-structure Tools & Strategies: - migration und emulation - Universal Virtual Computer - Investigation of existing Solutions and additional development of new Software (context aware objects)

The PLANETS project-structure - Inspection of different long-term preservation strategies through experiments with different tools. - Design, implementation und execution of these self-developed or open-source tools to messure the reliability and accuracy of these tools on digital resources.

The PLANETS project-structure XCL-Framework for characterisation and comparison of files Pronom: file-format identification

Introduction to the XCL-Tools Problem 1: Memory Institutions (Libraries, Archives, Museums) store a lot of digital material (e.g. images, text-files, s, databases, audio-video-material). How do they find out, that their data has been damaged during a conversion?

Introduction to the XCL-Tools Problem 1 - bit-detorriation during conversion: before:after:

Introduction to the XCL-Tools Problem 2: What if a file-format becomes obsolete? (e.g. *.doc)  Example-Strategy: file-format migration

Introduction to the XCL-Tools Problem 2: What happened here? before: after:

Introduction to the XCL-Tools Two XML-based languages: XCEL (Extensible Characterisation Extraction Language)

Introduction to the XCL-Tools Two XML-based languages: XCEL (Extensible Characterisation Extraction Language) XCDL (Extensible Characterisation Definition Language)

Introduction to the XCL-Tools Two Tools: Extractor uses XCEL-files to extract information from files and produces XCDL-files for each extracted file

Introduction to the XCL-Tools Two Tools: Extractor uses XCEL-files to extract information from files and produces XCDL-files for each extracted file Comparator compares two or more XCDL-files with different statistical methods to express the equality of these XCDL files  both available as command-line-tools and wrapped as services in the IF-subproject

Introduction to the XCL-Tools The XCL-tools know „normdata“

Properties in the XCL-Tools

The XCL-Ontology Migrator tiff png Extractor tiff XCEL png XCEL Comparator png XCDL tiff XCDL 93% XCL Ontology

The XCL-Ontology Lists all available property-names for file-formats, the extractor can process

The XCL-Ontology Lists all available property-names for file-formats, the extractor can process Maps these terms to a normalised naming, which is then used in the XDL-languages

The XCL-Ontology Lists all available property-names for file-formats, the extractor can process Maps these terms to a normalised naming, which is then used in the XDL-languages Defines datatypes and units for these properties

Ontologies – what? Definition: “formal, shared conceptualization of a particular domain of interest” (T. Gruber, “ Translation Approach to Portable Ontology Specifications”, Knowledge Acquisition 5, No. 2, (1993). )

Ontologies – why? Advantages: XML-based (rdf-owl)

Ontologies – why? Advantages: XML-based (rdf-owl) machine-readable

Ontologies – why? Advantages: XML-based (rdf-owl) machine-readable Clear relationships between different kinds of „things“

Ontologies – why? Advantages: XML-based (rdf-owl) machine-readable Clear relationships between different kinds of „things“ Enable inference

Ontologies – why? Advantages: XML-based (rdf-owl) machine-readable Clear relationships between different kinds of „things“ Enable inference are extensible

What concepts do we need? Example A colour map is called ‘PLTE’ in PNG Is the same as ‘lookup’ in pdf

What concepts do we need? Example A colour map is called ‘PLTE’ in PNG Is the same as ‘lookup’ in pdf ‘PLTE’ is modelled as an instance of PNG_Properties ‘lookup’ is modelled as an instance of PDF_Properties

What concepts do we need? In OWL terminology: Classes (= abstract) Individuals (=concrete instances of these) Object-Properties (relationships between individuals of different classes)

And what about datatypes and units? Abstract concepts as Classes: datatypes and units Instances as Individuals (cm, inch / int, char)

How does it look like?

<!--  <rdfs:comment rdf:datatype="&xsd;string" >A colour map for palette colour images. Usually for RGB coloured pictures. Defined as array of hex-colour-values, appearing in the order left to right and top to bottom id25

How does it look like? <rdfs:comment xml:lang="en" >Chunk that containes the Palette for truecolour-PNGs. Use is optional. If the Image containes also an alpha-channel there should also be a bKGD Chunk. Required for colour_type3-PNGs

How is it used? As a SPARQL-based webservice to produce XML- Schema-files for each format – all adhering to the same (XCL-) naming convention

How is it used? In the current version of the PLANETS-Testbed

Next Steps „PLANETS-wide“ Ontology Include more properties (non-extractable or observational) from all PLANETS-Subprojects and external projects 1.Preservation Planning / PLATO 2.PLANETS Testbed (benchmark-goals) 3.INSPECT Unify and map these to each other...

Next Steps „PLANETS-wide“ Ontology Include more properties (non-extractable or observational)

Next Steps „PLANETS-wide“ Ontology Display the „observational“ Properties as list in the testbed-webservice Let the user add values...work in progress (until May 2010)

Summary Characterisation tool for different file-formats = Extractor (XCDL-output) Evaluation tool = Comparator (compare XCDLs) The XCL Ontology (soon extended) Embedded into the PLANETS Interoperability Framework and the PLANETS Testbed

Thank you Find out more: