Interoperability of enhanced publications: the DRIVER Tech Watch Report SUETR Interoperability Workshop Tues, Dec 9 th, 2008 Karen Van Godtsenhoven, UGENT,

Slides:



Advertisements
Similar presentations
Adding OAI-ORE Support to Repository Platforms Alexey Maslov, Adam Mikeal, Scott Phillips, John Leggett, Mark McFarland Texas Digital Library TCDL09.
Advertisements

Enhanced Publications Presentation for ODaF Europe 2009 Thomas Place 2 April 2009.
DRIVER Building a worldwide scientific data repository infrastructure in support of scholarly communication 1 JISC/CNI Conference, Belfast, July.
DRIVER Institutional repositories and CRIS systems – the role of DRIVERs infrastructure, concepts and organisation 1 Nordbib Workshop 2008 Dale Peters,
DRIVER Long Term Preservation for Enhanced Publications in the DRIVER Infrastructure 1 WePreserve Workshop, October 2008 Dale Peters, Scientific Technical.
UKOLN is supported by: JISC Information Environment update Repositories and Preservation Programme meeting, October 24-25, 2006 Rachel Heery UKOLN
Andy Powell, Eduserv Foundation July 2006 Repository Roadmap – technical issues.
Andy Powell, Eduserv Foundation Feb 2007 The Dublin Core Abstract Model – a packaging standard?
Access to Knowledge; New roles for universities and libraries Leo Waaijers Disciple of Eve eIFL Seminar OPEN ACCESS: EXPLORING SCHOLARLY COMMUNICATION.
International Conference on Dublin Core and Metadata Applications DC-Scholar, 24 th September /10/2014 Scholarly Works Application.
A centre of expertise in digital information management UKOLN is supported by: The Dublin Core Application Profile for Scholarly Works.
Interoperability Aspects in Europeana Antoine Isaac Workshop on Research Metadata in Context 7./8. September 2010, Nijmegen.
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
Update on the SWORD Protocol & Future Directions.
LBSC 670 Information Organization. Today The web and automated information services Data, Ontologies and Web-services Protégé work time.
Software Recommendations CM Jones, JE Brace, PL Cave & DR Puplett VIF workshop 22 nd April
Repositories: Disruptive Technology or Disrupted Technology? Sandy Payette, Executive Director DORSDL Workshop at ECDL 2008 September 2008.
JOINING UP GOVERNMENTS EUROPEAN COMMISSION ADMS-enabled exploration of GS1 Dox 20 February 2013.
Depositing e-material to The National Library of Sweden.
Object Re-Use and Exchange Mellon Retreat, Nassau Inn, Princeton, NJ, March Herbert Van de Sompel, Carl Lagoze The OAI Object Re-Use & Exchange.
UKOLN is supported by: OAI-ORE a perspective on compound information objects ( Defining Image Access.
A centre of expertise in digital information management UKOLN is supported by: Eprints Application Profile UK Repositories Search Project.
UKOLN is supported by: A non-technical introduction to: OAI-ORE ( Defining Image Access project meeting.
Dspace – Digital Repository Dawn Petherick, University Web Services Team Manager Information Services, University of Birmingham MIDESS Dissemination.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
© 2006 DCMI DC-2006 – International Conference on Dublin Core and Metadata Applications 3-6 October 2006 Thomas Baker Dublin Core Metadata Initiative.
Tutorial 8 Sharing, Integrating and Analyzing Data
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
UKOLUG - July Metadata for the Web RDF and the Dublin Core Andy Powell UKOLN, University of Bath UKOLN.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales
Addressing Metadata in the MPEG-21 and PDF-A ISO Standards NISO Workshop: Metadata on the Cutting Edge May 2004 William G. LeFurgy U.S. Library of Congress.
Semantic Web Technologies ufiekg-20-2 | data, schemas & applications | lecture 21 original presentation by: Dr Rob Stephens
Practical RDF Chapter 1. RDF: An Introduction
Using IESR Ann Apps MIMAS, The University of Manchester, UK.
5-7 November 2014 DR Workflow Practical Digital Content Management from Digital Libraries & Archives Perspective.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
MPEG-21 : Overview MUMT 611 Doug Van Nort. Introduction Rather than audiovisual content, purpose is set of standards to deliver multimedia in secure environment.
SWAP FOR DUMMIES. Scholarly Works Application Profile a Dublin Core Application Profile for describing scholarly works (eprints) held in institutional.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
Fedora Content Models for the National Science Digital Library Data Repository Fedora User’s Group Meeting Copenhagen, September 28, 2005 Carl Lagoze Cornell.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
In Dublin’s fair city, where the metadata are so pretty… John Roberts Archives New Zealand.
"How much?": Aggregating usage data from Repositories in the UK Jo Lambert, Ross Macintyre, Paul Needham, Jo Alcock OR2015.
A bad case of content reuse Validator Website to Validate License Violations Validator – Only requires the URI of the site to check for a license violation.
Scientific Data and Electronic Publishing Renze Brandsma, Head, Digital Production Centre University of Amsterdam Maarten Hoogerwerf, Project Manager,
Van de Sompel, Herbert Los Alamos National Laboratory – Research Library OAI-PMH for Resource Harvesting.
Accessing a national digital library: an architecture for the UK DNER Andy Powell ELAG 2001, Prague 7 June 2001 UKOLN, University of Bath
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Breakout session OAI The future of scholarly communication: Enhanced Publications Saskia Woutersen University of Amsterdam.
Open Archive Initiative – Protocol for metadata Harvesting (OAI-PMH) Surinder Kumar Technical Director NIC, New Delhi
Metadata and Technology/Architecture Working Groups DLF Aquifer Project DLF Fall Forum Providence, RI November 14, 2008.
OAI Object Reuse & Exchange: Atom Serialization Nordbib Workshop, September , Stockholm, Sweden OAI-ORE: Atom Serialization The ORE Editors are:
Fedora Content Modeling for Improved Services for Research Databases Open Repositories 2009 Mikael Karstensen Elbæk Alfred Heller Gert Schmeltz Pedersen.
A centre of expertise in digital information management Content Packaging for Complex Objects Technical Workshop: Introduction.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
A centre of expertise in digital information management UKOLN is supported by: Functional Requirements Eprints Application Profile Working.
Open Archive Forum Rachel Heery UKOLN, University of Bath UKOLN is funded by Resource: The Council for Museums, Archives.
UKOLN is supported by: Content packaging and MPEG-21 DID Andy Powell, UKOLN, University of Bath JISC Joint Programmes Meeting, July.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
The NSDL, OAI and Your Metadata Core Infrastructure Metadata Repository (“union catalog”) Naomi Dushay Cornell University.
Metadata & Repositories Jackie Knowles RSP Support Officer.
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Jenn Riley Metadata Librarian Digital Library Program
Outline Pursue Interoperability: Digital Libraries
PREMIS Tools and Services
Jenn Riley Metadata Librarian Digital Library Program
Presentation transcript:

Interoperability of enhanced publications: the DRIVER Tech Watch Report SUETR Interoperability Workshop Tues, Dec 9 th, 2008 Karen Van Godtsenhoven, UGENT,

Context: DRIVER-II Technology Watch DRIVER-II project: create EU repository infrastructure, create services on top, deliver software (D-NET), streamline developments (DRIVER guidelines, validator), support repository managers (helpdesk and mentor service) and raise awareness (Open access) DRIVER-II focuses on services and demonstrators for enhanced publications Ep’s: can contain many (all kinds of) data formats, but within DRIVER-II, basis: textual element Dicovery workpackage: create object model for ep’s, demonstrator, Technology Watch report – Long Term Preservation, GRID computing, CRIS systems and

Interoperability of enhanced publications (Russell, Vanderfeesten, Hochstenbach, Van Godtsenhoven) Interoperability in DRIVER context: exchange and dissemination of ep’s as complex, compound objects Interoperability chapter focuses on five types of structural metadata (the relationship of the files within the objects), NOT on ingest or descriptive metadata (eg SWORD, Bag-it) For every type, a theoretical description and applications (case studies) are given, as well as an evaluation in the light of DRIVER.

Five formats for dissemination of ep’s 1. Envelope models or packaging formats: METS, MPEG 21-DIDL, LOM/IMS-CP, ODF, OOXML, Overlays, maps, feeds: RDF, SWAP, POWDER, ORE 3. Embedding formats: RDFa, Microformats 4. New/Old publishing formats: ODF, OOXML, CML, XHTML 5. Web services: OKI (SOA), Gdata (ROA)

1. Envelopes These formats provide access to the metadata, structural data, identifiers, and binary streams of publications all in one package (= envelope). 1. MPEG 21-DIDL in DARE context 2. METS 3. IMS – CP 4. ODF packages 5. OOXML/ Package convention 6. Open e-book package

Envelopes, II Comparison: table with all features Outcome: All Package formats are useful for representing an Enhanced Publication as a Dissemination Information Package. Most of these results are gained through the ability to create different relationships among the different parts. This gives DRIVER the opportunity to harvest enhanced publications packaged in different formats used by different user communities. On an aggregated level, where all sources are harvested, it is possible to create relational maps between all sub-parts of the enhanced publications.

2. Overlays, Maps, Feeds These formats provide an overlay on top of an existing network of internet resources. They tend to group references to resources, identify them and describe the content, structure and relations of all parts 1. SWAP 2. ORE 3. POWDER

2.1 SWAP A Dublin Core Profile for describing scholarly works Designed to offer solution to range of interoperability issues that arise when using simple DC Supports provision of richer & more consistent metadata Plus, eg version control, identification of full text Based on FRBR; uses DCMI Abstract Model/description sets Hierarchical model could be suitable for DRIVER enhanced publications

However… Despite much enthusiasm and support for the SWAP concept, no ‘proper’ implementation… Requires commitment/resources to implement (people too busy trying to do the basics…) Repository software developers need to implement first (currently have export plug-ins only) Too complex? (FRBR…) SWAP-Lite needed?! DRIVER as aggregator – wait and see if uptake happens...

SWAP case studies (partial implementation) University of Warwick –configured ePrints themselves to take SWAP records (not an easy task) –some problems encountered viewed in the community as being caused by SWAP (not the case eg Refs) CLADDIER project –used for citations – selected small no. SWAP fields –limited application

2.2 OAI- ORE Version 1.0 just released The collection of resources that make up a scholarly publication is called an Aggregation, consisting of Aggregated Resources. In order to instantiate, describe and identify Aggregations, OAI-ORE defines Resource Maps which provide also information about the context in which an Aggregation was defined. OAI-ORE suggests many published models for ORE documents using Atom, RDF/XML, OAI-PMH, and RDFa. Case studies include SCOPE, TheoREM project, experiments at Urbana-Champaign, ORE serialization of objects based on Fedora model, functional ORE DRIVER and ORE need to exchange views and ensure interoperability since ORE is a major player in the repository world.

ORE: an Aggregation containing three Aggregated Resources described by a Resource Map

2.3. POWDER POWDER, or Protocol for Web Description Resources, W3C working draft : description of a group of resources through the publication of machine-readable metadata documents. Groups of resources (=aggregations) can be described as a whole by enumerating the individual items, or matching URI’s against descriptions of the URI’s schemes used Use case: Trustmarks and verification (Online safety) POWDER allows to write about many resources at once. (vs ORE: inverse scenario, in ORE, one looks at an aggregation and wants to know the resources & their properties in the aggregation. In POWDER, you want to know to which aggregation a resource belongs and learn about it through aggregation. Hence POWDER is able to describe multiple things at once)

3. Embedding [ ] Existing resources are ‘beautified’/enriched by adding semantic annotations. Hence, the PDF link is embedded in splash page with special code highlighting its location. Microformats community (W3C): widely used (Yahoo, Flickr) Case study: Zotero (Urbana-Champaign), unAPI (clipboard like content copy across sites and browsers) Microformats could enable collecting references whilst working with Driver. Easy to harvest from, machine-readable html annotations

4. New Publishing formats ODF (26300:2006) versus OOXML (ISO :2008) –file format ISO standards for saving & exchanging office documents (alternative to proprietary formats eg doc, ppt) –open up access to structured content which can be reused by other services eg DRIVER –controversy surrounding development of OOXML eg Microsoft chose not to support existing ODF standard CML – disciplinary application of chemical structures Plus: disciplinary xml types, structured and crawlable data

5. Web services Web services: DRIVER needs to add API’s (in addition to OAI-PMH) on top of digital repositories to answer questions from agents on the content of your collections. Very large world, web services: two main approaches: ROA and SOA (DRIVER combines them: ROA external and SOA internal) Case studies: Gdata (ROA), Open Knowlegde Initiative (SOA) Outcome: widely used in research as well as e-learning community (OKI). DRIVER should follow up on developments and try to stay interoperable (Gdata).

Credits Karen Van Godtsenhoven University of Ghent Mikael Karstens Elbaek, DTU Gert Schmeltz Pedersen, DTU Barbara Sierman, KB Maurice Vanderfeesten, SURF Rosemary Russell, UKOLN

DRIVER II Project Helpdesk: infrastructures.euhttp://helpdesk.driver.research- infrastructures.eu Mentor service: support.eu/forms/contactsform.php?la=enhttp:// support.eu/forms/contactsform.php?la=en Supported by European Commission Available for re-use -