Integrating metadata schema registries with digital preservation systems to support interoperability Michael Day UKOLN, University.

Slides:



Advertisements
Similar presentations
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Advertisements

Open Archives Forum Workshop University of Bath, September 2003 Workshop summing up Rachel Heery UKOLN, University of Bath
Open Archives Forum Workshop University of Bath, September 2003 Welcome Rachel Heery UKOLN, University of Bath
A Tour of the OAIS Reference Model Brian Lavoie Research Scientist Office of Research OCLC Museum Computer Network Annual Conference September 2002.
The Reference Model for an Open Archival Information System (OAIS) Michael Day Digital Curation Centre UKOLN, University of Bath
Preservation Metadata Michael Day Digital Curation Centre UKOLN, University of Bath
Metadata for preservation: the Cedars perspective
Why metadata matters for libraries... Rachel Heery UKOLN: The UK Office for Library and Information Networking, University of Bath
Issues and approaches to preservation metadata Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
Preservation Metadata Initiatives: Practicality, Sustainability, and Interoperability Michael Day UKOLN, University of Bath ERPANET Training.
Collection-level description & collection management: tool for the trade or information trade-off? Collection Description Focus Workshop 4 Newcastle, 8.
The OAIS Reference Model: current implementations Michael Day, UKOLN, University of Bath Chinese-European Workshop.
The PREMIS Data Dictionary Michael Day Digital Curation Centre UKOLN, University of Bath JORUM, JISC and DCC.
Joint Information Systems Committee Digital Library Services BL/JISC Workshop Rachel Bruce JISC Programme Director The Digital Library and its Services,
The metadata challenge for libraries: a view from Europe Michael Day UKOLN: The UK Office for Library and Information Networking, University of Bath
An overview of collection-level metadata Applications of Metadata BCS Electronic Publishing Specialist Group, Ismaili Centre, London, 29 May 2002 Pete.
Metadata and the description of digital images Michael Day UKOLN, University of Bath International Digital Image Symposium London,
Preservation Metadata Initiatives: Practicality, Sustainability, and Interoperability Michael Day UKOLN, University of Bath ERPANET Training.
Digital disaster: are you prepared?, University College London, 23 June 2000 Michael Day, UKOLN Overview UKOLN is funded by Resource:
Image metadata: interoperability and exchange
UKOLN is supported by: The JISC Information Environment Metadata Schema Registry (IEMSR): Update DC-2006, Manzanillo, Mexico October 3-6, 2006 Rachel Heery.
UKOLN is supported by: JISC Information Environment update Repositories and Preservation Programme meeting, October 24-25, 2006 Rachel Heery UKOLN
Andy Powell, Eduserv Foundation July 2006 Repository Roadmap – technical issues.
Collection-level description & the Information Landscape: users evaluate strategies for resource discovery Collection Description Focus Workshop 5 Cambridge,
February Harvesting RDF metadata Building digital library portals with harvested metadata workshop EU-DL All Projects concertation meeting DELOS.
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
Collections and services in the information environment JISC Collection/Service Description Workshop, London, 11 July 2002 Pete Johnston UKOLN, University.
Collection description & Collection Description Focus JISC/DNER Moving Image & Sound Cluster Steering Group meeting, HEFCE Office, London, 24 September.
Digital preservation Michael Day UKOLN, University of Bath, UK University of the West of England, MSc in Information.
The Dublin Core Collection Description Application Profile (DC CD AP) Pete Johnston, UKOLN, University of Bath Chair, DC Collection Description Working.
Multi-purpose metadata for collections: Creating reusable CLDs Collection Description Focus Workshop 2 Aston Business School, Birmingham 8 February 2002.
Towards consensus on collection-level description Collection Description Focus Briefing Day 1 British Library, St Pancras, London 22 October 2001 Bridget.
An introduction to collections and collection-level description Collection-Level Description & NOF-digitise projects NOF-digitise programme seminar, London,
Thinking collectively : approaching collection-level description Collection Description Focus Workshop 1 Staff House, UMIST, Manchester 1 November 2001.
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
Metadata : Setting the Scene or a Basic Introduction Wendy Duff University of Toronto, Faculty of Information Studies.
Metadata for preservation Michael Day, UKOLN, University of Bath Chinese-European Workshop on Digital Preservation,
Documenting to preserve your data: metadata in support of digital preservation Michael Day, UKOLN, University of Bath
Seminar: OAIS Model application in digital preservation projects Michael Day, Digital Curation Centre UKOLN, University of Bath.
Metadata in support of digital preservation Michael Day, UKOLN, University of Bath Beginners Guide to Metadata:
How to build your own Dark Archive (in your spare time) Priscilla Caplan FCLA.
A centre of expertise in digital information management The MEG Metadata Schemas Registry Pete Johnston, Research Officer (Interoperability),
OAIS Open Archival Information System. “Content creators, systems developers, custodians, and future users are all potential stakeholders in the preservation.
File format registries - a global infrastructure for local persistence Andreas Aschenbrenner, ERPANET.
METADATA STANDARDS Andrew Wilson Project Manager Digital Preservation Project.
Digital preservation Michael Day UKOLN, University of Bath, UK University of Bristol, MSc in Library and Information.
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
Digital preservation Michael Day UKOLN, University of Bath, UK University of Bristol, MSc in Library and Information.
A Quick Introduction to Metadata Michael Day UKOLN: The UK Office for Library and Information Networking, University of Bath
Metadata Schema Registries in the Partially Semantic Web: the CORES experience Rachel Heery, Pete Johnston, UKOLN, University of.
Integrating metadata schema registries with digital preservation systems to support interoperability Michael Day UKOLN, University of Bath, UK
Metadata for digital preservation: a review of recent developments Michael Day UKOLN, University of Bath ECDL2001, 5th European Conference.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
M-1 INGEST OVERVIEW Don Sawyer National Space Science Data Center NASA/GSFC October 13, 1999.
The OAIS Reference Model Michael Day, Digital Curation Centre UKOLN, University of Bath Reference Models meeting,
Preservation metadata and the Cedars project Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
Preservation Metadata Initiatives: Status and Direction Brian Lavoie Senior Research Scientist Office of Research OCLC Archiving Web Resources Canberra.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
The OAIS Reference Model and Trustworthy Repositories Josh Lubell Manufacturing Engineering Laboratory NIST
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
Cedars work on metadata Michael Day UKOLN, University of Bath Cedars Workshop Manchester, February 2002.
Collection-level description: from theory to practice Minerva project meeting Paris, 24 January 2003 Pete Johnston UKOLN, University of Bath Bath, BA2.
UKOLN Metadata Projects: Subject Gateways and Preservation Metadata Michael Day UKOLN, University of Bath Electronic Resources: Definition,
An overview of the Reference Model for an Open Archival Information System (OAIS) Michael Day, Digital Curation Centre UKOLN, University.
Metadata Schema Registries: background and context MEG Registry Workshop, Bath, 21 January 2003 Rachel Heery UKOLN, University of Bath Bath, BA2 7AY UKOLN.
Metadata Issues in Long-term Management of Data and Metadata
OAIS Producer (archive) Consumer Management
Building A Repository for Digital Objects
Metadata for preservation
Presentation transcript:

Integrating metadata schema registries with digital preservation systems to support interoperability Michael Day UKOLN, University of Bath, UK 2003 Dublin Core Conference, Seattle, Washington, USA 28 September - 2 October 2003

DC-2003, Seattle, USA, 29 September 2003 Presentation outline –Preservation metadata Purpose Standards –Concepts of interoperability Metadata capture and inheritance Object exchange –Metadata schema registries Definitions Application to digital preservation systems –Concluding thoughts

DC-2003, Seattle, USA, 29 September 2003 Preservation metadata (1) All digital preservation strategies depend - to some extent - on the creation, capture and maintenance of metadata –"Preserving the right metadata is key to preserving digital objects" (ERPANET Briefing Paper, 2003) Defined as: –The various types data that will allow the re- creation and interpretation of the structure and content of digital data over time (Ludäsher, Marciano & Moore, 2001)

DC-2003, Seattle, USA, 29 September 2003 Preservation metadata (2) Metadata fulfil various roles, e.g.: –"… to find, manage, control, understand or preserve … information over time" (Cunningham, 2000) –Descriptive information; technical information about formats and structure; information about provenance and context; administrative information, e.g. for rights management –Current schemas either very complex or only provide a basic framework (sometimes both!) –Perception that different strategies and objects will need different metadata

DC-2003, Seattle, USA, 29 September 2003 Preservation metadata - standards –Developed from many different perspectives: Digital libraries: –METS, NISO Z39.87 (to support digitisation initiatives) –OCLC/RLG Framework, Cedars, NEDLIB, NLA, NLNZ –OAIS influence has been greatest in this area Records management and archival description: –Pittsburgh BAC, RKMS, NAA, VERS, PRO, EAD, etc. –Also standards not specifically developed for preservation, but with some overlap: Multimedia –MPEG-7, SMPTE, etc Rights management: –, MPEG-21, etc.

DC-2003, Seattle, USA, 29 September 2003 The OAIS model –Reference Model for an Open Archival Information System (OAIS) –ISO 14721:2003 –Established a common framework of terms and concepts –Influential on the design of some schemas »e.g., OCLC/RLG Metadata Framework –Identified basic functions: »Ingest, Data Management, Archival Storage, Administration, Access, Preservation Planning

DC-2003, Seattle, USA, 29 September 2003 OAIS functional model Administration Ingest Archival Storage Access Data Management Descriptive info. PRODUCERPRODUCER CONSUMERCONSUMER MANAGEMENT queries result sets Descriptive info. Preservation Planning orders OAIS Functional Entities (Figure 4-1) SIP DIP AIP

DC-2003, Seattle, USA, 29 September 2003 OAIS information objects Information Object (basic concept) –Data Object (bit-stream) –Representation Information (permits “the full interpretation of Data Object into meaningful information”) Information Object Classes –Content Information –Preservation Description Information (PDI) –Packaging Information –Descriptive Information

DC-2003, Seattle, USA, 29 September 2003 OAIS information packages Information package: –Container that encapsulates Content Information and PDI –Packages for submission (SIP), archival storage (AIP) and dissemination (DIP) »AIP = “... a concise way of referring to a set of information that has, in principle, all of the qualities needed for permanent, or indefinite, Long Term Preservation of a designated Information Object” –PDI = other information (metadata) “which will allow the understanding of the Content Information over an indefinite period of time” »Reference, Provenance, Context, Fixity

DC-2003, Seattle, USA, 29 September 2003 Draft categorisation (1) PracticalConceptual PRO NEDLIB DCMI METS RKMS PITT VERS NLNZ NLA CEDARS OCLC/RLG MPEG-7 Z39.87 OAIS

DC-2003, Seattle, USA, 29 September 2003 Draft categorisation (2) Earliest schemas were largely conceptual in nature: –e.g. Pittsburgh BAC model, Cedars outline specification, OCLC/RLG WG I Gradually moving towards a more practical focus: –e.g., VERS, NLNZ, METS, PREMIS WG –Convergence on XML (DTDs and Schemas) But there is an urgent need for all this practical experience to be shared –e.g., published schemas, advice on implementation, etc.

DC-2003, Seattle, USA, 29 September 2003 Implementation –We need to prove the practical value of metadata frameworks and 'outline specifications' –It can be difficult for implementers to use these as a guide to the design of real systems? –We need to move from the conceptual to the practical, need to move beyond proof-of- concept –Positive signs: METS/NISO Z39.87 OCLC/RLG PREMIS WG looking at implementation strategies for preservation metadata

DC-2003, Seattle, USA, 29 September 2003 Sustainability (1) Balance risks with costs: –There is a perception that metadata creation and maintenance will be expensive –But costs associated with data recovery are not trivial –Need to balance the risks of data loss with the cost of creating metadata »Cost/benefit analysis »Robust selection criteria »Co-operation between repositories »Re-use of existing metadata

DC-2003, Seattle, USA, 29 September 2003 Sustainability (2) Avoid imposing unnecessary costs: –Avoid large schemas (?) –Need to identify the right metadata - 'core metadata' (?)

DC-2003, Seattle, USA, 29 September 2003 Interoperability (1) Heterogeneity –The need to cope with a wide (and growing) range of preservation metadata standards, object types, formats, etc. –No realistic prospect of a single standard –Repositories will need to manage a range of metadata standards, at least within ingest and access functions

DC-2003, Seattle, USA, 29 September 2003 Interoperability (2) Metadata creation and capture –Created by humans or captured automatically? –Some metadata already exists, e.g.: »Embedded within objects »In separate databases »Generated by particular processes –Need for this metadata to be captured at creation, ingest, migration, and at other appropriate points in object life-cycle

DC-2003, Seattle, USA, 29 September 2003 Interoperability (3) Benefits of interoperability: –To support the capture or inheritance of metadata, e.g. on ingest –To support the management of multiple formats and metadata schema within a digital preservation system »Current metadata specifications not entirely clear on how this should be done –To support the exchange of information packages outside the repository, e.g. by converting to standard 'exchange formats' »Networks of 'trusted repositories'

DC-2003, Seattle, USA, 29 September 2003 Registries (1) Registries of metadata schemas may offer one way to deal with this problem Parallel concept of format registries –There is "… a pressing need to establish reliable, sustained repositories of file format specifications, documentation, and related software" (Lawrence, et al., 2000) –DSpace 'bitstream format registry' –Typed Object Model (TOM) project –Digital Library Federation, et al. recently proposed a Global digital format registry

DC-2003, Seattle, USA, 29 September 2003 Registries (2) Metadata schema registries: –"… formal systems that can disclose authoritative information about the semantics and structure of the data elements that are included within a particular metadata scheme" (Heery, et al., 2000) –Existing registries include the XML.org Registry and Repository (OASIS), and metadata registries set up by DCMI and SMPTE

DC-2003, Seattle, USA, 29 September 2003 Registry functions (1) Provides support for the ingest process –Support conversion and metadata capture tools May also provide support for the access function –The export of Dissemination Information Packages –The exchange of information packages (AIPs?) with other repositories; conversion to exchange standards Can link metadata where there are multiple instances within the system May help to manage schema evolution

DC-2003, Seattle, USA, 29 September 2003 Registry functions (2) Administration Ingest Archival Storage Access Data Management Descriptive info. PRODUCERPRODUCER CONSUMERCONSUMER MANAGEMENT queries result sets Descriptive info. Preservation Planning orders OAIS Functional Entities (Figure 4-1) SIP DIP AIP

DC-2003, Seattle, USA, 29 September 2003 Registry functions (2) Administration Ingest Archival Storage Access Data Management Descriptive info. PRODUCERPRODUCER CONSUMERCONSUMER MANAGEMENT queries result sets Descriptive info. Preservation Planning orders OAIS Functional Entities (Figure 4-1) SIP DIP AIP Registry Schemas

DC-2003, Seattle, USA, 29 September 2003 Registry functions (2) Administration Ingest Archival Storage Access Data Management Descriptive info. PRODUCERPRODUCER CONSUMERCONSUMER MANAGEMENT queries result sets Descriptive info. Preservation Planning orders OAIS Functional Entities (Figure 4-1) SIP DIP AIP Registry Schemas

DC-2003, Seattle, USA, 29 September 2003 Registries - organisational issues Registries are part of infrastructure Distributed vs. centralised approaches: –Concept of 'shared services' –Experimental distributed registries are based on Resource Description Framework (RDF) »CORES Registry »Encourage re-use of metadata (the 'application profile' concept) »Are other technologies more suitable? Who should be responsible for them?

DC-2003, Seattle, USA, 29 September 2003 Summing up Interoperability supports the reuse of metadata and the exchange of information objects There is a possible role for metadata registries to help manage these (and other) processes - but the concept needs extensive scoping and evaluation Registries are NOT a panacea

DC-2003, Seattle, USA, 29 September 2003 Acknowledgements UKOLN is funded by Resource: the Council for Museums, Archives and Libraries, the Joint Information Systems Committee (JISC) of the UK higher and further education funding councils, as well as by project funding from the JISC and the European Union. UKOLN also receives support from the University of Bath, where it is based.