Medusa at the University of Illinois

Slides:



Advertisements
Similar presentations
Curating Research: problems and policy Dale Peters Scientific Technical Manager DRIVER II.
Advertisements

1 Metadata Tools for JISC Digitisation Projects of still images and text Ed Fay BOPCRIS, Hartley Library University of Southampton.
PREMIS: To Be or Not To Be in My METS The Preservation Journey at the University of Connecticut Libraries ALA Annual 2013 ALCTS PARS Intellectual Access.
METS: An Introduction Structuring Digital Content.
Stefania Bergamasco, Cecilia Colasanti An integrated approach to turn statistics into knowledge combining data warehouse, controlled vocabularies and advanced.
Interoperability and Preservation with the Hub and Spoke (HandS) Matt Cordial, Tom Habing, Bill Ingram, Robert Manaster University of Illinois Urbana-Champaign.
CHAPTER 7 Roderick Dickson Kelli Grubb Tracyann Pryce Shakita White.
Interoperability and Preservation with the Hub and Spoke (HandS) Tom Habing, Bill Ingram, Robert Manaster University of Illinois Urbana-Champaign
METS Dr. Heike Neuroth EMANI – Project Meeting February 14 th - 16 th, 2002 Springer-Verlag Heidelberg Göttingen State and University Library (SUB)
Funded by: © AHDS Sherpa DP – a Technical Architecture for a Disaggregated Preservation Service Mark Hedges Arts and Humanities Data Service King’s College.
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
3. Technical and administrative metadata standards Metadata Standards and Applications.
PREMIS What is PREMIS? – Preservation Metadata Implementation Strategies When is PREMIS use? – PREMIS is used for “repository design, evaluation, and archived.
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
DigiTool METS Profile DigiTool Version 3.0. DigiTool METS Profile 2 What is METS? A Digital Library Federation initiative built upon the work of MOA2.
US GPO AIP Independence Test CS 496A – Senior Design Fall 2010 Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong.
PREMIS What is PREMIS? o Preservation Metadata Implementation Strategies When is PREMIS use? o PREMIS is used for “repository design, evaluation, and archived.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
AIP Archival Information Package – Defines how digital objects and its associated metadata are packaged using XML based files. METS (binding file) MODS.
1 Workshop on Metadata Interoperability for Electronic Records Management November 15, 2001 Archives II, College Park, MD.
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
Metadata standards, tools and processes for audio preservation at the British Library: An overview of new systems for audio description, preservation and.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
MEDIN Data Guidelines. Data Guidelines Documents with tables and Excel versions of tables which are organised on a thematic basis which consider the actual.
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
SOFTWARE ENGINEERING BIT-8 APRIL, 16,2008 Introduction to UML.
Controlled Vocabularies (Term Lists). Controlled Vocabs Literally - A list of terms to choose from Aim is to promote the use of common vocabularies so.
IIPC GA, Stanford, US - WARCApril 28 th 2015Slide 1 WARC as Package Format for all Preserved Digital Material by Eld Zierau The Royal Library of Denmark.
PREMIS Implementation at The Royal Library of Denmark by Eld Zierau.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
Data Management David Nathan & Peter Austin & Robert Munro.
HUB AND SPOKE TOOL SUITE PREMIS Implementation Fair – 7 October 2009 Bill Ingram Visiting Research Programmer University of Illinois at Urbana-Champaign.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
Implementation of PREMIS in METS Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
Roy Tennant California Digital Library escholarship.cdlib.org/rtennant/presentations/2003cil/ Achieving Together What None Can Do Alone: Interoperability.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Habing1 Integrating PREMIS and METS PREMIS Tutorial Implementers’ Panel June 21, 2007, 9:00-5:30 Library of Congress, Jefferson Building, Whittall.
PREMIS Implementation Fair – SF 2009 PREMIS use in Rosetta Yair Brama – Ex Libris.
1 Digital Preservation Testbed Database Preservation Issues Remco Verdegem Bern, 9 April 2003.
Digital library infrastructure -- systems Repositories for storing digital resources protect, manage, deliver, and preserve digital resources over time.
PREMIS Implementation Fair, San Francisco, CA October 7, Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.
IMPLEMENTATION ISSUES. How PREMIS can be used  For systems in development as a basis for metadata definition  For existing repositories as a checklist.
PREMIS at the British Library Markus Enders, The British Library PREMIS Implementation Fair, San Fransisco, CA 07 October 2009.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Digital Preservation Panel Medusa at the University of Illinois at Urbana-Champaign: A Digital Preservation Service Based on PREMIS Kyle Rimkus, Preservation.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
Memory Masters Preserving Digitized Histories— for today, for tomorrow, and for the future This project is made possible by a grant from the federal Institute.
Repository-specific Spoke Scripts Content Repository JSR-170/283 Content Repository for Java Technology API Normalized H&S METS Files METS Import/ExportMETS.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Applying preservation metadata to repositories The British Library, 21 January 2008 Led by Steve Hitchcock With Bill Hubbard, Gareth Johnson.
XML Databases Presented By: Pardeep MT15042 Anurag Goel MT15006.
Management Information Systems by Prof. Park Kyung-Hye Chapter 7 (8th Week) Databases and Data Warehouses 07.
Data Modeling Using the Entity- Relationship (ER) Model
Databases: What they are and how they work
What’s New in Colectica 5.3 Part 1
Chapter 2 Database Environment Pearson Education © 2009.
What is a Database and Why Use One?
Integrating PREMIS and METS
File Systems and Databases
Database Environment Transparencies
METS, MODS and PREMIS, Oh My! (and a little MIX and other schema too)
PREMIS Tools and Services
METS, MODS and PREMIS, Oh My! (and a little MIX and other schema too)
MANAGING DATA RESOURCES
Metadata in Digital Preservation: Setting the Scene
Message Queuing.
Database Design Hacettepe University
Palestinian Central Bureau of Statistics
Presentation transcript:

Medusa at the University of Illinois A Digital Preservation Repository Built Upon PREMIS Kyle Rimkus Preservation Librarian University of Illinois Urbana-Champaign rimkus@uiuc.edu presented October 2, 2012 iPres 2012, Toronto

National Digital Infrastructure and Information Preservation(NDIIPP) Program grants Phase I : 2004-2007 Phase II: 2007-1010 Background of dp at UIUC… “Hub and Spoke” (HandS tool suite): http://dli.grainger.uiuc.edu/echodep/hands/index.html

HandS METS Profile http://www.loc.gov/standards/mets/profiles/00000015.html This approach was distinguished by several key factors:   the reliance on PREMIS for digital preservation metadata the reliance on MODS for descriptive metadata the packaging of PREMIS, MODS, and other associated metadata and file information in the METS format, using a METS profile designed specifically for this project, to describe the relationships

Medusa is Born The central idea of our PREMIS implementation is that it is platform and infrastructure independent. The PREMIS records that describe our digital objects do so in such as way that the system in which they are currently managed is of little consequence – the emphasis being placed not on the software, but the objects in it and the records that describe them.

PREMIS in Medusa The central concept here is that of the self-describing, encapsulated object. That is, every digital asset stewarded in Medusa – whether a content or metadata file – is assigned a unique ID and an associated PREMIS file which tells the story of that item. We dislike the practice common to many repository platforms where, for example, digital content files live in one place, such as a file server, and metadata lives in a database. In such an infrastructure, there is an inherent risk to the long-term viability of digital objects, as their constituent parts are split up across a variety of systems subject to their own specific risk factors. We also like this because we are not leaving any metadata of importance in a database or other external application; we make sure to store all digital preservation metadata and relation metadata in our PREMIS files.

PREMIS “relationship” vs METS “structMap” Intellectual entities Rights Objects Agents Events …we do this without METS. We found from our Hub and Spoke experience that there is considerable overlap between many of the fields available in METS and PREMIS – you have to choose whether to place file information such as file type and file size, among other things, in one, the other, or both. When you come down to it, in fact, and this is perhaps a gross oversimplification – but for the sake of our discussion, let’s say that the one thing you have in METS that you do not necessarily have in PREMIS is the one required METS tag of “structMamp” or “structure map” for indicating the structure of METS packages and the relationships between metadata and items. As anyone who has worked with METS knows, there can be considerable overhead involved in expressing such relationships in consistently valid XML, especially if you end up going down the path of generating vast METS files for complex objects.

PREMIS Controlled Vocabularies Relationship Types and Subtypes: …taking a linked-data inspired approach to the use of the PREMIS “relationship” tag – preferring the flexibility it offers to construct a web of arbitrary relationships between digital assets with a rather simple structure rather than the more baroque, strictly hierarchical alternative offered by METS records.

A PREMIS snippet <relationship> <relationshipType>BASIC_IMAGE_ASSET</relationshipType> <relationshipSubType>PARENT</relationshipSubType> <relatedObjectIdentification> <relatedObjectIdentifierType>LOCAL</relatedObjectIdentifierType> <relatedObjectIdentifierValue>MEDUSA:4052dc68-7c0f-420b-9d07-840c79768ae9-2</relatedObjectIdentifierValue> </relatedObjectIdentification> </relationship> By “linked data inspired,” and we’ve also talked about this being inspired by object oriented programming, I mean the following. The “relationship” tag, by allowing a “relationshipType” and a “relationshipSubType” to refine it, the PREMIS standard offers a considerable amount of flexibility.

PREMIS Controlled Vocabularies What we end up with, in effect, is the ability to define any type of relationship we want between any single asset and any number of others; and have similar flexibility with defining Events, Agents, and Rights specific to assets in our digital preservation environment. Currently, our technical team is creating these terms as they go along, and often have a very loose connection to controlled vocabularies.

A PREMIS Archival Information Package …system is still under development.

…sample of some actual XML generated by one of our test packages.

Questions? Public documentation coming soon (before 2013) at: http://medusa.library.illinois.edu https://wiki.cites.uiuc.edu/wiki/display/ LibraryDigitalPreservation/Home