“SIPS, DIPS and Trips: How we will know if we've collected enough, or the right, metadata?” George Blood Audio, LP Safe Sound Archive Intellectual.

Slides:



Advertisements
Similar presentations
When Private Becomes Public:Legal Issues Society of American Archivist Annual Meeting, 2005 Building a Rights Framework for a Digital Preservation Repository.
Advertisements

Long-Term Preservation. Technical Approaches to Long-Term Preservation the challenge is to interpret formats a similar development: sound carriers From.
PREMIS: To Be or Not To Be in My METS The Preservation Journey at the University of Connecticut Libraries ALA Annual 2013 ALCTS PARS Intellectual Access.
Museums and Digital Repositories October, The punch line… In the digital realm, museums: * are very much like libraries * tend to share the same.
To Enhance Teaching and Learning. Images Documents Maps Audio Files Film Footage Images Documents Maps Audio Files Film Footage Realia Print Hardcopy.
Documenting the Resource Malcolm Polfreman
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
From Analog to Digital: Changes in Preservation Gregor Trinkaus-Randall Digital Commonwealth Conference Worcester, MA March 25, 2010.
Oral History, METS and Fedora: Building a Standards-Compliant Audio Preservation Infrastructure.
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
Merrilee Proffitt e(X)literature / Digital Cultures Project April 2003 News from the Digital Library The Metadata Encoding and Transmission Standard; the.
Z39.87 at DCAPS Oya Rieger & Danielle Mericle Digital Media Group, DCAPS May 2005 CUL Metadata Forum.
THE RUTGERS WORKFLOW MANAGEMENT SYSTEM Mary Beth Weber Cataloging and Metadata Services Rutgers University Libraries August 3, 2007.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
1 CS 502: Computing Methods for Digital Libraries Lecture 27 Preservation.
1 Copyright and Intellectual Property Design Issues by Jeremy Rowe
Web archiving at the NLA ‘ Archiving the music web’ Music Council of Australia Annual Assembly 28 September 2009 Paul Koerbin Manager Digital Archiving.
RDA – The SLQ experience Karen Stone 12 May 2014.
“Would You Like to Play a Game?” :: Megan Winget :: University of Texas at Austin A Review of Challenges and Current Practice in Game-Related Collections.
Different approaches to digital preservation Hilde van Wijngaarden Digital Preservation Officer Koninklijke Bibliotheek/ National Library of the Netherlands.
Publishing Digital Content to a LOR Publishing Digital Content to a LOR 1.
ORGANIZING AND STRUCTURING DATA FOR DIGITAL PROJECTS Suzanne Huffman Digital Resources Librarian Simpson Library.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
DATA CURATION & PRESERVATION CSG Fall Meeting, Princeton Mairéad Martin Penn State September, 2012.
How to build your own Dark Archive (in your spare time) Priscilla Caplan FCLA.
Amos Kujenga ADLSN Training Coordinator Addis Ababa, Ethiopia 5 – 7 November 2014 Introduction To Digital Libraries and Repositories.
The Legislative Library of Ontario’s Ontario Documents Repository Road to Partnership.
1 Digital Archives - Past, Present & Future Issues Anne Van Camp Manager, Member Initiatives The Research Libraries Group Digital Archives Directions (DADs)
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
Lifecycle Metadata for Digital Objects September 11, 2002 Major archival and digital library metadata schemes.
Digitization An Introduction to Digitization Projects and to Using the Montana Memory Project.
Building Bridges for Sharing Media. ✪ Five Colleges ✪ Best Practices for Sharing Media Best Practices Copyright Resources Collection Development Shared.
Electronic Scriptorium, Ltd. AIIM Minnesota Chapter Metadata and Taxonomy Presentation Copyright Electronic Scriptorium, Ltd. All rights reserved, 1991.
International Seminary on Digitisation: Experience and Technology 11 th May 2004 | National Library | Lisbon – Portugal DIGITAL ARCHIVE OF PORTUGUESE ART.
The Canadian Information Network for Research in the Social Sciences and Humanities Tim Au Yeung and Mary Westell Libraries.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
Metadata and Documentation Iain Wallace Performing Arts Data Service.
Discovery Metadata for Special Collections Concepts, Considerations, Choices William E. Moen School of Library and Information Sciences Texas Center for.
Introduction to metadata
ETD2006 Preserving ETDs With D.A.I.T.S.S. FLORIDA CENTER FOR LIBRARY AUTOMATION FC LA PAPER AUTHORS: Chuck Thomas Priscilla.
Metadata for digital preservation: a review of recent developments Michael Day UKOLN, University of Bath ECDL2001, 5th European Conference.
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
The Importance of Standards in Digital Preservation Tina Norris Kayla Payne Jennifer
Digitization & Digital Preservation
Preservation metadata and the Cedars project Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
No Longer Under Our Control? The Nature and Role of Standards in the 21 st Century Library William E. Moen School of Library and Information Sciences Texas.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Open Access & Institutional Repositories, Accra June 2007 Metadata and e-preservation Dr D Peters DISA: Digital Innovation South Africa.
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
Cedars work on metadata Michael Day UKOLN, University of Bath Cedars Workshop Manchester, February 2002.
Santi Thompson - Metadata Coordinator Annie Wu - Head, Metadata and Bibliographic Services 2013 TCDL Conference Austin, TX.
How to Implement an Institutional Repository: Part IV A NASIG 2006 Pre-Conference May 4, 2006 Policy Issues.
Data Wrangling: Developing Local Best Practice for Born Digital Metadata Tracy Popp, Digital Preservation Coordinator Ayla Stein, Metadata Librarian University.
Data Management and Digital Preservation Carly Dearborn, MSIS Digital Preservation & Electronic Records Archivist
2/26/2004 Dan Swaney 1 Preservation Metadata and the OAIS Information Model A Metadata Framework to Support the Preservation of Digital Objects A review.
CENTRAL/WESTERN MASSACHUSETTS AUTOMATED RESOURCE SHARING Digitization GOALS & THEIR LOGISTICS Michael J. Bennett Digital Initiatives Librarian C/WMARS,
Data Stewardship Lifecycle A framework for data service professionals Protectors of data.
Practical Aspects of Preservation Peter Simpson Development Officer Arts and Humanities Data Service.
Digital Asset Management at Michigan Tech
FLORIDA CENTER FOR LIBRARY AUTOMATION
Building A Repository for Digital Objects
Bentley Project Reel Digitization Bentley Historical Library t
How to Implement an Institutional Repository: Part IV
Users and Digital Collections
Metadata to fit your needs... How much is too much?
The Bentley Digital Media Library
Bentley Audio Digitization
Presentation transcript:

“SIPS, DIPS and Trips: How we will know if we've collected enough, or the right, metadata?” George Blood Audio, LP Safe Sound Archive Intellectual Access to Preservation Metadata Interest Group American Library Association June 2010

Definition by ALA PARS Digital Preservation: “Digital preservation combines policies, strategies and actions to ensure access to reformatted and born digital content regardless of the challenges of media failure and technological change. The goal of digital preservation is the accurate rendering of authenticated content over time.”

In the words of Grace Hopper.. “It's easier to ask forgiveness than it is to get permission” “A ship in a harbor is safe, but that is not what a ship is built for” “From then on, when anything went wrong with a computer, we said it had bugs in it” “You manage things; you lead people”

"The great thing about standards is that there are so many to choose from."

Standards are like toothbrushes. Everyone agrees they're desirable…

but nobody wants to use someone else's. Standards are like toothbrushes. Everyone agrees they're desirable…

Why are we collecting all this metadata? To provide for discovery To manage the files To provide provenance To provide authenticity Etc.

Metadata = Cataloging and Description How much is enough? Is it possible to have too much? Why do we need more than we did before? –Are we moving the goal posts? –To what extent are our neuroses about digital preservation a reflection of our failures in analog preservation? –Is more metadata less product? By doing “better” for one object are we preserving less overall? Has anyone asked the users what they need?

Organizing metadata “Standards” Toothbrushes

What is a standard? How widely adopted? If everyone is doing something... is that good enough to be a “standard”? Does a standard have to be perfect? Does one size fit all? If there’s a standard and no one uses it, what’s it matter? What are the implications if there’s a standard and it is “locally modified”? If you make your own “standard”, in what ways does this enhance or inhibit preservation and long-term access? –Aren’t we taught to avoid proprietary solutions? Why not for metadata?

SIPS: The State of the Art

Oberlin metadata

NYPL - LPA metadata

UMichigan RFI

SI AAA Metadata

SI AAA Second Project

Sample Rate: Bit Depth: 24 Duration: 0:42:19 INFO Name: Hess, Thomas B. "The Breakthrough of Abstract Expressionism." INFO Artist: INFO Date: INFO Archival Location: Smithsonian Institution Libraries, Hirshhorn Museum Library INFO Copyright: Material may be protected by copyright. Restrictions may apply. BEXT Description: Hess, Thomas B. "The Breakthrough of Abstract Expressionism." Lecture at NGA, : 0001, File Identifier; HMSG0001A-B, Tape Identifier BEXT Originator: Hirshhorn Museum Library BEXT Originator Reference: BEXT Origination Date: BEXT Time Reference: 0 BEXT Version: 1 BEXT Coding History: A=ANALOG,M=stereo,T=Nakamichi_Dragon; 09095; TDK_C90 A=PCM,F=96000,W=24,M=stereo,T=PrismSound; ADA-8XR; A/D A=PCM,F=96000,W=24,M=dual-mono,T=MetricHalo; ULN-2; DIO A=PCM,F=96000,W=24,M=stereo,T=SoX14.1; DAE Sample Rate: Bit Depth: 24 Duration: 0:56:32 INFO Name: INFO Artist: INFO Date: INFO Archival Location: INFO Copyright: BEXT Description: Oral history interview with Tony Rosenthal, 1968 May 10-June 29.; Tony; Sevim; 1968 May 10-June 29 BEXT Originator: Smithsonian Institution BEXT Originator Reference: Archives of American Art BEXT Origination Date: BEXT Time Reference: 0 BEXT Version: 1 BEXT Coding History: A=ANALOG,M=mono,T=Revox_A700; 13652; Audiotape_1251 A=PCM,F=96000,W=24,M=mono,T=PrismSound; ADA-8XR; A/D A=PCM,F=96000,W=24,M=mono,T=MetricHalo; ULN-2; DIO A=PCM,F=96000,W=24,M=mono,T=SoX14.1; DAE SI Hirshhorn and SI AAA

CUL METS

How will any of this provide for discovery, management, provenance, etc? It all has to be done manually. It is just as much work to create software tools to read the metadata as to make it. It costs more to do the metadata work on some projects than the digitization. What will be the cost to reformat the metadata when the digital file is migrated?

Except MY Metadata Open Source! Open Standards!! Interoperability!!!

DIPs: Let’s get religion

A return to basics When does a record end and context begin? When does the archive end and the research begin? What is the (end) goal of metadata? What is the end (goal) of metadata?

Ernie Ingles “Long term preservation of information has plagued mankind since we first etched images into stone tablets. And in many ways it’s been downhill every since.” “We should think of preservation with a 500 year time horizon.”

Quakerism 101

Keep It Simple, Stupid K.I.S.S. Keep It Stupid Simple

Pareto’s Principle 80% of effect comes from 20% of the causes –“80% of your revenue comes from 20% of your clients” –“80% of a project can be completed with 20% of your time” –“80% of total circulation comes from 20% of the books” –“80% of knowledge can be acquired with 20% of the information”

Short Record Dublin Core MARC

Jun June 23, 2010 Etc.

Date field conversion, Date to number, On Mac, PC, FMP, Different Version

Sample Rate: Bit Depth: 24 Duration: 0:42:19 INFO Name: Hess, Thomas B. "The Breakthrough of Abstract Expressionism." INFO Artist: INFO Date: INFO Archival Location: Smithsonian Institution Libraries, Hirshhorn Museum Library INFO Copyright: Material may be protected by copyright. Restrictions may apply. BEXT Description: Hess, Thomas B. "The Breakthrough of Abstract Expressionism." Lecture at NGA, : 0001, File Identifier; HMSG0001A-B, Tape Identifier BEXT Originator: Hirshhorn Museum Library BEXT Originator Reference: BEXT Origination Date: BEXT Time Reference: 0 BEXT Version: 1 BEXT Coding History: A=ANALOG,M=stereo,T=Nakamichi_Dragon; 09095; TDK_C90 A=PCM,F=96000,W=24,M=stereo,T=PrismSound; ADA-8XR; A/D A=PCM,F=96000,W=24,M=dual-mono,T=MetricHalo; ULN-2; DIO A=PCM,F=96000,W=24,M=stereo,T=SoX14.1; DAE

1.Achieve consensus on a standard 2.K.I.S.S. 3.Expose more complexity only as needed

Layer 1: Required Layer 2: Recommended Layer 3: Optional Conformance to Standards within the model

How much is enough? How much is being left behind? - 80% of information is available in 20% of the data - 80% isn’t good enough If we apply Pareto to the remaining information, the Next 20% of effort yields 80% of the remaining Information. 80% of 20% is 16% First 80% plus the next 16% is 96% of total information.

Layer 1: Required Layer 2: Recommended Layer 3: Optional Conformance to Standards within the model Layer 1: Consensus Layer 2: Structured Variety Layer 3: Whoopie!

ALA Definition of Digital Preservation Layer 1: Short, clear, quick Layer 2: Most useful in most circumstances Layer 3: Everything to everybody Parallel to Definition of Digital Preservation

Challenge to the Group: (a la Definition of Digital Preservation) - Convene a Task Force - Develop standards for DIPs - Present version 0.9 (draft) at this Interest Group - at ALA MidWinter 2011