The Library of Congress Audio-Visual Prototyping Project Carl Fleischhauer Office of Strategic Initiatives, Library of Congress Sound Savings.

Slides:



Advertisements
Similar presentations
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
Advertisements

Strategic issues for digital projects... …or, what are we doing here?
Pulling it all together… with thanks to Sheila Anderson.
Long-Term Preservation. Technical Approaches to Long-Term Preservation the challenge is to interpret formats a similar development: sound carriers From.
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
METS: An Introduction Structuring Digital Content.
DRS 2 one in a series of periodic updates Harvard University Library Andrea Goethals October 21, 2009 DRS = Digital Repository Service.
Digital Content Solutions Digital content management technology has transformed the way to manage content and knowledge, in this knowledge era. Research.
Audio Technical Metadata Importance of metadata to digital audio preservation Brief history of audio preservation standards Overview Kinds of Audio Metadata.
Motivation Application driven -- VoD, Information on Demand (WWW), education, telemedicine, videoconference, videophone Storage capacity Large capacity.
3.02C Multimedia Fair Uses Guidelines and Elements
Comp 1001: IT & Architecture - Joe Carthy 1 Review Floating point numbers are represented in scientific notation In binary: ± m x 2 exp There are different.
1 CS 502: Computing Methods for Digital Libraries Lecture 9 Conversion to Digital Formats Anne Kenney, Cornell University Library.
Multimedia for the Web: Creating Digital Excitement Multimedia Element -- Graphics.
Merrilee Proffitt e(X)literature / Digital Cultures Project April 2003 News from the Digital Library The Metadata Encoding and Transmission Standard; the.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
1 CS 502: Computing Methods for Digital Libraries Lecture 27 Preservation.
Image Formation and Digital Video
The British Library’s METS Experience The Cost of METS Carl Wilson
EMu and Archives NA EMu Users Conference – Oct Slide 1 EMu and Archives Experiences from the Canada Science and Technology Museum Corporation.
Metadata standards, tools and processes for audio preservation at the British Library: An overview of new systems for audio description, preservation and.
Skill Area 212 Introduction to Multimedia Internet and MultiMedia for SC 2.
Digital audio. In digital audio, the purpose of binary numbers is to express the values of samples that represent analog sound. (contrasted to MIDI binary.
Ingest and Dissemination with DAITSS Presented by Randy Fischer, Programmer, Florida Center for Library Automation, University of Florida DigCCurr2007.
Case History: Library of Congress Audio-Visual Prototyping Project METS Opening Day October 27, 2003 Carl Fleischhauer Office of Strategic Initiatives.
PeDALS Persistent Digital Archives & Library System Richard Pearce-Moses Deputy Director for Technology & Information Resources Arizona State Library,
Chapter 2, Exploring the Digital Domain
City of Seattle Office of the City Clerk Open Government = Access Challenges and Opportunities with Digital Records.
Case History: Library of Congress Audio-Visual Prototyping Project METS Opening Day (2003), Revised For the CUL Metadata Working Group July 22, 2004 Carl.
CSCI-235 Micro-Computers in Science Hardware Part II.
Digitisation of Archival and Manuscript Materials in Libraries Presentation by Martin Bradley.
WORKFLOWS AND OTHER CONSIDERATIONS FOR DIGITIZATION  Steve Bingo  Processing Archivist Washington State University Libraries  Alex Merrill  Assistant.
Digitization of the Federal Depository Library Program Judith C. Russell Superintendent of Documents & Managing Director, Information Dissemination “Electronic.
1 A journey of a thousand miles begins with a single step. Chinese Proverb.
Allison Schein.  Adobe Audition (  Recommended program, metadata creation and manipulation is easy and complete.
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager
Cataloging Sound Recordings with RDA
How to build your own Dark Archive (in your spare time) Priscilla Caplan FCLA.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Digital Reformatting and File Management Public Library Partnerships Project Sheila A. McAlister Director, Digital Library of Georgia and Sandra McIntyre.
Digitizing Photographs For Sustainable Heritage Workshop, June 12-15, 2014 By Steven Bingo Project Archivist, Washington State University.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan, Florida Center for Library Automation DCC Workshop on Long-term Curation within Digital Repositories.
Introduction to Interactive Media 03: The Nature of Digital Media.
Core Issues in Digital Preservation: Audio and Video Jacob Nadal, Preservation Officer UCLA Library.
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
Introduction to metadata
Digital Recording. Digital recording is different from analog in that it doesn’t operate in a continuous way; it breaks a continuously varying waveform.
Best Practices for Digital Imaging and Metadata Roy Tennant The Library, University of California, Berkeley
ETD2006 Preserving ETDs With D.A.I.T.S.S. FLORIDA CENTER FOR LIBRARY AUTOMATION FC LA PAPER AUTHORS: Chuck Thomas Priscilla.
VITAL at the National Library of Wales Glen Robson
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Marwan Al-Namari 1 Digital Representations. Bits and Bytes Devices can only be in one of two states 0 or 1, yes or no, on or off, … Bit: a unit of data.
Media Types Information Systems can contain the following types of media: Sound, graphics, video & text.
CSCI-100 Introduction to Computing Hardware Part II.
Digital Collections Forum Doug Moncur AIATSIS September 2004.
Chapter 1 Background 1. In this lecture, you will find answers to these questions Computers store and transmit information using digital data. What exactly.
Author(s): Paul Conway, License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution.
COMP135/COMP535 Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 2 Lecture 2 – Digital Representations.
Introduction to Interactive Media Interactive Media Raw Materials: Digital Data.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Preserving Digital Collections
Ingest and Dissemination with DAITSS
Digital Stewardship Curriculum
FLORIDA CENTER FOR LIBRARY AUTOMATION
DAITSS: Dark Archive in the Sunshine State
Multimedia: Digitised Sound Data
Bentley Project Reel Digitization Bentley Historical Library t
Metadata to fit your needs... How much is too much?
Robin Dale RLG OAIS Functionality Robin Dale RLG
Presentation transcript:

The Library of Congress Audio-Visual Prototyping Project Carl Fleischhauer Office of Strategic Initiatives, Library of Congress Sound Savings Conference University of Texas, Austin July 25, 2003 This slide show: lcweb.loc.gov/rr/mopic/avprot/SoundSavings03.ppt

National Audio-Visual Conservation Center New Library of Congress facility for the Motion Picture, Broadcasting, and Recorded Sound Division (M/B/RS) Facility funded by the Packard Humanities Institute Will be in Culpeper, Virginia, 70 miles from Washington Planned to go operational 2005

Audio-Visual Prototyping Project Collections from the M/B/RS division and the American Folklife Center at LC Emphasis: reformatting endangered materials, especially magnetic tapes and instantaneous discs Current work: audio Future activities: video, copyright MP3s, content from web sites Prototyping period:

Motive 1: Alternative Preservation Approach Shortcomings of conventional practice: reformatting onto analog magnetic tape 1 Short life expectancy 2 Generation loss with each copy 3 Cessation of manufacture of analog tape and tape recorders

Motive 1: Alternative Preservation Approach Desire to work in the digital realm Emerging issues –Deterioration of tangible born-digital, e.g., CD-Rs acquired by LC from music composer copyrights –Emerging issue: preserving intangible born-digital content, e.g., MP3s from copyright and other acquisitions

Motive 2: Provide Access Limited access, since most items protected by copyright or require consideration of folk performer prerogatives LC researchers on Capitol Hill, collections in Culpeper Possible future authorized remote research sites

Illustration : sample preserved item We want to reproduce the artifact as a whole This example is a Marine Corps recording from the South Pacific in WW II –Audio from a disc copy of an Amertape Recording Film original (film-with-grooves) –Images depict the film container and the disc label

Initial display of navigation tree and thumbnails

Close-up display of image & file-level metadata

Preservation Concept Content takes the form of information packages aka digital objects Information packages consist of data (e.g., audio and image files, ) and metadata

Preservation Concept Not a CD or DVD approach Packages managed in digital repository Repository is server and storage-system based Paradox: –Content at any given moment depends upon systems and media –Content must be system and media independent

Four issues 1. Selecting the target format for reformatting 2. Quality of the reformatted copy 3. Shaping the object/package and the importance of metadata 4. Longevity in “media-less” environment

Issue 1 Selecting the format

Selecting the format Disclosure –are specifications and tools available? Adoption –is the format already in wide use? Transparency –is encoding open to analysis with basic tools?

Selecting the format Self-documentation –does object include metadata that explains how to render or understand context? Fidelity –support for high resolution audio Sound field –support for stereo and/or surround sound

Audio formats Audio masters –Bitstream: PCM sampling, uncompressed –File format: WAVE (higher res) –One-bit-deep formats (e.g., SONY DSD) of interest but “ahead of the game” for us Service files –WAVE (lower res) and MP3

Image formats Image Masters –Bitstream: Uncompressed bitmapped –File format: TIFF S ervice copies –JPEGs

Issue 2 Quality of the Reformatted Copy

Key Parameters Sampling frequency –Render the waveform as “dots” –More dots contribute to greater accuracy, capable of rendering high frequency sounds –Expressed as kilocycles per second or kiloherz –Compare to spatial resolution for images –Higher “pixels or dots per inch” contribute to better clarity

Key Parameters Word length, bit depth –Greater bit depth means greater precision in locating the sample in terms of amplitude –Greater bit depth means greater capacity to represent dynamic range –Expressed as bits per sample –Compare to tonal resolution (color) for images –Higher “bits per pixel” mean more accurate color

Staff discussion of parameters... Consensus on word length –Everyone is sold that 24 bit is better than 16 –Based on listening, objective measurement possible –“Extra data will protect you when the original has wide or varying dynamics, or if an operator makes a mistake.” –Compare to imaging and a downstream benefit Master image at 12 or 16 bits per channel Manipulate for aesthetic effect, save at 8 bits No gaps in your histogram

Staff discussion of parameters... Less consensus on sampling frequency –Some of us thought this was the relevant question: “What is the range of frequencies we might expect in this item?” 78 rpm disc from the acoustic era –8-10 kilocycles per second, or less –Rule of thumb: digitally sample at 2x frequency –Will 25 kilocycles per second suffice? Folk music collector with a Nagra in 1970s –14-18 kilocycles per second –Will 44 or 48 kilocycles suffice?

Staff discussion of parameters... Engineers advocated sampling frequencies of 96 or even 192 kHz Discussion tended to look at practical production issues and possible downstream options Objective measurement is not relevant to some of these factors

Very high resolution desired because: –“There may be hard-to-hear harmonics that you won’t want to lose.” –“Copies with less noise and less distortion can more successfully be restored in a post-process.” –“In the future we’ll have better enhancement tools and post-processing, so save as much raw information as you can.” –“What if you need extra data to support certain types of resource discovery?” Staff discussion of parameters...

Inherent fidelity of the original items not decisive. Informal A-B listening comparisons were helpful but not conclusive. Proposal to carry out empirical comparison of restoration actions applied to a high-res and a medium-res master.

Audio resolution for prototyping project Result of preceding discussion: the engineers work at the upper limit of the tools they have Reformatted content –Audio masters 96 kHz/24 bit mono or stereo (some at 48/24) –Service files 44.1 kHz/16 bit WAVE 256 kbps MP3 (if stereo)

Image resolution for prototyping project Reformatted content –Borrow approach from other digitization projects –Image Masters lines/pixels per inch 24 bit color –S ervice copies Same-size JPEGs

Two Sidebars

Sidebar on practices Professional equipment –For example, professional analog-to-digital converters Some details –Masters as flat transfers, avoid/minimize cleanup –Copy mono discs with stereo cartridge, hope for future process to “find the best groove wall”

Sidebar on practices Professional workers –Supervise and perform expert work Work requires knowledge and skills with antique formats and new digital technology

Sidebar on practices Some ideas for the future –Include apprentice workers in work team –Sort originals by “transfer efficiency” category –Use expert systems to help monitor transfers, spot anomalies –For some categories, copy two or three items at once Inspired by –PRESTO project in Europe ( –I mage-based recovery from discs (

Sidebar on objective measurement Imaging: targets Audio: test tones Outputs from targets/tones measure the performance of equipment They do not measure actual “content” images or sounds directly.

Sidebar on objective measurement Tools and practices not mature, even for imaging Need performance measures for digital systems –You can’t believe your scanner when it says 300 ppi Measure what actually comes through the system –Imaging example: use modulation transfer function (MTF) as a yardstick for delivered spatial resolution –Pass-fail point not yet established for image reformatting projects

Sidebar on objective measurement Tentative use of standard ITU test sequences known as CCITT 0.33 –28-second series of tones to test satellite broadcast transmissions, mono and stereo –Recordings of the tones can be used to determine the frequency response, distortion, and signal-to-noise ratio produced in a given recording system –Pass-fail point not yet established for sound reformatting projects

Issue 3 Shaping the information package and the importance of metadata

Information package Complex entity with multiple parts Data and Metadata Data in this context means the audio, video, or image bitstreams Metadata includes –Descriptive –Administrative –Structural

Descriptive metadata in the AV project For object as a whole –Often copy of descriptive data in LC central catalog –MODS XML schema Optional additional descriptive metadata for individual parts of object –Song titles, artists for disc sides or cuts –Names of writers in manuscript file folder –MODS “related items”

Administrative metadata in the AV project Persistent identifier, “ownership” info Documentation of reformatting today and digital migration tomorrow About the source and actions taken to prepare items for digitization, e.g., clean, bake About the digitizing process Rights data or at least categorization of objects for management of access

Structural metadata in the AV project Relationships between parts of objects Example: long-playing record album –Box, front –Three discs, two sides each (audio segments) –Disc label (images) –Booklet, cover and 28 pages (images)

Illustration: three-lp-disc boxed set with booklet

Encoding the metadata AV project is using the emerging Metadata Encoding and Transmission Standard (METS)

METS XML output (partial) displayed in Internet Explorer

Added metadata for long-term preservation To support long term content management Examples: –“Fixity” info, e.g., checksums to monitor file changes –Pointers to documentation for file formats –Pointers to documentation of the hardware/software environment required to render files No practice yet in AV prototyping project See RLG-OCLC preservation metadata report –

Overall anxiety... Are we trying to capture too much metadata? Tools to automate the creation of metadata, especially administrative metadata, are critical

Issue 4 Longevity in a media-less environment

Future LC repository Intersection of the AV project and Culpeper center with LC-wide digital planning (NDIIPP) LC repository design will be in terms of the NASA Open Archival Information System (OAIS) reference model

PRODUCERSPRODUCERS ADMINISTRATION DATA MANAGEMENT ARCHIVAL STORAGE INGEST ACCESS CONSUMERSCONSUMERS PRESERVATION PLANNING Reference Model for an Open Archival Information System (OAIS) SIP: Submission information package

PRODUCERSPRODUCERS ADMINISTRATION DATA MANAGEMENT ARCHIVAL STORAGE INGEST ACCESS CONSUMERSCONSUMERS PRESERVATION PLANNING Reference Model for an Open Archival Information System (OAIS) AIP: Archival information package

PRODUCERSPRODUCERS ADMINISTRATION DATA MANAGEMENT ARCHIVAL STORAGE INGEST ACCESS CONSUMERSCONSUMERS PRESERVATION PLANNING Reference Model for an Open Archival Information System (OAIS) DIP: Dissemination information package

PRODUCERSPRODUCERS ADMINISTRATION DATA MANAGEMENT ARCHIVAL STORAGE INGEST ACCESS CONSUMERSCONSUMERS PRESERVATION PLANNING Reference Model for an Open Archival Information System (OAIS) Current plan: The Culpeper facility will produce and submit packages to LC’s future digital repository.

While we wait for the OAIS- compliant repository... Continue to use UNIX-filesystem based storage Orderly file storage, masters segregated from service copies METS metadata stored for now as individual XML files Virtual information packages are “ready to submit” METS also supports end-user display

What about smaller archives and libraries? The digital approach to content preservation depends on significant computer infrastructure Will we have a few consortial repositories to serve many smaller archives? Who and how would such arrangements be made?

What about smaller archives and libraries? Holding action? For audio, make multiple CD-Rs or DVD-Rs? Write to data tape? LC is challenged to give good advice today

Web Sites LC audio-visual prototyping project – LC enterprise-wide digital preservation planning – Metadata Encoding and Transmission Standard (METS) –

Thank you...