Automated Archiving of DVD Content Esteva, Vega, Nieto, Scott, Gunnels, Kumar, Lamphear, Henriksen, Lee, Martin TCDL 2013.

Slides:



Advertisements
Similar presentations
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
Advertisements

1 Metadata Tools for JISC Digitisation Projects of still images and text Ed Fay BOPCRIS, Hartley Library University of Southampton.
Long-Term Preservation. Technical Approaches to Long-Term Preservation the challenge is to interpret formats a similar development: sound carriers From.
OCLC Digital Archive Overview Judith Cobb LIPA Meeting July 2006.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
SOFTWARE PRESENTATION ODMS (OPEN SOURCE DOCUMENT MANAGEMENT SYSTEM)
Digital Video Archiving. ViArchive Overview ViArchive provides user friendly solutions for… – uploading video clips with metadata (searchable file info.
The Academy of Motion Picture Arts & Sciences Building an Interim Digital Preservation System Nancy Silver Digital Archival Program Manager Science and.
Chronopolis: Preserving Our Digital Heritage David Minor UC San Diego San Diego Supercomputer Center.
© 2010 Microsoft Corporation. All rights reserved. Quality Assurance: Towards Tools for Characterizing and Comparing Digital Documents Natasa Milic-Frayling.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
Active Data Curation in Libraries: Issues and Challenges ASEE ELD Presentation June 27, 2011 William H. Mischo & Mary C. Schlembach.
Server Roles and Features.NET Framework 3.51.NET Framework 4.5 IIS Web Server IIS Default Document IIS Directory Browsing IIS HTTP Errors.
Incompatible or Interoperable? A METS bridge for a small gap between two digital preservation software packages Lucas Mak Metadata & CatalogLibrarian
Talend 5.4 Architecture Adam Pemble Talend Professional Services.
JMU Online Video Collection Background and Technical Processes for the VIVA Multimedia Collections Task Force Jeff Clark, Grover Saunders James Madison.
Putting it all together for Digital Assets Jon Morley Beck Locey.
Microsoft Share Point 2007 Lela Castaneda. Microsoft Office SharePoint Designer 2007 top 10 benefits 1)Be more productive with next-generation Microsoft.
Metadata standards, tools and processes for audio preservation at the British Library: An overview of new systems for audio description, preservation and.
The Digital Motion Picture Archive Framework Project © 2008 AMPAS Academy of Motion Picture Arts and Sciences Science and Technology Council Nancy Silver,
WebArchiv Czech Web Archive IIPC 2007, Paris.
®® Microsoft Windows 7 for Power Users Tutorial 5 Comparing Windows 7 File Systems.
Ingest and Dissemination with DAITSS Presented by Randy Fischer, Programmer, Florida Center for Library Automation, University of Florida DigCCurr2007.
Case History: Library of Congress Audio-Visual Prototyping Project METS Opening Day October 27, 2003 Carl Fleischhauer Office of Strategic Initiatives.
“Filling the digital preservation gap” an update from the Jisc Research Data Spring project at York and Hull Jenny Mitcham Digital Archivist Borthwick.
MSS Technologies and the AIIM Grand Canyon Chapter present: Electronic Document Management System Needs Analysis.
Persistent Digital Archives and Library System (PeDALS) SC Department of Archives and History.
1 A journey of a thousand miles begins with a single step. Chinese Proverb.
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
CHAPTER FOUR COMPUTER SOFTWARE.
Kent Norsworthy, LLILAS and Benson Latin American Collection Jade Diaz, University of Texas Libraries2012 Texas Conference for Digital Libraries Kent Norsworthy,
OCLC Online Computer Library Center Kathy Kie December 2007 OCLC Cataloging & Metadata Services an introduction.
Sound Directions Digital Preservation and Access for Global Audio Heritage Indiana University Harvard University.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
Habing1 Integrating PREMIS and METS PREMIS Tutorial Implementers’ Panel June 21, 2007, 9:00-5:30 Library of Congress, Jefferson Building, Whittall.
Persistent Digital Archives and Library System (PeDALS)
VITAL at the National Library of Wales Glen Robson
OAIS: From Requirements to Reality at OCLC FLICC / CENDI Symposium, Dec Pam Kircher Product Manager, Digital Archive OCLC Digital & Preservation.
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
Carcanet Case Study Fran Baker, John Rylands University Library University of Manchester SPRUCE event 19 January 2012.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Preserving Electronic Mailing Lists as Scholarly Resources: The H-Net Archives Lisa M. Schmidt
| nectar.org.au NECTAR TRAINING Module 9 Backing up & Packing up.
A Technical Overview Bill Branan DuraCloud Technical Lead.
Implementing PREMIS in DigiTool Michael Kaplan ALA 2007 Update.
DAITSS and the Florida Digital Archive Priscilla Caplan Florida Center for Library Automation iPRES 2006.
Collection Management Systems
File and File Systems Compiled by IITG Team Need to be reorganized and reworded.
Where are my files? Discoveries in establishing a digital archive workflow Sally McDonald Archivist/Librarian Western History/Genealogy, Denver Public.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Chang, Wen-Hsi Division Director National Archives Administration, 2011/3/18/16:15-17: TELDAP International Conference.
© 2012 IBM Corporation IBM Linear Tape File System (LTFS) Overview and Demo.
A strategic view of document and digital object management for the University of the Witwatersrand, Johannesburg Prof Derek W. Keats Deputy Vice Chancellor.
Ingest and Dissemination with DAITSS
APTrust and Georgetown University Library
Fedora at UT Libraries Melanie Cofield, Metadata Coordinator
Building A Repository for Digital Objects
DAITSS and the Florida Digital Archive
An Introduction to Tessella and The Safety Deposit Box Platform
Digital Asset Management Part 15: Summary
Bentley Project Reel Digitization Bentley Historical Library t
Better than it was Finding what works for processing born-digital archives at the Bentley Historical Library Mike Shallcross U-M Bentley Historical Library.
Automated Archiving of DVD Content
Integrating PREMIS and METS
Managing Your Files.
Library Technology Conference: Building Exhibits
Data Management Components for a Research Data Archive
Presentation transcript:

Automated Archiving of DVD Content Esteva, Vega, Nieto, Scott, Gunnels, Kumar, Lamphear, Henriksen, Lee, Martin TCDL 2013

Motivation and perspective Find a file based preservation solution for video art in DVD media Create a SIP including files to fulfill museum functions Availability of high performance distributed storage Next generation display systems

Considerations Study the role of the DVD in the works technical history Options for conversion Usage of the DVD in the museum

Considerations UT Research Storage Infrastructure and Services Example of next generation display systems

Workflow requirements Only for DVD-based works Transcoding should not involve further compression Fixity to establish authenticity of disk image and production quality/preservation master Quality metric to assess the quality of the transcoding Metadata and provenance documentation Easy to implement SIP contents should fulfill preservation and access functions

Preservation roadmap Best practices in video and digital preservation – Few references for DVD preservation ISO disk images as identical copy of the DVD – Preservation master – Base for metadata extraction and further conversions – Not playable High quality access file (also preservation file) – Matroska container and ffv1 codec Quality metric – Structural Similarity Index (SSIM) Documentation – metadata and processing provenance

Workflow step by step Metadata extraction ISO disk image Checksum Transcoding Checksum Conversion to other access files Quality metric Visual evaluation

Quality Control Full reference quality metric Structural Similarity Index Aim for result of 1 Algorithm available in different programming languages Modeled closely to human perception Visual inspection of the Matroska file

Software choices Informed by testing Open source, command line tools Support technical documentation

Testing and Implementing the Automated Workflow at UT Libraries

Audiovisual Digitization Unit DVD Archiving – Preservation Reformatting – Digital Preservation Example DVD archiving projects – Benson Latin American Collection – Fine Arts Library Streaming Video Project UT Libraries and Audiovisual Digitization

DVD Archiving Requirements ISO image of original Streaming MP4 DVD access copy 2 checksums Descriptive metadata Source metadata Technical metadata

Current Workflow Create decrypted ISO disc image using DVDShrink Quality control of ISO file Move and delete file from local workstation to remote server Update Project Record with transfer notes and status Compile ISO file, MP4 file, and all XML files into a single directory folder Run BagIt script to contribute verifyvalid data, checksum data, manifest and tagmanifest data, as well as well baginfo data. Systems confederates XML into single UTVideoMD XML document Compile descriptive metadata as MODS XML from catalog MARC record using MARCedit Compile source metadata as XML from Digitization record using Oxygen Compile technical metadata as XML from ISO and MP4 using MediaInfo Move all XML documents to remote server Create streaming MP4 from main ISO media stream using Handbrake Quality control of MP4 file Move and delete file from local workstation to remote server Create physical derivatives from ISO Quality control of physical derivatives Label physical derivatives Send physical derivatives and originals to cataloging or return to branch and denote return in Project Record SIP BagIt Copy

Partnering with TACC TACC approached UT Libraries to expand testing and implementation of automated workflow Benefits of automation for the Libraries: – Streamline and unify current workflow – Less staff-intensive – Reduce opportunities for error

Testing and Results Test DVDs – 18 single-stream, unencrypted discs – 3 multi-stream, encrypted discs Workstation – Mac Pro running OS Mountain Lion Results – Encryption errors – MP4 streaming compatibility – Saving files to host directory – SSIM and Python Library limitations – Checksums – Enthusiasm for the next phase!

UT Libraries Modifications Addition of drag and drop functionality – DVD – Project Folder Addition of web-optimized MP4 Elimination of Matroska file

Future Modifications Decryption 2 Checksums SSIM quality measure Expand metadata collection UTVideoMD Integrate BagIt

Manual Workflow Create decrypted ISO disc image using DVDShrink Quality control of ISO file Move and delete file from local workstation to remote server Update Project Record with transfer notes and status Compile ISO file, MP4 file, and all XML files into a single directory folder Run BagIt script to contribute verifyvalid data, checksum data, manifest and tagmanifest data, as well as well baginfo data. Systems confederates XML into single UTVideoMD XML document Compile descriptive metadata as MODS XML from catalog MARC record using MARCedit Compile source metadata as XML from Digitization record using Oxygen Compile technical metadata as XML from ISO and MP4 using MediaInfo Move all XML documents to remote server Create streaming MP4 from main ISO media stream using Handbrake Quality control of MP4 file Move and delete file from local workstation to remote server Create physical derivatives from ISO Quality control of physical derivatives Label physical derivatives Send physical derivatives and originals to cataloging or return to branch and denote return in Project Record SIP BagIt Copy

SIP BagIt Automated Workflow

Thank You Maria Esteva and Karla Vega Texas Advanced Computing Center Vandy Henriksen, Jennifer Lee, Wendy Martin University of Texas Libraries Sue Ellen Jeffers and Meredith Sutton Blanton Museum of Art Bethany Scott Charlotte Mecklenburg Library Kertana Kumar University of Texas College of Natural Sciences Summer Gunnels University of Texas Cockrell School of Engineering