The Data Capacitor and Digital Libraries at IU Jon Dunn Associate Director for Technology IU Digital Library Program February 22, 2006.

Slides:



Advertisements
Similar presentations
Digital Music and Audio Projects at Indiana University Jon Dunn Digital Library Program Indiana University August 16, 2001.
Advertisements

Beyond the Google Book: the Future of the Digital Library Cory Snavely Library IT Core Services manager University of Michigan April 20, 2010.
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
Digital Collections: Storage and Access Jon Dunn Assistant Director for Technology IU Digital Library Program
The UM Libraries’ Frost Concert Archive Documenting the Performance History of the University of Miami Frost School of Music Amy Strickland University.
Funded by: © AHDS Sherpa DP – a Technical Architecture for a Disaggregated Preservation Service Mark Hedges Arts and Humanities Data Service King’s College.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
Digitizing and Delivering Audio and Video Jon Dunn Jenn Riley IU Digital Library Program Digital Library Brown Bag Series February 11, 2005.
Technology Bootcamp January 18, 2014 Large-Scale Digital Libraries Digitization Process Krystyna K. Matusiak, Ph.D. Assistant Professor Library & Information.
National Archives of Australia Digital Preservation Update
HydraDAM2 NEH funding January 2015 – December 2017 Builds on existing WGBH HydraDAM Open source Hydra ‘head’ Preservation-oriented digital asset management.
Variations On Video project update DLF Fall Forum 2010 Jon Dunn, Indiana University Claire Stewart, Northwestern University November 2, 2010.
Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales
Building a Fedora Architecture to Support Diverse Collections Jon Dunn Ryan Scherle Digital Library Program Indiana University.
EVIA Digital Archive Project Jon W. Dunn and William G. Cowan Digital Library Program Indiana University IU Digital Library Brown Bag Series November 19,
Columbia Digital Preservation Planning & Implementation Status Report, August 2010.
WORKFLOWS AND OTHER CONSIDERATIONS FOR DIGITIZATION  Steve Bingo  Processing Archivist Washington State University Libraries  Alex Merrill  Assistant.
ALI Digital Library Workshop Creating Digital Content: Digitization Jenn Riley Digital Media Specialist IU Digital Library Program
DLP Forum | March 31, 2003 Conversion Issues: Digitization Jenn Riley Digital Media Specialist IU Digital Library Program.
Reexamining Digital Library Infrastructure at IU Jon Dunn, Ryan Scherle, Eric Peters Indiana University Digital Library Program IU Digital Library Brown.
Digital Preservation in the IU Libraries Jon Dunn Stacy Kowalczyk IU Digital Library Brown Bag Series September 17, 2008.
Recordkeeping for Good Governance Toolkit Digital Recordkeeping Guidance Funafuti, Tuvalu – June 2013.
11-15 April 2011 Mauritius Institute of Health S.S.Pillai
Core Issues in Digital Preservation: Storage and Maintenance Jacob Nadal, Preservation Officer UCLA Library.
Sound Directions Digital Preservation and Access for Global Audio Heritage Indiana University Harvard University.
Challenges of Digital Media Preservation Karen Cariani, Director Media Library and Archives Dave MacCarn, Chief Technologist.
OAIS Open Archival Information System. “Content creators, systems developers, custodians, and future users are all potential stakeholders in the preservation.
Fedora Content Models for the National Science Digital Library Data Repository Fedora User’s Group Meeting Copenhagen, September 28, 2005 Carl Lagoze Cornell.
OCLC Online Computer Library Center Digital Preservation with OCLC Digitization Standards: Issues & Updates Taylor Surface, OCLC.
Digitizing Photographs For Sustainable Heritage Workshop, June 12-15, 2014 By Steven Bingo Project Archivist, Washington State University.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Libraries, Archives, and Digital Preservation: The Reality of What We Must Do Leslie Johnston Acting Director, National Digital Information Infrastructure.
Music Digital Libraries Jon Dunn IU Digital Library Program April 9, 2002.
EVIA Digital Archive New Tools William G. Cowan Mike Durbin Digital Library Program EVIA Digital Archive DLP Brown Bag 20 September 2006.
The FCLA Digital Archive Joint Meeting of CSUL Committees, 2005.
The Canadian Information Network for Research in the Social Sciences and Humanities Tim Au Yeung and Mary Westell Libraries.
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
Digital preservation activities at the NLW Sally McInnes 18 September 2009.
Video Annotation for Omeka William G. Cowan Associate Director, Digital Libraries System Development.
Institute Repositories and Digital Preservation : Assessing Current Practices at Research Library Rathachai Chawuthai Information.
Metadata for digital preservation: a review of recent developments Michael Day UKOLN, University of Bath ECDL2001, 5th European Conference.
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
VITAL at the National Library of Wales Glen Robson
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Improving Description through Collaboration: The Ethnomusicological Video for Instruction & Analysis Digital Archive Music Library Association, February.
April 14, 2005MIT Libraries Visiting Committee Libraries Strategic Plan Theme III Work to shape the future MacKenzie Smith Associate Director for Technology.
Storage Why is storage an issue? Space requirements Persistence Accessibility Needs depend on purpose of storage Capture/encoding Access/delivery Preservation.
Sharing Digital Scores: Will the Open Archives Initiative Protocol for Metadata Harvesting Provide the Key? Constance Mayer, Harvard University Peter Munstedt,
PRESERVATION IN A DIGITAL WORLD Presented By: Darrell Garwood Imaging Lab Manager Library and Archives Division Kansas State Historical Society
Al Cornish, Systems Librarian Washington State University Libraries Preserving Access to Multimedia Collections.
Portico’s “d-collections” preservation service Stephanie Orphan Positive trends in sustainability? Emerging approaches to archiving commercial databases.
Digital Library Storage Strategies Robert Cartolano, Director Library Information Technology Office November 14, 2008.
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
Digital Production Pipeline
Chang, Wen-Hsi Division Director National Archives Administration, 2011/3/18/16:15-17: TELDAP International Conference.
Libraries in the digital age Collection & preservation for generational access part two The LOCKSS Program.
Digital Preservation What, Why, and How? Dan Albertson’s Digital Libraries Class April 13, 2016 Jody DeRidder Head, Metadata & Digital Services University.
EVIA Digital Archive Technical Overview EVIA Digital Archive DLP Brown Bag: 7 December 2005.
Building A Repository for Digital Objects
Digital Archiving & Preservation : How to compare and contrast
Avalon's Role in the Digital Collections Ecosystem
Joseph JaJa, Mike Smorul, and Sangchul Song
Bentley Project Reel Digitization Bentley Historical Library t
Jon Dunn, Indiana University Marcel LaFlamme, Rice University
Jon W. Dunn Assistant Dean for Library Technologies
Bentley Audio Digitization
Presentation transcript:

The Data Capacitor and Digital Libraries at IU Jon Dunn Associate Director for Technology IU Digital Library Program February 22, 2006

Scientific Data vs. Traditional Digital Library Content Scientific instruments creating data  Digitization creating digital files Post-processing of data  Quality control and derivative creation Archiving of data  Archiving Metadata  Metadata

Steps in a Digital Library Ingestion Workflow Transfer  Digitization Quality control Processing Derivative creation Archiving

Digital Imaging Workflow TIFF File JPEG files for web delivery Archival Storage System/ Preservation Repository Picture Scan Generate derivatives Archive QC

Audio Digitization Workflow WAV File MP3 files for web delivery Archival Storage System/ Preservation Repository LP Digitize Generate derivatives Archive QC

Digital Library Infrastructure Reengineering Project Began August 2005 Funded for two years Implement a coherent technical infrastructure for digital libraries at IU Content storage – metadata storage – preservation – delivery – interoperability Using Fedora digital repository as a basis 

OAIS: Open Archival Information System Conceptual framework for an archival system dedicated to preserving and maintaining access to digital information over the long term Origins in space science community Discusses interactions that producers, consumers, and managers have with a repository Basis for much current thinking on repositories in digital library community  OCLC/RLG Trusted Digital Repositories: Attributes and Responsibilities  RLG/NARA Audit Checklist for Certifying Digital Repositories

OAIS Reference Model

Digital Library Infrastructure: Three Kinds of Storage Needed Working storage  For quality control, post processing, derivative generation  Fast, convenient, secure network access from wide variety of platforms  Temporary Archival storage  Secure, long-term storage  OK if access is slow  Preservation integrity checking  Migration to handle media and file format obsolescence Delivery storage  Fast, Web accessible, secure (in some cases)

Digital Library Infrastructure: Three Kinds of Storage Needed Working storage  DLP servers?  Data capacitor? Archival storage  Massive Data Storage Service – MDSS? Delivery storage  DLP servers

Some Numbers…

Local sheet music scanning 20MB/page x 20 pages/hour x 8 hours/day = 3.2 GB/day Time to process: about 3 hours Not a problem

But… Vendor Outsourced Digitization Items shipped to vendor for digitization Images returned on CD-R, DVD-R, or portable hard drive Hypothetical sheet music example: 5000 titles x 6 pages/title x 20MB/page = 600 GB at once  Time to process: days

Phase One P45 Camera Back 39 megapixel digital back for Hasselblad medium format camera 39,052,992 pixels x 48 bits/pixel = 224MB/image 10 images/hour x 8 hours/day = 18 GB/day 30 images/hour x 8 hours/day = 54 GB/day

Video = Really Scary Uncompressed 4:4:4 video  720 x 486 resolution = 349,920 pixels per frame  349,920 pixels x 10 bits/sample x 3 samples/pixel = 10,497,600 bits per frame  10,497,600 bits/frame X frames/second = 314,613,072 bits per second  314,613,072 bps x 3600 seconds = ~ GB/hour  For 1920x1080 HDTV, more like 840 GB/hour

EVIA Digital Archive Ethnomusicological Video for Instruction and Analysis Video digitized at University of Michigan Concentrated period of production – couple of months 12 scholars contribute 10 hours/video each 12 scholars x 10 hours x 142 GB/hour = 17 terabytes

More Information… Data Capacitor  Steve Simms  Jon Dunn 