Download presentation
Presentation is loading. Please wait.
Published byBelinda Powell Modified over 9 years ago
1
File storage is a lot like a basement closet... Image courtesy of Teemo, Master of Clowning Image courtesy of Life Magazine What happens when it's time to move???
2
Identification A digital object may belong in MANY potential virtual collections… … but it originated from ONE SINGLE ANALOG collection. Provenance trumps all! Slavery African Americans Sheet Music Tombigbee River Southern History … and more “Gum Tree Canoe,” Published by G.P. Reed (Boston: 1847). Wade Hall collection of Southern History and Culture, Hoole Special Collections, University of Alabama Libraries.
3
Hierarchical classes approach Holder ID: u0003 Collection ID: 0000023 Item ID: 0000007 Sequence ID: 0005 Archival File: u0003_0000023_0000007_0005.tif “Holder” and format type (example: Hoole Special Collections Manuscripts) Collection within those holdings (example: Gorgas Family Papers) Numbered item within that collection Sequence of delivery for pages (and subpages) Concatenated meaningful segments in filename; Same segments mirrored in file structure.
4
u0003_0001980_0000001 is the first digitized item in the MSS 1980 collection HOLDER ID COLLECTION ID
5
Create Clarity: Information at the Applicable Level Holder/Format Collection 1 Collection 2 Collection 3 Documentation Metadata Collection 2 Item 1 Item 2 Item 3 DocumentationMetadata
6
Dropping the Technical Metadata in… where it belongs Makes METS creation a Piece of Cake! (and redundant!) Is simply: u0003/ 0000002/ 0001/ The directory for u0003_0000002_0001.tif
7
storage area Simple, Clear Hierarchical Organization: Holder ID Collection ID Item ID Sequence ID
8
Bringing Content Up to the Level Of the WEB!!! Greater Usability and Access == Longer Life Images … ImageMagick: http://www.imagemagick.org (it’s free!) Protected archive area u0003 0000023 0000007 0005 u0003_0000023_0000007_0005.tif 0000007 0005 Thumb, mid-, and large-size derivatives Web accessible area Audio … LAME: http://lame.sourceforge.net OCR … TESSERACT: http://code.google.com/p/tesseract-ocr/
9
http://acumen.lib.ua.edu ACCESS! Via Acumen (also free!) XML schema agnostic No ingest No metadata modifications All content easily accessible Open to search engines
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.