Presentation is loading. Please wait.

Presentation is loading. Please wait.

File storage is a lot like a basement closet... Image courtesy of Teemo, Master of Clowning Image courtesy of Life Magazine What happens when it's time.

Similar presentations


Presentation on theme: "File storage is a lot like a basement closet... Image courtesy of Teemo, Master of Clowning Image courtesy of Life Magazine What happens when it's time."— Presentation transcript:

1 File storage is a lot like a basement closet... Image courtesy of Teemo, Master of Clowning Image courtesy of Life Magazine What happens when it's time to move???

2 Identification A digital object may belong in MANY potential virtual collections… … but it originated from ONE SINGLE ANALOG collection. Provenance trumps all!  Slavery  African Americans  Sheet Music  Tombigbee River  Southern History  … and more “Gum Tree Canoe,” Published by G.P. Reed (Boston: 1847). Wade Hall collection of Southern History and Culture, Hoole Special Collections, University of Alabama Libraries.

3 Hierarchical classes approach Holder ID: u0003 Collection ID: 0000023 Item ID: 0000007 Sequence ID: 0005 Archival File: u0003_0000023_0000007_0005.tif  “Holder” and format type (example: Hoole Special Collections Manuscripts)  Collection within those holdings (example: Gorgas Family Papers)  Numbered item within that collection  Sequence of delivery for pages (and subpages) Concatenated meaningful segments in filename; Same segments mirrored in file structure.

4 u0003_0001980_0000001 is the first digitized item in the MSS 1980 collection HOLDER ID COLLECTION ID

5 Create Clarity: Information at the Applicable Level Holder/Format Collection 1 Collection 2 Collection 3 Documentation Metadata Collection 2 Item 1 Item 2 Item 3 DocumentationMetadata

6 Dropping the Technical Metadata in… where it belongs Makes METS creation a Piece of Cake! (and redundant!)‏ Is simply: u0003/ 0000002/ 0001/ The directory for u0003_0000002_0001.tif

7 storage area Simple, Clear Hierarchical Organization:  Holder ID  Collection ID  Item ID  Sequence ID

8 Bringing Content Up to the Level Of the WEB!!! Greater Usability and Access == Longer Life Images … ImageMagick: http://www.imagemagick.org (it’s free!) Protected archive area u0003 0000023 0000007 0005 u0003_0000023_0000007_0005.tif 0000007 0005 Thumb, mid-, and large-size derivatives Web accessible area Audio … LAME: http://lame.sourceforge.net OCR … TESSERACT: http://code.google.com/p/tesseract-ocr/

9 http://acumen.lib.ua.edu ACCESS! Via Acumen (also free!)  XML schema agnostic  No ingest  No metadata modifications  All content easily accessible  Open to search engines


Download ppt "File storage is a lot like a basement closet... Image courtesy of Teemo, Master of Clowning Image courtesy of Life Magazine What happens when it's time."

Similar presentations


Ads by Google