Presentation is loading. Please wait.

Presentation is loading. Please wait.

Eric Luhrs Digital Initiatives Librarian Special Collections & College Archives MetaDB Development at Lafayette College Haruki Yamaguchi.

Similar presentations


Presentation on theme: "Eric Luhrs Digital Initiatives Librarian Special Collections & College Archives MetaDB Development at Lafayette College Haruki Yamaguchi."— Presentation transcript:

1 Eric Luhrs Digital Initiatives Librarian Special Collections & College Archives luhrse@lafayette.edu MetaDB Development at Lafayette College Haruki Yamaguchi Class of 2011 Department of Computer Science yamaguchm@lafayette.edu CS320: December 1, 2010

2 About this talk 1.WHY: digital collection management – The shift from analog to digital – Preserving our digital heritage – Fast-moving field 2.WHAT: usage overview – Digitization workflow – Why this is important – Brief version history 3.HOW: design overview – Development environment – Application design – Database interaction 4.END: wrap-up – Demo (time permitting) Talk Outline

3 About this talk Ever-Expanding Digital Collections The Lafayette 140 years online, ~43,000 searchable pages http://digital.lafayette.edu/collections/newspaper Lafayette Digital Repository Open Access to faculty & College publications http://dspace.lafayette.edu East Asia Image Collection ~3,000 images from books, photographs, negatives, slides http://digital.lafayette.edu/collections/eastasia

4 About this talk Analog versus digital – Can easily find 1000-year-old book – Where is the page we scanned last week? – How to manage digital material in the future? – Are we headed toward a digital dark age? – Organization, storage, and retrieval of information is a big field trying to keep up with fast-changing technology Preserving Our Digital Heritage

5 About this talk 1.Standardization is first step toward preservation – Automation prevents human error – Allows us isolate specific content types – Ubiquitous formats aid standards aid migration 2.Strengthens digital collection building efforts – Allows me to work faster and smarter – Subject exports create stronger collections MetaDB: Return on Investment

6 File Input High Resolution Master Images Metadata Input Descriptive Administrative Technical Collection Output CSV & TSV data Derivative Images Workflow Managment Digital Asset Management System (CONTENTdm, Dspace, Drupal)

7 Descriptive MD (subject specialist) Title Description Subjects […] Administrative MD (librarian) Collection Publisher Access Rights […] Technical MD (automated) File Format File Size Checksum […] MetaDB Allows Us to Automate & Distribute Collection of Metadata Asset Management System File Input High Resolution Master Images Metadata Input Descriptive Administrative Technical Collection Output CSV & TSV data Derivative Images

8 MetaDB Allows Us to Automate & Distribute Collection of Metadata Asset Management System File Input High Resolution Master Images Metadata Input Descriptive Administrative Technical Collection Output CSV & TSV data Derivative Images CSV, TSV data Dublin Core metadata standard Common file format outputs Derivative Images Created from multiple image formats Custom image sizes Pan/zoom interface Banding/branding

9 Completed MetaDB Collection TSV JPG CONTENTdm

10 Better Digital Collections Increased Visibility New Acquisition Greater Knowledge Improve Workflow Why this Work is Important

11 MetaDB Version History Version 0  Microsoft Access Database shared over local Novell network Version 1  MySQL database with simple web-based HTML interface Version 2  MySQL database with PHP / YUI JavaScript interface Version 3  Postgres database with Java / jQuery JavaScript interface

12 Major Features in Latest Version Version 3.1  Table view editing  Controlled vocabularies  Drag/drop field ordering  Technical metadata extraction  Automatic derivative creation  Image banding/branding  Web-based user management  Vastly improved user interface

13 Development Environment MetaDB0 (Production) MetaDB1 (Development) svn.lafayette.edu New releases Feedback Bugs Test Patches/Features

14 Application Design MetaDB Service API AJAX Database Servlets Images ImageMagick Back-End Front-End

15 Update Database Data Whitelist Cross- checking Authentication Log Feedback

16 Retrieve from Database Data Authenticate Request Gather data Wrap in objects Unpack into JSON Log Project: cpw-nofuko Item: 1 Type: Descriptive Metadata Session ID:AHJ7HA… Concurrency Check

17 Populate User Interface Data Widgets Templates

18 Facts & Figures Development : January 2009 - Present Size: 110+ Java classes, ~30,000 lines of code Database: ~120,000+ rows of data Images: ~200GB disk space Subversion: Revision 3864

19 Eric LuhrsHaruki Yamaguchi luhrse@lafayette.edu yamagucm@lafayette.edu http://metadb.lafayette.edu What does this mean for you?


Download ppt "Eric Luhrs Digital Initiatives Librarian Special Collections & College Archives MetaDB Development at Lafayette College Haruki Yamaguchi."

Similar presentations


Ads by Google