Digital Repository Service (DRS) Harvard University Library OIS presented by: Wendy Gogel & Andrea Goethals
Today’s Agenda April 26, 2010 How DRS began Building the collections Where DRS is now Where DRS is going
How DRS began
The formative years November 1997: Library Digital Initiative (LDI) Proposal “…create the first-generation technical infrastructure to support storage of and access to digital library materials.” July 1998: LDI was approved and funded December 1998: planning for DRS began
October 2000 launch Digital Repository Service (DRS) provides a set of professionally managed services to ensure the usability of securely stored digital objects over time. is both a preservation and an access repository
DRS is … Technical infrastructure Deposit tools Delivery services Management tools Storage system People Technical expertise and advice Content and system monitoring and management Preservation planning and activities User support and guidance Content
Building the collections
Content Programs & Projects LDI Internal Challenge Grant Program Harvard Art Museum inventory project Open Collections Program Google Books project Web Archiving ongoing
Content Digitizing Facilities Harvard College Library Imaging Services HCL Fine Arts Library Digital Imaging Lab Harvard Art Museum Digital Imaging and Visual Resources Harvard College Library Audio Preservation Services Peabody Museum of Archaeology and Ethnology
Metadata
Audio Matins for Sunday after the Elevation of the Holy Cross Matins for Sunday after the Elevation of the Holy Cross Laura Boulton ( ) Collection of Byzantine and Orthodox Musics Archive of World Music One of a series of Byzantine hymns and liturgies recorded in a monastery on Patmos, Logbook (Part I, p. 1-10)
Where DRS is now
DRS by the numbers 109 TB of content 356 TB total (counting all copies) 15 M files Includes compressed archives - in reality closer to 707 M files 857,000 compressed Google books containing 676 M files 7,300 compressed web harvests containing 17.5 M web files
Format distribution: file count
Format distribution: file size
Content Growth
DRS Architecture
DRS third-party components Open source software Castor (XML to Java mapping) XX XML Validator Java Swing (U/I toolset) iText PDF creator JHOVE Apache Lucene, Struts, Tomcat JQUERY (javascript tools) Apache Log4j (logging) Giffy (Tiff-to-Gif Converter) XML tools (Xerces, Xalan, JaxB, JDOM) Linux COTS software Luratech Image Server Real Media Helix Streaming Server Oracle database SUN Solaris SUN SAM/QFS Storage Archive Manager Common OIS software Access Management Service Name Resolution Server Open Source Berkeley DB
Where DRS is going
DRS 2 Why? 1. To better support digital preservation planning & activities 2. To better support operational & collection management needs of DRS depositors, collection managers, library administrators & repository staff
DRS 2 Process Phases of work DRS 2.1, 2.2, 2.3, etc. Themed phases DRS 2.1: “Object Security and Integrity” DRS 2.2: “Management and Monitoring” DRS 2.3: “Delivery and Dissemination” Includes support for new formats DRS 2.1: PDFs, opaque objects DRS 2.2: more audio formats (MP3, MP4/AAC) DRS 2.3: drawings, dissemination
DRS 2 Deliverables New backend New deposit tools New management tools New dissemination tools Support for new formats
DRS 2 Timing July 2010, (for testing): New backend New deposit environment (5 object types) August 2011, release: New backend New deposit environment (16 object types) New management interface Enhanced rights metadata Enhanced audio support Migrated content
DRS 2 Timing Spring 2012, release: Dissemination Support for drawing formats
Questions?