A Preservation Repository in Prose Being a Story of the DRS Past, Present and Future By Andrea Goethals, Wendy Gogel In Cambridge, Massachusetts 2009.

Slides:



Advertisements
Similar presentations
Digital Music and Audio Projects at Indiana University Jon Dunn Digital Library Program Indiana University August 16, 2001.
Advertisements

The Future of Scholarship in the Digital Age: The Role of Institutional Repositories Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Digital Preservation A Matter of Trust. Context * As of March 5, 2011.
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
DRS 2 Metadata Migration June 25, Agenda Introduction Preliminary results - content analysis Metadata options Next steps Questions.
DRS 2 one in a series of periodic updates Harvard University Library Andrea Goethals October 21, 2009 DRS = Digital Repository Service.
Harvard University Library Digital Initiative Internal Challenge Grant Program NERCOMP E-learning Conference March 20, 2001 Worcester, Massachusetts Wendy.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
Providing Online Access to the HKUST University Archives: EAD to INNOPAC Sintra Tsang and K.T. Lam The Hong Kong University of Science and Technology 7th.
May 2, 2006 Virtual Collections Or, catalog building without the rocket science.
MIT’s DSpace A good fit for ETDs Margret Branschofsky Keith Glavash MIT LIBRARIES.
Yale VITAL/FEDORA Repository IAC Update 30 Oct 2006 Audrey Novak, Head IS&P, ILTS.
Automatic Evaluation of Migration Quality in Distributed Networks of Converters Miguel Ferreira Supervisors Ana Alice Baptista.
The KnowledgeBank: Powered by DSpace Laura Tull Systems Librarian Ohio State University Libraries WiLSWorld July 27, 2004.
School of something FACULTY OF OTHER University Library The Library’s Digital Repository or Whatever happened to MIDESS? Michael Emly Jonathan Ainsworth.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Developing PANDORA Mark Corbould Director, IT Business Systems.
I:\Share\Bestuursinligting\OUDITfinaal\Portfolio\Statistics\BI UPSpace An institutional repository for the University of.
OU Digital Library development project Liz Mallett – Project Manager James Alexander – Project Developer 25 January 2012.
Shared October 13, 2010 Shelf Michael Roy, Dean of Library and Information Services, Middlebury College A Networked Image Platform Jeremy Stynes, Head.
Sai Deng, Metadata Catalog Librarian, Wichita State University Libraries Tse-Min Wang, Graduate Student in CS, Wichita State University Digital Imaging.
Harvard’s Digital Repository Service (DRS) Architecture Harvard University Library (HUL) Andrea Goethals, Randy Stern December 10, 2009.
The New DRS (DRS 2) Introduction. What is DRS? Digital repository for preservation and access –Maintains integrity of deposited content –Preserves content.
Digital Repository Service (DRS) Harvard University Library OIS presented by: Wendy Gogel & Andrea Goethals.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
ArcGIS Workflow Manager An Introduction
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Trusted Digital Repositories,
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Mid-Michigan Digital Practitioners, March 14, 2014 The National Digital Stewardship Alliance Agenda Mid-Michigan Digital Practitioners Meeting Abigail.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
DSpace: Introduction and Starting an Institutional Repository
An Overview of MPEG-21 Cory McKay. Introduction Built on top of MPEG-4 and MPEG-7 standards Much more than just an audiovisual standard Meant to be a.
Update on UDFR (Unified Digital Format Registry) NDIIPP Meeting June 25, 2009 Andrea Goethals.
From Concept to Reality: An overview of the University of Wisconsin Digital Collections Melissa Mclimans.
ESRI User Conference, August 8, 2006 Long-term archiving of geospatial data: the NGDA project Julie Sweetkind-Singer John Banning Stanford University.
Organizational Relationships and Shaping the Digital Resource July 21, 2010 Johanna Bauman, Senior Production Manager, ARTstor.
Searching Sheet Music: IN Harmony Final Report Stacy Kowalczyk Digital Library Program Brownbag Spring Series February 13, 2008.
Migrating Repository Metadata & Users: The Harvard DRS 2 Project Andrea Goethals, Harvard Library IS&T Archiving 2014, May
Why Archiving and Preserving GIS Data Is Important Maps tell a compelling story of change over time. They document movement, progress, and change to the.
GPO’s Federal Digital System August 17, 2010 U.S. Government Printing Office.
File format registries - a global infrastructure for local persistence Andreas Aschenbrenner, ERPANET.
DRS 2 Orientation Harvard University Library September 30, 2010 DRS = Digital Repository Service.
HathiTrust’s Past, Present and Future. Short- and Long-term Functional Objectives Short-term Page turner mechanism (and Mobile!) Branding (overall initiative;
AILLA:The Archive of the Indigenous Languages of Latin America Heidi Johnson The University of Texas at Austin Latin American Digital Library Initiative,
The Canadian Information Network for Research in the Social Sciences and Humanities Tim Au Yeung and Mary Westell Libraries.
Introduction to metadata
Implementing an Institutional Repository: Part III 16 th North Carolina Serials Conference March 29, 2007 Resource Issues.
DRS 2 Project (2008 – Present!) Andrea Goethals, Harvard Library Digital Preservation Management Workshop, MIT June 13, 2013.
Rights Metadata in DRS Basic Rights Functions in: – Batch Builder – EAS – DRS Web Admin.
The New DRS Introduction. What is DRS? Digital repository for preservation and access – Maintains integrity of deposited content – Preserves content for.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Research Data Management At the Smithsonian Using Sidora CNI December 10, 2013.
Institutional Repositories: the DSpace Experience Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
Managing live digital content with DuraSpace services Bill Branan PASIG Spring 2015.
A Beginner’s Guide to Preserving Digital Resources in Historic Environment Records Catherine Hardman and Kieron Niven Archaeology Data Service.
Making the Case for Curation: The Practical Experiment of DSpace Managing Digital Assets February 5-6, 2005 Charleston, SC Ann J. Wolpert, Director of.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
CENTRAL/WESTERN MASSACHUSETTS AUTOMATED RESOURCE SHARING Digitization GOALS & THEIR LOGISTICS Michael J. Bennett Digital Initiatives Librarian C/WMARS,
Archiving CAD in Archaeology: Ingest to Dissemination (or The ADS experience to date) Kieron Niven Archaeology Data Service, University of York, UK.
Breeda Herlihy, IR Manager, UCC Library. UCC selected DSpace in 2008 Software selection group Staff from Library IT, Computer Centre, Special Collections,
Discover ScholarSphere A repository service collaboration between the University Libraries and ITS.
Building Digital Archives Mark Phillips Cathy Hartman June 6, 2008.
7th Annual Hong Kong Innovative Users Group Meeting
GISELA & CHAIN Workshop Digital Cultural Heritage Network
An Open Archival Repository System for UT Austin
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Presentation transcript:

A Preservation Repository in Prose Being a Story of the DRS Past, Present and Future By Andrea Goethals, Wendy Gogel In Cambridge, Massachusetts 2009

Today’s Agenda DRS 1: Being a Story of the Past A Transition: Being a Story of the Present DRS 2 and You!: Being a Story of the Future Questions?

DRS 1: Being a Story of the Past

The Formative years - LDI November 1997 Proposal for the Library Digital Initiative “…create the first-generation technical infrastructure to support storage of and access to digital library materials.” In July 1998, LDI was approved and funded In December 1998, planning for DRS began

Digital Repository Service (DRS) provides a set of professionally managed services to ensure the usability of securely stored digital objects over time. is both a preservation and an access repository includes the bundled delivery services October 2000 Launch

LDI Grant projects 49 Grants were awarded Digitizing analog collections Images Text Audio Music scores Born Digital Biomedical images Geospatial data Web sites Online cataloging projects

Digitizing Facilities June 1999 Harvard College Library Imaging Services HCL Fine Arts Library Digital Imaging Lab (FAL DIL) Harvard Art Museum Digital Imaging and Visual Resources (DIVR) Harvard College Library Audio Preservation Services (HCL APS) Peabody Museum of Archaeology and Ethnology

The first Deposit and the first object was deposited on October 23, 2000…

w/ Metadata Administrative Stewardship, contacts (e.g., HCL Harvard-Yenching Library, Ray Lum, etc.) Billing account (e.g., 33-digit account number) Access flag (e.g., open to the public, restricted to the Harvard community, no access) Technical Physical characteristics (e.g. for images, x and y resolution, MD5 signature, pixel width and height, compression, bit sample rate, etc.) Production methods (e.g. for images, Scitex; Leaf Volare; Leaf Colorshop 5.x )

The first Book was deposited on June 29, 2001

The first Audio was deposited on January 28, 2003 Matins for Sunday after the Elevation of the Holy CrossMatins for Sunday after the Elevation of the Holy Cross Laura Boulton ( ) Collection of Byzantine and Orthodox Musics Archive of World Music One of a series of Byzantine hymns and liturgies recorded in a monastery on Patmos, Logbook (Part I, p. 1-10)

The first georeferenced map was deposited on January 14, 2005 Barnstable, Massachusetts 15 Minute Digital Raster Graphic From an 1893 Historic USGS map reprinted in 1907

Systems and Services 1985 HOLLIS –our OPAC VIA Visual Information Access– union catalog OASIS Online Archival Search Information System – union catalog OLIVIA – image cataloging tool

Systems and Services DRS Digital Repository Service – preservation and access repository NRS Name Resolution Service – to resolve persistent identifiers AMS Access Management Service – to provide access controls IDS Image Delivery Service PDS Page Delivery Service FTS Full-text Search Service NRS Web Admin Policy Web Admin

Systems and Services DRS Web Admin – staff interface to DRS PDS Maint Harvard Geospatial Library – union catalog TED TEmplated Database – collection building tool SDS Streaming Delivery Service – for audio delivery ADS Asynchronous Delivery – for large files Cross-catalog search – for federated searching

Systems and Services Dynamic IDS – for zoom and pan features w/ JP2 DMART- Audio deposit tool RList – Course reserves tool Virtual Collections Batch Builder Google data loading WAX

A Transition: Being a Story of the Present

2008: new DRS storage system New servers, new storage arrays, new tape library, new storage software Increased storage capacity Less complex - DRS loader doesn’t need to know the details of storage system anymore Higher availability for deliverable content Copies stored in 3 different geographic locations 3 “low use” copies, 4 “high use” copies

Cumulative file count per format type

Annual file size per harvard unit (gb) HCL Art Museums

Cumulative non-Google file size per use (gb) April 2009: 45,742 GB

Cumulative file size (gb) April 2009: 105,652 GB

DIY --

2008: new program, new position HUL takes next step in its commitment to digital preservation and establishes: 1.Digital Preservation and Repository Manager Position March 2008 Andrea Goethals 2.Digital Preservation Program June 2008 Established within OIS

2008/9 priorities of new digital preservation program 1.Define additional infrastructure requirements to support digital preservation DRS enhancements Global digital format registry (GDFR) 2.Identify and analyze new formats for the DRS to support PDF, , audio, architectural drawings, etc. 3.Establish communication network with the 2 communities we inhabit Broader digital preservation community Harvard community

Avenues of communication Broader digital preservation community Conferences and meetings Collaborative projects conversations, blogs, newsgroups Harvard community Committees (ULC, CCCC, DMCC, DCSWG, etc.) Digital project librarians Ad-hoc focus groups, meetings and with stakeholders (depositors, curators and collection managers) Customer surveys

These communities inform our thinking about: Concepts and terms Metadata Data models Content Recommended & supported formats Best practices Preservation planning and actions Storage, management and monitoring Certifications Registries Tools and services

DRS customer survey 2008 August - September 2008 Users of DRS tools or services To evaluate the level of satisfaction with DRS tools, services, and websites To understand any unmet needs

Survey findings Question 1: What word or phrase best describes the DRS? In general the DRS is valued for its preservation services and perceived as stable, secure and trusted.

Other key findings of survey DRS Customers want: Support for more formats Guidance on preservation formats and content creation Better search and editing management tools Delivery services that use common or popular third-party applications

Trends in DRS customer needs 1.Problem of abundance 2.Remote creators 3.Diversity of formats

1. Problem of abundance DRS owners and depositors: Are increasingly overwhelmed by the amount of digital content to preserve Can’t fully process the material they want to deposit into the DRS Can’t go through a deposit process that is time-consuming

2. Remote creators Increasingly DRS owners and depositors are acquiring content they did not create DRS staff can not influence the formats or technical properties of this content during creation

3. Diversity of formats DRS owners and depositors increasingly need to preserve formats and genres that aren’t currently supported by the DRS CAD formatsSpreadsheet formats 3D visualization formatsPresentation formats Additional audio formatsDatabases Video formatsLocally archived websites Executable file formatsRaw survey data Word processing formatsRaw camera files

Implications of these trends The DRS needs to: accept and preserve minimally-processed content provide a time-efficient deposit process support a broad range of formats and genres And: can’t rely on the content being in “preservable” formats prior to deposit into the DRS

DRS 2 and You!: Being a Story of the Future

DRS 2 changes Why? 1.To better support digital preservation 2.To better support needs of DRS depositors, curators and collection managers

DRS 2 changes 1.New conceptual foundation Objects Content models 2.User improvements Support for opaque objects Support for new file formats Deposit, management & delivery tools Guidance & user community 3.A new approach to metadata 4.Increased preservation planning and activities

Objects Currently only a file level in the DRS All management has to be done at the individual file level Objects are aggregations of files Page-turned object Still image object More intuitive unit for management, reporting and searching Example: How many Page-turned objects do I have in the DRS?

Content models Types of objects Example: audio content model

Support for opaque objects A special content model Allows files in any format The digital equivalent of buying time at HD Content can be minimally processed Must be intended for long-term preservation The content could be fully processed by depositors but not supported yet by DRS Will receive some preservation services Will be on a path to fuller DRS preservation

Support for new file formats PDF Audio MP3, MP4/AAC Drawings AutoCAD Adobe Illustrator Video What’s next?

Deposit, management & delivery tools Enhanced Batch Builder Integrated with File Information Tool Set (FITS) Enhanced DRS Web Admin Better searching Richer management and reporting Ability to perform batch updates File Delivery Service (FDS) Created for PDF delivery Delivers a file to user’s web browser

Future of

Guidance & user community New website for digital preservation Formats central Content models DRS practices HUL digital preservation projects Emerging standards and best practices Tools, services, registries Resources & Experts

A new approach to metadata Moving towards community-standard schemas PREMIS, MODS, MIX, textMD, etc. Metadata files on the file system alongside content files “object descriptors” Preservation, rights, descriptive metadata More reliance on embedded metadata Automatic extraction at deposit time by FITS Third party delivery applications are becoming aware of file-embedded metadata

Increased preservation planning and activities More granular format identification Sub-file characterization Preservation plans per content model Digital first aid (content & metadata) “Localization,” migrations, normalizations Technology watch Virus checking

DRS 2 process Phases of work DRS 2.1, 2.2, 2.3, etc. Themed phases DRS 2.1: “Object Security and Integrity” DRS 2.2: “Management and Monitoring” Includes support for new formats DRS 2.1: PDFs, opaque objects DRS 2.2: more audio formats (MP3, MP4/AAC)

Questions?

Image credits Future ghost Marley’s ghost _marley27s_ghost.jpghttp://cueballcol.files.wordpress.com/2007/12/435px-a_christmas_carol_- _marley27s_ghost.jpg Ghost of the past Ghost-Of-Marley,-From-Dickens-A-Christmas-Carol.jpghttps:// Ghost-Of-Marley,-From-Dickens-A-Christmas-Carol.jpg Ignorance and want Weight of wikipedia pghttp://images.theage.com.au/ftage/ffximage/2008/05/26/300_wikipedia1.j pg Lots of people Ghost of the future Mr. Magoo re_small.jpghttp:// re_small.jpg