DRS 2 Orientation Harvard University Library September 30, 2010 DRS = Digital Repository Service.

Slides:



Advertisements
Similar presentations
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
Advertisements

October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
Vital Implementation Update Vital Implementation Update 11 th January 2006 Paul Bevan – Glen Robson –
OCLC Digital Archive Overview Judith Cobb LIPA Meeting July 2006.
More Better Metadata SAA 2014 Panel: Metadata and Digital Preservation: How Much Do We Really Need? Andrea Goethals, Harvard Library Even v.
Technical Information Center
DRS 2 Metadata Migration June 25, Agenda Introduction Preliminary results - content analysis Metadata options Next steps Questions.
DRS 2 one in a series of periodic updates Harvard University Library Andrea Goethals October 21, 2009 DRS = Digital Repository Service.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
PREMIS in Thought: Data Center for LC Digital Holdings Ardys Kozbial, Arwen Hutt, David Minor February 11, 2008.
Fedora 3.0 and METS: A Partnership for the Organization, Presentation and Preservation of Digital Objects Open Repositories Georgia Tech, Atlanta,
1 Extending the Implementation of PREMIS to Geospatial Resources in the Stanford Digital Repository: An Exploration By Nancy J. Hoebelheinrich Metadata.
Joachim Bauer Senior System Engineer, CCS
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
3. Technical and administrative metadata standards Metadata Standards and Applications.
PREMIS What is PREMIS? – Preservation Metadata Implementation Strategies When is PREMIS use? – PREMIS is used for “repository design, evaluation, and archived.
Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in.
Ingest and Loading DigiTool Version 3.0. Ingest and Loading 2 Ingest Agenda Ingest Overview and Introduction Ingest activity steps Transformers Task Chains.
PREMIS What is PREMIS? o Preservation Metadata Implementation Strategies When is PREMIS use? o PREMIS is used for “repository design, evaluation, and archived.
AIP Archival Information Package – Defines how digital objects and its associated metadata are packaged using XML based files. METS (binding file) MODS.
WMS: Democratizing Data
THE RUTGERS WORKFLOW MANAGEMENT SYSTEM Mary Beth Weber Cataloging and Metadata Services Rutgers University Libraries August 3, 2007.
Descriptive Metadata o When will mods.xml be used by METS (aip.xml) ?  METS will use the mods.xml to encode descriptive metadata. Information that describes,
Harvard’s Digital Repository Service (DRS) Architecture Harvard University Library (HUL) Andrea Goethals, Randy Stern December 10, 2009.
A Preservation Repository in Prose Being a Story of the DRS Past, Present and Future By Andrea Goethals, Wendy Gogel In Cambridge, Massachusetts 2009.
The New DRS (DRS 2) Introduction. What is DRS? Digital repository for preservation and access –Maintains integrity of deposited content –Preserves content.
Digital Repository Service (DRS) Harvard University Library OIS presented by: Wendy Gogel & Andrea Goethals.
OCLC Online Computer Library Center OCLC’s Digital Archive – Disseminating with METS Jay Goodkin Software Engineer Digital Collection and Preservation.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Case History: Library of Congress Audio-Visual Prototyping Project METS Opening Day October 27, 2003 Carl Fleischhauer Office of Strategic Initiatives.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
PeDALS Persistent Digital Archives & Library System Richard Pearce-Moses Deputy Director for Technology & Information Resources Arizona State Library,
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
FITS: The File Information Tool Set
Implementing an Integrated Digital Asset Management System: FEDORA and OAIS in Context Paul Bevan DAMS Implementation Manager
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The DigiTool to FDA Program Lydia Motyka Florida Center for Library Automation.
Migrating Repository Metadata & Users: The Harvard DRS 2 Project Andrea Goethals, Harvard Library IS&T Archiving 2014, May
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan, Florida Center for Library Automation DCC Workshop on Long-term Curation within Digital Repositories.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
Library Repositories and the Documentation of Rights Leslie Johnston, University of Virginia Library NISO Workshop on Rights Expression May 19, 2005.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
The FCLA Digital Archive Joint Meeting of CSUL Committees, 2005.
PREMIS Implementation Fair – SF 2009 PREMIS use in Rosetta Yair Brama – Ex Libris.
Andrea Goethals, Harvard Library ASERL Webinar 2013 File Information Tool Set.
ETD2006 Preserving ETDs With D.A.I.T.S.S. FLORIDA CENTER FOR LIBRARY AUTOMATION FC LA PAPER AUTHORS: Chuck Thomas Priscilla.
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
DRS 2 Project (2008 – Present!) Andrea Goethals, Harvard Library Digital Preservation Management Workshop, MIT June 13, 2013.
Rights Metadata in DRS Basic Rights Functions in: – Batch Builder – EAS – DRS Web Admin.
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
The New DRS Introduction. What is DRS? Digital repository for preservation and access – Maintains integrity of deposited content – Preserves content for.
Carcanet Case Study Fran Baker, John Rylands University Library University of Manchester SPRUCE event 19 January 2012.
Collecting History: Profiles in Science Alexa T. McCray National Library of Medicine Bethesda, MD Stanford University August 21, 1999.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan Florida Center for Library Automation (FCLA)
5. Applying metadata standards: Application profiles Metadata Standards and Applications Workshop.
GPO’s Future Digital System (FDsys) November 2, 2006 LS&CM CENDI Presentation.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
DAITSS and the Florida Digital Archive Priscilla Caplan Florida Center for Library Automation iPRES 2006.
NLW. Object Classes Class 1  1 MARC Record  1 Image  No METS Class 2  1 MARC Record  Many images  No METS Class 3  1 MARC Record  Many.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
Developing a Dark Archive for OJS Journals Yu-Hung Lin, Metadata Librarian for Continuing Resources, Scholarship and Data Rutgers University 1 10/7/2015.
Digital Preservation What, Why, and How? Dan Albertson’s Digital Libraries Class April 13, 2016 Jody DeRidder Head, Metadata & Digital Services University.
Joint Meeting of CSUL Committees,
DAITSS: Dark Archive in the Sunshine State
DAITSS and the Florida Digital Archive
Library Technology Conference: Building Exhibits
Andrea Goethals, Harvard Library
Metadata for research outputs management
PREMIS Tools and Services
Presentation transcript:

DRS 2 Orientation Harvard University Library September 30, 2010 DRS = Digital Repository Service

Agenda 1. DRS 2 1. Concepts (Andrea) 2. New metadata (Robin) 3. Overall schedule (Andrea) 2. BatchBuilder 2 demo (Vitaly) 3. Testing instructions (Vitaly) 4. Questions & comments

DRS 2 Concepts

DRS 1: everything’s a file METS XML file TIFF image file JPEG image file JP2 image file JPEG image file JP2 image file Text file ZIP file PDF document file

File level is not a meaningful level for curatorial uses…  Which DRS files make up my digital manuscript? HOLLIS number

METS XML file TIFF image file JPEG image file JP2 image file JPEG image file JP2 image file Text file ZIP file PDF document file

METS XML file TIFF image file JPEG image file JP2 image file JPEG image file JP2 image file Text file ZIP file PDF document file DRS file ID =

METS XML file TIFF image file JPEG image file JP2 image file JPEG image file JP2 image file Text file ZIP file PDF document file

METS XML file TIFF image file JPEG image file JP2 image file JPEG image file JP2 image file Text file ZIP file PDF document file

METS XML file TIFF image file JP2 image file

METS XML file TIFF image file JP2 image file page 1 page 2

Objects  Aggregations of files that together represent a coherent unit of content All the files that make up a single digital book All the master and use copies representing a single photograph  Useful for management, reporting and searching “How many PDS document objects do I have in the DRS?”

Objects  New hook for metadata Administrative categories (projects, exhibits, collections, etc.) Descriptive metadata, catalog records Object Hollis # Digital Medieval Manuscripts at Houghton Library Moralia in Job: manuscript

Content models  Object types  Define valid file formats and relationships known delivery and rendering applications associated assessments and preservation plans  Enforce conformity - we know what we have in the DRS and can monitor & preserve it

DRS 2.1 content models – deposit & delivery 1. Still image Image objects, delivered by IDS 2. PDS document Page-turned documents, delivered by PDS 3. Document Initially just PDF files, delivered by FDS 4. Opaque Files in any format 5. Text Text, XML, etc. delivered by FDS

Still image CM – print TIFF archival master Several derivative JPEG deliverables Derivative JPEG thumbnail Pope Joan Series: Illustration from Philippus Bergomensis, De Claribus Mulieribus. Ferrara, Rossi Harvard Art Museum/Fogg Museum, Gift of Philip Hofer

PDS document CM - book Zoeller, Karl William. Merchandising the plumbing business. Chicago : Domestic Engineering Co., c1921. Baker Library. JP2 archival master / deliverable images per page Plain text files per page …

Document CM - report Intergovernmental Panel on Climate Change (IPCC) WG1 Fourth Assessment Report, Environmental Science and Public Policy Archives Harvard College Library PDF deliverable

Opaque content model  The contents of Judge Tragers’ hard drive, Harvard Law School Library Wordperfect files, Text files, PDF documents, etc. Plus documentation about the collection

Text CM – methodology Plain text file Processing methodology for Intergovernmental Panel on Climate Change (IPCC) documents, HCL Imaging Services.

New metadata

Object descriptors  A METS metadata file per object on the file system alongside content files Descriptive, administrative, preservation, technical and structural metadata Describes the object, all its files and bitstreams and related significant events Gives the metadata the same secure storage as the content files  Self-contained, portable objects

The move to standards  PREMIS -- for key preservation metadata, including Events that affect content Relationships that are not implicit  MODS -- for descriptive metadata  Form-specific schemas for technical metadata, including MIX for images textMD for text DocumentMD for PDF and other document formats More to come…  Supplemented by local administrative schemas

New local metadata  adminCategory  adminFlag  captions, phase 2 Behavior, default, unit name, description for objects  content model identification  DRS URI  isFirstGenerationInDrs Closest to original capture  isPreferredDeliverableSource

Changes to local metadata  OwnerSuppliedName Required for objects, optional for files  Role Repeatable for both objects and files  Processing Instead of “purpose”; repeatable  Quality Optional  Methodology Now for objects and files of all types

Tracking changes  DRS 2 will keep track of Changes that affect content Troubleshooting content errors Key administrative metadata  Three types: Events Administrative flags “Versioned” metadata elements  Not tracking every metadata change

Events  Object creation deletion /recovery from deletion ingest merge  File addition deletion / recovery from deletion integrity check confirmation replacement virus check confirmation

Other tracking Metadata where changes will be tracked: Access Flag Administrative Flag Billing Code Owner Code

Descriptive Metadata MODS Administrative Metadata For the object: PREMIS (including relationships) DRS administrative metadata For each file: PREMIS (including relationships) Format-specific metadata DRS administrative metadata PREMIS Events Inventory of Files Structure Map What’s inside a descriptor?descriptor

Overall schedule

 Available now: first release of BatchBuilder 2 for depositor training and testing Supports 5 content models  Fall 2010 – Summer 2011 BatchBuilder 2 enhancements & bug fixes Web Admin 2 development and testing  ~September 2011: BatchBuilder 2 and Web Admin 2 in production

BatchBuilder 2

 Will build batches of objects rather than batches of files  Will automatically determine most technical metadata (using FITS)  Will automatically create all object descriptors (using OTS)

BatchBuilder 1BatchBuilder 2 Expects files and creates batches of files. Expects objects and creates batches of objects. Can use an existing PDS METS file for PDS objects. Can import a structmap from an “old- style” PDS METS file to create a PDS Document descriptor. Uses batch genres.Uses DRS Content Models. Uses a supplied HOLLIS ID to import contents of a HOLLIS record to a PDS METS Label. Uses a supplied HOLLIS ID to import contents of a HOLLIS record into the MODS section of the object descriptor. Batch level and directory level metadata entered in Batch Template panel. Object level and directory level metadata entered in Object Template. Project level metadata is entered in Administrative Properties panel. Project level metadata is entered in Deposit Settings panel. No depositor authorization – anyone with access to the ftp dropbox can load batches. Depositor authorization – only depositors with permission to load into a particular owner code can load batches into that owner code.

Testing instructions

Questions & Comments