Presentation is loading. Please wait.

Presentation is loading. Please wait.

Building a Digital Archives for the City of Vancouver Glenn Dingwall 14 September, 2011.

Similar presentations


Presentation on theme: "Building a Digital Archives for the City of Vancouver Glenn Dingwall 14 September, 2011."— Presentation transcript:

1 Building a Digital Archives for the City of Vancouver Glenn Dingwall glenn.dingwall@vancouver.ca 14 September, 2011

2 Project Context 2004-2006VanRIMS Classification Project 2008-2009VanDOCS ERDMS Project 2009-2010Olympic Legacy Project

3 Project Phases I - Proof of Concept (2008-2009) Public records Controlled creation environment II – Prototype (2009-2010) Private records Uncontrolled creation environment

4 Initial Assumptions Use OAIS (Open Archival Information System Reference Model) as a starting point Progressively add to requirements, drawing from: –General Preservation Standards InterPARES RLG/OCLC Trusted Digital Repositories (TDR) –Task specific E.g., PREMIS metadata –Institution specific requirements

5 CoV Digital Archives: Producers and Consumers

6 Digital Preservation: The Business Case Technology obsolescence Technology incompatibility Long-term access and useability

7 Alternatives – What’s out there already? Already many free/open source tools available: Repository DSpace FEDORA Greenstone Ingest Tools JHOVE DROID XENA Access Archivist’s Toolkit ICA AtoM Each only does a small part in the preservation chain, no start-to-finish single solution

8 So, what can we do with the existing tools? Can we piece all of the various components together to come up with a complete Digital Preservation system? Constraints: Use open source tools wherever possible Lightweight system architecture Architecturally independent components

9 What is OAIS? OAIS (=Open Archival Information System) ISO 14721:2003 Is a high level reference model Defacto standard for discussing digital preservation concepts at this level Important concepts include –Information Model –Functional Entities –Mandatory Responsibilities

10 OAIS Information Model Information Packages contain: –Content (records) –PDI = Preservation Description Information (metadata) –Packaging Information Three types of Information Packages: SIP = Submission Information Package (what we get) AIP = Archival Information Package (what we preserve) DIP = Dissemination Information Package (what we provide)

11 Information Package Model

12 OAIS Responsibilities Accept submissions from Producer Establish control over material Implement long-term preservation policies Determine who the users are (“designated Community”) Ensure preserved information is understandable to users Provide access

13 OAIS Functional Entities Establishes the main functional components of the system Defines the relationships of the components to each other in terms of the information that passes between them

14 OAIS Functional Entities

15 City of Vancouver Archives Implementation

16 Archivematica

17 Archivematica Pipeline

18

19

20 Ingest Workflow Summary

21 Micro-services Create SIP backupCharacterize and extract metadata Scan for viruses in submission documentation Verify SIP complianceSet file permissions Characterize and extract metadata in submission documentation Assign file UUIDs and checksumsAppraise SIP for preservationNormalize submission documentation Verify metadata directory checksums Scan for removed files post appraise SIP for preservationRemove files without PREMIS Remove thumbs.db filesCreate DIP directoryVerify PREMIS checksums Create Dublin Core templateNormalizeCompile METS Set file permissions Add Dublin Core to METS Appraise SIP for submissionApprove normalizationCopy METS to DIP directory Scan for removed files post appraise SIP for submissionCheck for submission documentationGenerate DIP Place in quarantine Move Submission Documentation into objects directorySet file permissions Remove from quarantine Assign file UUIDs and checksums to submission documentationPrepare AIP Extract packagesExtract packages in submission documentationUpload DIP Sanitize file and directory names Sanitize file and directory names in submission documentationStore AIP Scan for viruses

22 Media typeFile formats Preservation format(s)Access format(s)Normalization tool Audio AC3, AIFF, MP3, WAV, WMAWAVE (LPCM)MP3FFmpeg EmailPST MBOX readpst Office Open XML DOCX, PPTX, XLSXOriginal formatPDF for PPTXOpenOffice Plain textTXT Original format None Portable Document FormatPDF PDF/APDFGhostscript Presentation filesPPT ODFPDFOpenOffice Raster images BMP, GIF, JPG, JP2*, PCT, PNG*, PSD, TIFF, TGA Uncompressed TIFFJPEGImageMagick Raw camera files/Digital Negative format** 3FR, ARW, CR2, CRW, DCR, DNG, ERF, KDC, MRW, NEF, ORF, PEF, RAF, RAW, X3FOriginal formatJPEGImageMagick/UFRaw SpreadsheetsXLS ODFOriginal formatOpenOffice Vector images AI, EPS, SVGSVGPDFInkscape Video AVI, FLV, MOV, MPEG-1, MPEG- 2, MPEG-4, SWF, WMVMPEG-2MPGFFmpeg Word processing files DOC, WPD, RTFODFPDFOpenOffice Media Type Preservation Plans

23 GIS Preservation Questions Appropriate formats Acceptable losses during migration/normalization Availability of normalization software Availability of viewing software Necessary metadata

24

25 Archivematica Collaborators Artefactual Systems Inc. City of Vancouver Archives International Monetary Fund University of British Columbia Library Rockefeller Archive Centre

26 Documentation Wikis Vancouver Digital Archives Project http://artefactual.com/wiki/index.php?title=V ancouver_Digital_Archiveshttp://artefactual.com/wiki/index.php?title=V ancouver_Digital_Archives Archivematica http://archivematica.org/wiki Qubit (ICA-AtoM) http://qubit-toolkit.org/wiki


Download ppt "Building a Digital Archives for the City of Vancouver Glenn Dingwall 14 September, 2011."

Similar presentations


Ads by Google