Download presentation
Presentation is loading. Please wait.
Published byRandolph Watts Modified over 9 years ago
1
Building a Digital Archives for the City of Vancouver Glenn Dingwall glenn.dingwall@vancouver.ca 14 September, 2011
2
Project Context 2004-2006VanRIMS Classification Project 2008-2009VanDOCS ERDMS Project 2009-2010Olympic Legacy Project
3
Project Phases I - Proof of Concept (2008-2009) Public records Controlled creation environment II – Prototype (2009-2010) Private records Uncontrolled creation environment
4
Initial Assumptions Use OAIS (Open Archival Information System Reference Model) as a starting point Progressively add to requirements, drawing from: –General Preservation Standards InterPARES RLG/OCLC Trusted Digital Repositories (TDR) –Task specific E.g., PREMIS metadata –Institution specific requirements
5
CoV Digital Archives: Producers and Consumers
6
Digital Preservation: The Business Case Technology obsolescence Technology incompatibility Long-term access and useability
7
Alternatives – What’s out there already? Already many free/open source tools available: Repository DSpace FEDORA Greenstone Ingest Tools JHOVE DROID XENA Access Archivist’s Toolkit ICA AtoM Each only does a small part in the preservation chain, no start-to-finish single solution
8
So, what can we do with the existing tools? Can we piece all of the various components together to come up with a complete Digital Preservation system? Constraints: Use open source tools wherever possible Lightweight system architecture Architecturally independent components
9
What is OAIS? OAIS (=Open Archival Information System) ISO 14721:2003 Is a high level reference model Defacto standard for discussing digital preservation concepts at this level Important concepts include –Information Model –Functional Entities –Mandatory Responsibilities
10
OAIS Information Model Information Packages contain: –Content (records) –PDI = Preservation Description Information (metadata) –Packaging Information Three types of Information Packages: SIP = Submission Information Package (what we get) AIP = Archival Information Package (what we preserve) DIP = Dissemination Information Package (what we provide)
11
Information Package Model
12
OAIS Responsibilities Accept submissions from Producer Establish control over material Implement long-term preservation policies Determine who the users are (“designated Community”) Ensure preserved information is understandable to users Provide access
13
OAIS Functional Entities Establishes the main functional components of the system Defines the relationships of the components to each other in terms of the information that passes between them
14
OAIS Functional Entities
15
City of Vancouver Archives Implementation
16
Archivematica
17
Archivematica Pipeline
20
Ingest Workflow Summary
21
Micro-services Create SIP backupCharacterize and extract metadata Scan for viruses in submission documentation Verify SIP complianceSet file permissions Characterize and extract metadata in submission documentation Assign file UUIDs and checksumsAppraise SIP for preservationNormalize submission documentation Verify metadata directory checksums Scan for removed files post appraise SIP for preservationRemove files without PREMIS Remove thumbs.db filesCreate DIP directoryVerify PREMIS checksums Create Dublin Core templateNormalizeCompile METS Set file permissions Add Dublin Core to METS Appraise SIP for submissionApprove normalizationCopy METS to DIP directory Scan for removed files post appraise SIP for submissionCheck for submission documentationGenerate DIP Place in quarantine Move Submission Documentation into objects directorySet file permissions Remove from quarantine Assign file UUIDs and checksums to submission documentationPrepare AIP Extract packagesExtract packages in submission documentationUpload DIP Sanitize file and directory names Sanitize file and directory names in submission documentationStore AIP Scan for viruses
22
Media typeFile formats Preservation format(s)Access format(s)Normalization tool Audio AC3, AIFF, MP3, WAV, WMAWAVE (LPCM)MP3FFmpeg EmailPST MBOX readpst Office Open XML DOCX, PPTX, XLSXOriginal formatPDF for PPTXOpenOffice Plain textTXT Original format None Portable Document FormatPDF PDF/APDFGhostscript Presentation filesPPT ODFPDFOpenOffice Raster images BMP, GIF, JPG, JP2*, PCT, PNG*, PSD, TIFF, TGA Uncompressed TIFFJPEGImageMagick Raw camera files/Digital Negative format** 3FR, ARW, CR2, CRW, DCR, DNG, ERF, KDC, MRW, NEF, ORF, PEF, RAF, RAW, X3FOriginal formatJPEGImageMagick/UFRaw SpreadsheetsXLS ODFOriginal formatOpenOffice Vector images AI, EPS, SVGSVGPDFInkscape Video AVI, FLV, MOV, MPEG-1, MPEG- 2, MPEG-4, SWF, WMVMPEG-2MPGFFmpeg Word processing files DOC, WPD, RTFODFPDFOpenOffice Media Type Preservation Plans
23
GIS Preservation Questions Appropriate formats Acceptable losses during migration/normalization Availability of normalization software Availability of viewing software Necessary metadata
25
Archivematica Collaborators Artefactual Systems Inc. City of Vancouver Archives International Monetary Fund University of British Columbia Library Rockefeller Archive Centre
26
Documentation Wikis Vancouver Digital Archives Project http://artefactual.com/wiki/index.php?title=V ancouver_Digital_Archiveshttp://artefactual.com/wiki/index.php?title=V ancouver_Digital_Archives Archivematica http://archivematica.org/wiki Qubit (ICA-AtoM) http://qubit-toolkit.org/wiki
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.