□ archiving in context □ principles & processes □ examples DocLing 2016 David Nathan Archiving.

Slides:



Advertisements
Similar presentations
LSA Archiving Tutorial January 2005 Archives, linguists, and language speakers.
Advertisements

E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
Pulling it all together… with thanks to Sheila Anderson.
Long-Term Preservation. Technical Approaches to Long-Term Preservation the challenge is to interpret formats a similar development: sound carriers From.
Digital Preservation Steps 1 & 2: Identify & Select.
Near East Plant Protection Network for Regional Cooperation & Knowledge Sharing Food and Agriculture Organization of the United Nations An Overview on.
Interoperability Scenarios All Working Groups Meeting May, Rome, Italy.
Lyndon Ormond-Parker Centre for Health and Society Centre for Cultural Materials Conservation The University of Melbourne.
Documenting the Resource Malcolm Polfreman
AIATSIS The Australian Institute of Aboriginal and Torres Strait Islander Studies Library Services in the Indigenous Context, UTS, Sydney, 27 November.
A centre of expertise in data curation and preservation MIS Seminar :: University of Edinburgh :: 2 October 2006 Funded by: This work is licensed under.
CNRIS CNRIS 2.0 Challenges for a new generation of Research Information Systems.
Common Use Cases for Preservation Metadata Deborah Woodyard-Robinson Digital Preservation Consultant Long-term Repositories:
1 Planning And Electronic Records Issues For Electronically Enhanced Courses Jeremy Rowe Nancy Tribbensee
Depositing and Disseminating Digital Resources Alan Morrison Collections Manager AHDS Subject Centre for Literature, Linguistics and Languages.
The Subject Librarian's Role in Building Digital Collections: Where Information Management and Subject Expertise Meet Ruth Vondracek Oregon State University.
1 The Australian Partnership for Sustainable Repositories Margaret Henty Digital Futures Industry Briefing November 8, 2006.
Guidelines for Accessible Information Roger Blamire, Isabelle Turmaine, Marcella Turner-Cmuchal.
Part of the Arts and Humanities Data Service and the UK Data Archive. Funded by the Joint Information Systems Committee and the Arts and Humanities Research.
NOBLE Digital Library. How does it work? The NOBLE Digital Library uses the DSpace platform. Image files and metadata are imported into DSpace using.
Rethinking language documentation & support for the 21st century David Nathan Endangered Languages Archive SOAS University of London.
In 1993 Simon Fowler defined income generation by archives as ‘those activities organised by archival staff with the aim of raising.
Agricultural Biotechnology Network for Regional Collaboration and Knowledge Sharing Food and Agriculture Organization of the United Nations An Overview.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Prof. Pravakar Rath Head, Department of Library and Information Science Mizoram University, Aizawl ; NACLIN
David Giaretta Associate Director (Development) Funders: DCC Development Digital Curation Centre a centre of expertise in data curation and preservation.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Trusted Digital Repositories,
Ingest and Dissemination with DAITSS Presented by Randy Fischer, Programmer, Florida Center for Library Automation, University of Florida DigCCurr2007.
Integrating Digital Curation in a Digital Library curriculum: the International Master DILL case study Anna Maria Tammaro University of Parma Florence,
WORKFLOWS AND OTHER CONSIDERATIONS FOR DIGITIZATION  Steve Bingo  Processing Archivist Washington State University Libraries  Alex Merrill  Assistant.
STIM Sloan-Stanford Network for the History of Technology.
24 March 2010Atlanta, Georgia Passing it on: Notes on digital initiative sustainability Marty Kurth HBCU Library Alliance – Cornell University Library.
AILLA:The Archive of the Indigenous Languages of Latin America Heidi Johnson / The University of Texas at Austin.
A CIDOC CRM – compatible metadata model for digital preservation
Towards a European network for digital preservation Ideas for a proposal Mariella Guercio, University of Urbino.
International Seminary on Digitisation: Experience and Technology Lisbon, 11th May 2004 Minerva &MinervaPLUS Benefits for Cultural Institutions and Industries.
AILLA:The Archive of the Indigenous Languages of Latin America Heidi Johnson The University of Texas at Austin Latin American Digital Library Initiative,
The FCLA Digital Archive Joint Meeting of CSUL Committees, 2005.
4 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved. Computer Software Chapter 4.
Digital preservation activities at the NLW Sally McInnes 18 September 2009.
Archival Workshop on Ingest, Identification, and Certification Standards Certification (Best Practices) Checklist Does the archive have a written plan.
1 LingDy February 14, 2012 TUFS, Tokyo David Nathan Endangered Languages Archive Hans Rausing Endangered Languages Project SOAS, University of London Data.
Sorina Stanca Director Cluj County Library, Romania 1.
Symposium on Global Scientific Data Infrastructures Panel Two: Stakeholder Communities in the DWF Ann Wolpert, Massachusetts Institute of Technology Board.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Digital Preservation across the technologies, strategies, open standards & interoperability aspects including the legal issues Pratik Shrivastava Scientist.
CALIMERA: Co-ordination Action Cultural Applications: Local Institutions Mediating Electronic Resource Access.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
National Library of the Czech Republic Integration of digital materials into EDL Adolf Knoll National Library of the Czech Republic Helsinki CENL Workshop.
Digital Collections Forum Doug Moncur AIATSIS September 2004.
The Importance of Standards in Digital Preservation Tina Norris Kayla Payne Jennifer
Serenate1 The librarian’s view Raf Dekeyser K.U.Leuven.
Oman College of Management and Technology Course – MM Topic 7 Production and Distribution of Multimedia Titles CS/MIS Department.
Institutional data curation implementation 1st African Digital Curation Conference 12 February 2008.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
A Project of the University Libraries Ball State University Libraries A destination for research, learning, and friends.
1 February 2012 ILCAA, TUFS, Tokyo program David Nathan and Peter Austin Hans Rausing Endangered Languages Project SOAS, University of London Language.
Collection Description considerations in the nof-digitise programme Sarah Mitchell Programme Manager New Opportunities Fund.
A Shared Commitment to Digital Preservation and Access.
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Library Council October 15, 2015 DIGITAL ARCHIVES UPDATE Creighton Barrett Digital Archivist
Grant Writing 2012 Grant Writing for Digital Projects September 2012 IODE Project Office IODE Project Office Oostende, Belgium Oostende, Belgium.
Joint Meeting of CSUL Committees,
Ingest and Dissemination with DAITSS
Summit 2017 Breakout Group 2: Data Management (DM)
Karen Dennison Collections Development Manager
Richard Waller NOF Technical Advisor UKOLN is supported by:
Bird of Feather Session
Robin Dale RLG OAIS Functionality Robin Dale RLG
Presentation transcript:

□ archiving in context □ principles & processes □ examples DocLing 2016 David Nathan Archiving

Archiving in context

Where does archiving fit in?  “traditionally”: archives museums galleries libraries education/research institutions libraries, archives, museums and galleries are “memory institutions”

Archiving skill inputs Sources speakers/performers authors historical and “legacy” providers Recordists audio and video experts data collectors/annotators/analysts Curators content/area specialists cataloguers Data managers data scientists Co-ordinators managers governance Technical practitioners IT, media & communications IT systems & software cataloguing, storage, preservation & access systems IT practitioners programmers, installers THE ARCHIVE

A definition of archiving :  a commitment by an organization to:  appraise the value of a resource  preserve the resource  make known the existence of the resource  enable access to the resource (or its ‘content’)  a commitment by an organization to:  appraise the value of a resource  preserve the resource  make known the existence of the resource  enable access to the resource (or its ‘content’)  a commitment by an organization to:  appraise the value of a resource  preserve the resource  make known the existence of the resource  enable access to the resource (or its ‘content’)  a commitment by an organization to:  appraise the value of a resource  preserve the resource  make known the existence of the resource  enable access to the resource (or its ‘content’)  a commitment by an organization to:  appraise the value of a resource  preserve the resource  make known the existence of the resource  enable access to the resource (or its ‘content’)  a commitment by an organization to:  appraise the value of a resource  preserve the resource  make known the existence of the resource  enable access to the resource (or its ‘content’)  a commitment by an organization to:  appraise the value of a resource  preserve the resource  make known the existence of the resource  enable access to the resource (or its ‘content’)

Archiving principles & processes

Archiving Acquisition & curation Storage & preservation Access & usage The virtuous loop we hope to achieve through serving community and through community participation

Archiving Acquisition & curation Storage & preservation Access & usage

Acquisition & curation creation evaluation & selection collaboration with providers & users description rights & protocol sharing & exhibiting promotion foster creation advice rights metadata completeness formats agreements work with providers community curation provenance content usages languages change history research collect & record implement good practice requirements seek resources reach & help users funding & sustainability audiences curation rights outcomes goals ▫ policies ▫ resourcing ▫ management ▫ documentation security ▫ usability ▫ organisation/technology changes ▫ evaluation & reporting

Archiving Acquisition & curation Storage & preservation Access & usage

goals ▫ policies ▫ resourcing ▫ management ▫ documentation security ▫ usability ▫ organisation/technology changes ▫ evaluation & reporting Storage & preservation analogue (things) A→DA→D digital catalogue storage integrity certification packing environment players carrier formats players digital formats identifiers file formats metadata formats migration filenames usability functions number of users media provider hardware locations copy/backup management integrity check migration

Archiving Acquisition & curation Storage & preservation Access & usage

catalogue relationships protocols delivery management → acquisition users usability accuracy completeness functions archive ↔ users, providers providers ↔ users communication negotiation share & exchange community stakeholding research formulation implementation manage responses user capabilities user needs access methods monitoring record keeping communications statistics & reports costs, business model... acquiring from users goals ▫ policies ▫ resourcing ▫ management ▫ documentation security ▫ usability ▫ organisation/technology changes ▫ evaluation & reporting

Managing data and preparing for archiving

Software to help manage data and prepare for archiving  checking file names, sizes, folder structures etc (Treesize, Everything)TreesizeEverything  changing or standardizing formats (especially of media files) Handbrake (video), Audacity (audio), XnView or paint.net (images), MS or Libre Office and Notepad++ (text) HandbrakeAudacityXnViewpaint.netLibre OfficeNotepad++  creating and managing metadata  spreadsheets and databases  SIL’s SayMoreSayMore  TLA’s ArbilArbil  Miromaa Miromaa

File formats  audio  WAV  (what if original is not WAV??)  resolution: 16 bit, 44.1KHz, stereo or better  video  changing frequently  MP4/MPEG4 or MTS/H264/AVCH  aspect, resolution: depends on project  get advice from achive before depositing

File formats  images  TIFF **OR** original from device  resolution: archive quality is 300dpi or better

File formats  text  best is plain text  PDF/A often acceptable, but may pose problems  if MS-Word or ODF, check with archive  structured data (spreadsheets, databases  original format should be supplied  provide a preservable derivative as well (eg csv, PDF/A)  common linguistic software (ELAN, Transcriber, Toolbox, Praat etc)  their file formats are generally preservable

Can I still use MS Word?  most archives no longer accept MS Word files  but Word is still useful  quicker to type up  useful tables, functions, macros etc  solutions  think “text only”  tables as spreadsheets (are they bad too?)  (advanced) complex materials formatted as styles, then export as marked up  PDF/A – but not a perfect solution

Standards  we have already mentioned some standards – UTF-8, WAV etc  there are other relevant standards, eg  ISO (language/dialect names)  metadata systems – OLAC, CMDI, METS/MODS and others  you can also establish project-local standards, eg  to handle special characters (eg \e = schwa)  data field names  document them! – for your usage and for correspondence to wider standards

Approaches to small scale archive storage

Approaches to small scale archive storage/backup  work with a large institution that can support/sponsor your storage/backup needs  partner with a number of similar centres to achieve critical mass of materials and resources, set up replication or data centre  set up local storage/backup using creative “appropriate technology” approach (e.g. using NAS unit and offsite replication (HD, SSD, tape, or cloud)  use a commercial (cloud) provider (also hybrid version – “cloud gateway”)

Examples

Archive examples – Aboriginal languages/protocol emphasis  (Aboriginal and Torres Strait Islander Data Archive) – research data related to Indigenous Australia emphasis on return of Indigenous knowledge; can assist communities with repatriation, hosting and distribution  an archive based on Mukurtu CMS emphasis on culturally appropriate and controlled access and usage (see also  (Endangered Languages Archive) - international language documentation archive with 20 Australia deposits ( emphasis on protocol-based and negotiated access to recordings and annotations

Archive examples – Aboriginal languages  (AIATSIS) - merged archive and library catalogues to “Mura” largest archive but limited operationally  (Paradisec) – Pacific and regional but much Australian content emphasis on digitization  (Living Archive of Aboriginal Languages) community-created literature gathered and “rescued” after the end of support for bilingual education emphasis on easy to use but powerful interface

Archive examples – records institutions  (State Records Office WA) - demographic, school and other records  missionary correspondence, records, registers archives  (Flint collection, UQ library) emphasis on providing awareness of (audio and written) materials

In development or not publicly available  created by Pitjatjantjatjara Council, to repatriate digital versions of cultural.community materials and to manage access to them (see also ) emphasis on usability by remote communities and detailed control of access  projects/our_story_version_2_project Community Stories, a version of Ara Irititja, enabling communities to establish a digital collections by creating, adding and repatriating content related to their own culture and history projects/our_story_version_2_project