ADASS 24-26 Sept 2007 1 Trusted Data Repositories David Giaretta STFC and Director of CASPAR and Associate Director UK Digital Curation Centre.

Slides:



Advertisements
Similar presentations
Criteria for the trustworthiness of data centres Jens Klump Helmholtz Centre Potsdam German Research Centre for Geosciences (GFZ) DataCite Summer Meeting.
Advertisements

Platter Planning Tool For Trusted Electronic Repositories
The PREMIS Data Dictionary Michael Day Digital Curation Centre UKOLN, University of Bath JORUM, JISC and DCC.
Long-Term Preservation. Technical Approaches to Long-Term Preservation the challenge is to interpret formats a similar development: sound carriers From.
May 16, 2012EDMC Workshop in College Park MDDan Kowal Trusted Digital Repositories: A New Audit Standard A Follow-on to the OAIS Dan Kowal, Data Administrator,
Digital Preservation and Trusted Digital Repositories Priscilla Caplan Florida Center for Library Automation ALA 2005 Chicago IL.
International Audit and Certification of Digital Repositories PV 2009 David Giaretta.
DigCCurr 2007: What digital curators do and what they need to know The CASPAR view on: What digital curators do and what they need to know : Research Perspectives.
Trusted Digital Archives. Experiences from the Landesarchiv Baden-Württemberg, nestor and DIN Dr. Christian Keitel Johannesburg, 27/2/2013.
Project Overview APA Conference 2012 ESA/ESRIN (Frascati), 6-7 November 2012 D. Giaretta (APA)
An Introduction June 17, 2013 Open Archival Information System (OAIS)
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
SCIDIP-ES Components Oct ,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation.
Certification of Trustworthy Digital Repositories Arnold Rots Harvard-Smithsonian CfA.
Repository audit and risk profiles: trust through transparency
By Eileen Clegg Digital Preservation at Columbia in the Old Days (2009)
TRAC / TDR ICPSR Trustworthy Digital Repositories.
ISO Process for Audit and Certification of Digital Repositories Partnerships in Innovation II: From Vision to Reality and Beyond STANDARDS AND POLICIES.
Future Access to the Scientific and Cultural Heritage – A shared Responsibility Birte Christensen-Dalsgaard State and University Library.
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network Co-ordinated by aparsen.eu #APARSEN.
Long-term Archive Service Requirements draft-ietf-ltans-reqs-00.txt.
Who is doing a good job in digital preservation? Audit and Certification of Digital Repositories: ISO and the European Framework.
An Overview of Selected ISO Standards Applicable to Digital Archives Science Archives in the 21st Century 25 April 2007 Donald Sawyer - NASA/GSFC/NSSDC.
David Giaretta Associate Director (Development) Funders: DCC Development Digital Curation Centre a centre of expertise in data curation and preservation.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Trusted Digital Repositories,
Ingest and Dissemination with DAITSS Presented by Randy Fischer, Programmer, Florida Center for Library Automation, University of Florida DigCCurr2007.
Ensuring Enduring Access: A Forum on Digital Preservation, July 21, 2009.
Science Archives in the 21st Century 25/26 April Towards an International standard for Audit and Certification of Digital Repositories David Giaretta.
Repository Requirements and Assessment August 1, 2013 Data Curation Course.
Data Archiving and Networked Services DANS is an institute of KNAW en NWO Trusted Digital Archives and the Data Seal of Approval Peter Doorn Data Archiving.
BNSC Report Fall 2007 David Giaretta. CASPAR Consortium Integrated project Total spend 16MEuro.
OAIS Open Archival Information System. “Content creators, systems developers, custodians, and future users are all potential stakeholders in the preservation.
Reference Model for an Open Archival Information System (OAIS) ESIP Summer Meeting John Garrett – ADNET Systems at NASA/GSFC ESIP Summer Meeting.
OAIS in the Library Environment Managing and Preserving Electronic Resources FLICC/CENDI Washington DC, December 11,2001 Anne Van Camp RLG, Member Initiatives.
DigCCurr Professional Institute: Curation Practices for the Digital Object Lifecycle Digital Curation Program Development Nancy Y McGovern Research Assistant.
Data Preservation Creating trustworthy archives. Digital Preservation does not happen by accident  To preserve digital information, we need to take careful,
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan, Florida Center for Library Automation DCC Workshop on Long-term Curation within Digital Repositories.
Libraries, Archives, and Digital Preservation: The Reality of What We Must Do Leslie Johnston Acting Director, National Digital Information Infrastructure.
The FCLA Digital Archive Joint Meeting of CSUL Committees, 2005.
Digital Preservation: Current Thinking Anne Gilliland-Swetland Department of Information Studies.
Digital preservation activities at the NLW Sally McInnes 18 September 2009.
Archival Workshop on Ingest, Identification, and Certification Standards Certification (Best Practices) Checklist Does the archive have a written plan.
Metadata for digital preservation: a review of recent developments Michael Day UKOLN, University of Bath ECDL2001, 5th European Conference.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
April 12, 2005 WHAT DOES IT MEAN TO BE AN ARCHIVES? Trusted Digital Repository Model Original Presentation by Bruce Ambacher Extended by Don Sawyer 12.
NDSR Boston webinar: Digital Preservation Introduction Presenter: Nancy Y McGovern October 2015.
OAIS Based Certification David Giaretta ERPANET WORKSHOP Antwerpen April 2004.
Preserving Electronic Mailing Lists as Scholarly Resources: The H-Net Archives Lisa M. Schmidt
The OAIS Reference Model Michael Day, Digital Curation Centre UKOLN, University of Bath Reference Models meeting,
Preservation metadata and the Cedars project Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
ARIADNE is funded by the European Commission's Seventh Framework Programme Archiving and Repositories Holly Wright.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
SEDAC Long-Term Archive Development Robert R. Downs Socioeconomic Data and Applications Center Center for International Earth Science Information Network.
An overview of the Reference Model for an Open Archival Information System (OAIS) Michael Day, Digital Curation Centre UKOLN, University.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
Data Management and Digital Preservation Carly Dearborn, MSIS Digital Preservation & Electronic Records Archivist
DP Knowhow: Introduction to Audit and Certification in ISO APARSEN-EGI Community Workshop on Managing, Computing and Preserving Big Data for Research.
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network aparsen.eu #APARSEN Options.
DP Knowhow: Open Archival Information Systems (OAIS) in ISO APA/C-DAC International Conference on Digital Preservation and the Development of Trusted.
Trusted Repository Systems Overview
Hannes Kulovits, Andreas Rauber Vienna University of Technology
D33.1B PEER REVIEW OF DIGITAL REPOSITORIES
DAITSS: Dark Archive in the Sunshine State
Trustworthiness of Preservation Systems
FAIR Data Management, Trustworthy Digital Repositories and Business Continuity / Disaster Preparedness
Certifying Preservation Actions - TRAC and related initiatives
Digital Preservation and Trusted Digital Repositories
Robin Dale RLG OAIS Functionality Robin Dale RLG
Presentation transcript:

ADASS Sept Trusted Data Repositories David Giaretta STFC and Director of CASPAR and Associate Director UK Digital Curation Centre

ADASS Sept The need for Trustable Repositories Task Force on Archiving of Digital Information (1996) declared, – “a critical component of digital archiving infrastructure is the existence of a sufficient number of trusted organizations capable of storing, migrating, and providing access to digital collections.” –“a process of certification for digital archives is needed to create an overall climate of trust about the prospects of preserving digital information.” A recurring request in many subsequent studies and workshops

ADASS Sept Digital Preservation… Easy to do… …as long as you can provide money forever Easy to test claims about repositories… …as long as you live a long time Reference Model for Open Archival Information System (OAIS) provides an approach

ADASS Sept Key OAIS Concepts Claiming “This is being preserved” is untestable –Essentially meaningless How can we make it testable? –Claim to be able to continue to“do something” with it Understand/use –Need Representation Information Still meaningless… –Things are too interrelated Representation Information potentially unlimited –Designated Community Many other concepts identified –Checklist – not just blanket term of “metadata”

ADASS Sept Information is the important thing What information? –Documents…… –Data……. Original bits? Look and feel? Behaviour? Performance? Explicit/ Implicit/ Tacit Information : Any type of knowledge that can be exchanged. In an exchange, it is represented by data. Long Term is long enough to be concerned with the impacts of changing technologies, including support for new media and data formats, or with a changing user community. Long Term may extend indefinitely. Ensure that the information to be preserved is Independently Understandable to (and usable by) the Designated Community.

ADASS Sept Representation Information The Information Model is key Recursion ends at KNOWLEDGEBASE of the DESIGNATED COMMUNITY (this knowledge will change over time and region) Information Object Representation Information 1+ interpreted using 1+ Data Object interpreted using Physical Object Digital Object Bit Sequence 1+

ADASS Sept FITS FILE FITS STANDARD PDF STANDARD FITS JAVA s/w JAVA VM PDF s/w FITS DICTIONARY SPECIFICATION UNICODE SPECIFICATION XML SPECIFICATION

ADASS Sept Changes in hardware and software Change of hardware operating system compilers/libraries/drivers Affecting ability to run the Data Management System the specialised data processors the format/auxiliary data converters.

ADASS Sept Changes in environment Change of software licence/copyright affecting ability to read data Evolution and merging of organisations affecting access rights Dependencies on external info –e.g. DTD, Schema

ADASS Sept End of the archive Need to hand things on –“the bits” –and much else besides Risk losing: –linkage of files to experimental programs –ability to operate specialised programs –ability to link data file to system files e.g. log files

ADASS Sept Changes in copyright ownership and legal restrictions on materials supporting knowledge base A core requirement for the preservation of the knowledge extract from and usability of the data set are copyrighted or otherwise restricted materials which include: –Journal Articles –Bibliographies –Books (standard texts) Copyright restriction and ability to deal copyright issues is major reason for much of the material of this type of material not being added to the archive.

ADASS Sept Retirement of key personnel affecting knowledge base Lose ability to –understand linkages between archive holdings –locate key support texts –maintain Ingest process –maintain other links which have not been formally recorded

ADASS Sept Authenticity Traditionally archivists have been extremely concerned about authenticity –Central to any “archivist” discussion about preservation InterPARES project: –“When we refer to an electronic record, we consider it essentially intact and uncorrupted if the message that it is meant to communicate in order to achieve its purpose is unaltered.” Document, business process oriented Does this apply to data? Maintaining authenticity –Technical aspects –Social aspects Perhaps not so important for science data in the short term but will become increasingly important over time hash codes etc do I trust him/her? What is “the” message intent of data?

ADASS Sept Trusted Digital Repositories Invited group, hosted by Research Library Group (RLG) Concerned with organisational and financial issues Trusted Digital Repositories: Attributes and Responsibilities (TDR) –

ADASS Sept RLG/NARA Working Group International group of individuals selected by RLG/NARA Representatives of TDR document OAIS standard Various types of archive Combine –TDR – financial, organisational infrastructure –OAIS – technical issues Produced “Trustworthy Repositories Audit & Certification: Criteria and Checklist” (TRAC)

ADASS Sept Outline of TRAC (1): Organisational Infrastructure A1. Governance and organizational viability A2. Organizational structure and staffing A3. Procedural accountability and policy framework A4. Financial sustainability A5. Contracts, licenses, and liabilities Organizational infrastructure includes but is not restricted to these elements: Governance Organizational structure Mandate or purpose Scope Roles and responsibilities Policy framework Funding system Financial issues, including assets Contracts, licenses, and liabilities Transparency

ADASS Sept Outline of TRAC (2): Digital Object Management B1: Ingest: acquisition of content: –The initial phase of ingest that addresses acquisition of digital content. B2: Ingest: creation of the archivable package: –The final phase of ingest that places the acquired digital content into the forms, often referred to as Archival Information Packages (AIPs), used by the repository for long-term preservation. B3: Preservation planning –Current, sound, and documented preservation strategies along with mechanisms to keep them up to date in the face of changing technical environments. B4: Archival storage & preservation/maintenance of AIPs –Minimal conditions for performing long-term preservation of AIPs. B5: Information management –Minimal-level metadata to allow digital objects to be located and managed within the system. B6: Access management –The repository’s ability to produce and disseminate accurate, authentic versions of the digital objects.

ADASS Sept Outline of TRAC (3): Technologies, Technical Infrastructure, & Security C1: General system infrastructure requirements. C2: Appropriate technologies, building on the system infrastructure requirements, with additional criteria specifying the use technologies and strategies appropriate to the repository’s designated community(ies). C3: Security–from IT systems, such as servers, firewalls, or routers to fire protection systems and flood detection to systems that involve actions by people

ADASS Sept Critique of TRAC Closed process –Single review of draft document Many changes based on unpublished “test audits” Underplays “understandability” –Important for data –Assumed not to be important for “documents” Simple list – –Do ALL boxes have to be ticked? –What does a “tick” mean anyway? Link to other standards –ISO 17799/27001 for security (overlap with TRAC section C) –ISO 9000 – say what you do and do what you say –but impractical to demand multiple independent audits

ADASS Sept ISO process status New group set up with the primary aim of producing an ISO standard –Repository Audit and Certification (RAC) OPEN process –Wiki open to all –Mailing list open to all –Virtual meetings normally every week –See Into ISO via CCSDS – same route as OAIS –Some organisational/procedural changes in CCSDS Currently a Birds of a Feather (BoF) group –To demonstrate adequate support for the work Subsequently should become a Working Group Documents agreed by the WG will then be reviewed by CCSDS and more broadly via international ISO review process

ADASS Sept Current status Reviewing and comparing –TRAC –NESTOR –DCC documents Do we need another ISO standard? –Could we could simply add to existing standards e.g. ISO –The view is that ISO CANNOT be modified adequately It’s view of Information is too limited Started drafting a straw man document –Taking TRAC and add concepts from other docs

ADASS Sept Key Issues How to get from a checklist to an international accreditation/ certification system? Evidence – short term Evidence – long term –The real crunch! Quantification –The marking system Levels of audit? –External review –Internal maturity

ADASS Sept The Market Transparency Trustable? –certified by whom? –to what level? –what evidence? –for what Designated Community relevant/sensible? What cost?

ADASS Sept Links RAC group Wiki: – TRAC document – Digital Curation Centre – CASPAR project –EU project on digital preservation – Science, Culture and Arts data Infrastructure, tools and detailed case studies – what does one need to actually “understand” the data? –