Digital | Curation | Centre Digital Curation Centre www.dcc.ac.uk Peter Burnhill, Michael Day, David Giaretta, Liz Lyon, Robin Rice, Bridget Robinson and.

Slides:



Advertisements
Similar presentations
The Economic and Social Data Service (ESDS) Kevin Schürer ESDS/UKDA ESDS Awareness Day 5 December 2003.
Advertisements

Digital | Curation | Centre Continuing Access to Research Data: The New Digital Curation Centre Peter Burnhill Director (Phase One) Funded by:
Philip LordDigital Archiving Consultancy Alison Macdonald Digital Archiving Consultancy Liz LyonDigital Curation Centre David GiarettaDigital Curation.
A centre of expertise in data curation and preservation DCC/NeSC eScience Workshop, June 2008 Working in partnership with the eScience community This work.
The PREMIS Data Dictionary Michael Day Digital Curation Centre UKOLN, University of Bath JORUM, JISC and DCC.
A centre of expertise in data curation and preservation EAOLUG :: RSC :: Cambridge23 May 2006 Funded by: This work is licensed under the Creative Commons.
Joint Information Systems Committee Digital Library Services BL/JISC Workshop Rachel Bruce JISC Programme Director The Digital Library and its Services,
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
A centre of expertise in data curation and preservation Preserving Digital ArchivesLUCAS March 2006 Funded by: This work is licensed under the Creative.
A centre of expertise in data curation and preservation UKOLN Open ForumIWMW June 2006 Funded by: This work is licensed under the Creative Commons.
A centre of expertise in data curation and preservation CETIS MDR SIG::28 June 2006::University of Bath Funded by: This work is licensed under the Creative.
An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
Supporting further and higher education Supporting Digital Preservation and Asset Management in Institutions eSPIDA event University of Glasgow 11 February.
INFSO-RI Enabling Grids for E-sciencE Grid & Data Preservation Boon Low System Development, EGEE Training National.
SCIDIP-ES Components Oct ,Brussels. Basic Preservation Strategies Often stated as: “Emulate or Migrate” OAIS concepts change these to: Add Representation.
Supporting further and higher education Christopher Pressler Head of Arts Collections University of London Library Archiving – UK Perspective.
Supporting education and research E-learning tools, standards and systems Sarah Porter Head of Development, JISC.
Common Use Cases for Preservation Metadata Deborah Woodyard-Robinson Digital Preservation Consultant Long-term Repositories:
Depositing and Disseminating Digital Resources Alan Morrison Collections Manager AHDS Subject Centre for Literature, Linguistics and Languages.
E-IRG Open Workshop on e-Infrastructures 4-5 Oct 2006 CASPAR Project Digital Preservation and Digital interoperability.
Co-funded by the European Union under FP7-ICT Alliance Permanent Access to the Records of Science in Europe Network Co-ordinated by aparsen.eu #APARSEN.
Digital | Curation | Centre The UK Digital Curation Centre Michael Day UKOLN, University of Bath (with thanks to Peter Burnhill, Chris Rusbridge, et al.)
© HATII, University of Glasgow Introduction to the UK ’ s Digital Curation Centre Prof Seamus Ross Visiting Fellow at Oxford Internet Institute ,
Diana Laurillard Head, e-Learning Strategy Unit Overview of e-learning: aims and priorities.
David Giaretta Associate Director (Development) Funders: DCC Development Digital Curation Centre a centre of expertise in data curation and preservation.
Ingest and Dissemination with DAITSS Presented by Randy Fischer, Programmer, Florida Center for Library Automation, University of Florida DigCCurr2007.
Integrating Digital Curation in a Digital Library curriculum: the International Master DILL case study Anna Maria Tammaro University of Parma Florence,
Science Archives in the 21st Century 25/26 April Towards an International standard for Audit and Certification of Digital Repositories David Giaretta.
Data Archiving and Networked Services DANS is an institute of KNAW en NWO Trusted Digital Archives and the Data Seal of Approval Peter Doorn Data Archiving.
Research Data Management Services Katherine McNeill Social Sciences Librarians Boot Camp June 1, 2012.
Supporting further and higher education The UK FAIR Programme: OAI in context Chris Awre OAI3, CERN, February 2004.
Towards a European network for digital preservation Ideas for a proposal Mariella Guercio, University of Urbino.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Peter Burnhill Director (Phase One) Funders: Aims & Organisation Digital Curation Centre a centre of expertise in data curation and preservation.
Digital Preservation MetaArchive Cooperative.  9:00-9:45 - Session 1: Digital Preservation Overview  9:45-11:00 - Session 2: Policy & Planning Overview.
HATHITRUST A Shared Digital Repository The HathiTrust Print Monograph Archive Planning Task Force Print Archive Network Forum ALA 2015 Annual Meeting June.
Seamus Ross Director, HATII & ERPANET Associate Director of DCC Services Funders: Service Definition & Delivery Digital Curation Centre a centre of expertise.
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
Archival Workshop on Ingest, Identification, and Certification Standards Certification (Best Practices) Checklist Does the archive have a written plan.
UK LOCKSS Alliance: Investigation into Private LOCKSS Networks Adam Rusbridge EDINA, University of Edinburgh.
Metadata for digital preservation: a review of recent developments Michael Day UKOLN, University of Bath ECDL2001, 5th European Conference.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Digital Preservation across the technologies, strategies, open standards & interoperability aspects including the legal issues Pratik Shrivastava Scientist.
April 12, 2005 WHAT DOES IT MEAN TO BE AN ARCHIVES? Trusted Digital Repository Model Original Presentation by Bruce Ambacher Extended by Don Sawyer 12.
Edinburgh e-Science MSc Bob Mann Institute for Astronomy & NeSC University of Edinburgh.
UKOLN is supported by: Introduction to UKOLN Dr Liz Lyon, Director UKOLN, University of Bath, UK Grand Challenge Meeting, June a centre.
PREMIS Data Dictionary and the Future of Preservation Metadata Brian Lavoie Research Scientist OCLC Research Society of American Archivists.
From ePrints to eSPIDA: Digital Preservation at the University of Glasgow William J Nixon, Service Development DAEDALUS, University of Glasgow DPC: Digital.
OAIS Based Certification David Giaretta ERPANET WORKSHOP Antwerpen April 2004.
Dr Liz Lyon Associate Director, Outreach Funders: Engaging the Users: the Outreach & Community Support Programme Digital Curation Centre a centre of expertise.
M-1 INGEST OVERVIEW Don Sawyer National Space Science Data Center NASA/GSFC October 13, 1999.
JISC/CNI Conference Edinburgh, 26th June 2002 Challenges of Digital Preservation – do we have a road map? Maggie Jones.
The OAIS Reference Model Michael Day, Digital Curation Centre UKOLN, University of Bath Reference Models meeting,
Preservation metadata and the Cedars project Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
Toward a common data and command representation for quantum chemistry Malcolm Atkinson Director 5 th April 2004.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
Cedars work on metadata Michael Day UKOLN, University of Bath Cedars Workshop Manchester, February 2002.
Long-term preservation and access: the UK context Michael Day, UKOLN, University of Bath RCUK Workshop on Publication.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI strategy and Grand Vision Ludek Matyska EGI Council Chair EGI InSPIRE.
New Opportunities Fund Preservation Workshop March 15th 2002 Maggie Jones Cedars Project Manager.
Data Management and Digital Preservation Carly Dearborn, MSIS Digital Preservation & Electronic Records Archivist
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Practical Aspects of Preservation Peter Simpson Development Officer Arts and Humanities Data Service.
CESSDA SaW Training on Trust, Identifying Demand & Networking
Ingest and Dissemination with DAITSS
Digital Curation Centre research agenda
Digital Curation Activities at the University of Glasgow
Presentation transcript:

Digital | Curation | Centre Digital Curation Centre Peter Burnhill, Michael Day, David Giaretta, Liz Lyon, Robin Rice, Bridget Robinson and Seamus Ross Funded by:

Digital | Curation | Centre 2 Session Overview 1. Introduction & Briefing 2. Towards a Technical Model of Digital Curation: our R&D 3. Planning Delivery of Services & the Associates Network

Digital | Curation | Centre 3 1. Introduction & Briefing Background story on the DCC ‘So who’s that new kid on the block?’ What is digital curation anyway? –‘adding value’ & ‘ensuring longevity’ Aims & objectives for the DCC –‘improving the quality of what is done’ Our planning & our progress –timelines & deliverables How does this relate to the JISC Programme?

Digital | Curation | Centre 4 Background to the DCC (1) Two parallel policy concerns 1. Neglect of digital heritage, especially given investment in digitsation programmes JISC Continuing Access and Digital Preservation Strategy, –eLib Programme, eLib3, Circular 5/97: Digital Preservation Digital Preservation Coalition formed in Differing data sharing practices in eScience, especially given huge data volumes Links between eScience Programme and JISC Report commissioned by JISC Cttee for Support of Research (Lord & Macdonald, May 2003) –twin drivers: Digital Preservation & Continuing Access (e-Science) –identified need for national digital curation centre

Digital | Curation | Centre 5 Interpretation of JISC policy JISC plays 3 roles 1.promotes, supports & develop management & preservation of institutional and community digital materials for UK benefit 2.partner to Research Council/AHRB & other national/international bodies 3.as organization, appropriate grant conditions for JISC-funded creation of digital resources; good practice for JISC created/managed materials “escalating scale and complexity of digital resources to be curated and the subsequent urgency of developing a critical mass of expertise, shared services and tools, for long-term digital preservation … require a step change in investment and approaches. –“Over the next three years a greater emphasis on development of production services and tools … needed to build on previous research studies and projects.” “Digital preservation remains a challenging area in which techniques, costs, and skills are still in development: advocacy, dissemination and training, to embed preservation needs as appropriate in JISC funding programmes.”

Digital | Curation | Centre 6 Interpreting the implementation plan Risk assessment studies, eg ePrints –Calls to implement studies’ recommendations for services and integration of preservation activity & standards into repositories funded by JISC. Series of community calls to support records management and digital preservation in institutions - cf FOI compliance. Establish Digital Curation Centre to: Provide central focus of skilled staff & research links to wider network of development activity, researchers, & services Develop set of central services, standards, and tools for a range of distributed digital data centres & preservation services, across the Information Environment & Research Grid. JISC Partnership funding, –eg Web-archiving study: jointly funded by JCIE and Wellcome Trust » Digital Preservation Coalition as an independent entity with JISC membership and sector activity supported by JISC. National preservation of e-journals, through RLN/RSLG

Digital | Curation | Centre 7 Back to the DCC Background (2) JISC Circular 6/03, initially issued June 2003 –Call postponed, revised & re-issued with more significant research component –Joint funding: JISC and e-Science Core Programme –£750K pa (outreach, services & development) £250K pa (research) –Unlikely that any single organisation could do what’s expected –Expressions of Interest & Full Proposals from Consortia –Final selection made in December 2003 –Negotiations & clarification in January 2004

Digital | Curation | Centre 8 Designation of DCC Task entrusted to Consortium of four institutional partners –Universities of Edinburgh (lead), Glasgow & Bath together with CCLRC (Rutherford Appleton and Daresbury Laboratories) –brought together through the National eScience Centre jointly managed by Universities of Edinburgh & Glasgow Two 3-year awards made: –JISC funding started on 1st March 2004 –EPSRC grant-funded starts on 1st September 2004 Phase One set-up –some ‘early deliverables’ of website & helpdesk –preparation for full operation & launch of services in October –planning formal opening for early November 2004

Digital | Curation | Centre 9 Responsibilities across the DCC Them with titles … –Peter Burnhill, Director (Phase One) with Robin Rice, Phase One Project Co-ordinator (from EDINA & Data Library, University of Edinburgh) –Peter Buneman Research Director (& PI on EPSRC grant) Professor of Informatics, University of Edinburgh –Liz Lyon, Associate Director (Community Support & Outreach) Director of UKOLN, University of Bath –Seamus Ross, Associate Director (Service Definition & Delivery) Director of HATII [ERPANET], University of Glasgow –David Giaretta, Associate Director (Development) Head of Astronomical Software & Services, CCLRC Two significant & well known ‘Ex Portfolio’ names –Malcolm Atkinson, Director, NeSC –Chris Rusbridge, Director, Information Services, UofGlasgow

functional management & collaboration Industry research collaborators standards bodies testbeds & tools communities of practice: users community support & outreach research development co-ordination service definition & delivery management & admin support curation organisations eg DPC Collaborative Associates Network of Data Organisations

Digital | Curation | Centre 11 What is this digital curation anyway? The term Digital Curation is a new invention. Digital Data Curation Task Force - Report of Strategy Discussion Day (2002) –citing Tony Hey citing use by Dr John Taylor, Director General of the Research Councils, to distinguish the actions involved in caring for digital data beyond its original use, from digital preservation. The concept’s reach extends beyond libraries. – The e-Science Curation Report (2003) proposed the following distinctions: –Curation : managing & promoting the use of data from point of creation, to ensure fit- for-contemporary-purpose, available for discovery & re-use. For dynamic datasets this may mean continuous enrichment or updating to keep it fit for purpose. Higher levels of curation will involve maintaining links with annotation & with other published materials. –Archiving : curation activity which ensures that data are properly selected, stored, can be accessed logical and physical integrity is maintained over time, including security and authenticity. –Preservation : activity within archiving in which specific items of data are maintained over time so that they can still be accessed and understood through changes in technology.

Digital | Curation | Centre 12 digital curation:... digital objects and data, over their life-cycle, for current & future generations of use... = f(data curation & digital preservation) data curation [when high current/ongoing interest] –actions needed to maintain and utilise digital data & research results over entire life-cycle –data creation & management; adding value; generating new sources of information & knowledge, for use digital preservation [for longevity;fall off in interest] –long-run technological/legal accessibility & usability –storage, maintenance & accessibility of information content in digital material over the long-term, for use OAIS concept of designated community Digital curation redefined...

Digital | Curation | Centre 13 Data curation in action Astronomy Integrating and analysing distributed data (AstroGrid) publishing multi-TB sky surveys (SuperCOSMOS & WFCAM) interoperability standards (IVO Alliance) BioInformatics data publishing: generic tools for XML export (EBI Biomart) annotation tools for massive data sets (Pubmed, VOTable) archiving tools for dynamic data sets (biological DBs) Environmental sciences spatio-temporal annotation (OS Mastermap/ Mouse Atlas) Document management Repository certification (RLG Task Force)

Digital | Curation | Centre 14 Digital preservation approaches Migration & Refreshment Emulation & Encapsulation Digital Archaeology & Rescue Document Format Specification Robin Rice & Najla Semple,

Digital | Curation | Centre 15 Communities of Practice: Social Sciences (IASSIST) History of sharing – economical in terms of both data collector and respondent Data about humans – problems of confidentiality confronted early on Mixed blessing of agreed proprietary formats (OSIRIS, SPSS, etc.) allows migration ‘Future-proofing’ - 30 years of data advocacy! –Tradition of data archiving & data citation –Building new data standards out of common experience data archivists, & data librarians: the new digital curators?

Digital | Curation | Centre 16 Unifying Themes for D C C ‘data as evidence’ –for one or more designated communities ‘archival responsibility’ –at one or more institutional levels –with institutional policies & individuals’ competence engage/discover communities of practice, to invoke/provoke good practices –appraisal & retention/disposal –logical & physical integrity: authenticity/security research problems in productive research domains –eg Informatics, Law School

Digital | Curation | Centre 17 Aims & Objectives for the DCC ‘quality improvement in data curation & digital preservation’ –Initial focus: data as evidence for scholarly conclusions –Wider remit: worlds of scholarly communication & eLearning twin aims:excellence in research & excellence in service need to bridge across communities: –universities & research institutes –scientific data tradition & document tradition –multi-sectoral, international

Digital | Curation | Centre 18 We are all curators now... The term “curation” builds on our understanding of the word “curator” –who keeps something for the public good, value of which often needs to be brought out by the curator. 1. this open context implies more support for explicit policies with regard to data sharing, and it has major implications for structuring and tools. 2. the digital curator as ‘store-keeper’ closely linked to promoting new science, looking forward to identify new ways to serve present and future researchers. digital curator should take an active role in promoting and adding value to holdings – manage the value of collection –adding links and annotation to provide context –recording provenance of changes made

Digital | Curation | Centre 19 Planning & Progress We must plan for the Long, with our 2020 Vision - 15yrs –we have large territory, and large expectation multi-disciplinary, multi data type, multi tradition/profession national and international, but also local and hidden from view a lot is going on –how to ensure that we do something sensible with the ££’s and the trust we have been given? –who/what should we plan to affect/effect? policy-makers; ‘responsible curators’; (researchers?) how do we wish to be judged, and when? collaboration & win-win-win scenarios

Digital | Curation | Centre 20 focii of attention in set-up phase Users: client, peer and policy communities –outreach & community support; service definition/delivery; development co-ordination; research agenda –user requirements analysis: Leona Carpenter (Focus Groups) Consortium: ‘organisation’ from partner participation –roles; commitment; norming/performing; operational communication; consortium agreement (IPR) Employers: institutional settings –re-deployment/appointments; accommodation; commitment/reporting -> Project Plan, as living document

Digital | Curation | Centre 21 weekly AccessGrid/telecon; two face2face meetings –defining programme of deliverables; re-deploying & recruiting staff; planning appointment of full time director in time for Launch early ‘deliverables’: – with links, presentations & progress updates for contacts & offers of collaboration project plan submitted to JISC, late May 2004 defining R & D programme & services for delivery eg curation architecture; repository of tools & technical information engaging curators in existing community of practice Phase One Progress, March -

Digital | Curation | Centre Towards a Technical Model of Digital Curation: our R&D David Giaretta Funded by:

Digital | Curation | Centre 23 What can we rely on in the Long Term The bits - BIT PRESERVATION Paper documents that people can read –ISO standards The information we collect – either in the far future DCC or its successor Some kind of remote access Some kind of computers People?

Digital | Curation | Centre 24 Preservation “vs” Current Use There are already very many architectures to support immediate use of information –Including JISC architecture –Aim to support these Therefore chose to be guided by –long-term preservation aspects –to promote this we should emphasise “interoperability” and “automated use” as far as possible. –based initially on OAIS Reference Model – but add other ideas later –bear e-Science in mind

Digital | Curation | Centre 25 OAIS Reference Model – Functional Model

Digital | Curation | Centre 26 OAIS – Preservation Planning - key aspects Representation Net Designated Communities & Knowledge Base

Digital | Curation | Centre 27 Representation Net

Digital | Curation | Centre 28 Preservation Issues Given a file or a stream of bits how does one know what Representation Information is needed (this question applies to Representation Information itself as well as to the digital objects we are primarily interested in preserving and using); how does one know, for example, if this thing is in FITS format? Someone may simply “know” what it is and how to deal with it i.e. the bits are within the Knowledge Base One may be able to recognise the format by looking for various types of patterns. One may feed the bits into all available interpreters to see which accept the data as valid Other means…. The only safe way: have an associated label which points to the appropriate Representation Information –Note this does not exclude the other methods e.g. for data rescue

Digital | Curation | Centre 29 High Level View Example of use of Representation Information Labelling

Digital | Curation | Centre 30 Implications A label must be attached to each piece of digital object as a necessary (but not sufficient) condition for long-term preservation –logical attachment or packaging TBD by the DCC. The label should at least identify Representation Information. For long-term preservation this label must therefore be a DCC persistent identifier. –allow some normalisation In order for the Representation Information to be persistent then it should either be held with the data object itself or be part of a central repository – part of the DCC. Thus the DCC needs a DCC Representation Information Repository. This repository would include –a Format Repository (covering structural information) *automated use would be supported by use of formal description languages such as EAST (ISO 15889, ) or DFDL ( –a Semantic Repository with, for example, Data Dictionaries and Ontologies –Software Repository – with appropriate emulation capabilities Each piece of digital RI is also a digital object – which is understood either by the users’ Knowledge Base OR by further Representation Information. Therefore each piece of RI also has a label pointing to further RI.

Digital | Curation | Centre 31 Designated Community Techniques must be created for –defining a Knowledge Base –linking a Knowledge Base to a Designated Community –linking Representation Information to a Knowledge Base if possible

Digital | Curation | Centre 32 Representation Information (1) Structure – including Formats –Distinguish formats which are used mainly for rendering – to be followed by human inspection, and formats used for automated processing Implications: –Representation Information Repository should define selected file formats using EAST and DFDL –Definitions should include scientific objects and humanities objects

Digital | Curation | Centre 33 Representation Information (2) Semantics –Hard problem start with Data Dictionaries –Implications:  the Representation Information Repository should include Data Dictionaries, followed by more general semantics

Digital | Curation | Centre 34 Representation Information (3)  Time Dependent Information –Many, perhaps most, datasets change over time and the state at each particular moment in time may be important. It may be useful to break the issue into separate parts. at each moment in time we could, in principle, take a snapshot and store it. That snapshot has its associated Representation Net. efficient storage of a series of snapshots may lead one to store differences or include time tags in the data (see for example P.Buneman, S. Khanna, and Wang-Chiew Tan. On the Propagation of Deletions and Annotations through Views. Proc.21st ACM Sym. on Principles of Database Systems.). –Additional Representation Information would be needed which describes how to get to a particular time's snapshot from the efficiently encoded version. –Also applies to ANNOTATION – who said what and when did they say it –Implications: These are area of active research within the consortium and the DCC should be able to provide –advice and well tested tools for certain forms of efficient encoding of time dependent information –advice on annotation –identifiers and Representation, perhaps in the form of software, for the associated encodings

Digital | Curation | Centre 35 Representation Information (4) Actions and Processes (Behaviour?) –Some information has, as an integral part of its content, an implicit or explicit process associated with it – this could be argued to be a type of semantics, however it is probably sufficiently different to need special classification. An examples of this is a database or other time dependent or reactive system such as a Neural Net. –Emulations – Universal Virtual Computer (UVC) –Implications: Support Software emulation via a UVC (possibly based on JVM) Support time dependent or reactive systems

Digital | Curation | Centre 36 Persistent Ids Implications: –Use of existing, or creation of new, infrastructure (standards, protocols, servers etc) for persistent IDs with adequate flexibility and longevity as part of the succession planning, agreement would be needed with appropriate organisation to act as backup and inheritor of DCC data.

Digital | Curation | Centre 37 Archival Information Package

Digital | Curation | Centre 38 Preservation Description Info

Digital | Curation | Centre 39 AIP implications – PDI define standard Preservation Metadata – based initially on OCLC work – including Michael Day’s work and also CCLRC work etc define adequate Packaging technique – almost certainly XML based define recommended tools and procedures for creating Fixity Information such as checksums and digests, together with associated Representation Information investigate authentication systems

Digital | Curation | Centre 40 Audit and Certification Implications: –facilitate production of standard(s) on which a certification program can be based –work to establish accreditation and certification body in preparation for offering audit and certification services –audit, certification and accreditation are potential sources of long term funding for the DCC –software certification will require testbeds and testing procedures. Hardware and software systems will need to be purchased, hired or borrowed. The DCC associates would be useful partners. We might expect Commercial software to be offered to us by the manufacturer for testing Testing commercial software could be fee based.

Digital | Curation | Centre 41 Implications for Research Research needed on Representation Information (Structure and Semantics) e.g. –Investigate fundamental limitations of bit-level descriptions and existing tools. –Contribute to DFDL definition –Investigate capabilities needed to describe rendered format (including Word, PDF etc) Data Virtualisation – define Science objects and “Humanities” objects Research is needed to: –Support Software emulation via a UVC (possibly based on JVM) –Support time dependent or reactive systems Research is needed to provide a solid basis on which we can develop persistent IDs with adequate flexibility and longevity Research is needed to allow the DCC to: –define standard Preservation Metadata – based initially on OCLC work –define adequate Packaging technique – almost certainly XML based –investigate authentication systems with a view to preparing recommendations for users and consider offering, for example, a (fee-based) key storage service. A rigorous theoretical basis must be put in place from which we can create techniques for: –defining a Knowledge Base –linking a Knowledge Base to a Designated Community –linking Representation Information to a Knowledge Base if possible

Digital | Curation | Centre 42 Curation Manual Put in place quickly using international experts Updates annually Build to “curation encyclopaedia”

Digital | Curation | Centre 43 Document format specification They borrowed from records management tradition - institutions to create documents in standard or open formats, which are easier to preserve. Much easier to do in a strict records management environment with a published policy of retention schedules and a clear knowledge of internally produced records. Stipulating a specific file format is harder in a research environment where a wide range of digital materials are produced and have to be preserved. The move to DDI DTD in social science data world may be seen as an example of this preservation technique.

Digital | Curation | Centre 44 Services & Development Turns Research into ‘Products for Research’ that our communities can use with confidence –tracking and testing tools and standards that are correct, usable, reliable, well documented e.g. for ingest, repository management, data exchange, ontologies working with tool developers wherever possible developing testbeds & interworking with other testbeds –aim to gain leverage formats working with other projects worldwide using generic tools and techniques –to develop strategies for emerging digital formats –Metadata standards long-term viability of metadata Registries underpin, to provide basis of Advisory Service

Digital | Curation | Centre Scientist Research Process Secondary (derived) data Tertiary data for publication Primary publication Secondary publication Tertiary publication Peer Review Pre-prints & e-Prints Publication archives Library - Peers - Public - Industry Publication Process Primary data Web Content Patent data Research Process Level 1 curation © Philip Lord, 2003

Digital | Curation | Centre Scientist Research Process Secondary (derived) data Tertiary data for publication Primary publication Secondary publication Tertiary publication Peer Review e-Prints Publication archives Library - Peers - Public - Industry Publication Process Primary data Web Content Patent data Research Process Research based on data Metadata Archivist © Philip Lord, 2003 Level 2 curation Archived data

Digital | Curation | Centre Scientist Research Process Secondary (derived) data Tertiary data for publication Primary publication Secondary publication Tertiary publication Peer Review e-Prints Publication archives Library - Peers - Public - Industry Publication Process Primary data Web Content Patent data Research Process Research based on data Metadata Curation Curator Curation Process Data repositories © Philip Lord, 2003 Level 3 curation Archived data

Digital | Curation | Centre 48 Faith in the medium ?

Digital | Curation | Centre 49 Faith in the technology