A centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Funded by: This work is licensed under the.

Slides:



Advertisements
Similar presentations
What is HathiTrust and How Can it Make a Difference? Sourcing and Scaling brought to the collective collection.
Advertisements

Recent developments in digital archiving and preservation Jan Fullerton Director General National Library of Australia.
Introduction to Planets Hans Hofman Nationaal Archief Netherlands Prague, 17 October 2008.
Platter Planning Tool For Trusted Electronic Repositories
Creating Institutional Repositories Stephen Pinfield.
A centre of expertise in data curation and preservation LOCKSS Town Meeting :: DCC LOCKSS TSS :: 2 nd December 2005 DCC LOCKSS Technical Support Service.
Subject Based Information Gateways in The UK Coordinated Activities in The UK Within the UK Higher Education community, the JISC (Joint Information Systems.
Joint Information Systems Committee Supporting UK Further and Higher Education JISC Information Environment and Architecture, part 1 Alicia Wise and Andy.
A centre of expertise in data curation and preservation DCC/NeSC eScience Workshop, June 2008 Working in partnership with the eScience community This work.
CURRENT ISSUES Current contents Over 3,000 items open access, 42% reports and working papers, 21% journal articles, 21% conference items, 7% book chapters,
Supporting education and research Repositories in Context Digital repositories as components of an integrated infrastructure for education Leona Carpenter.
Collection-level description & collection management: tool for the trade or information trade-off? Collection Description Focus Workshop 4 Newcastle, 8.
The PREMIS Data Dictionary Michael Day Digital Curation Centre UKOLN, University of Bath JORUM, JISC and DCC.
A centre of expertise in data curation and preservation EAOLUG :: RSC :: Cambridge23 May 2006 Funded by: This work is licensed under the Creative Commons.
Joint Information Systems Committee Digital Library Services BL/JISC Workshop Rachel Bruce JISC Programme Director The Digital Library and its Services,
A centre of expertise in digital information management Developing a Quality Culture For Digital Library Programmes Author & Presenter Brian Kelly UKOLN.
A centre of expertise in digital information management UKOLN is supported by: Digital Futures for MLAs? A snapshot in real time. Dr Liz.
A centre of expertise in digital information management UKOLN is supported by: Memory institutions and the social fabric of the Web Dr.
UKOLN is supported by: JISC Information Environment update Repositories and Preservation Programme meeting, October 24-25, 2006 Rachel Heery UKOLN
The Future of Scholarship in the Digital Age: The Role of Institutional Repositories Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
A centre of expertise in data curation and preservation Funded by: This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike.
Collection-level description & the Information Landscape: users evaluate strategies for resource discovery Collection Description Focus Workshop 5 Cambridge,
A centre of expertise in data curation and preservation Archiving Web-based recordsIWMW June 2006 Funded by: This work is licensed under the Creative.
A centre of expertise in data curation and preservation DigCCur2007 Symposium, Chapel Hill, N.C., April 18-20, 2007 Co-operation for digital preservation.
A centre of expertise in data curation and preservation BIALL Annual ConferenceSheffield15 June 2007 Funded by: This work is licensed under the Creative.
A centre of expertise in data curation and preservation Preserving Digital ArchivesLUCAS March 2006 Funded by: This work is licensed under the Creative.
A centre of expertise in data curation and preservation DCC Workshop: Curating sApril 24 – 25, 2006 Funded by: This work is licensed under the Creative.
A centre of expertise in data curation and preservation UKOLN Open ForumIWMW June 2006 Funded by: This work is licensed under the Creative Commons.
A centre of expertise in data curation and preservation London :: ARK Group Workshop: Archiving the Web :: 28 Sept 2006 Funded by: This work is licensed.
Joint Information Systems Committee 11/03/07 | | Slide 1 Joint Information Systems CommitteeSupporting education and research JISC Conference 2007 Managing.
A centre of expertise in data curation and preservation National FoI Group Birmingham07 March 2007 Funded by: This work is licensed under the Creative.
A centre of expertise in data curation and preservation SoA Annual Conference::York::August 2008 Funded by: This work is licensed under the Creative Commons.
A centre of expertise in data curation and preservation CETIS MDR SIG::28 June 2006::University of Bath Funded by: This work is licensed under the Creative.
Digital Records: for Ever or for Never? Maureen Pennock, Digital Curation Centre UKOLN, University of Bath RMS North of England Group Meeting, Wigan, 3.
Pulling it all together… with thanks to Sheila Anderson.
A centre of expertise in data curation and preservation DC 101 Lite, September 10, 2010, London Funded by: This work is licensed under the Creative Commons.
Data Management: Metadata, Repositories and Curation Tony Mathys, Anne Robertson Eddie Boyle, Guy McGarva GeoForum, 4 th November, York.
1 What is the Internet Archive We are a Digital Library Mission Statement: Universal access to human knowledge Founded in 1996 by Brewster Kahle in San.
… because good research needs good data DMP Online, Lincoln, 28 th Feb 2013 DMP Online Kerry Miller Digital Curation Centre University of Edinburgh
EU-funded Digital Preservation Research APA 2014 Conference Brussels, 22 October 2014 Dr. Manuela Speiser European Commission DG CONNECT, unit "Creativity"
Cultural Content and Digital Heritage Bernard Smith European Commission INFSO/D2.
A centre of expertise in data curation and preservation MIS Seminar :: University of Edinburgh :: 2 October 2006 Funded by: This work is licensed under.
APSR Forum on Long-Term Repositories National Library of Australia, 31 August – 1 September, Trust and the Web: Can the audit criteria apply to.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
1 Archiving and Preserving the Web Kristine Hanna Internet Archive April 2006.
Elizabeth Newbold and Samantha Tillett GL8 New Orleans, December 2006
1 WEB ARCHIVING IN THE BRITISH LIBRARY John Tuck Head of British Collections February 2004.
© HATII, University of Glasgow Introduction to the UK ’ s Digital Curation Centre Prof Seamus Ross Visiting Fellow at Oxford Internet Institute ,
David Giaretta Associate Director (Development) Funders: DCC Development Digital Curation Centre a centre of expertise in data curation and preservation.
1 Archiving and Preserving the Web Dan Avery Kristine Hanna Merrilee Proffitt Internet Archive RLG April 2006.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Trusted Digital Repositories,
How to Face the Challenges of Web Archiving? The experiences of a small library on the edge. Chloe Martin, Internet Memory Catherine Ryan, National Library.
Web Capture team Office of strategic initiatives February 27, 2006 Selecting Content from the Web: Challenges and Experiences of the Library of Congress.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Digital Preservation: Lessons learned through national action Digital Preservation Interoperability Framework Workshop April 2010.
Digital Archiving in the Hungarian Széchényi Library The story and the plans of the Hungarian Electronic Library Rome, 21. Oct István Moldován OSZK,
The Canadian Information Network for Research in the Social Sciences and Humanities Tim Au Yeung and Mary Westell Libraries.
UKOLN is supported by: Introduction to Collections and Collection-Level Description Bridget Robinson Collection Description Focus A centre of expertise.
Seamus Ross Director, HATII & ERPANET Associate Director of DCC Services Funders: Service Definition & Delivery Digital Curation Centre a centre of expertise.
Metadata for digital preservation: a review of recent developments Michael Day UKOLN, University of Bath ECDL2001, 5th European Conference.
A centre of expertise in data curation and preservation ICA-SUV Seminar :: September 2006 ::Reykjavík Funded by: This work is licensed under the Creative.
1 BCS, Oxfordshire, 19 February, 2004 WEB ARCHIVING issues and challenges Deborah Woodyard Digital Preservation Coordinator.
Dr Liz Lyon Associate Director, Outreach Funders: Engaging the Users: the Outreach & Community Support Programme Digital Curation Centre a centre of expertise.
Institutional Repositories: the DSpace Experience Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
Managing Access at the University of Oregon : a Case Study of Scholars’ Bank by Carol Hixson Head, Metadata and Digital Library Services
New Opportunities Fund Preservation Workshop March 15th 2002 Maggie Jones Cedars Project Manager.
Leveraging the Expertise of our Staff and the Information Resources We Manage MIT Libraries Visiting Committee April 13, 2005.
Working with personal digital archives Susan Thomas Project Manager & Digital Archivist project Manuscripts Matter, Electronica panel London, October.
GISELA & CHAIN Workshop Digital Cultural Heritage Network
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Presentation transcript:

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Funded by: This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 2.5 UK: Scotland License. To view a copy of this license, visit sa/2.5/scotland/ ; or, (b) send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA. sa/2.5/scotland/ From Digital Creation to Digital Curation Managing Digital Cultural Heritage Resources Maureen Pennock Digital Curation Centre, UKOLN, University of Bath

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Todays Talk Introductions The UK Digital Curation Centre Curation and the digital life-cycle Issues in developing and managing digital collections Helpful projects and initiatives Discussion

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Funded by: This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 2.5 UK: Scotland License. To view a copy of this license, visit sa/2.5/scotland/ ; or, (b) send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA. sa/2.5/scotland/ The UK Digital Curation Centre

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Digital Curation Digital Curation, broadly interpreted, is about maintaining and adding value to a trusted body of digital information for current and future use The active management and appraisal of data over the entire life-cycle

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee The DCC Launched in 2004 Established to help solve the extensive challenges of digital preservation and curation, and to provide research, advice and support services to UK institutions Consortium project with 4 main partners 4 main teams distributed across the 4 UK locations Funded by JISC & the e-Science Core Programme

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Organisation to Engage & Collaborate Industry research collaborators standards bodies testbeds & tools communities of practice: users community support & outreach research development co-ordination service definition & delivery management & admin support Collaborative Associates Network of Data Organisations curation organisations eg DPC

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee DCC Outreach Raising Awareness and Dissemination Website ( ) International Journal of Digital Curation Annual International Conference Understanding Users and their Needs Requirements gathering Associates Network DCC Forum

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee DCC Services Information Services Community-developed Digital Curation Manual Briefing Papers & FAQs Technology Watch, Standards Watch, Legal Watch Case Studies Best Practice Checklists Advisory Services Events: information days, workshops, training Helpdesk Audit and Certification Services

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee DCC Research Annotation in Databases Data archiving Socio-economic and legal issues Metadata extraction and curation Ontologies and data dictionaries Provenance and databases Data transformation, integration and publishing Supporting technologies Networks of trusted digital repositories Organisational and cultural challenges to digital curation

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee DCC Development DCC Approach to Digital Curation (white paper) – sets out the path for development activities: Monitoring international standards Creating testbeds for digital curation tools Development of recommendations for tools and methods for generating Representation Information Development of a Representation Information Registry (DCC RIR)

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Funded by: This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 2.5 UK: Scotland License. To view a copy of this license, visit sa/2.5/scotland/ ; or, (b) send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA. sa/2.5/scotland/ Digital Curation and the Life-Cycle

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Why a life-cycle approach? Curation is a life-cycle approach to management and preservation of digital objects, necessary because: Digital materials are fragile & susceptible to change from technological advances throughout their life-cycle Each stage can impact on subsequent stages Traditional management processes can need adapting for digital materials with different requirements. The life-cycle approach enables continuity and provenance despite technological and organisational contextual change Maximises investments and potential

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Life-Cycle model Digital Object Life-cycle model differs slightly depending on the context (e.g. libraries/ archives/museums) This generic model addresses libraries

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee From Creation to Curation Life-cycle approach facilitates continuity and control over the different stages Each stage can impact on the following one: Creation impacts on many stages, as the way a resource is created affects the way it can be curated and its sustainability Creation problematic in a digital heritage context as you may not have control over the way resources are created

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Funded by: This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 2.5 UK: Scotland License. To view a copy of this license, visit sa/2.5/scotland/ ; or, (b) send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA. sa/2.5/scotland/ Issues in Developing and Managing Digital Collections

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee The Digital Library: Discuss What exactly is a digital library?

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee The Digital Library: Discuss What exactly is a digital library? A library accessible over the internet? (but to what extent?) A library with (only?) digital holdings? A cutting-edge institution that maximises IT potential? (can be achieved multifariously) An added-value service?

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee The Digital Library: Discuss What exactly is a digital library? A library accessible over the internet? (but to what extent?) A library with (only?) digital holdings? A cutting-edge institution that maximises IT potential? (can be achieved multifariously) An added-value service? Professional disparity over the definition (especially the difference between this and a digital archive)

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee The Digital Library: Discuss What exactly is a digital library? A library accessible over the internet? (but to what extent?) A library with (only?) digital holdings? A cutting-edge institution that maximises IT potential? (can be achieved multifariously) An added-value service? Professional disparity over the definition (especially the difference between this and a digital archive) More than just a search engine and an access mechanism – more than just the Internet!

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Potential digital library resources Digitised Maps and Posters Photographs Original texts – books, manuscripts, newspapers, journals Audio-visual material Microfilm Born Digital Maps and Posters Photographs E-Publications Audio-visual material Websites (which will invariably contain multi- media objects) Cataloguing data?

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Issues Range across the life-cycle Involves different stakeholders in each Communication essential TechnicalPreservationOrganisational LegalFinancialCultural

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Technical issues (1) Harvesting & Accession Storage – which model to implement? Metadata – what metadata are needed? Security – protection from unauthorised or malicious access User access – what tools are needed?

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Technical issues (2) Preservation Objects highly environmentally dependent Software/hardware changes many times during the lifetime of the records – every five years? Content may be altered if action is undertaken Content will become inaccessible if action is not taken Preservation strategies & tools Fragility of storage media Media obsolescence File deterioration Hardware & software obsolescence

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Organisational and Cultural issues Organisational and cultural infrastructure not usually geared towards digital longevity Digital cultural heritage resources are often primarily recognised as resources for the here and now Here and now access practices longevity! Preservation issues not recognised/regarded Staffing – expansion of duties or new staff? Need for senior managerial support, e.g policy, finances…

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Financial issues Financial: Not just a one-off digitising or collecting cost Preservation activity can require ongoing financial commitment Who will pay – now and in the future? What are the cost benefits? Wheres the business model? Will access be payment-restricted?

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Legal issues Legal: Meeting legal obligations: data protection, copyright, database right… Who is responsible? Copyright particularly relevant, as copying can be a vital act in preservation and access Impact of DRM on copying abilities A new definition of copying needed?

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Addressing the issues Follow progress in national initiatives Collaborate & communicate Engage the consumer Success requires commitment: At a policy level (integrated) At a managerial level (support/backing) At a staffing level (actions/activities)

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Strategy (1) A written policy and strategy to support activities and help secure resources Take a life-cycle approach to support curation and preservation planning If creating resources, provide good practice guidance for sustainability (eg when digitising or accepting digitised resources) Assess collection/selection criteria – are they still valid? Do they need expanding? Identify possible resources Digital resources can complement & enhance physical ones Be aware of externally produced digital resources (eg websites); check other heritage collections before gathering!

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Strategy (2) Identify legal restraints in collection/management/access Can value be added to resources during acquisition? Store objects in a secure environment Plan for preservation activities to maintain access to authentic resources over time and avoid incurring extra costs Determine access and user requirements Implement integrated approach to collection accessibility Adapt and learn from national and other leading activities

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Funded by: This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 2.5 UK: Scotland License. To view a copy of this license, visit sa/2.5/scotland/ ; or, (b) send a letter to Creative Commons, 543 Howard Street, 5th Floor, San Francisco, California, 94105, USA. sa/2.5/scotland/ Helpful projects and initiatives for preservation and accessibility

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee National Library of Scotland Developed several digital and web-accessible themed collections: Propaganda: A weapon of war (posters/images) Maps First Scottish books Robert-Louis Stevenson (letters, sketches, photos) Muriel Spark – the story Churchill: The evidence (contains school resources) Trusted Digital Repository Part of the UK Web Archiving Consortium (UKWAC) Selection and collection criteria for Scottish web sites Archiving the UK General Election 2005

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee UK WAC UK Web Archiving Consortium (6 members) British Library, National Library of Scotland, National Library of Wales, The National Archives, Wellcome Library, JISC Collects Web content selectively Uses modified PANDAS collection/harvesting software developed by the National Library of Australia Underlying harvesting program is currently HTTrack Permission is sought from site owners in advance Persistent Identifier URLs Single partner assumes responsibility for each site Central repository of metadata The collections are publicly accessible Website:

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Internet Archive Non-profit organisation, based in U.S. Wants to offer permanent access to digital online materials of all types Founded in 1996, has been collecting since then … much content donated by Alexa Internet Collects sites by crawling and harvesting web sites Sites can 'opt out' by way of robots.txt file on the web server Most content is freely available to the public, e.g. through the Wayback Machine Interface issues: only the URL indicates that the page is archived Website:

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee IIPC (1) International Internet Preservation Consortium Builds co-operation between the Internet Archive and national and research libraries Co-ordinated by the Bibliothèque nationale de France The British Library is the only current UK member, other national library partners include the Library of Congress, the Library and Archives Canada and the national libraries of Australia, Denmark, Finland, Iceland, Italy, Norway and Sweden Reflects those with current experience of Web archiving Both working-groups and tool development Phase II will enable new partners to join the consortium Website:

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee IIPC (2)* Phase I - developing the IIPC toolkit Standards and tools for supporting: Acquisition - archival quality crawler (Heritrix); portable database extraction and migration tool for database-driven deep web sites (DeepARC) Managing collections - analytical and prioritization tools for automatically focusing harvesting; curation tools to provide a non-technical interface for selecting, monitoring and verifying archived web sites Collection storage and maintenance - tools for manipulating formats; a standardised storage format (WARC), standards for metadata Access and finding aids - browse interfaces (WERA) and search facilities (NutchWAX) * Michael Day, IWMW 2006

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee LOCKSS (1) Lots of Copies Keeps Stuff Safe (LOCKSS) An easy and inexpensive way to collect, store, preserve, and and provide access to their own, local copy of authorised content they purchase (LOCKSS website) E-Journal collection and preservation system Open Source Software Runs on standard desktop hardware Requires very little technical administration

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee LOCKSS (2) Trial and pilot projects underway DCC support available through helpdesk and dedicated Advisory post Current trial suitable only for certain titles (due to licensing arrangements with publishers) Private networks can be developed: Requires technical development Minimum of six machines necessary to achieve desired redundancy Suitable for, eg, online course material

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Further resources National Library of Scotland National Library of Wales British Library DCC website UKOLN website SLAINTE website Digital Archives Regional Pilot (DARP) project Building and Sustaining Digital Collections, Abbey Smith

a centre of expertise in data curation and preservation CILIPs Branch/Group Day :: 27 September 2006 :: Dundee Thank You & Discussion Maureen Pennock Join the DCC Associates Network (its free!)