1 Collection Development and Web Publications at the British Library John Tuck Head of British Collections Digital Memory, Session 2, Tallinn 24 th November.

Slides:



Advertisements
Similar presentations
Recent developments in digital archiving and preservation Jan Fullerton Director General National Library of Australia.
Advertisements

August 2005IFLA - CDNL1 The International Internet Preservation Consortium (IIPC)
Opening access and closing the risk: delivering the mandate for e-theses deposit 10 th International Symposium on Electronic Theses and Dissertations Uppsala.
A survey of Web preservation initiatives Michael Day UKOLN, University of Bath 7 th European Conference on Research and Advanced Technology.
JISC/BL Workshop Digital Libraries and their services March 6, 2006 Richard Boulderstone Director eStrategy, The British Library.
A centre of expertise in data curation and preservation London :: ARK Group Workshop: Archiving the Web :: 28 Sept 2006 Funded by: This work is licensed.
A centre of expertise in data curation and preservation SoA Annual Conference::York::August 2008 Funded by: This work is licensed under the Creative Commons.
Libraries for Future Generations Martha Anderson Director National Digital Information Infrastructure and Preservation Program The Library of Congress.
Near East Plant Protection Network for Regional Cooperation & Knowledge Sharing Food and Agriculture Organization of the United Nations An Overview on.
Providing collections, tools and services for digital humanities A national library perspective Clément Oury Head of Digital Legal Deposit Bibliothèque.
Bibliothèque nationale de France Tallinn, BnF update: production and development priorities in 2015.
HORIZON 2020 The EU Framework Programme for Research and Innovation Europe in a changing world – inclusive, innovative and reflective Societies Albert.
14 mai 2007Evolution of Scientific Publications, Colloque de l'Académie des sciences1 Preservation of electronic publications mission Catherine Lupovici.
BUILDING DIGITAL WEB ARCHIVES FOR FUTURE SCHOLARS Jani Stenvall
Supporting education and research E-learning tools, standards and systems Sarah Porter Head of Development, JISC.
DAEDALUS: Facing the Challenges of eTheses at Glasgow William J Nixon Project Manager: Service Development (DAEDALUS) ETD Berlin, May 2003.
APSR Forum on Long-Term Repositories National Library of Australia, 31 August – 1 September, Trust and the Web: Can the audit criteria apply to.
Web archiving at the NLA ‘ Archiving the music web’ Music Council of Australia Annual Assembly 28 September 2009 Paul Koerbin Manager Digital Archiving.
1 Archiving and Preserving the Web Kristine Hanna Internet Archive April 2006.
Integration of Regulatory Impact Assessment into the decision making process in the Czech Republic Aleš Pecka Department of Regulatory Reform and Public.
1 The British Library: Legal Deposit Libraries Act 2003 and Extension of Legal Deposit to Non-Print John Tuck Head of British Collections Digital memory,
Elizabeth Newbold and Samantha Tillett GL8 New Orleans, December 2006
1 WEB ARCHIVING IN THE BRITISH LIBRARY John Tuck Head of British Collections February 2004.
NATIONAL MEMORY AND DIGITAL DELIVERY PROGRESS WITH LEGAL DEPOSIT OF ELECTRONIC PUBLICATIONS IN THE UNITED KINGDOM Graeme Forbes National Library of Scotland.
Supporting further and higher education Digital Preservation: Legal Issues Chinese National Academy of Sciences July04 Neil Beagrie, BL/JISC Partnership.
The capture and preservation of websites at the National Library of New Zealand Gillian Lee Alexander Turnbull Library.
Bibliography in the Digital Age - IFLA Satellite Meeting Warsaw, 9 August Online materials published in Austria collecting, archiving and metadata.
Joanne Archer University of Maryland Kate Odell Archive-It Abbie Grotke Library of Congress Tessa Fallon Columbia University Creating and Maintaining Web.
1 Archiving and Preserving the Web Dan Avery Kristine Hanna Merrilee Proffitt Internet Archive RLG April 2006.
Chinese-European Workshop on Digital Preservation, Beijing July 14 – Network of Expertise in Digital Preservation 1 Trusted Digital Repositories,
How to Face the Challenges of Web Archiving? The experiences of a small library on the edge. Chloe Martin, Internet Memory Catherine Ryan, National Library.
Creating Access to Europe’s Television Heritage Prof. Dr. Sonja de Leeuw (project-coordinator, Utrecht University) Johan Oomen MA (technical director,
The Digital Object Management Programme (DOM) Richard Masters, Programme Manager PRESERV Partners Meeting 18 th November
Web Capture team Office of strategic initiatives February 27, 2006 Selecting Content from the Web: Challenges and Experiences of the Library of Congress.
Ymchwil Research Ymchwil Research RESAW Ioan Isaac-Richards Ingest Processes Manager Head of Web Archiving
Challenges & opportunities in the preservation of (digital) information: the case of European research libraries Museo de las Ciencias Teatro de UNIVERSUM.
UK LOCKSS Alliance: Content Development Adam Rusbridge EDINA, University of Edinburgh 10 th May 2011.
Wellcome Library & JISC Web Archiving Project Presented by Michael Day, UKOLN, University of Bath [Author of the Web Archiving feasibility study] Digital.
CNI Fall Task Force, December 2007 International Internet Preservation Consortium Abbie Grotke IIPC Communications Officer Library of Congress & George.
The Legislative Library of Ontario’s Ontario Documents Repository Road to Partnership.
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
IFAP Special Event: Information and Knowledge for All, Emerging Trends and Challenges Information Preservation 4000 Years of Traditions Challenged by Digital.
Web Archiving at the National Library of Australia Russell Latham Senior Web Archivist, National Library of Australia.
The Library of Congress Martha Anderson Program Officer, NDIIPP Office of Strategic Initiatives Library of Congress April 2005 LC Perspective : Preservation.
UKOLN is supported by: Iniciativas de preservación de la Web: una visión actual Michael Day Digital Curation Centre, UKOLN, University of Bath, UK
European Commission on Preservation and Access Preservation of digital heritage Yola de Lusenet Lisbon, November
Formalizing Project EMANI Ithaca, July 26 th, 2002.
Collaboration Between Publishers and The British Library UKSG – Spring 2003 Natalie Ceeney Director of Operations and Services The British Library.
The KB e-Depot long-term preservation of scientific publications in practice Marcel Ras, National library of The Netherlands.
Digital Preservation across the technologies, strategies, open standards & interoperability aspects including the legal issues Pratik Shrivastava Scientist.
April 12, 2005 WHAT DOES IT MEAN TO BE AN ARCHIVES? Trusted Digital Repository Model Original Presentation by Bruce Ambacher Extended by Don Sawyer 12.
NDSR Boston webinar: Digital Preservation Introduction Presenter: Nancy Y McGovern October 2015.
1 BCS, Oxfordshire, 19 February, 2004 WEB ARCHIVING issues and challenges Deborah Woodyard Digital Preservation Coordinator.
Preservation metadata and the Cedars project Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
Libraries of Course: integrating library content and services into the e-learning environment. Brian Flaherty Digital Services Manager University of Auckland.
New Opportunities Fund Preservation Workshop March 15th 2002 Maggie Jones Cedars Project Manager.
INSME – International Network for SMEs Tailored Services 1 st Meeting of the INSME Promoting Committee Milan, 7 th – 8 th July 2003.
Chang, Wen-Hsi Division Director National Archives Administration, 2011/3/18/16:15-17: TELDAP International Conference.
UNDERSTANDING INFORMATION MANAGEMENT (IM) WITHIN THE FEDERAL GOVERNMENT.
Reforms in the Albanian Public Procurement System 7 th Regional Public Procurement Forum Tbilisi, Georgia May 16-19, 2011 PUBLIC PROCUREMENT AGENCY 1.
Archiving the UK Web Helen Hockx-Yu Web Archiving Programme Manager British Library 13 October 2008.
Towards a European Shared Environmental Information System in Support of Environmental Policies: INSPIRE: an Inspired revolution for a knowledge-based.
IPR and the EThOS Project 28 th October 2008 Dr. Susan Copeland Senior Information Adviser (Research)
Use cases for BnF broad crawls Annick Lorthios. 2 Step by step, the first in-house broad crawl The 2010 broad crawl has been performed in-house at the.
Auditing Sustainable Development Goals
Creating The Oregon State Electronic Documents Repository
László Drótos – Márton Németh National Széchényi Library Department of Electronic Library Services Web archiving Planning a new pilot project.
Legal Deposit & UK Publishing
Scanning the environment: The global perspective on the integration of non-traditional data sources, administrative data and geospatial information Sub-regional.
Standard for Teachers’ Professional Development July 2016
Presentation transcript:

1 Collection Development and Web Publications at the British Library John Tuck Head of British Collections Digital Memory, Session 2, Tallinn 24 th November 2005

2 British Library Web Archiving Programme Three strands to the Programme an underpinning collection development policy UK collaborative approach through the UK Web Archiving Consortium (UKWAC) International collaboration through the International Internet Preservation Consortium (IIPC)

3 British Library Web Archiving Programme: Why? Short life-span Looking ahead to extension of legal deposit to non-print Pilot project Domain.uk as proof of concept

4 British Library Web Archiving Programme: Resources Team is divided across Scholarship and Collections and IT directorates Web Archiving Programme Manager Curator, Web Archiving whose responsibility includes definition of the collection development policy Other posts focussing on technical aspects/developments and on permissions, rights clearance and administration

5 Collection Development Policy Web Archiving High Level Collection Development Policy Given the huge scale and dynamic nature of the web (estimated at approx 5 million UK-based web sites) the British Library does not consider it practicable nor affordable to aim at truly comprehensive coverage of the UK web presence. The Library’s strategy is based on: a) taking a complete snapshot of the entire UK web presence at regular intervals (possibly annually or twice a year) b) achieving a more intensive and selective harvesting of a limited number and well-defined range of sites, building up over time to perhaps 10,000. These would be sites judged to be of research value now and in the future, reflecting the national and cultural heritage, and including a number of sites which are exemplars of web innovation. Also included is an events-based, thematic collection strand

6 Web Archiving Collection Development Progress Through its Curator, Web Archiving, the British Library has defined a more detailed development policy statement for UK web sites (See Framework of curators within the British Library to assist the Curator, web archiving. Work also carried out with partners, e.g. within the UKWAC consortium Longer-term aim is to consider web-sites as just another format to collect within an overall collection development policy

7 UK Web Archiving Consortium (UKWAC) Officially launched in June 2004 Comprises six institutions: British Library (lead partner), Joint Information Systems Committee, National Library of Scotland, National Library of Wales, the National Archives, and the Wellcome Trust Two-year pilot project with aims of putting in place common framework, common approaches to rights- cleared web archiving, and to put in place an archive of websites (see To date has archived over 700 sites. British Library input has been 700 instances of 282 sites The first successful selective archive of UK web space which imposes no charge for including material or for access. Based on the National Library of Australia’s web archiving application `PANDORA’

8 UKWAC: Permissions-based approach (1) From outset it has been the intention to seek explicit rights clearance from website owners, pending secondary legislation for the deposit of UK websites Common licence/template devised by UKWAC Sites only mounted once explicit permission has been agreed Some exceptions in case of events-based collection, e.g. Asian tsunami, UK general election 2005, and London bombings, July 2005: notice and takedown policy put in place

9 UKWAC: Permissions-based approach (2) British Library has sent out more than 1,500 permission requests; has received only approximately 400 positive replies. 25% success rate. Very few outright rejections (10) but many queries (200) and no replies Not sustainable: impact both on collection size but also collection balance Secondary legislation through the Legal Deposit Libraries Act will address this. May be the case that web sites will be brought up the agenda with a swifter schedule for implementation than originally thought

10 International Internet Preservation Consortium (IIPC) Website: Mission:  To acquire, preserve and make accessible knowledge and information from the Internet for future generations everywhere, promoting global exchange and international relations Goals:  To enable the collection of a rich body of Internet content from around the world to be preserved in a way that it can be archived, secured and accessed over time  To foster the development and use of common tools, techniques and standards that enable the creation of international archives  To encourage and support national libraries everywhere to address Internet archiving and preservation

11 Working with IIPC Aim of IIPC is to put in place a range of tools and common standards for those tasked with web archiving We see IIPC and developing tools, standards as the means of achieving a whole domain crawl of the UK Recently took part in a smart crawler project and procurement with the Bibliotheque nationale de France to put in place a prototype to enable large scale web archiving; automatically locating content, frequency of capture and thematic linking. Complexities of the technology have led to a new approach now to involve British Library, BnF and Library of Congress National Library of New Zealand, Library of Congress and British Library also to work on improved curator tools to facilitate interface and work of curators dealing with websites

12 e-Content: Future Collection Development for other e-Formats Offline digital publications  The British Library will seek to collect offline resources (e.g. CD-ROMs, Disks, DVDs [not films]) comprehensively to the level of approximately 80% - 90% of estimated published output. Collection will be within a scope generally defined as appropriate for current research or research in the future

13 e-Content: Future Collection Development for e-Formats Online e-journals  The British Library will seek to collect e-journals with a UK imprint comprehensively to the level of approximately 80% of published output, and within a scope generally defined as appropriate for current research or for research in the future. The 20% of material not collected will reflect out of scope material considered to be of non-research level together with a small element of inevitable non compliance

14 e-Content: Future Collection Development for e-Formats Online e-books  The same collection criteria as for e-journals apply to e-books but we believe that the build-up to 80% will be slower than for e-journals as to a large degree e-books currently replicate printed materials and very few are at research level. E-books are not prioritised by the legal deposit libraries in the UK as an area of early Regulation under the Legal Deposit Libraries Act 2003

15 e-Content: Future Collection Development for e-Formats Databases  In the case of databases, many may not be defined as publications under the Act and thus would not be eligible for legal deposit. For formally published databases, the British Library will seek to acquire comprehensively and within the same scope and proportions as for e-journals. Note is taken, however, of the dynamic and ephemeral nature of databases and the technical challenges they will present. From the perspective of the national published archive, databases can probably only meaningfully be collected on a snapshot or last edition basis. At present online databases are being accorded a lowish priority for the Library from the perspective of both voluntary and statutory deposit. Many are more likely to be relevant to the web archiving programme

16 Voluntary Deposit of Electronic Publications: Practice Handheld (CD-ROMs etc): declining in number; delivered to our Legal Deposit Office; processed as other physical materials. Fully catalogued and accessible in reading rooms On-line materials received through voluntary deposit: new set of procedures, workflows put in place; clear collection development policy defined enabling selection; multitude of file extensions; on-line material stored as s in first instance, then burned on DVD for storage (using Ex Libraries Digitool) Long-term objective is incorporation in Digital Object Management Programme, as part of overall digital preservation strategy