1 WEB ARCHIVING IN THE BRITISH LIBRARY John Tuck Head of British Collections February 2004.

Slides:



Advertisements
Similar presentations
Member States' Expert Group on Digitisation and Digital Preservation – 13. December 2007 Jakob H. Petersen Head of Division Danish Library Agency
Advertisements

University of Leeds Academic Services Cross-institutional repositories Tracey Stanley Head of e-Strategy and Development Leeds University Library.
Supporting Further and Higher Education Joint Information Systems Committee JISC Strategies & Support of e-Science for Research Dr Malcolm Read JISC Executive.
CURRENT ISSUES Current contents Over 3,000 items open access, 42% reports and working papers, 21% journal articles, 21% conference items, 7% book chapters,
A survey of Web preservation initiatives Michael Day UKOLN, University of Bath 7 th European Conference on Research and Advanced Technology.
JISC/BL Workshop Digital Libraries and their services March 6, 2006 Richard Boulderstone Director eStrategy, The British Library.
DIGITAL POLICY MANAGEMENT IN THE DOM PROGRAMME Richard Masters Programme Manager Digital Object Management Programme Digital Policy Management Workshop.
A centre of expertise in data curation and preservation SoA Annual Conference::York::August 2008 Funded by: This work is licensed under the Creative Commons.
The White Rose Collaborative Collection Partnership Brian Clifford University of Leeds.
Providing collections, tools and services for digital humanities A national library perspective Clément Oury Head of Digital Legal Deposit Bibliothèque.
Role of librarians in the development of Institutional Repositories Susan Ashworth University of Glasgow.
14 mai 2007Evolution of Scientific Publications, Colloque de l'Académie des sciences1 Preservation of electronic publications mission Catherine Lupovici.
BUILDING DIGITAL WEB ARCHIVES FOR FUTURE SCHOLARS Jani Stenvall
Partnering for the future David MacArthur 31 October 2003 The British Library and FIL.
The view from Europe Paola Gargiulo – CASPUR (and Valentina Comba University of Bologna – Italy) Fiesole Collection Development Retreats Fiesole 2004 March.
CONSORTIUM PURCHASING FOR UK UNIVERSITIES THROUGH THE JISC Frederick J. Friend JISC Scholarly Communication Consultant Honorary Director Scholarly Communication.
Supporting education and research E-learning tools, standards and systems Sarah Porter Head of Development, JISC.
Building Digital Museums, Libraries and Archives David Dawson Senior Policy Adviser (Digital Futures)
Challenges for the DL and the Standards to solve them Alan Hopkinson Technical Manager (Library Systems) Learning Resources Middlesex University.
The FDLP Web Archive Dory Bower Archive-It Partner Meeting November 18, 2014.
1 Archiving and Preserving the Web Kristine Hanna Internet Archive April 2006.
1 The British Library: Legal Deposit Libraries Act 2003 and Extension of Legal Deposit to Non-Print John Tuck Head of British Collections Digital memory,
Elizabeth Newbold and Samantha Tillett GL8 New Orleans, December 2006
NATIONAL MEMORY AND DIGITAL DELIVERY PROGRESS WITH LEGAL DEPOSIT OF ELECTRONIC PUBLICATIONS IN THE UNITED KINGDOM Graeme Forbes National Library of Scotland.
Supporting further and higher education Digital Preservation: Legal Issues Chinese National Academy of Sciences July04 Neil Beagrie, BL/JISC Partnership.
Managing Ordnance Survey geospatial data in the UK legal deposit libraries Chris Fleet Deputy Map Curator National Library of Scotland Kimberly Kowal Curator.
The British Library’s METS Experience The Cost of METS Carl Wilson
The capture and preservation of websites at the National Library of New Zealand Gillian Lee Alexander Turnbull Library.
1 Archiving and Preserving the Web Dan Avery Kristine Hanna Merrilee Proffitt Internet Archive RLG April 2006.
A centre of expertise in digital information managementwww.ukoln.ac.uk Digital Preservation / UK Web Focus Brian Kelly UKOLN University of Bath Bath, BA2.
The Digital Object Management Programme (DOM) Richard Masters, Programme Manager PRESERV Partners Meeting 18 th November
Web Capture team Office of strategic initiatives February 27, 2006 Selecting Content from the Web: Challenges and Experiences of the Library of Congress.
Geoff Payne ARROW Project Manager 1 April Genesis Monash University information management perspective Desire to integrate initiatives such as electronic.
Digitization of the Federal Depository Library Program Judith C. Russell Superintendent of Documents & Managing Director, Information Dissemination “Electronic.
HathiTrust Digital Library. Overview ›Began in 2008 ›Large scale digital preservation repository ›Partnership of major research libraries ›Focus on both.
Digital Preservation through Cooperation: LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
Wellcome Library & JISC Web Archiving Project Presented by Michael Day, UKOLN, University of Bath [Author of the Web Archiving feasibility study] Digital.
1 Archive-It: Archiving and Preserving Born Digital Content NDIIPP June 2009 Molly Bragg Partner Specialist Internet Archive.
Preserving our audiovisual heritage Plan for a national television and radio archive.
IFAP Special Event: Information and Knowledge for All, Emerging Trends and Challenges Information Preservation 4000 Years of Traditions Challenged by Digital.
February, CONTEXT  CONSTITUTIONAL AMENDMENTS  Creation of the Statistical and Geographical Information System (SNIEG)  INEGI’s Autonomy (July.
The Library of Congress Martha Anderson Program Officer, NDIIPP Office of Strategic Initiatives Library of Congress April 2005 LC Perspective : Preservation.
UKOLN is supported by: Iniciativas de preservación de la Web: una visión actual Michael Day Digital Curation Centre, UKOLN, University of Bath, UK
Kristiina Hormia-Poutanen Head of National Electronic Library Services (FinELib) National Electronic Library programme and the digital research and study.
The Canadian Information Network for Research in the Social Sciences and Humanities Tim Au Yeung and Mary Westell Libraries.
European Commission on Preservation and Access Preservation of digital heritage Yola de Lusenet Lisbon, November
Collection Development in a Grey World Jennie Grimshaw and Elizabeth Newbold GL10 Conference, Amsterdam December 2008.
EVA Workshop, 26 March 2003, Florence, Italy1 COINE Cultural Objects In Networked Environments Anthi Baliou University of Macedonia,Library Thessaloniki,
1 Collection Development and Web Publications at the British Library John Tuck Head of British Collections Digital Memory, Session 2, Tallinn 24 th November.
Collaboration Between Publishers and The British Library UKSG – Spring 2003 Natalie Ceeney Director of Operations and Services The British Library.
2CUL: EMERGING MODEL OF DEEP COLLABORATION? Anne R. Kenney ASERL Fall 2010 Membership Meeting.
INTELLECTUAL RIGHTS AND HISTORIC CORPORA Mark Sandler University of Michigan ICOLC, March, 2003.
Digital Preservation across the technologies, strategies, open standards & interoperability aspects including the legal issues Pratik Shrivastava Scientist.
Preservation Program Digital Preservation Program Digital Preservation Services: Extending tools to meet campus needs Patricia Cruse, Director, Digital.
1 BCS, Oxfordshire, 19 February, 2004 WEB ARCHIVING issues and challenges Deborah Woodyard Digital Preservation Coordinator.
GISC Seminar: Towards Uncharted GroundSeptember 29, 2006 North Carolina Partnership with Library of Congress on Long-term Preservation of Digital Geospatial.
Preservation metadata and the Cedars project Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
Digital Preservation through Cooperation: LOCKSS Gail McMillan Digital Library and Archives, University Libraries Virginia Polytechnic Institute and State.
Collection Description considerations in the nof-digitise programme Sarah Mitchell Programme Manager New Opportunities Fund.
Kate Fernie ICT Adviser (EU Projects) Looking at ways of communicating between systems to reveal digital resources for educational uses.
Embedding the Brunel Repository in the academic practice John Murtagh.
IPR and the EThOS Project 28 th October 2008 Dr. Susan Copeland Senior Information Adviser (Research)
IAUTL June 2002 Michelle Cadoree, Library of Congress Virtual Reference: Making it Work For You.
Archiving & Preserving Digital Content
Building A Repository for Digital Objects
Joanne Archer University of Maryland Libraries
Challenges and Opportunities of Archiving the UK Web
László Drótos – Márton Németh National Széchényi Library Department of Electronic Library Services Web archiving Planning a new pilot project.
DRIVER Digital Repository Infrastructure Vision for European Research
Institutional Repositories
Presentation transcript:

1 WEB ARCHIVING IN THE BRITISH LIBRARY John Tuck Head of British Collections February 2004

2 BRITISH LIBRARY: CONTEXT  Created by British Library Act  National Library of the United Kingdom.  Origins from  One of world’s greatest research libraries.  160 million collection items.

3 BRITISH LIBRARY: COLLECTION DEVELOPMENT  Building as completely as possible the UK national published archive - current and retrospective gap filling; print and electronic.  Collecting research-level English- language material published world-wide in the humanities, social sciences, STM.  Buying foreign-language material selectively  Material acquired through: legal deposit, voluntary deposit from publishers, purchase, donation, exchange.

4 LEGISLATION  Legal Deposit Libraries Act 2003: enabling legislation.  VDEP: Voluntary Deposit of Electronic Publications.

5 DOMAIN.UK  Six-month experiment to select and capture 100 UK web-sites,  audit change, loss, links, etc.  determine next steps.

6 DOMAIN.UK: Why?  Short-lived nature/changing content of many web-sites.  loss of information.  increasing reference to web-sites in research/scholarship.

7 DOMAIN.UK: Voluntary/Rights Cleared Approach  Voluntary.  Requiring explicit agreement of website publishers to take part in pilot.  No public access.

8 DOMAIN.UK: Selection  Websites of historical or cultural significance.  Cross-section of Dewey Decimal Classification.

9 DOMAIN.UK: Process  selected sites for approval and to check whether already archived.  Measure sites for links, size, change, etc.  Frequency of visits: every three weeks or more in some cases.  Supported by those sites approached.  Report recommended scaling up.

10 BRITISH LIBRARY WEB ARCHIVING PROGRAMME  Building on Domain.uk.  BL to play leading role in collecting UK web presence in partnership with other institutions nationally and internationally.  Selective approach.

11 BRITISH LIBRARY WEB ARCHIVING PROGRAMME contd.  Co-ordinate a snapshot of entire UK web presence at occasional intervals.  Achieve more regular capture of limited and well-defined range of sites.  Sites judged to be research-level, whether in terms of stated intentions of sites themselves or of potential to be primary resources for research.

12 WEB ARCHIVING PROGRAMME  Comprises a series of complementary projects and activities.  Based entirely on voluntary, rights-cleared basis pending secondary legal deposit legislation.  Aims to embed web archiving within the BL's overall collection development policy.  Aims to provide the infrastructure to collect, preserve and make accessible web-site material alongside material in other formats.

13 WEB ARCHIVING PROGRAMME STRANDS  Four main strands:  Definition of collection development policy.  UK Web Archiving Consortium.  International Internet Preservation Consortium.  Internet Archive: incunabula of the internet.

14 COLLECTION DEVELOPMENT  Appointment of Curator, Web Archiving.  Extension of policy defined for Domain.uk.  Sites of national, historical and cultural significance.  Research level now/in the future.

15 UK WEB ARCHIVING CONSORTIUM  Two-year project.  Six partners: BL (lead); National Library of Scotland, National Library of Wales, National Archives, Joint Information Systems Committee, Wellcome Library.  Plan to use PANDAS software developed by National Library of Australia.  Rights to use individual sites to be cleared with rights-holders.

16 UK WEB ARCHIVING CONSORTIUM contd.  Procurement exercise in process to recruit supplier to host service.  Intention to let contract in April 2004 and to be operational in summer  Sites to be made accessible to users.  Each partner to collect up to 500 sites per year, i.e. 6,000 during project.

17 INTERNATIONAL INTERNET PRESERVATION CONSORTIUM  Project involving national libraries.  Led by Bibliotheque Nationale de France.  Also includes BL, Library of Congress, Library and Archives of Canada, Nordic countries, Italy, Australia, Internet Archive.

18 INTERNATIONAL INTERNET PRESERVATION CONSORTIUM contd.  Aims to develop automated web-crawler mechanism.  Open-source tools to search web at regular intervals matching agreed collection development policies.  Working groups in: access tools; content management, deep web, framework, metrics and test-beds, researcher requirements.  Developmental at this stage.

19 INTERNET ARCHIVE  Collecting and saving sites since  Wayback machine.  Legal, technical and procurement issues.

20 SOME CHALLENGES  Defining UK.  Rapid technology change.  Third party rights (not always subject to UK law).  Libel/defamation issues.  Software issues / which platform?  Validity of a snapshot.

21 SOME CHALLENGES contd.  Formats for archiving.  Metadata standards.  Archiving ‘look and feel’.  Authenticity.