Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal.

Slides:



Advertisements
Similar presentations
E-resources Collection Management Anna Grigson E-resources Manager.
Advertisements

Harvesting and archiving the Web Nordunet2000, Juha Hakala Helsinki University Library.
E-Content Hosting Platform Offered by Blackwells, powered by ebrary powered by ebrary &
UCL LIBRARY SERVICES Enhance the impact of your research with UCL Eprints Suzanne Tonkin Bartlett Library – Site Librarian UCL Eprints Project Officer.
NIH Public Access Compliance Cleveland Health Sciences Library Case Western Reserve University Kathleen C. Blazar.
Welcome to the New Library Website. The website has been revised to offer: Quick catalogue access to diverse resources and archives in the library Awareness.
Digital Library Services by Kodak i Center. Kodak i Centre - Sino Data Kodak i Centre Imaging expert Sino Data Library expert Bibliographic record creation.
Bibliothèque nationale de France Tallinn, BnF update: production and development priorities in 2015.
1 L U N D U N I V E R S I T Y a home grown, bespoke institutional Federated Search tool JIBS Conference at The John Rylands University Library,
BUILDING DIGITAL WEB ARCHIVES FOR FUTURE SCHOLARS Jani Stenvall
Chapter 2. Slide 1 CULTURAL SUBJECT GATEWAYS CULTURAL SUBJECT GATEWAYS Subject Gateways  Started as links of lists  Continued as Web directories  Culminated.
Digitisation projects and preserving digital documents in Hungary Current trends in digitisation DELOS, Turin, 3-4. febr István Moldován Hungary,
Bibliothèque de l’Université LavalFaculté des études supérieures Guy Teasdale Access 2003 Vancouver - October 4, 2003.
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
Jesper Klein The Swedish Library of Talking Books and Braille The Swedish talking book model
Technical Tips and Tricks for User Support Mike Gardner
WISER Social Sciences: Education Kate Williams and Judy Reading January 2007.
1 CS 502: Computing Methods for Digital Libraries Lecture 27 Preservation.
1 Minerva The Web Preservation Project. 2 Team Members Library of Congress Roger Adkins Cassy Ammen Allene Hayes Melissa Levine Diane Kresh Jane Mandelbaum.
Consortia Portal for Sharing Resources of Russian Libraries Alexander Plemnek, Natalia Sokolova St. Petersburg State Polytechnic University, St. Petersburg,
Developing PANDORA Mark Corbould Director, IT Business Systems.
Web Programming Language Dr. Ken Cosh Week 1 (Introduction)
Recent approaches to capture web content, which Heritrix can’t harvest  Capturing Social Media  Screen filming of Rich Media  Project: Event crawl of.
The capture and preservation of websites at the National Library of New Zealand Gillian Lee Alexander Turnbull Library.
1 Archive-It Training University of Maryland July 12, 2007.
Session 7 Selection of Online Resources and Options for Providing Access.
Danish Legal Deposit on the Internet: Current Solutions and Approaches for the Future ECDL, September 2001 by Birgit N. Henriksen Head of Digitization.
1 Introduction to Web Development. Web Basics The Web consists of computers on the Internet connected to each other in a specific way Used in all levels.
Danish Legal Deposit Experiences & the Need for Adjustments by Birgit N. Henriksen Head of Digitization and Web Department The Royal Library, Denmark.
Introduction to EndNote Web Margaret Forrest Academic Liaison Librarian.
Bibliography in the Digital Age - IFLA Satellite Meeting Warsaw, 9 August Online materials published in Austria collecting, archiving and metadata.
Svein Arne Brygfjeld National Library of Norway Nordic Web Archive.
From the National Strategy towards the evaluation of the National project "Croatian cultural heritage" Dunja Seiter-Šverko Head of Department for the Digitisation.
UPSpace An institutional research repository for the University of Pretoria Presented by Ina Smith to the School of Public Management and Administration.
1 Library Services. 2 Benefits of using the Library To find resources for your assignments and identify areas of interest To produce extra good papers.
Copyright © Allyn & Bacon 2008 POWER PRACTICE Chapter 7 The Internet and the World Wide Web START This multimedia product and its contents are protected.
DIGAR as the way and possibility to re-use the publications of public sector National Library of Estonia Kairi Felt Chief Specialist of E-Collections
THE SCIENCETHE SEARCHTHE SOLUTION New Publishing Paradigms and their impact on a not-for- profit organisation Shaun Hobbs Database.
Adobe FLASH What & Why? Where & When? Is Flash dead? What about HTML5?
Cataloguing Electronic resources Prepared by the Cataloguing Team at Charles Sturt University.
Depth customization of DSpace: Best practices and techniques of institutional repository at IIT Kanpur, India By S. K. Vijaianand V. D. Shrivastava Gaurav.
1 CS 502: Computing Methods for Digital Libraries Lecture 28 Current work in preservation.
1 Wawasan Open Library Library Orientation 21 January 2007.
Estonian Web and Bibliographic Control Janne Andresoo.
The Legislative Library of Ontario’s Ontario Documents Repository Road to Partnership.
ERIKA Eesti Ressursid Internetis Kataloogimine ja Arhiveerimine Estonian Resources in Internet, Indexing and Archiving.
The DiVA System: Current Status and Ongoing Development Uwe Klosa Electronic Publishing Centre, Uppsala University, Sweden Eva Müller.
NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006.
From here to perpetuity: challenges (and a few confessions) in preserving web-based AV content ASRA Conference 2011 Paul Koerbin Manager Web Archiving.
1 October 16, 2015 Iowa County Land Record Information System (CLRIS)
Digital Archiving in the Hungarian Széchényi Library The story and the plans of the Hungarian Electronic Library Rome, 21. Oct István Moldován OSZK,
Netarkivet RESAW seminar, Dec 2-3, 2013 Day 1. Who are we today □Birgit N. Henriksen, head of digital preservation, KB □Bjarne Andersen, head of digital.
Open access & visibility Management Digital Preservation ORA: Purposes.
Class 02 – 03 Feb 2014 Setup Where do we begin? Know your content Discovering your target user.
European Commission on Preservation and Access Preservation of digital heritage Yola de Lusenet Lisbon, November
National policy of the preservation of digital cultural heritage Estonian Legal Deposit Act and web resources Ülle Talihärm Head of Collection Development.
Examples for Open Access Scholar Electronic Repository by New Bulgarian University IP LibCMASS Sofia 2011 Contract № 2011-ERA-IP-7 Sofia, September,
Resource Description and Access (RDA) information session Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee.
Collecting History: Profiles in Science Alexa T. McCray National Library of Medicine Bethesda, MD Stanford University August 21, 1999.
Government Documents Made Easy? Or Just Easier?. 90% since % since 2000 Retrospective conversion projects Retrospective conversion projects Search.
Website Design, Development and Maintenance ONLY TAKE DOWN NOTES ON INDICATED SLIDES.
Libraries of Course: integrating library content and services into the e-learning environment. Brian Flaherty Digital Services Manager University of Auckland.
Electronic Theses and Dissertations: The bepress Approach Ben Hermalin Interim Dean, Haas School of Business, UC Berkeley & Co-Founder, bepress.
Strategies for archiving the Danish web space Bjarne Andersen Head of Digital Resources State and University Library, Aarhus
Digitalcommons.unl.edu Archiving Department Records.
Web Programming Language
Impact of the Alternative e-Publishing Model: From Open Access Resources & Self-Publishing toward Librarian’s New Challenges 溫達茂 飛資得資訊 中華民國九十三年十一月.
Workshop on Web Archiving
Library Web Portals: Reinventing Libraries for the Future
Introducing… Welcome to this introduction to Wiley Online Library.
Presentation transcript:

Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal Library, Denmark

Presentation outline Experiences with legal deposit of web materials in DK since 1998 Period with new projects, A new strategy in the future?

Denmark 5 million people in Northern Europe

The Danish Legal Deposit Law 1697: The first legal deposit law in Denmark 1902: All printed materials to be deposited 1997: All published works to be deposited

The law from 1997 covers any work published in Denmark regardless of medium “work”: a delimited quantity of information which must be considered a final and independent unit “published”: when … copies of the work have been placed on sale or otherwise distributed to the public

Types of Net Publications Static publications included (only periodically updated) monographs periodicals Dynamic publications excluded (continuously updated) Databases Homepages

Notification Who the person in charge of the technical completion of the digital copy How by filling out a form at the Danish legal deposit website: When as soon as the net publication is placed on the web. The Royal Library must then download it within three months

DC Registration Form - Monographs

Download - workflow The staff at the Danish Department : determine whether a publication is covered by the law if yes, download all files belonging to the work check downloaded work catalogue and classify the work in the OPAC (only periodicals) transfer work to archival server (server mirrored every night to State and University Library, Århus)

Cataloguing – Indexing Danish Bibliographic Centre makes MARC records of the part included in the National Bibliography Searches in OPAC supplemented or replaced with: access by searching directly in data provided by the publisher full text search in the archived material through a ‘web index’ – the same way you use the material when it is online on the net

Access to archived web material Theory - Restricted Access One PC in each legal deposit library placed in a reading room – free for all No possibility of making electronic copies from the archive, only paper print-outs Practice - No Access

Statistics January 2002 Subdomains within top level domain.dk: # subdomains in Denmark: 352,000 # subdomains in archive: ~ 1,000 Volume: # net publications : 10,522 # files :693,309 # Gbytes: 23 Content: 1/3 monographs, 2/3 periodical issues 2/3 public publishers, 1/3 private publishers

Staff resources Man YearsPaid hours/ publication Comment s 19982,312,75 System being developed and set up 19991,91,2 Downloading, cataloguing and classifying all publications ,30,6 Downloading, cataloguing and classifying all periodicals

MimeType Statistics – 2001 % of collected files Selective collection, Denmark Bulk collection, Sweden TEXT/ HTML 59,3 %55,6 % Image (GIF, JPEG, PNG) 37,9 %40,0 % PDF1,7 %1,0 % Other formats 1,1 %3,4 %

Problems related to harvesting Errors or inconsistencies in the published files Java applets and java scripts – no solution at the moment Data protected with username/password logins is covered by the law but more difficult to download Can't harvest more sophisticated formats Can't harvest interactive processes

Summary Selective web archiving based on notification covered by law and practiced since 1998 Only static publications Doesn’t get everything covered by the law Doesn’t get a representative part of the net Doesn’t get the most advanced part Labour intensive Cataloguing partly replaced by alternatives Very restricted access

Web Archiving Conference, CPH June 2001 Focus: User Expectations for web archiving Input from scholars & scientists: Archive the dynamic part of the web Focus on archiving the content the context the evidence of use Archivists: Use different archiving approaches Find new methods for archiving interactive material Budgets for making snapshots and making selective collections are comparable

Why harvesting ? Possible to get a representative part of the net Private and public publishers Material about Danes as well as material that interests the Danes Get new trends as soon as they appear The easiest way to get (all) updated versions quick and easy Accumulative harvesting of news and media

Why not only harvesting? Programmes and plug-ins difficult to handle Harvesting is not always possible (e.g. streamed and web casted material, flash applications, chat …) Harvesting may not give a useful result technical problems (java, interactive sites like net art, games, auctions …) personalised sites services (search engines, route planners, home banking, e-commerce…)

Birte Christensen-Dalsgaard, SB: Archive Experience, not Data

Three Danish web archiving initiatives Legal deposit based on selective approach since 1998, Nordic Web Archive (Nordic project , access to web archives) netarchive.dk (Danish project, multiple archiving strategies, )

netarchive.dk Project testing different archival approaches and the subsequent usability of the archived material for research Project partners: State and University Library, Aarhus Centre for Internet Research, Aarhus University The Royal Library, Copenhagen Economic support from the Danish Electronic Research Library (DEF) Period: August 2001 – July 2002 Case: Danish municipal elections November 2001

netarchive.dk Interactivity StaticDynamic Real time dialog Published, static Signal lifetime Different archival approaches Chatter botsChatWeb conference Report Web forms Searching OPAC Net auctions Net art

Accumulative Snapshot netarchive.dk Interactivity Static Dynamic Real time dialog Published, static Signal lifetime Process Different archival approaches

netarchive.dk - Experiences Experiences with event based harvesting New materials: web sites, discussion groups, portals and chat Hard to find the relevant, new URL’s during the event Experiences with contracts/agreements Only 3 of 44 contracts have been signed Knowledge of agreements do not spread out sufficiently in a top-down organisation Agreements must cover how to harvest (Technical issues) how to give access to harvested (Copyright issues) Experiences with different harvesters Browsers more robust to errors on sites than harvesters, and they interpret programme objects like java scripts

netarchive.dk Process rather than data Make a film of the process ‘container’ with known preservation strategy Accept loss of all functionality ‘Filming’ through a browser Catch chronological series of displayed WebPages Tools to take into consideration: Business intelligence tools Tools used in usability laboratories

New Strategy Proposal Archiving dynamic material must be legal Selective approach replaced/supplemented with bulk collections done by robot harvesting Retain possibility for delivery ‘Filming’ parts of the net Access less restrictive

END