Presentation is loading. Please wait.

Presentation is loading. Please wait.

Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal.

Similar presentations


Presentation on theme: "Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal."— Presentation transcript:

1 Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal Library, Denmark bnh@kb.dk

2 Presentation outline Experiences with legal deposit of web materials in DK since 1998 Period with new projects, 2000- 2002 A new strategy in the future?

3 Denmark 5 million people in Northern Europe

4 The Danish Legal Deposit Law 1697: The first legal deposit law in Denmark 1902: All printed materials to be deposited 1997: All published works to be deposited

5 The law from 1997 covers any work published in Denmark regardless of medium “work”: a delimited quantity of information which must be considered a final and independent unit “published”: when … copies of the work have been placed on sale or otherwise distributed to the public

6 Types of Net Publications Static publications included (only periodically updated) monographs periodicals Dynamic publications excluded (continuously updated) Databases Homepages

7 Notification Who the person in charge of the technical completion of the digital copy How by filling out a form at the Danish legal deposit website: http://www.pligtaflevering.dk http://www.pligtaflevering.dk When as soon as the net publication is placed on the web. The Royal Library must then download it within three months

8 DC Registration Form - Monographs

9 Download - workflow The staff at the Danish Department : determine whether a publication is covered by the law if yes, download all files belonging to the work check downloaded work catalogue and classify the work in the OPAC (only periodicals) transfer work to archival server (server mirrored every night to State and University Library, Århus)

10 Cataloguing – Indexing Danish Bibliographic Centre makes MARC records of the part included in the National Bibliography Searches in OPAC supplemented or replaced with: access by searching directly in data provided by the publisher full text search in the archived material through a ‘web index’ – the same way you use the material when it is online on the net

11 Access to archived web material Theory - Restricted Access One PC in each legal deposit library placed in a reading room – free for all No possibility of making electronic copies from the archive, only paper print-outs Practice - No Access

12 Statistics January 2002 Subdomains within top level domain.dk: # subdomains in Denmark: 352,000 # subdomains in archive: ~ 1,000 Volume: # net publications : 10,522 # files :693,309 # Gbytes: 23 Content: 1/3 monographs, 2/3 periodical issues 2/3 public publishers, 1/3 private publishers

13 Staff resources Man YearsPaid hours/ publication Comment s 19982,312,75 System being developed and set up 19991,91,2 Downloading, cataloguing and classifying all publications 2000-1,30,6 Downloading, cataloguing and classifying all periodicals

14 MimeType Statistics – 2001 % of collected files Selective collection, Denmark Bulk collection, Sweden TEXT/ HTML 59,3 %55,6 % Image (GIF, JPEG, PNG) 37,9 %40,0 % PDF1,7 %1,0 % Other formats 1,1 %3,4 %

15 Problems related to harvesting Errors or inconsistencies in the published files Java applets and java scripts – no solution at the moment Data protected with username/password logins is covered by the law but more difficult to download Can't harvest more sophisticated formats Can't harvest interactive processes

16 Summary Selective web archiving based on notification covered by law and practiced since 1998 Only static publications Doesn’t get everything covered by the law Doesn’t get a representative part of the net Doesn’t get the most advanced part Labour intensive Cataloguing partly replaced by alternatives Very restricted access

17 Web Archiving Conference, CPH June 2001 Focus: User Expectations for web archiving Input from scholars & scientists: Archive the dynamic part of the web Focus on archiving the content the context the evidence of use Archivists: Use different archiving approaches Find new methods for archiving interactive material Budgets for making snapshots and making selective collections are comparable

18 Why harvesting ? Possible to get a representative part of the net Private and public publishers Material about Danes as well as material that interests the Danes Get new trends as soon as they appear The easiest way to get (all) updated versions quick and easy Accumulative harvesting of news and media

19 Why not only harvesting? Programmes and plug-ins difficult to handle Harvesting is not always possible (e.g. streamed and web casted material, flash applications, chat …) Harvesting may not give a useful result technical problems (java, interactive sites like net art, games, auctions …) personalised sites services (search engines, route planners, home banking, e-commerce…)

20 Birte Christensen-Dalsgaard, SB: Archive Experience, not Data

21 Three Danish web archiving initiatives Legal deposit based on selective approach since 1998, http://www.pligtaflevering.dk http://www.pligtaflevering.dk Nordic Web Archive (Nordic project 2000-2002, access to web archives) http://nwa.nb.no http://nwa.nb.no netarchive.dk (Danish project, multiple archiving strategies, 2001-2002) http://netarchive.dk http://netarchive.dk

22 netarchive.dk Project testing different archival approaches and the subsequent usability of the archived material for research Project partners: State and University Library, Aarhus Centre for Internet Research, Aarhus University The Royal Library, Copenhagen Economic support from the Danish Electronic Research Library (DEF) Period: August 2001 – July 2002 Case: Danish municipal elections November 2001

23 netarchive.dk Interactivity StaticDynamic Real time dialog Published, static Signal lifetime Different archival approaches Chatter botsChatWeb conference Report Web forms Searching OPAC Net auctions Net art

24 Accumulative Snapshot netarchive.dk Interactivity Static Dynamic Real time dialog Published, static Signal lifetime Process Different archival approaches

25 netarchive.dk - Experiences Experiences with event based harvesting New materials: web sites, discussion groups, portals and chat Hard to find the relevant, new URL’s during the event Experiences with contracts/agreements Only 3 of 44 contracts have been signed Knowledge of agreements do not spread out sufficiently in a top-down organisation Agreements must cover how to harvest (Technical issues) how to give access to harvested (Copyright issues) Experiences with different harvesters Browsers more robust to errors on sites than harvesters, and they interpret programme objects like java scripts

26 netarchive.dk Process rather than data Make a film of the process ‘container’ with known preservation strategy Accept loss of all functionality ‘Filming’ through a browser Catch chronological series of displayed WebPages Tools to take into consideration: Business intelligence tools Tools used in usability laboratories

27 New Strategy Proposal Archiving dynamic material must be legal Selective approach replaced/supplemented with bulk collections done by robot harvesting Retain possibility for delivery ‘Filming’ parts of the net Access less restrictive

28 END


Download ppt "Danish Legal Deposit on the Internet National Diet Library, Tokyo, January 2002 by Birgit N. Henriksen Head of Digitization and Web Department The Royal."

Similar presentations


Ads by Google