1 CS 502: Computing Methods for Digital Libraries Lecture 28 Current work in preservation.

Slides:



Advertisements
Similar presentations
NATIONAL LIBRARY OF MEDICINE PubMed Central Edwin Sequeira National Library of Medicine May 26, 2004.
Advertisements

A survey of Web preservation initiatives Michael Day UKOLN, University of Bath 7 th European Conference on Research and Advanced Technology.
The Future of Scholarship in the Digital Age: The Role of Institutional Repositories Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
1 Answer to the Questions and Comments on the Services of the National Diet Library NCC 2007 Open Meeting Friday, March 23, 2007 Nobuya AIHARA Reader Service.
1 Uppsala University Library Eva Müller Peter Hansson Stefan Andersson Uwe Klosa Electronic Publishing Centre Krister Östlund Waller project.
Providing Online Access to the HKUST University Archives: EAD to INNOPAC Sintra Tsang and K.T. Lam The Hong Kong University of Science and Technology 7th.
Highlights from the Open Access Timeline (1) 1971, Project Gutenberg launched on the Internet (originally as an FTP site). There are now 18,000 free books.
Bibliothèque de l’Université LavalFaculté des études supérieures Guy Teasdale Access 2003 Vancouver - October 4, 2003.
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
Contents and Formats Existing Digital Sources Gertraud Griepke Cornell University, July 26th 2002.
1 Strategies for Collecting and Preserving Open Access Materials on the Web William Y. Arms Cornell University Federal Library and Information Center Committee.
1 CS 502: Computing Methods for Digital Libraries Lecture 16 Web search engines.
1 CS 502: Computing Methods for Digital Libraries Lecture 22 Repositories.
William Y. Arms Corporation for National Research Initiatives March 22, 1999 Object models, overlay journals, and virtual collections.
Online Databases and the Online DB Industry Change, change and more change!
Access to Digital Materials through the Library of Congress OPAC Presentation by Dr. Barbara B. Tillett Chief, Cataloging Policy and Support Office Library.
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
1 CS 502: Computing Methods for Digital Libraries Lecture 25 Access Management.
1 CS 502: Computing Methods for Digital Libraries Lecture 27 Preservation.
1 Automated Digital Libraries William Y. Arms Department of Computer Science Cornell University.
1 Minerva The Web Preservation Project. 2 Team Members Library of Congress Roger Adkins Cassy Ammen Allene Hayes Melissa Levine Diane Kresh Jane Mandelbaum.
1 William Y. Arms Cornell University April 4, 2003 Free Access to Information Today Who Benefits? What are the Risks? Who Pays?
Corporation For National Research Initiatives NSF SMETE Library Building the SMETE Library: Getting Started William Y. Arms.
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links.
1 CS 430: Information Discovery Lecture 15 Library Catalogs 3.
Trends in scholarly electronic publishing Setting the context for the workshop.
WVU Electronic Theses & Dissertations Transforming Graduate Education and Research.
Update on the VERSIONS Project for SHERPA-LEAP SHERPA Liaison Meeting UCL, 29 March 2006.
Social Science Data and ETDs: Issues and Challenges Joan Cheverie Georgetown University Myron Gutmann ICPSR – University of Michigan Austin McLean ProQuest.
Web Capture team Office of strategic initiatives February 27, 2006 Selecting Content from the Web: Challenges and Experiences of the Library of Congress.
Cataloging and Metadata at the University Library.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Thomas HeckeleiPublishing and Writing in Agricultural Economics1 Publishing and Writing in Agricultural Economics Promotionskolleg Agrarökonomik 1Introduction.
Google Books, UMI and Other Intriguing Trends in Digital Publishing Joe Wible Hopkins Marine Station of Stanford University October 9, 2006.
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
Digital Archiving in the Hungarian Széchényi Library The story and the plans of the Hungarian Electronic Library Rome, 21. Oct István Moldován OSZK,
Library of Vilnius Gediminas Technical University Asta Katinaitė, Aurelija Striogienė
123 Springer & CrossRef CrossRef Members Meeting November 14, 2000 Howard Ratner.
P. Schirmbacher Humboldt-Universität zu Berlin The Changing Process of Scholarly Publishing or the Necessity of a New Culture of Electronic.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
PNC 2005 Hawaii Toward an Institutional Repository at the Data Service of NDAP Ya-ning Chen, Shu-jiun Chen Computing Centre, Academia Sinica Taiwan.
Definition and search of scientific articles Tord Heljeberg
Uganda Scholarly Digital Library (USDL) Makerere University’s Institutional Repository By Margaret Nakiganda URL:
Preservation Program Digital Preservation Program Digital Preservation Services: Extending tools to meet campus needs Patricia Cruse, Director, Digital.
Corporation For National Research Initiatives Technical Issues in Electronic Publishing Corporation for National Research Initiatives William Y. Arms.
Group 1 – Session 3 Key Points. Experiences in digital archiving Who is involved? –Partnerships with library and computer centre –Who should be responsible?
Institutional Repositories: the DSpace Experience Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
April 14, 2005MIT Libraries Visiting Committee Libraries Strategic Plan Theme III Work to shape the future MacKenzie Smith Associate Director for Technology.
From Access to Archive Transforming Scholars Portal into an E-Journal Archive.
Future Functionality and CrossRef Policy Special Member Meeting December 4th, 2001.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
Managing ETDs with Associated Complex Digital Objects Gabrielle V. Michalek Director, Scholarly Publishing, Archives and Data Services Carnegie Mellon.
Digitalcommons.unl.edu Archiving Department Records.
GNU EPrints 2 Overview Christopher Gutteridge 19 th October 2002 CERN. Geneva, Switzerland.
CS 791-S04 Digital Preservation Seminar Presentation of: Arms, "Preservation of Scientific Serials: Three Current Examples", JEP, 5(2), 1999 and Nelson.
7th Annual Hong Kong Innovative Users Group Meeting
Lecture 12 Why metadata? CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Education of a scientist video
Introduction to Metadata
VI-SEEM Data Repository
Find support in.
Brewster Kahle Director Internet Archive
….part of the OSU Libraries' suite of digital library tools…
Accessing journals by Language 4
Networked Information Resources
Digital Library and Plan for Institutional Repository
Presentation transcript:

1 CS 502: Computing Methods for Digital Libraries Lecture 28 Current work in preservation

2 Administration Review class Tuesday, 12:20. Room to be announced on web site "Notices". Format, questions (by you) and answers (by me). Laptops Return before examination. Bring receipt to examination. Examination Part 1: 5 questions, 1.5 hour time limit Part 2: nomad experiment questionnaire, no time limit

3 Education and research Digital libraries in a state of flux: Much of this class has described material that is still experimental Cornell people and our colleagues are actively involved in many aspects This class: Recent activities in preservation of materials on the web Some of my recent work

4 Some light reading William Y. Arms, "Preservation of scientific serials: three current examples." Journal of Electronic Publishing, 5(2), December William Y. Arms, "Economic models for open-access publishing." iMP, March

5 Preservation of serials September Workshop chaired by Deanna Marcum, Don Waters, Cliff Lynch Issues in preserving online journals for 100 years Invited paper by William Arms "Preservation of Scientific Serials: Three Current Examples" ACM Digital Library Internet RFC Series D-Lib Magazine Motivated by realization that early preservation work may be tackling the wrong problem

6 Publisher's role in preservation Life cycle of electronic publication 1. Active management by publisher 2. Long-term preservation by another organization Overall observation The length of #1 may be very short or hundreds of years The most vulnerable time is the transition between #1 and #2 Preservation discussions have emphasized #2 (e.g., 5 level model)

7 ACM Digital Library Organizational ACM is a stable organization that considers the Digital Library one of its principal assets Rights ACM either owns copyright or has full preservation rights Technical Complex: relational database (schema), SGML (DTD), rendering software, private metadata system Strong computing department Replication No independent mirrors

8 Internet RFC Series Organizational Complex relationship between Internet Society (ISCO), Internet Engineering Task Force (IETF) and RFC editor. Currently actively managed, but no long-term commitment Secretariat & RFC editor -- income from meetings & grants Rights ISOC and IETF have very broad rights Technical Simple: text only (a few PostScript) Replication Several independent mirrors

9 D-Lib Magazine Organizational Published by CNRI, reliant on grants. Rights Authors own rights in articles. CNRI owns rights in other materials. Technical Simple: uses basic web technology. Used for experiments in DOIs, XML metadata, etc. Replication Several independent mirrors

10 Approaches to preservation of the web Partnership with publishers Publishers and libraries as partners Selective collection of open access web Librarianship in a new domain Bulk collection of open access web Automatic librarianship

11 Partnerships with publishers Library of Congress and UMI US theses and dissertations American Physical Society and Cornell University Journals in physics Elsevier Science Policy statement on archiving

12 Partnership with publishers Publishers and libraries as partners Selective collection of open access web Librarianship in a new domain Bulk collection of open access web Automatic librarianship Approaches to preservation of the web Cornell and Library of Congress

13 Selective preservation Selection of web sites Example: National Library of Australia national importance multiple versions (print and online) authority and research value

14 Selection of web sites Pragmatic considerations technical complexity -- not all standards are good frequency of making copies COST Librarianship in a new domain

15 Catalogs and indexes Example: CORC simple standard using Dublin Core tools for creating records COST Librarianship in a new domain

16 Bulk collection: automatic librarianship Volumes of information are too great for human selection, indexing and management Examples: Kulturarw 3 -- National Library of Sweden Internet Archive -- Brewster Kahle Automatic methods are used to collect, organize and provide access

17 Automatic librarianship Collection Example: Internet Archive Collecting open access web since 1996 Complete sweep of web approximately once a month HTML pages only 14 terabytes of data (soon all online) access for researchers using Unix tools 7 people

18 Automatic librarianship Indexing Examples: ResearchIndex Google

19 Legal issues Legal position of archives that download open access materials is unclear Preservation is in the national interest See the discussion in The Digital Dilemma (National Academy of Sciences, 1999) Crucial factor is economic impact on copyright owners Library of Congress has no special position except via copyright deposit U.S. Copyright Office offer to help clarification

20 Current activities Selection: guidelines and prototypes Library of Congress working group Political web sites Tools Web site mirroring Web site profiler (M.Eng. project) Copyright Ad hoc working group (Deanna Marcum, Bill Arms)

21 CS 502 Computing Methods for Digital Libraries THE END