Born-digital AES and CES publications: Archiving and Preserving the New Stuff Linda Eells & Leslie Delserone

Slides:



Advertisements
Similar presentations
Ubiquity of Grey Literature in a Connected Content Context Julia Gelfand University of California, Irvine Paper presented at GL5 Conference.
Advertisements

The Messy World of Grey Literature in Cyber Security 8 th Grey Literature Conference 4-5 December 2006 New Orleans, Louisiana Patricia Erwin – I3P Senior.
Creating Institutional Repositories Stephen Pinfield.
Building Repositories of eprints in UK Research Universities Bill Hubbard SHERPA Project Manager University of Nottingham.
Electronic theses - the next stage, 27-Sep-2004, The British Library E-thesis submission – a case study Simon J. Bevan Information Systems Manager Cranfield.
The Future of Scholarship in the Digital Age: The Role of Institutional Repositories Ann J. Wolpert Director of Libraries Massachusetts Institute of Technology.
WELCOME & TRLN UPDATES Mona Couts, Director 2011 TRLN Annual Meeting.
University of Sydney – Academic Forum – 13 April 2005 John Shipp University Librarian THE FUTURE OF THE UNIVERSITY LIBRARY CHANGES IN SCHOLARLY COMMUNICATION.
Strategies for Building Successful Digital Initiatives: Tools, Workflows and Ideas for Small to Medium Institutions Rachel L. Frick & Andrew Rouner University.
Newspaper Preservation through Collaboration and Communication The Texas Digital Newspaper Program By Ana Krahmer & Mark Phillips University of North Texas.
Selecting Preservation Strategies for Web Archives Stephan Strodl, Andreas Rauber Department of Software.
Library Resources Phase 2 of New Program Proposals CSU Library-IT Task Force February 19, 2009.
The Successful Repository: Welcome and Context Keith Webster University Librarian & Director of Learning Services.
Working Together Revisited: Diverse Skills for Sustainability Robert P. Spindler Arizona State University December 5 th, 2006.
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Persistent Digital Archives and Library System (PeDALS) A Guide for Wisconsin State Agencies.
CERES AND COLORADO STATE UNIVERSITY LIBRARIES. PROJECT CERES Begun in 2013, Project CERES is a Center for Research Libraries Global Resources Agriculture.
E-journal Publishing Strategies at Pitt Timothy S. Deliyannides Director, Office of Scholarly Communication and Publishing and Head, Information Technology.
The National Digital Newspaper Program (NDNP) An NEH/LC Collaborative Program Enhancing access to historical newspapers Release: September 2006.
1 EDMS 101 Speaker: Monica Crocker, DHS EDMS Coordinator Overview of current project(s) Objective of this section: This session outlines EDMS fundamentals.
Isabel Silver and Laurie Taylor IMLS Library Publishing Services Workshop May 5, 2011 UF Smathers Libraries Publishing Services.
City of Seattle Office of the City Clerk Open Government = Access Challenges and Opportunities with Digital Records.
Advancing Institutional Repositories A Case Study in Digital Agricultural Publication Management Laura Hanson University of Illinois at Urbana-Champaign.
Ymchwil Research Ymchwil Research RESAW Ioan Isaac-Richards Ingest Processes Manager Head of Web Archiving
Digitisation of Cultural Heritage at the National Library of Latvia: Past and Future Uldis Zariņš Head of Strategic Development National Library of Latvia.
A Public Trust at Risk: The Heritage Health Index Report on the Condition of Alabama’s Collection.
Collaborative Approach to Open Access: Experience from Bioline International Leslie Chan Associate Director Bioline International University of Toronto.
The search for alternative metrics for taxonomy Daphne Duin & Peter van den Besselaar VU university Amsterdam Org Science & Network Institute.
Digitization of the Federal Depository Library Program Judith C. Russell Superintendent of Documents & Managing Director, Information Dissemination “Electronic.
Making Grey Literature Available through Institutional Repositories LeRoy J. LaFleur, Social Sciences Bibliographer Nathan A. Rupp, Metadata Librarian.
DAEDALUS Project William J Nixon Service Development Susan Ashworth Advocacy.
The Role of Librarians in the Curation of Born-digital Resources: Building History Linda Eells University of Minnesota Libraries
Digitising Journals, March 2000, Copenhagen Astrid Wissenburg Information Services and Systems King’s College London
Login / Upload / Share Deposit your scholarly research - it’s as easy as 1, 2, 3 MAIN MESSAGE key reasons enumerated ->please read speaker notes id / who.
University of Bergen Library Electronic publishing Bergen – Makerere visit February 2005.
1 CS 502: Computing Methods for Digital Libraries Lecture 28 Current work in preservation.
Publisher’s Perspective: Digitization of print resources, and archiving of digital resources Judy Best, June 13, 2006.
Looking to the East: Challenges in Connecting Asian Libraries in the World of Information Karen T. Wei University of Illinois at Urbana-Champaign Hong.
From Concept to Reality: An overview of the University of Wisconsin Digital Collections Melissa Mclimans.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Implementing an Institutional Repository at IUPUI: A Good IDeA Kevin Petsche Acting Digital Libraries Team Leader Emily Dill Public Services Librarian,
Choosing Delivery Software for a Digital Library Jody DeRidder Digital Library Center University of Tennessee.
The Legislative Library of Ontario’s Ontario Documents Repository Road to Partnership.
Open Access to Grey Literature: Challenges and Opportunities in India By Dr. Manorama Tripathi Prof. H. N. Prasad Banaras Hindu University, Varanasi. Mr.
1 Keeping stuff safe: how can libraries maintain their e-journal collections in the long-term? Richard Gartner King's College London International conference.
Challenges and Opportunities for Academic Libraries Collaborative Imperatives to Support Collections, Digital Initiatives, and New Services for a Changing.
& Collaborating to Build an Open Access Archive of Public Policy Research Coalition for Networked Information Task Force Meeting.
Data Management Practices for Early Career Scientists: Closing Robert Cook Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN.
S YCAMORE S CHOLARS ISU Institutional Repository.
Robert Spindler University Archivist Arizona State University Libraries
HATHITRUST A Shared Digital Repository The HathiTrust Print Monograph Archive Planning Task Force Print Archive Network Forum ALA 2015 Annual Meeting June.
ALA Institutional Repository Update ALA Archives at the University of Illinois Urbana-Champaign Chris Prom Cara Bertram Denise Rayman.
Institutional Repositories GIL Users Group Meeting (GUGM) May 19, 2005 Macon State College Tim Daniels - Digital Technologies Librarian Georgia State University.
Symposium on Global Scientific Data Infrastructures Panel Two: Stakeholder Communities in the DWF Ann Wolpert, Massachusetts Institute of Technology Board.
Collections Under Stress: Developing a Coordinated Response to Ensuring Future Access to Print Holdings in New Jersey VALE 2005 Users Conference Panel.
Ufdc.ufl.edu/ufir Institutional UF. The Institutional Repository at the University of Florida is the digital archive for the intellectual.
Digital Collections: Making it Happen Hema Ramachandran Ed Sponsler Jim O’Donnell, Caltech Library System SCELC, September , Caltech.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
Digitally Signed Records – Friend or Foe? Boris Herceg Hrvoje Brzica Financial Agency – FINA Hrvoje Stančić.
15th North Carolina Serials Conference - March 31, Accessing Yesterday’s Information for Tomorrow’s Research: The Growth of Electronic Backfiles.
Digitalcommons.unl.edu Archiving Department Records.
IPR and the EThOS Project 28 th October 2008 Dr. Susan Copeland Senior Information Adviser (Research)
Outline of Talk What is eResearch and why does it matter? The South African SARIS project Challenging the current scholarly communication system eResearch,
Presenter: Beverly D. Charlot
Building A Repository for Digital Objects
YugNIRO Digitization Proposal 2012
Preservation efforts in the library community
An Ounce of Different is Worth A Pound of Same ~ Sustaining rich collections by adapting what we know & learning skills we need.
Presentation transcript:

Born-digital AES and CES publications: Archiving and Preserving the New Stuff Linda Eells & Leslie Delserone

The Problem “…digital information isn’t going to be easy to find…at a stable address in a stable form unless it is held by libraries – and yet, libraries do not hold most of the digital information…important to scholarship. It is out there in the wild, on the Web, not collected or preserved.” -Unsworth and Yu 2003 “The average lifespan of a Web page today is 100 days. This is no way to run a culture.” - Brewster Kahle, Director and Co-Founder, Internet Archive

Born-digital Extension Publications: Future?

The Problem: LINK ROT The percentage of inactive Internet references increased from 3.8% at 3 months to 13% at 27 months after publication Inactive Internet references .com addresses - 46% lost after 27 months .edu (30%)  other (20%) .gov (10%) .org (5%) -Dellavalle et.al  46% of all citations to Web-located sources could not be accessed  HTTP 404 (Page not found) message (61.5%) being the greatest cause of missing citations  Collectively, the missing citations accounted for 22.0% of all citations -Sellitto 2005

SILOS Current content

More SILOS Historical (reborn digital) content National Preservation Program for Agricultural Literature A National Endowment for the Humanities supported project of USAIN (United States Agriculture Information Network) Preserve and provide access to agricultural literature published prior to 1950 Twenty-seven states in first five phases MN – 350 titles, ~3,000 volumes NAL/Land-grant Universities Microfilming Project Early 1980s - microfilmed older Minnesota Agricultural Experiment Station, Agricultural Extension Service, and some academic department publications, including both monographs and serials Cornell>1,900 books, 6 journals, >850,000 pages

Key Concepts Phased approach Scalable Compliant with national/international standards Persistent long-term access Secure Openly accessible Collaborative content development Sustainable deposition/description

Born-digital Extension Publications Project In partnership and with strong (and critical) support from University [of Minnesota] Cooperative Extension, the Agriculture Experiment Station, and the College of Agriculture, Food, and Natural Resource Sciences- Collaboratively establish workflows, create policies, determine appropriate standards, and develop the technical infrastructure for a repository of born (and eventually reborn) digital agricultural resources that may be readily scaled to involve other national and international partners (e.g. NAL, eXtension, USAIN, FAO, AgNIC).

Research Proposal Context Evidence to support a born-digital pilot project at a national level

Research Proposal Question What are the best practices for conversion of documents that are available in both print and obsolete digital formats?

Research Proposal Approach Sample of extension publications, available both in print and digitally Conversion from born-digital format or print to archival digital format (pdf) File-to-file conversion vs scan+OCR Description and deposit into the University Digital Conservancy (UDC)

Reality check Recession… “unallotments”… Short-term, EFY funds

Current Pilot Project Conversion Sample Minnesota Extension Service publications Bulletins Fact Sheets Miscellaneous Publications Total of 245 documents

Current Pilot Project Methodology File-to-file conversion Obsolete or non-archival quality publishing formats (e.g., InDesign, Quark, PageMaker) to pdf

Current Pilot Project Preliminary Analyses Evaluate file-to-file conversion based on time, expense, error rates, and ease of workflow Assess time costs versus benefits associated with the application of NAL-T terms to this type of content

Current Pilot Project Staff Conversion and proofing work: Library professional, two students with backgrounds in graphic design Description and UDC deposit: Library professional

Results, Current Pilot Project Fully-converted, proofed, described & uploaded to UDC: 136 Converted files awaiting proofing: 6 Files in process: Files with unresolvable issues: 21 Files without print for comparison: 52

Conclusions, Current Pilot Project If print is available, file-to-file conversion not the most efficient choice Minimal metadata application (NAL-T, abstract) was workable

Conclusions, Current Pilot Project If print is available, file-to-file conversion not the most efficient choice Minimal metadata application (NAL-T, abstract) was workable Preliminary estimate, permanently-lost publications Evidence for greater attention to collecting print

Next Steps Scan + OCR HTML capture? Establish workflow from Extension Service to University Libraries to UDC Establish selection criteria in collaboration with Extension Service staff Still a silo…

References Cornell Historical Literature for Agriculture, available at (18 April 2008). Dellavalle, Robert P. et.al., Going, Going, Gone: Lost Internet References, Science 302 (5646): 787. Eells L Born-Digital Agricultural Resources: Archives and Issues. Quarterly Bulletin of IAALD, 52(3/4). Gwinn, Nancy, A National Preservation Program for Agricultural Literature, U.S. Agricultural Information Network. Heatley R Plan to Develop a Digital Information Infrastructure to Manage Land Grant Information. Available at (5 February 2008). Sellitto, Carmine The impact of impermanent Web-located citations: A study of 123 scholarly conference publications, Journal of the American Society for Information Science and Technology 56(7): Unsworth, John and Pauline Yu. 2003, Not-so-Modest Proposals: What do we want our system of scholarly communication to look like in 2010? CIC Summit on Scholarly Communication, Chicago, December 2, Available at (18 April 2008). USAIN Task Force Making the Case for a Next-Generation Digital Information System to Ensure America’s Leadership in Agricultural Sciences in the 21st Century. Available at (1 May 2008). Weiss, Rick Electronic Archivists Are Playing Catch-Up in Trying to Keep Documents From Landing in History's Dustbin, Washington Post, November 24, 2003, A08.