A Partnership Born of Urgency and Civic Responsibility Preserving Access to Government Websites Through the CyberCemetery Starr Hoffman Librarian for Digital.

Slides:



Advertisements
Similar presentations
Digital Initiatives at the University of North Texas Libraries Cathy Nelson Hartman University of North Texas Libraries Texas Conference on Digital Libraries.
Advertisements

Business Development Suit Presented by Thomas Mathews.
U.S. Government Printing Office Packaging and Metadata PREMIS Implementers Panel Library of Congress June 13, 2007.
1 What is the Internet Archive We are a Digital Library Mission Statement: Universal access to human knowledge Founded in 1996 by Brewster Kahle in San.
Copyright © 2014 Pearson Education, Inc. Publishing as Prentice Hall
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
IAEA International Atomic Energy Agency United Nations Library and Information Network for Knowledge Sharing (UN-LINKS) September 2013, Geneva.
Providing Access to Wisconsin State Government Documents By Abby Swanton, Librarian Dept. of Public Instruction, Reference and Loan Library Minnesota Capitol.
IAEA International Atomic Energy Agency ICSTI 2013 Annual Members’ Meeting March 2013.
The Division of Labor on a Campus Hosting Open Journal Systems and Open Conference Systems.
Publishing Partnerships: Federal Depository Library Program Requirements and Opportunities for Cooperation Federal Customer Agency Presentation April 2015.
Technical Tips and Tricks for User Support Mike Gardner
TC2-Computer Literacy Mr. Sencer February 4, 2010.
Your online classroom. Powerhouse Campus o Custom Class dashboards o Links with Moodle, Studywiz, Bb, ClickView & all web apps o Links your school library.
Static and Dynamic Websites Static and Dynamic Website Design Presented by: Shawn Cohan, President All Squared Web Design, LLC
1 Archiving and Preserving the Web Kristine Hanna Internet Archive April 2006.
1 of 7 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
1 The Vietnam Center and Archive Stephen Maxner, Ph.D.
Online Resources From Oxford University Press This presentation gives a brief description of Oxford Journals. It tells you: what the journals are; how.
1 Archive-It Training University of Maryland July 12, 2007.
Web Content Management at GCN.com The Gilbane Conference: Content Technologies for Government Alec Dann SVP of Internet Publishing PostNewsweek Tech Media.
Building Library Web Site Using Drupal
Energy and the Environment Research Resources Locating Government Publications and other material Donna Burton Documents Librarian Documents Librarianx6635.
© 2008 The McGraw-Hill Companies, Inc. All rights reserved. M I C R O S O F T ® Preparing for Electronic Distribution Lesson 14.
Lecturer: Ghadah Aldehim
1 Archiving and Preserving the Web Dan Avery Kristine Hanna Merrilee Proffitt Internet Archive RLG April 2006.
Multimedia and the Web Chapter Overview  This chapter covers:  What Web-based multimedia is  how it is used today  advantages and disadvantages.
City of Seattle Office of the City Clerk Open Government = Access Challenges and Opportunities with Digital Records.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Collaborative Approach to Open Access: Experience from Bioline International Leslie Chan Associate Director Bioline International University of Toronto.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
U.S. Government Printing Office FDsys Update Spring Depository Library Council April 16, 2007.
Support.ebsco.com Basic Searching for K-12 School Libraries Tutorial.
1.Getting Started 2.Modifying Design 3.Page 4.News 5.Events 6.Photo Gallery 7.Newsletter Index Training 15 th Mar., 2011.
The Library of Congress Martha Anderson Program Officer, NDIIPP Office of Strategic Initiatives Library of Congress April 2005 LC Perspective : Preservation.
Open access & visibility Management Digital Preservation ORA: Purposes.
The Internet 8th Edition Tutorial 4 Searching the Web.
AILLA:The Archive of the Indigenous Languages of Latin America Heidi Johnson The University of Texas at Austin Latin American Digital Library Initiative,
Tech Competencies for the 21st-Century Librarian Starr Hoffman Librarian for Digital Collections University of North Texas Libraries San Antonio Public.
CyberCemetery Preserving At-Risk Government Web Content.
ALA Institutional Repository Update ALA Archives at the University of Illinois Urbana-Champaign Chris Prom Cara Bertram Denise Rayman.
The “How” of Wikis Starr Hoffman Librarian for Digital Collections University of North Texas Libraries Five Weeks to a Social Library:
GPO POLICIES AND PLANS FOR SPATIAL INFORMATION DISTRIBUTION GPO POLICIES AND PLANS FOR SPATIAL INFORMATION DISTRIBUTION Judy Russell Superintendent of.
Susan Lyons, Rutgers University Law School Library, Moderator George Barnum, United States Government Printing Office Cathy Nelson Hartman, University.
1 SERD Project Director’s Conference CRIS OVERVIEW Education Component Current Research Information System March 30, 2005 Dr. Irma A. Lawrence National.
Encouraging An Informed Citizenry: Locating and Using Congressional Research Service Reports Starr Hoffman Librarian for Digital Collections University.
IBM Lotus Software © 2006 IBM Corporation IBM Lotus Notes Domino Blog Template Steve Castledine.
GPO’s Future Digital System (FDsys) November 2, 2006 LS&CM CENDI Presentation.
Microsoft Office 2008 for Mac – Illustrated Unit D: Getting Started with Safari.
Discovering Computers Fundamentals, 2010 Edition Living in a Digital World.
COM: 111 Introduction to Computer Applications Department of Information & Communication Technology Panayiotis Christodoulou.
Date of Presentation Name of Presenter Insert image _________ Toolkit.
Access to Government Documents in the Digital Age: Should we be worried?
Where are my files? Discoveries in establishing a digital archive workflow Sally McDonald Archivist/Librarian Western History/Genealogy, Denver Public.
Grant Writing for Digital Projects September 2012 IODE Project Office IODE Project Office Oostende, Belgium Oostende, Belgium Sustainability and.
introductionwhyexamples What is a Web site? A web site is: a presentation tool; a way to communicate; a learning tool; a teaching tool; a marketing important.
Fab25 User Training Cerium Labs LabCollector - LIMS Lynette Ballast.
RSC Learning Resources Conference 8 th November 2012, Manchester Andrew Bevan (EDINA)
Government Printing Office Future Digital System (FDsys) Special Library Association Open Access and Public Access: New Models for Information Access June.
Financial Management of ECE Programs.  Go to “Tools”  Click on “Personal Information” to edit your personal information (including address) or.
Federal Regulations Federal regulations are the third primary source of American law discussed. Proposed regulations and final regulations are published.
2008 DOT GOV HARVEST PRESERVING ACCESS UNIVERSITY OF NORTH TEXAS LIBRARIES Cathy N. Hartman Mark E. Phillips FDLC Oct 21, 2008.
Archiving & Preserving Digital Content
Building A Repository for Digital Objects
Latin American Government Documents Archive, LAGDA
PUBLIC SCHOOL LAW Part 15: Primary Legal Sources-Administrative Law
Slides prepared by Sarah Benis Scheier-Dolberg
Introduction To Building a Web Site
Presentation transcript:

A Partnership Born of Urgency and Civic Responsibility Preserving Access to Government Websites Through the CyberCemetery Starr Hoffman Librarian for Digital Collections University of North Texas Libraries 22 April AGA Regional Professional Development Conference

Presentation Overview Intro: What is the CyberCemetery? Purpose: Why create a CyberCemetery? Development Archiving Process Technical Details Users by Country Types of Content Using the CyberCemetery Other Resources Conclusion

What is the CyberCemetery?

online archive of websites from U.S. government agencies or commissions that are no longer operating

What is the CyberCemetery? online archive of websites from U.S. government agencies or commissions that are no longer operating maintained by the University of North Texas Libraries freely accessible world-wide

CyberCemetery vs. Dot Gov Harvest  Partners: UNT, GPO, NARA  archive of websites from U.S. government agencies or commissions that are no longer operating  “dead” websites (no longer hosted or maintained by the government)  currently live & useable  purpose:  to preserve “dead” government websites and provide permanent public access  Partners: LC, IA, UNT, others  archive of government website “snapshots” from key time periods (i.e., before/after an administration change)  will include snapshots of many still-live websites  archived, but not currently “live”  purpose:  to preserve a record of government web presence during specific time periods and administrations  to track changes in government websites over time present present

Why Create the CyberCemetery? At-Risk Information: ◦ 1990’s: U.S. government information moved online ◦ much of it born-digital ◦ often edited or removed without warning Federal Depository Library Program (FDLP)  mission:  to provide free, permanent public access to government information  online information complicates this mission  administered by the U.S. Government Printing Office (GPO)  UNT = federal depository library

1995 ◦ report from Government Printing Office (GPO):  need to preserve electronic government publications 1997 ◦ UNT & GPO discuss a partnership ◦ UNT archives ACIR website  (Advisory Commission on Intergovernmental Relations) Development

Development 1999 ◦ UNT/GPO partnership expanded  permanent public access  multiple government websites  government agency or commission which is no longer operating  (and/or has issued a final report) ◦ Collection named “CyberCemetery”  websites from “dead” government agencies and commissions

Development 2006 ◦ UNT/GPO partnership expanded  U.S. National Archives and Records Administration (NARA)

Archiving Process Identify at-risk government agencies and commissions ◦ read/listen to the news ◦ online queries targeting keywords (i.e., “final report”) ◦ read government-related websites and blogs ◦ referrals from other librarians ◦ contacted by GPO ◦ contacted directly by the agency/commission

Archiving Process Evaluate the website ◦ official government website ◦ agency or commission must:  be closing  issued a final report  other indication that the website is at-risk

Archiving Process Evaluate the website (continued)  Questions for website administrator: 1. What operating system was used to host this website? 2. What webserver software was used for the hosting of this website? 3. Are server side includes (ssi) used in this website? 4. Was this website static html or a dynamic site? 1. If dynamic, what scripting languages were used for this website (php, perl, python)? 2. Was a database used for this website? 1. If so, what database was used for this website? 2. What methods were used to connect to the database? 5. Is there streaming media associated with this website? 6. Are there proprietary content types used in this website? 7. Are there any comments you would like to add?

Archiving Process Harvest the website Past method: HTTrack   user interface:  UNT’s Digital Collections website Current method: Heritrix   ARC files  website in a single file: 100 – 600MB  user interface:  Internet Archive’s Wayback Machine

Archiving Process Harvesting alternative: Donated content directly receive files from agency or commission ◦ Why donated content?  If content cannot be accessed by harvesting  flash video, large amounts of media ◦ Why not donated content?  Content could be altered  Harvesting = exact copy of online published content

Archiving Process Link Checking ◦ Automated:  Xenu Link Checker   compare reports of original and archived sites ◦ Manual:  manually navigate original and archived sites

Archiving Process Archive Preparation (previous method) ◦ add text “Archive”  8 point, Times New Roman font  added to top/center of each page ◦ manually disable contact links  “mail to” links  submit-able forms (Heritrix makes these preparations unnecessary)

Archiving Process Load to UNT Server ◦ Upload archived website ◦ Add navigation ◦ Notify GPO (or agency/commission) that archived version is live

Technical Details Equipment ◦ Four servers (three as backup) ◦ Four node fail-over clustered configuration ◦ SAN volume ◦ 27.2GB of content on 40GB server Environment ◦ Library basement ◦ 38 ◦ Fahrenheit (3 ◦ Celsius) ◦ 50% humidity

Technical Details Backup ◦ full backups to magnetic tape ◦ performed each weekend ◦ shipped to offsite storage company  Iron Mountain 

Where Are Our Users?

Types of Content web files (HTML, XML) text documents (.txt,.pdf,.doc) spreadsheets & statistical information (.xls) presentations (.ppt) media files: ◦ images & photographs (.jpg,.gif,.png, tiff) ◦ audio (.mp3) ◦ video (.wm,.mov,.rp)

Using the CyberCemetery

Navigating browse by: ◦ title ◦ date of expiration ◦ government branch

Navigating main search box ◦ all CyberCemetery content at once ◦ National Partnership for Reinventing Government ◦ Office of Technology Assessment ◦ 9/11 Commission

Other Resources Congressional Research Reports ◦ research specialists at Library of Congress ◦ topics relevant to pending legislation ◦ high-quality, non-biased information ◦ created for members of Congress ◦ not typically publically available ◦ +10,000 reports available

Other Resources UNT Digital Library ◦ digitizing our “legacy” collection of government documents  “A-Z Digitization Project” ◦ FCC Record  (FCC Report = future project) ◦ U.S. Agricultural Experiment Station Record ◦ OTA documents ◦ ACIR documents

Other Resources get updates via RSS example feed: ◦ feed://digital.library.unt.edu/explore/collections/ATOZ/feed/

Ask us! ◦ phone: (940) , main desk ◦ Government Documents Dept. Service Desk Hours

Conclusion permanent public access archived government information freely, globally available partnership: ◦ University of North Texas Libraries ◦ U.S. Government Printing Office ◦ National Archives and Records Administration

Contact Information download this presentation: ◦ Starr Hoffman Librarian for Digital Collections Government Documents Department University of North Texas Libraries