Challenges in Web Archiving UNT Perspective NDIIPP – July 21, 2010.

Slides:



Advertisements
Similar presentations
K-12 Web Archiving Project Archive-It Partner Meeting November 4, 2009.
Advertisements

Digital Initiatives at the University of North Texas Libraries Cathy Nelson Hartman University of North Texas Libraries Texas Conference on Digital Libraries.
© 2008 EBSCO Information Services SUSHI, COUNTER and ERM Systems An Update on Usage Standards Ressources électroniques dans les bibliothèques électroniques.
Metadata for Digital Content at the Library of Congress Jane Mandelbaum Information Technology Services Library of Congress May 2009.
Panel: What Changes With Digital? Web Archiving ARL Forum 2009 Tracy Seneca – California Digital Library.
Strategic Action Group 3: Scholarly Research & Communication (SAG3) Orientation Webinar June 24, 2013 University of California Libraries Systemwide Advisory.
An Arizona Model for Capturing and Describing Documents on the Web Richard Pearce-Moses Director of Digital Government Information Arizona State Library,
Strategic Action Group 3: Collection Licensing Subgroup (CLS) Orientation Webinar June 25, 2013 University of California Libraries Systemwide Advisory.
1 Archiving and Preserving the Web Kristine Hanna Internet Archive July 2008.
The FDLP Web Archive Dory Bower Archive-It Partner Meeting November 18, 2014.
1 Archiving and Preserving the Web Kristine Hanna Internet Archive April 2006.
National Digital Information Infrastructure and Preservation Program (NDIIPP) Building a Network of Preservation Partners CNI Spring Task Force Meeting.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
1 Archiving and Preserving the Web Dan Avery Kristine Hanna Merrilee Proffitt Internet Archive RLG April 2006.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
A Partnership Born of Urgency and Civic Responsibility Preserving Access to Government Websites Through the CyberCemetery Starr Hoffman Librarian for Digital.
E NSURING THAT THE LIBRARY DEVELOPMENT COMMUNITY THRIVES AT THE NETWORK LEVEL Terry Reese Gray Family Chair for Innovative Library Services
Web Capture team Office of strategic initiatives February 27, 2006 Selecting Content from the Web: Challenges and Experiences of the Library of Congress.
Impact of E-Resources 2004 North Carolina Serials Conference Aisha Harvey Collection Development & Reference Perkins Library, Duke University 4/16/2004.
The Web Archiving Service Tracy Seneca California Digital Library California Digital LibraryNew York UniversityUniversity of North Texas National Digital.
Copyright © 2008, Open Geospatial Consortium, Inc., All Rights Reserved. NDIIPP Partnership Update: North Carolina and Multi-state Demonstration Projects.
Open Access Symposium 2015 Open Access, the Law, and Public Information Mary Alice Baish UNT Dallas College of Law May 19, 2015 National Plan for Access.
The ECHO DEPository Project A project of the University of Illinois at Urbana-Champaign and OCLC in partnership with the Library of Congress ALA Annual.
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
Web Archiving Challenges: Collaborative Collection Building.
Aarhus. BnF main topics – 2013 – crawling side Keep crawling –Broad and focused crawls –Limit of 100 Tb Crawl of password protected content –“Press project”:
Office of Strategic Initiatives All Hands Meeting-March 2010 Challenges in Web Archiving: Library of Congress Edition Abbie Grotke, Web Archiving Team.
1 Archive-It: Archiving and Preserving Born Digital Content NDIIPP June 2009 Molly Bragg Partner Specialist Internet Archive.
K-12 Web Archiving Project NDIIPP Partner Meeting July 10, 2008.
Group. “Your partner in developing future Lifelong Learners” UROWNE UNIVERSITY LIBRARY.
Can we be doing more? Beth Tillinghast University of Hawaii at Manoa October 19, 2011 Archive-It Partner Meeting ACCESS TO OUR ARCHIVED WEBSITE COLLECTIONS.
Preserving Digital Culture: Tools & Strategies for Building Web Archives : Tools and Strategies for Building Web Archives Internet Librarian 2009 Tracy.
Web Archiving at the National Library of Australia Russell Latham Senior Web Archivist, National Library of Australia.
The Library of Congress Martha Anderson Program Officer, NDIIPP Office of Strategic Initiatives Library of Congress April 2005 LC Perspective : Preservation.
National Digital Information Infrastructure and Preservation Program (NDIIPP) CNI Project Briefing December 5, 2005.
November 2004 NDIIPP: Future Directions and Relevance to Other Countries Beth Dulabahn Office of Strategic Initiatives Library of Congress November 7,
Martin Halbert UNT Dean of Libraries MetaArchive President Monday, April 11, 2011 Newspaper Archive Summit University of Missouri Columbia, MO.
Safeguarding the Freedom of Information: Digital Archive Initiatives in the United States Federal Government Michael Paul Huff Information Resource Officer.
1 E- Learning and Writing Skills IGGU 1101 Libraries Dr. Sana’a Wafa Al-Sayegh.
Imaging Pittsburgh: Creating a Shared Gateway to Digital Image Collections of the Pittsburgh Region IMLS 2002 National Leadership Grant Library & Museum.
CyberCemetery Preserving At-Risk Government Web Content.
1 Turning Challenges into Opportunities: How Law Libraries Can Capture and Preserve Government Web Resources AALL Annual Meeting July 17, 2007 Cathy Nelson.
Discovery Tools for Health Libraries  11 th September 2015 WorldCat Discovery Services Simon Day Product Manager.
GPO POLICIES AND PLANS FOR SPATIAL INFORMATION DISTRIBUTION GPO POLICIES AND PLANS FOR SPATIAL INFORMATION DISTRIBUTION Judy Russell Superintendent of.
The Web-at-Risk NDIIPP Sponsored Project Partners include: California Digital Library – project lead University of North Texas New York University California.
Web Archiving Service Public Access Release Date: July
Encouraging An Informed Citizenry: Locating and Using Congressional Research Service Reports Starr Hoffman Librarian for Digital Collections University.
Preservation Program Digital Preservation Program Digital Preservation Services: Extending tools to meet campus needs Patricia Cruse, Director, Digital.
Navigating the ‘information jungle’ a Research Safari Leonie McIlvenny.
Orientation Webinar July 10, 2013 University of California Libraries Systemwide Advisory Structure.
Travis Clayton Senior Consultant Microsoft Enterprise Services.
The Disappearing Data Problem Steve Morris Head of Digital Library Initiatives North Carolina State University Libraries.
Current Landscape in Federal Government Printing and Publishing: Pros & Cons of Digitization and the Impact on Users.
Building Collections on the Web BCWeb. What’s BCWeb ? BCWeb was developped entirely by the BnF for the content curators to replace its old selection tools.
Library of Congress Partnerships for Managing Geospatial Data North Carolina Geographic Information Coordinating Council Raleigh, NC November 7, 2007 William.
LOCKSS at Georgia Tech Patricia E. Kenly April 2007.
Access to Government Documents in the Digital Age: Should we be worried?
Preserving the End of a Digital Era Kate Kosturski December 16, 2008.
Geospatial Data Appraisal NDIIPP Meeting Presented by Brett Abrams, Archivist June, 2012.
A Shared Commitment to Digital Preservation and Access.
Web Archiving Workshop Mark Phillips Texas Conference on Digital Libraries June 4, 2008.
2008 DOT GOV HARVEST PRESERVING ACCESS UNIVERSITY OF NORTH TEXAS LIBRARIES Cathy N. Hartman Mark E. Phillips FDLC Oct 21, 2008.
Digitization Workflows From the Digital Projects Unit University of North Texas Libraries Mark E. Phillips Jeremy D. Moore February 12, 2009.
Building A Repository for Digital Objects
Library of Congress Resources for Federal Records Managers
Curate, Archive, Manage, Preserve
Web Documents Digital Archive Pilot Project at GPO
Preserving Our Collective Digital History
CNI Project Briefing December 5, 2005
Update on Digitization Projects at UNT
Presentation transcript:

Challenges in Web Archiving UNT Perspective NDIIPP – July 21, 2010

Broad Challenges  Make Web archives more usable for libraries  Tools for collection builders  Bringing the Web archive to the collection builders  Build digital library collections from Web content  Identification of key content within archive  Move beyond the needle in haystack approach of selection  Counting and reporting  Understanding how Web archives should fit into traditional library metrics July 21, 2010NDIIPP

EOT Archiving Project  Who  Library of Congress, the GPO, the Internet Archive (IA), the University of North Texas (UNT) Libraries, and the California Digital Library (CDL)  What  Snapshot of the federal government’s public Web presence  When  Before & after the 2009 change in administrations  How  Nomination Tool: Websites  Website Harvests: IA, UNT, & CDL  Harvest Consolidation: Library of Congress July 21, 2010NDIIPP

EOTCD Project  EOT Archive Classification  Objective: Classify materials in accord with the Superintendent of Documents (SuDocs) Classification Numbering System  Outcome: Enable librarians to utilize existing selection practices to identify materials in the EOT Archive  Web Archive Metrics  Objective: Identify a set of metrics for materials in Web archives  Outcome: Enable characterization of materials in Web archives in units of measurement more familiar to libraries and their administrations July 21, 2010NDIIPP

Research Questions July 21, 2010NDIIPP 1. How effective is the organization of large-scale unstructured Web archives using a pre-defined classification system, the SuDocs classification numbering system, as evaluated by government information librarians? 2. What measurable units for the materials in Web archives best support management acquisition decisions in libraries?

Next Steps  Create a process for understanding Web archives  Large historical archives  Enable collection librarians to make decisions involving archive content  Workflows for moving content into other areas of the library July 21, 2010NDIIPP