HATHI TRUST A Shared Digital Repository Building A Future By Preserving Our Past The Preservation Infrastructure of HathiTrust Digital Library Jeremy York.

Slides:



Advertisements
Similar presentations
HathiTrust Unless otherwise noted, these slides and their contents are licensed under a Creative Commons Attribution Unported License.
Advertisements

Beyond the Google Book: the Future of the Digital Library Cory Snavely Library IT Core Services manager University of Michigan April 20, 2010.
HathiTrust Digital Library
HATHI TRUST A Shared Digital Repository HathiTrust Digital Library Is There A Past In Your Future? Princeton University February 2010.
KAT HAGEDORN HATHITRUST SPECIAL PROJECTS COORDINATOR UNIVERSITY OF MICHIGAN LIBRARIES OCTOBER 9, 2009 Seamless Sharing: NYU, HathiTrust, ReCAP and the.
HathiTrust: Building the Universal Collection John Wilkin 18 May 2009.
This Library Never Forgets Preservation, Cooperation, and the Making of HathiTrust Digital Library Jeremy York Project Librarian HathiTrust Digital Library.
HathiTrust: A Big Idea with Bold Plans
HATHI TRUST A Shared Digital Repository HathiTrust Open Webinar Jeremy York Project Librarian, HathiTrust May 3 and 5, 2011.
HATHI TRUST A Shared Digital Repository HathiTrust Overview Julie Bobay, Heather Christenson, and John Wilkin April 12, 2011.
Building the Universal Library: The Promise and Challenges of HathiTrust John Wilkin 2 April 2009.
HathiTrust Sharing a Federal Print Repository: Issues and Opportunities May 25, 2011 Heather Christenson.
HATHI TRUST A Shared Digital Repository Digital Preservation, HathiTrust, and the Reimagination of the Library Landscape Jeremy York Iceland August 5,
HATHI TRUST A Shared Digital Repository HathiTrust How We Can Make A Difference Jeremy York Yale University November 3, 2010.
What is HathiTrust and How Can it Make a Difference? Sourcing and Scaling brought to the collective collection.
HATHI TRUST A Shared Digital Repository HathiTrust 101 John Wilkin and Jeremy York August 27, 2010.
What is HathiTrust and Why is it relevant to research libraries? Sourcing and Scaling brought to the collective collection.
Building the Universal Library: Introducing HathiTrust Patricia A. Steele Indiana University Libraries John Price Wilkin University of Michigan Libraries.
HATHI TRUST A Shared Digital Repository Delivering Data For New Generations of Research Strategies and Challenges Jeremy York NISO/BISG Forum ALA 2010.
HATHI TRUST A Shared Digital Repository HathiTrust, Collections, and Collaboration COLD 2011 Spring Meeting Jeremy York May 20, 2011.
HATHITRUST A Shared Digital Repository HathiTrust Outside-In University of Michigan Law School June 14, 2011 Jeremy York HathiTrust Project Librarian.
KAT HAGEDORN HATHITRUST SPECIAL PROJECTS COORDINATOR UNIVERSITY OF MICHIGAN LIBRARIES OCTOBER 9, 2009 Seamless Sharing: NYU, HathiTrust, ReCAP and the.
Digital Preservation A Matter of Trust. Context * As of March 5, 2011.
HathiTrust Digital Library: Enrich Your Research and Scholarship Doreen Bradley Chris Powell University Library May 2011.
HathiTrust and the Ecology of Shared Collections Paul N. Courant 21 May 2009.
HATHITRUST A Shared Digital Repository We’re Preserving the Past, What About the Present? NISO Webinar: Ensuring the Preservation of E-Books May 23, 2012.
What’s Next for HathiTrust?. We’re Growing Up! Partnership Arizona State University Baylor University Boston University California Digital Library Columbia.
HATHITRUST A Shared Digital Repository HathiTrust current work, challenges, and opportunities for public libraries Creating a Blueprint for a National.
HATHITRUST A Shared Digital Repository HathiTrust as a Model for Preservation and Access Jeremy York Media Preservation Conference April 17, 2013.
HATHI TRUST A Shared Digital Repository Digital Repositories for Preservation and Access Digital Directions 2013 Jeremy York July 22, 2013 Unless otherwise.
HATHITRUST A Shared Digital Repository Bibliographic Metadata and HathiTrust ALCTS CaMMS Catalog Management Interest Group Meeting American Library Association.
HATHITRUST A Shared Digital Repository HathiTrust METS and PREMIS October 25, 2011 Jeremy York Project Librarian, HathiTrust.
HATHITRUST A Shared Digital Repository HathiTrust on the Move A Growing Partnership Taking Stock and Looking Ahead National Library of Medecine October.
HATHITRUST A Shared Digital Repository A Preservation Infrastructure Built to Last: Preservation, Community, and HathiTrust UNESCO Memory of the World.
HATHITRUST A Shared Digital Repository Big Collections in an Era of Big Copyright: Practical Strategies for Making the Most of Digitized Heritage Jeremy.
HATHITRUST A Shared Digital Repository Why Digitize? or The Limits of Preservation 2014 TEI/DHCS Plenary Session Evanston, IL Mike Furlough Executive Director,
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
HATHI TRUST A Shared Digital Repository Columbia University and HathiTrust Collaboration at a new level.
HATHITRUST A Shared Digital Repository HathiTrust Past, Present, and Future A Brief Introduction.
HATHITRUST A Shared Digital Repository More, Better, Together: HathiTrust Accomplishments and Aspirations The Researcher of Tomorrow Universidad Complutense.
HathiTrust – How To By Dr. Rob McGeachin 20 th Annual AgNIC Meeting May 7, 2015.
HATHITRUST A Shared Digital Repository HathiTrust: Putting Research in Context HTRC UnCamp September 10, 2012 John Wilkin, Executive Director, HathiTrust.
HATHITRUST A Shared Digital Repository HathiTrust Infrastructure and Information Organization November 7, 2011 Jeremy York Project Librarian, HathiTrust.
HathiTrust Digital Library. Overview ›Began in 2008 ›Large scale digital preservation repository ›Partnership of major research libraries ›Focus on both.
1 A journey of a thousand miles begins with a single step. Chinese Proverb.
HATHITRUST A Shared Digital Repository HathiTrust: Key Concepts and Issues in Managing the Digital Archive ICPSR Summer Workshop “Curating and Managing.
HATHITRUST A Shared Digital Repository HathiTrust and TRAC DigitalPreservation 2012 July 25, 2012 Jeremy York, Project Librarian, HathiTrust.
H ATHI T RUST HTTP :// WWW. HATHITRUST. ORG Large-Scale Digital Initiatives and their potential impact on the Maine Shared Collections Strategy Colby College.
HathiTrust’s Past, Present and Future. Short- and Long-term Functional Objectives Short-term Page turner mechanism (and Mobile!) Branding (overall initiative;
Author(s): Jeremy York, 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Noncommercial–Share.
ILS Futures. Background Changes 95/96 to 06/07 –Stacks circ= 179,996 to 160,970 ILS’s are no longer the center of the library universe. To most users,
HATHITRUST A Shared Digital Repository HathiTrust and the Future of Research Libraries American Antiquarian Society March 31, 2012 Jeremy York, Project.
HATHITRUST A Shared Digital Repository Your Library, Now Online! Putting HathiTrust in the Context of Traditional (and New) Library Services MCLS Webinar.
HATHI TRUST A Shared Digital Repository Use of PREMIS for Internet Archive AIPs September 22, 2010.
How to Implement an Institutional Repository: Part II A NASIG 2006 Pre-Conference May 4, 2006 Technical Issues.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan Florida Center for Library Automation (FCLA)
HATHITRUST A Shared Digital Repository Institution Uses of HathiTrust Jeremy York University of Maine May 24, 2013.
HathiTrust: Collaboration in Building the Universal Collection John Wilkin 1 October 2009.
DAITSS and the Florida Digital Archive Priscilla Caplan Florida Center for Library Automation iPRES 2006.
ARIADNE is funded by the European Commission's Seventh Framework Programme Archiving and Repositories Holly Wright.
HathiTrust: Possibilities Metadata Working Group Cornell University Library March 21, 2014.
HATHITRUST A Shared Digital Repository HathiTrust Large Digital Libraries: Beyond Google Books Modern Language Association January 5, 2012 Jeremy York,
Barbara Preece ICOLC, April Mark Sandler Center for Library Initiatives Chicago Illinois Indiana Iowa Michigan Michigan State Minnesota Northwestern.
HathiTrust: A valuable and visionary Partnership.
HathiTrust Digital Library Interface and Services
DAITSS and the Florida Digital Archive
Building the Universal Library: Introducing HathiTrust
Implementing an Institutional Repository: Part II
Implementing an Institutional Repository: Part II
How to Implement an Institutional Repository: Part II
Presentation transcript:

HATHI TRUST A Shared Digital Repository Building A Future By Preserving Our Past The Preservation Infrastructure of HathiTrust Digital Library Jeremy York IFLA 2010 August 15, 2010

Current Partners – Columbia University – New York Public Library – University of California system – CIC (Committee on Institutional Cooperation) – University of Virginia – Yale University University of Chicago University of Illinois Indiana University University of Iowa University of Michigan Michigan State University University of Minnesota Northwestern University Ohio State University Pennsylvania State University Purdue University University of Wisconsin-Madison

Mission To contribute to the common good by collecting, organizing, preserving, communicating, and sharing the record of human knowledge

Universal Library Common Goal Single Entity, Many Partners HathiTrust

Goals Comprehensive collection Preservation…with Access Shared strategies – Collection management, development – Preservation – Copyright – Efficient user services Openness

Content Distribution 6,549,680 – Total volumes 1,300,896 – Public Domain 3,798,116 Book titles 153,311 Serial titles * As of August 13, 2010

Language Distribution (1) * As of August 13, 2010

Language Distribution (2) The next 40 languages make up ~13% of total * As of August 13, 2010

Dates * As of August 13, 2010

Content Growth

Repository Philosophy/Design OAIS/TRAC Consistency Standardization Simplicity (in design, not function) Practicality Sustainability

Content Largely uniform in technical characteristics 4 formats – ITU G4 TIFF – JPEG2000 – JPEG – Unicode (with and without coordinates)

Object Package images bib data bib data Source METS text HT METS Zip

Metadata Details and specifications at repository level – Object specifications / Validation criteria – Page-tagging Variations at object level – Files missing – Non-valid files – Incorrect file checksums

Bibliographic Data – Must be present prior to content ingest – MARCXML, as complete as possible Content – Pre-ingest – Ingest Ingest

Ingest (2) Pre- ingest SIP Backend servers GROOVE Validation METS creation Package creation Package creation Handle creation Handle creation - Evaluation - Determination of standards - Modification / Transformation - Ensure conformance - Barcode - Fixity - Consistency - Well-formedness - Prepare archival package Bibliographic data Content

Archival Storage Reliability – ensure integrity Redundancy – in single and multiple sites Scalability – including ease of management Accessibility – for repository processes and services Platform-independence – for data/object management

Media & Architecture Michigan Indiana Tape Backup Archival Storage Isilon Systems Load balancing and failover Ingest at Michigan, replicated to Indiana Replacement on 3-4 year cycle

Architecture & Management images bib data bib data Source METS text HT METS../uc1/pairtree_root/b3/54/34/86/b b zip b mets.xml Example ids: wu mdp uc2.ark:/1390/t miua.aaj

Data Management Rights Determination Rights Database Bibliographic Management System Copyright Review Management System - Inventory - Loading and updating records - Duplicate detection and collation - Solr indexes behind VuFind catalog - Source of information for Access services - Rights determination (automated and support for manual review)

Rights Database System of precedence 9 attributes 11 reason codes Bibliographic (automatic) Manual 1.Conformance with formalities 2.Contractual agreements 3.Access control overrides Manual 1.Conformance with formalities 2.Contractual agreements 3.Access control overrides

Access Rights Database Michigan Indiana Data Management Archival Storage Tab-delimited Metadata files Collection Builder Index Rights Determin ation Bibliographic Management Full text Index VuFind Index Bibliographic Catalog Bibliographic API OAI sets Full text Search application PageTurner Data API Collection Builder

Content Access Rights Database Michigan Indiana Data Management Archival Storage Tab-delimited Metadata files Collection Builder Index Rights Determin ation Bibliographic Management Full text Index VuFind Index Bibliographic Catalog Bibliographic API OAI sets Full text Search application PageTurner Data API Collection Builder

Search and Aggregation Access Rights Database Michigan Indiana Data Management Archival Storage Tab-delimited Metadata files Collection Builder Index Rights Determin ation Bibliographic Management Full text Index VuFind Index Bibliographic Catalog Bibliographic API OAI sets Full text Search application PageTurner Data API Collection Builder

Metadata Access Rights Database Michigan Indiana Data Management Archival Storage Tab-delimited Metadata files Collection Builder Index Rights Determin ation Bibliographic Management Full text Index VuFind Index Bibliographic Catalog Bibliographic API OAI sets Full text Search application PageTurner Data API Collection Builder

Thank you!