This Library Never Forgets Preservation, Cooperation, and the Making of HathiTrust Digital Library Jeremy York Project Librarian HathiTrust Digital Library.

Slides:



Advertisements
Similar presentations
HathiTrust Unless otherwise noted, these slides and their contents are licensed under a Creative Commons Attribution Unported License.
Advertisements

Beyond the Google Book: the Future of the Digital Library Cory Snavely Library IT Core Services manager University of Michigan April 20, 2010.
HathiTrust Digital Library
HATHI TRUST A Shared Digital Repository Building A Future By Preserving Our Past The Preservation Infrastructure of HathiTrust Digital Library Jeremy York.
HATHI TRUST A Shared Digital Repository HathiTrust Digital Library Is There A Past In Your Future? Princeton University February 2010.
KAT HAGEDORN HATHITRUST SPECIAL PROJECTS COORDINATOR UNIVERSITY OF MICHIGAN LIBRARIES OCTOBER 9, 2009 Seamless Sharing: NYU, HathiTrust, ReCAP and the.
HathiTrust: Building the Universal Collection John Wilkin 18 May 2009.
HATHI TRUST A Shared Digital Repository Unpacking HathiTrusts New Cost Model Jeremy York Project Librarian, HathiTrust SUNY July 15, 2011.
HathiTrust: A Big Idea with Bold Plans
HATHI TRUST A Shared Digital Repository HathiTrust Open Webinar Jeremy York Project Librarian, HathiTrust May 3 and 5, 2011.
HATHI TRUST A Shared Digital Repository HathiTrust Overview Julie Bobay, Heather Christenson, and John Wilkin April 12, 2011.
Building the Universal Library: The Promise and Challenges of HathiTrust John Wilkin 2 April 2009.
HathiTrust Sharing a Federal Print Repository: Issues and Opportunities May 25, 2011 Heather Christenson.
HATHI TRUST A Shared Digital Repository Digital Preservation, HathiTrust, and the Reimagination of the Library Landscape Jeremy York Iceland August 5,
HATHI TRUST A Shared Digital Repository HathiTrust How We Can Make A Difference Jeremy York Yale University November 3, 2010.
What is HathiTrust and How Can it Make a Difference? Sourcing and Scaling brought to the collective collection.
HATHI TRUST A Shared Digital Repository HathiTrust 101 John Wilkin and Jeremy York August 27, 2010.
What is HathiTrust and Why is it relevant to research libraries? Sourcing and Scaling brought to the collective collection.
How HathiTrust Serves the UC Community Users Council May 21, 2012 Heather Christenson, California Digital Library.
Building the Universal Library: Introducing HathiTrust Patricia A. Steele Indiana University Libraries John Price Wilkin University of Michigan Libraries.
HATHI TRUST A Shared Digital Repository HathiTrust, Collections, and Collaboration COLD 2011 Spring Meeting Jeremy York May 20, 2011.
HATHITRUST A Shared Digital Repository HathiTrust Outside-In University of Michigan Law School June 14, 2011 Jeremy York HathiTrust Project Librarian.
KAT HAGEDORN HATHITRUST SPECIAL PROJECTS COORDINATOR UNIVERSITY OF MICHIGAN LIBRARIES OCTOBER 9, 2009 Seamless Sharing: NYU, HathiTrust, ReCAP and the.
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
An update on Google Book search digitization at the University of Michigan … the agreement and plans for work between Google and the.
Digital Preservation A Matter of Trust. Context * As of March 5, 2011.
HathiTrust Digital Library: Enrich Your Research and Scholarship Doreen Bradley Chris Powell University Library May 2011.
HathiTrust and the Ecology of Shared Collections Paul N. Courant 21 May 2009.
HATHITRUST A Shared Digital Repository We’re Preserving the Past, What About the Present? NISO Webinar: Ensuring the Preservation of E-Books May 23, 2012.
What’s Next for HathiTrust?. We’re Growing Up! Partnership Arizona State University Baylor University Boston University California Digital Library Columbia.
HATHITRUST A Shared Digital Repository HathiTrust current work, challenges, and opportunities for public libraries Creating a Blueprint for a National.
HATHITRUST A Shared Digital Repository HathiTrust as a Model for Preservation and Access Jeremy York Media Preservation Conference April 17, 2013.
HATHITRUST A Shared Digital Repository Bibliographic Metadata and HathiTrust ALCTS CaMMS Catalog Management Interest Group Meeting American Library Association.
HATHITRUST A Shared Digital Repository HathiTrust METS and PREMIS October 25, 2011 Jeremy York Project Librarian, HathiTrust.
HathiTrust Constitutional Convention Session #2: Report on 3-year review and Q & A Ed Van Gemert, Chair, Strategic Advisory Board Patricia Cruse, Member,
HATHITRUST A Shared Digital Repository HathiTrust on the Move A Growing Partnership Taking Stock and Looking Ahead National Library of Medecine October.
HATHITRUST A Shared Digital Repository HathiTrust: A Second Life for Library Collections Jeremy York Exploring Humanities Cyberinfrastructure April 30,
HATHITRUST A Shared Digital Repository HathiTrust: The Collection and Its Uses NEFLIN Webinar - November 7, 2013 Jeremy York, Assistant Director, HathiTrust.
HATHITRUST A Shared Digital Repository A Preservation Infrastructure Built to Last: Preservation, Community, and HathiTrust UNESCO Memory of the World.
HATHITRUST A Shared Digital Repository How Can Digital Collections Support Shared Print Initiatives? The HathiTrust Print Monograph Archive Planning Task.
HATHITRUST A Shared Digital Repository HathiTrust Overview: Partnership and Services Jeremy York Wesleyan University Web Presentation February 18, 2014.
HATHITRUST A Shared Digital Repository Why Digitize? or The Limits of Preservation 2014 TEI/DHCS Plenary Session Evanston, IL Mike Furlough Executive Director,
HATHI TRUST A Shared Digital Repository Columbia University and HathiTrust Collaboration at a new level.
HATHITRUST A Shared Digital Repository HathiTrust Past, Present, and Future A Brief Introduction.
HATHITRUST A Shared Digital Repository More, Better, Together: HathiTrust Accomplishments and Aspirations The Researcher of Tomorrow Universidad Complutense.
HathiTrust – How To By Dr. Rob McGeachin 20 th Annual AgNIC Meeting May 7, 2015.
HATHITRUST A Shared Digital Repository HathiTrust: Putting Research in Context HTRC UnCamp September 10, 2012 John Wilkin, Executive Director, HathiTrust.
HATHITRUST A Shared Digital Repository Collaborating Globally, Planning Locally HathiTrust and New Opportunities in Collection Management GWLA/UNM: Emerging.
HATHITRUST A Shared Digital Repository HathiTrust Infrastructure and Information Organization November 7, 2011 Jeremy York Project Librarian, HathiTrust.
HathiTrust Digital Library. Overview ›Began in 2008 ›Large scale digital preservation repository ›Partnership of major research libraries ›Focus on both.
Rapidly Developing Mass Digitization and the Future of the University Library James Michalko Vice President, OCLC Research Keio University 6 October 2010.
HATHITRUST A Shared Digital Repository HathiTrust: Key Concepts and Issues in Managing the Digital Archive ICPSR Summer Workshop “Curating and Managing.
Breana McCracken University of Illinois at Urbana-Champaign HathiTrust and Copyright Future Implications - Strong precedent for libraries to continue to.
HATHITRUST A Shared Digital Repository HathiTrust and TRAC DigitalPreservation 2012 July 25, 2012 Jeremy York, Project Librarian, HathiTrust.
Challenges and Opportunities for Academic Libraries Collaborative Imperatives to Support Collections, Digital Initiatives, and New Services for a Changing.
Author(s): Jeremy York, 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Noncommercial–Share.
E-books and E-Journals in US University Libraries: Current Status and Future Prospects James Michalko Vice President, OCLC Research Symposium Keio University.
Web Archiving Service (WAS) Rosalie Lack Data Curation for Practitioners 2012 Workshop.
HATHITRUST A Shared Digital Repository HathiTrust and the Future of Research Libraries American Antiquarian Society March 31, 2012 Jeremy York, Project.
HATHITRUST A Shared Digital Repository Your Library, Now Online! Putting HathiTrust in the Context of Traditional (and New) Library Services MCLS Webinar.
HATHI TRUST A Shared Digital Repository Use of PREMIS for Internet Archive AIPs September 22, 2010.
HATHITRUST A Shared Digital Repository Institution Uses of HathiTrust Jeremy York University of Maine May 24, 2013.
HathiTrust: Collaboration in Building the Universal Collection John Wilkin 1 October 2009.
HathiTrust: Possibilities Metadata Working Group Cornell University Library March 21, 2014.
HATHITRUST A Shared Digital Repository HathiTrust Large Digital Libraries: Beyond Google Books Modern Language Association January 5, 2012 Jeremy York,
Barbara Preece ICOLC, April Mark Sandler Center for Library Initiatives Chicago Illinois Indiana Iowa Michigan Michigan State Minnesota Northwestern.
HathiTrust: A valuable and visionary Partnership.
Web Archiving Service (WAS) Rosalie Lack Data Curation for Practitioners 2012 Workshop.
HathiTrust Digital Library Interface and Services
Building the Universal Library: Introducing HathiTrust
Presentation transcript:

This Library Never Forgets Preservation, Cooperation, and the Making of HathiTrust Digital Library Jeremy York Project Librarian HathiTrust Digital Library Archiving 2009 May 5-8, 2009

What is HathiTrust?

California Digital Library Indiana University Michigan State University Northwestern University The Ohio State University Penn State University Purdue University UC Berkeley UC Davis UC Irvine UCLA UC Merced UC Riverside UC San Diego UC San Francisco UC Santa Barbara UC Santa Cruz The University of Chicago University of Illinois University of Illinois at Chicago The University of Iowa University of Michigan University of Minnesota University of Wisconsin- Madison University of Virginia

Current Holdings As of May 5 2,823,385 volumes 448,413 in the public domain (~16%)

How it came to be

University of Michigan Large Scale Production Environments – JSTOR – Making of America – PEAK – Humanities Text Initiative

Committee on Institutional Cooperation Long history of successful cooperation Voluntary partnership Build strengths of all for benefit of all

University of California System-wide planning Shared storage, cataloging Standards – preservation and access

University of Virginia Electronic Text Center 1992 Focus on the scholar Innovation and Research User-centered orientation

Origins - Chronology UM in 2004 …U of M shall have the right to use the U of M Digital Copy, in whole or in part at U of M's sole discretion, as part of services offered in cooperation with partner research libraries such as the institutions in the Digital Library Federation…

Origins - Chronology 2007 CIC/Google Agreement Shared Digital Repository 2008 University of California and University of Virginia join Launched October, 2008

Goals and Aspirations How we are doing

Partnership Grow Voluntary/Flexible Stable

Governance Model Executive Committee Strategic Advisory Board

Executive Committee Paul Courant, University Librarian and Dean of Libraries, University of Michigan John King, Vice Provost for Academic Information, University of Michigan Patricia Steele, Dean of Libraries, Indiana University Brad Wheeler, Chief Information Officer, Indiana University Paula Kaufman, University Librarian and Dean of Libraries, University of Illinois at Champaign-Urbana Laine Farley, Executive Director, California Digital Library Brian Schottlaender, University Librarian, University of California, San Diego Libraries John Wilkin, Executive Director of HathiTrust and Associate University Library, Library Information Technology, University of Michigan

Strategic Advisory Board Guiding hand of HathiTrust At least 4 members from the CIC, 3 members from the University of California

Strategic Advisory Board – Ed Van Gemert (Chair), Director of Libraries, University of Wisconsin-Madison – John Butler, Associate University Librarian for Information Technology, University of Minnesota – Patricia Cruse, Director, Preservation, California Digital Library – Robin Dale, Associate University Librarian for Collections and Library Information Systems, University of California, Santa Cruz – R. Bruce Miller, University Librarian, University of California, Merced – Sarah Pritchard, University Librarian, Northwestern University – Paul Soderdahl, Director, Library Information Technology, University of Iowa – John Wilkin, Executive Director, HathiTrust (ex officio)

Partnership/Cost Model HathiTrust Funded for initial 5-year period ( ) Base funding from member institutions 3-year review Constitutional Convention – Members by September 2010 – Contribute content by March 2011

How much does it cost? Infrastructure

Costs Estimate content over 5 years Calculate proportional cost Calculate average per-year cost < $0.15 per volume One-time fee (25% of yearly cost)

Repository and Content Sustainable curation of library content Community Building Support content beyond books and journals Grow

Sustainable Curation fund repository with base funds from member institutions two active storage sites with backup Based on standards and best practices for Archival repositories – OAIS – METS/PREMIS – Ingest Validation (GROOVE) – Periodic fixity checks using MD5 Rights Database

Sustainable curation of library content OAIS Reference Model GRIN Internal Data Loading GRIN Internal Data Loading Google [OCA] In-house Conversion Google [OCA] In-house Conversion MARC record extensions (Aleph) Rights DB MARC record extensions (Aleph) Rights DB Page Turner HathiTrust API OAI GeoIP DB CNRI Handles [Solr] Page Turner HathiTrust API OAI GeoIP DB CNRI Handles [Solr] METS/PREMIS object TIFF G4/JPEG2000 OCR MD5 checksums METS/PREMIS object TIFF G4/JPEG2000 OCR MD5 checksums METS object PNG OCR PDF METS object PNG OCR PDF Isilon Site Replication TSM MD5 checksum validation Isilon Site Replication TSM MD5 checksum validation GROOVE (JHOVE) GROOVE (JHOVE)

Community Building Shared Collection Development – Unified core collection – Certification of volumes

Support content beyond books and journals Born-digital Native XML Encoded Text

Grow

Services Catalog Page Turner Bibliographies and Saved Collections Users with Print Disabilities Computational Research (sample datasets) Ability to build applications with Library content Large scale Search

Upcoming Plans Expand partnership Begin work on shared collection development and de-duplication Complete Data API Create Development Sandbox Configure for Computational Research Worldcat Local Catalog Prepare for TRAC

Thank you very much!