HATHITRUST A Shared Digital Repository A Preservation Infrastructure Built to Last: Preservation, Community, and HathiTrust UNESCO Memory of the World.

Slides:



Advertisements
Similar presentations
HathiTrust Unless otherwise noted, these slides and their contents are licensed under a Creative Commons Attribution Unported License.
Advertisements

HATHI TRUST A Shared Digital Repository Building A Future By Preserving Our Past The Preservation Infrastructure of HathiTrust Digital Library Jeremy York.
HATHI TRUST A Shared Digital Repository HathiTrust Digital Library Is There A Past In Your Future? Princeton University February 2010.
KAT HAGEDORN HATHITRUST SPECIAL PROJECTS COORDINATOR UNIVERSITY OF MICHIGAN LIBRARIES OCTOBER 9, 2009 Seamless Sharing: NYU, HathiTrust, ReCAP and the.
HathiTrust: Building the Universal Collection John Wilkin 18 May 2009.
This Library Never Forgets Preservation, Cooperation, and the Making of HathiTrust Digital Library Jeremy York Project Librarian HathiTrust Digital Library.
Building the Universal Library: The Promise and Challenges of HathiTrust John Wilkin 2 April 2009.
HATHI TRUST A Shared Digital Repository HathiTrust, Collections, and Collaboration COLD 2011 Spring Meeting Jeremy York May 20, 2011.
KAT HAGEDORN HATHITRUST SPECIAL PROJECTS COORDINATOR UNIVERSITY OF MICHIGAN LIBRARIES OCTOBER 9, 2009 Seamless Sharing: NYU, HathiTrust, ReCAP and the.
Digital Preservation A Matter of Trust. Context * As of March 5, 2011.
National Institutes of Health U.S. Department of Health and Human Services The PEPH Resource Center: A New, More Convenient Login.
HATHITRUST A Shared Digital Repository Update on Developments and Activities UM Selectors October 9, 2012 Jeremy York, Project Librarian, HathiTrust.
HathiTrust and the Ecology of Shared Collections Paul N. Courant 21 May 2009.
HATHITRUST A Shared Digital Repository We’re Preserving the Past, What About the Present? NISO Webinar: Ensuring the Preservation of E-Books May 23, 2012.
What’s Next for HathiTrust?. We’re Growing Up! Partnership Arizona State University Baylor University Boston University California Digital Library Columbia.
HATHITRUST A Shared Digital Repository HathiTrust current work, challenges, and opportunities for public libraries Creating a Blueprint for a National.
HATHITRUST A Shared Digital Repository HathiTrust as a Model for Preservation and Access Jeremy York Media Preservation Conference April 17, 2013.
HATHITRUST A Shared Digital Repository Bibliographic Metadata and HathiTrust ALCTS CaMMS Catalog Management Interest Group Meeting American Library Association.
The West` Washington Idaho 1 Montana Oregon California 3 4 Nevada Utah
HATHITRUST A Shared Digital Repository Collective Stewardship through HathiTrust Digital Library African Studies in the Digital Age November 12, 2014 Mike.
HATHITRUST A Shared Digital Repository HathiTrust METS and PREMIS October 25, 2011 Jeremy York Project Librarian, HathiTrust.
HATHITRUST A Shared Digital Repository HathiTrust on the Move A Growing Partnership Taking Stock and Looking Ahead National Library of Medecine October.
HATHITRUST A Shared Digital Repository HathiTrust: A Second Life for Library Collections Jeremy York Exploring Humanities Cyberinfrastructure April 30,
HATHITRUST A Shared Digital Repository HathiTrust: The Collection and Its Uses NEFLIN Webinar - November 7, 2013 Jeremy York, Assistant Director, HathiTrust.
Sustainable Preservation Services for Archivists through Distributed Custody Caryn Wojcik State of Michigan Records Management Services.
HATHITRUST A Shared Digital Repository How Can Digital Collections Support Shared Print Initiatives? The HathiTrust Print Monograph Archive Planning Task.
HATHITRUST A Shared Digital Repository HathiTrust Overview: Partnership and Services Jeremy York Wesleyan University Web Presentation February 18, 2014.
HATHITRUST A Shared Digital Repository Why Digitize? or The Limits of Preservation 2014 TEI/DHCS Plenary Session Evanston, IL Mike Furlough Executive Director,
HATHITRUST A Shared Digital Repository Digital Humanities in HathiTrust: Research At Any Scale Jeremy York Digital Humanities and the Futures of Japanese.
Average Increase in Direct Compensation by Employee Group (Includes Extension, excludes Hospital) PercentPercent.
States and Cities SOL US II 2c A state is an example of a political region. States may be grouped as part of different regions, depending upon the criteria.
What are the states in the Northeast Region?
HATHITRUST A Shared Digital Repository HathiTrust Past, Present, and Future A Brief Introduction.
HATHITRUST A Shared Digital Repository More, Better, Together: HathiTrust Accomplishments and Aspirations The Researcher of Tomorrow Universidad Complutense.
High Water Raises All Boats Leveraging Partnerships on Campus to Build a Repository Mary Molinaro University of Kentucky Libraries.
CILogon and InCommon: Technical Update Jim Basney This material is based upon work supported by the National Science Foundation under grant numbers
HATHITRUST A Shared Digital Repository HathiTrust: Putting Research in Context HTRC UnCamp September 10, 2012 John Wilkin, Executive Director, HathiTrust.
HATHITRUST A Shared Digital Repository Collaborating Globally, Planning Locally HathiTrust and New Opportunities in Collection Management GWLA/UNM: Emerging.
1 The Partnership Challenge Higher education’s missions are realized in increasingly global, collaborative, online relationships –Higher educations’ digital.
HATHITRUST A Shared Digital Repository HathiTrust Infrastructure and Information Organization November 7, 2011 Jeremy York Project Librarian, HathiTrust.
Map Review. California Kentucky Alabama.
OAIS Open Archival Information System. “Content creators, systems developers, custodians, and future users are all potential stakeholders in the preservation.
June, 2012 Art Mandel.  Multiple acceptances to Ivy League Schools  Multiple acceptances to the “Most Competitive” colleges and universities  State.
HATHITRUST A Shared Digital Repository HathiTrust: Key Concepts and Issues in Managing the Digital Archive ICPSR Summer Workshop “Curating and Managing.
UPDATED KUALI STATISTICS. KUALI FOUNDATION MEMBERS – INSTITUTIONAL Australian National University Boston College Boston University Brock University Brown.
HATHITRUST A Shared Digital Repository HathiTrust and TRAC DigitalPreservation 2012 July 25, 2012 Jeremy York, Project Librarian, HathiTrust.
H ATHI T RUST HTTP :// WWW. HATHITRUST. ORG Large-Scale Digital Initiatives and their potential impact on the Maine Shared Collections Strategy Colby College.
HathiTrust’s Past, Present and Future. Short- and Long-term Functional Objectives Short-term Page turner mechanism (and Mobile!) Branding (overall initiative;
Author(s): Jeremy York, 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution–Noncommercial–Share.
1 Fall 2004 Freshman Profile September 9, Total Number of Applications = (+13%)
Web Archiving Service (WAS) Rosalie Lack Data Curation for Practitioners 2012 Workshop.
HATHITRUST A Shared Digital Repository HathiTrust and the Future of Research Libraries American Antiquarian Society March 31, 2012 Jeremy York, Project.
HATHITRUST A Shared Digital Repository Your Library, Now Online! Putting HathiTrust in the Context of Traditional (and New) Library Services MCLS Webinar.
HATHITRUST A Shared Digital Repository Institution Uses of HathiTrust Jeremy York University of Maine May 24, 2013.
HathiTrust: Collaboration in Building the Universal Collection John Wilkin 1 October 2009.
US MAP TEST Practice
UPDATED KUALI STATISTICS. KUALI FOUNDATION MEMBERS – INSTITUTIONAL (60) Australian National University Boston College Boston University Brock University.
HATHITRUST A Shared Digital Repository HathiTrust Large Digital Libraries: Beyond Google Books Modern Language Association January 5, 2012 Jeremy York,
An Overview of the Platform
Collaboration: to work jointly with others towards a common goal Or the whole is greater than the sum of its parts Lisa B. German Library Faculty Organization.
DPLAfest, April 15, 2016 Chip German, Program Director, APTrust and Senior Director, Content Stewardship, at the University of Virginia Library
HathiTrust: A valuable and visionary Partnership.
HATHITRUST A Shared Digital Repository ALA CopyTalk: CRMS The Copyright Review Management System September 1, 2016 Melissa Levine, Lead Copyright Officer,
2c: States grouped by region
Faculty Salary Study Comparison to AAU Data Exchange Institutions
HathiTrust Copyright Review
The States How many states are in the United States?
Update May 5, 2008.
WASHINGTON MAINE MONTANA VERMONT NORTH DAKOTA MINNESOTA MICHIGAN
From Innovation to Commercialization Access to Data
Presentation transcript:

HATHITRUST A Shared Digital Repository A Preservation Infrastructure Built to Last: Preservation, Community, and HathiTrust UNESCO Memory of the World September 26, 2012 Jeremy York, Project Librarian, HathiTrust

Partnership Arizona State University Baylor University Boston College Boston University California Digital Library Columbia University Cornell University Dartmouth College Duke University Emory University Florida State University Getty Research Institute Harvard University Library Indiana University Johns Hopkins University Kansas State University Lafayette College Library of Congress Massachusetts Institute of Technology McGill University` Michigan State University New York Public Library New York University North Carolina Central University North Carolina State University Northwestern University The Ohio State University The Pennsylvania State University Princeton University Purdue University Stanford University Texas A&M University Universidad Complutense de Madrid University of Arizona University of Calgary University of California Berkeley Davis Irvine Los Angeles Merced Riverside San Diego San Francisco Santa Barbara Santa Cruz The University of Chicago University of Connecticut University of Delaware University of Florida University of Illinois University of Illinois at Chicago The University of Iowa University of Maryland University of Miami University of Michigan University of Minnesota University of Missouri University of Nebraska-Lincoln The University of North Carolina at Chapel Hill University of Notre Dame University of Pennsylvania University of Pittsburgh University of Utah University of Virginia University of Washington University of Wisconsin- Madison Utah State University Virginia Polytechnic University Washington University Yale University Library

Digital Repository Launched 2008 Initial focus on digitized book and journal content – 10.5 million total volumes – 5.5 million book titles – 270,000 serial titles – 3.2 million public domain (~30%)

Setting

Outline Community Overarching Considerations Technological Infrastructure, Social System

Outline Community – Open Archival Information Systems (OAIS) – Trustworthy Repository Audit and Certification (TRAC) Overarching Considerations Technological Infrastructure, Social System

Community Production Management/Sta keholders Consumption/Us ers

Outline Community Overarching Considerations – Scale – Preservation and Access – Openness Technological Infrastructure, Social System

Scale Mission – To contribute to the common good by collecting, organizing, preserving, communicating, and sharing the record of human knowledge Strategy – “Co-owned and managed”

Preservation and Access “Light” archive benefits – Access to materials – Checks on integrity – Best chance for content to be used and valued, preserved

Openness Reliable and comprehensive archive of materials converted from print…co-owned Improve access …to meet the needs of the co- owning institutions Ensure the long-term preservation of content Coordinate shared storage strategies “public good” …sustaining the historical record Simultaneously …centralized …open

Outline Community Overarching Considerations Technological Infrastructure, Social System – Infrastructure overview – Preservation strategies

Preservation Strategies Information integrity – Content – Fixity – Reference – Provenance – Context

Content (1) Selection of content for digitization and preservation – Partner institutions, Collections Committee, Govdocs – Collective decision-making Types of materials, content formats – Books and journals – 3 Formats ITU G4 TIFF, JP2, Unicode

Content (2) Adherence enforced through rigorous validation Types and numbers of formats important to degree that satisfy community concerns – Open formats, meet community standards – Widely supported on a number of platforms – Confidence in preservation and migration

Fixity Concern of content being changed or corrupted without notice Strategies – Verification of checksums on ingest – Periodic re-calculation of checksums in repository and comparison with pre-ingest – Data integrity mechanisms in storage itself

Fixity (2) Authenticity and integrity – Duranti (1995), Lynch (2000) Automated checks for random or accidental corruption Security and Trust for integrity of overall environment

Reference “For an object to maintain its integrity, its wholeness and singularity, one must be able to locate it definitively and reliably over time among other objects” Strategies – Identification of objects – Structure of repository – Embedding of identifiers – Permanent URLs – Version dates

Identification Identifier of object prior to ingest; Namespace Namespace indicates digitization source and identifier scheme Examples uc1.b (Google-digitized) uc2.ark:/13960/t (Internet Archive- digitized)

Reference (2) Identification of objects Structure of repository –../uc1/pairtree_root/b3/54/34/86/b Embedding of identifiers Permanent URLs Version dates

Provenance Chain of custody – Authenticity – Document uses by custodians Strategies – Original source – Agent of digitization – Administrative metadata (provenance and preservation)

Provenance 2 Reliability – A record is regarded as reliable when its form is complete, that is, when it possesses all the elements that are required by the socio-juridical system in which the record is created for it to be able to generate consequences recognized by the system itself.

Context “the ways in which [digital information objects] interact with elements in the wider digital environment” – Technical (Hardware and software dependencies) – Linkages between objects – Communication medium

Context (2) Relation to print Discovery and use

Conclusion