University of California Mass Digitization Projects Update Users Council Annual Meeting May 8, 2008 Heather Christenson, Mass Digitization Project Mgr,

Slides:



Advertisements
Similar presentations
Beyond the Google Book: the Future of the Digital Library Cory Snavely Library IT Core Services manager University of Michigan April 20, 2010.
Advertisements

What is HathiTrust and How Can it Make a Difference? Sourcing and Scaling brought to the collective collection.
How HathiTrust Serves the UC Community Users Council May 21, 2012 Heather Christenson, California Digital Library.
Pulling it all together… with thanks to Sheila Anderson.
Next-Generation UC Libraries; Next-Generation UC Librarians Ginny Steel, UCSC.
UC’s Electronic Resources Management System Users Council, May 11, 2007 Heather Christenson.
Session Overview Endnote Databases Summon EBooks Institutional Repository.
The Google Books Settlement: A Partner Library Perspective Ivy Anderson California Digital Library Library Journal Virtual E-Book.
An introduction to the work of the Scottish Archive Network Internet access to the written history of Scotland.
Massively Digitizing UC Collections Ivy Anderson Director, Collections California Digital Library May 2009.
Re-envisioning (and Re-purposing) Collections: Mass Digitization, Google, and the HathiTrust Ivy Anderson CDL CDL Users Council Meeting April 10, 2009.
Features and Uses of a Multilingual Full-Text Electronic Theses and Dissertations (ETDs) System Yin Zhang Kent State University Kyiho Lee, Bumjong You.
Constructing the Memories Creating a Digital Collection Linda J. White, Digital Project Coordinator.
The Million Book Project: Removing Obstacles to Use, Satisfaction, & Success Denise Troll Covey Principal Librarian for Special Projects – Carnegie Mellon.
The Open Content Alliance Project Liz Bell & Charley Pennell.
1 Archiving and Preserving the Web Kristine Hanna Internet Archive April 2006.
 an easy-to-use interface for deposit and update  access via persistent URLs  tools for long-term management  permanent storage Merritt is a new cost-effective.
UC’s Systemwide Library Planning Some background & current information.
Massively Digitizing UC Library Collections Google, Microsoft, and More Learning in Retirement Libraries – The Intersection of Tradition and Innovation.
Searching and Accessing the Cultural Heritage in a Digital World Yoram Elkaim International Conference on Intellectual Property & Cultural Heritage in.
Metadata Guidelines for Disclosing Shared Print Commitments Lizanne Payne Shared Print Consultant ALA Midwinter 2013.
HathiTrust – How To By Dr. Rob McGeachin 20 th Annual AgNIC Meeting May 7, 2015.
UC Libraries and the Implications of Mass Digitization Robin L. Chandler User’s Council May 11, 2007.
HATHITRUST A Shared Digital Repository HathiTrust: Putting Research in Context HTRC UnCamp September 10, 2012 John Wilkin, Executive Director, HathiTrust.
Web-based workflow software to support book digitization and dissemination The Mounting Books project books.northwestern.edu Open Repositories 2009 Meeting,
Isabel Silver and Laurie Taylor IMLS Library Publishing Services Workshop May 5, 2011 UF Smathers Libraries Publishing Services.
Overview of the Google Books digitized from the University of Michigan Library collection: its impact to Korean Studies scholars -- Yunah Sung, University.
WORKFLOWS AND OTHER CONSIDERATIONS FOR DIGITIZATION  Steve Bingo  Processing Archivist Washington State University Libraries  Alex Merrill  Assistant.
. Do not distribute. 2 Online Content (Billions of items indexed) Offline Content (Billions of items still un-indexed) Google’s.
How Research Libraries Became E-knowledge Networks Peter X. Zhou 周欣平 University of California, Berkeley University of California, Berkeley October 6, 2009.
HathiTrust Digital Library. Overview ›Began in 2008 ›Large scale digital preservation repository ›Partnership of major research libraries ›Focus on both.
UC Libraries Systemwide Collaborations Review of Initiatives Financial Implications Ginny Steel SLASIAC Meeting May 7, 2012.
Cataloging and Metadata at the University Library.
UC3 Standards and Best Practices for Datasets and Other Supplemental Journal Article Materials UC3 Stephen Abrams Patricia Cruse John Kunze.
Google Books, UMI and Other Intriguing Trends in Digital Publishing Joe Wible Hopkins Marine Station of Stanford University October 9, 2006.
SUNY Digital Repository: An Overview. Topics Repository History/Background Content Types Collections Discovering Content Needs/Gaps Demos Additional Resources.
The New Digital World and the Transformation of Information and Libraries Patricia L. Thibodeau Associate Dean Library Services & Archives Oct. 26, 2011.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Technology Choices for the JSTOR Online Archive Presented by Chang Feng Department of Computer Engineering and Computer Science, University of Missouri-Columbia,
Next Generation Technical Services Rethinking Library Technical Services for the University of California R Bruce Miller.
Digital Special Collections Users Council Annual Meeting May 9, 2008.
Breana McCracken University of Illinois at Urbana-Champaign HathiTrust and Copyright Future Implications - Strong precedent for libraries to continue to.
Jonathan Band Jonathan Band PLLC Library Association Concerns with the Google Book Settlement.
The Hindi word for ‘elephant’ ITC Friday, January 22, 2010.
Digitizing Aloha: Using Information Technology to Preserve and Present the History and Culture of Hawai'i Bob Schwarzwalder Assistant University Librarian,
HathiTrust’s Past, Present and Future. Short- and Long-term Functional Objectives Short-term Page turner mechanism (and Mobile!) Branding (overall initiative;
Bibliographic Services Users Council, May 9, 2008 Patti Martin Director, Bibliographic Services.
Digitization Costs & Funding Digital Library Workshop Oct. 2, 2003.
Web Archiving Service (WAS) Rosalie Lack Data Curation for Practitioners 2012 Workshop.
Research Data Services from the ASU Libraries Mary Whelan GIS Data Manager.
From small beginnings: Developing collection level description Mapping the Information Landscape Showcase day British Library Conference Centre, London,25.
Building an Infrastructure for Digital Humanities: Issues and Considerations Peter Zhou 周欣平 University of California, Berkeley October 8, 2009.
The Oxford-Google Digitization Project* Michael Popham Oxford Digital Library * Rules of commercial confidentiality apply to this presentation!
Mass Digitization Projects Celebration and Challenges Presented to the 2 nd ICUDL Alexandria, Egypt by Dr. Gloriana St. Clair Carnegie Mellon University.
CDL’s Metasearch Infrastructure ICOLC, Boston April 13, 2005 Laine Farley, Director Digital Library Services.
Digital Library of the Caribbean Project Planning Phone:
HathiTrust: Collaboration in Building the Universal Collection John Wilkin 1 October 2009.
Million Book Project in U. S. and India International Conference on The Future of the Book April 22, 2003 Gloriana St. Clair Carnegie Mellon University.
The Future of Scholarly Communication & the Role of Libraries Roy Tennant eScholarship, The California Digital Library.
HATHITRUST A Shared Digital Repository HathiTrust Large Digital Libraries: Beyond Google Books Modern Language Association January 5, 2012 Jeremy York,
Effectively Conducting Research on the Internet Library Research Skills Seminar.
O PEN A CCESS TO O UR H ERITAGE The Gateway to Oklahoma History Cross Timbers Library Conference – August 16, 2013 Sarah Lynn Fisher University of North.
Dspace at AUS | American University of Sharjah | DSpace at AUS AMICAL Conference 6 April 2012.
Million Book Project: Vision Becoming Reality Gabrielle Michalek, Carnegie Mellon Presentation to Carnegie Mellon Qatar Library November 9 & 10, 2005.
Picking up the Pieces: A Retro ETD Project at Utah State University Richard W. Clement Dean of Libraries Utah State University ETD 2013 University of Hong.
Pre-Course Assignment
Re-envisioning (and Re-purposing) Collections:
Internet Archive & OPENLIBRARY.ORG
Copyright Policy & Education Officer
University of Pittsburgh Library System (ULS)
Presentation transcript:

University of California Mass Digitization Projects Update Users Council Annual Meeting May 8, 2008 Heather Christenson, Mass Digitization Project Mgr, California Digital Library

Mass Digitization at UC Overview of current projects, locations, roles of participants Description of process What have we digitized and where you can find it A few thoughts on book discovery, scholarly use & what the future holds

Three Projects, One Goal Goal: Mass digitization of UC Libraries’ book collections Google In-copyright and out-of-copyright works Available via Google search engine and Google Book Search Microsoft Out-of-copyright works only Available via Microsoft Live Search Books Open Content Alliance Out-of-copyright works only Available (via the Internet Archive website) to any and all search engines Library and grant-funded

Why Are They Doing It? Google’s vision: To put all the world’s information online Google and Microsoft: To gain marketshare and competitive advantage for their search (and online advertising) services It’s all about Search OCA: To put the world’s information online, for free, forever It’s all about the public good

Why Are We Doing It? Create ability for anyone to discover & access books anywhere, anytime, (essentially) for free New kinds of scholarship To preserve and protect our collections To explore new collection & access models

Participant Roles UC Libraries supply & curate books and bibliographic metadata supply onsite scanning facilities when appropriate preserve digital files created Third-parties (Google, Microsoft) provide funding for book scanning digitization –scanning, post-processing

Microsoft/OCA Production scanning began April 2006 Books from all UC Libraries Internet Archive: Digitization Agent Projected scope 100 K books per year Pick-list driven: limit to public domain Scanning Centers (30 scanners “scribes”) Location: UC at SRLF, Internet Archive

Google Production scanning began October 2006 Scanning books from NRLF Projected Scope 2.5 million books during 6 year period Bulk pulling: public domain /in-copyright Scanning location Books transported to offsite Google digitization facility Expansions to UC campus libraries, 2008 UCSC & UCSD are sending books

Participating UC locations Microsoft/OCA Northern Regional Library Facility (NRLF) Southern Regional Library Facility (SRLF) UC Berkeley, Bancroft Library UCLA Google Northern Regional Library Facility (NRLF) + UC Berkeley Systems UC Santa Cruz UC San Diego

CDL’s role, on behalf of UC Liaison with partners Planning & coordination Funding Stewardship of digital content New services

Campuses Provide the Books

Reasons books might get rejected (images)

Costs to the UC Libraries Staffing (2-5 FTE at each of 6 locations) Physical space & facilities Scanning centers (where scanning machines are housed), book processing, queue storage (book trucks) Costs to run campus systems CDL servers for inventory database, digital preservation

Digital files Images OCR - Text OCR - Page coordinates Metadata

What books are being digitized? American history Humanities Science Cookbooks Children’s books East Asian & Pacific Rim collections

Where can you find UC books? Google Book Search: Microsoft Live Search Books: e=books Internet Archive: alifornia_libraries Melvyl:

Full-text access: copyright status is a factor Public domain, pre-1923 “orphan works,” present

Book Discovery Book Discovery in a Mass Digitized Environment Christenson.pdf Christenson.pdf What are the strengths and weaknesses of leading book discovery interfaces? What is the best user experience for book discovery tasks?

Wish list for book discovery Improved results ranking and recommendations Ability to both browse/winnow and search across full text Ability to find & display multi-volume works in a meaningful way

Scholarly use studies CLIR: “When Mass Digitization Reaches Critical Mass: Scholar’s Evaluation and Analysis of Major Digitization Projects: Mellon Funded Study OCLC/RLG: Explore user expectations for scholarly use of the outputs of mass digitization

Questions? Heather Christenson, CDL Mass Digitization Project Manager For more information: