Massively Digitizing UC Library Collections Google, Microsoft, and More Learning in Retirement Libraries – The Intersection of Tradition and Innovation.

Slides:



Advertisements
Similar presentations
Beyond the Google Book: the Future of the Digital Library Cory Snavely Library IT Core Services manager University of Michigan April 20, 2010.
Advertisements

1 Scholarly Publishing Initiatives in ARL Libraries: a Penn State Perspective Nancy L. Eaton, Dean University Libraries and Scholarly Communications The.
Next-Generation UC Libraries; Next-Generation UC Librarians Ginny Steel, UCSC.
Catherine H. Candee Director, Publishing and Strategic Initiatives California Digital Library Scholarly Publishing at University of California ———— An.
The Oxford-Google Digitization Project* Michael Popham Oxford Digital Library * Rules of commercial confidentiality apply to this presentation!
The Google Books Settlement: A Partner Library Perspective Ivy Anderson California Digital Library Library Journal Virtual E-Book.
Digital Repositories Team Informational Session University Libraries
Massively Digitizing UC Collections Ivy Anderson Director, Collections California Digital Library May 2009.
Re-envisioning (and Re-purposing) Collections: Mass Digitization, Google, and the HathiTrust Ivy Anderson CDL CDL Users Council Meeting April 10, 2009.
Information Literacy in a Rapidly Evolving Educational and Research Environment The Hybrid Research Environment Prof. Monica Berger June 8, 2005.
Building Institutional Repository Communities Through Collaborative Strategies An exploration of collaboration in the context.
Institutional Repositories Tools for scholarship Mary Westell University of Calgary AMTEC Conference May 26, 2005.
Online resources in TCD Library:
1 Archiving and Preserving the Web Kristine Hanna Internet Archive April 2006.
 an easy-to-use interface for deposit and update  access via persistent URLs  tools for long-term management  permanent storage Merritt is a new cost-effective.
UC’s Systemwide Library Planning Some background & current information.
Carol Hixson Dean, Nelson Poynter Memorial Library and Alex Brice Associate Professor, College of Education Promote and Publish Your Work A Presentation.
Eleanor Yuen Asian Library University of British Columbia October 20 th, 2010 The Digitization of Asian Materials at UBC: A Model for National and International.
Partnership agreement between Complutense University and Google Books Manuela Palafox Parejo Servicio Edición Digital y Web Biblioteca de la Universidad.
Ebooks: digitizing our print collections Sian Meikle University of Toronto Libraries.
The impacts of google digitization projects on libraries
UC Libraries and the Implications of Mass Digitization Robin L. Chandler User’s Council May 11, 2007.
Alternative Models of Scholarly Communication: The "Toddler Years" for Open Access Journals and Institutional Repositories Greg Tananbaum President The.
OCLC Online Computer Library Center Strategic Partnerships: An International View 30 October 2003.
The DSpace Course Module – An introduction to DSpace.
How Research Libraries Became E-knowledge Networks Peter X. Zhou 周欣平 University of California, Berkeley University of California, Berkeley October 6, 2009.
HathiTrust Digital Library. Overview ›Began in 2008 ›Large scale digital preservation repository ›Partnership of major research libraries ›Focus on both.
Open Journal Systems Project or The UB Libraries as Publisher: Information and Roles for Liaison Librarians Charles Lyons Library Liaison Summit December.
Libraries as Partners in Research: the UC Curation Center’s Tools and Services UC3 Team University of California Curation Center California Digital Library.
Cataloging and Metadata at the University Library.
The Western Waters Digital Library: Building a Resource Through Multi- State Collaboration and Technology Dawn Paschal Assistant Dean, Digital Library.
UC3 Standards and Best Practices for Datasets and Other Supplemental Journal Article Materials UC3 Stephen Abrams Patricia Cruse John Kunze.
Live Search Books University of Toronto – Scholar’s Portal Forum 2007 January 2007.
Google Books, UMI and Other Intriguing Trends in Digital Publishing Joe Wible Hopkins Marine Station of Stanford University October 9, 2006.
The New Digital World and the Transformation of Information and Libraries Patricia L. Thibodeau Associate Dean Library Services & Archives Oct. 26, 2011.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Next Generation Technical Services Rethinking Library Technical Services for the University of California R Bruce Miller.
Breana McCracken University of Illinois at Urbana-Champaign HathiTrust and Copyright Future Implications - Strong precedent for libraries to continue to.
University of California Mass Digitization Projects Update Users Council Annual Meeting May 8, 2008 Heather Christenson, Mass Digitization Project Mgr,
Creating Change in Scholarly Communications Heather Joseph Executive Director, SPARC September 21, 2009 TCAL, Austin, TX.
Digitizing Aloha: Using Information Technology to Preserve and Present the History and Culture of Hawai'i Bob Schwarzwalder Assistant University Librarian,
HathiTrust’s Past, Present and Future. Short- and Long-term Functional Objectives Short-term Page turner mechanism (and Mobile!) Branding (overall initiative;
Radical Change by Traditional Means: Deep Resource Sharing by the University of California Libraries Presentation to the UK Serials Group Conference 2004.
OCLC Programs & Research Prospecting in the library data mines Brian Lavoie Consulting Research Scientist OCLC Programs & Research Annual Partners Meeting.
I NVESTIGATING Ashley Butler, Rebecka Embry, Jo Lammert INF385S Digital Libraries February 17, 2011.
UC Libraries Leslie Schick Associate Dean, UC Libraries
Implementing an Institutional Repository: Part III 16 th North Carolina Serials Conference March 29, 2007 Resource Issues.
HATHITRUST A Shared Digital Repository HathiTrust and the Future of Research Libraries American Antiquarian Society March 31, 2012 Jeremy York, Project.
Getting Your Publications to the Masses: Using W&L’s Institutional Repository to Enhance Scholarly Communication Elizabeth Anne Teaff, MLIS August 31,
INTELLECTUAL RIGHTS AND HISTORIC CORPORA Mark Sandler University of Michigan ICOLC, March, 2003.
Building an Infrastructure for Digital Humanities: Issues and Considerations Peter Zhou 周欣平 University of California, Berkeley October 8, 2009.
1 The Oxford-Google mass-digitisation project: How, why and what? An EDUCAUSE Webcast by Reg Carr (University of Oxford) 15 June 2005.
The Oxford-Google Digitization Project* Michael Popham Oxford Digital Library * Rules of commercial confidentiality apply to this presentation!
Mass Digitization Projects Celebration and Challenges Presented to the 2 nd ICUDL Alexandria, Egypt by Dr. Gloriana St. Clair Carnegie Mellon University.
CDL’s Metasearch Infrastructure ICOLC, Boston April 13, 2005 Laine Farley, Director Digital Library Services.
Make or Buy the Big Historical Collections? Your friendly facilitators: Ivy Anderson, CDL Warren Holder, OCUL.
Carnegie Mellon University’s Million Book Project (MBP) Laurel Foundation – August 27, 2002.
HathiTrust: Collaboration in Building the Universal Collection John Wilkin 1 October 2009.
The Future of Scholarly Communication & the Role of Libraries Roy Tennant eScholarship, The California Digital Library.
OCLC Online Computer Library Center Scott Wasinger OCLC NetLibrary September 4, 2007 Going Global with eBooks.
Marilyn Billings Scholarly Communication Librarian University of Massachusetts, Amherst.
Future Directions for Scholarly Publishing at the University of California Catherine H. Candee Director, Publishing and Strategic Initiatives Office of.
Leveraging the Expertise of our Staff and the Information Resources We Manage MIT Libraries Visiting Committee April 13, 2005.
CENTRAL/WESTERN MASSACHUSETTS AUTOMATED RESOURCE SHARING Digitization GOALS & THEIR LOGISTICS Michael J. Bennett Digital Initiatives Librarian C/WMARS,
The New Now: Institutional Repositories and Academia Institutional Repository USM April 17, 2015 Marilyn Billings Scholarly Communication Librarian.
Fresno State Digital Repository
Mass Digitization of Books and the Potential for Universal Access
Re-envisioning (and Re-purposing) Collections:
Copyright Policy & Education Officer
Users and Digital Collections
Presentation transcript:

Massively Digitizing UC Library Collections Google, Microsoft, and More Learning in Retirement Libraries – The Intersection of Tradition and Innovation April 10, 2008 Ivy Anderson & Heather Christenson

California Digital Library Two Complementary Roles Facilitate library collaboration across the ten campuses of the UC system (e.g. shared collection development) Distinctive services emphasizing digital stewardship, innovation in scholarly publishing, and open-access digital collections Three Audiences UC libraries Broader UC community External constituencies and the general public Five Programs Collection Development and Management (Licensed Content, Shared Print Collections, Mass Digitization) Bibliographic Services (Melvyl Catalog, SFX) Preservation (Digital Preservation Repository, Web Archiving) Digital Special Collections (Calisphere, Online Archive of California) Publishing Services (eScholarship Repository, eScholarship Editions, collaboration with UC Press) “11 th University Library” founded 1997 Part of UC Office of the President

Digitization of Library Collections Special Collections Manuscripts, archival collections, photographs, etc. CDL / UC Libraries Online Archive of California Calisphere Berkeley, University of California, Bancroft Library, UCB 150, f. 252v

Digitization of Library Collections Specialized Texts and Corpora Making of America -10,000 texts in 10 years CDL eScholarship Editions

Digitization of Library Collections Commercial Partnerships EEBO: 100,000 important early English texts Licensed access via ProQuest Satans stratagems, copy from UCLA Library

…and Along Came Google Google Library Project 2005: The ‘Google Five:’ Harvard, Oxford, New York Public Library, Stanford, University of Michigan 2008: 20 library partners in 5 countries Google Publisher Partner Program

…and the Open Content Alliance October 2005 Founders: Internet Archive, University of California, U of Toronto… Large-scale digitization of out-of- copyright works only A project of the Internet Archive

…and Microsoft Out-of-Copyright Works Only

UC Involvement October 2005 August 2006 March 2007 Founding Member of Open Content Alliance UC Joins Google Library Project Microsoft Digitization Agreement

So: Three Projects, One Goal Goal: Mass digitization of library book collections Google In-copyright and out-of-copyright works Available via Google search engine and Google Book Search Microsoft Out-of-copyright works only Available via Microsoft Live Search Open Content Alliance Out-of-copyright works only Available (via the Internet Archive website) to any and all search engines Library and grant-funded

Why Are They Doing It? Google’s vision: To put all the world’s information online Google and Microsoft: To gain marketshare and competitive advantage for their search (and online advertising) services It’s all about Search OCA: To put the world’s information online, for free, forever It’s all about the public good

Why Are We Doing It? To enhance student and faculty research To put our collections where our users are – in Google! Mass digitization of these materials enhances access. It can make people aware of books they may not have discovered otherwise and lead them, through an internet search, back to our libraries To support deeper textual analysis and research. Scholars can trace the evolution of ideas and perform other sophisticated textual analysis when the full text is indexed and searchable by computer, opening scholarship in new ways. To fulfill our public service mission Many books of enduring general interest – including classic works of literature and more unique items such as early histories of the settlement of California and the West - can now be read by anyone, anywhere, anytime To preserve and protect our collections In earthquake and fire-prone California, digitizing books in our collections may also help protect the university from catastrophic loss should disaster someday strike our libraries

Microsoft/OCA Project Contributors Northern Regional Library Facility (NRLF) Southern Regional Library Facility (SRLF) UC Berkeley, Bancroft Library UCLA

Google Project Contributors Northern Regional Library Facility (NRLF) + UC Berkeley Systems UC Santa Cruz UC San Diego

CDL’s role, on behalf of UC Liaison with partners Planning & coordination Funding Stewardship of digital content New services

Campuses Provide the Books

The Book Digitization Process A world of barcodes, logistics, loading docks, packing materials, and scanning machines!

Reasons books might get rejected (images)

Costs to the UC Libraries Staffing (2-5 FTE at each of 5 locations) Physical space & facilities Scanning centers (where scanning machines are housed), book processing, queue storage (book trucks) Costs to run campus systems CDL servers for inventory database, digital preservation

Digital files Images OCR - Text OCR - Page coordinates Metadata

What sort of books are being digitized? American history Humanities Science Cookbooks Children’s books East Asian & Pacific Rim collections

Where can you access the books? Google Book Search: Microsoft Live Search Books: e=books Internet Archive: alifornia_libraries Test version of UC Union catalog:

Copyright status is a factor Out of copyright, pre-1923 “orphan works,” present

At the frontier…

What’s ahead Digital preservation –storage, storage, storage Copyright determination Print on demand

New modes of access & critical mass of digital books will transform scholarship Full text search - new form of book discovery Beyond search – text mining, computationally assisted research Machines can interact with massive amounts of texts, and provide new structures

Questions? Heather Christenson, CDL Mass Digitization Project Manager Ivy Anderson, CDL Director of Collections For more information: sdig/