Progress towards the Digital Future Dr. Gloriana St. Clair Dean of University Libraries Digital Libraries Colloquium February 22, 2006.

Slides:



Advertisements
Similar presentations
Million Book Project Today Gloriana St. Clair October 21, 2003 OCLC.
Advertisements

Google Series Part 1: gmail Part 2: maps Part 3: talk Part 4: earth Part 5: books Part 6: picasa Part 7: sites Part x: ?
" OPEN ACCESS INITIATIVE IN ONE OF THE PALESTINIAN UNIVERSITIES: BIRZEIT UNIVERSITY" Prepared by Mrs. Diana Sayej-Naser Library Director Birzeit University.
DSpace: the MIT Libraries Institutional Repository MacKenzie Smith, MIT EDUCAUSE 2003, November 5 th Copyright MacKenzie Smith, This work is the.
Throwing Open the Doors: Strategies and Implications for Open Access Heather Joseph Executive Director, SPARC October 23, 2009 Educause Live 1.
Open Access Policies in Scotland and the UK Morag Greig, University of Glasgow.
PubMed Central ANCHASL Spring Meeting April 1, 2005 Robert James Associate Director of Public Services Duke University.
Collection and Service of CADAL Project Huang Chen Zhejiang Uni. Libraries ALA.
Open Access and Scholarly Communications Tyler Walters Julie G. Speer Library Faculty Advisory Board November 20, 2009.
Million Book Project: Dreams and Realities Dr. Gloriana St. Clair University Librarian, Carnegie Mellon.
UCL LIBRARY SERVICES The Future of Scholarly Publication Dr Paul Ayris Director of UCL Library Services and UCL Copyright Officer
The Million Book Project: Removing Obstacles to Use, Satisfaction, & Success Denise Troll Covey Principal Librarian for Special Projects – Carnegie Mellon.
The Million Book Project: Confronting Copyright Absurdity, Creating Copyright Hope Denise Troll Covey Associate Dean, Carnegie Mellon University Libraries.
Denise Troll Covey Principal Librarian for Special Projects The Impact of Current Copyright Law Erin Rhodes Copyright Permission Assistant Carnegie Mellon.
Global Cooperation for Global Access: The Million Book Project Denise Troll Covey Principal Librarian for Special Projects Carnegie Mellon CRIS 2004 –
Unconditional Copyright Removing the Camouflage Denise Troll Covey Principal Librarian for Special Projects Erin Rhodes Copyright Permission Assistant.
Biodiversity Heritage Library by Connie Rinaldo. Overview History EOL/BHL: WHY? Members/Collaborators Process Governance Sustainability: Legal and Financial.
Denise Troll Covey Principal Librarian for Special Projects – Carnegie Mellon DLF Forum – April 2004 – New Orleans, LA Copyright Permission for Open Access:
Faculty Roles in the Evolving Scholarly Communications System Mark Kamlet University Provost.
The Open Content Alliance Project Liz Bell & Charley Pennell.
What is open access (OA) publishing? Why is it important? What are the pros and cons of OA? How does it relate to library and information science?
Elizabeth Newbold and Samantha Tillett GL8 New Orleans, December 2006
Recent Progress in the Million Book Digital Library Project in China By Prof. Jihai Zhao Zhejiang University Libraries, Hangzhou, China
Massively Digitizing UC Library Collections Google, Microsoft, and More Learning in Retirement Libraries – The Intersection of Tradition and Innovation.
Searching and Accessing the Cultural Heritage in a Digital World Yoram Elkaim International Conference on Intellectual Property & Cultural Heritage in.
CREATING CHANGE IN EUROPE : SPARC EUROPE AND SCHOLARLY PUBLISHING Frederick J. Friend SPARC Senior Consultant
Jonathan Band Jonathan Band PLLC Google Library Project: Copyright Issues.
Million Book Project (MBP) Gloriana St. Clair Johns Hopkins University February 5, 2003.
HATHITRUST A Shared Digital Repository HathiTrust: Putting Research in Context HTRC UnCamp September 10, 2012 John Wilkin, Executive Director, HathiTrust.
Open Access: An Introduction Edward Shreeves Director, Collections and Content Development University of Iowa Libraries
Fall 2002 DLF Forum RLG Cultural Materials DLF Forum Ricky Erway Digital Resources Manager, RLG.
Million Book Project (MBP) Coalition for Networked Information December 5-6, 2002.
Epublishing and journals Angus Phillips Director Oxford International Centre for Publishing Studies.
Complying with the NIH Public Access Policy: Depositing manuscripts in PubMed Central Julie Speer, Lori Critz, Michelle Powell Office of Organizational.
Open Access Catherine Boden, Health Sciences Liaison Librarian David Fox, Head of Monographs Presentation to the Musculoskeletal Journal Club College of.
Exploring the Feasibility of Seeking Copyright Permissions ALA Annual Conference June 16, 2001 Carole A. George, Ed. D. Carnegie Mellon University Libraries.
Google Print ™, Million Book Project, and Google Scholar ™ Digital Libraries Colloquium January 27, 2005 Gloriana St. Clair Dean of University Libraries.
The Public Knowledge Project and the ULS Scholarly Communications Lunch and Learn Discussion Thursday, September 12, 2013 Office of Scholarly Communication.
Live Search Books University of Toronto – Scholar’s Portal Forum 2007 January 2007.
Breana McCracken University of Illinois at Urbana-Champaign HathiTrust and Copyright Future Implications - Strong precedent for libraries to continue to.
University of California Mass Digitization Projects Update Users Council Annual Meeting May 8, 2008 Heather Christenson, Mass Digitization Project Mgr,
Google Confidential Daniel Clancy Engineering Director, Google Print 18-July-05.
Amy Jackson UNM Technology Days July 22,  An institutional repository (IR) is a web-based database of scholarly material which is institutionally.
Committed to making the world’s scientific and medical literature a public resource.
TDWG 2006 Conference, St Louis Digitizing the legacy literature of biodiversity An introduction to the Biodiversity Heritage Library (BHL) Neil Thomson.
Collaboration Between Publishers and The British Library UKSG – Spring 2003 Natalie Ceeney Director of Operations and Services The British Library.
 A Primer for Higher Education in disseminating Management Research Data Arnold Mwanzu Rodney Malesi.
INTELLECTUAL RIGHTS AND HISTORIC CORPORA Mark Sandler University of Michigan ICOLC, March, 2003.
ScholarSpace & Open UH Mānoa March 2013 Beth Tillinghast Web Support Librarian ScholarSpace & eVols Project Manager UHM Library.
Mass Digitization Projects Celebration and Challenges Presented to the 2 nd ICUDL Alexandria, Egypt by Dr. Gloriana St. Clair Carnegie Mellon University.
Web Information Retrieval Prof. Alessandro Agostini 1 Context in Web Search Steve Lawrence Speaker: Antonella Delmestri IEEE Data Engineering Bulletin.
National Library of the Czech Republic Integration of digital materials into EDL Adolf Knoll National Library of the Czech Republic Helsinki CENL Workshop.
Traditional Distribution Electronic Distribution User Florida Entomologist Issues Reprints FTP.
Carnegie Mellon University’s Million Book Project (MBP) Laurel Foundation – August 27, 2002.
Million Book Project in U. S. and India International Conference on The Future of the Book April 22, 2003 Gloriana St. Clair Carnegie Mellon University.
Access to Research Data: NIH Public Access and PMC International Seminar on Open Access for Developing Countries 21 September 2005 Jane Bortnick Griffith.
The Future of Scholarly Communication & the Role of Libraries Roy Tennant eScholarship, The California Digital Library.
HATHITRUST A Shared Digital Repository HathiTrust Large Digital Libraries: Beyond Google Books Modern Language Association January 5, 2012 Jeremy York,
Primo at the British Library Mandy Stewart. 2 About the British Library The British Library is the National Library of the UK It is a world-class.
Million Book Project: Vision Becoming Reality Gabrielle Michalek, Carnegie Mellon Presentation to Carnegie Mellon Qatar Library November 9 & 10, 2005.
Publishing from the Library: New Roles for Libraries in Scholarly Communications David Ruddy Cornell University Library September, 2004.
Merit JISC Collections Merit: presentation for UKCORR Hugh Look, Project Director.
Disclaimer This presentation is for informational purposes only and does not constitute legal advice.
Million Book Project: Collections Dr. Gloriana St. Clair University Librarian, Carnegie Mellon.
Open Access to Scholarly Publications A Brief Introduction.
HathiTrust Copyright Review
Million Book Project Today
Copyright Permission for Open Access: Costs, Strategies, & Success Rates Denise Troll Covey Principal Librarian for Special Projects – Carnegie Mellon.
Accomplishments of the Million Book Project
AUC’s Role In Facilitating Access To Knowledge In The Arab World
Presentation transcript:

Progress towards the Digital Future Dr. Gloriana St. Clair Dean of University Libraries Digital Libraries Colloquium February 22, 2006

Purpose  Introduce the NIH Public Access Policy as a major first step toward solving dysfunction in scholarly communications  Compare the NSF-funded Million Book Project with Google Print and the Open Content Alliance  Foster an interchange about these developments relevant to education in Qatar

Scholarly Communications and NIH  Essence of the problem  Promise of open access  NIH Public Access Policy proposal  Current situation

Essence of the Problem Provost Mark Kamlet at Carnegie Mellon’s Open Access Forum (2004)

Essence of the Problem Serial & Monograph Costs, * North American research libraries *Source:

Essence of the Problem AuthorPublisher GoalWide distribution of work Increase revenue RewardReputation, P&T, grants Maximized profit StrategyPublishControl access & price Mismatched Motives

What is “Open Access”?  Immediate free availability on the public Internet  Research literature that scholars produce without expectation of payment (e.g., journal articles)  Recognizes that the value of research increases with use  Exploits economics of Internet  An access model, not a business model

What Open Access Can Achieve  Expand information usage and application.  Remove barriers that make content scarce.  Weaken the position of publishers that use their monopoly position to support excessively high prices.  Focus economic return on value addition (rather than content control).  Eliminate systemic inefficiencies by unbundling functions.  Introduce price competition.  Benefits outweigh dislocations.

NIH Public Access Policy Proposal Elias Zerhouni’s memo urging authors of NIH- funded research to submit final manuscripts to PubMed Central

Why Pennsylvania Mattered  NIH Public Access Proposal began as a rider to NIH appropriations bill (ATA)  Arlen Specter is the chair of the Appropriations Committee  Rick Johnson called me because I have a certain reputation, having left Journal of Academic Librarianship when Elsevier purchased it  Carnegie Mellon provost Mark Kamlet is a medical economist who understands the dysfunction and favors change

Why Pennsylvania Mattered  Carnegie Mellon lobbyist, Maureen McFalls, has a daughter in library school and was a great advocate  Carnegie Mellon Libraries asked for letters from PALCI, PALINET, Pennsylvania Library Association, etc.  ATA and SPARC arranged for Carnegie Mellon, Pitt, Penn, and Penn State provosts to send a joint letter  Provost Kamlet made telephone calls and visits on short notice

Why Pennsylvania Mattered Provosts’ joint letter supporting expanded access to NIH-funded research

ATA

NIH Policy Fulfills its Promise  Accessible electronic information is more desirable than print  Author goals of wide distribution and reputation are met  Publishers are not put out of business but the public is better served  Health domain has huge popular interest

NIH Policy Problem  Slow uptake Nature, New England Journal of Medicine

in the News, January 2006 Founder Sergey Brin said he believed Google is "doing the right thing" with their work in China: "We ultimately made a difficult decision, but we felt that by participating there, and making our services more available, even if not to the 100 percent that we ideally would like, that it will be better for Chinese Web users, because ultimately they would get more information, though not quite all of it.“ 1

“This is the day the world changes.” John Wilkin, University of Michigan 3 “… commercialize the great research libraries with a handshake, suddenly and epochally.” Rory Litwin, in Library Juice 2 Advent of Google Print (late 2004)

A Closer Look at the Players …  Compare the NSF-funded Million Book Project with Google Print and the Open Content Alliance (OCA)  Project the impact of partnership: Million Book Project and OCA

Digital v. Paper  Rory Litwin, “On Google’s Monetization of Libraries” 4 1. Privacy [cookies] 2. Introduction of commercial bias 3. Questions about democratization and equity of access 4. Disintermediation issues 5. Decontextualization of knowledge 6. Closing of the information commons

In our rapidly changing world, lifelong learning and access to books have become essential to employment, health, peace, and prosperity. Greater public access to information is consistent with the goals of education and democracy. Million Book Project Dr. Raj Reddy

MBP Vision To create online access to all published works … Searchable, browsable and navigable by humans and machines …  Free-to-read  Instantly available  In any language  At any literacy level  Anywhere in the world

Google Print Vision To organize the world’s information and make it useful and web-accessible. “How many users will find, and then buy, books they never could have discovered any other way? How many out-of- print and backlist titles will find new and renewed sales life? How many future authors will make a living through their words solely because the Internet has made it so much easier for a scattered audience to find them?” 5

OCA Vision 6 To collect all published information, and make it accessible to everyone, no matter where they are in the world  Access to information is a key ingredient to education and an open society  We have the necessary technologies, and the will for an open society … Will we make it happen?

MBP Partners  The National Science Foundation has awarded the Million Book Project four grants for equipment and planning  Partners include government and academic institutions in India, China, and Egypt; academic libraries in the U.S.; OCLC; and the United Nations Food and Agriculture Organization  Newest partner is the Open Content Alliance

Google Print Leader & Partners  Google, Inc.  U. Michigan  Stanford University  Harvard University  U. Oxford  New York Public Library

 Brewster Kahle, director and co-founder of the Internet Archive  I can do this OCA Leaders

OCA Partners  Adobe Systems Incorporated  Biodiversity Heritage Library, a cooperative project of:  American Museum of Natural History  Harvard U. Botany Libraries  Harvard U. Library of the Museum of Comparative Zoology  Missouri Botanical Garden  Natural History Museum, London  The New York Botanical Garden  Royal Botanic Gardens, Kew  Smithsonian Institution Libraries  Columbia University  Emory University  European Archive  HP Labs  Johns Hopkins University Libraries  McMaster University  Memorial University of Newfoundland  Million Book Project  Missouri Botanical Garden  MSN  National Archives (United Kingdom)  O'Reilly Media  Prelinger Archives  Research Libraries Group (RLG)  Rice University  Smithsonian Institution Libraries  University of British Columbia  University of California  University of Ottawa  University of Pittsburgh  University of Toronto  University of Virginia  York University

MBP Collections  Books for College Libraries (best books)  University presses and scholarly societies (with copyright permission)  U.N.’s Food and Agriculture Organization content  National Agriculture Library  Academic libraries with agriculture collections Janet McCue, Cornell University will coordinate the agriculture collections

Google Print Collections  Stanford – 40,000-volume pilot  Harvard – 40,000-volume pilot from a 15-million volume collection  U. Michigan – virtually the entire collection; add seven million to search engine; Michigan to “receive and own a high quality digital copy” and to provide access 7  New York Public Library – a subset of a 20-million volume collection. Selection criteria = in public domain (1923), interesting, not too fragile

OCA Collections Will seed the archive with partners’ collections (below). Will scan U.S. collections in situ at 10¢ per page.  European Archive  Internet Archive  National Archives (UK)  O'Reilly Media  Prelinger Archives  University of California  University of Toronto

MBP Research Initiatives  Machine translation  Massive distributed database  Storage formats  Use of digital libraries  Distribution and sustainability  Security  Search engines  Image processing  Optical Character Recognition (OCR)  Language processing  Copyright laws

Research: Arabic OCR R&D in Arabic OCR “Million Book Project at Bibliotheca Alexandrina,” by Youssef Eldakar et al. Journal of the Zhejiang University SCIENCE: Special Proceedings Issue of the 1 st Int’l Conference on Universal Digital Library (ICUDL November 2005): p

Research: Indian and Chinese OCR and Language Translation

OCA Research Initiatives  … In discussions with major publishers and the organizations that represent them in order to explore legal, sustainable business models through which more copyrighted content can be made widely available. … OCA looks forward to continued dialogue with publishers in order to explore and build solutions that benefit the entire community of Internet users. 8  Exploring and/or creating inexpensive digitization techniques 9

Worries  Copyright, Copyright, Copyright  Printing  [Good News] Working with Publishers

Copyright, Copyright, Copyright … Copyright is the biggest reality that we all face.

MBP Copyright Strategy  Focusing on available out-of-copyright materials (government documents, pre-1923 collections, etc., as well as indigenous cultural treasures …) Incised palm leaves to be digitized Saraswathi Mahal Library, India

Google Print Copyright Strategy 10  For books in copyright, a Google Print search displays “snippet[s] of text” A ‘snippet’ is defined as three lines Search returns three snippets per book and indicates how many times search terms appear Search also returns bibliographic data about the book, and information on where to buy the book or find it at a local library

Google Print Strategy Adjustments “Google said yesterday that it would temporarily halt its program to make searchable, digital copies of the vast contents of three university libraries to give publishers and other copyright holders the chance to opt out of having their protected works copied.” 11

OCA Copyright Strategy The OCA is committed to respecting the copyrights of content owners. … OCA contributors must secure the permission of all concerned copyright holders prior to submitting materials to the OCA for digitization or inclusion in the archive. 12

MBP Working with Publishers  Focusing with increasing success on gaining permission from university presses and scholarly societies to digitize books and provide access to searchable full-text.  The MBP approach is to request permission for a range of years, for example, everything published prior to A publisher can specify the cut-off year or, alternatively, specify the list of titles for which they grant non-exclusive permission to digitize in the MBP.

Google Print Working with Publishers “We’ve already had great success working with publishers directly to add their works to our index through our Publisher Program, and when we add books with publisher permission, we can offer more information and a much richer user experience.” 13

OCA Working with Publishers 14  OCA has been in discussions with major publishers and the organizations that represent them in order to explore legal, sustainable business models through which more copyrighted content can be made widely available. O'Reilly Media is one commercial publisher that has already agreed to make certain content available to the OCA.  OCA looks forward to continued dialogue with publishers in order to explore and build solutions that benefit the entire community of Internet users.

2005 MBP Partners’ Meeting Partners met in Hangzhou, China

 India 200,000 volumes  China 400,000 volumes  Egypt 20,000 volumes 620,000 volumes Status of Collection Digitization

Big New Ideas: Reading Online  “ Will people read on screens in the future? ” 15 People often print long online documents (i.e., users still find hard copy easier to use) Hardware improvement? Navigation improvement? Building on previous user behavior to help guide new users from page to page online?

Big New Ideas: Synthetic Documents Bypass current copyright restrictions with machine-made synthetic documents that transmit intellectual content. 16

Big New Ideas: Big Finish 17  Finish the Million Book Project by the next Partners Meeting (fall 2006)  Congregate all the metadata in one place

Conclusions  Future of libraries is digital  Technical progress is substantial  Intellectual property laws continue to be a major barrier  Social reactions deserve more research by librarians

Q & A Digital future and education in Qatar … What lies ahead?

Thank You Gloriana St. Clair And thanks to my generous collaborators: Mark Kamlet, Provost, Carnegie Mellon Rick Johnson, Executive Director, SPARC If you would like an electronic copy of this talk, contact Cindy Carroll,

Endnotes 1. Brin, Sergey. Quoted by David Kirkpatrick, “Google Founder Defends China Portal.” Fortune (January 25, 2006). Available: Litwin, Rory. “On Google’s Monetization of Libraries.” Library Juice 7, 26 (December 17, 2004). Available: Wilkin, John. Quoted in “Google to Scan Books from Major Libraries.” MSNBC Tech News & Reviews (December 14, 2004). Available: Litwin.

Endnotes 5. Schmidt, Eric. “Books of Revelation.” The Wall Street Journal (October 18, 2005). Available: Mr. Schmidt is Google CEO Kahle, Brewster. “Towards Universal Access to all Knowledge: Internet Archive.” Journal of the Zhejiang University SCIENCE: Special Proceedings Issue of the 1 st International Conference on Universal Digital Library (ICUDL November 2005). 7. University of Michigan (Nancy Connell). “Google/U-M Project Opens the Way to Universal Access to Information.“ University of Michigan News Service (December 14, 2004). Available:

Endnotes 8. Open Content Alliance. “Open Content Alliance FAQ.” Available: Kahle. 10. University of Michigan. “Google/U-M Project Questions and Answers.” The University Record Online (January 7, 2005). Available: Wyatt, Edward. “Google Library Database is Delayed.” New York Times (August 13, 2005). Available: C708DDDA10894DD C708DDDA10894DD404482

Endnotes 12. “Open Content Alliance FAQ.” 13. Smith, Adam. “Discovering Hard-to-find Books.” Available: books.html. Mr. Smith is Senior Business Product Manager for Google Print. books.html 14. “Open Content Alliance FAQ.” 15. Lesk, Michael. “The Qualitative Advantages of Information: Bigger is Better.” Journal of the Zhejiang University SCIENCE: Special Proceedings Issue of the 1 st International Conference on Universal Digital Library (ICUDL November 2005): p

Endnotes 16. Shamos, Michael I. “Machines as Readers: A Solution to the Copyright Problem,” Journal of the Zhejiang University SCIENCE: Special Proceedings Issue of the 1 s t International Conference on Universal Digital Library (ICUDL November 2005): p Proposed by N. Balakrishnan, Supercomputer Education & Research Center, Indian Institute of Science (ICUDL, November 2005).