OCLC Programs & Research Prospecting in the library data mines Brian Lavoie Consulting Research Scientist OCLC Programs & Research Annual Partners Meeting.


Similar presentations

Ithaka A Systemwide View of Library Collections Brian Lavoie, OCLC Research Roger C. Schonfeld, Ithaka CNI Spring Task Force Meeting April 5, 2005.
4 pictures and a conclusion : the third age of libraries in a network environment Lorcan Dempsey Taiga Forum 28 March 2006.
Anatomy of Aggregate Collections: The Example of Google Print for Libraries Brian Lavoie Senior Research Scientist OCLC Research OCLC Members Council Meeting.
OCoLR # OCLCR Making data work harder Lorcan Dempsey OCLC Members Council 17 May 2005.
Libraries and the network platform: a new cooperative context Lorcan Dempsey 2006 OCLC/Frederick G. Kilgour Lecture in Information and Library Science.
OCLC Research The tale of the library long tail: space, collections, and the network Brian Lavoie Consulting Research Scientist OCLC Space: The Final Frontier.
The network reconfigures the library: people and places, collections and services. Lorcan Dempsey Lir Seminar April 2008 Liberty Hall, Dublin.
Dr. Clem Guthro, Director of the Colby College Libraries MSCS Project Co-PI Maine Shared Collections Strategy: Print Archive Network Update
The White Rose Collaborative Collection Partnership Brian Clifford University of Leeds.
ICOLC October 4, 2001 OCLC Services. Purpose Libraries’ web-based information portal needs –Maximize consortia’s role in their members’ use of database.
Google Scholar and Google Books Meg Atwater-Singer UE Emeriti Presentation October 1, 2007.
Gwen Bird Executive Director, COPPUL Leonora Crema AUL Client Services & Programs, UBC and Chair, SPAN Management Committee Council of Prairie and Pacific.
Moving to the Network Level: Libraries, Readers and Applications Lorcan Dempsey ORBIS Cascade Alliance Retreat University of Washington April 2006.
Moving libraries to Web scale Matt Goldner Product & Technology Advocate 14 June 2011.
Moving Shared Print to the Network Level Emily Stambaugh ALA Annual Conference Las Vegas, NV June 27, 2014 “Looking to the Future of Shared Print” Shared.
The world’s libraries. Connected. Single-search access to Tenn-Share library resources through WorldCat Group Catalog September 28, 2012 Suzanne Butte.
Collect/connect The future of library collections and collection management Libraries Australia Adelaide, 27 October 2011 Caroline Brazier, Director of.
Programs and research Changing users and changing technology: the network rewrites the library Lorcan Dempsey CSU Libraries Futures Summit meeting 6-8.
The Long Tail of Books 1.Online retail 2.Used books 3.POD 4.Someday: ebooks.
VIVA Webinars – July 23 & 24, 2014 Suzanne Butte From FirstSearch to WorldCat Discovery: Cooperative Discovery that Puts You Where Your Users Are Library.
Relevance ranking of results from MARC-based catalogues: from guidelines to implementation exploiting structured metadata Tony Boston and Alison Dellit.
OCLC Online Computer Library Center Cooperative Collection Management Survey ARL Membership Meeting October 19, 2006 Chip Nilges Vice President, New Services.
The world’s libraries. Connected. Print Management at ‘Mega’-scale NITLE Collections in a Mega-regional framework NITLE Shared Academics » Future of Libraries.
OCLC Research Exploration, innovation and community for libraries and archives. Featuring Brian Lavoie, Research Scientist Print Management at “Mega-scale”:
OCLC Online Computer Library Center Registry of Digital Masters A joint project of the Digital Library Federation and OCLC Taylor Surface, OCLC ALA Annual.
OCLC Online Computer Library Center Data Mining Library Collection Silos: Print Books and E-books in Library Collections Lynn Silipigni Connaway Ed O’Neill.
Credits Slide 1. Picture of Sterling Memorial Library. Günter Waibel. Slide 9. From avlxyz on Flickr.
The National Library of New Zealand (Te Puna Matauranga o Aotearoa) & OCLC established a Partnering Agreement for the supply of bibliographic services.
Is Cataloging Dead: Advocacy for Bibliographic Control Randy Roeder and Rebecca Routh ILA/ACRL Spring Conference Davenport, Iowa March 3, 2008.
OCLC Research Libraries Partners 10 June 2011 Robin Murray Vice President, Global Product Management OCLC Collaboratively Building Web-Scale with Libraries.
HATHITRUST A Shared Digital Repository HathiTrust: Putting Research in Context HTRC UnCamp September 10, 2012 John Wilkin, Executive Director, HathiTrust.
Connecting to Mission Lisa R. Carter Past Forward! Meeting Stakeholder Needs in 21st Century Special Collections June 3, 2013 This.
The world’s libraries. Connected. WorldShare platform & Management Services Integrate all of your collections: print, licensed & digital Chris Thewlis.
Additional New Content to be Purchased Annually Team 2.
Anatomy of Aggregate Collections Exploring Mass Digitization and the “Collective Collection” Brian Lavoie Research Scientist OCLC Research NELINET September.
Programs and Research Libraries in a web 2.0 environment Lorcan Dempsey Bibliothèque National de France 8 December 2006.
OCoLR # OCLCR Making data work harder Lorcan Dempsey OCLC OVGTSL 2005 Conference Newark, May
Google Confidential Daniel Clancy Engineering Director, Google Print 18-July-05.
Challenges and Opportunities for Academic Libraries Collaborative Imperatives to Support Collections, Digital Initiatives, and New Services for a Changing.
SCELC Shared Print for Monographs Bob Kieft Occidental College March 5, 2014.
Programs and Research Thinking about collections Lorcan Dempsey Fiesole retreat The University of Hong Kong 13 April 2007.
RLG Programs Curating the Collective Collection Ricky Erway RLG Programs OCLC Programs and Research Western Digital Forum 9 August 2007.
Programs and research Changing users and changing technology: the network rewrites the library Lorcan Dempsey CSU Libraries Futures Summit meeting 6-8.
Programs and Research Moving to the network level: discovery and disclosure Lorcan Dempsey ALCTS ALA Midwinter, Seattle January
EVERY CONNECTION has a starting point. Jasmine de Gaia Product Management WorldCat Consumer Discovery Social Networking & WorldCat.org.
Libraries and networks: the new cooperative context Lorcan Dempsey University of Illinois, Springfield 30 March 2005.
Discovery Tools for Health Libraries  11 th September 2015 WorldCat Discovery Services Simon Day Product Manager.
EThOS: Where have we got to and where do we go next? FIL Conference 29 June 2015 Sara Gould.
Putting “Special” in the “Collective Collection”
The network rewrites the library: supporting research and learning Lorcan Dempsey Solinet annual meeting Atlanta, May
NetLibrary Publishers’ Summit Looking at libraries Lorcan Dempsey OCLC NetLibrary Publishers’ Summit June 2005.
OhioLINK Collection Analysis Project Report on the OCLC/OhioLINK Circulation Study Julie Gammon, University of Akron Ed O’Neill, OCLC Research Webinar.
Theses record exchange: developments in the Australian National Union Catalogue Roxanne Missingham and Margaret Kennedy, Director, National Library of.
Brian Lavoie Research Scientist OCLC The Economics of Sustaining Digital Information NDIIPP Partners Meeting Washington, DC July 22, 2010.
The network rewrites the catalog Lorcan Dempsey University of Virginia Libraries April
Renee Register Senior Product Manager OCLC Cataloging and Metadata Services Sandy Piver OCLC Publisher Services Consultant OCLC Services for the Publisher.
EVERY CONNECTION has a starting point. A compelling end user environment: OCLC’s view Marianne Klomp Product Manager OCLC EUSIDIC 2008 London, UK.
HATHITRUST A Shared Digital Repository HathiTrust Large Digital Libraries: Beyond Google Books Modern Language Association January 5, 2012 Jeremy York,
OCLC Web-scale Management Services Change in the library, impact for our organization Andrew Pace Executive Director, Networked Library Services Larry.
Matt Goldner Product & Technology Advocate Mela Kircher Product Manager WorldCat Local Metasearch 13 November 2009.
Delivers local and global resources and OCLC e-Content in a single search Paul Cappuzzello Senior Library Services Consultant
Delivers local and global resources in a single search The first, easy step toward the first cooperative library service on the Web WorldCat Local “quick.
HathiTrust: A valuable and visionary Partnership.
Lorcan Dempsey, OCLC Environmental trends and OCLC Research. RLP meeting, U Melbourne, 2 Dec
ReCAP Shared Collections: Off-site is in-the-center
Maine Shared Collections Strategy: Print Archive Network Update
When to Hold On and When to Let Go: A Distributed Retrospective Library Assessment Conference, December 6, 2018 Jean Blackburn, Collections Librarian,
Collection Analysis with Circulation, ILL and Collection Statistics: A Follow-up Presentation Lynn Silipigni Connaway OCLC, Inc. Heather Wicht University.
Presentation transcript:

OCLC Programs & Research Prospecting in the library data mines Brian Lavoie Consulting Research Scientist OCLC Programs & Research Annual Partners Meeting Washington, DC June 4, 2007

OCLC Programs & Research Prospecting the library data mines Annual Partners Meeting 2 Making data work harder  Data is an asset  Informs planning and decision-making  Drives new forms of services  Libraries have many data assets  Bibliographic, holdings, usage, reference inquiries, …  Opportunities to collect data increase in network spaces …  Web site traffic, click-through patterns, e-usage, …  Make data work harder  Use library data in innovative ways to create value

OCLC Programs & Research Prospecting the library data mines Annual Partners Meeting 3 Data mining & OCLC Research  Networks of collaboration and coordination  Decisions taken in “system-wide context”  Focus on resources of “system”  Mass digitization, cooperative print storage, shared discovery environments, …  As library networks develop and expand, opportunities arise to create value through:  Collective action  Aligning local collections with system-wide environment  Data is context  Research area focused on data mining activities  Aggregate collections  “System-wide collection” (as represented in WorldCat)

OCLC Programs & Research Prospecting the library data mines Annual Partners Meeting 4 Managing the collective collection  Mass digitization  “Last copies”  Long tail

OCLC Programs & Research Prospecting the library data mines Annual Partners Meeting 5 Mass digitization Google Book Search (aka Google Print for Libraries) Aggregate collection of digitized print books (combined holdings of Harvard, Michigan, Oxford, NYPL, and Stanford) Data-mining to provide empirical context to inform community- wide dialog

OCLC Programs & Research Prospecting the library data mines Annual Partners Meeting 6 “Rareness is common” System-wide print book collection: ~32 million print books 37% Held by 1 5% Held by > 100 3% Held by % Held by % Held by % Held by Data-mining to better understand nature of the “collective collection” Identify rare & unique materials in system-wide collection (“last copies”)

OCLC Programs & Research Prospecting the library data mines Annual Partners Meeting 7 The Library Long Tail (using holdings as measure of popularity) Number of Holdings Items ranked by system-wide popularity HEAD: Top 10% of WorldCat records (ranked by holdings) account for 80% of total WorldCat holdings LONG TAIL: Bottom 90% of WorldCat records (ranked by holdings) account for 20% of total WorldCat holdings HEAD: Small proportion of items account for lion’s share of collecting activity LONG TAIL: Everything else spread out across Long Tail of diffuse collecting activity Data-mining to inform strategies/policies aimed at optimizing system-wide supply & demand for library materials

OCLC Programs & Research Prospecting the library data mines Annual Partners Meeting 8 Others …  Registry of Copyright Evidence  New York Art Museum study

OCLC Programs & Research Prospecting the library data mines Annual Partners Meeting 9 Shared print storage  Use library data to inform decision-making:  Data about library assets (bibliographic)  Data about choices involving these assets (holdings, circ., ILL)  System-wide aggregation (larger aggregation = richer context)  Shared print storage decision-making:  Data about assets (local inventories of print materials)  Data about system-wide availability (holdings)  Data about usage (local & system-wide)  Role of Research:  Data collection  Data-mining analysis in support of project needs  Inform community dialog on shared print storage issues  Analyze “collective collection” in shared print context  Support development of effective print storage strategies  Standardize analysis to maximize applicability/re-use