Data Mining the Largest Library Database in the World Roy Tennant OCLC Research Leveraging WorldCat.

Slides:



Advertisements
Similar presentations
Unicorn iLink Catalog Prepared by LadyJane Hickey April 2004
Advertisements

OCLC Online Computer Library Center Product Development Update 2003 OCLC CJK Users Group Meeting March 28, 2003 Queens Borough (Flushing) Public Library,
A worldwide library cooperative OCLC Online Computer Library Center OCLC CJK Users Group 2007 Annual Meeting March 24, 2007, Boston David Whitehair, OCLC.
OCLC Research OCLC Online Computer Library Center ALA Midwinter 2006 San Antonio, TX OCLC FictionFinder & OCLC DeweyBrowser Eric Childress OCLC Research.
XID Web Services Roy Tennant OCLC Research OCLC API Mashathon Monday, 8 February 2010.
The worlds libraries. Connected. Linked Data at OCLC Roy Tennant Senior Program Officer ALA Midwinder, January 2013.
Members Council Feb. 11, 2009 Expert Community Experiment Glenn E. Patton Director, WorldCat Quality Management.
Working Together to Find Solutions NISO E-Resource Management Forum Sandy Hurd Innovative Interfaces Director of Strategic Markets September 25, 2007.
OCLC Online Computer Library Center Connexion Client 1.30 for Multiscripts Cataloging CJK User Group Meeting, Chicago April 2, 2005 David Whitehair and.
OCLC Research at work: FRBR, VIAF & Classify Eric Childress OCLC Research.
NExpress Koha Cataloging in the New Shared Catalog.
AFTER MARC: OPTIONS New bibliographic framework. Aside: what we need to do Identify the resources we are describing, e.g.
TO BE.
VIAF for NAAC 2012 October Eric Childress OCLC Research.
RLG Programs Karen Smith-Yoshimura OCLC Research CEAL, Philadelphia 24 March 2010 Cooperative Identities Hub.
Authorities in a connected world Indiana Library Federation 2011 November 16 Thomas Hickey OCLC Chief Scientist.
Eric Childress Ed O’Neill ALA Annual June 2013 FAST Report 1 Faceted Subject Access Interest Group.
Leveraging Names with Linked Data Karen Smith-Yoshimura Ralph LeVan 2010 RLG Partnership Annual Meeting Chicago, IL 9 June 2010.
CERES AND COLORADO STATE UNIVERSITY LIBRARIES. PROJECT CERES Begun in 2013, Project CERES is a Center for Research Libraries Global Resources Agriculture.
OCLC Online Computer Library Center Registry of Digital Masters A joint project of the Digital Library Federation and OCLC Taylor Surface, OCLC ALA Annual.
Name Authorities Metadata Working Group, November 15, 2013 Chew Chiat Naun.
Matching names in parallel T. Hickey Access October.
The world’s libraries. Connected. WorldShare platform & Management Services Integrate all of your collections: print, licensed & digital Chris Thewlis.
Bibliographic Framework and Future Scenarios for RDA Records Dr. Barbara B. Tillett Chief, Policy & Standards Division, Library of Congress & Chair, Joint.
VIAF (Virtual International Authority File) Building Blocks for the Future: Making Controlled Vocabularies Available for the Semantic Web Dr. Barbara B.
@LorcanD Lorcan Dempsey, OCLC 11 October 2013 ARL Fall Forum: Mobilizing the research enterprise #ARLforum13 SHARE : Discovery:Focus on papers.
The OCLC-AMICAL RESPOND project: Leveraging WorldCat to connect international American universities.
OCLC Research OCLC Online Computer Library Center Research & New Technologies Interest Group 24 October 2005 DeweyBrowser & Curiouser Diane Vizine-Goetz.
Virtual International Authority File – introduction & implications Basil Dewhurst Project Manager, ARDC Party Infrastructure Project | National Library.
OCLC Research: Selected projects Eric Childress Larry Olszewski Presentation for Dpto. Biblioteconomía y Documentación Universidad Carlos III de Madrid.
VIAF Update T. Hickey, OCLC Chief Scientist Strasbourg
A Future for the Library Catalogue T. Hickey ACRL/DVC Bryn Mawr 3 November 2006.
GENERAL RULE AACR directly except if rules or apply. Enter a corporate body directly under the name by which it is commonly identified,
9/26/2007OCLC Orientation & Services1 What is OCLC?
A worldwide library cooperative OCLC Online Computer Library Center 24 Hour Reference Goes Global: A Cooperative Approach to providing a Multilingual Reference.
VIAF Update Thomas Hickey Chief Scientist OCLC Research Singapore, 2013.
Speaking the Same Language Serials Standards and e-Resource Data Interactions Diane Hillmann Cornell University.
Renee Register Senior Product Manager OCLC Cataloging and Metadata Services Sandy Piver OCLC Publisher Services Consultant OCLC Services for the Publisher.
Network Level Cataloging: The View from a Member Library Diana Brooking University of Washington Libraries Jan. 14, 2008 ALA.
Council Rock World Language Articulation of Courses School Year.
ADLUG Roma (Italy) What is known must be shared Building on the insights from OCLC Research.
Once you acquire thousands e-books, then what? Shi Deng, UC San Diego OCLC CJK User Group Meeting March 24, 2007.
Legal Issues with Cataloging Supplied by MARCIVE Joan Chapa.
Authority Addicts: The New Frontier of Authority Control on Wikidata.
WorldCat Growth & Quality: Vision and Practice Ted Fons Director WorldCat Global Metadata Network ALA Midwinter 2010 January 17, 2010.
Metadata Services for Publishers Bruce A. Miller Publisher Services Executive April 27, 2010.
Digital libraries research IG Cataloging and metadata IG Web services and metadata switch February 2003 Web services and metadata switch February 2003.
Thomas Hickey Chief Scientist, OCLC Research 2015 August VIAF Council State of VIAF VI AF.
The world’s libraries. Connected. RDA & OCLC Glenn Patton Director, WorldCat Quality Management.
OCLC Cluster Service Leiden March Discussion Session With KB & UVA Janifer Gatenby, Strategic Research.
Vendor-Supplied Authority Control -- What Can the Vendor Deliver? What Still Needs to Be Done Locally?
OCLC and the Social Web: Building tools, providing platforms, engaging the community International Conference on Dublin Core and Metadata Applications.
The world’s libraries. Connected. Linked Data A View of OCLC’s Strategy Ted Fons Executive Director, Data Services,& WorldCat Quality ALA Annual Conference,
Batchload User Group Meeting Ted Fons, Director, WorldCat Global Metadata Network Pam Harper, Product Manager, Batchload services Tony Chirakos, Consulting.
1 Anna H. Perrault Professor School of Library and Information Science University of South Florida WorldCat = Worldwide Presented at the 2nd International.
AN ARCHETYPE FOR INFORMATION ORGANIZATION AND CLASSIFICATION OCLC WorldCat.
MARC extensions Yoel Kortick | Senior Librarian
Enhancing VIAF with WorldCat
Importing and exporting records in Alma
A Future for the Library Catalogue
Language An Element of Culture.
Language An Element of Culture.
Scholars’ Contributions to VIAF

Name authority control in an evolving landscape
COUNTRIES NATIONALITIES LANGUAGES.
Onboarding Webinar 13 April 2019 Presented by and.
ALA Midwinter 2006 San Antonio, TX
2019 CEAL ERMB Cooperative Cataloging for E-resources Project: Updates

Presentation transcript:

Data Mining the Largest Library Database in the World Roy Tennant OCLC Research Leveraging WorldCat

E U R O P E, M I D D L E E A S T & A F R I C A R E G I O N A L C O U N C I L Worldcat.org/identities/ Algorithmically constructed from WorldCat records Algorithmically constructed from WorldCat records

E U R O P E, M I D D L E E A S T & A F R I C A R E G I O N A L C O U N C I L Viaf.org A Union database of authority records A Union database of authority records

E U R O P E, M I D D L E E A S T & A F R I C A R E G I O N A L C O U N C I L The Responsible Party Thom Hickey Chief Scientist OCLC Research

E U R O P E, M I D D L E E A S T & A F R I C A R E G I O N A L C O U N C I L 290+ million records

E U R O P E, M I D D L E E A S T & A F R I C A R E G I O N A L C O U N C I L Language Coverage 30 June % 274 million 36.5 million 25.5 million 11.3 million 4.7 million 4.3 million 3.6 million 3.5 million Total German French Spanish Italian Dutch Russian Latin

E U R O P E, M I D D L E E A S T & A F R I C A R E G I O N A L C O U N C I L Worldcat.org/identities/Worldcat.org/identities/

(J.K. Rowling) (Diana Gabaldon) (Galileo)

E U R O P E, M I D D L E E A S T & A F R I C A R E G I O N A L C O U N C I L

Viaf.org

E U R O P E, M I D D L E E A S T & A F R I C A R E G I O N A L C O U N C I L VIAF Participants

E U R O P E, M I D D L E E A S T & A F R I C A R E G I O N A L C O U N C I L

“Super” Authority File

E U R O P E, M I D D L E E A S T & A F R I C A R E G I O N A L C O U N C I L

Our Cataloging Future “Moving from cataloging to catalinking” Eric Miller, Zepheira

E U R O P E, M I D D L E E A S T & A F R I C A R E G I O N A L C O U N C I L

Some Lessons Widespread collaboration is essentialWidespread collaboration is essential Normalizing the data is essentialNormalizing the data is essential Normalizing the data is complicatedNormalizing the data is complicated Everything is interrelated:Everything is interrelated: –You can’t bring names together if titles don’t match –You can’t bring titles together if names don’t match Batch mode processing still rules (but we’re getting better and faster at it)Batch mode processing still rules (but we’re getting better and faster at it)

E U R O P E, M I D D L E E A S T & A F R I C A R E G I O N A L C O U N C I L Conclusions Data mining isn’t just useful, it’s essentialData mining isn’t just useful, it’s essential Extracting data from MARC that is useful in other contexts is possible, but will require sophisticated processingExtracting data from MARC that is useful in other contexts is possible, but will require sophisticated processing Only very large organizations (e.g., OCLC, national libraries) have the data and resources to do this workOnly very large organizations (e.g., OCLC, national libraries) have the data and resources to do this work Thankfully, we are doing it, but there is much more to be doneThankfully, we are doing it, but there is much more to be done

E U R O P E, M I D D L E E A S T & A F R I C A R E G I O N A L C O U N C I L Roy Tennant