Jennifer Bowen, University of Rochester CLA Preconference, Shaping Tomorrow’s Metadata with RDA June 2, 2010, Edmonton, Alberta The eXtensible Catalog Preparing Library Metadata for New Environments and Standards 1
Agenda –About the eXtensible Catalog (XC) –Library metadata issues that XC can address –Overview of XC Software –XC Metadata Services Toolkit (MST) and Services –XC’s User Interface –Status of XC Development 2
What does XC have to do with RDA? 3
What is XC? eXtensible Catalog is open source, user-centered, next generation software for libraries. Four XC software components: used together, provide an end-to-end discovery system to connect library users with resources or components can be used independently to address a particular need. 4
XC Project Partners, Supporters University of Rochester The Andrew W. Mellon Foundation Consortium of Academic and Research Libraries in Illinois (CARLI) University of Notre Dame Rochester Institute of Technology Kyushu University working with NTT-Data University of North Carolina at Charlotte Serials Solutions University at Buffalo Cornell University Yale University Ohio State University Nylink
XC - Worldwide
XCO Meeting Participants OCLC Library of Congress OhioLINK VUFind (Villanova) LYRASIS OLE (Penn, Duke, NC State) Rice University RDA development (IMT) 7
Current Metadata Situation Metadata experts disagree about how to/whether to make changes Making major changes to library metadata and systems is expensive Legacy MARC metadata is holding us back 8
Pressing Metadata Questions –Do we need a replacement for the MARC formats? Who will design it? Maintain it? Implement it? –How do we migrate legacy metadata into a new format/schema? –What will be the impact of RDA? –How can we make library metadata function independently on the web, outside of library systems? 9
Regarding Metadata… How do we even get started? 10
XC Software: A Place to Start –Opportunities to make the best possible re- use of library metadata –An architecture for moving and managing metadata –A low-risk way to try new things while also maintaining the status quo 11
Metadata goals for XC –Enable harvesting and aggregating metadata from multiple repositories –Define a FRBR-based metadata schema to support XC’s user-interface functionality –Design a new type of staff client for processing large batches of metadata through an orchestrated set of services 12
XC Metadata Opportunities “FRBRization” RDA in a non-MARC Environment Linked Data (Semantic Web) 13
XC Software Overview User Interface Faceted, FRBRized, customizable search interface Built on a web application framework (Drupal) Metadata Tools Automated processing of large batches of metadata Connectivity Tools Between XC and an ILS Harvest and synchronize metadata with OAI-PMH Circulation and authentication with NCIP 14
MARCXML DC XC Software Components User Interface for searching and browsing Library Website (on Drupal) Integrated Library System Repository XC Drupal Toolkit Tools for automated processing of large batches of metadata XC Metadata Services Toolkit Tools for connectivity between XC and an ILS XC Circ. Status/Req. Authentication XC NCIP Toolkit 15 XC OAI Toolkit
User Interface for searching and browsing Tools for automated processing of large batches of metadata Tools for connectivity between XC and an ILS MARCXML DC Library Website (on Drupal) Integrated Library System Repository XC Drupal Toolkit XC Metadata Services Toolkit Circ. Status/Req. Authentication XC NCIP Toolkit XC OAI Toolkit Metadata Services Toolkit XC 16 Library Website (on Drupal) Integrated Library System Repository XC Drupal Toolkit Circ. Status/Req. Authentication XC NCIP Toolkit XC OAI Toolkit
Metadata Services Toolkit –Services clean up/normalize inconsistent metadata –Aggregate metadata from multiple repositories –Converts MARC/Dublin Core metadata into XC Schema metadata for use in XC User Interface –MST is not a metadata editor (but can eventually incorporate one) 17
XC Metadata Services –Normalization –Transformation –Aggregation –Authority Control 18
Normalization Service –Metadata stays in same schema –Corrects frequent errors –Prepares metadata for use in other applications –Now available: MARCXML Normalization –In development: Dublin Core Normalization –Services for any XML data possible! 19
Transformation Service –Automates the transformation of metadata from one schema to another –Any XSLT schema crosswalk can be incorporated into the Transformation Service –Creates XC Schema records for use within a complete XC System –Now available: MARCXML to XC Schema –In development: DC to XC Schema 20
Creating XC Schema Records –Transformation Service parses flat MARC records into linked FRBR-based records (work, expression, etc.) –One input record results in several output records –Maintains relationships between input records 21
MARC to XC Schema Transformation XC Work XC Expression XC Manifestation XC Holdings MARC Bibliographic MARC Holdings
“Uplinks” maintain relationships XC Work XC Expression XC Manifestation XC Holdings MARC Bibliographic MARC Holdings 004 “uplink” “Work Expressed” “Expression Manifested” “Manifestation Held”
More About the XC Schema –Flexible, extensible –Optimized to enable XC functionality –Not intended for general use –Can be as complex as it needs to be –Will give us experience manipulating separate records that represent individual FRBR levels
What’s in the XC Schema? – Dublin Core terms – RDA elements – RDA roles – MARC vocabularies – XC elements Newly-defined 25
Metadata “shopping” – Dublin Core terms – RDA elements – RDA roles – MARC vocabularies – XC elements Newly-defined 26
XC Schema Elements: RDA –22 RDA elements –11 RDA role designators –Retain granularity in current MARC data: – Frequency – Numbering of Serials – Coordinates of Cartographic Content – Plate number (music) 27
Aggregation Service –Aggregates records that represent the same resource (manifestation-level identifiers) –Manages relationships between records (FRBR entities, etc.) –Enables automated synchronization of updates for records at each FRBR level –Sets stage for future “non-MARC” RDA implementation 28
Authority Control Service –MARCXML, Dublin Core –Match headings against MARCXML authority resource file –Populate records with authority control identifier –Tool for debugging probable, unlikely matches –Prototype developed by RIT students –Development planned for
User Interface for searching and browsing Tools for automated processing of large batches of metadata Tools for connectivity between XC and an ILS MARCXML DC Library Website (on Drupal) Integrated Library System Repository XC Drupal Toolkit XC Metadata Services Toolkit Circ. Status/Req. Authentication XC NCIP Toolkit XC OAI Toolkit Metadata Services Toolkit Functionality XC 30 Library Website (on Drupal) Integrated Library System Repository XC Drupal Toolkit Circ. Status/Req. Authentication XC NCIP Toolkit XC OAI Toolkit
Metadata Services Toolkit Tasks – Add Repositories – Schedule Harvests – Add Metadata Services – Add Processing Rules – Browse Records – View Logs – Set up Users/Groups 31 Get metadata Verify results/ Troubleshooting Tell MST what to do with metadata Admin functions
MST – Add Repositories 32 To tell the MST about a repository, give it a name of your choice and enter the URL. The MST will automatically do a “handshake” with it and provide “Success” or “Error” messages for each step. When successful, the MST reports on available formats and sets in the remote database.
MST – Schedule Harvests 33 Functions: Add Repositories Schedule Harvests Add Metadata Services Add Processing Rules Browse Records View Logs Set up Users/Groups
MST – Add Metadata Services 34 Functions: Add Repositories Schedule Harvests Add Metadata Services Add Processing Rules Browse Records View Logs Set up Users/Groups
MST - Add Processing Rules 35 Functions: Add Repositories Schedule Harvests Add Metadata Services Add Processing Rules Browse Records View Logs Set up Users/Groups
MST - Add Processing Rules 36
MST - Browse Records 37 “Browse Records” includes full-text search and faceted browse Examples of Facets: Repository Service Schema Errors Examples of Facets: Repository Service Schema Errors
MST - Browse Records 38 Library staff use “Browse Records” to verify that services are functioning properly and to debug any issues. The MST preserves the original record and new records produced by each metadata service. Navigation links take you to predecessor and successor records. The MST preserves the original record and new records produced by each metadata service. Navigation links take you to predecessor and successor records.
MST - Browse Records 39 Navigation to full record display
MST - Browse Records 40 Errors Each service can register error messages and attach errors to specific records. Errors are facets in the MST. Errors Each service can register error messages and attach errors to specific records. Errors are facets in the MST.
MST – Full Record Display 41 Full Record Display: MARC Holding Record Administrative metadata managed by the MST XML viewer (supports any XML schema) Predecessor and Successor links Full Record Display: MARC Holding Record Administrative metadata managed by the MST XML viewer (supports any XML schema) Predecessor and Successor links
MARCXML DC XC Software Components User Interface for searching and browsing Library Website (on Drupal) Integrated Library System Repository XC Drupal Toolkit Tools for automated processing of large batches of metadata XC Metadata Services Toolkit Tools for connectivity between XC and an ILS XC Circ. Status/Req. Authentication XC NCIP Toolkit 42 XC OAI Toolkit
User Interface for searching and browsing Tools for automated processing of large batches of metadata Tools for connectivity between XC and an ILS Drupal Toolkit (User Interface) 43 Integrated Library System MARCXML DC Library Website (on Drupal) Repository XC Drupal Toolkit XC Metadata Services Toolkit XC Circ. Status/Req. Authentication XC NCIP Toolkit XC OAI Toolkit Integrated Library System MARCXML DC Repository XC Metadata Services Toolkit XC Circ. Status/Req. Authentication XC NCIP Toolkit XC OAI Toolkit
XC User Interface Features –Extensive Customization of – Themes – Facet definitions – Displays – Indexes –Takes advantage of XC Schema, metadata granularity, FRBRized metadata structure 44
Kyushu University Library – Drupal UI 45
Kyushu - Search results in Japanese Reasons why these items are shown Query : America Japan Translated : Faceted navigation 46
Faceted browsing of search results 47
Customizable Facet Definitions 48
Edit Authors/Contributors Facet 49
Edit Format Facets 50
Customizable Result Lists 51
Customizable Full Record Display 52
Custom “Browse” pages –Configured using menus, no programming –Customize a page for particular users, collections –Pre-set a “limit” based upon metadata values, e.g. DVDs and Videos Electronic Journals/Databases Juvenile Materials Search a specific location 53
Find DVDs and Videos 54
Find DVDs and Videos – Genre Browse 55
XC’s Contribution More Control over Metadata More Options for Customizing the User Interface 56
XC Metadata Opportunities “FRBRization” RDA in a non-MARC Environment Linked Data (Semantic Web) 57
Moving toward Linked Data –Semantic Web –URIs for all vocabularies, metadata elements –Enables library metadata to be used/understood outside of specific “silos” and outside of record structures –Linked data is understandable by web applications as well as by humans –XC Schema, MST provide a way to get started 58
Metadata Opportunities 59 What if we’ve gotten this ALL WRONG?
XC Enables Experimentation –Harvests a copy of metadata from an ILS or repository –Very low risk of data corruption –XC propagates changes throughout the system –Web interfaces for trouble-shooting –“Sandbox” for experimentation 60
Status of XC Development XC software available for free download via Ongoing XC development includes: –testing with a range of data –performance work –code refinement –documentation 61
eXtensible Catalog Organization, LLC – New funding/governance structure –Maintains and improve XC software –Provides guidance and direction for XC –Fosters a community of XC users/developers Become an XCO sponsor! 62
XC Metadata Opportunities 63 Redesigning from the Inside Out
Jennifer Bowen