An Update from the PCC URIs in MARC Task Group

Slides:



Advertisements
Similar presentations
LEVERAGING THE DEEPER GRAPH (VIA QUERIES OR PATTERNS) STEVEN FOLSOM PAOLO CICCARESE LD4L USE CASE 4.
Advertisements

RDF AND LINKED DATA Jenn Riley Head, Carolina Digital Library and Archives The University of North Carolina at Chapel Hill.
Module 6: Preparing for RDA... Library of Congress RDA Preconference for MLA/DLA May 4, 2011.
Module 6: Preparing for RDA... Library of Congress RDA Seminar, University of Florence, May 29-June 2, 2011.
Or… Don’t expect a lot of URIs in records, yet. (in 7 minutes, 53 seconds) Steven Folsom, Cornell University LD4L Workshop, February 23, 2015 URIs you.
RDA AND AUTHORITY CONTROL Name: Hester Marais Job Title: Authority Describer Tel: Your institution's logo.
1 Primo Product Working Group IGeLU Agenda for Today PWG Business Primo PWG members and roles Work of the group in 2009/10 Enhancements management.
LC Training for RDA: Resource Description & Access Module 6: Authorities II Part 4: FRBR Group One and NARs Cooperative and Instructional Programs Division.
RDA and Linked Data Steve Henry University of Maryland March 2, 2013.
1 On the Record Report of the Library of Congress Working Group on the Future of Bibliographic Control Diane Boehr Head of Cataloging, NLM
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
Society of American Archivists Research Forum 18 August 2015 A Deep Dive into the Archival MARC Records in WorldCat (and ArchiveGrid) Jackie Dooley Program.
OCLC Online Computer Library Center Kathy Kie December 2007 OCLC Cataloging & Metadata Services an introduction.
Module 6: Preparing for RDA... LC RDA for Georgia Cataloging Summit Aug. 9-10, 2011.
Implementation scenarios, encoding structures and display Rob Walls Director Database Services Libraries Australia.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
All the Reasons to be a Fan of PCC's Strategic Directions Shifting from Authorities to People, Places, Events, Awards… Steven Folsom | Metadata.
Module 6: Preparing for RDA... LC RDA for NASIG - June 1, 2011.
RDA DAY 1 – part 2 web version 1. 2 When you catalog a “book” in hand: You are working with a FRBR Group 1 Item The bibliographic record you create will.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Introduction to the Semantic Web and Linked Data
THE BIBFRAME EDITOR AND THE LC PILOT Module 3 – Unit 1 The Semantic Web and Linked Data : a Recap of the Key Concepts Library of Congress BIBFRAME Pilot.
11 Ana Lupe Cristán Policy and Standards Division Library of Congress June 2012 Name Authorities - What’s New?
RDA Updates Since the Creation of LC RDA Training Material (May 2012) Annual Meeting of Council on East Asian Libraries March 19, 2013 Jessalyn Zoom Library.
Future of Cataloguing: how RDA positions us for the future for RDA Workshop June, 2010.
Ex Libris, LOD and BIBFRAME
BIBFRAME Update Session  Library of Congress pilot and development  Beacher Wiggins – Pilot project  Sally McCallum – Vocabulary development  A supplier’s.
Sally McCallum Library of Congress
Combine_and_stir (Aleph data + RDF + Python + other things) IGeLU 2015 Developer’s Day Budapest, Hungary Laura Akerman.
The world’s libraries. Connected. RDA & OCLC Glenn Patton Director, WorldCat Quality Management.
CNI Spring 2016 Membership Meeting San Antonio TX Linked Data Implementations— Who, What and Why? Karen Smith-Yoshimura OCLC Research.
OpCo MEETING May 5,, 2016 KATE HARCOURT Vision, Mission, and Strategic Directions Update.
PCC Standing Committee on Standards An Overview. Overview  Our charge  Our membership  Our activities  A highlighted activity.
Setting the stage: linked data concepts Moving-Away-From-MARC-a-thon.
AN ARCHETYPE FOR INFORMATION ORGANIZATION AND CLASSIFICATION OCLC WorldCat.
Information organization Week 2 Lecture notes INF 380E: Perspectives on Information Spring 2015 Karen Wickett UT School of Information.
RSC Strategy Gordon Dunsire, Chair, RDA Steering Committee
Putting Linked Data at the Service of Libraries
Ready...Set...URIs...Actionable!
LINKED DATA Telling the Library’s Story through
Repository Software - Standards
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
A Lightweight Structured Data Implementation Using JSON-LD and Schema
Authority Control and Alma: A look at change from many Angles
Metadata Editor Introduction
Welcome It’s our #MedLitBlitz @Mark2Cure.
ALA Practical Linked Data With Open Source
Jenn Riley Metadata Librarian Digital Library Program
BIBFRAME at the Library of Congress
Module 6: Preparing for RDA ...
Getting started With Linked Data.
Applications of IFLA Namespaces
METADATA SERVICES’ NEW CAPACITIES
Cataloging Tips and Tricks
Appellations, Authorities, and Access
FRAD: Functional Requirements for Authority Data
MARC21 changes to accommodate RDA
Enhancing Student Learning and Retention with Community Partnerships
Cataloging the Internet
Key Considerations and Decisions Before Consortia Implementation
CSU Millennium to Alma migration
PREMIS Tools and Services
Introduction to Metadata
school self-evaluation and improvement toolkit
Recording the Attributes of Series MARC21 in NACO RDA Series Authority Records Welcome back, everyone. In this module, we are going to continue talking.
Onboarding Webinar 13 April 2019 Presented by and.
FRBR and FRAD as Implemented in RDA
Jenn Riley Metadata Librarian Digital Library Program
Cataloging Archival Collections: Grouping Collections to Aid Retrieval
Introduction to BIBFRAME
Presentation transcript:

An Update from the PCC URIs in MARC Task Group ELUNA | OKC | 2016 Talk Description: In recent years the MARC format has been extended to provide for identifiers for a wide range of entities represented in bibliographic and authority records, but thus far these new elements have not consistently been populated. Moreover, the current practice is generally to supply an alphanumeric string rather than a URI (a key requirement for the effective dissemination of data on the Semantic Web) even though URIs are available in many cases. This session will be an update from the Program for Cooperative Cataloging (PCC) URIs in MARC Task Group which has been working since the Fall of 2015 to define best practices for capturing URIs in MARC. These best practices have the potential to enable for easier management of our data in our local systems, new use cases in our discovery environments, and ultimately result in more interconnected data when we convert our legacy data to linked data. Image credit: http://www.whorange.net/ Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library

Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library Talk Outline Role of URIs and the Semantic Web Brief History of “Linking” Identifiers in MARC Why the PCC Cares PCC URIs in MARC Task Group Activities Use cases for URIs in PNX Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library

Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library In linked data a URI identifies the thing. The thing can have labels (even preferred labels) and relationships to other things, but none of these traits *identify* the thing. So how do we get there? Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library

Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library URIs Continued The URI stands-in or symbolizes the actual thing. We can say that thing has traits. In this case the URI on the left stands in for me. I am a person as defined by the FOAF ontology. I have a name, “Steven Folsom”. I know another person, Violeta who is identified by a different URI. Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library

Records/Authorities and the Things They Describe Libraries have Records/Authorities about things The web has webpages about things When we talk about linked data we mean URIs that identify the things themselves, with associated data. We need information resources (websites and authorities) to be separate entities from the things themselves A Person and Record/Authority/Website About a Person have different traits, e.g. Creators/Creation dates (You can do the math.) Size (height and weight vs. bytes) Suppressed means something completely different when talking about a person (or class of persons) vs. a record RDA and 3XX’s go a long way toward generating RWOs. The problem is that the RWO data in these authorities don’t have a separate URI for the thing they describe, making them hidden in the record. A big and meaningful change would be to give the persons in these records their own URI and relate them to the authority. LOC is currently underway with introducing separate URIs for the Authority and the Things they are about. Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library

How did we come to this point in history? Turns out this was 1994, but you get the point. Towards a New Beginning in Cooperative Cataloging, History, Progress, and Future of Cooperative Cataloging Council by Library of Congress. (https://babel.hathitrust.org/cgi/pt?id=mdp.39015034240682;view=1up;seq=35). Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library

Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library History of URIs an the $0 http://www.loc.gov/marc/marbi/2009/2009-dp01-1.html http://www.loc.gov/marc/marbi/2009/2009-dp06-1.html http://www.loc.gov/marc/marbi/2010/2010-dp02.html http://www.loc.gov/marc/marbi/2010/2010-dp03.html http://www.loc.gov/marc/marbi/2010/2010-06.html http://www.loc.gov/standards/sourcelist/standard-identifier.html In 2009-2010, there was a flurry of MARBI Discussion Papers about the use of URIs. After some back and forth, including considering using the $1 for URIs and keeping the $0 for control numbers, the MARBI group decided to have both control numbers and URIs in the $0. Consider this foreshadowing… Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library

Led to Current Specification for URIs 100 1#$aBach, Johann Sebastian,$d1685-1750 $0(uri)http://id.loc.gov/authorities/names/n79021425 _________________________________________ 600 17$aBach, Johann Sebastian,$d1685-1750 $0(uri)http://id.loc.gov/authorities/names/n79021425$2naf 024 7#$ahttp://id.loc.gov/authorities/names/n79021425$2naf Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library

Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library Related Work http://kcoyle.blogspot.com/2013/07/linked-data-first-steps-catch-21.html Local Library Practice/Experimentation GW Stanford Princeton Cornell OCLC FAST $0’s Univ. of Mannheim & Amsterdam Karen Coyle in 2013 was calling for libraries to double down on using URIs, and to make local decisions outside of the MARC spec to allow URIs to be more useful upon conversion to RDF. Many libraries around that same time began adding URIs to their MARC records experimentally in a more concerted effort. George Washington under Jackie Shieh began adding them as a part of their RDA reclamation work. More on this in a second. Stanford began working with vendors like Backstage to have URIs added to local fields in the MARC because (my understanding) of the complications of how OCLC does or doesn’t treat them. Princeton and Cornell began experimenting as well. Sentiment for $0s seemed to shift with the addition of FAST to the OCLC database. For each FAST heading in a bib there is now a $0 linking it to the FAST entity. On the Primo front, Univ. of Mannheim & Amsterdam began enhancing PNX data to connect with RDF in the Dutch National Library Repository. (http://www.slideshare.net/lukask/the-drawbridge-to-knowledge) Image credit: www:papercitymag.com Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library

Jackie Shieh and George Washington University’s RDA records.

Cornell’s use of VIVO URIs for Cornell Faculty, in this case to control advisors in records for theses, derived from the Graduate School Database. The names aren’t using LCNAF, but instead as they appear in the Grad database. Opportunistically both the grad database and VIVO have faculty share faculty id’s so the script that creates the MARC records can look up URIs in VIVO for faculty.

May 2015 Call for best practices posted to the PCC list (white paper: http://bit.ly/1BMQ0ed) Some of the same functions that URIs serve in Linked Data (to identify, collate, and disambiguate) are functions they could serve in MARC right now. Adding URIs to MARC records would also make the transition to linked data less bumpy; MARC to linked data converters would require less reconciliation if we were adding URIs to records now. Image credit: https://www.pinterest.com/eamesoffice/dogs-and-eames-chairs/

PCC Strategic Direction 3 “Existing methods of library authority control are based on constructing unique authorized access points as text strings (literals). This string-based approach works somewhat well in the closed environment of a traditional library catalog, but not in an open environment where data are shared and linked, and so require unique identifiers. The web presents both a challenge and an opportunity for libraries, which are now in a position to take advantage of authorities created outside of the library world, and also to contribute library authority data for use by other communities.” The PCC Strategic plan anticipates the need to move away from using unique text strings to as we share our data on the wider web. Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library

PCC URIs Task Group Activities MAC Discussion Papers (Submitted this cycle) Removal of (uri) Adding $0 to select fields where currently not permitted. Bibliographic Format: Country of Producing Entity (257) Bib and Authority Format: Associated Language (377) URIs for $4 (proposal from British Library in consultation with the PCC Task Group on URIs) Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library

[Tenative/Hopeful] Blockbuster Summer Releases Formulating and Obtaining URIs: A Guide to Commonly Used Vocabularies and Reference Sources RWO Discussion Paper Hoping to share with the community later this year two documents Formulating and Obtaining URIs: A Guide to Commonly Used Vocabularies and Reference Sources will: Outline for the commonly used vocabularies and data sources in MARC how their related systems construct URIs Where in the interface you can go to find them Link to the RWO discussion paper RWO Discussion Paper will outline the differences between URIs for Authorities, Things, and Related Websites. Link to the Formulating and Obtaining URIs: A Guide to Commonly Used Vocabularies and Reference Sources document. Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library

Unanswered Questions in MARC Repeatability, and how to perform heading maintenance if $0’s are repeated? Consideration being given to accommodating various cataloger realities Access to Authority Records Single URI per $a, other URIs linked to from Authority/Thing No access to related Authority Records Repeated $0s in Bib per $a Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library

Unanswered Questions in MARC How to capture URIs in order to provide the clearest semantics? (URIs for things vs. URIs for authorities about things) Currently only one subfield ($0 or $w) to capture URIs in BIBs per field; current MARC semantics is “record control number”. Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library

*Draft* solution to the unanswered questions Bib Format 100 1# $a[Name] $d[date] $0[AuthorityURI]$1[ThingURI] $2[Source of $a] Authority Format 024 [New Second Indicator for Authority or Thing] $a [URI] The PCC URIs in MARC Task Group hasn’t come to an agreement for solutions to the outstanding questions, but here is a sketch of what we might propose. Part of the reason to propose a talk like this is to get real time peer review of your work. Goals of this proposal: We can continue to use authorities for as long as they are useful. We can allow catalogers to repeat $0 and $1 in bib’s because the $2 can be used to designate which Data Source to use to maintain the heading in MARC. We can note URIs for Authorities, distinct from URIs for Things so that we can convert our data to RDF more meaningfully. Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library

<ex:ThisPresentation> dcterms:creator <http://orcid <ex:ThisPresentation> dcterms:creator <http://orcid.org/0000-0003-3427-5769> . <http://orcid.org/0000-0003-3427-5769> , owl:sameAs <http://vivo.cornell.edu/individual/sf433> , <http://viaf.org/viaf/316560733> , <http://worldcat.org/entity/person/id/2630057950> . <http://id.loc.gov/authorities/names/no2015079947> mads:identifiesRWO <http://orcid.org/0000-0003-3427-5769> .

Ex Libris 2016 Linked Data Collaboration Program Current discussions on what we can do in Primo once the URIs are in PNX What to display (VIAF, id.loc,gov, MESH, etc.)? What to index from various data sources (VIAF, id.loc,gov, MESH, etc.)? Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library

Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library A Chew Chiat Naun (from Cornell) quote from a Back Stage Metadata Matters event. [A Chew Chiat Naun quote from a Back Stage Metadata Matters event.] Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library

So if barcodes don’t excite you, maybe dogs and chairs. So if Barcodes don’t excite you… Maybe Dogs and Chair can better characterize how I hope the library community reacts to the use of URIs in MARC. Like dogs, hopefully chance encounters with URIs will provide positive feelings because of the promise of they provide. Eventually like chairs, URIs will be ubiquitous and well designed, where form meets function. Image credit: https://www.instagram.com/p/2giK7lOZa8/ Steven Folsom | http://orcid.org/0000-0003-3427-5769 | Harvard Library