MARCXTM: Topic Maps Modeling of MARC Bibliographic Information

Slides:



Advertisements
Similar presentations
Bibliographic Framework Initiative Approach for MARC Data as Linked Data Sally McCallum Library of Congress.
Advertisements

Standards showcase: MODS, METS, MARCXML ALA Annual 2006 Rebecca Guenther and Jackie Radebaugh Network Development and MARC Standards Office Library of.
Lis512 lecture 4 the MARC format structure, leader, directory.
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
MARC Machine Readable Cataloging & MARC family
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
StatCat Building a Statistical Data Finder ssrs.yale.edu/statcat Steven Citron-Pousty Ann Green Julie Linden Yale University.
MODS What is MODS: When is MODS use:
WMES3103 : INFORMATION RETRIEVAL
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
RDF Kitty Turner. Current Situation there is hardly any metadata on the Web search engine sites do the equivalent of going through a library, reading.
Introduction to Databases Transparencies
Introduction to MARC Cataloguing Part 2 Presenters: Irma Sauvola: Part 1 Dan Smith: Part 2.
Batch-conversion of Non-standard Multiscript Records by XSLT Lucas Mak Metadata and Catalog Librarian Michigan State University Catalog Management Interest.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
XML & Library Applications ELAG 2001 Poul Henrik Jørgensen, Danish Bibliographic Centre,
Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.
Jennifer Bowen, University of Rochester ALA Midwinter Conference January 22, 2012, Dallas, TX The eXtensible Catalog (XC): Transitioning to a Post-MARC.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Object and component “wiring” standards This presentation reviews the features of software component wiring and the emerging world of XML-based standards.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
Highlights from recent MARC changes Sally McCallum Library of Congress.
Developing Databases and Selecting an Appropriate Library System.
1 Metadata Standards Catherine Lai MUMT-611 MIR January 27, 2005.
Local Holdings Maintenance: The Basics. Agenda Defining Local Holdings Accessing Connexion Searching in Connexion Understanding an LHR Deriving LHR’s.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Resource Description and Access Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee for the Development.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
ESDI Workshop on Conceptual Schema Languages and Tools
If not DC, then MODS? A look at the Metadata Object Description Schema Cheryl Walters Kayla Willey ULA Annual Conference St. George, Utah May 17, 2006.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Future of Cataloguing: how RDA positions us for the future for RDA Workshop June, 2010.
COMMON COMMUNICATION FORMAT (CCF). Dr.S. Surdarshan Rao Professor Dept. of Library & Information Science Osmania University Hyderbad
Sally McCallum Library of Congress
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Collection Management Systems
Presenter: Tito Wawire US Embassy, Library of Congress.
A RCHIVAL COLLECTIONS IN A D IGITAL W ORLD Cheryl Walters Nov. 6, 2008.
An information retrieval system may include 3 categories of information:  Factual  Bibliographical  Institutional  Exchange and sharing of these categories.
A centre of expertise in digital information management UKOLN is supported by: Metadata – what, why and how Ann Chapman.
Some basic concepts Week 1 Lecture notes INF 384C: Organizing Information Spring 2016 Karen Wickett UT School of Information.
XML Databases Presented By: Pardeep MT15042 Anurag Goel MT15006.
Chinese MARC (Taiwan) and its bibliographic database Ching-Chen Anthony Mao (Fu Jen Catholic University) Ching-fen Frances Hsu (National Central Library)
1 Metadata: an overview Alan Hopkinson ILRS Middlesex University.
Engineering, 7th edition. Chapter 8 Slide 1 System models.
Chapter 2 Object-Oriented Paradigm Overview
Lecture 12 Why metadata? CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
From the old to the new… Towards better resource discoverability
Professional development training on cataloging at the University Wisconsin-Madison Memorial Library, USA 14th October -24th October, 2016 Aigerim Shurshenova.
XML QUESTIONS AND ANSWERS
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Catherine Lai MUMT-611 MIR January 27, 2005
DATA MODELS.
Alphabet Soup: Choosing Among DC, QDC, MARC, MARCXML, and MODS
Introduction to Metadata
Abstract descriptions of systems whose requirements are being analysed
Markup Languages Gilok Choi 9/17/2018
Lifecycle Metadata for Digital Objects
Cataloging Tips and Tricks
System models October 5, 2005.
MARC: Beyond the Basics 11/24/2018 (C) 2006, Tom Kaun.
PREMIS Tools and Services
Accommodating local cataloguing traditions in a global context
An Introduction to Software Architecture
Some Options for Non-MARC Descriptive Metadata
Presentation transcript:

MARCXTM: Topic Maps Modeling of MARC Bibliographic Information 2005.10.07 Hyun-Sil Lee, Yang-Seung Jeon, Sung-Kook Han Semantic Web Services Research Group Won Kwang University, Korea

Agenda Description of Bibliographic Information MARC21 MARCXML MODS Topic Maps Modeling of MARC21 Requirements of MARC Modeling UML Model MARCXTM Implementations Conclusions 2019-01-15

Overview: MARC MARC: Machine-Readable Cataloging standards used for the representation of bibliographic and related information for books and other library materials in machine-readable form and their communication to and from other computers. All MARC Standards conform to: ISO 2709:1996 Information and documentation - Format for Information Exchange. MARC was originally designed in the late 1960’s to aid in the transfer of bibliographic data onto magnetic tape, and also to replace the printed catalog cards with electronic forms. There are a number of implementation of MARC, including USMARC used in US, CAN/MARC used in Canada, and UKMARC used in Britain. After discussions and minor changes to USMARC and CAN/MARC, MARC21 was evolved to harmonize both formats and to cover diverse types of resources including digital materials and Internet resources. MARC accommodates extensive data elements describing all forms of materials susceptible to bibliographic description, as well as related information. 2019-01-15

Family of MARC Formats Bibliographic Authorities Holdings a carrier for bibliographic information about printed and manuscript textual materials, computer files, maps, music, serials, visual materials and mixed materials. Authorities a carrier for information concerning the authorized forms of names, [titles,]subjects, and subject subdivisions to be used in constructing access points in MARC records, the forms of these names, subjects and subdivisions that should be used as references to the authorized form, and the relationships among these forms Holdings a carrier for holdings information for three types of bibliographic items: single-part; multipart; serial and may include: copy-specific information; information peculiar to the holding institution; information needed for local processing, maintenance or preservation; version information. Classification a carrier for information about classification numbers and the captions associated with them that are formulated according to a specified authoritative classification scheme Community Information a carrier for descriptions of non-bibliographic resources that fulfil the information needs of a community. 2019-01-15

Supporting Documentation of MARC MARC 21 Specification for Record Structure, Character Sets, and Exchange Media Character sets MARC-8 (8-bit encoding) UCS/UNICODE UTF-8 (8/16 bit encoding) Repertoire of 15,000+ characters Latin; Cyrillic; Hebrew; Arabic; CJK Code lists Countries, Geographical; Languages; Sources; Relators 2019-01-15

MARC Record Format Leader the first 24 characters of the record defining parameters for processing the record data elements that contain coded values and are identified by relative character position Directory directory entries that contain the tag used in variable fields, starting location, and length of each field within the record constructed by computer from the bibliographic record, and can be reconstructed in the same way if any of the cataloging information is altered Variable Field Control 00X fields in the MARC 21 formats are variable control fields. either a single data element or a series of fixed-length data elements identified by relative character position Data Indicators: The first two characters which interpret or supplement the data found in the field. Subfield codes: Two characters that precede each data element within a field that requires separate manipulation 2019-01-15

MARC Record Format: Example Leader Directory Variable Control Field Data 2019-01-15

MARC Record Format: Example Sign Post 2019-01-15

Formalization of MARC <MARC21Record>::=<Leader><Directory><VariableField> <Directory>::=<DirectoryElement>* <DirectoryElement>::=<Tag><Length><Position> <VariableField>::=<ControlField><DataField> * <ControlField>::=<ControlNumber><ControlFieldElement> <DataField>::=<Tag><Indicator><SubField> * <Indicator>::=<FirstIndicator><SecondIndicator> <SubField>::=<SubFieldCode><SubFieldValue> 2019-01-15

Problems with MARC Lack of expandability due to rigorous record formats, since it was originally intended for the production of printed catalogue cards in 1960s Difficulties in representing bibliographic relationships Ambiguities in describing MARC records Incompatibilities between other MARC formats since the various library systems have invented their own non-standard peculiarities in order to handle local bibliographic materials Weaknesses in describing bibliographic attributes of digitized resources 2019-01-15

Character Set Conversion MARCXML MARC21 (2709)Records MARC21 (XML) Records Tagging Transformations Character Set Conversion Dublin Core Records MODS Records Other XML Formats HTML Output MARC Validation 2019-01-15

MARCXML MARCXML: a framework for working with MARC data in a XML environment Design Considerations and Features Simple and Flexible MARC XML Schema for representing a complete MARC record in XML Supports all MARC encoded data regardless of format Lossless Conversion of MARC to XML Roundtrip ability from XML back to MARC Data Presentation and Data Conversion Extensibility A component-oriented, extensible architecture allowing users to plug and play different software pieces to build custom solutions 2019-01-15

MARCXML: Example 2019-01-15

MODS MODS: Metadata Objects Description Schema Features XML-based descriptive metadata standard that includes a subset of data elements derived from MARC21 Features MODS is intended to complement other metadata formats. MODS provides a richer bibliographic element set than Dublin Core. MODS has a high level of compatibility with MARC records because it inherits the semantics of the equivalent data elements in the MARC21 bibliographic format. In MODS some elements that appear in various fields in MARC have been repackaged into one. So MODS can define 19 upper metadata elements. MODS takes advantage of the XML environment. It uses language-based tags rather than the numeric tags traditional to MARC. MODS also has flexible linking mechanisms by providing for all the top-level elements with attributes such as xlink and ID. MODS accommodates special requirements for digital resources. 2019-01-15

MODS: Example 2019-01-15

Topic Maps Modeling of MARC 21 Requirements for MARC Modeling A model should be able to support the full set of data elements in MARC21 to achieve seamless compatibility with MARC formats. This is a practical requirement in order to embrace the current circumstances even though it is awkward. It should have the same expressive power as metadata. This implies that the model should be realized with semantic descriptors to be used in an XML environment instead of obsolete alphanumeric codes. The use of attributes should be minimized to maintain consistency and increase readability. It should be able to maintain the structure of MARC record format A model does not intend to develop bibliographic metadata system based on MARC. A model can be handled without expertise in MARC to achieve the usability of the model. A model should be simple and lightweight for system implementation and harmonization with other models. 2019-01-15

UML diagram of MARC Modeling DataField TagCode:String DataFieldName:String Repeatability:{NR, R} Description:String FirstIndicator SecondIndicator SubField IndicatorItem IndicatorCode:{Integer, ‘#’} IndicatorName:String SubFieldItem SubFieldCode:String SubFieldValue:String SubFieldName:String 1 1…* 0…* 2019-01-15

MARCXTM Implementation Librarians/Users MARCXTM for MARC Specification MARC Records XTM Representation of MARC Records 2019-01-15

XTM Realization of MARC Specification DataField: <association> of data item, indicators, and subfield codes <association id="data100"> <instanceOf> <topicRef xlink:href="#DataField"/> </instanceOf> <member> <roleSpec> <topicRef xlink:href="#Field"/> </roleSpec> <topicRef xlink:href="#Field100"/> </member> <topicRef xlink:href="#FirstIndicator"/> <topicRef xlink:href="#TypeOfPersonalNameEntryElement"/> <topicRef xlink:href="#SecondIndicator"/> <topicRef xlink:href="#Undefined"/> <topicRef xlink:href="#SubField"/> <topicRef xlink:href="#a100"/> <topicRef xlink:href="#b100"/> <topicRef xlink:href="#c100"/> <topicRef xlink:href="#d100"/> ……………………………………. <topicRef xlink:href="#q100"/> <topicRef xlink:href="#t100"/> <topicRef xlink:href="#u100"/> <topicRef xlink:href="#four100"/> <topicRef xlink:href="#six100"/> <topicRef xlink:href="#eight100"/> </association> 2019-01-15

XTM Realization of MARC Specification Hiding the real data value by topic abstraction <topic id="TypeOfPersonalNameEntryElement"> <baseName> <baseNameString> Type of personal name entry element </baseNameString> </baseName> <occurrence> <instanceOf> <topicRef xlink:href="#Forename"/> </instanceOf> <resourceData> 0 </resourceData> </occurrence> <instanceOf> <topicRef xlink:href="#Surname"/> </instanceOf> <resourceData> 1 </resourceData> <instanceOf> <topicRef xlink:href="#FamilyName"/> </instanceOf> <resourceData> 3 </resourceData> </topic> 2019-01-15

MARCXTM for MARC Specification 2019-01-15

XTM Realization of MARC Records Complex to maintain MARC structure due to its idiosyncratic dependency between indicators and subfield code Difficult to realize the seamless compatible with MARC records Repeatability of subfield elements are individually defined in MARC specification. XTM supports for MARC modeling XTM does not provide multiple instances for <occurrence>. Difficult to define record schema with <association>. 2019-01-15

XTM Realization of MARC Records 2019-01-15

MARCXTM for MARC Records 2019-01-15

Conclusions MARCXTM: Topic Maps-based implementation of MARC 21 MARCXTM for MARC Specification MARCXTM for MARC Records Application of Topic Maps paradigm to bibliographic information system Seamless compatible with MARC 21 expressive power as metadata XTM is inappropriate to represent MARC format due to its idiosyncratic structure and dependency between data elements. Metadata models similar to Dubline Core or MODS are necessary for XTM modeling of MARC. FRBR (Functional Requirements for Bibliographic Records) framework is an attractive model for XTM modeling of bibliographic information system. 2019-01-15

Thank you!!!