Repository Software - Standards Marc Goovaerts, Hasselt University Library OceanTeacher Academy Training Course Development and Management of e-Repositories 8 – 12 April 2013
Overview Metadata standards Authority Control Exchange standards Ontologies – vocabularies Unique identifiers Exchange standards Z39-50 – SRU/SRW OAI – OAI/ORE SWORD
1. Metadata standards
Metadata The word "metadata" means "data about data". Metadata articulates a context for objects of interest -- "resources" such as MP3 files, library books, or satellite images -- in the form of "resource descriptions". As a tradition, resource description dates back to the earliest archives and library catalogs. The modern "metadata" field that gave rise to Dublin Core and other recent standards emerged with the Web revolution of the mid-1990s.
Library standards MARC – MARC21 – MARC XML MODS: Developed by Library of Congress and MARC-related http://www.loc.gov/standards/mods/ Ex. http://iodeweb1.vliz.be/odin-oai/request?verb=GetRecord&metadataPrefix=mods&identifier=oai:iodeweb1.vliz.be:1834/1784 XML-based
Dublin Core Basic description: 15 fields Refinement: Extensions: See http://dublincore.org/documents/dces/ Refinement: Qualified Dublin Core DC Terms: http://dublincore.org/documents/dcmi-terms/ Extensions: Agris AP: Developed by FAO to refine the DC for their agriculture database AGRIS. http://aims.fao.org/metadata/sets/agris-application-profile VOA3R AP: http://ieru.org/voa3r//wiki/index.php?title=VOA3R_Metadata_Application_Profile Problem: granularity
2. Authority Control
Ontologies What? Examples: Description over the web: Ontology is a way to model knowledge in order to enhance sharing (interoperability) Ontology = A declarative model of a domain (concepts + their attributes + relationships between them) Examples: ASFA - http://www4.fao.org/asfa/asfa.htm Agrovoc - http://aims.fao.org/standards/agrovoc/functionalities/search Description over the web: XML – RDF – OWL/SKOS - SPARQL
Unique Identifiers ISSN - ISBN DOI Handle Digital Object Identifier for electronic object: developed by publishers Handle Digital Unique identifier for publications in repositories. Author ID - Unique researcher identifier a transparent method of linking research activities and outputs to these identifiers Web of Science – Scopus ORCID – http://www.orcid.org OceanExpert – http://www.oceanexpert.org
Unique Identifiers: URI Linked Open Data: e.g. OceanExpert linked data describes a method of publishing structured data so that it can be interlinked and become more useful. It builds upon standard Web technologies such as HTTP and URIs, but rather than using them to serve web pages for human readers, it extends them to share information in a way that can be read automatically by computers - http://en.wikipedia.org/wiki/Linked_data Linked Open Vocabularies - http://lov.okfn.org/dataset/lov/ Oceanographic vocabulary: NERC (http://www.bodc.ac.uk/products/web_services/vocab/)
3. Exchange standards
Federated search Client–server protocol for searching and retrieving information from remote computer databases Z39-50: http://en.wikipedia.org/wiki/Z39.50 SRU: http://www.loc.gov/standards/sru/ Both developed by the Library of Congress
Repositories OAI-PMH (Open Archives Initiative Protocol for Metadata Harvesting) For metadata exchange (DC, MODS, METS) Standardization of data exchange mostly used by institutional repositories http://www.openarchives.org/ OAI-ORE (Open Archives Initiative Object Reuse &Exchange) defines standards for the description and exchange of aggregations of Web resources. Used to exchange the full document SWORD Used for alternative submission for repositories Ex. SwordShare – Android App to submit photographs in repositories (http://blog.stuartlewis.com/tag/swordshare/) OA tegenover OAIster