Images Application Profile meeting 29th October 2007, London Julie Allinson Digital Library Manager Library & Archives, University of York SWAP a Dublin.

Slides:



Advertisements
Similar presentations
Pete Johnston, Eduserv Foundation 16 April 2007 Update on work of Joint DCMI/IEEE LTSC Task.
Advertisements

A centre of expertise in digital information management The OAI Protocol for Metadata Harvesting Andy Powell UKOLN,
Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
A centre of expertise in digital information management UKOLN is supported by: British Academy e-Resources Policy Review: UKOLN Report.
Andy Powell, Eduserv Foundation July 2006 Repository Roadmap – technical issues.
Andy Powell, Eduserv Foundation Feb 2007 The Dublin Core Abstract Model – a packaging standard?
Repositories thru the looking glass Andy Powell Eduserv Foundation
A centre of expertise in digital information management UKOLN is supported by: The Dublin Core Application Profile for Scholarly Works.
JISC CETIS Metadata and Digital Repository SIG meeting, Manchester 16 April 2007 A Dublin Core Application Profile for Scholarly Works (eprints) ‏ Julie.
A centre of expertise in digital information management UKOLN is supported by: XML and the DCMI Abstract Model DC Architecture WG Meeting,
International Conference on Dublin Core and Metadata Applications DC-Scholar, 24 th September /10/2014 Scholarly Works Application.
A centre of expertise in digital information management UKOLN is supported by: The Dublin Core Application Profile for Scholarly Works.
Eprints Application Profile
Open Repositories 2007 Eprints Application Profile The Eprints Application Profile: a FRBR approach to modelling repository metadata Julie Allinson, UKOLN,
A centre of expertise in digital information management UKOLN is.
A centre of expertise in digital information management UKOLN is.
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
An Introduction to MODS: The Metadata Object Description Schema Tech Talk By Daniel Gelaw Alemneh October 17, 2007 October 17, 2007.
The JISC IE Metadata Schema Registry Pete Johnston UKOLN, University of Bath JISC Joint Programmes Meeting Brighton, 6-7 July 2004
UKOLN is supported by: OAI-ORE a perspective on compound information objects ( Defining Image Access.
UKOLN is supported by: Repositories and the wider context Exchange of Experience on Institutional/Digital Repositories 3 November 2006, Liverpool Julie.
A centre of expertise in digital information management UKOLN is supported by: Eprints Application Profile UK Repositories Search Project.
Eprints Special Session DC-2006, Mexico Wednesday Oct 4, Julie Allinson (UKOLN, University of Bath) and Andy Powell (Eduserv Foundation)
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
Metadata : Concentrating on the data, not on the scheme Imma Subirats FAO of the United Nations Marcia Zeng Kent State University euroCRIS Meeting Bologna.
Europeana: Europe's Digital Library, Museum and Archive Ashley Carter and Dana Sagona.
UKOLUG - July Metadata for the Web RDF and the Dublin Core Andy Powell UKOLN, University of Bath UKOLN.
Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.
Cornell CS Bibliographic Concepts CS 502 – Carl Lagoze – Cornell University Acks to H. Van de Sompel.
Update on the VERSIONS Project for SHERPA-LEAP SHERPA Liaison Meeting UCL, 29 March 2006.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Andy Powell, Eduserv Foundation June 2006 Eprints Application Profile.
Using IESR Ann Apps MIMAS, The University of Manchester, UK.
‘The Universal Catalogue’ a cultural sector viewpoint David Dawson Senior Policy Adviser (Digital Futures) Museums, Libraries and archives Council.
SWAP FOR DUMMIES. Scholarly Works Application Profile a Dublin Core Application Profile for describing scholarly works (eprints) held in institutional.
The Evolution of Mathematical Communication 1 Using Metadata for the Interlinking of Digitized Mathematics Thomas Fischer State and University.
The DNER - a national digital library Andy Powell ZIG Meeting, York October 2001 UKOLN, University of Bath UKOLN is funded by Resource:
The JISC IE Metadata Schema Registry and IEEE LOM Application Profiles Pete Johnston UKOLN, University of Bath CETIS Metadata & Digital Repositories SIG,
The Future of Cataloging Codes and Systems: IME ICC, FRBR, and RDA by Dr. Barbara B. Tillett Chief, Cataloging Policy & Support Office Library of Congress.
A centre of expertise in digital information management UKOLN is supported by: FRBR and Metadata Application Profiles Peter Cliff, Research.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Accessing a national digital library: an architecture for the UK DNER Andy Powell ELAG 2001, Prague 7 June 2001 UKOLN, University of Bath
Evidence from Metadata INST 734 Doug Oard Module 8.
RDA: Benefits and opportunities Gordon Dunsire Centre for Digital Library Research University of Strathclyde, Glasgow Presented at the CIG Standards Forum,
1 Dublin Core & DCMI – an introduction Some slides are from DCMI Training Resources at:
JISC Information Environment Service Registry (IESR) Ann Apps MIMAS, The University of Manchester, UK.
AGROVOC Thesaurus. 1980s: developed as multilingual structured thesaurus for agricultural terminology (“rice”) : parallel effort to express thesaurus.
A centre of expertise in digital information management UKOLN is supported by: Metadata for the People’s Network Discovery Service PNDS.
Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.
FRBR: Cataloging’s New Frontier Emily Dust Nimsakont Nebraska Library Commission NCompass Live December 15, 2010 Photo credit:
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
From FRBR to FRBR OO through CIDOC CRM… A Common Ontology for Cultural Heritage Information Patrick Le Bœuf, National Library of France International Symposium.
Pete Johnston, Eduserv Foundation 16 April 2007 An Introduction to the DCMI Abstract Model JISC.
IESR Metadata Ann Apps MIMAS, The University of Manchester, UK.
A centre of expertise in digital information management UKOLN is supported by: Functional Requirements Eprints Application Profile Working.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Surveying the landscape: collection-level description & resource discovery JISC/NSF DLI Projects meeting, Edinburgh, 24 June 2002 Pete Johnston UKOLN,
DC Architecture WG meeting Wednesday Seminar Room: 5205 (2nd Floor)
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
A centre of expertise in digital information management UKOLN is supported by: IEMSR, the Information Environment & Metadata Application.
Metadata & Repositories Jackie Knowles RSP Support Officer.
RDA and Linked Data Gordon Dunsire Presented at Selmathon 1, 9 May 2016, Stockholm, Sweden.
Attributes and Values Describing Entities. Metadata At the most basic level, metadata is just another term for description, or information about an entity.
Session 3 Metadata & Workflow
RDA work plan: current and future activities
From FRBR to FRBROO through CIDOC CRM…
Attributes and Values Describing Entities.
Attributes and Values Describing Entities.
Presentation transcript:

Images Application Profile meeting 29th October 2007, London Julie Allinson Digital Library Manager Library & Archives, University of York SWAP a Dublin Core application profile for describing scholarly works Produced forby the Eduserv Foundation and UKOLN

the order of things background, scope and functional requirements … the model, application profile and vocabularies current situation

where are we coming from? background and scope

background and scope overall aim: to offer a solution to issues with using simple DC for interoperability to provide a richer metadata profile for the Intute repository search project development summer 2006 funded (and scoped) by JISC co-ordinated by Andy Powell and Julie Allinson; with Pete Johnston and others scope Dublin Core properties as far as possible, plus other necessary elements identifiers for the eprint and full-text(s); and for related resources support subject access solutions (without mandating any) additional properties to fulfil search/browse requirements bibliographic citations and references citing other works

terminology eprints, research papers and scholarly works are used synonymously for a ''scientific or scholarly research text'‘ (as defined by the Budapest Open Access Initiative e.g. a peer-reviewed journal article, a preprint, a working paper, a thesis, a book chapter, a report, etc. the application profile is known as the eprints application profile by the DCMI community but it’s often called the scholarly works application profile (SWAP) in the UK repositories community (to demonstrate its software independence!)

defining the problem what’s wrong with simple DC?

multiple titles, what language? normalised form? person or org? full-text or metadata? is it a uri? of what? modification? publication? is this a MIME type? local keyword or controlled scheme? what did they contribute? is this an RFC 3066 value? what relationship? is this a uri? what does this tell me? is this a citation? or something else?

what do we need metadata to do? functional requirements

functional requirements for describing scholarly works a richer metadata set consistent, good quality metadata unambiguous method of identifying full-text(s) distinguish open access materials from restricted support browse based on controlled vocabularies make use of OpenURL link servers and support citation analysis identify the research funder and project code identify the repository or other service making available the copy say when a copy of a scholarly work will be made available better search and browse options consider version identification and finding the most appropriate copy of a version support for added-value services the requirements demanded a more complex model …

what and why? the model, application profile and vocabularies

model : what’s that? it’s an entity-relationship model it says what ‘things’ we want to describe the set of entities and the key relationships between those entities several models already exist, e.g. FRBR (Functional Requirements for Bibliographic Records) CIDOC CRM for cultural heritage information Common European Research Information Format (CERIF) FRBR provides the basis for our model it’s a model for the entities that bibliographic records are intended to describe and the relationships between them it’s working in a similar space to our modelling of scholarly works and it could have wider applicability

FRBR and eprints entities there are 4 key FRBR entities: Work, Expression, Manifestation and Copy A work is a distinct intellectual or artistic creation. A work is an abstract entity An expression is the intellectual or artistic realization of a work A manifestation is the physical embodiment of an expression of a work. An item is a single exemplar of a manifestation. The entity defined as item is a concrete entity. FRBR also defines additional entities - 'Person', 'Corporate body', 'Concept', 'Object', 'Event' and 'Place‘ and the relationships between entities We use ‘Agent’ to describe a Person or Organisation We use Scholarly Work to distinguish our refinement. We use ‘Copy’ as a more appropriate entity for digital information

the model in pictures ScholarlyWork Expression 0..∞ isExpressedAs Manifestation isManifestedAs 0..∞ Copy isAvailableAs 0..∞ isPublishedBy 0..∞ isEditedBy 0..∞isCreatedBy 0..∞ isFundedBy isSupervisedBy AffiliatedInstitution Agent

from model to where? the model defines the entities and relationships each entity and its relationships are described using an agreed set of attributes / properties this is where the model ends it doesn’t tell us where to get those properties from, what vocabularies to use, how to construct our descriptions, or how to encode all of this

Dublin Core Abstract Model using Dublin Core was in-scope from the beginning the DCMI Abstract Model (DCAM) guides us on what our descriptions ‘look like’ it provides the notion of ‘description sets’ i.e. groups of related ‘descriptions’ where each ‘description’ is about an instance of one of the entities in the model and each description contains statements about each attribute using property-value pairs

application profile relationships and attributes are captured as metadata properties in the application profile contains recommendations, cataloguing/usage guidelines and examples little is mandatory (identifier and title) structured according to the entities in the model re-uses properties from existing schemes dc, dcterms, foaf, MARC relators introduces new ‘eprint’ properties supported by various value vocabularies

example properties ScholarlyWork: title (dc) subject (dc) abstract (dcterms) affiliated institution (new) identifier (dc) funder (marc) grant number (new) has adaptation (new) ScholarlyWork: title (dc) subject (dc) abstract (dcterms) affiliated institution (new) identifier (dc) funder (marc) grant number (new) has adaptation (new) Agent: name (foaf) type of agent (new) date of birth (foaf) mailbox (foaf) homepage (foaf) identifier (dc) Agent: name (foaf) type of agent (new) date of birth (foaf) mailbox (foaf) homepage (foaf) identifier (dc) Expression: title (dc) date available (dcterms) status (new) version number (new) language (dc) genre / type (dc) copyright holder (new) bibliographic citation (dc) identifier (dc) has version (new) has translation (new) Expression: title (dc) date available (dcterms) status (new) version number (new) language (dc) genre / type (dc) copyright holder (new) bibliographic citation (dc) identifier (dc) has version (new) has translation (new) Manifestation: format (dc) date modified (dcterms) Manifestation: format (dc) date modified (dcterms) Copy: date available (dcterms) access rights (dcterms) licence (dcterms) identifier (dc) Copy: date available (dcterms) access rights (dcterms) licence (dcterms) identifier (dc)

enough with the theory what does this actually mean for repositories?

revisiting the functional requirements the model and application profile mean we can support this … a richer metadata set consistent, good quality metadata unambiguous method of identifying full-text(s) distinguish open access materials from restricted support browse based on controlled vocabularies make use of OpenURL link servers and support citation analysis identify the research funder and project code identify the repository or other service making available the copy say when a copy of a scholarly work will be date available better search and browse options consider version identification and finding the most appropriate copy of a version support for added-value services

an example ‘Signed metadata’ - a paper (the eprint as scholarly work) scholarly work (work) version (expression) format (manifestation) copy (item) pdfdoc institutional repository copy pdfhtml publisher’s repository copy institutional repository copy published proceedings print copy author’s web site copy Version of Record (English) Author’s Original 1.0 … Author’s Original 1.1 Version of Record (Spanish) no digital copy available (metadata only) restricted access

thoughts on the approach … this approach is guided by the functional requirements identified and the primary use case of richer, more functional, metadata it makes it possible to group together descriptions and therefore to rationalise ‘traditional’ and ‘modern’ citations traditional citations tend to be made between eprint ‘expressions’ hypertext links tend to be made between eprint ‘copies’ (or ‘items’ in FRBR terms) a complex underlying model may be manifest in relatively simple metadata and/or end-user interfaces the application profile is for metadata exchange, it is not a blueprint for local metadata (but it can help) existing eprint systems may well capture this level of detail – but use of simple DC stops them exposing it to others!

what about interoperability? xml format and schema allows eprint description sets to be encoded, shared over oai-pmh, searched using SRU/W etc. for this exchange to happen we need deployment by developers deployment by repositories consumption and use by services dumb-down we still need to be able to create simple DC descriptions we have guidelines for dumbing-down to separate simple DC descriptions of the ScholarlyWork and each Copy simple DC about the ScholarlyWork corresponds to previous guidance simple DC about each Copy useful for getting to full-text, e.g. by Google

conclusion? community acceptance and implementation are ongoing … led by UKOLN more application profiles funded by JISC following a similar approach … Dublin Core Scholarly Communications Community