The <indecs> Data Dictionary

Slides:



Advertisements
Similar presentations
Presented to the ALCTS FRBR Interest Group, ALA Annual, 24 June 2011
Advertisements

OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
Ontology Assessment – Proposed Framework and Methodology.
DOI update on progress Norman Paskin DOIs and journal publishing Ed Pentz, CrossRef DOIs and book publishing David Sidman, CDI [DOIs and non-English language.
Doi> DOI Standardisation DOI Tools and Technologies.
IDF Patent Policy & Core DOI Specification. DOI Encyclopedia to DOI Core DOI Handbook has been the DOI encyclopedia – main developments of the DOI framework.
DOI update on progress Norman Paskin DOIs and journal publishing Ed Pentz, CrossRef DOIs and book publishing David Sidman, CDI [DOIs and non-English language.
Doi> DOI and MPEG RDD Standard IDF Strategy meeting Bologna 2005.
Registration agencies: DOI deployment doi>. POLICIES Any form of identifier NUMBERING DESCRIPTION framework: DOI can describe any form of intellectual.
CISCIS CIS - The Common Information System Keith Hill International DOI Foundation 7th May, 1998.
doi> Digital Object Identifier: overview
Registration agencies: DOI deployment doi>. POLICIES Any form of identifier NUMBERING DESCRIPTION framework: DOI can describe any form of intellectual.
Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
IDF Open Meeting 2008: Resource Access for a Digital World International DOI Foundation Brussels, June
Enabling Access By Permission Standards for rights expression within the ONIX family Brian Green.
Workshop on the DOI System DOI SYSTEM: DATA MODEL International DOI Foundation.
IDF open meeting 2007 doi>. Eight possible innovations doi> Innovative uses of the DOI System.
Godfrey Rust, Ontologyx © Rightscom 2004Presentation to DOI Members meeting June 2004 Godfrey Rust, Ontologyx DOI Metadata consultant Metadata interoperability.
Doi> DOI – new applications panel IDF Annual Members meeting Bologna 2005.
Copyright © 2006 Data Access Technologies, Inc. Open Source eGovernment Reference Architecture Approach to Semantic Interoperability Cory Casanave, President.
Metadata vocabularies and ontologies Dr. Manjula Patel Technical Research and Development
UKOLN, University of Bath
Vocabulary Mapping Framework Tertius Ltd / International DOI Foundation Norman Paskin London Nov Next Steps.
DOI System: overview Norman Paskin International DOI Foundation.
UKOLN is supported by: Bridget Robinson and Ann Chapman From analytical model to implementation and beyond CD Focus Schema Forum, CBI Conference Centre.
Effective management Accurate tracking Easier automation.
Key to the management of intellectual property in digital media BISG/NISO The Changing Standards Landscape Washington DC, June Norman Paskin IDENTIFY.
The model: some key issues Godfrey Rust, Data Definitions Technical Coordinator, project MODELS workshop,
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
The Digital Object Identifier: A Tool for E-Commerce and Rights Management doi> Glen Secor 26 Nov 01.
Metadata : Setting the Scene or a Basic Introduction Wendy Duff University of Toronto, Faculty of Information Studies.
CORDRA Philip V.W. Dodds March The “Problem Space” The SCORM framework specifies how to develop and deploy content objects that can be shared and.
Domain Modelling the upper levels of the eframework Yvonne Howard Hilary Dexter David Millard Learning Societies LabDistributed Learning, University of.
System Analysis Overview Document functional requirements by creating models Two concepts help identify functional requirements in the traditional approach.
Metadata and identifiers for e- journals Copenhagen Juha Hakala Helsinki University Library
Doi> Norman Paskin, International DOI Foundation Digital Object Identifier.
Digital Object Identifier Charles Ellis: Chairman, International DOI Foundation Norman Paskin: Director, International DOI Foundation Steve Stone: Director,
Harmonising without Harm: towards an object-oriented formulation of FRBR aligned on the CIDOC CRM ontology Maja Žumer (University of Ljubljana) & Patrick.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Piero Attanasio mEDRA: the European DOI agency The DOI as a tool for interoperability between private and public sector Athens, 14 January.
1 Chuck Koscher, CrossRef New Developments Relating to Linking Metadata Metadata Practices on the Cutting Edge May 20, 2004 Chuck Koscher Technology Director,
Profiling Metadata Specifications David Massart, EUN Budapest, Hungary – Nov. 2, 2009.
MPEG-21 : Overview MUMT 611 Doug Van Nort. Introduction Rather than audiovisual content, purpose is set of standards to deliver multimedia in secure environment.
Key to the management of intellectual property in digital media Europe-China Conference on Intellectual Property in Digital Media Shanghai Oct
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
This material was developed by Duke University, funded by the Department of Health and Human Services, Office of the National Coordinator for Health Information.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Domain Modeling In FREMA David Millard Yvonne Howard Hugh Davis Gary Wills Lester Gilbert Learning Societies Lab University of Southampton, UK.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
Towards a semantic web Philip Hider. This talk  The Semantic Web vision  Scenarios  Standards  Semantic Web & RDA.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Categorization Recommendations for Implementing the E-Gov Act of 2002 Richard Huffine U.S. Environmental Protection Agency Co-chair, Categorization Working.
RELATORS, ROLES AND DATA… … similarities and differences.
OWL Representing Information Using the Web Ontology Language.
Intellectual Works and their Manifestations Representation of Information Objects IR Systems & Information objects Spring January, 2006 Bharat.
Digital Object Identifier doi> Norman Paskin The International DOI Foundation W3C DRM workshop January 22/
RDA: history and background Ann Huthwaite Library Resource Services Manager, QUT ACOC Seminar, Sydney, 24 October 2008.
Introduction: Databases and Database Systems Lecture # 1 June 19,2012 National University of Computer and Emerging Sciences.
Statistical process model Workshop in Ukraine October 2015 Karin Blix Quality coordinator
Attributes and Values Describing Entities. Metadata At the most basic level, metadata is just another term for description, or information about an entity.
Norman Paskin International DOI Foundation
NAMING AND MEANING Norman Paskin
Digital Object Identifier
Metadata for research outputs management
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
Data Model.
Accommodating local cataloguing traditions in a global context
Metadata in Digital Preservation: Setting the Scene
Presentation transcript:

The <indecs> Data Dictionary indecs DD N. Paskin ELECTRONIC COMMUNICATION OF LICENCE TERMS The <indecs> Data Dictionary Norman Paskin, International DOI Foundation (c) IDF 2004

Influenced by CIS and FRBR: <indecs> 1998-2000: Interoperability of Data in E-Commerce Systems: www.indecs.org Focus: generic intellectual property and how to make data about it interoperable EC + groups from the content, author, creator, library, publisher and rights communities Pioneered a model of event-based metadata as a solution for integrating rights. For “e-commerce” read “automation” Influenced by CIS and FRBR: 1995+ : Common Information System “CIS” (CISAC) – music rights 1998: Functional Requirements of Bibliographic Records “FRBR” (IFLA) – library cataloguing Has been used and developed further

doi> Why do we need a “data dictionary”? There’s lots of metadata already Which should be (re-) used People use different schemes So we need to map from one scheme to another Data (identifiers, metadata) assigned in one context or scheme may be encountered, and may be re-used, in another place (or time or scheme) - without consulting the assigner. You can’t assume that your assumptions will be known to someone else. Interoperability = the possibility of use in services outside the direct control of the issuing assigner This is a prerequisite for communication (of rights terms or anything else) Does “owner” in scheme A mean “owner” in scheme B? We need to map meanings A prerequisite for extensibility

doi> What is a “data dictionary”? A set of terms, with their definitions used in a computerized system Some data dictionaries are structured, with terms related to other terms through hierarchies and other relationships: structured data dictionaries are derived from ontologies. An ontology combines a data dictionary with a logical data model, providing a consistent and logical world view. An interoperable data dictionary contains terms from multiple computerized systems or metadata schemes, and shows the relationships they have with one another in a formal way. The purpose of an interoperable data dictionary is to support the use together of terms from different systems. Indecs DD is structured (ontology based) and interoperable

Agreed term-by-term mapping or “Crosswalk” Metadata scheme e.g. ONIX Metadata scheme e.g. SCORM Agreed term-by-term mapping or “Crosswalk”

Metadata scheme e.g. ONIX Metadata scheme e.g. SCORM

Data Dictionary Metadata scheme Metadata scheme e.g. ONIX e.g. SCORM Term “Author” Data Dictionary Metadata Scheme NormanRights Term “Writer” ONIX:Author = NormanRights:Writer

doi> Metadata interoperability: semantic problems But such mappings are not simple: Different names (and languages) for the same thing (journal_article vs SerialArticleWork) Same name for different things (title, Title) Data elements at different levels of speciality (title vs FullTitle, AlternativeTitle). Different allowed values for elements (pii vs not pii) Data at different levels of granularity (journal_article vs SerialArticleWork/SerialArticleVersion). Data in different structures (article as attribute of journal or vice versa). Data from different sources (local codes vs ONIX codes). Different contextual meaning (DOI of what…?) Different representation (1 title vs n titles). Different mandatory requirements (ISSN mandatory vs optional) Schemas are being updated all the time. . . . . etc. Requires a coherent structured approach.

doi> So how do we make sense of this? Data dictionary uses an “ontology” “An explicit formal specification of how to represent the objects, concepts and other entities that are assumed to exist in some area of interest and the relationships that hold among them” Because relationships can be complex

doi> The dictionary model The methodology is the <indecs> one (as developed in more detail for the MPEG RDD) This has also been developed further (OntologyX) It uses the “context model” – i.e. events based (a common ontology approach) We think of metadata as “thing” or “people” based. static views e.g. about “creation B” But then how do we link things, e.g. to describe rights activities? By describing “events”; relating things and people dynamic views e.g. “A created B” Events description is also the key to rights metadata all rights transactions are events

The dictionary model doi> Agent Time Place Resource

doi> Agent Norman Paskin Time Place London 2004-12-02 Resource The dictionary model doi> Agent Norman Paskin Time Place London 2004-12-02 Resource 041202BICNISO.ppt

doi> Agent Event: Norman Paskin presented 041202.ppt The dictionary model doi> Agent Event: Norman Paskin presented 041202.ppt in London on 2 Dec 2004 Time Place Resource

Building views of “metadata”… Q: “This isn’t how I think of my metadata! ” ..”it’s just a series of “things about” something. How does this more complex approach fit what I have? A: This is simply a deeper view for the purposes of analysis.. You don’t need to change your own approach. The “events” view builds from the simple “things about” view:

Building views of “metadata”… entity attribute 1. attribute view – simplest, most direct: “things about…” isbn “0297816470” Author S Pinker (values may be strings, IDs etc)

Building views of “metadata”… relationship entity 2. association or relationship view – richer, more indirect: book “0297816470” hasTitle “Words & Rules” treats attributes as defined entities and others e.g. book “0297816470” hasAuthor “Stephen Pinker” allows multiple occurrences

Building views of “metadata”… agent context resource time place 3. context view – richest, most indirect publishingEvent hasAgentType publisher “Weidenfeld” publishingEvent hasResourceType book “0297816470” publishingEvent hasTimeType dateOfPublication “2002” publishingEvent hasPlaceType placeOfPublication “UK” Analysis moves from attribution to attribution process (Event) Most efficient handling of complex multiple metadata e.g. a rights catalogue (“all rights transactions are events”) Allows analysis of complex relationships and meaning

An ontology approach uses the deeper view of metadata Three levels of attribution, moving from simple (static) to richer (dynamic events): entity attribute Attribute (static view) relationship entity Relationship agent context resource time place Context (dynamic view)

Tested iDD has a long history and is used in several major activities. Built using methodology from the <indecs> framework Used as the basis for DOI data model Used as basis for the MPEG-21 Rights Data Dictionary (RDD) Heavily influenced the current development of messaging systems for the publishing industry (ONIX) and music industry (MI3P). Methodology has been validated against the W3C ontology language OWL-DL Methodology for constructing interoperable Data Dictionaries which underlies iDD is in use commercially (Ontologyx). The International DOI Foundation (IDF) and EDItEUR intend to harmonise ONIX and DOI metadata through the use of this common data dictionary and welcome collaboration with others adopting a similar approach

Neutral as to business model The semantic analysis underlying the iDD is independent of any implementation model. It was fundamental to indecs (despite “e-commerce” in its name) that it had no inherent commercial model, and it remains so for all the work that has followed it. It is just as critical to be able to say "this is not subject to copyright" as to say the opposite; any "non-commercial" person or organization has is to be able to state that something is freely available and under what circumstances. A broad ontology, supporting rights expressions, must be able to support any kind of expression of any kind of right, agreement or licence or any terms (or none). Most organizations have the need for both freedom and protection of intellectual property in different contexts. The iDD is not solely a tool for intellectual property as “commercial property” but is neutral as to the intellectual property regime being used.

Does not mandate one metadata scheme The aim of the iDD is to facilitate mapping between schemes The more precise the input, the more precise the output e.g. a mapping from simple DC to SCORM will of necessity be “lossy” Some uses will set minimum standards e.g. DOI Registration Agencies have rules that must be followed in the DOI application to ensure that the metadata can be mapped into the iDD to declare Application Profiles Any user is otherwise free to use their own metadata schemes for gathering, storing or disseminating metadata. iDD facilitates input and output to others schemes = semantic interoperability

If reciprocally agreed, then can be an assured mapping Provides authority Every term entered into the iDD carries information on its status as to origin and mapping agreement If reciprocally agreed, then can be an assured mapping which will enable users of the dictionary to interpolate mappings from their own schemes, through iDD, to scheme A and know that this will be considered authoritative by scheme A.. Anyone contributing terms to the iDD can specify who is allowed to see or specify their own terms. Some terms will be accessible to all: e.g. ONIX, some kernel DOI terms, and the MPEG21 RDD.

Construction Based on DD methodology and Contextual Ontologyx Architecture tools, terms from various sources (ONIX, RDD, DOI) …But users need not understand the underlying concepts and construction of the iDD. It is no more a requirement to know the details than it is for the designer of a web page to read all the underlying internet protocol RFCs. A fundamental role of the IDF and others with the iDD is to provide assurance to users that the work has been peer-reviewed and tested, and make available tools. Some key features are: Extensible and granular to whatever level of detail is required. Multiple, different, specialized views are available: these include a Rights Model, based on a set of specialized Contexts. Local terms: local (internal) data elements and names can be added into the ontology External terms: incorporates external and standard schemes such ISO territory, currency and language codes, and sector specific external schemes

Use Current use of the dictionary is on a project–by-project basis using technical consultancy An automated web based look-up system for the Dictionary is under development for IDF use (and potentially others e.g. RDD) Access will be granular: those with authority to access the Dictionary able to view what is appropriate private terms are kept confidential.

iDD and the MPEG Rights Data Dictionary (RDD) ISO MPEG-21 Rights Data Dictionary is another notable data dictionary built on similar principles. The MPEG Rights Data Dictionary provides semantic interoperability for use of rights expression languages and other tools. Derives from work funded by IDF and others using the same methodology as the iDD, so closely related and fully integrated. All terms in the RDD are mapped into the iDD; RDD is one of the authorities specifying terms within iDD. RDD is therefore at present a sub set of iDD. Some future RDD terms might be added to the RDD which are not within iDD; the two Data Dictionaries would then overlap and share some common terms. The MPEG 21 RDD requirements in terms of management and availability for MPEG use are very similar to those of the iDD in relation to DOI implementation. IDF is to be the Registration Authority for the MPEG-21 RDD, and will subcontract management of the RDD and iDD to Ontologyx.

Data dictionaries 2004 IDF is authority RightsCom (Mi3p etc) IDF + Development of <indecs> 1998-2004 Black = what Red = who 2004 ISO MPEG21 RDD IDF is authority OntologyX RightsCom (Mi3p etc) indecsDD IDF + ONIX indecs (2000) EC plus many others: Framework IFPI/RIAA, MPA, IDF, DentsuMMG, Rightscom: methodology for DD CONTECS (2001+)

The <indecs> Data Dictionary indecs DD N. Paskin ELECTRONIC COMMUNICATION OF LICENCE TERMS The <indecs> Data Dictionary Norman Paskin, International DOI Foundation www.doi.org n.paskin@doi.org (c) IDF 2004