Metadata issues and DOI doi>. Presentation based on one given at the IDF members conference, July 2000… and Forthcoming additional Handbook material on.

Slides:



Advertisements
Similar presentations
The <indecs> Data Dictionary
Advertisements

Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
DOI update on progress Norman Paskin DOIs and journal publishing Ed Pentz, CrossRef DOIs and book publishing David Sidman, CDI [DOIs and non-English language.
Digital preservation: identifiers and rights
Doi> DOI Standardisation DOI Tools and Technologies.
DOI update on progress Norman Paskin DOIs and journal publishing Ed Pentz, CrossRef DOIs and book publishing David Sidman, CDI [DOIs and non-English language.
Registration agencies: DOI deployment doi>. POLICIES Any form of identifier NUMBERING DESCRIPTION framework: DOI can describe any form of intellectual.
doi> Digital Object Identifier: overview
Metadata issues and DOI doi>. overview of presentation... Background Three conclusions The metadata landscape: which schemes matter most to DOI? DOI metadata.
Resource description and access for the digital world Gordon Dunsire Centre for Digital Library Research University of Strathclyde Scotland.
IDF Open Meeting 2008: Resource Access for a Digital World International DOI Foundation Brussels, June
Digital Object Identifier workshop doi> Norman Paskin The International DOI Foundation.
IDF open meeting 2007 doi>. Eight possible innovations doi> Innovative uses of the DOI System.
Godfrey Rust, Ontologyx © Rightscom 2004Presentation to DOI Members meeting June 2004 Godfrey Rust, Ontologyx DOI Metadata consultant Metadata interoperability.
Doi> DOI – new applications panel IDF Annual Members meeting Bologna 2005.
Dc:DC November 1998 Godfrry Rust, Data Definitions Godfrey Rust, Data Definitions.
Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
DOI System: overview Norman Paskin International DOI Foundation.
The analysis Godfrey Rust, Data Definitions, London W3C DRM workshop, January 2001 January 2001.
Effective management Accurate tracking Easier automation.
RDA: A New Standard Supporting Resource Discovery Presentation given at the CLA conference session The Future of Resource Discovery: Promoting Resource.
1 MPEG-21 : Goals and Achievements Ian Burnett, Rik Van de Walle, Keith Hill, Jan Bormans and Fernando Pereira IEEE Multimedia, October-November 2003.
Philips Research France Delivery Context in MPEG-21 Sylvain Devillers Philips Research France Anthony Vetro Mitsubishi Electric Research Laboratories.
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
The Digital Object Identifier: A Tool for E-Commerce and Rights Management doi> Glen Secor 26 Nov 01.
MPEG-21 Multimedia Framework: Status and Directions January 8, 2003 John R. Smith Pervasive Media Management Group IBM T. J. Watson Research Center 19.
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Identifiers and Reference Links.
Metadata and identifiers for e- journals Copenhagen Juha Hakala Helsinki University Library
Publishing Digital Content to a LOR Publishing Digital Content to a LOR 1.
Addressing Metadata in the MPEG-21 and PDF-A ISO Standards NISO Workshop: Metadata on the Cutting Edge May 2004 William G. LeFurgy U.S. Library of Congress.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Piero Attanasio mEDRA: the European DOI agency The DOI as a tool for interoperability between private and public sector Athens, 14 January.
1 Chuck Koscher, CrossRef New Developments Relating to Linking Metadata Metadata Practices on the Cutting Edge May 20, 2004 Chuck Koscher Technology Director,
‘The Universal Catalogue’ a cultural sector viewpoint David Dawson Senior Policy Adviser (Digital Futures) Museums, Libraries and archives Council.
An Overview of MPEG-21 Cory McKay. Introduction Built on top of MPEG-4 and MPEG-7 standards Much more than just an audiovisual standard Meant to be a.
Metadata Xiangming Mu. What is metadata? What is metadata? (cont’) Data about data –Any data aids in the identification, description and location of.
MPEG-21 : Overview MUMT 611 Doug Van Nort. Introduction Rather than audiovisual content, purpose is set of standards to deliver multimedia in secure environment.
8th November 2000 SGML UK Meeting: XML in Publishing 1 XML and e-commerce in book and serials publishing Francis J Cave Francis Cave Digital Publishing.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
RDA : Resource Description and Access Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee for the Development.
Persistent Identifiers: A Publisher’s Perspective Cliff Morgan, John Wiley & Sons Ltd ERPANET Seminar on Persistent Identifiers University College Cork,
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
1 Annual Meeting 2004 CrossRef Publishers International Linking Association, Inc Charles Hotel, Cambridge, MA November 9 th, 2004.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Resource Description and Access Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee for the Development.
Metadata Bridget Jones Information Architecture I February 23, 2009.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Resource Description and Access (RDA) information session Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee.
C-2-C Industry workshop The future starts with DRM.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Digital Object Identifier doi> Norman Paskin The International DOI Foundation W3C DRM workshop January 22/
ISO TC46 / SC9 What is it? What does it do? So what? Brian Green.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
Identifiers for a Digital World June 29, 2010 Patricia Payton Senior Director of Publisher Relations & Content Development
Resources of a Resource By, Anupama Atmakur Pooja Adudodla.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
Metadata & Repositories Jackie Knowles RSP Support Officer.
Online Information and Education Conference 2004, Bangkok Dr. Britta Woldering, German National Library Metadata development in The European Library.
International Planetary Data Alliance Registry Project Update September 16, 2011.
1 Metadata: an overview Alan Hopkinson ILRS Middlesex University.
Norman Paskin International DOI Foundation
DRM in Proprietary Products and Digital Archive
An Overview of MPEG-21 Cory McKay.
Multimedia Content Description Interface
IDEALS at the University Of Illinois: A Case Study of Integration Between an IR and Library Discovery Systems Sarah L. Shreeves University of Illinois.
Session 2: Metadata and Catalogues
Metadata in Digital Preservation: Setting the Scene
MUMT611: Music Information Acquisition, Preservation, and Retrieval
FRBR and FRAD as Implemented in RDA
Presentation transcript:

Metadata issues and DOI doi>

Presentation based on one given at the IDF members conference, July 2000… and Forthcoming additional Handbook material on DOI Genres Metadata issues and DOI

overview of presentation... Background Three conclusions The metadata landscape Which schemes matter most to DOI? DOI metadata - practical implications DOI Genres DOI Kernels Handle and metadata Conclusion Metadata issues and DOI

Definitions of metadata popular... Metadata is data about data. Everyone logical... An item of metadata is a relationship that someone claims exists between two entities. framework functional... Metadata is the life-blood of e-commerce. John Erickson (HP)

#1: All metadata is just a view e.g. Views of a person: some (generic) ways in which you might be identified in metadata schemes... Son Legal person Agent Alien Scholar Library user Composer credit card holder Shoe purchaser Author Lottery entrant Hospital patient Citizen Car driver Rights owner Marathon runner Software licensee Parent Tax payer Club member e-consumer Back account holder Husband Charity giver Hotel guest Speeding ticket recipient DisneyWorld visitor Frequent Flyer Concert-goer Passenger Employee Voter Dog owner In each of these roles you will have different IDs and attributes. Three conclusions

#1: All metadata is just a view Creations are the same. An identifier for a published article may refer to... A manuscript The abstract work A draft A (class of) physical copy in a publication A (class of) digital copy (not in a publication) A (class of) digital copy in a publication A (class of) digital format A specific digital copy A (class of) paper copy A specific paper copy An edition A reprint A translation etc…and many combinations of the above Similar views apply to other types of creations. Three conclusions

#1: All metadata is just a view Views must not be confused for digital content and rights management. Mistaken identity can be catastrophic. Increasingly, views need to be interoperable (eg production workflow, rights, marketing within one business). The need for automated, interoperable views in d- commerce will be enormous. Three conclusions

#2: (Almost) all terms need identifiers Each of the values of a view must be defined and identified if other views are to recognize them (what do you mean by an abstract work? an edition? a format? a scholar? a dog owner? a name?) So views need comprehensive controlled vocabularies (nb our reliance on ISO language, territory, currency, time codes). Automation needs disambiguity. Terms of rights must be unambiguous. Anything may be a term of an agreement. Emergence of the value of structured ontologies for commerce (like the indecs model). Three conclusions

#3: Events are the key to interoperability Most metadata is thing or people based. static views e.g. a creation In the Web future, metadata interoperability will be achieved by describing events; relating things and people dynamic views e.g. A created B Event descriptions will also be the key to rights metadata (transactions are events) Three conclusions

These conclusions are being reached increasingly often elsewhere. There is an explosion of metadata activity: Models, Identifiers, Vocabularies, Dictionaries, Ontologies. XML/RDF schemas. Registries/Repositories/crosswalks. Technical standards. The metadata landscape

The metadata landscape for creations

Books Audio Audiovisual Libraries Copyright Journals Magazines Newspapers Standards Education Music Texts Technology Archives Museums

The metadata landscape for creations Books Audio Audiovisual Libraries Copyright Journals Magazines Newspapers Standards Education MARC CAE ISBN ISSN Music Texts EAN Technology Archives Museums UPC ISO codes 1980s

The metadata landscape for creations Books Audio Audiovisual Multimedia Libraries Copyright Journals Magazines Newspapers Standards Education MARC ISRC CAE ISBN ISSN ISAN Music ISMN CIS Texts Dublin Core EAN Technology DOI IIM Archives Museums ISWC FRBR UPC url urn Handle ISO codes mid 90s IMS

The metadata landscape for creations Books Audio Audiovisual Multimedia Libraries Copyright Journals Magazines Newspapers Standards Education MARC ISRC CAE ISBN ISSN ISAN Music ISMN CIS UMID TextsISTC Dublin Core SMPTE DMCS EPICS ONIX EAN IMS LOM abc MPEG7 MPEG21 ISO11179 RDF Technology XML schema DOI IPDA PRISM eBooks EBooks IIM NITF Archives Museums CIDOC CROSSREF ISWC P/META XrML FRBR UPC urluri urn Handle BICI SICIISO codes today

Convergence All serious schemes are becoming... Granular (parts and versions) Modular (creations within creations) Multimedia Multinational Multilingual Multipurpose EPICS/ONIX (text) SMPTE (audiovisual) SDMI/DCMS (audio/music) eBooks DOI genres CIDOC (museums/archives) FRBR (libraries) Dublin Core CIS (copyright societies) PRISM (magazines) NITF (newspapers) MPEG7 (multimedia) Result: major sector schemes are now trying to define metadata with broadly the same scope, only different emphases.

Which initiatives matter most to DOI? MPEG21 SMPTE data dictionary EPICS/ONIX XrML Criteria... Strong underlying data model Multi-purpose Extensive, structured vocabulary Commercial critical mass Outward-looking

MPEG21 Began 2000 (ISO Motion Picture Expert Group). Possible umbrella for digital multimedia standards. Place to bring technology and content standards together. MPEG track record of disciplined standards development. Most major players getting involved. Not many lawyers (yet). Short-term perception problem: MPEG is audiovisual. Is the challenge too great?

SMPTE Data Dictionary/UMID Began 1998 (Society of Motion Picture and Television Engineers). Well-structured multimedia technically-oriented data dictionary. ISO metadata registry based, good governance and update procedure. SMPTE track record of disciplined standards development. UMID (Unique Media Identifier) for digital material - complementary to editorial identifiers like DOI. Guaranteed implementation in home sector. Start point for MPEG7 metadata work.

EPICS & ONIX International EDItEUR (EPICS) and AAP (ONIX) convergence (May 2000). Substantial and extensible EPICS metadata dictionary, -model based, from which ONIX XML-tagged subset(s) are taken. Commerce-driven (Amazon etc) with transatlantic industry support and International Steering Group. Likely to be used by eBooks, ISTC. ONIX for video (Amazon initiative)? ONIX for audio? Best chance of e-commerce multimedia vocabulary and schema (and maybe d-commerce?).

Standard controlled vocabularies Existing… Territories, Language, Currency, Date/Time (ISO) Measures (U.C.U.M) Needed… Creation types Derivation types (adaptation, sample, compilation…) Contributor roles (author, translator, cameraman…) Title types (abbreviated, inverted, formal... etc) Media types (formats) Name types Identifier types Encoding types Tools/instruments User roles etc...and many identifiers need establishing or creating (Parties, Agreements, ISWC, ISTC, ISAN, UMID etc)

XrML and Rights metadata DRM (Digital Rights Management) systems at present are for unitary rights: doesnt deal with modularity. Holdup 1: Rights vocabularies need descriptive vocabularies - not yet ready. Holdup 2: Events model needed to integrate descriptions and rights - event-based tools not yet developed. XrML likely focal point for next stage before more mature interoperable developments start to emerge. DOI-R? Interested partners in a prototype?

DOI metadata - practical implications DOI Genres DOI Kernels Handle and metadata Conclusion Metadata issues and DOI

DOI Genres A genre is a DOI view: mechanism for unity in diversity. Genre based on any interest groups view of a type of creation. Functional granularity: create a genre when you need it. Genres can overlap: creations can be in multiple genres. Genre has metadata kernel, Registration Agency, Genre Development / Steering Group? Base Genre for new, unplaced DOIs. Zero Genre for legacy DOIs.

Each Genre starts from Base Genre kernel (8 elements) and may add whatever else it needs. IDF/indecs to develop a kernel extension model this September. DOI Genre vocabulary to be developed - bin tandem with EPICS/ONIX? Can/should coincide with or provide sector requirements (eg ISBN, ISRC, ISWC etc). Different Genres metadata will interoperate if vocabularies are developed within indecs/EPICS/ONIX model. DOI Metadata Kernel

DOI /ISBN DOI GenreBook Identifier ISBN Title Two for the dough Type Manifestation DerivationOriginal Primary AgentJanet Evanovich Agent Role Author DOI Base Genre Kernel Contains critical minimum metadata for basic recognition (but not complete disambiguation). Standard base vocabulary (eg manifestation, version) mean all DOI applications can expect base genre metadata. DOI Genre type (eg book) must be analysable in terms of other attributes (eg media, mode, content, subject).

DOI Genre Kernel Extensions IDF to develop an extended catalogue for all extended metadata requirements from indecs/EPICS model and vocabulary, along these lines... DOI DOI Genre Identifier(s) Title(s) + Types, Languages Type Derivation Media Encoding Genre(s) Form(s) Subject(s) Content Language + Use Type Measures + Units of Measure Content Creations Content Link Sequence, Measure Related Creations + Link Type Creation Event + Type Primary Agent + Agent Role + Tool Source Creation Date(s) Location(s) Availability Event + Type Agent + Agent Role Date(s) Location(s) Price + Type

Metadata declarations Either local webpage or central repository or both (as decided by Genre). Base kernel metadata must be declared. Genre-specific metadata is a matter for the Genre (Development Group/Registration Agency) to decide. Automated access to metadata declaration via Handle data types. XML schemas.

Roles of declared metadata = Functional spec of the DOI kernel (a) to assign a unique DOI to the creation [DOI] (b) to link the DOI to the principal local identifier of a creation (if any) to enable the integration of DOI-related applications and metadata with others [Identifier] (c) to enable a searcher or application to identify the creation by its most common name and the parties(s) responsible for its creation or publication [Title, Main Creator, Role]

Roles of declared metadata (continued) (d) to enable a searcher or application to distinguish the fundamental type of creation (abstract, physical, digital or spatio-temporal), and thereby also to distinguish between creations of different types with the same names and creators. [Type] (e) to enable a searcher or application or distinguish the mode in which the creation is (abstract, audio, visual, audiovisual, multimedia). [Mode] (f) to enable a searcher or application to determine to which DOI Genre the creation belongs [DOI Genre].

Roles of declared metadata (continued) (g) to enable basic metadata about creations from different DOI Genres to be searched, stored or processed in common applications [All, plus controlled values of Role, Type, Mode, DOI Genre].

Handle and metadata Handle data types can create a distributed database. eg etc Data types (and results) must be consistent, so the Handle data type vocabulary must be developed with great care within indecs-based model. Some data types could be genre specific.

Conclusion: DOI - the Integrator? G.Rust 1998: DOI is the most ambitious identifier in the history of the world. But now several things are becoming established... …it has a persistent, granular, flexible, unique identifier which can be a wrapper for other IDs. Not competitive - enhances legacy identifiers functionality in d-commerce. DOI as the integrating digital identifier?...a strong, established metadata model and vocabulary. …a controlled but flexible development structure. …it does not confuse names with addresses. …allows multiple, standardised automated actions. Nothing else comes close...

The DOI model Identifier DescriptionAction doi>

The DOI model Identifier DescriptionAction Rights doi> DOI for parties and events in future?

Metadata tasks for DOI ? Mapping ONIX to –reconcile any differences data dictionary –elements and iids tested in depth; for mappings maintaining iid registry –database –available to anyone building a genre schema, but not need to be public applications based on iid registry –technology tools to ease genre building developing rights management aspects of dictionary.

Metadata issues and DOI