Lifecycle Metadata for Digital Objects September 25, 2006 Major archival and digital library metadata schemes: How (or how not) to go about scheming.

Slides:



Advertisements
Similar presentations
METS Awareness Training An Introduction to METS Digital libraries – where are we now? Digitisation technology now well established and well-understood.
Advertisements

Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
The PREMIS Data Dictionary Michael Day Digital Curation Centre UKOLN, University of Bath JORUM, JISC and DCC.
UKOLN, University of Bath
Preservation Metadata Initiatives: Practicality, Sustainability, and Interoperability Michael Day UKOLN, University of Bath ERPANET Training.
InterPARES Project Joanne Evans and Lori Lindberg Description Cross-domain Describing and analyzing the recordkeeping capabilities of metadata sets Joanne.
Developing a Metadata Exchange Format for Mathematical Literature David Ruddy Project Euclid Cornell University Library DML 2010 Paris 7 July 2010.
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
InterPARES Project Joanne Evans, School of Information Management and Systems, Monash University Description Cross-domain Description Cross Domain - Metadata.
An Introduction to Metadata by Wendy Duff ECURE 2000 October 6, 2000.
Metadata: An Introduction By Wendy Duff October 13, 2001 ECURE.
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
The RDF meta model: a closer look Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations.
Metadata : Setting the Scene or a Basic Introduction Wendy Duff University of Toronto, Faculty of Information Studies.
Improving access to digital resources: a mandate for order mandate: managing digital assets in tertiary education craig green,
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
UKOLUG - July Metadata for the Web RDF and the Dublin Core Andy Powell UKOLN, University of Bath UKOLN.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
Addressing Metadata in the MPEG-21 and PDF-A ISO Standards NISO Workshop: Metadata on the Cutting Edge May 2004 William G. LeFurgy U.S. Library of Congress.
University of Illinois at Urbana-Champaign OAI Alpha Experiences Timothy W. Cole Thomas G. Habing Grainger Engineering.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
Demystifying library standards Stanislav Pejša NEES IT Meeting, DLC This work is licensed under a Creative Commons Attribution 3.0 Unported License.Creative.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
TEXT ENCODING INITIATIVE (TEI) Inf 384C Block II, Module C.
Lifecycle Metadata for Digital Objects October 4, 2004 Creation Metadata.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Metadata: Essential Standards for Management of Digital Libraries ALI Digital Library Workshop Linda Cantara, Metadata Librarian Indiana University, Bloomington.
Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context.
In Dublin’s fair city, where the metadata are so pretty… John Roberts Archives New Zealand.
Lifecycle Metadata for Digital Objects September 11, 2002 Major archival and digital library metadata schemes.
1 Metadata –Information about information – Different objects, different forms – e.g. Library catalogue record Property:Value: Author Ian Beardwell Publisher.
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
Resource Description and Access Deirdre Kiorgaard Australian Committee on Cataloguing Representative to the Joint Steering Committee for the Development.
Discovery Metadata for Special Collections Concepts, Considerations, Choices William E. Moen School of Library and Information Sciences Texas Center for.
Metadata Bridget Jones Information Architecture I February 23, 2009.
OCLC Online Computer Library Center Preservation Metadata Standards PREMIS & METS Taylor Surface, OCLC.
2nd Concertation Day 18 February 2000 The Charity Centre RSLP Collection Description.
METS: Implementing a metadata standard in the digital library Richard Gartner Oxford University Library Services
Lifecycle Metadata for Digital Objects November 1, 2004 Descriptive Metadata: “Modeling the World”
Evidence from Metadata INST 734 Doug Oard Module 8.
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
Slavic Digital Text Workshop 2006 The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment.
Metadata for digital preservation: a review of recent developments Michael Day UKOLN, University of Bath ECDL2001, 5th European Conference.
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
A centre of expertise in digital information managementwww.ukoln.ac.uk DCMI Affiliates: Implications for Institutions Rosemary Russell UKOLN University.
The RDF meta model Basic ideas of the RDF Resource instance descriptions in the RDF format Application-specific RDF schemas Limitations of XML compared.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
RECORDKEEPING METADATA STANDARDS: THE INTERNATIONAL CONTEXT Barbara Reed, Director, Recordkeeping Innovation.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
Lifecycle Metadata for Digital Objects October 23, 2006 Creation Metadata.
Differences and distinctions: metadata types and their uses Stephen Winch Information Architecture Officer, SLIC.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Collection Management Systems
Lifecycle Metadata for Digital Objects September 4, 2002 Overall framework: OZ meets WC3.
Cedars work on metadata Michael Day UKOLN, University of Bath Cedars Workshop Manchester, February 2002.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
A RCHIVAL COLLECTIONS IN A D IGITAL W ORLD Cheryl Walters Nov. 6, 2008.
Lifecycle Metadata for Digital Objects With an emphasis on preservation…
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
RECORDKEEPING METADATA STANDARDS: THE INTERNATIONAL CONTEXT
Lifecycle Metadata for Digital Objects
Lifecycle Metadata for Digital Objects
Metadata for research outputs management
Cataloging the Internet
Metadata for Digital Objects
Metadata in Digital Preservation: Setting the Scene
Oya Y. Rieger Cornell University Library May 2004
Presentation transcript:

Lifecycle Metadata for Digital Objects September 25, 2006 Major archival and digital library metadata schemes: How (or how not) to go about scheming

NHPRC Initiatives, Research Issues in Electronic Records publication (no longer online) ectronic_records/research_issues_report.html#recom mendations ectronic_records/research_issues_report.html#recom mendations 1996 conference on electronic records research du/e-recs/Report du/e-recs/Report review of research agenda, MN Historical Society ml ml

NHPRC Initiatives, cont. For links to extant online NHPRC project results, see: interpares/bibliography/NHPRC.htm interpares/bibliography/NHPRC.htm For a list of all funded NHPRC e- records projects, see: /electronic-records/projects.html /electronic-records/projects.html

University of Pittsburgh Project NHPRC funding, Overlapped with “Camp Pitt”, (which spread the word about need) “Business Acceptable Communications” assured by warrant from non-archival contexts Emphasis on evidence and on postcustodial strategies for managing records Emphasis on transactions as records

Warrant by Functional Requirements (for system) Conscientious organization 1 Compliant Accountable recordkeeping system 2 Responsible 3 Implemented 4 Consistent Captured Records 5 Comprehensive 6 Identifiable 7 Complete 8 Authorized Maintained Records 9 Preserved 10 Removable Usable Records 11 Exportable 12 Accessible 13 Redactable Major elements here are organization, recordkeeping system, records

Warrant by Practice Lawyers Auditors Records Managers Information Technologists Managers (mostly ISO 9000, 9001) Medical Professions

Pittsburgh metadata reference model in six layers Note now available (rescued from loss at Pittsburgh) at ml ml Handle [URI] Terms & Conditions [IP, privacy, etc] Structura l Contextual Content Use History [entire life history] Ordering depends on the assumption that metadata will be encapsulated as part of the record

I. Handle layer (ID + description) Unique identifier Record declaration (i.e., as record) Transaction domain (creation context) Transaction instance (date-stamp etc.) Discovery metadata Description standard (e.g. namespace) Descriptors Language

II. Terms & Conditions Layer Restrictions status (any “holds” on data) Access conditions (for restricted records) Use conditions (licenses, redactions, etc. Disposition requirements (retention, destruction)

III. Structural Layer (technical + preservation) File identification metadata (of constituent files) File encoding metadata (standards used) File rendering metadata (standards required) Record rendering metadata (standards required) Content structure metadata (for e.g. databases) Source metadata (creator + capture event)

IV. Contextual Layer (provenance + evidence) Transaction context metadata (people + transaction) Responsibility metadata (Organizational information; org chart, etc.) System accountability metadata (system audit)

V. Content Layer Actual data Any constituent files Any internal markup

VI. Use History Layer Type Instance User Consequences

Indiana University test of BAC Funded by NHPRC Evaluating administrative recordkeeping systems at IU Testing functional requirements Mapping metadata requirements Elimination of “metadata encapsulated objects” (record separated from metadata) Reduction in structural metadata Pulled back from record-level metadata to record, file, class levels in many cases Influenced by MoReq (I.e., )

InterPARES Project Funded by NHPRC, SSHRC Initially a University of British Columbia project that led to DoD STD Aim to establish characteristics of a reliable and authentic electronic record InterPARES is international project funded by NHPRC, SSRC, etc. Aim to establish rest of record life cycle (i.e., after creation and classification)

InterPARES case studies Examine digital recordkeeping systems in wide variety of contexts worldwide Qualitative methods used to discover how records are used, carry out functional analysis Data used to provide basis for modeling preservation processes

InterPARES basis in diplomatics 16 th -19 th -century method for establishing genuineness of documents Defines four types of records: Dispositive (form is essence of evidence) Probative (written form part of evidence) Supporting (written form discretionary, procedurally linked to action) Narrative (provides context)

InterPARES Authenticity template Documentary form Extrinsic elements Intrinsic elements Annotations Medium Context

A quick example e/ delay2.html Do we believe this document? Why? What metadata does it incorporate?

InterPARES findings, 2002 Hopes for a clear typology of record forms dashed after four rounds Contemporary systems too fluid for model No fixed form or content No annotations Embedded in social contexts Managed procedurally

InterPARES 2 Follow-on from InterPARES 1 Addresses “new” file formats: Experiential Dynamic Interactive Description cross-domain Metadata Schema Registry Metadata Specification Model Literary Warrant Database

Meanwhile, in Australia… Bearman and Australia (postcustodialism) Specifics of Australian problems Relatively young government Relatively small government Swift computerization Context: Australian national government recordkeeping + regulated industries

Sue McKemmish, “Yesterday, Today, and Tomorrow” Application of Frank Upward’s records continuum model Integration of continuum process Importance of continuum (compare to life cycle) Governance and accountability Collective memory and identity Records as assets Can be expanded to the idea of any kind of information, not just records Does not require centralized custodianship Heals records manager/archivist split

Identity Evidentiality Trans- actionality Recordkeeping containers Dimension 1 CREATE Dimension 2 CAPTURE Dimension 3 ORGANISE Dimension 4 PLURALISE Actor(s) Trace Transaction [Archival] Document Unit(s) evidence Activity record(s) Organisation Corporate / Individual Memory Function archive Institution Collective Memory Purpose Archives The Records Continuum c.Frank Upward, all rights reserved

“Yesterday” II Four dimensions of the continuum (all active through life of record): Create: actors, acts, documents, trace Capture: reliable recordkeeping systems Organize: entire recordkeeping regime Pluralize: social/archival context for access Liberatory assumptions provide for multiple views

“Create Once, Use Many Times” project Reuse and inheritance of metadata from many contexts Avoid retrospective description Use standards in schema registries Use web services [Use ontology-matching] “Metadata Broker” concept Note: most archives and library activity so far is concentrated at the attribute space/value space level (see next slide)

“Create Once” metadata layers concept Layer 3 (a) Attribute Space (e.g. LOM, Dublin Core, MES, indecs) (b) Value Space (e.g. ontologies, classifications, controlled vocabularies, taxonomies) Layer ? Left out of model: covers cultural and temporal conceptual change Layer 2 Representation (e.g. XML, RDF, DAML-OIL) Layer 1 Transport and Exchange (e.g. HTTP Get, OAI Protocol for Metadata Harvesting)

RKMS Australian Metadata standard Agent Rights Management Title Subject Description Language Relation Coverage Function Date Type Aggregation Level Format Record Identifier Management History Use History Preservation History Location Disposal Mandate

What about EAD? Supported by SAA Archival description focuses on aggregates of objects EAD was created to mark up finding aids, not actual objects So far only experimental use has attempted to add document-level records to EAD descriptions Addition of EAC for detailed name authority control:

Dublin Core Metadata Initiative Supported by OCLC Primarily a surrogate/discovery metadata scheme Does not aim to document everything Goal to be easy to use: a “boundary object” for many communities of practice Especially useful for management of active digital objects

Dublin Core elements Title Creator Subject Description Publisher Contributor Date Type Format Identifier Source Language Relation Coverage Rights

Dublin Core development Initial development of simple elements Subelements and user communities: Qualified Dublin Core Warwick Framework container architecture (ca. 1996) RDF and XML: toward namespaces within DC or including DC

Dublin Core in HTML environment Example: MDAH

Library of Congress Metadata Efforts See webpage: MARC (21, AMC, etc.; MARCXML) FRBR multiple-manifestation issue ISBD(G) insistence on nonrepeatable source fields MODS and MADS (Metadata Object/Authority Description Schema) METS (Metadata Encoding and Transmission Standard)

Metadata Encoding and Transmission Standard (METS) Developed out of LoC’s MOA project, available 2003 Initially designed to support maintenance of library of digitized digital objects Note importance of transmission emphasis: not meant to be an internal format Compare to the “manifest” idea for structuring multimedia objects (MPEG-21) Three overall types of metadata, introduced by header for METS document creation context

METS Descriptive metadata External (e.g., finding aid) Internal (part of the document) Provides namespace reference for any XML-encoded non-METS metadata LoC is using MODS, DC, and MARCXML Provides an XML “wrapper” for non-XML metadata (e.g. MARC)

METS Administrative metadata Technical metadata (including e.g. NISO still/MIX) Intellectual property rights metadata Source metadata (re analog source) Digital provenance metadata Relations between files Migration/transformation history data

METS Structural metadata File group list (all the files included) Structural map (defines hierarchical relations between files and METS element structure) Behavior segment (associates executable methods like viewers, render engines, etc. with specific content elements)

METS and XML The METS XML schema ts_xsd/mets.html

Why is it all so complicated? Existing library/archives professions have lacked IT skills for substantive innovation Little impetus to re-vision practice Little awareness of successful external practice (cf. data archives) Existing professions have had different goals Providing access to text blocks Providing high-level access to undescribed collections Preserving physical objects

Why should we care about library/archives schemes? Long track record in descriptive metadata Understanding of ontology construction Indeed most of ALL activity on metadata at the attribute space/value space level is based on library/archives understandings, even if restructured Australian “clever recordkeeping” project is a glimmer of what is needed to blend with Semantic Web/web services/distributed intelligence views of a worldwide information appliance…