and Transmission Standard overview – and case study

Slides:



Advertisements
Similar presentations
METS: Metadata Encoding & Transmission Standard Merrilee Proffitt Society of American Archivists August 2002.
Advertisements

METS Awareness Training An Introduction to METS Digital libraries – where are we now? Digitisation technology now well established and well-understood.
Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
Introduction to METS (Metadata Encoding and Transmission Standard) Jerome McDonough New York University
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
Standards showcase: MODS, METS, MARCXML ALA Annual 2006 Rebecca Guenther and Jackie Radebaugh Network Development and MARC Standards Office Library of.
METS: An Introduction Towards a Digital Object Standard Rick Beaubien Library Systems Office U.C. Berkeley.
METS: An Introduction Structuring Digital Content.
METS In order to reconstruct the archive, we will need to understand the METS files. METS is schema that provides a flexible mechanism for encoding descriptive,
METS Dr. Heike Neuroth EMANI – Project Meeting February 14 th - 16 th, 2002 Springer-Verlag Heidelberg Göttingen State and University Library (SUB)
Creating METS Application Profiles using METS and MODS Morgan Cundiff Network Development and MARC Standards Office Library of Congress.
Fedora 3.0 and METS: A Partnership for the Organization, Presentation and Preservation of Digital Objects Open Repositories Georgia Tech, Atlanta,
1 Extending the Implementation of PREMIS to Geospatial Resources in the Stanford Digital Repository: An Exploration By Nancy J. Hoebelheinrich Metadata.
Joachim Bauer Senior System Engineer, CCS
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
METS What is METS ? What is METS ? A schema that provides a flexible mechanism for encoding descriptive, administrative, and structural metadata for a.
Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in.
METS Metadata Encoding and Transmission Standard Metadata Working Group Forum April 19, 2002.
DigiTool METS Profile DigiTool Version 3.0. DigiTool METS Profile 2 What is METS? A Digital Library Federation initiative built upon the work of MOA2.
US GPO AIP Independence Test CS 496A – Senior Design Fall 2010 Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong.
AIP Archival Information Package – Defines how digital objects and its associated metadata are packaged using XML based files. METS (binding file) MODS.
Metadata: use of METS with Fedora Marie Lagerwall Technical Officer Centre for Learning Technology London School of Economics and.
METS: An Introduction Part III METS and MOA2. MOA2: A Brief History Digital Library Federation project started in 1997 Main goal was to create a digital.
METS: An Introduction Part II
METS: Metadata Encoding and Transmission Standard Richard Gartner Oxford University Library Services
Incompatible or Interoperable? A METS bridge for a small gap between two digital preservation software packages Lucas Mak Metadata & CatalogLibrarian
Use of METS in CDL Digital Special Collections Brian Tingle.
A METS Application Profile for Historical Newspapers
Create and Manage METS in retrodigitization Markus Enders Goettingen State and University Library
OCLC Online Computer Library Center OCLC’s Digital Archive – Disseminating with METS Jay Goodkin Software Engineer Digital Collection and Preservation.
Guest Lecture LIS 656, Spring 2011 Kathryn Lybarger.
Metadata standards, tools and processes for audio preservation at the British Library: An overview of new systems for audio description, preservation and.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
By Carrie Moran. To examine the Metadata Object Description Schema (MODS) metadata scheme to determine its utility based on structure, interoperability.
METS Intro & Overview Mets Opening Day Germany May 7, 2007 Nancy J. Hoebelheinrich Stanford University Libraries.
Creating a Simple Page: HTML Overview
Metadata Standards and Applications 5. Applying Metadata Standards: Application Profiles.
METS-Based Cataloging Toolkit for Digital Library Management System Dong, Li Tsinghua University Library
1 The Universal Object Format - A METS Profile for an archiving and exchange format for digital objects.
METS Dissemination: Interfaces METS Opening Day 28 October, 2003 Leslie Myrick.
Web based METS creation Ralf Stockmann case study.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
Gathering Audio Metadata for the Monterey Jazz Festival Concerts OLAC 2006 By Nancy J. Hoebelheinrich, Stanford University Libraries.
An Introduction to METS Morgan Cundiff Network Development and MARC Standards Office Library of Congress Metadata Encoding and Transmission Standard.
Preservation Audio Using METS: The Sound Directions Project Robin Wendler Harvard University Library 7 May 2007.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
METS at UC Berkeley Generating METS Objects. Background Kinds of materials: –primarily imaged content & tei encoded content archival materials: manuscripts.
HUB AND SPOKE TOOL SUITE PREMIS Implementation Fair – 7 October 2009 Bill Ingram Visiting Research Programmer University of Illinois at Urbana-Champaign.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
Implementation of PREMIS in METS Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
METS Navigator Jenn Riley John Walsh Michelle Dalmau David Jiao Indiana University Digital Library Program Digital Library Federation Spring Forum
Habing1 Integrating PREMIS and METS PREMIS Tutorial Implementers’ Panel June 21, 2007, 9:00-5:30 Library of Congress, Jefferson Building, Whittall.
OCLC Online Computer Library Center Preservation Metadata Standards PREMIS & METS Taylor Surface, OCLC.
PREMIS Implementation Fair – SF 2009 PREMIS use in Rosetta Yair Brama – Ex Libris.
METS: Implementing a metadata standard in the digital library Richard Gartner Oxford University Library Services
METS Application Profiles Morgan Cundiff Network Development and MARC Standards Office Library of Congress.
PREMIS Implementation Fair, San Francisco, CA October 7, Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.
IMPLEMENTATION ISSUES. How PREMIS can be used  For systems in development as a basis for metadata definition  For existing repositories as a checklist.
PREMIS at the British Library Markus Enders, The British Library PREMIS Implementation Fair, San Fransisco, CA 07 October 2009.
5. Applying metadata standards: Application profiles Metadata Standards and Applications Workshop.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
The Care and Feeding of Digital Collections Amy Jackson March 14, 2005.
Lifecycle Metadata for Digital Objects The Final Curtain December 4, 2006.
Integrating PREMIS and METS
Metadata for research outputs management
METS, MODS and PREMIS, Oh My! (and a little MIX and other schema too)
METS, MODS and PREMIS, Oh My! (and a little MIX and other schema too)
Introduction to METS (Metadata Encoding and Transmission Standard)
Presentation transcript:

and Transmission Standard overview – and case study Metadata Encoding and Transmission Standard overview – and case study Markus Enders, SUB Göttingen enders@sub.uni-goettingen.de

METS overview METS was derived from „Making of America“ format --> generalize format; usage for other media types Funded by Digital Library Federation (DLF) multiple structures are possible; type attribute can be "logical, physical" etc... nested div elements Editorial Board is steering the development helds “Mets Opening Days”

METS overview structMap central object mandatory <mets:div TYPE=”Monograph” LABEL=”From Hamburg to San Fransisco” ORDER=”1” ID="DMD1"> structMap <div> central object mandatory nested <div> store structure multiple structures (type attribute) multiple structures are possible; type attribute can be "logical, physical" etc... nested div elements

METS overview structLink structMap central object mandatory <mets:structLink> <mets:smLink xlink:from=”div1” xlink:to=”div2”> structLink structMap <div> central object mandatory nested <div> store structure multiple structures (type attribute) structLink: stores links between two <div> elements link between two div elements from different <structMap>

METS overview contains file groups structLink structMap (nested) files are contained in file groups basic technical metadata as attributes link from a <div> to one or more files structLink structMap <div> <fptr>: FileSec parallel or sequential file groups can be nested; Files are always contained in file Groups. files: attributes for checksums, size, mime type – further technical metadata can be stored in metadata section <fptr> = file pointer <par> or <seq>: several file pointers for each <div> possible; files can be parallel or sequential can link into a file: -- images: HTML coordinates -- byte-offsets -- XML IDs -- time codes (streaming media) link to streams <FileGrp> <file> link into a file

METS overview Descriptive metadata vs. Administrative Metadata metadata can be embedded or referenced XML or binary metadata extensions schemas used: MODS, DC, premis etc... m:n relationship between metadata and <div> od <file> Desc. MD extension schema Admin. MD Administrative Metadata: seperate sections: technical metadata, digital provenance metadata, rights metadata, source metadata METS does not come with an own metadata schema, but enables to plug in different extensions schemas extension schema techMD digiProvMD rightsMD sourceMD

METS overview structLink StructMap Desc. MD FileSec Admin. MD <div> extension schema FileSec Admin. MD several metadata sections for each <div> or <file> possible a single metadata section can be used by several <div> or <file> objects <FileGrp> extension schema <file> techMD digiProvMD rightsMD sourceMD

METS overview METS Header structLink StructMap Desc. MD FileSec <div> extension schema FileSec Admin. MD METS header contains information about the METS object (mets file), NOT about the content <FileGrp> extension schema <file> techMD digiProvMD rightsMD sourceMD

METS overview How does the linking work (in XML): XML IDs are used: each target must have a unique ID <mets:dmdSec ID="DMD1"> Metadata: DMDID and ADMID are of the type IDREFs <mets:div DMDID="DMD1 DMD2"> ID need only locally unique (within the same file) IDREFS: space separated pointers may point everywhere in the file: even from DMDID to <file>: file will validate Not a problem of METS data model but of XML representation File pointer: <mets:fptr FILEID="FN10081"/>

METS example (1) Digitization Centre Simple Document model (single structure) several content files per document (single TIFF image per page) bibliographic metadata logical structure for the document (table fo content) direct relationships between logical structure entities and content files This model was developed in mid 90ies, stored in XML with a proprietary metadata set

METS example (1) Digitization Centre Simple logical document model Logical structure <structMap> Content files <fileSec> Monograph 00000001.tif 00000002.tif Chapter 00000003.tif Chapter 00000004.tif 00000005.tif Max. eine Datei pro Seite; Namenskonvention bestimmt die Reihenfolge Chapter 00000006.tif Chapter 00000007.tif Chapter 00000008.tif

METS example (1) Digitization Centre Simple logical document model Logical structure <structMap> Content files <fileSec> Metadaten Monograph Metadaten 00000001.tif 00000002.tif Chapter 00000003.tif Chapter 00000004.tif 00000005.tif file can belong to several document structure entities Chapter 00000006.tif Chapter 00000007.tif Chapter 00000008.tif

METS example (1) Digitization Centre Simple logical document model Logical structure <METS:structMap TYPE="LOGICAL"> <METS:div TYPE="Monograph"DMDID="dmdlog0001"> <METS:div TYPE="TitlePage" ID="log0002"> <METS:fptr FILEID="bitonal0001"/> </METS:div> <METS:div TYPE="Dedication" ID="log0003"/> <METS:fptr FILEID="bitonal0002"/> ...... </METS:structMap> file can belong to several document structure entities

METS example (1) Digitization Centre Simple logical document model Metadata <METS:dmdSec ID="dmdlog0001"> <METS:mdWrap MDTYPE="MODS"> <METS:xmlData> <MODS:mods> ...... </MODS:mods> </METS:xmlData> </METS:mdWrap> </METS:dmdSec> MODS metadata embedded in METS

METS example (1) Digitization Centre Simple logical document model ContentFiles <METS:fileSec> <METS:fileGrp> <METS:file ID="bitonal0001" MIMETYPE="image/tiff"> <METS:FLocat LOCTYPE="URL" xlink:href="file://./00000001.tif"/> </METS:file> </METS:fileGrp> </METS:fileSec> Files are only referenced no metadata section for files; basic technical metadata is included as attributes: size, mimetype and checksum...

METS example (2) Digitization Centre Document model with two structures logical structure (TOC) physical structure (bound book, page) realtionships between structures Every structure entity has its own metadata section content files are linked to physical structure entities

METS example (2) Digitization Centre Document model with two structures Logical structure Phys. structure Content files Monograph Bound Book 00000001.tif Page 00000002.tif Chapter Page 00000003.tif Chapter Page 00000004.tif Page 00000005.tif page area: column Chapter page area 00000006.tif Chapter Page 00000007.tif Chapter Page 00000008.tif Page HiRes01.jpg Page Fulltext.xml

METS example (2) Digitization Centre Document model with two structures Map two structures <METS:structMap TYPE="LOGICAL"> <METS:div TYPE="Monograph" ID="log0001" DMDID="dmdlog0001"/> </METS:structMap> <METS:structMap TYPE="PHYSICAL"> <METS:div TYPE="BoundBook" ID="phys0001" DMDID="dmdphys0001"> <METS:div TYPE="page" ID="phys0002" DMDID="dmdphys0002"/> </METS:div> </METS:structMap>

METS example (2) Digitization Centre Document model with two structures Map two structures <METS:structLink TYPE="xxx"> <!--Monograph --> <METS:smLink from="log0001" to="phys0001"/> <!—title page--> <METS:smLink from="log0002" to="phys0002"/> </METS:structLink> link from logical to physical (pages)

METS example (2) Digitization Centre Document model with two structures Link to several files <METS:div TYPE="page" ID="phys0002" DMDID="dmdphys0002"> <METS:fptr FILEID="bitonal0001"/> <METS:fptr FILEID="hires0001"/> </METS:div> Link to page area files are neither sequential nor parallel, but alternative versions link to page area: COORDS attribute contains information where the column is <METS:div TYPE="column" ID="phys0003" DMDID="dmdphys0002"> <METS:fptr> <METS:area FILEID="bitonal00000001" COORDS="40x40x150x250"/> </METS:fptr> </METS:div>

METS example (2) Digitization Centre Document model with two structures Logical structure Phys. structure Content files Monograph Bound Book 00000001.tif Page 00000002.tif Chapter Page 00000003.tif Chapter Page 00000004.tif Page 00000005.tif Link to full text: single fulltext file (TEI) for the whole monograph Chapter page area 00000006.tif Chapter Page 00000007.tif Chapter Page 00000008.tif Page HiRes01.jpg Page Fulltext.xml

METS example (2) Digitization Centre Document model with two structures Link to fulltext (TEI): <METS:div TYPE="page"> <METS:fptr> <METS:area FILEID="teixml01" BEGIN="xx02" END"xx02"BETYPE="IDREF"/> </METS:fptr> </METS:div> <METS:div TYPE="page"> <METS:fptr> <METS:area FILEID="teixml01" BEGIN="xx02" END"xx02"BETYPE="IDREF"/> </METS:fptr> </METS:div> files are neither sequential nor parallel, but alternative versions link to page area: COORDS attribute contains information where the column is <TEI:p> <TEI:q id="xx01">....</TEI:q> <TEI:q id="xx02">....</TEI:q> <TEI:pb n="13"/> <TEI:q id="xx03">...</TEI:q> </TEI:p>

METS example (2) Digitization Centre Document model with two structures Fulltext is referenced, not embedded in METS file due to file sizes. METS file is about 2 – 3 MB fulltext is about 20 MB Use MODS for descriptive metadata for logical structure entities files are neither sequential nor parallel, but alternative versions link to page area: COORDS attribute contains information where the column is Own descriptive metadata schema for physical structure entites – storing page numbers

METS example (2) Digitization Centre Why did the GDZ choose METS: easily extendable: may start with image digitization and may add fulltext later complex structure needs to be stored Fulltext format not flexible enough: (1) TEI knows only one kind of structure (logical); does not know any pages (just page breaks). (2) no extensive metadata model --> fulltext files needs to be linked to a METS file

METS creation: By hand in XML editor (structMap the only required object) special tools for certain purposes, e.g: - conversion tools for web-archiving - ... At GDZ: GOOBI workflow management tool to do: General METS API, which implements the data model.

METS presentation: Depends on your METS file: - simple XSLT transformations - repository systems (ContentDM, Fedora etc.) - some page turners available (for digitized content)

METS-Profile Documentation Documentation is necessary: Describe objects and relationships in you document model: What objects are available What metadata are attached to those objects How are objects related to each other (trees) How to store unambiguous order? Are there non-hierarical relationships between objects? Which content files are available? How's the access granularity?

METS-Profile Documentation Documentation should not describe a format generally, but the precise usage of a packaging format. Example: How to inheirit relationships between two structure-trees? Chapter Page Page Need the column be linked to the chapter directly or is an indirect link sufficient? Page Column

METS-Profile Documentation Documentation should not describe a format generally, but the precise usage of a packaging format. Examples: How to link into fulltext files? Usage of BEGIN and END attributes How to store the order of <div> elements? What kind of <div> elements are available? Developing and sharing documentation encourages the usage of „complex document formats“ even for simple documents.  documents can be enriched with additional information later on.

METS-Profile Documentation METS Profile describes the usage of METS for a special scenario: - what extension schemas are used? - what authority files? - usage of attributes and elements METS-profile schema available; profile is an XML file, which is not machine readable. „registry“ on METS website available ähnliche Dokumente:

http://www.loc.gov/mets ähnliche Dokumente: