Joachim Bauer Senior System Engineer, CCS

Slides:



Advertisements
Similar presentations
METS Awareness Training An Introduction to METS Digital libraries – where are we now? Digitisation technology now well established and well-understood.
Advertisements

Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
METS Awareness Training METS and Learning Objects.
October 28, 2003Copyright MIT, 2003 METS repositories: DSpace MacKenzie Smith Associate Director for Technology MIT Libraries.
Vital Implementation Update Vital Implementation Update 11 th January 2006 Paul Bevan – Glen Robson –
METS: An Introduction Towards a Digital Object Standard Rick Beaubien Library Systems Office U.C. Berkeley.
METS: An Introduction Structuring Digital Content.
DRS 2 Metadata Migration June 25, Agenda Introduction Preliminary results - content analysis Metadata options Next steps Questions.
METS In order to reconstruct the archive, we will need to understand the METS files. METS is schema that provides a flexible mechanism for encoding descriptive,
METS Dr. Heike Neuroth EMANI – Project Meeting February 14 th - 16 th, 2002 Springer-Verlag Heidelberg Göttingen State and University Library (SUB)
Susan Dahl University of Alberta METS and the Peel’s Prairie Provinces Project.
Fedora 3.0 and METS: A Partnership for the Organization, Presentation and Preservation of Digital Objects Open Repositories Georgia Tech, Atlanta,
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
METS What is METS ? What is METS ? A schema that provides a flexible mechanism for encoding descriptive, administrative, and structural metadata for a.
Keeping the pieces together: The Role of METS in the Preservation of Digital Content Robin Wendler Harvard University Library January 16, 2005 [Men in.
METS Metadata Encoding and Transmission Standard Metadata Working Group Forum April 19, 2002.
DigiTool METS Profile DigiTool Version 3.0. DigiTool METS Profile 2 What is METS? A Digital Library Federation initiative built upon the work of MOA2.
US GPO AIP Independence Test CS 496A – Senior Design Fall 2010 Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong.
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
METS: Metadata Encoding and Transmission Standard Richard Gartner Oxford University Library Services
Incompatible or Interoperable? A METS bridge for a small gap between two digital preservation software packages Lucas Mak Metadata & CatalogLibrarian
A METS Application Profile for Historical Newspapers
Create and Manage METS in retrodigitization Markus Enders Goettingen State and University Library
OCLC Online Computer Library Center OCLC’s Digital Archive – Disseminating with METS Jay Goodkin Software Engineer Digital Collection and Preservation.
METS Intro & Overview Mets Opening Day Germany May 7, 2007 Nancy J. Hoebelheinrich Stanford University Libraries.
1 April 2004 – METS Opening Day West docWORKS/METAe Automated Conversion Of Printed Documents Into Fully Tagged METS Objects Claus Gravenhorst.
Version 18 Upgrade: Web OPAC. Version 18 Upgrade: Web OPAC Customization 2 All of the information in this document is the property of Ex Libris Ltd. It.
© January/2008 CCS Content Conversion Specialists GmbH Weidestr. 134, Hamburg, Germany consulting technology digitization services.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
The DigiTool to FDA Program Lydia Motyka Florida Center for Library Automation.
Gathering Audio Metadata for the Monterey Jazz Festival Concerts OLAC 2006 By Nancy J. Hoebelheinrich, Stanford University Libraries.
An Introduction to METS Morgan Cundiff Network Development and MARC Standards Office Library of Congress Metadata Encoding and Transmission Standard.
HUB AND SPOKE TOOL SUITE PREMIS Implementation Fair – 7 October 2009 Bill Ingram Visiting Research Programmer University of Illinois at Urbana-Champaign.
Implementation of PREMIS in METS Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
OCLC Online Computer Library Center Preservation Metadata Standards PREMIS & METS Taylor Surface, OCLC.
PREMIS Implementation Fair – SF 2009 PREMIS use in Rosetta Yair Brama – Ex Libris.
METS: Implementing a metadata standard in the digital library Richard Gartner Oxford University Library Services
Introduction to metadata
METS Application Profiles Morgan Cundiff Network Development and MARC Standards Office Library of Congress.
Mark Sullivan Digital Library of the Caribbean. 2 dLOC Training (7/29/2013) Gainesville, FLMark Sullivan.
PREMIS at the British Library Markus Enders, The British Library PREMIS Implementation Fair, San Fransisco, CA 07 October 2009.
National Library of Finland Metadata in the Digitisation Process Cultural unity and diversity of the Baltic Sea Region – common history, different languages,
1 Resource Management: Resource Management Fundamentals.
Sending sets of records via . Sending via 2 All of the information in this document is the property of Ex Libris Ltd. It may NOT, under any.
1 SFX TotalCare: User Interface Configuration - A-Z List.
1 SFX TotalCare: User Interface Configuration - SFX Menu.
The Care and Feeding of Digital Collections Amy Jackson March 14, 2005.
How to use the SDI RSS Feed Version 18 Yoel Kortick.
Implementing PREMIS in DigiTool Michael Kaplan ALA 2007 Update.
1 Central Publish to External Catalog from the Alma Network Zone.
Items Version 18 Upgrade Training. Items: Version 18 Upgrade Training 2 All of the information in this document is the property of Ex Libris Ltd. It may.
NLW. Object Classes Class 1  1 MARC Record  1 Image  No METS Class 2  1 MARC Record  Many images  No METS Class 3  1 MARC Record  Many.
Preparing and sending records to ULI. 2 All of the information in this document is the property of Ex Libris Ltd. It may NOT, under any circumstances,
How to use the Offline Circulation Version 16 Yoel Kortick.
How to retrieve orders which were sent to vendor and not arrived Version 16 and up Yoel Kortick.
How to create and use authority records Version 16 and up Yoel Kortick.
Sobek for Curators and Collection Managers Training Three: Quality Control and Serial Hierarchy Mark Sullivan December 2013 University of Florida George.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
The LKR field in Cataloging Version 16 and up Yoel Kortick.
How to view and manipulate an ALEPH report in Excel Version 16 Yoel Kortick.
What module are you interested in? New Features newsWorks 6.4 Standalone CCS | January 2016 newsClipnewsClip web newsCorr newsPress Start.
How to send Serial claims to vendor (Batch) Version 16 Yoel Kortick.
1 July 2004 – METS Opening Day UK docWORKS/METAe The Engine for Automated Metadata Extraction and XML Tagging Claus Gravenhorst Content.
1 Parsing call numbers for labels Yoel Kortick Senior Librarian.
How to print barcodes in batch mode via item-03
One vs. two production environments
Fedora Metadata The Basics 9/9/2008.
The LKR field in Cataloging Version 16 and up
How to use SDI Version 16 Yoel Kortick.
Presentation transcript:

Joachim Bauer Senior System Engineer, CCS METS with docWorks Joachim Bauer Senior System Engineer, CCS

What is docWorks? How is METS used in docWorks? How does the data model look like?

Illustration of docWorks docWorks îs a conversion software typically integrated into a full digitization workflows cropping / OCR-ing / structuring / exporting Manuscripts -> no OCR Digital born -> no cropping Catalog cards -> recording of metadata (MARC records) with METS in total different scope Newspaper reals -> splitting into single issues illustration of a production line matches very good Is used for detailed and precised metadata enrichment by libraries as well as by service providers for mass digitization Automated processes by background services (servers) Manual quality control / correction / enrichment by client application

Role of METS within docWorks internal data model used within docWorks to keep intermediate data METS is used as output format One METS file for each digital object Newspaper issue Book Journal issue Default output METS ALTO Master images Derivatives (PDF, ePUB, lossy images) METS not used within docWorks METS is used as standard output format

How the dW - METS files look like METS header <metsHdr> Descriptive metadata section <dmdSec> Administrative metadata section <amdSec> File inventory section <fileSec> Structural map <structMap> Structural map linking <structLink> Not used in default output of docWorks. Behavior section <behaviorSec>

METS Physical structMap ORDER 1 2 3 4 5 6 7 8 9 10 11 12 … LABEL II III IV V VI ORDERLABEL I Structural map <structMap TYPE=„PHYSICAL“> <div ID=„DIVL1" type="Newspaper"> <div ID="DIVP2" type=„PAGE"> <div ID="DIVP3" type=„PAGE"> <div ID="DIVP4" type=„PAGE"> Physical structMap - recording page level reference - recording page numbering (printed page numbers)

Physical structure of a newspaper with four pages METS Structural map <structMap TYPE=„PHYSICAL“> structMap Sample XML: Physical structure of a newspaper with four pages Physical structure of a newspaper with four pages

METS Logical structMap Reading sequence reference to ALTO content Structural map <structMap TYPE=„LOGICAL“> <div ID=„DIVL1" type="Newspaper"> <div ID="DIVL2" type="Issue"> <div type="Article" label="My first article"> <div type="Article" label="My second article"> Logical structMap Reading sequence reference to ALTO content Segmentation into articles, chapters, ...

METS Structural map <structMap TYPE=„LOGICAL“> structMap Sample XML: Logical structure of a newspaper issue with several elements in its title section Logical structure of a newspaper issue with several elements in its title section

METS fileSec references to all files of the digital object File inventory section (fileSec) fileSec references to all files of the digital object One filegroup for each file type Master images ALTO xml further derivatives / thumbnails PDF (per page / whole doc) ePUB Adaptions based on customer requirements of repository / presentation system (ID and USE attribute)

File section with two file groups METS File inventory section (fileSec) fileSec Sample XML: File section with two file groups File section with two file groups

METS One amdSec for each master image mix metadata embedded Administrative metadata sections (amdSec) One amdSec for each master image mix metadata embedded Adaptions based on customer requirements, e.g. scanner details out of workflow recordings, PREMIS for copyright details or detailed recording of processing steps or

Administrative metadata integration into the METS file (here: MIX) Administrative metadata sections (amdSec) amdSec Sample XML: Administrative metadata integration into the METS file (here: MIX) Administrative metadata integration into the METS file (here: MIX)

METS One dmdSec for whole item (book, newspaper issue, object) Descriptive metadata section <dmdSec> One dmdSec for whole item (book, newspaper issue, object) MODS / MARC / DC <dmdSec> for each structural unit down to any level Typically: Chapter (books) Articles (newspapers) Illustrations Advertisements

Descriptive metadata integration into the METS file (here: MODS) Descriptive metadata section (dmdSec) dmdSec Sample XML: Descriptive metadata integration into the METS file (here: MODS) Descriptive metadata integration into the METS file (here: MODS)

METS METS header containing by default Identifier METS header <metsHdr> METS header containing by default Identifier Agent for CREATOR software Agent for CREATE library / company Often customized to client needs Specified by repositories / presentation systems

Header with basic document metadata METS METS header (metsHdr) metsHdr Sample XML: Header with basic document metadata Header with basic document metadata

How the dW-METS look like METS header (metsHdr) 1 x <metsHdr> Descriptive metadata section (dmdSec) 1 x <dmdSec> for whole unit 1 x <dmdSec> for each structural unit Administrative metadata sections (amdSec) 1 x <amdSec> for each page (master) File inventory section (fileSec) 1 x <fileGrp> for each file type Structural map (structMap) 1 x <structMap TYPE=PHYSICAL> 1 x <structMap TYPE=LOGICAL> Structural map linking (structLink) Behavior section (behaviorSec)

Summary dW - METS data model METS as main digital object container Each newspaper issue / book / journal issue one METS All files referenced from METS Metadata embedded with MODS, MARC or DC Two <structMap> elements for physical and logical structure All text content in ALTO - all transformations for other formats done out of standard METS/ALTO output, e.g. PDF, EPUB, Sample METS http://www.content-conversion.com/docworks/data/sample-mets.xml

www.content-conversion.com Sample METS http://www.content-conversion.com/docworks/data/sample-mets.xml

Disclaimer All of the information in this document is the property of CCS Content Conversion Specialists GmbH (CCS). It may NOT, under any circumstances, be distributed, transmitted, copied, or displayed without the written permission of CCS. The information contained in this document has been prepared for the sole purpose of providing information about theme described in the following title. The material herein contained has been prepared in good faith; however, CCS disclaims any obligation or warranty as to its accuracy and/or suitability for any usage or purpose other than that for which it is intended. © CCS Content Conversion Specialists GmbH, 2014