Descriptive Metadata o When will mods.xml be used by METS (aip.xml) ?  METS will use the mods.xml to encode descriptive metadata. Information that describes,

Slides:



Advertisements
Similar presentations
PRESERVATION METADATA: IMPLEMENTATION STRATEGIES Preservation Metadata: The PREMIS Experience Priscilla Caplan Florida Center for Library Automation (FCLA)
Advertisements

The PREMIS Data Dictionary Michael Day Digital Curation Centre UKOLN, University of Bath JORUM, JISC and DCC.
Long-Term Preservation. Technical Approaches to Long-Term Preservation the challenge is to interpret formats a similar development: sound carriers From.
The OAIS experience at the British Library Deborah Woodyard Digital Preservation Coordinator ERPANET OAIS Training Seminar, Nov 2002.
PREMIS Conformance. Agenda 1.NLNZ and NLB conformance exercise 2.History of PREMIS Conformance 3.Current status 4.Mapping to functionality.
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
METS In order to reconstruct the archive, we will need to understand the METS files. METS is schema that provides a flexible mechanism for encoding descriptive,
PREMIS in Thought: Data Center for LC Digital Holdings Ardys Kozbial, Arwen Hutt, David Minor February 11, 2008.
Fedora 3.0 and METS: A Partnership for the Organization, Presentation and Preservation of Digital Objects Open Repositories Georgia Tech, Atlanta,
1 Extending the Implementation of PREMIS to Geospatial Resources in the Stanford Digital Repository: An Exploration By Nancy J. Hoebelheinrich Metadata.
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
Common Use Cases for Preservation Metadata Deborah Woodyard-Robinson Digital Preservation Consultant Long-term Repositories:
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
3. Technical and administrative metadata standards Metadata Standards and Applications.
MODS What is MODS: – Stands for Metadata Object Descriptive Schema – MODS is an XML descriptive metadata standard. – Extension schema to METS – MODS was.
PREMIS What is PREMIS? – Preservation Metadata Implementation Strategies When is PREMIS use? – PREMIS is used for “repository design, evaluation, and archived.
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
US GPO AIP Independence Test CS 496A – Senior Design Fall 2010 Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong.
MODS What is MODS: When is MODS use:
PREMIS What is PREMIS? o Preservation Metadata Implementation Strategies When is PREMIS use? o PREMIS is used for “repository design, evaluation, and archived.
AIP Archival Information Package – Defines how digital objects and its associated metadata are packaged using XML based files. METS (binding file) MODS.
1 CS 502: Computing Methods for Digital Libraries Lecture 17 Descriptive Metadata: Dublin Core.
8/28/97Information Organization and Retrieval Files and Databases University of California, Berkeley School of Information Management and Systems SIMS.
MODS What is MODS: o Stands for Metadata Object Descriptive Schema o MODS is an XML descriptive metadata standard.  Uses the XML schema language of the.
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Metadata Standards and Applications 4. Metadata Syntaxes and Containers.
The Data Attribution Abdul Saboor PhD Research Student Model Base Development and Software Quality Assurance Research Group Freie.
Chapter 6 Text and Multimedia Languages and Properties
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
SOFTWARE ENGINEERING BIT-8 APRIL, 16,2008 Introduction to UML.
Metadata: An Overview Katie Dunn Technology & Metadata Librarian
San Diego Supercomputer CenterUniversity of California, San Diego Preservation Research Roadmap Reagan W. Moore San Diego Supercomputer Center
How to build your own Dark Archive (in your spare time) Priscilla Caplan FCLA.
DAITSS: Dark Archive in the Sunshine State Priscilla Caplan, Florida Center for Library Automation DCC Workshop on Long-term Curation within Digital Repositories.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
AGENTS, RIGHTS, EVENTS. Agents  The Agent entity aggregates information about agents (persons, organizations, or software) associated with rights management.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
The FCLA Digital Archive Joint Meeting of CSUL Committees, 2005.
Implementation of PREMIS in METS Rebecca Guenther Sr. Networking & Standards Specialist, Library of Congress PREMIS Implementation Fair San.
Methodology - Conceptual Database Design
Habing1 Integrating PREMIS and METS PREMIS Tutorial Implementers’ Panel June 21, 2007, 9:00-5:30 Library of Congress, Jefferson Building, Whittall.
Linked Digital Archive Institutional Repository Rathachai Chawuthai CSIM/SET/AIT.
PREMIS Implementation Fair – SF 2009 PREMIS use in Rosetta Yair Brama – Ex Libris.
PREMIS Implementation Fair, San Francisco, CA October 7, Stanford Digital Repository PREMIS & Geospatial Resources Nancy J. Hoebelheinrich Knowledge.
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
PREMIS at the British Library Markus Enders, The British Library PREMIS Implementation Fair, San Fransisco, CA 07 October 2009.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
ELIS – Multimedia Lab PREMIS OWL Sam Coppens Multimedia Lab Department of Electronics and Information Systems Faculty of Engineering Ghent University.
AGENTS, RIGHTS, EVENTS. Agents  The Agent entity aggregates information about agents (persons, organizations, or software) associated with rights management.
GPO’s Future Digital System (FDsys) November 2, 2006 LS&CM CENDI Presentation.
Lifecycle Metadata for Digital Objects November 15, 2004 Preservation Metadata.
Describing resources II: Dublin Core CERN-UNESCO School on Digital Libraries Rabat, Nov 22-26, 2010 Annette Holtkamp CERN.
An Introduction to PREMIS Jenn Riley Metadata Librarian IU Digital Library Program.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
US GPO AIP Independence Test CS 496B – Senior Design Creating A Fedora Commons SIP.
US GPO AIP Independence Test CS 496B – Senior Design Creating A Fedora Commons SIP.
Joint Meeting of CSUL Committees,
DAITSS: Dark Archive in the Sunshine State
Integrating PREMIS and METS
Data Management: Documentation & Metadata
Metadata for research outputs management
Metadata in Digital Preservation: Setting the Scene
Database Design Hacettepe University
Presentation transcript:

Descriptive Metadata o When will mods.xml be used by METS (aip.xml) ?  METS will use the mods.xml to encode descriptive metadata. Information that describes, classifies, and characterizes the identity of the content. o How will METS (aip.xml) access MODS (mods.xml)?  METS uses a pointer to the metadata that is located outside of the METS document. More specifically, it uses a xlink:href to indicate the location of mods file. o For example: o When will mods.xml be used by METS (aip.xml) ?  METS will use the mods.xml to encode descriptive metadata. Information that describes, classifies, and characterizes the identity of the content. o How will METS (aip.xml) access MODS (mods.xml)?  METS uses a pointer to the metadata that is located outside of the METS document. More specifically, it uses a xlink:href to indicate the location of mods file. o For example:

MODS o MODS elements considered mandatory are essential to the ingestion of the content to a repository:  Those elements are the following: originInfo language identifier location physicalDescription typeOfResource recordInfo o MODS elements considered mandatory are essential to the ingestion of the content to a repository:  Those elements are the following: originInfo language identifier location physicalDescription typeOfResource recordInfo

originInfo o originInfo: Information that pertains to the origin of the resource  Recommended If Applicable sub-elements of : name of entity : date published  Optional sub-element : informs how the resource was issued  For example: U.S. Government Printing Office monographic o originInfo: Information that pertains to the origin of the resource  Recommended If Applicable sub-elements of : name of entity : date published  Optional sub-element : informs how the resource was issued  For example: U.S. Government Printing Office monographic

language o language: of the resource  Required sub-elements : is a repeatable sub-element that details whether the language of the resource is in textual form or coded form.  For example: eng o language: of the resource  Required sub-elements : is a repeatable sub-element that details whether the language of the resource is in textual form or coded form.  For example: eng

Identifier o identifier: a unique number or code that identifies a resource  : is a required attribute, that indicates the type of identifier. For example, the value “local” refers to a local identifier.  For Example: V0b002ee180b003e5</identifier o identifier: a unique number or code that identifies a resource  : is a required attribute, that indicates the type of identifier. For example, the value “local” refers to a local identifier.  For Example: V0b002ee180b003e5</identifier

location o location: indicate the repository holding the resource or a URL where the resource is available  : it is a mandatory sub-element that refers to the Uniform Resource Location for the resource : is an attribute the provides information associated with the location : is an attribute that indicates the type resource that will be accessed via the URL link o location: indicate the repository holding the resource or a URL where the resource is available  : it is a mandatory sub-element that refers to the Uniform Resource Location for the resource : is an attribute the provides information associated with the location : is an attribute that indicates the type resource that will be accessed via the URL link

location example  For example: 111s3880is/html/BILLS-111s3880is.htm 111s3880is/pdf/BILLS-111s3880is.pdf 111s3880is/xml/BILLS-111s3880is.xml  For example: 111s3880is/html/BILLS-111s3880is.htm 111s3880is/pdf/BILLS-111s3880is.pdf 111s3880is/xml/BILLS-111s3880is.xml

physicalDescription o physicalDescription: contains all sub-elements that relate to the physical description information of the resource  : is a recommended if applicable sub-element, contains physical description that does not fall under any other sub-element.  : is a required sub-element, that describes the method use to achieve digital form of the resource  : is a recommended if applicable sub- element, describes the number of units that make up the resource o physicalDescription: contains all sub-elements that relate to the physical description information of the resource  : is a recommended if applicable sub-element, contains physical description that does not fall under any other sub-element.  : is a required sub-element, that describes the method use to achieve digital form of the resource  : is a recommended if applicable sub- element, describes the number of units that make up the resource

physicalDescription Example  For example: deposited born digital 7 p.  For example: deposited born digital 7 p.

typeOfResource o typeOfResource: information describing the form of the resource o For example: text o typeOfResource: information describing the form of the resource o For example: text

recordInfo o recordInfo: contains information pertain metadata  Required sub-elements : refers to the language of the text of the MODS record o : required attribute, refers to the language of the metadata reocrd  Recommended sub-elements : information about the metadata of the original record such as who created it or modified it : shows the origin of MODS record  Optional sub-elements : the date the record was created : the date the record was modified : contains the organization and the system control number assigned to it o recordInfo: contains information pertain metadata  Required sub-elements : refers to the language of the text of the MODS record o : required attribute, refers to the language of the metadata reocrd  Recommended sub-elements : information about the metadata of the original record such as who created it or modified it : shows the origin of MODS record  Optional sub-elements : the date the record was created : the date the record was modified : contains the organization and the system control number assigned to it

recordInfo Example  For example: DGPO </recordChangeDate BILLS-111s3880is machine generated eng  For example: DGPO </recordChangeDate BILLS-111s3880is machine generated eng

Preservation Metadata o When will premis.xml be used by METS (aip.xml) ?  METS will use the premis.xml to encode preservation metadata. “information a repository uses to support the digital preservation process” Which includes information such as: o Provenance – refers to who has ownership of the digital object o Authenticity – refers to the claim of the digital object o Preservation activity – refers to the activities that have been carried out to preserve the digital object o Technical environment – refers to the tasks required to interpret and use the digital object o Rights management – refers the intellectual property rights that must be declared o When will premis.xml be used by METS (aip.xml) ?  METS will use the premis.xml to encode preservation metadata. “information a repository uses to support the digital preservation process” Which includes information such as: o Provenance – refers to who has ownership of the digital object o Authenticity – refers to the claim of the digital object o Preservation activity – refers to the activities that have been carried out to preserve the digital object o Technical environment – refers to the tasks required to interpret and use the digital object o Rights management – refers the intellectual property rights that must be declared

Preservation Metadata o How will METS (aip.xml) access PREMIS (premis.xml)?  METS uses a pointer to the metadata that is located outside of the METS document. More specifically, it uses a xlink:href to indicate the location of mods file.  Example code from aip.xml: o How will METS (aip.xml) access PREMIS (premis.xml)?  METS uses a pointer to the metadata that is located outside of the METS document. More specifically, it uses a xlink:href to indicate the location of mods file.  Example code from aip.xml:

PREMIS Data Model

PREMIS Intellectual Entity Intellectual Entity – refer to content that can be describe as a unit (e.g. books, maps, articles)

PREMIS Object Entity Objects – refer to units of information in digital form. PREMIS defines different kinds of objects it can an a file, bitstream or representation o File – it is a computer file, such as a pdf, txt or JPEG o Bitstream – refer to data bits within a file that contain common properties for preservation purposes Objects – refer to units of information in digital form. PREMIS defines different kinds of objects it can an a file, bitstream or representation o File – it is a computer file, such as a pdf, txt or JPEG o Bitstream – refer to data bits within a file that contain common properties for preservation purposes

PREMIS Object Entity o Representation – refer to a set of files, that includes structural metadata, required to be identified, stored and maintained in order to assemble a complete rendition of an Intellectual unit.  For example, text files and images files of a magazine are required to form a representation. o Representation – refer to a set of files, that includes structural metadata, required to be identified, stored and maintained in order to assemble a complete rendition of an Intellectual unit.  For example, text files and images files of a magazine are required to form a representation.

PREMIS Object Entity o PREMIS Data Dictionary defines the mandatory semantic units (elements) of object entity  Those elements are the following: objectIdentifier * objectCategory objectCharacteristics * format * storage * * Indicates repeatable semantic unit o PREMIS Data Dictionary defines the mandatory semantic units (elements) of object entity  Those elements are the following: objectIdentifier * objectCategory objectCharacteristics * format * storage * * Indicates repeatable semantic unit

objectIdentifier o objectIdentifier: refers to the unique identifier of the object  : refers to the classification of the domain that creates the object identifier.  : value of the object identifier. o objectIdentifier: refers to the unique identifier of the object  : refers to the classification of the domain that creates the object identifier.  : value of the object identifier.

objectIdentifier Example  For example: FDsys ACP R0b002ee180b003b0  For example: FDsys ACP R0b002ee180b003b0

objectCharacteristics o objectCharacteristics: refers to the technical properties of a file  : indicates if the object is subject to one or more processes of decoding or unbundling under  : used to verify if an object has been changed in an undocumented or unauthorized way under  : the size of the object  : the format information of the object o objectCharacteristics: refers to the technical properties of a file  : indicates if the object is subject to one or more processes of decoding or unbundling under  : used to verify if an object has been changed in an undocumented or unauthorized way under  : the size of the object  : the format information of the object

objectCharacteristics Example 0 SHA b92f0bb2642c6be368ad68a8d1d1c5dbbb db781f56a860b0a1 FDsys 9326 text/plain PRONOM x-fmt/111 Plain Text File 0 SHA b92f0bb2642c6be368ad68a8d1d1c5dbbb db781f56a860b0a1 FDsys 9326 text/plain PRONOM x-fmt/111 Plain Text File

storage o storage: information about where and how a files are stored in the repository  : information needed to retrieve a file from a storage system.  : refers to the way of accessing the location of the content.  : refers to the “location of the content used by the storage system”  : The medium on which an object is stored o storage: information about where and how a files are stored in the repository  : information needed to retrieve a file from a storage system.  : refers to the way of accessing the location of the content.  : refers to the “location of the content used by the storage system”  : The medium on which an object is stored

storage Example URI file:/u02/app/emc/documentum /data/fdsysprod1/fdsysprod1/content_storage_01/00002 ee1/80/55/b0/48.txt hard disk URI file:/u02/app/emc/documentum /data/fdsysprod1/fdsysprod1/content_storage_01/00002 ee1/80/55/b0/48.txt hard disk

PREMIS Event Entity Events – refers to actions that involve an object and an agent known to the system o Events are critical for maintaining the digital provenance of an object (helps demonstrates the authenticity of the object) Examples of Events: o modifying an document o actions that create new relationships  Object could be related to another object as a result of a particular event, for instance if a program takes file 1 and generates a different version known as file 2 o Actions that check the validity and integrity of the objects (i.e. virus scan) Events – refers to actions that involve an object and an agent known to the system o Events are critical for maintaining the digital provenance of an object (helps demonstrates the authenticity of the object) Examples of Events: o modifying an document o actions that create new relationships  Object could be related to another object as a result of a particular event, for instance if a program takes file 1 and generates a different version known as file 2 o Actions that check the validity and integrity of the objects (i.e. virus scan)

PREMIS Event Entity o PREMIS Data Dictionary defines the mandatory semantic units (elements) of event entity  Those elements are the following: eventIdentifier eventType eventDateTime o PREMIS Data Dictionary defines the mandatory semantic units (elements) of event entity  Those elements are the following: eventIdentifier eventType eventDateTime

Event Entity o : unique identifier for the event  : refers to the classification of the domain that creates the event identifier.  : value of the event identifier. o : classifies the nature of the event. o : date, time and type of event o : detail description of the event o : the outcome of the event o : indicates if the event was a success, partial success, or failure. o : unique identifier for the event  : refers to the classification of the domain that creates the event identifier.  : value of the event identifier. o : classifies the nature of the event. o : date, time and type of event o : detail description of the event o : the outcome of the event o : indicates if the event was a success, partial success, or failure.

Event Entity o : agents involved in the event and their specific roles; Agents role are defined here because agents can perform different roles in different events o : refers to the classification of the domain that creates the linking agent identifier. o : “value of the linking agent identifier”. o indicates the role of the agent associated to the event. o : Objects involved in the event and their specific roles o : refers to the classification of the domain that creates the linking object identifier. o : “value of the linking object identifier”. o indicates the role of the object associated to the event. o : agents involved in the event and their specific roles; Agents role are defined here because agents can perform different roles in different events o : refers to the classification of the domain that creates the linking agent identifier. o : “value of the linking agent identifier”. o indicates the role of the agent associated to the event. o : Objects involved in the event and their specific roles o : refers to the classification of the domain that creates the linking object identifier. o : “value of the linking object identifier”. o indicates the role of the object associated to the event.

PREMIS Event Example FDsys:event 1cdd2b6c-5a2d-449b-b386-ebb15eb4af11 Rendition Submitted T19:38:47-04:00 Rendition R0b002ee180b003b0, uploaded by hotfolderadmin, was submitted in the Submission Information package P0b002ee180b003af Success FDsys:agent hotfolderadmin implementer FDsys R0b002ee180b003b0 outcome FDsys:event 1cdd2b6c-5a2d-449b-b386-ebb15eb4af11 Rendition Submitted T19:38:47-04:00 Rendition R0b002ee180b003b0, uploaded by hotfolderadmin, was submitted in the Submission Information package P0b002ee180b003af Success FDsys:agent hotfolderadmin implementer FDsys R0b002ee180b003b0 outcome

PREMIS Agent Entity o Agents – refer to people, organizations, or software associated with events, more specifically preservation events, of an object  In the data model diagram, there is no arrow from Agent entity to the Object entity, that is because Agents influence Objects indirectly through Events. o Agents – refer to people, organizations, or software associated with events, more specifically preservation events, of an object  In the data model diagram, there is no arrow from Agent entity to the Object entity, that is because Agents influence Objects indirectly through Events.

PREMIS Agent Entity o PREMIS Data Dictionary defines the mandatory semantic unit (element) of agent entity  That element is the following: : is a repeatable semantic unit o PREMIS Data Dictionary defines the mandatory semantic unit (element) of agent entity  That element is the following: : is a repeatable semantic unit

Agent Entity o agentIdentifier: unique identifier for the agent  : refers to the classification of the domain that creates the agent identifier.  : “value of the agent identifier” o : the agent’s name o : the type of agent (people, organization, or software) o agentIdentifier: unique identifier for the agent  : refers to the classification of the domain that creates the agent identifier.  : “value of the agent identifier” o : the agent’s name o : the type of agent (people, organization, or software)

PREMIS Agent Example FDsys:agent hotfolderadmin hotfolderadmin Person FDsys:agent hotfolderadmin hotfolderadmin Person

References o Data Dictionary for Preservation Metadata  o Digital Library Federation/Acquifer Implementation Guidelines for Shareable MODS Records  Guidelines.pdf Guidelines.pdf o Fdsys Requirements Document  o MODS User Guidelines  o MODS: Uses and Features  o Understanding PREMIS  o W3C Schools  o Data Dictionary for Preservation Metadata  o Digital Library Federation/Acquifer Implementation Guidelines for Shareable MODS Records  Guidelines.pdf Guidelines.pdf o Fdsys Requirements Document  o MODS User Guidelines  o MODS: Uses and Features  o Understanding PREMIS  o W3C Schools 