H a r v a r d U n i v e r s i t y L i b r a r y Global Digital Format Registry An Update July 2006.

Slides:



Advertisements
Similar presentations
How to Set Up a System for Teaching Files, Conferences, and Clinical Trials Medical Imaging Resource Center.
Advertisements

NATIONAL LIBRARY OF MEDICINE PubMed Central Edwin Sequeira National Library of Medicine May 26, 2004.
Making and Moving Metadata: Two Library of Congress Initiatives Sally McCallum NDMSO, Library of Congress NISO/BISG Forum - June 22, 2012.
Putting together a METS profile. Questions to ask when setting down the METS path Should you design your own profile? Should you use someone elses off.
Collaborating to Compile Information about Formats The vision, the current state, and the challenges for format registries Caroline R. Arms Library of.
Preserving and Sharing Digital Data Greg Colati, Director, Archives and Special Collections May 11, 2012.
Spatial Data Infrastructure: Concepts and Components Geog 458: Map Sources and Errors March 6, 2006.
LIFECYCLE METADATA FOR DIGITAL OBJECTS Danielle Cunniff Plumer School of Information The University of Texas at Austin Summer 2014.
Overview of OASIS SOA Reference Architecture Foundation (SOA-RAF)
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
UKOLN is supported by: OAI-ORE a perspective on compound information objects ( Defining Image Access.
3. Technical and administrative metadata standards Metadata Standards and Applications.
1 ISO – Metadata Next Generation International consensus being built on structured metadata within a broader Geomatics Standard under ISO Technical.
1 Using Scalable and Secure Web Technologies to Design Global Format Registry Muluwork Geremew, Sangchul Song and Joseph JaJa Institute for Advanced Computer.
UKOLN is supported by: A non-technical introduction to: OAI-ORE ( Defining Image Access project meeting.
COMP 6703 eScience Project Semantic Web for Museums Student : Lei Junran Client/Technical Supervisor : Tom Worthington Academic Supervisor : Peter Strazdins.
BitstreamFormat Renovation: DSpace Gets Real Technical Metadata.
A Registry for controlled vocabularies at the Library of Congress
Data Sources & Using VIVO Data Visualizing Scholarship VIVO provides network analysis and visualization tools to maximize the benefits afforded by the.
Introduction to UDDI From: OASIS, Introduction to UDDI: Important Features and Functional Concepts.
An Overview of Selected ISO Standards Applicable to Digital Archives Science Archives in the 21st Century 25 April 2007 Donald Sawyer - NASA/GSFC/NSSDC.
UKOLUG - July Metadata for the Web RDF and the Dublin Core Andy Powell UKOLN, University of Bath UKOLN.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Robert Sharpe, Tessella PRELIDA Workshop 2013 ENSURE Linked Data Registry.
Catherine Masi, National Geospatial Digital Archive May 16, 2005 NGDA Format Registry  Why do we need a FR? We are designing with long-term storage in.
SC32 WG2 Metadata Standards Tutorial Metadata Registries and Big Data WG2 N1945 June 9, 2014 Beijing, China.
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
Methods For Web Page Design 6. Methods Why use one? What it covers –Possibly all stages Feasibility Analysis Design Implementation Testing –Maybe just.
International Council on Archives Section on University and Research Institution Archives Michigan State University September 7, 2005 Preserving Electronic.
How to build your own Dark Archive (in your spare time) Priscilla Caplan FCLA.
H ARVARD U NIVERSITY L IBRARY The Global Digital Format Registry (GDFR) Project Stephen Abrams Harvard University Andreas Stanescu OCLC CNI Fall Task Force.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
MPEG-21 : Overview MUMT 611 Doug Van Nort. Introduction Rather than audiovisual content, purpose is set of standards to deliver multimedia in secure environment.
Update on UDFR (Unified Digital Format Registry) NDIIPP Meeting June 25, 2009 Andrea Goethals.
Preservation and Archiving Special Interest Group Spring Meeting San Francisco, May 2008 Preservation Characterization Stephen Abrams California.
ESRI User Conference, August 8, 2006 Long-term archiving of geospatial data: the NGDA project Julie Sweetkind-Singer John Banning Stanford University.
Interfacing Registry Systems December 2000.
An Introduction to METS Morgan Cundiff Network Development and MARC Standards Office Library of Congress Metadata Encoding and Transmission Standard.
File format registries - a global infrastructure for local persistence Andreas Aschenbrenner, ERPANET.
XML Web Services Architecture Siddharth Ruchandani CS 6362 – SW Architecture & Design Summer /11/05.
Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.
Use Cases and Functional Requirements Goal: Agree on prioritization and scope of requirements Sources – UDFR Technical Working Group: The Functional Requirements.
9 th Open Forum on Metadata Registries Harmonization of Terminology, Ontology and Metadata 20th – 22nd March, 2006, Kobe Japan. Presentation Title: Day:
John Mark OckerbloomMay 10, 2004 The Typed Object Model Support for diverse formats John Mark Ockerbloom File Formats for Preservation Seminar May 10,
Evolving MARC 21 for the future Rebecca Guenther CCS Forum, ALA Annual July 10, 2009.
METS Application Profiles Morgan Cundiff Network Development and MARC Standards Office Library of Congress.
Global Digital Format Registry Progress Andrea Goethals, Harvard University Library NDIIPP Digital Preservation Partners’ Meeting Arlington, VA July 9,
Information Architecture WG: Report of the Spring 2004 Meeting May 13, 2004 Dan Crichton, NASA/JPL.
Introduction to the Semantic Web and Linked Data
Overview of SC 32/WG 2 Standards Projects Supporting Semantics Management Open Forum 2005 on Metadata Registries 14:45 to 15:30 13 April 2005 Larry Fitzwater.
Preservation Program Digital Preservation Program Digital Preservation Services: Extending tools to meet campus needs Patricia Cruse, Director, Digital.
Public Access: Update on Progress National Science Foundation April 2, 2014.
How to Set Up a System for Teaching Files, Conferences, and Clinical Trials Medical Imaging Resource Center.
PREMIS Data Dictionary and the Future of Preservation Metadata Brian Lavoie Research Scientist OCLC Research Society of American Archivists.
E-Government Initiative Geospatial Information One-Stop FGDC Coordination Group January 10, 2002 John Moeller.
Cedars work on metadata Michael Day UKOLN, University of Bath Cedars Workshop Manchester, February 2002.
Repository-specific Spoke Scripts Content Repository JSR-170/283 Content Repository for Java Technology API Normalized H&S METS Files METS Import/ExportMETS.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
The National Archives Washington DC July 10, 2008
The Global Digital Format Registry (GDFR) Project
? What is Institutional Repository for Rutgers University
Knowledge Management Systems
Global Digital Format Registry (GDFR)
CS 501: Software Engineering Fall 1999
Web services, WSDL, SOAP and UDDI
PREMIS Tools and Services
Executive Sponsor: Tom Church, Cabinet Secretary
Presentation transcript:

H a r v a r d U n i v e r s i t y L i b r a r y Global Digital Format Registry An Update July 2006

H a r v a r d U n i v e r s i t y L i b r a r y Global Digital Format Registry “The Global Digital Format Registry (GDFR) will provide sustainable services to collect, review, store, discover, and deliver significant representation information about digital formats.” –Centrally-organized collection and review –Distributed storage, discovery, and delivery via a peer-to-peer network

H a r v a r d U n i v e r s i t y L i b r a r y The GDFR project Harvard University Library (HUL) funded for 2 years by the Mellon Foundation Staffing and technical work subcontracted by HUL to OCLC (June 2006) Project oversight –Steering Committee (SC) for policy oversight –Technical Working Group (TWG) for technical oversight –Active solicitation of the international stakeholder community for review and comment

H a r v a r d U n i v e r s i t y L i b r a r y Deliverables Functional requirements Technical specifications Implementation plan (technology platform) Inter-nodal protocol Reference software implementation for nodes –Released under LGPL Editorial process Initial population Succession plan

H a r v a r d U n i v e r s i t y L i b r a r y Schedule Month 1Staffing, establish public web site Months 2-6Consultation, design, prototyping Public discussion planned for DLF Fall Forum, Boston, November 2006 Months 7-12Protocol, node implementation Months 13-18Initial population, inter-nodal testing Months 19-24Integration testing

H a r v a r d U n i v e r s i t y L i b r a r y What is a format? “A serialization of an abstract information model” –A set of syntactic and semantic rules for mapping from an information model to a byte stream (and, in most instances, for mapping back) Encompasses the nominal sense of “file format” as well as a range of conceptual models from the micro to the macro level –IEEE 754 floating point number … File system

H a r v a r d U n i v e r s i t y L i b r a r y GDFR network Peer-to-peer network communicating over a common protocol Structured delegation for distribution –DNS analogy “Root” node Top-level nodes –Distribution classes Local data Unvetted data Vetted data

H a r v a r d U n i v e r s i t y L i b r a r y Representation Information Identifiers Responsibility Classification Relationships Specifications Signatures Grammar Tools Assessment

H a r v a r d U n i v e r s i t y L i b r a r y Identifiers Canonical and alias identifiers in a variety of naming systems –Common usage“TIFF” –MIME“image/tiff” –PRONOM PUID“fmt/10” –LC FDD“fdd000022” Canonical GDFR-defined identifier in the “info” URI scheme

H a r v a r d U n i v e r s i t y L i b r a r y Responsibility Creator Owner Maintenance agency and process Legal conditions for use

H a r v a r d U n i v e r s i t y L i b r a r y Classification Ontological CLASSES, abstract families, concrete formats, and relationships BYTESTREAM IMAGE STILL RASTER GIF GIF87a GIF89ais-new-version-ofGIF87a JPEG ISO JFIFis-subtype-ofISO TIFF TIFF 4.0 TIFF 5.0is-new-version-ofTIFF 4.0 TIFF 6.0is-new-version-ofTIFF 5.0 TIFF/ITis-subtype-ofTIFF 6.0 TIFF/IT/CTis-subtype-ofTIFF/IT TIFF/IT/CT/P1is-subtype-ofTIFF/IT/CT

H a r v a r d U n i v e r s i t y L i b r a r y Relationships Subtype ASCIIis-subtype-ofUTF-8 UTF-8has-subtypeASCII Version TIFF 6.0is-version-ofTIFF 5.0 TIFF 5.0has-versionTIFF 6.0 Encapsulation WAVEcan-containμ-law μ-lawis-contained-byWAVE Affinity JPEGis-similar-toSPIFF SPIFFis-similar-toJPEG

H a r v a r d U n i v e r s i t y L i b r a r y Specifications Bibliographic citation, including descriptive (e.g. ISBN) and actionable (e.g. (URI) identifiers IP considerations probably prohibit the free distribution of specification documents

H a r v a r d U n i v e r s i t y L i b r a r y Signatures External –Generally indicative –File extension(s) Internal –Generally dispositive –Magic number –Other well-defined internal syntactic structures

H a r v a r d U n i v e r s i t y L i b r a r y Grammar Formal notation of a format Typed to permit multiple parallel formulations, e.g. BNF, ABNF, BSDL, DFDL, EAST May be feasible only for relatively simple formats

H a r v a r d U n i v e r s i t y L i b r a r y Tools Services, systems, and tools using formats as inputs or outputs Described in terms of some functional taxonomy, e.g. edit, transform, render

H a r v a r d U n i v e r s i t y L i b r a r y Assessment Format-specific risk assessment Typed to permit multiple parallel formulations –LC Sustainability/Quality & Functionality (SQF) –OCLC INFORM –DSTC PANIC –Cornell Virtual Remote Control (VRC)

H a r v a r d U n i v e r s i t y L i b r a r y General development goals First create a generalized registry framework, then specialize it for the GDFR application –To the extent that this does not effect other goals and schedules Platform/network transport independent Full information content of GDFR is expressible in XML form GDFR network is re-instantiatable from its XML expression

H a r v a r d U n i v e r s i t y L i b r a r y Related Work PRONOM Representation Information Registry/Repository dev.dcc.ac.uk/twiki/bin/view/Main/DCCRegRepV04 LC Digital Formats Web NARA GDFR governance investigation