Catherine Masi, National Geospatial Digital Archive May 16, 2005 NGDA Format Registry  Why do we need a FR? We are designing with long-term storage in.

Slides:



Advertisements
Similar presentations
1 ShareGeo Discovering and Sharing Geospatial Data
Advertisements

Alexandria Digital Library Project Integration of Knowledge Organization Systems into Digital Library Architectures Linda Hill, Olha Buchel, Greg Janée.
Collaborating to Compile Information about Formats The vision, the current state, and the challenges for format registries Caroline R. Arms Library of.
METS: An Introduction Structuring Digital Content.
California Environmental Resources Evaluation System Environmental Information Sharing and Integration.
An Operational Metadata Framework For Searching, Indexing, and Retrieving Distributed GIServices on the Internet By Ming-Hsiang.
Interoperability Principles in the Global Earth Observations System of Systems (GEOSS) Presented 13 March 2006 at eGY in Boulder, CO by: Eliot Christian,
Digital Preservation - Its all about the metadata right? “Metadata and Digital Preservation: How Much Do We Really Need?” SAA 2014 Panel Saturday, August.
Depositing e-material to The National Library of Sweden.
1 Extending the Implementation of PREMIS to Geospatial Resources in the Stanford Digital Repository: An Exploration By Nancy J. Hoebelheinrich Metadata.
Mark Evans, Tessella Digital Preservation Boot Camp – PASIG meeting, Washington DC, 22 nd May 2013 PREMIS Practical Strategies For Preservation Metadata.
Identification, Selection, and Appraisal within the North Carolina Geospatial Data Archiving Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital.
Records Management Network Digital Archiving Workshop 19 March 2015.
Publishing Workflow for InDesign Import/Export of XML
EAS781 Practical Geophysics: The Tools and How to Use Them ArcGis Introduction ArcView ArcInfo ArcGis ?
1 Using Scalable and Secure Web Technologies to Design Global Format Registry Muluwork Geremew, Sangchul Song and Joseph JaJa Institute for Advanced Computer.
BitstreamFormat Renovation: DSpace Gets Real Technical Metadata.
FGDC, Meet the DDI Adding Geospatial Metadata to a Numeric Data Catalog Julie Linden Yale University.
Archival Prototypes and Lessons Learned Mike Smorul UMIACS.
Long-term Archive Service Requirements draft-ietf-ltans-reqs-00.txt.
ORGANIZING AND STRUCTURING DATA FOR DIGITAL PROJECTS Suzanne Huffman Digital Resources Librarian Simpson Library.
Overview Dennis L. Johnson What is GIS? Geographic Information System Geographic implies of or pertaining to the surface of the earth Information implies.
Robert Sharpe, Tessella PRELIDA Workshop 2013 ENSURE Linked Data Registry.
8/28/97Organization of Information in Collections Introduction to Description: Dublin Core and History University of California, Berkeley School of Information.
Why We Create Metadata and How it is Useful Bruce Godfrey University of Idaho Library INSIDE Idaho
Planning and Writing Your Documents Chapter 6. Start of the Project Start the project by knowing the software you will write about, but you should try.
How to build your own Dark Archive (in your spare time) Priscilla Caplan FCLA.
North Carolina Geospatial Data Archiving Project (NCGDAP) JISC/NDIIPP Joint Digital Preservation Workshop – May 2006 Presented by: Rob Farrell, Steve Morris,
E-Learning standards and meta-data: Case study ดร. น้ำทิพย์ วิภาวิน Sripatum University Library.
Meta Tagging / Metadata Lindsay Berard Assisted by: Li Li.
ESRI User Conference, August 8, 2006 Long-term archiving of geospatial data: the NGDA project Julie Sweetkind-Singer John Banning Stanford University.
Intro to GIS and ESRI Trainers: Randy Jones, GIS Technician, Douglas County Jon Fiskness, GISP GIS Coordinator, City of Superior.
Why Archiving and Preserving GIS Data Is Important Maps tell a compelling story of change over time. They document movement, progress, and change to the.
Metadata Lessons Learned Katy Ginger Digital Learning Sciences University Corporation for Atmospheric Research (UCAR)
Metadata Handling in the North Carolina Geospatial Data Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital Library Initiatives Rob Farrell Geospatial.
National Digital Information Infrastructure and Preservation Program (NDIIPP) CNI Project Briefing December 5, 2005.
File format registries - a global infrastructure for local persistence Andreas Aschenbrenner, ERPANET.
NCSU Libraries 27 March 2006 Digital Preservation in State Government – Wilmington, NC North Carolina Geospatial Data Archiving Project Workflow, Tools,
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
1 DLESE-IMS Metadata, ADN Metadata and the DLESE Catalog System.
Introduction to Omeka. What is Omeka? - An Open Source web publishing platform - Used by libraries, archives, museums, and scholars through a set of commonly.
National Geospatial Digital Archive Greg Janée University of California at Santa Barbara.
Long-Term Preservation of At- Risk Digital Geospatial Data: A Cooperative Agreement with Library of Congress Steve Morris NCSU Libraries Zsolt Nagy NC.
John Mark OckerbloomMay 10, 2004 The Typed Object Model Support for diverse formats John Mark Ockerbloom File Formats for Preservation Seminar May 10,
Introduction to metadata
National Geospatial Digital Archive Greg Janée University of California at Santa Barbara.
North Carolina Geospatial Data Archiving Project : Cooperative Project with Library of Congress on Preservation of Digital Geospatial Data Partners: NCSU.
Greg Janée topics Fedora NGDA project activities Two study ideas MODIS Preservation as series-of-handoffs.
Global Digital Format Registry Progress Andrea Goethals, Harvard University Library NDIIPP Digital Preservation Partners’ Meeting Arlington, VA July 9,
Archiving Geospatial Data: Background to the Problem Area State Government Users Committee October 16, 2008 Steve Morris, NCSU Libraries.
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
Funded by: © AHDS Preservation in Institutional Repositories Preliminary conclusions of the SHERPA DP project Gareth Knight Digital Preservation Officer.
Digital Library Repositories and Instructional Support Systems: Repository Interoperability Working Group Leslie Johnston University of Virginia Library.
Intro to GIS & Pictometry Trainers: Randy Jones, GIS Technician, Douglas County Jon Fiskness, GISP GIS Coordinator, City of Superior.
1 Overview Finding and importing data sets –Searching for data –Importing data_.
ESRI Education User Conference – July 6-8, 2001 ESRI Education User Conference – July 6-8, 2001 Introducing ArcCatalog: Tools for Metadata and Data Management.
Santi Thompson - Metadata Coordinator Annie Wu - Head, Metadata and Bibliographic Services 2013 TCDL Conference Austin, TX.
Digital Data Preservation: a schema-driven model Student: Stacy Kowalczyk Co-Authors: Clare McInerney and Phil Mitchell Digital Data Preservation – the.
Preservation Strategies in the North Carolina Geospatial Data Archiving Project (NCGDAP) NCSU Libraries Steve Morris Head of Digital Library Initiatives.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
North Carolina Geospatial Data Archiving Project/NDIIPP: Collection and preservation of at-risk digital geospatial data Partners: NCSU Libraries NC Center.
Digital Preservation What, Why, and How? Dan Albertson’s Digital Libraries Class April 13, 2016 Jody DeRidder Head, Metadata & Digital Services University.
TRIG: Truckee River Info Gateway Dave Waetjen Graduate Student in Geography Information Center for the Environement (ICE) University of California, Davis.
Preserving Digital Collections
Using E-Business Suite Attachments
Accessing Spatial Information from MaineDOT
CNI Project Briefing December 5, 2005
Medusa at the University of Illinois
ESRM 250/CFR 520 Autumn 2009 Phil Hurvitz
Presentation transcript:

Catherine Masi, National Geospatial Digital Archive May 16, 2005 NGDA Format Registry  Why do we need a FR? We are designing with long-term storage in mind (> 100 years) Cannot depend on format spec to be available via url or even a format registry that might not still be up to date or in existence Thus semantic definition of format must be archived with the object itself This semantic definition must be comprehensive so that format can be accessed even if current access mechanisms no longer exist!

Catherine Masi, National Geospatial Digital Archive May 16, 2005 NGDA Format Registry  Two major tasks  Analyze and define spatial data formats (Meredith Williams)  Develop local format registry with programmatic interface to existing authoritative/collaborative FR (Catherine Masi)

Catherine Masi, National Geospatial Digital Archive May 16, 2005 Analyze and define spatial data formats  Is there a comprehensive list of geospatial formats? Are they defined? How? List of Spatial Data Formats - MWSpatial Data Formats  Digital Map Formats  Vector File Formats  Raster File Formats  Other categories - TIN, ASCII, 3D, Tabular Databases  Unacceptable Formats

Catherine Masi, National Geospatial Digital Archive May 16, 2005 Analyze and define spatial data formats  What formats do we have in ADL? How do we define them? ADL format documentation  ADL website: t/BucketDescrip.htm t/BucketDescrip.htm  MIME types:  ADL literature/presentations: Format type: hierarchical vocabulary: ADL Object Format Thesaurus  loosely based on MIME multiple values: union compare: DC.Format  ADL Webclient list:

Catherine Masi, National Geospatial Digital Archive May 16, 2005 Analyze and define spatial data formats  What are our preferred formats for NGDA, if any? MW tested three geospatial formats using Sustainability Test derived from LCDF Sustainability Test GJ - "we can ingest anything if we have the definition representation information" Decided to limit allowed formats to a few the first year – CASIL test suite (geotiff, shapefile) What if there is free proprietary software, such as from ESRI, that allows one to look the files. Should we request and archive that as well? - No (UCSB)

Catherine Masi, National Geospatial Digital Archive May 16, 2005 Analyze and define spatial data formats  How will we define our formats? Using Meredith's list of Spatial Data FormatsSpatial Data Formats Begin defining using LoC Digital Formats as an example  How do we know that we have sufficient semantic information to define each geospatial format?  What information is required to make the format usable? Ask the users.  What information is required to programmatically access the format if current access mechanisms become obsolete?  Prioritize and start with most important/ubiquitous formats for our archive  Cooordinate with format definitions in Jhove

Catherine Masi, National Geospatial Digital Archive May 16, 2005 Develop local format registry with programmatic interface to existing authoritative/collaborative FR  What format registries are out there? Library of Congress Digital Formats (LCDF) Global Digital Format Registry (GDFR) - Harvard  Global Digital Format Registry Description Global Digital Format Registry Description  Ockerbloom's Format Registry Demonstrator (FRED) PRONOM - File format registry - UK archives  Practical, in use, not geo-spatial

Catherine Masi, National Geospatial Digital Archive May 16, 2005 Develop local format registry with programmatic interface to existing authoritative/collaborative FR  Coordinate our efforts with the LCDF, GDFR, FRED, TOM NH initiated contact (Stephen Abrams, John Ockerbloom, Steve Morris, etc.) at DLF Questions for DFL meeting to get discussion started. Questions for DFL meeting  Questions that we formulated showed that we have to solve a lot of these problems on our own, especially with regard to the technical aspects of building a FR and interaction mechanisms between LC, GDFR and our local FR

Catherine Masi, National Geospatial Digital Archive May 16, 2005 Develop local format registry with programmatic interface to existing authoritative/collaborative FR  Do the existing format registries contain geospatial formats? No, in the future we will contribute geospatial formats to an existing registry effort such as LCDF or GDFR

Catherine Masi, National Geospatial Digital Archive May 16, 2005 Develop local format registry with programmatic interface to existing authoritative/collaborative FR  Do the existing format registries support access and contribution mechanisms? No.

Catherine Masi, National Geospatial Digital Archive May 16, 2005 Develop local format registry with programmatic interface to existing authoritative/collaborative FR  How are Library of Congress Digital Formats stored internally? Database? XML? Directory structure? In MS Word files

Catherine Masi, National Geospatial Digital Archive May 16, 2005 Develop local format registry with programmatic interface to existing authoritative/collaborative FR  Is there a data dictionary or other mechanism for defining fields in LCDF? FDD

Catherine Masi, National Geospatial Digital Archive May 16, 2005 Develop local format registry with programmatic interface to existing authoritative/collaborative FR CM contacted Steve Morris (NCSU - NDIIPP), Stephen Abrams (Harvard - GDFR) and John Mark Ockerbloom (Penn - FRED), to open up a discussion on the technical aspects of developing a geospatial format registry.  S. Abrams responded that GDFR is still only an idea rather than a reality and that a technical discussion of how our GIS formats should be managed in a GDFR- conformant way is a bit premature

Catherine Masi, National Geospatial Digital Archive May 16, 2005 Develop local format registry with programmatic interface to existing authoritative/collaborative FR  What are the requirements for the NGDA Format Registry? independent contains sufficient semantic information to programmatically access format (UCSB) contains geospatial reference information definitions exist in simple documented format in simple directory structure access/search mechanism not necessary for access interfaces with collaborative authoritative FR for updates and contributions

Catherine Masi, National Geospatial Digital Archive May 16, 2005 First steps:  CM began prototyping the physical structure of format registry using 2 CASIL formats, geotiff and shapefile. Created directory based registry. Incorporated info from MW's documents Spatial Data Formats and Sustainability TestSpatial Data FormatsSustainability Test Created record layout loosely based on Library of Congress Digital Formats but including spatial reference information. Included format spec as local website (in the case of geotiff) and as local pdf file (in the case of shapefile). All links on record referred to local copies of format information. All documentation about the format is located locally in that format's directory Entries are not complete. This is just a first pass at what the html-rendered format entries will look like. Focus here is on physical structure rather than content.

Catherine Masi, National Geospatial Digital Archive May 16, 2005 First steps:  Refining content using input from DV, MW and from actual data users as to what is needed to adequately define a format. Determine sufficient semantic info to define geospatial formats Review CASIL formats. Began to flesh out sufficient semantic info. Started with geotiff, shapefile. Review record layout and add, change and delete fields.

Catherine Masi, National Geospatial Digital Archive May 16, 2005 Next steps  Make sure format spec is complete and all information is located locally where possible.  Determine where we draw the line between format registry information/policy/higher level descriptive metadata. Format registry will stick to format spec and a few other important fields only.  Develop xml stylesheet of record layout. Decided that html, xml and pdf are acceptable archivable formats for format registry information.  Flatten the directory structure (hierarchy) because tfw, for example, is not a subtype of geotiff but can be attached to a tiff or another format. Work more on trying to find a sensible organization for the files in our FR  Link to other parts of Archive (Descriptive Metadata) from within FR

Catherine Masi, National Geospatial Digital Archive May 16, 2005 Later  Develop method of search, retrieval, update  Begin to develop programmatic interface to LoC Digital Formats or other authoritative/collaborative format registry