NOAA/NESDIS/National Oceanographic Data Center Following the Flow of Two Underway Data Streams Within the U. S. National Oceanographic Data Center Steven.

Slides:



Advertisements
Similar presentations
Rolling Deck to Repository: Transforming the United States Academic Fleet Into an Integrated Global Observing System Suzanne M. Carbotte, Robert Arko,
Advertisements

WOCE Global Data V3 WOCE-DPC Report Nathan Bindoff and David M. Legler Co-Chairs, WOCE DPC WOCE Conference November 2002 All of it.
Geospatial One-Stop A Federal Gateway to Federal, State & Local Geographic Data
Visualizing Fitness for Purpose Bob Groman and Dicky Allison Biological and Chemical Oceanography Data Management Office Woods Hole Oceanographic Institution.
The NODC Glider Technical Specification Tom Ryan, Dan Seidov, John Relph (NODC) and James Bennett (University of Washington) U.S. IOOS National Glider.
1 NODC, Russia GISC & DCPC developers meeting Langen, 29 – 31 March E2EDM technology implementation for WIS GISC development S. Sukhonosov, S. Belov.
1 Archiving Workflow between a Local Repository and the National Library Archive Experiences from the DiVA Project Eva Müller, Peter Hansson, Uwe Klosa,
Caro-COOPS Data Management: Metadata. Cast-Net addresses the need for improved connectivity among coastal observing systems by creating a regional framework.
Operational Dataset Update Functionality Included in the NCAR Research Data Archive Management System 1 Zaihua Ji Doug Schuster Steven Worley Computational.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
IQuOD Data Flow Tim Boyer NODC. Inflow How will IQuOD quality controlled data get into the World Ocean Database?
GHRSST Data Access Tutorial GHRSST Data Access How to Access GODAE High Resolution SST Products from the GDAC and LTSRF Kenneth.
Steve Rutz NOAA/NESDIS National Oceanographic Data Center NODC Observing Systems Team Leader June 21, 2011.
2 nd Training Workshop 4 – 5 June 2007 Common Data Index - CDI By Dick M.A Schaap Technical Coordinator SeaDataNet.
October 16-18, Research Data Set Archives Steven Worley Scientific Computing Division Data Support Section.
Metadata (for the data users downstream) RFC GIS Workshop July 2007 NOAA/NESDIS/NGDC Documentation.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Research Data at NCAR 1 August, 2002 Steven Worley Scientific Computing Division Data Support Section.
Bringing it All Together: NODC’s Geoportal Server as an Integration Tool for Interoperable Data Services Kenneth S. Casey, Ph.D. YuanJie Li NOAA National.
WP 9 (former Task 1b of WP 1): Data infrastructure Robert Huber UNI-HB Esonet 2nd all regions workshop, Paris
Geospatial One-Stop FGDC and GOS: Working as One to Build the NSDI Rob Dollison Geospatial One-Stop Program Office.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
UDDI ebXML(?) and such Essential Web Services Directory and Discovery.
IODE Ocean Data Portal - technological framework of new IODE system Dr. Sergey Belov, et al. Partnership Centre for the IODE Ocean Data Portal MINCyT,
FGDC and GOS Metadata: Foundations to Build the NSDI Sharon Shin FGDC Secretariat / Geospatial One-Stop.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Implementor’s Panel: BL’s eJournal Archiving solution using METS, MODS and PREMIS Markus Enders, British Library DC2008, Berlin.
NODC ↔ Data Consumers Steve Rutz NOAA/NESDIS National Oceanographic Data Center NODC Observing Systems Team Leader June 21, 2011.
NODC Metadata Management for Geoportal Server and Beyond John Relph NOAA National Oceanographic Data Center.
IODE Ocean Data Portal - ODP  The objective of the IODE Ocean Data Portal (ODP) is to facilitate and promote the exchange and dissemination of marine.
GPO’s Federal Digital System December 10, 2009 U.S. Government Printing Office.
1 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION NCEI-IOOS Project Updates Mathew Biddle May 28th, 2015 IOOS DMAC Meeting, IOOS.
Creating Good Documentation NOAA National Geophysical Data Center
November 16, 2009 Page 1 of 28 Data and Data Management: Introduction to the BCO-DMO Presented to Professor Keiichi Uchida November 16, 2009 Robert C.
The US Long Term Ecological Research (LTER) Network: Site and Network Level Information Management Kristin Vanderbilt Department of Biology University.
NOAAServer: Unified access to distributed NOAA data Ernest Daddio, NOAA/ESDIM Steve Hankin, NOAA/PMEL Donald Denbo, NOAA/PMEL/JISAO Nancy Soreide, NOAA/PMEL.
Hussein Suleman University of Cape Town Department of Computer Science Digital Libraries Laboratory February 2008 Data Curation Repositories:
29 March 2004 Steven Worley, NSF/NCAR/SCD 1 Research Data Stewardship and Access Steven Worley, CISL/SCD Cyberinfrastructure meeting with Priscilla Nelson.
An Introduction to the Argo Data Sytem South Pacific Workshop 11 – 14 October 2005 Mark Ignaszewski FNMOC.
U.S. Environmental Protection Agency Central Data Exchange Pilot Project Promoting Geospatial Data Exchange Between EPA and State Partners. April 25, 2007.
10 th Argo data management 2009 Toulouse What is new at GDACs ?
Global Collecting Centres ETMC-5 Activities of the GCCs Geneva 2015 Activities of the GCCs ETMC-5 22 nd – 25 th June 2015, Geneva, Switzerland.
1 1 NOAA Office of Ocean Exploration End-to-End Data Management: A Success Story NOAA Tech Conference November 2005 Susan Gottfried National Coastal Data.
The Research Data Archive at NCAR: A System Designed to Handle Diverse Datasets Bob Dattore and Steven Worley National Center for Atmospheric Research.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
ARIADNE is funded by the European Commission's Seventh Framework Programme Archiving and Repositories Holly Wright.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
Distributed Data Servers and Web Interface in the Climate Data Portal Willa H. Zhu Joint Institute for the Study of Ocean and Atmosphere University of.
A Climate Data Portal Focused on realtime and retrospective in situ data Nancy Soreide, Don Denbo, Willa Zhu, PMEL Charles Sun, NODC Bernie Kilonsky, U.
NOAA Shipboard Data Douglas Perry NOAA Office of Marine & Aviation Operations May 3, 2006.
Data Management System to Collect, Quality Control, Distribute, and Archive Near Real-time Marine Data Jeremy J. Rolph, Jacob T. Rettig, Mark A. Bourassa,
Open Access data at VLIZ Experience in retrieving data from EMODnet “Data ingestion, archiving, citation and DOI” June 26, 2014.
5-7 May 2003 SCD Exec_Retr 1 Research Data, May Archive Content New Archive Developments Archive Access and Provision.
Physical Oceanography Distributed Active Archive Center THUANG June 9-13, 20089th GHRSST-PP Science Team Meeting GHRSST GDAC and EOSDIS PO.DAAC.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
1. 2 NOAA’s Mission To describe and predict changes in the Earth’s environment. To conserve and manage the Nation’s coastal and marine resources to ensure.
Data Management: Data Processing Types of Data Processing at USGS There are several ways to classify Data Processing activities at USGS, and here are some.
GOSUD 5 th meeting Boulder, USA 2-4 May, Status GOSUD Meeting 2-4 May, 2006 Review of progress has been extracted from the Annual Report prepared.
R2R ↔ NODC Steve Rutz NODC Observing Systems Team Leader May 12, 2011 Presented by L. Pikula, IODE OceanTeacher Course Data Management for Information.
IODE Ocean Data Portal - technological framework of new IODE system Dr. Sergey Belov, et al. Partnership Centre for the IODE Ocean Data Portal.
NASA Earth Science Data Stewardship
Data Browsing/Mining/Metadata
Flanders Marine Institute (VLIZ)
Data and Data Management: Introduction to the BCO-DMO
VI-SEEM Data Repository
Distributed Marine Data System:
Send2NCEI: Fostering Producer-Archive Propinquity..
Prepared by: Jennifer Saleem Arrigo, Program Manager
Robert Dattore and Steven Worley
Presentation transcript:

NOAA/NESDIS/National Oceanographic Data Center Following the Flow of Two Underway Data Streams Within the U. S. National Oceanographic Data Center Steven B. Rutz NOAA/NESDIS/NODC, 1315 East West Hwy SSMC3 Fourth Floor, Silver Spring, MD The stewardship of the nation's oceanographic data archive is an essential responsibility of the U.S. National Oceanographic Data Center (NODC). NODC’s focus continues to be on the long-term preservation, integrity, and accessibility of irreplaceable observational data through multiple technological and scientific generations. NODC has implemented processes to ensure that its data archive stewardship responsibilities are met, that online data discovery and retrieval services are expanded, and that adequate supporting metadata are available to guide use of the provided data. The NODC Archive Management System (AMS), launched in 2004, enables datasets to be accessioned, archived, and disseminated in a web-enabled, browser-based environment ( For the first time, NODC’s collection of over 20,000 unique accessioned data collections, ranging from individual observations to large datasets of major programs, are managed in a unified system. Two such data collections managed within the AMS are from (1) the Global Data Assembly Center (GDAC) of the International Oceanographic Data and Information Exchange’s Global Ocean Surface Underway Data Project (GOSUD), and (2) the NOAA Office of Marine and Aviation Operations (NMAO), which maintains the Scientific Computer System (SCS) aboard NOAA vessels. The following pages describe NODC’s AMS and how these two data collections (or streams) are managed within it. This poster was based on the contributions by Donald W. Collins, Eric J. Ogata, Francis J. Mitchell, Joseph Shirley and Thaila Thailambal (NODC). GOSUD contributions were aided by Thierry Carval (IFREMER) and John Relph (NODC). ARCHIVE FILE MANAGEMENT Some of the features of the AMS’s file management are: Generates a uniform directory tree structure for each unique, original dataset submission; Creates MD5 checksums for file validation Performs virus checks; Implements dataset versioning; and Provides for automated backups Each dataset archived at NODC is assigned a unique Accession Number. For each new accession, a file directory structure is automatically created. Original data and metadata files are placed in the data directory, while NODC-created information are placed in the about directory. When the ATDB record is complete, each file in the storage area is automatically checked for viruses, MD5 checksums are calculated, and then the Accession is published online for the public. If it is necessary to update an accession, these files are checked out, updated, and then re-published as a new version with the same Accession Number. ARCHIVE MANAGEMENT SYSTEM (AMS) The NODC Archive Management System (AMS) enables datasets to be accessioned, archived, and disseminated in a web-enabled, browser-based environment. NODC’s 20,000 unique accessioned datasets, ranging from individual observations to large collections by major programs, are managed in a unified system. Three major components of the AMS covered in this poster are the Archive File Management, Accession Tracking Database, and Archive Search and Retrieval. Some of the advantages of the AMS for data producers and data consumers are: Long-term data management at no charge to data producer; Data accessible to a worldwide audience long after the data producer is gone; Fulfills contractual obligations of federally funded research; and Low-cost access to global data from a reliable source. NODC ARCHIVE ACCESSION TRACKING DATA BASE (ATDB) Some of the features of the AMS’s ATDB are:  Generates a unique Accession Number for tracking each dataset submitted to NODC;  Captures basic metadata for data discovery by the public; Exports metadata into XML files that follow the FGDC Content Standard for Digital Geospatial Metadata (CSDGM);  Uses a controlled vocabulary for dataset descriptions; and  Provides a mechanism for NODC data managers to oversee the management of each dataset. When a record is created in the ATDB for a dataset submitted to NODC, an Accession number is automatically assigned. The Accession number, a unique dataset identifier, is the primary key in the ATDB. The ATDB record also contains a limited amount of descriptive metadata about each accession such as observation date ranges, submitting person and institution, data types, and geographical bounding coordinates. Upon completion of the ATDB record, the data are ready to be published online for the public. ARCHIVE SEARCH AND FILE RETRIEVAL Some of the features of the AMS’s archive search and file retrieval interface (the Ocean Archive System) are: Consumer-driven search, discovery, and retrieval of datasets archived at NODC via a web browser; Searches on nearly two dozen parameters, including data submitting and collecting institutions; Downloads an entire accession at once or individual data files; Provides checksums to ensure validity of downloaded data files. All archived data may be searched by the public at Searches are performed on the metadata in the ATDB. Data that are identified as relevant to the information needs of the search can be downloaded as a whole or as individual files. DATA PULLED Via FTP, the GDAC serves the GOSUD data in netCDF files. Each netCDF file has an associated MD5 checksum file, which is generated by the GDAC. The MD5 checksum is regenerated whenever its associated netCDF file is updated. The MD5 file is served from the same FTP directory as its associated netCDF data file. The GOSUD data are assigned NODC Accession Number Once a day, NODC pulls the netCDF and MD5 files from the GDAC into the Accession directory, which is managed within the AMS. Once downloaded, the MD5 checksums are generated for each netCDF file and compared to the MD5 checksum generated by the GDAC. If the files are corrupted, the transfer is tried again (if still corrupted, an NODC data manager is notified). Uncorrupted files are left in place within the directory structure of the AMS, where the data are served via OPeNDAP (see figure below). GOSUD GDAC INTRODUCTION IFREMER serves as the GDAC for the GOSUD project. The GDAC serves the data via FTP, OPeNDAP, and a Geographical Information System portal. The GDAC receives and QCs real-time and historical underway data from several vessels and sources. For more information on GOSUD and for data from the GDAC, go to NODC mirrors the GDAC and serves as a long- term archive for the GOSUD data. Below is a description of how the GOSUD data stream flows into NODC and is managed within the AMS. GOSUD GDAC METADATA The GOSUD GDAC data are accessioned within the AMS under Accession Metadata to track these data within NODC and for the public to find the data are stored in the ATDB (see figure below). FINDING GOSUD DATA The GOSUD data are available through: NODC OPeNDAP server ( ); HTTP ( ); FTP ( ftp://data.nodc.noaa.gov/iode/gosud ); and Ocean Archive System, the public archive search and file retrieval interface ( ). Through the Ocean Archive System, the data can be found by searching for GOSUD in the title or as the project or by searching for as the Accession number (see figure below). DATA SUBMITTED SCS data are submitted to NODC by shipboard technicians on CD-ROM or uploaded to the NODC’s FTP server. Each SCS submission, which consists of data from several cruises, is assigned an NODC Accession Number and then the files are moved to the archive file management directory structure for that Accession (see figure below). After all data and documentation (e.g., file format description) files are complete and all the required metadata are entered into the ATDB, then the Accession is published online for the public. NMAO SCS INTRODUCTION The NOAA Office of Marine and Aviation Operations (NMAO) developed the Scientific Computer System (SCS) to uniformly log data from a variety of instrument packages aboard NOAA vessels. The SCS is used on U.S. Coast Guard and other vessels, also. NODC serves as a long-term archive for a subset of the NMAO SCS data. Below is a description of how the NMAO SCS data stream flows into NODC and is managed within the AMS. NMAO SCS METADATA The SCS metadata entered into the ATDB are in the header files and are extracted from the data files (see figure below). FINDING SCS DATA The NMAO SCS data archived at NODC are available through the Ocean Archive System ( the public interface to the AMS. Through the Ocean Archive System, the data can be found by searching for NSSDAC (NOAA Shipboard Sensor Data Acquisition Database) as the project or for one of the NOAA vessels as a platform (see figure below).