1 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION NCEI-IOOS Project Updates Mathew Biddle May 28th, 2015 IOOS DMAC Meeting, IOOS.

Slides:



Advertisements
Similar presentations
Archive Requirements Working Group A NOAA clearinghouse for requirement planning in support of the science objectives related to archive, access, and reprocessing.
Advertisements

PRINCIPLES OF A CALIBRATION MANAGEMENT SYSTEM
Data and Information Framework: Principles Sue Barrell Bureau of Meteorology, Australia CBS-Ext.(14), Asuncion, September 2014.
Peter Griffith and Megan McGroddy 4 th NACP All Investigators Meeting February 3, 2013 Expectations and Opportunities for NACP Investigators to Share and.
Integrated Ocean Observing System (IOOS) Data Management and Communication (DMAC) Standards Process Julie Bosch NOAA National Coastal Data Development.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
Symposium on Digital Curation in the Era of Big Data: Career Opportunities and Educational Requirements Workforce Demand and Career Opportunities From.
The NODC Glider Technical Specification Tom Ryan, Dan Seidov, John Relph (NODC) and James Bennett (University of Washington) U.S. IOOS National Glider.
QARTOD III November 2-4, 2005 Metadata in the IOOS Community Julie Bosch NOAA Coastal Data Development Center QARTOD III November 2–4, 2005.
Euseden INTERNAL AUDIT & ASSURANCE SERVICES.
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
Developing an IOOS Data Management Plan that Satisfies NOAA’s Requirements and IOOS Certification Requirements John Ten Hoeve 9/11/13.
Steve Rutz NOAA/NESDIS National Oceanographic Data Center NODC Observing Systems Team Leader June 21, 2011.
Ingest and Dissemination with DAITSS Presented by Randy Fischer, Programmer, Florida Center for Library Automation, University of Florida DigCCurr2007.
The Case for Data Stewardship: Preserving the Scientific Record Matthew Mayernik National Center for Atmospheric Research Version 2.0 [Review Date]
What is an Inventory Program for? Dr. Emilio Moceo Ph.D Director of Studies Meet international obligations and expectations Inform international, national,
1 Quality Assurance In moving information from statistical programs into the hands of users we have to guard against the introduction of error. Quality.
Earth Observing System Data and Information System (EOSDIS) provides access to more than 3,000 types of Earth science data products and specialized services.
Bringing it All Together: NODC’s Geoportal Server as an Integration Tool for Interoperable Data Services Kenneth S. Casey, Ph.D. YuanJie Li NOAA National.
Demystifying the Business Analysis Body of Knowledge Central Iowa IIBA Chapter December 7, 2005.
material assembled from the web pages at
1 SPSRB Decision Brief on Declaring a Product Operational Instructions / Guidance This template will be used by NESDIS personnel to recommend to the SPSRB.
M u l t I b e a m III W o r k s h o p M u l t I b e a m III W o r k s h o p National Geophysical Data Center / World Data Centers NOAA Slide 1 End-to-End.
NOAA Administrative Order : Management of Environmental and Geospatial Data and Information Jeff Arnfield NOAA’s National Climatic Data Center Version.
NODC ↔ Data Consumers Steve Rutz NOAA/NESDIS National Oceanographic Data Center NODC Observing Systems Team Leader June 21, 2011.
1 NOAA Use of the Open Archival Information System Reference Model (OAIS-RM) Ken McDonald NOAA NESDIS ESIP Federation Meeting July 9, 2009.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
Categorization Recommendations for Implementing the E-Gov Act of 2002 Richard Huffine U.S. Environmental Protection Agency Co-chair, Categorization Working.
US IOOS Independent Cost Estimate: Summary Information.
Archival Workshop on Ingest, Identification, and Certification Standards Certification (Best Practices) Checklist Does the archive have a written plan.
RDA Data Foundation and Terminology (DFT) WG: Overview  Prepared for Collab Chairs Meeting, NIST, Nov 13-14, 2014  Gary Berg-Cross, Raphael Ritz, Peter.
NIST Data Science SymposiumMarch 4, 2014 NIST Data Science SymposiumMarch 4, Climate Archives in NOAA: Challenges and Opportunities March 4, 2014.
Implementation Strategy July 2002 STANDARDS DEVELOPMENT LIFECYCLE PROCESS ORP Publishes & Maintains 8 Standing Committee Recommends Approval / Disapproval.
NODC Metadata Management for Geoportal Server and Beyond John Relph NOAA National Oceanographic Data Center.
GPO’s Federal Digital System December 10, 2009 U.S. Government Printing Office.
IOOS National Glider Data Assembly Center
Thoughts on Stewardship, Archive, and Access to the National Multi- Model Ensemble (NMME) Prediction System Data Sets John Bates, Chief Remote Sensing.
DRAFT EDMC Procedural Directives NOAA Environmental Data Management Committee 12/3/2015 1
NOAA/NESDIS/National Oceanographic Data Center Following the Flow of Two Underway Data Streams Within the U. S. National Oceanographic Data Center Steven.
Data Integrity Issues: How to Proceed? Engineering Node Elizabeth Rye August 3, 2006
Science Data in the Science Mission Directorate (SMD) Jeffrey J.E. Hayes Program Executive for MO & DA, Heliophysics Division August 17, 2011.
Space Observations Ocean Observations Land Surface Observations Atmospheric Observations Environmental Data at NOAA.
A Proposed Short Course on Data Stewardship Scott Hausman Deputy Director NOAA’s National Climatic Data Center Preparing Scientists to Steward Their Data.
Copyright 2010, The World Bank Group. All Rights Reserved. Recommended Tabulations and Dissemination Section B.
EO Dataset Preservation Workflow Data Stewardship Interest Group WGISS-37 Meeting Cocoa Beach (Florida-US) - April 14-18, 2014.
11 Proposed A-16 Portfolio Metrics Lifecycle Management Workgroup Geospatial Line of Business For Preliminary Discussion FGDC Coordination Group (09/21/10)
Standard Metadata in Scientific Data Formats September 19, 2007 Flash at:
June 21, 2011EDMC Workshop in Silver Spring, MDDan Kowal Submission Agreements: The role they play in supporting the relationship between the Data Producer/Provider.
Ed Kearns National Climatic Data Center Asheville, NC.
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Working with your archive organization: Broadening your user community Robert R. Downs, PhD Socioeconomic Data and Applications Center (SEDAC) Center for.
Future needs and plans for ocean observing in the Arctic AOOS Arctic Town Hall Futur Zdenka Willis Integrated Ocean Observing System National Program Office.
ISWG / SIF / GEOSS OOSSIW - November, 2008 GEOSS “Interoperability” Steven F. Browdy (ISWG, SIF, SCC)
1. 2 NOAA’s Mission To describe and predict changes in the Earth’s environment. To conserve and manage the Nation’s coastal and marine resources to ensure.
Data Management: Data Processing Types of Data Processing at USGS There are several ways to classify Data Processing activities at USGS, and here are some.
NOAA EDMC Ocean Observatories Initiative Cyberinfrastructure Karen Stocks OOI CI Data Curator University of California, San Diego Ocean Observatories.
ISWG / SIF / GEOSS OOS - August, 2008 GEOSS Interoperability Steven F. Browdy (ISWG, SIF, SCC)
Data Stewardship Lifecycle A framework for data service professionals Protectors of data.
IOOS Biological Data Services Three Steps to Enrollment Philip Goldstein (University of Colorado, OBIS-USA) Hassan Moustahfid (NOAA US IOOS) May 28, 2014.
OAIS (archive) OAIS (archive) Producer Management Consumer.
R2R ↔ NODC Steve Rutz NODC Observing Systems Team Leader May 12, 2011 Presented by L. Pikula, IODE OceanTeacher Course Data Management for Information.
April 7, 2016 NOAA Satellite and Information Service | National Centers for Environmental Information Mike Tanner Director, Center for Weather and Climate.
Ingest and Dissemination with DAITSS
Implementation Strategy July 2002
Agency Requirements: NOAA Administrative Order Management of environmental and geospatial data and information This training module is part of.
Send2NCEI: Fostering Producer-Archive Propinquity..
Essential Climate Variable (ECV) Inventory
Improving the Archiving of NOS Data
Prepared by: Jennifer Saleem Arrigo, Program Manager
Reportnet 3.0 Database Feasibility Study – Approach
Presentation transcript:

1 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION NCEI-IOOS Project Updates Mathew Biddle May 28th, 2015 IOOS DMAC Meeting, IOOS Program Office Silver Spring, MD

2 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION Outline Information about NCEINCEI Certification Archiving Access Future

3 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION National Centers for Environmental Information Revealing the Past, Interpreting the Present, and Informing the Future NOAA’s National Centers for Environmental Information (NCEI) is the merger of the National Climatic Data Center, National Geophysical Data Center, and National Oceanographic Data Center as approved in the Consolidated and Further Continuing Appropriations Act, 2015, Public Law The newly merged organization under NESDIS is called the National Centers for Environmental Information (NCEI). NOAA requested the merger to increase integration across the three centers. By using consistent data stewardship tools and practices across all of our science disciplines and by forging an improved data management paradigm, we expect to provide users with improved access to environmental data and information archive products. (archive, IT, administration). The merger will allow the Data Centers to continue the successful tradition and mission of stewarding the Nation’s environmental data and providing outstanding use-inspired products and services to the American public. It will provide much-needed information from and access to oceanographic, geophysical, and climatic data in a fully integrated way. A top priority during the merger will be to build on the full spectrum of climatic, oceanographic, coastal, and geophysical science and services the Data Centers currently deliver.

4 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION NCEI Tiers of Data Stewardship 1: Long Term Preservation and Basic Access 2: Enhanced Access and Basic Quality Assurance ●Create complete metadata to enable automated quality assurance and statistics collection ●Provide enhanced data access through specialized software services for users and applications 3: Scientific Improvements ●Improve data quality or accuracy with scientific quality assessments, controls, warning flags, and corrections ●Reprocess data sets to new, improved versions and distribute to users 4: Derived Products ●Build upon archived data to create new products that are more broadly useful ●Distill, combine, or analyze products and data to create new or blended scientific data products 5: Authoritative Records ●Combine multiple time series into a single, inter-calibrated product ●Establish authoritative quality, uncertainties, and provenance ●Ensure products are fully documented and reproducible 6: National Services and International Leadership ●Lead, coordinate, or implement scientific stewardship activities for a community or across disciplines ●Establish highly specialized levels of data services and product assessments ●Archive only necessary data using appropriate retention schedules●Provide data citation services by minting DOIs ●Serve as expert advisors on standards for data providers.●Coordinate support agreements for sustainable data archiving ●Preserve original data with metadata for discovery and access●Safeguard data over its entire life-cycle

5 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION NCEI Tiers of Data Stewardship 1: Long Term Preservation and Basic Access 2: Enhanced Access and Basic Quality Assurance ●Create complete metadata to enable automated quality assurance and statistics collection ●Provide enhanced data access through specialized software services for users and applications 3: Scientific Improvements ●Improve data quality or accuracy with scientific quality assessments, controls, warning flags, and corrections ●Reprocess data sets to new, improved versions and distribute to users 4: Derived Products ●Build upon archived data to create new products that are more broadly useful ●Distill, combine, or analyze products and data to create new or blended scientific data products 5: Authoritative Records ●Combine multiple time series into a single, inter-calibrated product ●Establish authoritative quality, uncertainties, and provenance ●Ensure products are fully documented and reproducible 6: National Services and International Leadership ●Lead, coordinate, or implement scientific stewardship activities for a community or across disciplines ●Establish highly specialized levels of data services and product assessments ●Archive only necessary data using appropriate retention schedules●Provide data citation services by minting DOIs ●Serve as expert advisors on standards for data providers.●Coordinate support agreements for sustainable data archiving ●Preserve original data with metadata for discovery and access●Safeguard data over its entire life-cycle

6 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION Archiving submission process Two types of submissions: 1.One-off: a. One time or very infrequent submissions. b. Non-standard data sets. c. Now you can submit through the new Send2NCEI webtool! 2.Automation: a. Recurring submission. b. Well structured, consistent data sets. c. Develop the submission procedures with NCEI. i.ATRAC (Advanced Tracking and Resource tool for Archive Collections) d. NCEI Pipeline proposal.

7 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION Send2NCEI

8 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION Archiving submission process Two types of submissions: 1.One-off: a. One time or very infrequent submissions. b. Non-standard data sets. c. Now you can submit through the new Send2NCEI webtool! 2.Automation: a. Recurring submission. b. Well structured, consistent data sets. c. Develop the submission procedures with NCEI. i.Submission Information Form (SIF). ii.ATRAC (Advanced Tracking and Resource tool for Archive Collections) d. NCEI Pipeline proposal.

9 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION RA-NCEI Data Pipeline Proposal Build a pipeline for each CF Feature Type (Time Series, Profile, etc.). If the data has a DAC, send it to the DAC (HF Radar, Glider, CDIP, etc.). Start developing the pipeline with a simple feature type first. Want to focus on non-Federal assets initially.

10 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION RA CertificationArchive Collection Level Record netCDF Data Files Access Manifest Auto-Harvest SIP and Archive RA-NCEI Data Pipeline Proposal NCEI RA SIP - Submission Information Package

11 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION RA Certification Archive Collection Level Record netCDF Data Files Access Manifest Auto-Harvest SIP and Archive RA-NCEI Data Pipeline Proposal NCEI RA SIP - Submission Information Package

12 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION Certification Requirements: Section (f) of the certification form is complete. Any contracts with private industry must be cited. Any litigation holds are clearly stated and supporting documentation Any/all data you handle must be documented. Specific attention to: –data flow –data conversions –QA/QC Checklist: In development. Fundamentally based on the guidance provided by the IOOS Program Office. Collection Level Record If a Certification exists: Developed from Certification documents. Some iteration between RA and NCEI will occur. If a Certification does not exist: Iteration between RA and NCEI will develop the record.

13 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION RA Certification Archive Collection Level Record netCDF Data Files Access Manifest Auto-Harvest SIP and Archive RA-NCEI Data Pipeline Proposal NCEI RA SIP - Submission Information Package

14 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION NODC NetCDF Templates Provides guidance for formatting your data in netCDF. Primarily follow ACDD 1.2 and CF 1.6 with a few added attributes/variables. Decision Tree. Data can be served to the public through NCEI’s various web services (FTP, HTTP, DAP, THREDDS...). Tier 2 Stewardship. Updates: Working on updates for ACDD 1.3. Broadening the application of our templates to fit the NCEI scope.

15 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION RA CertificationArchive Collection Level Record netCDF Data Files Access Manifest Auto-Harvest SIP and Archive RA-NCEI Data Pipeline Proposal NCEI RA SIP - Submission Information Package

16 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION Current IOOS RA Automations SECOORA GLOS AOOS CSESP Key: "Integrated Ocean Observing System Data Assembly Centers Data Stewardship Program"

17 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION NetCDF data files are posted on SECOORA FTP for NCEI to harvest. Validation through checksums. Disseminate and generate Archival Information Packages (AIP). (e.g. Each AIP is one station that gets updated monthly. Automation started on May 28, Current volumes (as of 5/20/2015): –min = MB* –max = MB* –average = MB* –total = MB (~12 months, 62 AIP) Data Access Statistics – – How do we manage SECOORA data? *The sizes are per AIP.

18 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION RA Certification Archive Collection Level Record netCDF Data Files Access Manifest Auto-Harvest SIP and Archive NCEI RA SIP - Submission Information Package SECOORA

19 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION NetCDF data files are posted on GLOS FTP for NCEI to harvest. Validation through checksums. Disseminate and generate Archival Information Packages (AIP). (e.g. Each AIP is one station that gets updated monthly. Automation started on December 11, Current volumes (as of 5/20/2015): –min = MB* –max = MB* –average = MB* –total = MB (~6 months, 22 AIP) How do we manage GLOS data? *The sizes are per AIP.

20 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION RA Certification Archive Collection Level Record netCDF Data Files Access Manifest Auto-Harvest SIP and Archive NCEI RA SIP - Submission Information Package GLOS

21 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION Six years of data Yearly updates. Data is posted on workspace… when a manifest appears, we pull the data. Disseminate and generate Archival Information Packages (AIP). Current volumes: –total = MB (1 AIP) How do we manage AOOS CSESP data?

22 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION RA Certification Archive Collection Level Record Data Files Access Manifest Auto-Harvest SIP and Archive NCEI RA SIP - Submission Information Package AOOS CSESP

23 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION Lessons Learned Consistency is key. Definitions for non CF keywords. Triage for data streams has been developed. Data not following CF and ACDD in netCDF requires more iteration.

24 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION RA CertificationArchive Collection Level Record netCDF Data Files Access Manifest Auto-Harvest SIP and Archive RA-NCEI Data Pipeline Proposal NCEI RA SIP - Submission Information Package X11 X1

25 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION RA-NCEI Data Pipeline Proposal cont. RA requirements: a.Data is formatted in ioos compliance checker validated netCDF. b.Certification contains all relevant documentation about the data set. c.RA will host the data on FTP/HTTP/DAP/THREDDS. d.Manifest to document Submission Information Package (SIP). NCEI requirements: a.Develop a collection record, with limited feedback from RA (provided the information is available). b.Develop an acquisition procedure to pull the data, generate metadata, archive, and publish. c.Provide various access mechanisms for the AIP’s.

26 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION Access tegrated%20Ocean%20Observing%20System%20Data%20Assembly%20 Centers%20Data%20Stewardship%20Program%22&start=1&max=2500& contentOption=intersecting&f=searchPagehttp://data.nodc.noaa.gov/geoportal/rest/find/document?searchText=%22In tegrated%20Ocean%20Observing%20System%20Data%20Assembly%20 Centers%20Data%20Stewardship%20Program%22&start=1&max=2500& contentOption=intersecting&f=searchPage ftp://ftp.nodc.noaa.gov/pub/data.nodc/ioos/

27 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION Future Feasibility test for the pipeline: –Develop the pipeline for the already automated SECOORA and GLOS process. NCEI Certification Checklist. Cookbook to submit data to NCEI.

28 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION Fun Stuff /catalog.html?dataset=testdata/mbiddle/aggregate_SECO ORA_carocoops.cap2.buoy_joinExisting.ncml atalog/ioos/secoora/carocoops.cap 2.buoy/catalog.html?dataset=ioos/s ecoora/carocoops.cap2.buoy/caroc oops.cap2.buoy_2015_05_01_18.n c

29 NATIONAL CENTERS FOR ENVIRONMENTAL INFORMATION Thanks! Questions?