Steven Worley, NCAR Scott Woodruff, NOAA/ESRL Eric Freeman, NOAA/NCDC

Slides:



Advertisements
Similar presentations
Use of VOS data in Climate Products Elizabeth Kent and Scott Woodruff National Oceanography Centre, Southampton NOAA Earth System Research Laboratory.
Advertisements

1 NOAA’S NATIONAL CLIMATIC DATA CENTER – ASHEVILLE, NORTH CAROLINA NCDC Report to US Port Meteorological Officers August 2014 Eric Freeman 1,2, Scott.
XHTML Basics.
THE USE OF REMOTE SENSING DATA/INFORMATION AS PROXY OF WEATHER AND CLIMATE IN THE GREATER HORN OF AFRICA Gilbert O Ouma IGAD Climate Applications and Prediction.
RECLAIM and ICOADS Scott Woodruff 1, Philip Brohan 2, Eric Freeman 3, Elizabeth Kent 4, Sandy Lubker 1, Clive Wilkinson 5, and Steve Worley 6 1) NOAA Earth.
ICOADS Archive Practices at NCAR JCOMM ETMC-III 9-12 February 2010 Steven Worley.
First Marine Board Forum – 15 May Oostende Marine Data Challenges: from Observation to Information From observation to data.
Symposium on Digital Curation in the Era of Big Data: Career Opportunities and Educational Requirements Workforce Demand and Career Opportunities From.
Surface Marine Data International Comprehensive Ocean-Atmosphere Data Set (ICOADS) Steven Worley, NCAR Scott Woodruff, NOAA/ERSL Eric Freeman, NOAA/NCDC.
Report from the GCOS Archive/Analysis Center Matthew Menne NOAA/National Centers for Environmental Information Center for Weather and Climate (NCEI-Asheville)
Research Data at NCAR 1 August, 2002 Steven Worley Scientific Computing Division Data Support Section.
The Digital Library for Earth System Education: A Community Resource
The Climate Prediction Project Global Climate Information for Regional Adaptation and Decision-Making in the 21 st Century.
The National Center for Atmospheric Research is operated by the University Corporation for Atmospheric Research under sponsorship of the National Science.
Preserving the Scientific Record: Preserving a Record of Environmental Change Matthew Mayernik National Center for Atmospheric Research Version 1.0 [Review.
ADVANCED KNOWLEDGE IS POWER Protect Life and Property Promote Economic Vitality Environmental Stewardship Promote Fundamental Understanding.
CC&E Best Data Management Practices, April 19, 2015 Please take the Workshop Survey 1.
IVAD development supported through a grant from the NOAA Climate Program Office. IVAD – Enhancing Marine Climate Data Records Shawn R. Smith 1, Scott D.
Technical Working Group, II Teruko Manabe Steven Worley Miroslaw Mietus Shawn Smith Simon Tett Volker Wagner Scott Woodruff David Berry Liz Kent.
Archive and Access Practices that Support Data Reuse and Transparency Steven Worley Doug Schuster Bob Dattore National Center for Atmospheric Research.
Marine Climatology from Research Vessels Shawn R. Smith 1, Scott D. Woodruff 2, and Steve Worley 3 1 Center for Ocean-Atmospheric Prediction Studies, FSU,
Data Access to Marine Surface Observations and Products from COADS 29 January, 2002 Steven Worley National Center for Atmospheric Research.
Page 1© Crown copyright Report of the Global Collecting Centres Elanor Gowland, GCC UK SOT-IV, 16 th - 21 st April 2007.
Automated Weather Observations from Ships and Buoys: A Future Resource for Climatologists Shawn R. Smith Center for Ocean-Atmospheric Prediction Studies.
ICOADS: Update Status and Data Distribution Steven J. Worley Scott D. Woodruff Sandra J. Lubker Ziahua Ji J. Eric Freeman NCAR, NOAA/ESRL, NOAA/NCDC CLIMAR-III,
Geosciences - Observations (Bob Wilhelmson) The geosciences in NSF’s world consists of atmospheric science, ocean science, and earth science Many of the.
Center for Satellite Applications and Research (STAR) Review 09 – 11 March 2010 In Situ SST for Satellite Cal/Val and Quality Control Alexander Ignatov.
Data Discovery and Access to The International Surface Pressure Databank (ISPD) 1 Thomas Cram Gilbert P. Compo* Doug Schuster Chesley McColl* Steven Worley.
1 OSTI - Accelerating Science Information Dr. Walter L. Warnick Director U.S. Department of Energy Office of Scientific and Technical Information Federal.
NOAA/NESDIS/National Oceanographic Data Center Following the Flow of Two Underway Data Streams Within the U. S. National Oceanographic Data Center Steven.
WGISS and GEO Activities Kathy Fontaine NASA March 13, 2007 eGY Boulder, CO.
NOAA’s National Climatic Data Center Climate Service Partnership Activities At NOAA’s National Climatic Data Center Tim Owen Climate Prediction Applications.
Outcomes of CLIMAR-IV DAVID I. BERRY ETMC-V, 22 – 25 JUNE 2015.
© Crown copyright Met Office Upgrading VOS to VOSClim Sarah North.
Marine Surface and Climate Data Gaps in the archives at the National Center for Atmospheric Research for 15 th U.S. – China Marine and Fishery Science.
The Research Data Archive at NCAR: A System Designed to Handle Diverse Datasets Bob Dattore and Steven Worley National Center for Atmospheric Research.
JCOMM: Perspectives & Contributions 1 Val Swail - Environment Canada, Toronto 2 Scott Woodruff - NOAA Earth System Research Laboratory, Boulder Elizabeth.
I-COADS Data and Products Steven J. Worley Scott D. Woodruff Richard W. Reynolds.
Data Discovery and Access to The International Surface Pressure Databank (ISPD) 1 Thomas Cram Gilbert P. Compo* Doug Schuster Chesley McColl* Steven Worley.
Climate Database Modernization Program (CDMP) Marine Tasks Joe Elms Program Manager.
1. 2 NOAA’s Mission To describe and predict changes in the Earth’s environment. To conserve and manage the Nation’s coastal and marine resources to ensure.
Understanding and Improving Marine Air Temperatures David I. Berry and Elizabeth C. Kent National Oceanography Centre, Southampton
NASA Earth Science Data Stewardship
USGS EROS LCMAP System Status Briefing for CEOS
Partner Toolbox Cloud Infrastructure & Management
Implementing the New JCOMM Marine Climate Data System (MCDS)
OceanDocs Digital Repository of Marine Science Research Outputs
Associate Director for Research, Education and Marine Operations
Persistent Identifiers Implementation in EOSDIS
Joseph JaJa, Mike Smorul, and Sangchul Song
Integrating Data and Information Across Observing System
Observing Climate Variability and Change
Operational Dataset Update Functionality Included in the NCAR Research Data Archive Management System Zaihua Ji Doug Schuster Steven Worley Computational.
Guidance for managing international precipitation data
Coordinating Operational Oceanography and Marine Meteorology
GODAE Quality Control Pilot Project
Eric Freeman2, Catherine Marzin3,
English East India Company Project
Research Data Archives at NCAR
Coding issues BUFR Binary Universal code Form for the Representation of meteorological data Binary Table driven code form (BUFR, CREX) Efficient compression.
OPeNDAP: Accessing Data in a Distributed, Heterogeneous Environment
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Bird of Feather Session
CISL’s Research Data Archive (RDA) : Description and Methods
Long-Lived Data Collections
Data Management Components for a Research Data Archive
Robert Dattore and Steven Worley
ICOADS: Data, Products, and Access
Data Curation in Climate and Weather
Comeaux and Worley, NSF/NCAR/SCD
Presentation transcript:

Steven Worley, NCAR Scott Woodruff, NOAA/ESRL Eric Freeman, NOAA/NCDC ICOADS: A multinational data rescue, digitization, archiving, and access success for the ocean Steven Worley, NCAR Scott Woodruff, NOAA/ESRL Eric Freeman, NOAA/NCDC International Comprehensive Ocean-Atmosphere Data Set 5/12/20195/12/2019

Topics Introduction to ICOADS Lessons Learned Data Management International Partnerships Data User and Project Expectations ICOADS Value-Added Database 5/12/20195/12/2019

ICOADS – Status Today Release 2.5 ICOADS Release 2.5: extensions and enhancements to the surface marine meteorological archive, IJC, First Published Online 2 Mar 2010, DOI: 10.1002/joc.2103 Scott D. Woodruff, Steven J. Worley, Sandra J. Lubker, Zaihua Ji, J. Eric Freeman, David I. Berry, Philip Brohan, Elizabeth C. Kent, Richard W. Reynolds, Shawn R. Smith and Clive Wilkinson Recovery of logbooks and international marine data: the RECLAIM project, IJC, First Published Online 2 Mar 2010, DOI: 10.1002/joc.2103 Clive Wilkinson, Scott D. Woodruff, Philip Brohan, Stefan Claesson, Eric Freeman, Frits Koek, Sandra J. Lubker, Catherine Marzin and Dennis Wheeler 5/12/20195/12/2019

ICOADS – Status Today What it is: A surface marine meteorological and oceanographic dataset. Date Range: Rel. 2.5 1662-2007, Preliminary Extension 2008-8/2010 (ongoing) Number of Records: 261 M (not including P. Ext.) Number of Unique Users: ≅ 400 per year Number of Data Sources: ≅ 80 Number of Major Updates/Releases: ≅ 10 5/12/20195/12/2019

ICOADS – Status Today U.S. Primary Partners Number of People NOAA ESRL 1.5 NOAA NCDC 0.5 (CDMP ~ 1.0) NCAR 0.5 International/National Partners and Contributors Met Office Hadley Centre National Oceanography Centre - Southampton Center for Ocean-Atmospheric Prediction Studies, FSU Deutscher Wetterdienst, DWD Koninklijk Nederlands Meteorologisch Instituut, KNMI Climate Research Unit, University of East Anglia Many others currently and in the past! Discuss CRU 5/12/20195/12/2019

Early Data Mixture Changes: Science Impacts Relative importance of national collections in early period. Changing mixture over time – has science impact WWII is drastically different – Thompson et al. 2008, Nature, noted discontinuity in global mean temperature. Annual distribution (1820-1949) of source makeup of Release 2.5 shown as thousands of reports per year, grouped according to roughly national categories, plus Historical Sea Surface Temperature (HSST) Project data. Note e.g. abrupt data mixture transitions around World War II. Forest and Reynolds (2008) and Thompson et al. (2008) discuss changes in SST patterns around 1945, which appear to be associated with similar abrupt changes in the national data mixture composition (but based on pre-Release 2.5 ICOADS data 5/12/20195/12/2019

Platform Mixture WMO Pub. 47 VOS metadata 1966-2007 Annual distribution (1937-2007) of major platform types in Release 2.5 shown as millions of reports per year. For clarity the vertical scale is truncated at 9M; years 2005-07 have 13M, 15M, and 16M total reports (not visible) in Release 2.5, respectively. The red line curve shows the Release 2.4 annual counts. Ships (including some research vessels), buoys, and oceanographic are self explanatory, Ocean (permanent) Station Vessel = OSV, Coastal-Marine Automated Network = C-MAN, ocean drilling rigs/platforms and other small entities = other, and unidentified platform types = missing 5/12/20195/12/2019

Project Foundation NOAA special report cover page Comprehensive Ocean-Atmosphere Data Set Release 1, April 1985 CIRES University of Colorado/NOAA Cooperative Institute of Research in Environmental Sciences Ralph J. Slutz, Sandra J. Lubker, Jane D. Hiscox ERL U.S. Department of Commerce National Oceanic and Atmospheric Administration (NOAA) Environmental Research Laboratories Scott Woodruff NCAR National Science Foundation sponsored National Center for Atmospheric Research Roy L. Jenne, Dennis H. Joseph NCDC National Environmental Satellite, Data, and Information Service National Climatic Data Center Peter M. Steurer, Joe D. Elms 5/12/20195/12/2019

Forward to Release 1, by Joe Fletcher (NOAA) "The incentive for developing the Comprehensive Ocean-Atmosphere Data Set (COADS) was to make this record available to the individual investigator in a form that is reliable and easy to use. The global marine surface data set contains the most detailed record we will ever have of the dynamics of the global climate system over the last century and more. It should trigger rapid progress in understanding by making it possible to delineate the spatial and temporal characteristics of the several sharp adjustments of the global circulation that have occurred, and to glean from them clues to the nature and causes of global climate variability. COADS provides the material for diagnostic research to identify and explore the key questions. It also provides the needed boundary conditions for model simulation of the climate system variability. It has taken four years and much effort by many individuals and several institutions to obtain and process the hundreds of tapes containing the basic data input. All of this effort was provided from ongoing activities; there was no appropriation identified for the task. It is a tribute to the spirit of cooperation among the participating organizations that the task has been successfully completed." 5/12/20195/12/2019

First Big Announcement Woodruff, Scott D. , Ralph J. Slutz, Roy L First Big Announcement Woodruff, Scott D., Ralph J. Slutz, Roy L. Jenne, Peter M. Steurer, 1987: A Comprehensive Ocean-Atmosphere Data Set. Bull. Amer. Meteor. Soc., 68, 1239-1250 5/12/20195/12/2019

A successful multi-organization team requires balance Data Management Lessons Learned A successful multi-organization team requires balance Balance elements Significant contribution of people and facilities Mutually supportive expertise in areas like operations, archival, research Commitment for an extended period ICOADS was forged out of ongoing activities Institutional commitment led to sustainability After 4 years, a strong foundation had been established 5/12/20195/12/2019

Support (people and $,£,€) is always a challenge - multiple organizations can help buffer the crises Organizations can shift focus and impact progress Climate data archives are research development efforts, but bridge with operational activities at near real time. Too much emphasis on operations will effect resources to improve the historical record. When support ebbs at one organization it may increase at another – progress continues possibly in different areas 5/12/20195/12/2019

Long-term involvement of key individuals is essential Need key focal points with deep knowledge Need individuals to take charge of - or document - the hard questions Historical archives have many subtle characteristics Answers come from knowing the history and evolution of the data 5/12/20195/12/2019

Organizing data sources is very difficult Data provenance - not well documented Many evaluations are needed Inventories, distributions (time and space), duplicate assessment, date and time / location problems http://icoads.noaa.gov/r2.5.html http://climateaudit.org/2010/08/30/a-first-look-at-icoads/ Study formats - QC'd data . Descriptions ≠ findings. Dutch deck 193: empty box (data never received?) 5/12/20195/12/2019

Finding duplicates between overlapping sources can be difficult Some Factors Truncated values, inaccurate units conversion Proper handling of 'missing' values, 00 ≠ null/blank Early technology influence 80 character punch cards - leading to abbreviated records. Challenge to match GTS and delayed mode archives Source data and documentation need long-term preservation Track the details Data, documentation, event logs Make backup copies of everything CDMP has funded small project to image legacy ICOADS documents Climate Data Modernization Program (CDMP)/NOAA 5/12/20195/12/2019

Design and use a robust and extensible data record format. ICOADS, IMMA: flexible format (International Maritime Meteorological Archive) core + optional attachments core icoads immt meta model . . . suppl ASCII, based on established formats, expanded with metadata for source tracking Core is multi-parameter, satisfies +95% of users Ship/instrument metadata – thanks to NOCS Easy to use and suitable for long-term archiving Does well with these competing objectives Key requirement: attm of original (“suppl.”) data Advantage: exact copy of original permits re-translation and cross-checks at any time. 5/12/20195/12/2019

Data translation into a uniform format is necessary Brings differently formatted sources together Carefully done translation is the "foundation" for success Undetected errors create negative downstream impacts Keep emphasis on the “data” - system look and feel are secondary Mitigate difficulties by standardizing software to write records Create, use, share, standardized units-conversion libraries Assure source data format understanding/interpretation Use two-person blind translations (expensive) Amazing number of subtle QC issues are found Animate this slide. 5/12/20195/12/2019

Evolution of technology will have impact Data storage concerns (1985) => compact binary formats Evolution factors OSs, compilers, languages, applications, rapidly multiplied Diversity of the user community demanded easier approaches Conversion to ASCII (IMMA) (~2000) Impact factors Software stack modifications – producers, providers Documentation rewrite Access interfaces upgrade Pushes unwanted changes onto regular data users Large resource drain - seldom adequately funded Risky but necessary 5/12/20195/12/2019

International Partnerships Lessons Learned International partnerships are a substantial benefit (COADS to ICOADS in 2002) Add value through: Content building, data discovery Promoting broad community recognition Sharing the workload Detecting errors and resolving problems Partnership conundrum - requirements and administration commitments grow without concomitant national resources support In the balance ICOADS is very positive on partnerships 5/12/20195/12/2019

Recognition by international bodies aids development Easier to get national resources to support the project if it has received 'approval’ of international organizations World Meteorological Organization (WMO), Intergovernmental Oceanographic Commission (IOC) ICOADS is seeking formal international recognition Through the Joint WMO-IOC Technical Commission on Oceanography and Marine Meteorology Goal: Create mirrored/shared development of ICOADS at WMO-IOC Centres 5/12/20195/12/2019

Formal announcement expected 1 October Regular international meetings, with a stated data focus, help drive progress and develop shared project ownership ICOADS, benefited @ CLIMAR and MARCDAT meeting series http://icoads.noaa.gov/marcdat3/ Formal announcement expected 1 October 5/12/20195/12/2019

Data User and Project Expectations Lessons Learned ICOADS had over-expectations on how well the users would understand the data record format ICOADS archive heterogeneity => complex QC flagging Early years – many users ignored the flags Results, erroneous extreme values, ship on land Solution – offer alternative data access pathways Provide a web interface with standard QC defaults and selectable options Serve access software with bulk file downloads 5/12/20195/12/2019

User expectations will shift over time Early years, simple monthly summary statistics Now, individual observations are most demanded Maybe - simple statistics replaced by climate-quality derived products Observational datasets are now small compared personal computers. ICOADS is 90 GB. 5/12/20195/12/2019

Today data access is expected to be immediate in 'my' favorite format ICOADS has not fully addressed some areas Interoperable access, web services GIS Organizations can share access services - each addressing different user communities Critical point, data must be identical Open data access can lead to out-dated secondary archives that negatively impact uninformed users Release 1a COADS (1993) can still be found on the web! Important, have a known authoritative location 5/12/20195/12/2019

The scientific reach is surprising The ICOADS data users are now more than physical meteorologist and oceanographers. Examples: bird migration, turtle population studies, ship engine exhaust impacts, coral reef ecology, etc… This interdisciplinary nature is also expanding because of historical data rescue Program support from: RECLAIM, ACRE, CDMP Steve: Reclaim is undefined about. How about a fig? RECovery of Logbooks And International Marine data (RECLAIM) Project Atmospheric Circulation Reconstructions for the Earth (ACRE) Climate Data Modernization Program (CDMP) 5/12/20195/12/2019

Version of ICOADS need to be reproducible upon request Need for precise data citation and version control is of hyper new importance Growing movement for accurate data referencing in publications, e.g. digital object identifier (DOI) for data Version of ICOADS need to be reproducible upon request Our approach has been ad hoc – we will formalize it in future 5/12/20195/12/2019

ICOADS Value-Added Database (IVAD) Status of a new proposal 3-years, submitted to NOAA Climate Observations and Monitoring Program Partners COAPS FSU, NOAA ESRL & NCDC, NCAR Main Features Enhancement of IMMA data format Deploy ICOADS in a database Receive and include some parameter adjustments Provide user access to original and adjusted parameters Demonstrate impact on marine surface flux estimates 5/12/20195/12/2019

http://icoads.noaa.gov/ Last 32 month platform type time series Last month geo-located platforms, SLP, and SST departure Others – long-term plots of platform counts and parameters 5/12/20195/12/2019

ICOADS: A multinational data rescue, digitization, archiving, and access success for the ocean International Comprehensive Ocean-Atmosphere Data Set 5/12/20195/12/2019