Ocean glider data management: Argo concepts, GROOM, OGC sensor observation services Justin Buck + many collaborators British.

Slides:



Advertisements
Similar presentations
Groom-gliders data-management workshop Brest, December 2012 Groom gliders data management n In 2012 : vertical profiles from 26 platforms.
Advertisements

Ocean Data Interoperability Platform EU-US-Australia collaborative project Grant Number: Call: FP7-INFRASTRUCTURES INFSO Activity: INFRA :
Preparing CMOR for CMIP6 and other WCRP Projects
FP7-Infra : Design studies for European Research Infrastrutures 1st October 2011 – 31st December 2014 Duration 39 months – Periods : 2 (month.
The NODC Glider Technical Specification Tom Ryan, Dan Seidov, John Relph (NODC) and James Bennett (University of Washington) U.S. IOOS National Glider.
Argo QC with an emphasis on the North Atlantic Justin Buck British Oceanographic Data Centre Joseph Proudman Building 6 Brownlow Street Liverpool L3 5DA,
1 NODC, Russia GISC & DCPC developers meeting Langen, 29 – 31 March E2EDM technology implementation for WIS GISC development S. Sukhonosov, S. Belov.
1 The IIPC Web Curator Tool: Steve Knight The National Library of New Zealand Philip Beresford and Arun Persad The British Library An Open Source Solution.
DCS Architecture Bob Krzaczek. Key Design Requirement Distilled from the DCS Mission statement and the results of the Conceptual Design Review (June 1999):
SMOS SAG, Villafranca November 2-3, 2006 Development of a Global In-Situ Soil Moisture Network: A SMOS Project Contribution P.J. van Oevelen.
Serving society Stimulating innovation Supporting legislation Meeting on OGC Sensor Observation Service (SOS) for INSPIRE Mickael.
Managing Data Interoperability with FME Tony Kent Applications Engineer IMGS.
26-28 th April 2004BioXHIT Kick-off Meeting: WP 5.2Slide 1 WorkPackage 5.2: Implementation of Data management and Project Tracking in Structure Solution.
Discussion and conclusion The OGC SOS describes a global standard for storing and recalling sensor data and the associated metadata. The standard covers.
Status of upgrading CDI service (user interface, harvesting via GeoNetwork, CDI interoperability options following SeaDataNet D8.7) By Dick M.A. Schaap.
Data Management Practices: BCO-DMO’s Successes and Challenges Bob Groman BCO-DMO Woods Hole Oceanographic Institution NERACOOS/NeCODP Data Management Workshop.
AIRNow-International The future of the United States real-time air quality reporting and forecasting program and GEOSS participation John E. White U.S.
EARTH SCIENCE MARKUP LANGUAGE “Define Once Use Anywhere” INFORMATION TECHNOLOGY AND SYSTEMS CENTER UNIVERSITY OF ALABAMA IN HUNTSVILLE.
WP 9 (former Task 1b of WP 1): Data infrastructure Robert Huber UNI-HB Esonet 2nd all regions workshop, Paris
SeaDataNet A Pan-European Infrastructure for Ocean and Marine Data Management: Achievements and challenges.
Mapping between SOS standard specifications and INSPIRE legislation. Relationship between SOS and D2.9 Matthes Rieke, Dr. Albert Remke (m.rieke,
IODE Ocean Data Portal - technological framework of new IODE system Dr. Sergey Belov, et al. Partnership Centre for the IODE Ocean Data Portal MINCyT,
DELIVERING ENVIRONMENTAL WEB SERVICES (DEWS) Partners: UK Met Office (Lead Partner), British Atmospheric Data Centre (BADC), British Maritime Technology.
IIIrd SeaSearch Full Group Meeting Limassol, Cyprus Tuesday May 25-27, 2004 SeaDataNet-Design Study proposal.
EGO – COST - Groom-gliders Kiel meeting Kiel, June 2014 Data-management activity n Within Groom and with COST support, the glider data- management group.
GROOM Annual meeting Data Management issues Trieste 4-5 June 2013 Quoi de Neuf à Coriolis en 2008 ? GMMC Octobre 2008 S Pouliquen & Coriolis team.
1ECOOP Plenary Meeting Athens 2008 EUROMISS EUROpean Marine Information System of Systems S Pouliquen, Thomas Loubrieu ECOOP plenary meeting.
GBIF Mid Term Meetings 2011 Biodiversity Data Portals for GBIF Participants: The NPT Global Biodiversity Information Facility (GBIF) 3 rd May 2011.
EuroGOOS Annual Meeting 2009/10/ /10/08 2b: Contribution of open ocean observatories to GEO Data Core – Biodiversity, Ecosystems and Health Richard.
Future Perspectives of Ocean Observatories in Germany [Name of the infrastructure / site / time series…] Contact person: [name, ] [Institution(s)
MEDIN Work Plan for By March 2011 MEDIN will be 3 years into the original 5 year development plan started in Would normally ask for continued.
RAINEX Data Management UCAR Joint Office for Science Support José Meitín Jim Moore Dick Dirks UCAR Joint Office for Science Support José Meitín Jim Moore.
IOOS National Glider Data Assembly Center
Selene Dalecky March 20, 2007 FDsys: GPO’s Digital Content System.
31 March 2009 MMI OntDev 1 Autonomous Mission Operations for Sensor Webs Al Underbrink, Sentar, Inc.
NOAA/NESDIS/National Oceanographic Data Center Following the Flow of Two Underway Data Streams Within the U. S. National Oceanographic Data Center Steven.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Silver Spring HRMM Workshop April Global Ocean Surface Underway Data project (GOSUD) Theirry Carval – Coriolis Ifremer Bob Keely – MEDS Thierry.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
An Introduction to the Argo Data Sytem South Pacific Workshop 11 – 14 October 2005 Mark Ignaszewski FNMOC.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Internet Documentation and Integration of Metadata (IDIOM) Presented by Ahmet E. Topcu Advisor: Prof. Geoffrey C. Fox 1/14/2009.
U.S. Environmental Protection Agency Central Data Exchange Pilot Project Promoting Geospatial Data Exchange Between EPA and State Partners. April 25, 2007.
Eurostat 1.SDMX: Background and purpose 1 Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
1 SIMDAT Simdat Project –GTD. Meteo Activity – SIMDAT Meteo Activity OGF June 2008 Barcelona Marta Gutierrez, Baudouin Raoult, Cristina.
Data Management System to Collect, Quality Control, Distribute, and Archive Near Real-time Marine Data Jeremy J. Rolph, Jacob T. Rettig, Mark A. Bourassa,
Climate-SDM (1) Climate analysis use case –Described by: Marcia Branstetter Use case description –Data obtained from ESG –Using a sequence steps in analysis,
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
5-7 May 2003 SCD Exec_Retr 1 Research Data, May Archive Content New Archive Developments Archive Access and Provision.
Download Manager software Training Workshop Ostend, Belgium, 20 th May 2014 D.M.A. Schaap - Technical Coordinator.
EMODNet Physics + My Ocean – SeaDataNet Dick Schaap (MARIS) – Brussels – 1 March 2013.
NOAA EDMC Ocean Observatories Initiative Cyberinfrastructure Karen Stocks OOI CI Data Curator University of California, San Diego Ocean Observatories.
Metadata V1 By Dick M.A. Schaap – technical coordinator Oostende, June 08.
Sensor Web Enablement (SWE) developments for fixed monitoring platforms and research vessels By Dick M.A. Schaap – SeaDataNet Technical Coordinator with.
Born Semantic: Linking data from sensors to users and balancing hardware limitations with data standards Justin Buck 1 Adam Leadbetter.
Introduction to BODC and GEOTRACES data office Edward Mawji British Oceanographic Data Centre
Botts – August 2004 Sensor Web Enablement Sensor Web Enablement WG (SWE-WG)
IODE Ocean Data Portal - technological framework of new IODE system Dr. Sergey Belov, et al. Partnership Centre for the IODE Ocean Data Portal.
Data Browsing/Mining/Metadata
Justin Buck OceanSITES data Incentives for participation: Data citation & data services Justin Buck
Flanders Marine Institute (VLIZ)
Outline RTQC goals to achieve Description of current proposal Tests
Distributed Marine Data System:
Dick M.A. Schaap – Technical Coordinator SeaDataNet Training Workshop
Future Data Architectures Big Data Workshop – April 2018
AGU2017 fall meeting 2017, New Oreleans, 11th Decmember 2017
JCOMM in-situ Observations Programme Support Centre www. jcommops
Prepared by: Jennifer Saleem Arrigo, Program Manager
JCOMM in-situ Observations Programme Support Centre www. jcommops
EMODNet Physics + My Ocean – SeaDataNet
Presentation transcript:

Ocean glider data management: Argo concepts, GROOM, OGC sensor observation services Justin Buck + many collaborators British Oceanographic Data Centre 6 th EGO Meeting – 16 th June 2014

The numerous collaborators Sylvie Pouliquen Thierry Carval Jean-Philippe Rannou Mark Hebden Lise Quesnel Adam Leadbetter Task Data management Cost action ES0904 Data STSM partners Marine Autonomous Robotic Systems (MARS) facility

Outline Introduction GROOM data flow Common data tools Sensor observation services

Introduction

GOOS status (JCOMMOPS)

User expectations Data easily accessible from a unique point/portal Data coherent in terms of: – Data format – Data Quality – Processing chain (clearly documented) Additional requirements for Monitoring and forecasting users: – Data are available in near real time (within 24 hours) – Data are available in delayed mode after calibration and /or validation (typically within 12 months)

Stakeholder expectations Opportunity to use more observations than they could afford alone Operate jointly part of the network Benefit from the other partners' experience from design to implementation to data management and user uptake A key phrase for funding bodies: Acquire once, use multiple times

However... Broadly speaking, two distinct types of deployment: 1)‘Process’ type studies – short duration, spatially restricted, typically associated with a cruise. Designed to answer a specific question and can have data restrictions. 2)‘Sustained observation’ studies – longer duration, regional scale missions. Typically repeated sections. Of interest to the ocean modelling and forecasting community. So there is potential for conflict between ‘project’ interests and ‘operational’ interests

Gliders/GROOM/EGO

GROOM Task 3.2 Data system goals

A common data exchange format EGO glider data format established by the GROOM community (October 2012): Climate and Forecast (CF) and SeaDataNet compliant NetCDF. Interoperable with data standards being developed internationally (e.g. IMOS in Australia and IOOS in the U.S). Standard quality control protocols for both near real-time and ‘delayed-mode’ glider datasets (utilising Argo). Ensures that glider data, metadata and technical information are stored and distributed in a consistent manner.

Common data tools

BODC workflow Data Provider Data Scientist Unauthorised user Authorised user Source data Source meta Delayed mode meta/data Database meta Database data File System meta/data Register Arrival Archive Convert to standards [meta/data] QC/Calibration Merge as relevant Authorise Authenticate Prepare request Store as relevant External Database Discovery Manual Processing Automated (Provider) upload Automated download/upload Automated (BODC) download Checkout Error Handling & Reporting

BODC workflow & common tools Data Provider Data Scientist Unauthorised user Authorised user Source data Source meta Delayed mode meta/data Database meta Database data File System meta/data Register Arrival Archive Convert to standards [meta/data] QC/Calibration Merge as relevant Authorise Authenticate Prepare request Store as relevant External Database Discovery Manual Processing Automated (Provider) upload Automated download/upload Automated (BODC) download Checkout Error Handling & Reporting Data collection and secure archive Internal storage Data delivery Reformatting & processing (EGO code)

STSM to develop common tools Brest, December 2012 Goal to develop first version of common tools – Tomeu Garau, Daniele Cecchi (NURC) – Thierry Carval, Jean-Philippe Rannou (Ifremer) – Justin Buck, Mark Hebden, Lise Quesnel (BODC)

Common tools requirements Modular [for easy modifications and maintenance] Consistent input/output interfaces between modules Simple! Flexible enough to handle varying use cases Robust and handles any error with known rollback points Reports every failure Similar tasks such as reading from the database are done using similar approaches

Common tools workflow Source data Seaglider Slocum Other gliders Processing modules Data delivery Collection of source data readers, multiple readers per platform type. Data from single transmission converted to a.mat Conversion of.mat output to EGO format NetCDF for transmission Merge single transmission files to produce EGO NetCDF containing trajectory for deployment Time series quality control routines Single transmission quality control routines Corrections/calibration routines OriginatorGDACGTS

JSON files drive a generic system EGO NetCDF writer Raw data files Merged EGO NetCDF file JSON files describing EGO format JSON files describing variables to transfer JSON files describing deployment, glider and sensor configurations *The flexibility means system is usable in other projects e.g. SMRU animal tags

JSON files EGO NETCDF writer controlled by JSON files. Example on right describes CTD sensor metadata.

Real time quality control Test are adaptations of established Argo QC RTQC configured via JSON files Presently implemented tests – Valid range (e.g. TEMP, PRES, speed etc) – Regional range – Gradient – Spike – Stationary – Position on land – Density inversion

Sharing of code Code is available on the Ifremer SVN repository with Mantis used for bug tracking Pooling of common code from multiple centres Code has a reciprocal public license A SeaDataNet login is required Thierry Carval is administrator and can grant access

Sensor Observation Services

BODC workflow & common tools Data Provider Data Scientist Unauthorised user Authorised user Source data Source meta Delayed mode meta/data Database meta Database data File System meta/data Register Arrival Archive Convert to standards [meta/data] QC/Calibration Merge as relevant Authorise Authenticate Prepare request Store as relevant External Database Discovery Manual Processing Automated (Provider) upload Automated download/upload Automated (BODC) download Checkout Error Handling & Reporting Data collection and secure archive Internal storage Data delivery Reformatting & processing (EGO code)

Sensor webs - connecting data From

OGC, SWE, O&M? OGC – Open Geospatial Consortium SWE – Sensor Web Enablement initiative OGC Defined prototyped and tested sensor web components: Sensor Model Language (SensorML) Observations & Measurements (O&M) Sensor Observation Service (SOS) Sensor Web Enablement Sensor Model Language Observations & Measurements...

Why SWE? Standardized web services will exist for accessing sensor information and sensor observations Sensor systems will be capable of real-time mining of observations to find phenomena of immediate interest Sensors will be capable of issuing alerts based on observations, as well as be able to respond to alerts issued by other sensors O&M required for EC INSPIRE directive

OGC Sensor Observation Service (SOS) The SOS standard is applicable to use cases in which sensor data needs to be managed in an interoperable way. Defines a Web service interface which allows querying observations, sensor metadata, as well as representations of observed features. Defines means to register new sensors and to remove existing ones. Defines operations to insert new sensor observations.

International Harmonisation Meeting with IOOS and IMOS during December 2013 to harmonise data formats Adjusted format includes metadata structure changes for OGC SOS compatibility

ncSOS – NetCDF implementation SOS service run on a repository of NetCDF files via a THREDDS serverTHREDDS – C.F1.6 and Attribute Convention for Dataset Discover (ACDD) attributes required C.F1.6ACDD IOOS adapting THREDDS for 3D trajectory data BODC investigating the addition of access control methods for restricted data

Potential BODC implementation NRT Source data Gliders NRT Ship underway Others e.g. Animal tags, Argo, sea level, etc... Processing modules Data delivery Accession of data Single transmission quality control routines Originator via SFAGDAC NODB (when delayed mode ready as per IDP project) Met Office for push to GTS Storage in repository of files THREDDS server with ncSOS Methods to interrogate & package data Application of calibrations Conversion to CF NetCDF with ACDD compliant attributes web

Summary First concepts for glider data management based on OceanSites, Argo, etc formed the basis of GROOM activity – Basic common tools available for conversion of glider data to an international exchange format Todays technology permits more advanced data delivery methods than a decade ago and a prototype SOS services being developed – Several projects contributing ODIP & SenseOCEAN (active), AltantOS (proposed)

Questions?