A Fleet-wide Approach to Optimizing Data Quality Vicki Ferrini, Suzanne O’Hara (LDEO) Paul Johnson, Kevin Jerram (UNH)

Slides:



Advertisements
Similar presentations
CSUS I-Scan Group California State University, Sacramento
Advertisements

Future Directions and Initiatives in the Use of Remote Sensing for Water Quality.
Prototype Phase SIO Accomplishments
Rolling Deck to Repository: Transforming the United States Academic Fleet Into an Integrated Global Observing System Suzanne M. Carbotte, Robert Arko,
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Symposium on Digital Curation in the Era of Big Data: Career Opportunities and Educational Requirements: A Data Scientist Perspective Dr. Vicki Lynn Ferrini.
Open Data at the World Bank. Open Data at the World Bank Open about what we do Open about what we.
Most efforts are project-based Coverage therefore patchy, often repeat returns Deep ocean practically unsurveyed at resolution
Optimizing Multibeam Data Quality Across the Fleet TIM GATES (Gates Acoustic Services) VICKI FERRINI (LDEO) JONATHAN BEAUDOIN (CCOM-UNH) PAUL JOHNSON (CCOM-UNH)
® IBM India Research Lab © 2006 IBM Corporation Challenges in Building a Strategic Information Integration Infrastructure Mukesh Mohania IBM India Research.
Data Portal for the “Climate changes Spatial Planning” program Henk Klein Baltink (KNMI) Fred Bosveld (KNMI) Hans de Wolf (Dutch Space)
CEOS System Engineering Toolset (CSET) CSET is a Software Framework + Suite of Tools (Apps) that leverages a Common Architecture, Unified Data Model, Common.
Esri UC2013. Technical Workshop.Editing & Maintaining Parcels with ArcGIS.
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
Environment Environnement Canada Modernization of Environment Canada’s Water Quality Data and Services: Chris Lochner 1, Gino Sardella 2 and team 1 National.
Metadata (for the data users downstream) RFC GIS Workshop July 2007 NOAA/NESDIS/NGDC Documentation.
Inter Regional Coordination Committee Paper by the IHB Considerations on the development of the General Bathymetric Chart of the Oceans (GEBCO data store.
Bringing it All Together: NODC’s Geoportal Server as an Integration Tool for Interoperable Data Services Kenneth S. Casey, Ph.D. YuanJie Li NOAA National.
Updates from EOSDIS -- as they relate to LANCE Kevin Murphy LANCE UWG, 23rd September
North Carolina Geospatial Data Archiving Project (NCGDAP) JISC/NDIIPP Joint Digital Preservation Workshop – May 2006 Presented by: Rob Farrell, Steve Morris,
IV-3.1 JCOMMOPS SOT Technical Coordinator. 2 JCOMMOPS structure Programmes currently supported –Ship Observations Team (30% Mathieu Belbeoch) –Argo Profiling.
A survey based analysis on training opportunities Dr. Jūratė Kuprienė Framing the digital curation curriculum International Conference Florence, Italy.
1 Global Systems Division (GSD) Earth System Research Laboratory (ESRL) NextGen Weather Data Cube Chris MacDermaid October, 2010.
M u l t I b e a m III W o r k s h o p M u l t I b e a m III W o r k s h o p National Geophysical Data Center / World Data Centers NOAA Slide 1 End-to-End.
PDS Geosciences Node Page 1 Archiving Mars Mission Data Sets with the Planetary Data System Report to MEPAG Edward A. Guinness Dept. of Earth and Planetary.
GeoPlannerSM for ArcGIS®: An Introduction
NOAA National Geophysical Data Center & collocated World Data Centers, Boulder CO USA World Data Center for Marine Geology and Geophysics, Boulder, CO.
NODC Metadata Management for Geoportal Server and Beyond John Relph NOAA National Oceanographic Data Center.
Presented at AMSR Science Team Meeting September 23-24, 2014 AMSR2 NRT Land, Atmosphere Near real-time Capability for EOS (LANCE) Helen Conover Information.
UAF/OSMC Presenters: Kevin O’Brien and Eugene Burger Abstract: Kevin O’Brien and Eugene Burger are from NOAA’s Pacific Marine Environmental Laboratory.
DRAFT EDMC Procedural Directives NOAA Environmental Data Management Committee 12/3/2015 1
1 Understanding Cataloging with DLESE Metadata Karon Kelly Katy Ginger Holly Devaul
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
1-2-3 February 2006 –Page 1 Mersea Integrated System How to improve Access/Downloading services ? How far do we go in terms of standardization ?
TSS Database Inventory. CIRA has… Received and imported the 2002 and 2018 modeling data Decided to initially store only IMPROVE site-specific data Decided.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
AMSR-E and AMSR-E Validation Status at NSIDC Amanda Leon, NSIDC AMSR-E Lead Joint AMSR-E Science Team Meeting Asheville, NC June 2011.
Serving Multidisciplinary Data For Ridge2000 and MARGINS Programs William B. F. Ryan, Suzanne Carbotte and MGDS Team.
Proprietary and confidential. © 2003 Perot Systems Corporation. All rights reserved. All registered trademarks are the property of their respective owners.
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
ESRI Education User Conference – July 6-8, 2001 ESRI Education User Conference – July 6-8, 2001 Introducing ArcCatalog: Tools for Metadata and Data Management.
Welcome to the PRECIS training workshop
1 Beyond Content Packaging: LETSI’s Open Learning Architecture Avron Barr letsi.org LETSI is an international non-profit federation committed to open standards.
1 1 NOAA Office of Ocean Exploration End-to-End Data Management: A Success Story NOAA Tech Conference November 2005 Susan Gottfried National Coastal Data.
The Proliferation of Metadata Standards and the Evolution of NASA’s Global Change Master Directory (GCMD) Standard for Uses in Earth Science Data Discovery.
SEDAC Long-Term Archive Development Robert R. Downs Socioeconomic Data and Applications Center Center for International Earth Science Information Network.
Providing access to your data: Determining your audience Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Semantic Concepts in Expedition Metadata Semantic Concepts in Expedition Metadata Bob Arko Lamont-Doherty Earth Observatory OOSSI Workshop Nov. 18, 2008.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
Rolling Deck to Repository (R2R): How to Systematically Document Quality for the New Era of Data Re-Usability? AGU Poster IN21B-1048 AGU Fall Meeting December.
SIOExplorer: Digital Library Projects R/V Alexander Agassiz November, 1907 UCSD Libraries Scripps Institution of Oceanography San Diego Supercomputer Center.
CHARTER SCHOOLS PROGRAM MONITORING AND DATA COLLECTION CONTRACT SOLICITATION NO.: ED-OII-15-R-0014.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
A Quick Tour of the NOAA Environmental Software Infrastructure and Interoperability Group Cecelia DeLuca Dr. Robert Detrick visit March 28, 2012
Center of Excellence for Oceans and Human Health at the Hollings Marine Laboratory Metadata Development in Support of the Oceans and Human Health Tidal.
Microsoft® System Center Virtual Machine Manager 2008
Scientific Information Management Approaches Needed to Support Global Assessments: The USEPA's Experience and Information Resources Jeffrey B. Frithsen.
NOAA Data Management Perspective & Plans NSF RDMI Workshop
Working with your archive organization Broadening your user community
LSI-VC Jenn Lacey, USGS, LSI-VC Co-Lead CEOS SIT-33
Prepared by: Jennifer Saleem Arrigo, Program Manager
LSI-VC User Requirements
Google Sky.
Seabed 2030 Project Overview
A Science Community Perspective
Lamont-Doherty Earth Observatory of Columbia University
Moving from Data Aggregation to Decision-as-a-Service | US Hydro Conference | Wetherbee Dorshow & Guy Noll.
Cloud Optimized Processing for Hydrographic Data
Arctic SDI: Interoperability Framework
Presentation transcript:

A Fleet-wide Approach to Optimizing Data Quality Vicki Ferrini, Suzanne O’Hara (LDEO) Paul Johnson, Kevin Jerram (UNH)

MBES Data Workflow Acquisition Analysis & Interpretation Products

Increasing Emphasis on Open Data Access Acquisition Costs Spatial & Temporal Change Scientific Reproducibility Federal Data Policy Compliance Data Syntheses & Big Data Enable New Analyses

*NSF-funded cruises can have 2-year proprietary restriction

A lot of this high-value data is acquired opportunistically!

How can we cost-effectively optimize data quality? The Economist, 2010

GMRT 1992 R2R 2009 MAC 2011 Multibeam Sonar Data Continuum GOAL: Well-documented high-quality publicly available data

Multibeam Advisory Committee Optimize data quality at acquisition – Encourage opportunistic data acquisition Consolidate Tools, Resources & Expertise US Academic Research Fleet Technical Teams / Ship Visits Data Resources – BIST Database – Reference Surfaces – Patch Test Locations Help Desk More details tomorrow…

MAC-Supported Ships +2 new ships coming online 2016

GMRT 1992 R2R 2009 MAC 2011 Multibeam Sonar Data Continuum GOAL: Well-documented high-quality publicly available data

Data Stewardship of Underway Data Unprocessed data from permanent sensors Cruise Catalog Cruise and data set metadata Optimize delivery to National Data Centers Programmatic Quality Assessment Rolling Deck to Repository (R2R)

Identify potential problems in data No judgment on scientific utility Provide Feedback Vessel Operators – address problems Down-stream data users (scientists/engineers) – facilitate data use/re-use Enable evaluation of fleet-wide system performance over time R2R Quality Assessment: Goals

R2R: MB Quality Assessment Lead: S. O’Hara (LDEO) Leverage open source (MB System) Programmatically introspect data files Fully document tests, results, and ranking criteria/thresholds in I/O XML Customizable test thresholds Output includes QA Test Results, Ranking (R,Y,G) and other relevant info

R2R: MB Quality Assessment Lead: S. O’Hara (LDEO)

GMRT 1992 R2R 2009 MAC 2011 Multibeam Sonar Data Continuum GOAL: Well-documented high-quality publicly available data

GMRT Synthesis: Overview Open-access bathymetry product Support specialists & non-specialists Multi-resolutional synthesis – GEBCO + MBES + land + grids – Full-native resolution of MBES (100m+) Tiled Global Compilation – Images, grids, mask – Mercator, South Polar, North Polar – 2 scheduled releases / year (~80 cruises) Attribution to data contributors Access to source data

GMRT – Open Data Access Java Applications (GeoMapApp, Virtual Ocean) Web Application (GMRT MapTool) iPhone App (Earth Observer) Web Services Grid Server, Image Server, Attribution Service WMS (Mercator, SP, + NP 2016) Point Service + Profile Service (Dec 2015) Broad distribution through collaborations GEBCO, Google, ESRI, NOAA NCEI

GMRT: MB QA/QC Bad navigation Noisy outer beams Attitude problems Bad soundings Instrument problems Bad weather Sound velocity Slow speed in turns Quality assessment –Grid weighting –Grid resolution

Raw MB Files* MB QA/QC Processed MB files rDB *source data in public domain Tiled images, grids, mask GMRT Services GMRT – MBES Workflow MB System

GMRT – MBES Content GMRT v3.1 released Nov 2015 >175,000 data files + metadata ~4.4 million ship-track km of data 875 cruises 26 Ships 21 Swath File Formats 15 Sonar Systems – Most modern data acquired with Kongsberg systems

GMRT Metadata Per Data Set Processing notes Make/Model/Ship Quality, resolution Contributor Per Data File Metrics (from mbinfo) Track-line geometry Under Development Polygon geometry Area mapped Processing metrics Per Data Set Processing notes Make/Model/Ship Quality, resolution Contributor Per Data File Metrics (from mbinfo) Track-line geometry Under Development Polygon geometry Area mapped Processing metrics

GMRT 1992 R2R 2009 MAC 2011 Multibeam Sonar Data Continuum GOAL: Well-documented high-quality publicly available data

Pulling the pieces together…

Consolidated access to resources

Fleet-wide Review of R2R MBQA Tests & Results Which tests correspond with issues corrected in GMRT processing? Are test thresholds correct? Other tests needed?

Fleet-wide Review of GMRT Processed Data Statistics Are there metrics can we programmatically assemble from processed data that can help improve data quality at acquisition?

Compare/Combine Results Preliminary

Next Steps… Quantitatively compare R2R MBQA results with GMRT Processed Data Statistics –Identify which tests are working –Refine MBQA tests/parameters Code-sharing between MAC-GMRT-R2R Use fleet-wide data review (MBQA tests + GMRT results) to help improve MAC guidelines/best practices