Development of Quality Monitoring System on satellite Sea Surface Salinity Products Yongsheng Zhang NESDIS/NODC – UMD College Park/ESSIC/CICS.

Slides:



Advertisements
Similar presentations
Conversion of CPC Monitoring and Forecast Products to GIS Format Viviane Silva Lloyd Thomas, Mike Halpert and Wayne Higgins.
Advertisements

1 NASA CEOP Status & Demo CEOS WGISS-25 Sanya, China February 27, 2008 Yonsook Enloe.
Chapter 2 Exploring Data with Graphs and Numerical Summaries
SMOS-BEC – Barcelona (Spain) CP34/BEC L3-L4 maps internal production chain BEC team SMOS Barcelona Expert Centre Pg. Marítim de la Barceloneta 37-49, Barcelona.
OSE meeting GODAE, Toulouse 4-5 June 2009 Interest of assimilating future Sea Surface Salinity measurements.
Class Session #2 Numerically Summarizing Data
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
1 A new set of tools for all altimetry users J. Benveniste (ESA), N. Picot (CNES), V. Rosmorduc (CLS) et al.
MEASURES OF SPREAD – VARIABILITY- DIVERSITY- VARIATION-DISPERSION
Homework Questions. Quiz! Shhh…. Once you are finished you can work on the warm- up (grab a handout)!
Lesson 4 Compare datas.
Vocabulary for Box and Whisker Plots. Box and Whisker Plot: A diagram that summarizes data using the median, the upper and lowers quartiles, and the extreme.
TPAC Digital Library Talk Overview Presenter:Glenn Hyland Tasmanian Partnership for Advanced Computing & Australian Antarctic Division Outline: TPAC Overview.
Exploratory Data Analysis. Height and Weight 1.Data checking, identifying problems and characteristics Data exploration and Statistical analysis.
(c) 2007 IUPUI SPEA K300 (4392) Outline: Numerical Methods Measures of Central Tendency Representative value Mean Median, mode, midrange Measures of Dispersion.
Objectives 1.2 Describing distributions with numbers
Summary statistics Using a single value to summarize some characteristic of a dataset. For example, the arithmetic mean (or average) is a summary statistic.
Bringing it All Together: NODC’s Geoportal Server as an Integration Tool for Interoperable Data Services Kenneth S. Casey, Ph.D. YuanJie Li NOAA National.
Measures of Variability In addition to knowing where the center of the distribution is, it is often helpful to know the degree to which individual values.
STA Lecture 131 STA 291 Lecture 13, Chap. 6 Describing Quantitative Data – Measures of Central Location – Measures of Variability (spread)
John T. Gunn, AVDS Manager Earth & Space Research 1910 Fairview Ave. E., Suite 210 Seattle, WA Ph: ; 1 st Joint GOSUD/SAMOS.
Calibration and Validation Studies for Aquarius Salinity Retrieval PI: Shannon Brown Co-Is: Shailen Desai and Anthony Scodary Jet Propulsion Laboratory,
Exploratory Data Analysis
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
Measures of Position & Exploratory Data Analysis
Discovery and Web Services in Support of SST Datasets at the PO.DAAC Edward Armstrong, Jorge Vazquez Toshio M. Chin, Charles Thompson Jet Propulsion Laboratory/California.
Data Access to Marine Surface Observations and Products from COADS 29 January, 2002 Steven Worley National Center for Atmospheric Research.
The Practice of Statistics Third Edition Chapter 1: Exploring Data 1.2 Describing Distributions with Numbers Copyright © 2008 by W. H. Freeman & Company.
1 Dapper and Argo Joe Sirott PMEL/NOAA. 2 What is Dapper? Web server that provides distributed access to in-situ data via OPeNDAP protocol Clients include.
OceanBrowser viewing service, overview and upgrades Alexander Barth (1), Charles Troupin (2), Aida Alvera Azcárate (1), Jean-Marie Beckers (1) (1) University.
NODC ↔ Data Consumers Steve Rutz NOAA/NESDIS National Oceanographic Data Center NODC Observing Systems Team Leader June 21, 2011.
Opendap dev - meeting, Boulder, Feb 2007 OPeNDAP infrastructure in European Operational Oceanography T Loubrieu (IFREMER) T Jolibois (CLS)
ICOADS: Update Status and Data Distribution Steven J. Worley Scott D. Woodruff Sandra J. Lubker Ziahua Ji J. Eric Freeman NCAR, NOAA/ESRL, NOAA/NCDC CLIMAR-III,
VIPER Quality Assessment Overview Presenter: Sathyadev Ramachandran, SPS Inc.
INVESTIGATION 1.
T T03-01 Calculate Descriptive Statistics Purpose Allows the analyst to analyze quantitative data by summarizing it in sorted format, scattergram.
Argo workshop in Ghana, December Argo data status & data access.
IODE Ocean Data Portal - ODP  The objective of the IODE Ocean Data Portal (ODP) is to facilitate and promote the exchange and dissemination of marine.
Chapter 2: Describing Location In a Distribution Section 2.1 Measures of Relative Standing And Density Curves.
Introduction to Statistics Santosh Kumar Director (iCISA)
UAF/OSMC Presenters: Kevin O’Brien and Eugene Burger Abstract: Kevin O’Brien and Eugene Burger are from NOAA’s Pacific Marine Environmental Laboratory.
NQuery: A Network-enabled Data-based Query Tool for Multi-disciplinary Earth-science Datasets John R. Osborne.
© Crown copyright Met Office The EN4 dataset of quality controlled ocean temperature and salinity profiles and monthly objective analyses Simon Good.
s Donna J. Scott, Marilyn Kaminski, Jason Wolfe, Terry Haran NSIDC's MODIS Snow and Sea Ice Products NSIDC provides a suite.
Ed Armstrong – PI Luca Cinquini Chris Mattmann NASA Jet Propulsion Laboratory Frank O’Brien Zach Siegrist System Science Applications, Inc. 18 July 2012.
Central Region Snowfall Analysis Brian P. Walawender NWS Central Region Headquarters Matt W. Davis NWS WFO La Crosse, WI 5/26/2011.
Yield Cleaning Software and Techniques OFPE Meeting
National Aeronautics and Space Administration Aquarius Validation Data System Overview and Status Algorithm Workshop March 21-22, 2007 John Gunn Earth.
00/XXXX 1 Data Processing in PRISM Introduction. COCO (CDMS Overloaded for CF Objects) What is it. Why is COCO written in Python. Implementation Data Operations.
1 Adventures in Web Services for Large Geophysical Datasets Joe Sirott PMEL/NOAA.
An Introduction to the Argo Data Sytem South Pacific Workshop 11 – 14 October 2005 Mark Ignaszewski FNMOC.
US Army Engineer Research and Development Center U.S. IOOS MODEL VALIDATION CAPABILITY SURA SUPER-REGIONAL TEST BED Working across agencies to bring observations.
Cal/Val for physics MED-MFC internal meeting CMCC-INGV-SOCIB Lecce E. Clementi, INGV.
1 WHY WE USE EXPLORATORY DATA ANALYSIS DATA YES NO ESTIMATES BASED ON NORMAL DISTRIB. KURTOSIS, SKEWNESS TRANSFORMATIONS QUANTILE (ROBUST) ESTIMATES OUTLIERS.
What is a box-and-whisker plot? 5-number summary Quartile 1 st, 2 nd, and 3 rd quartiles Interquartile Range Outliers.
QWG-10 – 4-6 February 2013 – ESRIN (Italy) SMOS Level3 and Level 4 Research Products Provided by the Barcelona Expert Center Jordi Font and BEC team SMOS.
Data Management System to Collect, Quality Control, Distribute, and Archive Near Real-time Marine Data Jeremy J. Rolph, Jacob T. Rettig, Mark A. Bourassa,
Climate-SDM (1) Climate analysis use case –Described by: Marcia Branstetter Use case description –Data obtained from ESG –Using a sequence steps in analysis,
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
Last chapter... Four Corners: Go to your corner based on if your birthday falls in the Winter, Spring, Summer, or Fall; 1 minute In your group, come to.
New HR-DDS User Services David J. S. Poulter, National Oceanography Centre, UK
Proposal of Satellite Data Center India Meteorological Department A.K.Sharma (Chairman), Virendera Singh (Member), R.K.Giri (Member) and N.Puviarasan (Member.
New Australian High Resolution AVHRR SST Products from the Integrated Marine Observing System Presented at the GHRSST Users Symposium, Santa Rosa, USA,
NQuery: A Network-enabled Data-based Query Tool for Multi-disciplinary Earth-science Datasets John R. Osborne 1, Kevin T. McHugh 2, and Donald W. Denbo.
Sea Surface Temperature Distribution from the Physical Oceanography DAAC Ed Armstrong JPL PO.DAAC MODIS Science Team Meeting.
Data Browsing/Mining/Metadata
Exploring Data: Summary Statistics and Visualizations
Description of Data (Summary and Variability measures)
2.1 Density Curve and the Normal Distributions
Program This course will be dived into 3 parts: Part 1 Descriptive statistics and introduction to continuous outcome variables Part 2 Continuous outcome.
Presentation transcript:

Development of Quality Monitoring System on satellite Sea Surface Salinity Products Yongsheng Zhang NESDIS/NODC – UMD College Park/ESSIC/CICS

In the past four years, NODC has developed a quality monitoring system known as the data Rich Inventory (RI) for Jason-2 level-2 products. The concept of RI is originally developed by NGDC/NESDIS for data management in CLASS. The basic idea is to extract existing metadata and calculate the QA statistics in a granule, store them in a database, and make them available as part of a data discovery system. NODC’s Jason-2 data quality monitoring system provides: 1) real-time original data process and visualization, statistical value calculation and QA information packing, and conversion of the outputs in CF compliant NetCDF format. 2) Provided data achiever and user the data quality information in numerical presentation via various data access tools (http, ftp and OPeNDAP) and graphical one via web interface. Development of Quality Monitoring System at NODC

1.To develop automation processing and visualizing tools for near real-time level-2 SMOS data in swath, daily, 3-day and monthly mean time frames; 2.To develop tools for calculation of QA statistical values in a SMOS level-2 swath data file and visualization (valid observation number, mean, standard deviation, minimum & maximum) ; 3.To perform and develop tools and methods for histogram analyses for comparing the monthly mean SSS among SMOS, Aquarius and NODC objective analysis from In-situ observations; 4.To Conduct investigation on SMOS QA flags choices Current tasks and goals of QA monitoring on SMOS/Aquarius Sea surface salinity products

1.0x1.0 longitude/latitude box average; Applied Quality Flags: 1. Dg_quality_sss_# (model #): Quality index for sss, lower=better, values over 300 are filtered 2. Control_Flags_# (model #): Total = 32, but only contains 27 meaningful control flags, all applied except mask(tag # 1), roughness correction (tag # 16), availability of ECMWF(tag # 19) and grid point measurement discrimination test (tag # 19). Software: FORTRAN,C-shell, GrADS and Linux image convert; Outputs: EPS and PNG format. Development visualizations of level-2 SMOS data in swath, daily, 3-day and monthly mean

Development of tools for calculation of quality statistics a level-2 granule Six QA statistics: – Valid number of observations – Number of observations over 3-sigma – Mean – Standard deviation – Minimum & Maximum Provide visualizations (EPS and PNG); Convert the data into CF compliant NetCDF format and provide data access via LAS/OPeNDAP/THREDDS servers (future work)

Data filters: SSS1=-999; Dg_quality_SSS_1=999, Control flags not applied Zero Obs NO

Data Sets and Histogram Analysis Data sets: monthly 1x1 degree, April Aquarius: JPL PO.DDAC Level-3 mapped and smoothed version 1.3. SMOS: gridded by NODC from level-2 swath data with Dq_quality and control flag filters. NODC monthly objective analysis. Histogram analysis References: “The SST quality Monitor (SQUAM)”, P. Dash et al Median and Robust Standard Deviation (RSD) are used to construct thresholds to remove extreme values, compared to conventional mean and Standard Deviation (SD) : SD= RSD=IOR/S IOR is the interquartile range: difference of the values of 75 th and 25 th percentile in an ordered dataset. S=1.348 for an ideal normal distribution Outliers: median ± 4xRSD

Difference of monthly SSS between SMOS and Aquarius

Histograms of SSS difference between SMOS and Aquarius April 2012 Number Density (%) SMOS Model 1SMOS Model 2SMOS Model 3 Left outlier: Value median+4xRSD

Histograms of SSS difference between NODC OA and SMOS April 2012 SMOS Model 1SMOS Model 2SMOS Model 3 Number Density (%)

Histogram of SSS difference between NODC OA and Aquarius April 2012

Histogram statistics for monthly difference among NODC OA, SMOS, and Aquarius SSS, April 2012 Observation N meanmedian Standard Deviation Robust Standard Deviation N: left outlier N: right outlier SMOS(M1)-Aqu SMOS(M2)-Aqu SMOS(M3)-Aqu NODC OA-SMOS (M1) NODC OA-SMOS (M2) NODC OA-SMOS (M3) NODC OA-Aqu

SMOS level-2 data quality flags choices and impact in monthly mean results Test and Applied Quality Flags: Dg_quality_sss_# – Quality index for sss, lower=better, values over 300 are filtered Control_Flags_# – Total 32, but only contains 27 meaningful control flags, all applied except: – Mask (tag # 1) – Roughness correction (tag # 16) – Availability of ECMWF(tag # 19) – Grid point measurement discrimination test (tag # 19).

Distributions of number density of Dg_quality_sss_# in April 2012 (849 swath files) number density (10**6) Values of Dg_quality_sss_# flags

Dg_quality_SSS_1 GoodBad

Data filtered: Dg_quality_SSS_1 <= 300

Data filtered: Dg_quality_SSS_1 >= 300

SMOS Filter: Dg_quality_SSS_1 =999 SMOS Filter: Dg_quality_SSS_1 <300 SMOS Filters: Dg_quality_SSS_1 =999; Control_flags_1, except tag# 1,6,16,18,19 SMOS Filters: Dg_quality_SSS_1 <300; Control_flags_1, except tag# 1,6,16,18,19

Future Work Application of more flags in data quality analysis in SMOS level-2: Control_Flags_#, Science_Flags_# ; Compare difference among SMOS Models 1, 2 and 3. Develop NetCDF format for the QA statistics and data visualization tools; Histogram analysis and development NetCDF format for the histogram statistical data and visualizations; Development of software for automation processing and user service interfaces