The homogenization of GPS Integrated Water Vapour time series: methodology and benchmarking the algorithms on synthetic datasets R. Van Malderen1, E. Pottiaux2,

Slides:



Advertisements
Similar presentations
EURANDOM & KNMI, May 2009 Analysis of extremes in a changing climate in support of informed decisions for adaptation
Advertisements

Clima en España: Pasado, presente y futuro Madrid, Spain, 11 – 13 February 1 IMEDEA (UIB - CSIC), Mallorca, SPAIN. 2 National Oceanography Centre, Southampton,
Evaluating Potential Impacts of Climate Change on Surface Water Resource Availability of Upper Awash Sub-basin, Ethiopia rift valley basin. By Mekonnen.
Collaboration FST-ULCO 1. Context and objective of the work  Water level : ECEF Localization of the water surface in order to get a referenced water.
“Real-time” Transient Detection Algorithms Dr. Kang Hyeun Ji, Thomas Herring MIT.
Benchmark database based on surrogate climate records Victor Venema.
Benchmark database inhomogeneous data, surrogate data and synthetic data Victor Venema.
Benchmark database inhomogeneous data, surrogate data and synthetic data Victor Venema.
Two and a half problems in homogenization of climate series concluding remarks to Daily Stew Ralf Lindau.
R. Van Malderen 1,5, E. Pottiaux 2,5, H. Brenot 3 and S. Beirle 4 R OYAL M ETEOROLOGICAL I NSTITUTE OF B ELGIUM 1 R OYAL O BSERVATORY OF B ELGIUM 2 B ELGIAN.
A new prototype AMSR-E SWE operational algorithm M. Tedesco The City College of New York, CUNY, NYC With contributions from : Chris Derksen, Jouni Pulliainen,
COSTOC Olivier MestreMétéo-FranceFrance Ingebor AuerZAMGAustria Enric AguilarU. Rovirat i VirgiliSpain Paul Della-MartaMeteoSwissSwitzerland Vesselin.
SCIENTIFIC REPORT ON COST SHORT TERM SCIENTIFIC MISSION Tania Marinova National Institute of Meteorology and Hydrology at the Bulgarian Academy of Sciences,
Reducing Canada's vulnerability to climate change - ESS J28 Earth Science for National Action on Climate Change Canada Water Accounts AET estimates for.
European Climate Assessment CCl/CLIVAR ETCCDMI meeting Norwich, UK November 2003 Albert Klein Tank KNMI, the Netherlands.
Combining CMORPH with Gauge Analysis over
Quality control of daily data on example of Central European series of air temperature, relative humidity and precipitation P. Štěpánek (1), P. Zahradníček.
Modern Era Retrospective-analysis for Research and Applications: Introduction to NASA’s Modern Era Retrospective-analysis for Research and Applications:
Eurostat – Unit D5 Key indicators for European policies Third International Seminar on Early Warning and Business Cycle Indicators Annotated outline of.
Issues in GPS Error Analysis What are the sources of the errors ? How much of the error can we remove by better modeling ? Do we have enough information.
September Advertisement: CM SAF Data freely available in netcdf-format User-friendly data access via the Web User Interface:
Benchmark database inhomogeneous data, surrogate data and synthetic data Victor Venema.
Evapotranspiration Estimates over Canada based on Observed, GR2 and NARR forcings Korolevich, V., Fernandes, R., Wang, S., Simic, A., Gong, F. Natural.
Development and testing of homogenisation methods: Moving parameter experiments Peter Domonkos and Dimitrios Efthymiadis Centre for Climate Change University.
Meteorological Observatory Lindenberg – Richard Assmann Observatory (2010) GCOS Reference Upper Air Network Holger Vömel Meteorological Observatory Lindenberg.
Bob Livezey NWS Climate Services Seminar February 13, 2013.
ACTION COST-ES0601: Advances in homogenisation methods of climate series: an integrated approach (HOME), WG Meeting, Palma de Mallorca, January, 25-27,
Developing long-term homogenized climate Data sets Olivier Mestre Météo-France Ecole Nationale de la Météorologie Université Paul Sabatier, Toulouse.
Water vapour estimates over Antarctica from 12 years of globally reprocessed GPS solutions Ian Thomas, Matt King, Peter Clarke Newcastle University, UK.
Of what use is a statistician in climate modeling? Peter Guttorp University of Washington Norwegian Computing Center
The ENSEMBLES high- resolution gridded daily observed dataset Malcolm Haylock, Phil Jones, Climatic Research Unit, UK WP5.1 team: KNMI, MeteoSwiss, Oxford.
Chalmers University of Technology Site-Dependent Electromagnetic Effects in High-Accuracy Applications of GNSS Jan Johansson and Tong Ning Chalmers University.
1 Detection of discontinuities using an approach based on regression models and application to benchmark temperature by Lucie Vincent Climate Research.
Data quality control for the ENSEMBLES grid Evelyn Zenklusen Michael Begert Christof Appenzeller Christian Häberli Mark Liniger Thomas Schlegel.
The joint influence of break and noise variance on break detection Ralf Lindau & Victor Venema University of Bonn Germany.
GWDAW11 – Potsdam Results by the IGEC2 collaboration on 2005 data Gabriele Vedovato for the IGEC2 collaboration.
IGARSS 2011, Vancuver, Canada July 28, of 14 Chalmers University of Technology Monitoring Long Term Variability in the Atmospheric Water Vapor Content.
A global, 2-hourly atmospheric precipitable water dataset from ground-based GPS measurements and its scientific applications Junhong (June) Wang NCAR/EOL/T.
William Prosser April 15, Introduction to Probability of Detection (POD) for Nondestructive Evaluation (NDE) This briefing is for status only and.
The first AURIGA-TAMA joint analysis proposal BAGGIO Lucio ICRR, University of Tokyo A Memorandum of Understanding between the AURIGA experiment and the.
Analyzing circadian expression data by harmonic regression based on autoregressive spectral estimation Rendong Yang and Zhen Su Division of Bioinformatics,
Actions & Activities Report PP8 – Potsdam Institute for Climate Impact Research, Germany 2.1Compilation of Meteorological Observations, 2.2Analysis of.
The validation and user assessment of the SL-CCI products has been performed through: - Internal consistency check and comparison with in-situ data. -
ESA Climate Change Initiative Sea-level-CCI project A.Cazenave (Science Leader), G.Larnicol /Y.Faugere(Project Leader), M.Ablain (EO) MARCDAT-III meeting.
COST ES1206 (GNSS4SWEC) M.C. Meeting Reykjavik, Iceland NATIONAL REPORT OF BELGIUM, 2016 Eric Pottiaux, Royal Observatory of Belgium (ROB) Roeland van.
Recent GNSS activities in Germany for COST
Ann Mari Fjæraa Philipp Schneider Tove Svendby
Royal Meteorological Institute of Belgium
Recent GNSS activities in Germany for COST and E-GVAP
WG3: GNSS for climate monitoring
Synthetic benchmark datasets for homogenization of IWV time series
Royal Meteorological Institute of Belgium
Overview of GNSS & GNSS-Met situation in France
A global model for the conversion of ZWD to IWV
Online Conditional Outlier Detection in Nonstationary Time Series
The homogenization of GPS Integrated Water Vapour time series: methodology and benchmarking the algorithms on synthetic datasets R. Van Malderen1, E. Pottiaux2,
Royal Meteorological Institute of Belgium
Change in Flood Risk across Canada under Changing Climate
Carsten Boldsen Hansen Economic Statistics Section, UNECE
On instrumental errors and related correction strategies of ozonesondes: possible effect on calculated ozone trends for the nearby sites Uccle and De Bilt.
Royal Meteorological Institute of Belgium
EMS Annual Meeting 2017 September 4-8, 2017 Dublin, Ireland
SVY207: Lecture 16 GPS Field Procedures and Computations
4th Joint EU-OECD Workshop on BCS, Brussels, October 12-13
R. Van Malderen1,6, E. Pottiaux2,6, G. Stankunavicius3, S. Beirle4,
SLOPE: A MATLAB Revival
ESOC IGS Reprocessing T. Springer, F. Dilssner, E. Schoenemann,
Royal Meteorological Institute of Belgium
Instrumental Surface Temperature Record
Reanalyses – A sharing of juicy tidbits* and gory details**
Presentation transcript:

The homogenization of GPS Integrated Water Vapour time series: methodology and benchmarking the algorithms on synthetic datasets R. Van Malderen1, E. Pottiaux2, A. Klos3, O. Bock4, J. Bogusz3, B. Chimani5, M. Elias6, M. Gruszczynska3, J. Guijarro7, S. Zengin Kazancı8 and T. Ning9 1 2 3 4 5 6 7 8 9 Royal Observatory of Belgium

Outline Motivation and introduction Methodology Synthetic dataset generation Assessment of the performance of the homogenization tools on the synthetic dataset Outlook

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook Motivation COST action GNSS4SWEC ‘Advanced Global Navigation Satellite Systems tropospheric products for monitoring severe weather events and climate’ WG3: Use of GNSS tropospheric products for climate monitoring (e.g. Integrated Water Vapour (IWV) = Precipitable Water (PW)) From different presentations at different GNSS4SWEC workshops, it turned out that different groups were showing results from time series analyses, sometimes based on the same datasets. They were dealing/struggling with the homogenization of their datasets.  common activity, on a reference IWV dataset.

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook Introduction: the reference IWV dataset From A. Klos 120 sites worldwide, with homogeneous reprocessing from 1995 – 2010 IGS repro 1: International GNSS Service, first reprocessing screened, outlier removed, and ZTD converted to IWV by O. Bock

Motivation & introduction Performance of homogen tools 20.06.2013 Motivation & introduction Methodology Synthetic datasets Performance of homogen tools Outlook Methodology: use of a reference IWV dataset (ERA-interim) good correlation between IGS repro 1 and ERA-interim model output but: not fully independent datasets ERA-interim is used to screen the GPS data. ERA-interim is used to convert the Zenith Total Delay from GPS to IWV (surface pressure, weighted mean temperature). ERA-interim not homogenous either! From O. Bock ERA-interim: 79km horizontal resolution Denkdag projecten 28,36 en 37 - Belspo

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook Methodology: use of a reference IWV dataset (ERA-interim) daily monthly From O. Bock We will look for break points/change points in the GPS – ERA-interim IWV differences series.

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook Generation of synthetic dataset: how? based on the comparison of the epochs identified by different tools , we decided to build synthetic datasets with known inserted offsets for the assessment of the performance of the different tools a sensitivity analysis of this performance on the properties of the datasets synthetic datasets of the IWV differences, 1. based on the characteristics of the real GPS – ERA-interim IWV differences: analysis of significant frequencies noise model linear trends gaps 2. based on the characteristics of the offsets in the real IWV differences: number typical amplitudes computations were performed with Maximum Likelihood Estimation (MLE) in the Hector software (Bos et al., 2013). around 1000 epochs of instrumental changes, reported in the stations metadata, were manually checked in the IWV differences (164 confirmed + 57 new ones)  also used for 1st order correction of the data

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook Noise model combination of autoregressive process of first order plus white noise (AR(1)+WN) characterized by: coefficient, fraction and amplitude (AR), amplitude (WN).

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook Noise model combination of autoregressive process of first order plus white noise (AR(1)+WN) characterized by: coefficient, fraction and amplitude (AR), amplitude (WN).

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook 3 variants of synthetic time series of IWV differences For every station (120 stations!), 3 variants of synthetic time series were created: ‘easy’ dataset: seasonal signals (annual, semi-annual, ter- and quater-annual, if present for a particular station) + offsets + white noise (WN) ‘less-complicated’ dataset: same as 1. + autoregressive process of the first order (noise model = AR(1)+WN) ‘fully-complicated’ dataset: same as 2. + trend + gaps (up to 20% of missing data = overshoot!)

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook 3 variants of synthetic time series of IWV differences For every station (120 stations!), 3 variants of synthetic time series were created: ‘easy’ dataset: seasonal signals (annual, semi-annual, ter- and quater-annual, if present for a particular station) + offsets + white noise (WN) ‘less-complicated’ dataset: same as 1. + autoregressive process of the first order (noise model = AR(1)+WN) ‘fully-complicated’ dataset: same as 2. + trend + gaps (up to 20% of missing data)

Motivation & introduction Performance of homogen tools 20.06.2013 Motivation & introduction Methodology Synthetic datasets Performance of homogen tools Outlook Which homogenization tools? Climatol J. Guijarro HOMOP B. Chimani PMTred T. Ning Non-parametric tests R. Van Malderen 2-sample t-test M. Elias Pettitt test S. Zengin Kazancı Method 1 Method 2 Method 3 Method 4 Method 5 Method 6 Method 7 Operator M. Elias R. Van Malderen J. Guijarro T. Ning S. Zengin B. Chimani Method / SW 2-sample t-test 2 of 3 PMW CLIMATOL PMTred Pettitt HOMOP Daily/Monthly D+M D X Easy/Less/Full E+L+F L+F E+F HOMOP: PRODIGE+SPLIDHOME PMTred: a penalized maximal t test (PMT) to empirically construct a penalty function that evens out the U-shaped false-alarm distribution over the relative position in the time series being tested. PMTred accounts for first-order autoregressive noise. Denkdag projecten 28,36 en 37 - Belspo

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook Which homogenization tools?

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook Assessment of the performance of tools on… … the identification of the epochs of the inserted breakpoints (+ sensitivity analysis) in the synthetic datasets. … the estimation of the trends that were or were not imposed to the 3 sets of synthetic IWV differences. Venema et al. 2012, Benchmarking homogenization algorithms for monthly data, Climate of the Past, 8, 89-115, doi:10.5194/cp-8-89-2012. Gazeaux et al. 2013, Detecting offsets in GPS time series: first results from the detection of offsets in GPS experiment, J. Geophys. Res. Solid Earth, 118, doi:10.1002/jgrb.50152.

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook 1. Identification of epochs of offsets: defining a proper time window daily monthly Mean epoch difference daily monthly Mean epoch difference  In this presentation, a time window of 62d (2 months), will be assumed.

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook 1. Identification of epochs of offsets: detection scores LESS COMPLICATED + A.R. Noise EASY good performance TP + TN > 40% FP < 40% FN < 40% Performance Increase Adapted from Gazeaux et al. 2013 good performance for the majority of the tools for the easy and less complicated dataset Performance Increase

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook 1. Identification of epochs of offsets: detection scores LESS COMPLICATED + gaps and trend FULLY COMPLICATED Adapted from Gazeaux et al. 2013 performance decreases drastically for almost all the tools when adding gaps and a trend in the benchmark time series

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook 1. Identification of epochs of offsets: detection scores = POD - POFD EASY Venema et al. 2012 T: 0.10  0.63 Precip. : 0.04  0.22 PSS LESS COMPLICATED PSS FULLY COMPLICATED PSS

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook 1. Identification of epochs of offsets: detection scores FULLY COMPLICATED POD POFD

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook 1. Identification of epochs of offsets: amplitudes of offsets EASY EASY DAILY TRUTH 

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook 1. Identification of epochs of offsets: amplitudes of offsets EASY EASY DAILY MONTHLY

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook 2. Trend estimation EASY FULLY COMPLICATED “Hector “ = output from MLE with simulated offsets large variety of trends (and uncertainties) for the different corrected time series

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook 2. Trend estimation FULLY COMPLICATED “Hector “ = output from MLE with simulated offsets most trends differ within ±0.05 mm/yr.

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook Workplan more detailed assessment of the performance of the different tools (and sensitivity analysis): CRMSE, … The epochs and amplitudes (and feedback) of the offsets in the synthetic datasets is given to the participants who already provided their solutions  fine-tuning of their tools We highly welcome other contributions (tools, solutions) in our activity! Interested? Please contact me at roeland@meteo.be. A next generation of a fully complicated synthetic dataset will be generated: fully complicated II? gaps decoupled from trend(s)? based on the difference of the synthetic IGS repro 1 minus the real ERA- interim? A second round of blind homogenization on this next generation dataset(s) will end in September.

Motivation & introduction Performance of homogen tools Methodology Synthetic datasets Performance of homogen tools Outlook Outlook application of the good performing tools on the IGS repro 1 define a common strategy to correct the IGS repro 1 dataset, based on criteria as (examples!): break points should be detected by a minimum of homogenization tools. break point should be present in the metadata file of the station. the amplitude of the offset should be above a certain limit. break points should be detected in other IWV differences time series (e.g. IGS minus NCEPNCAR reanalysis) Thereafter, the focus will be on a GNSS IWV dataset centred above Europe, homogeneously reprocessed, with about 100 sites (out of 280) with data from 1996-2014 ( neighbour-based approach possible?) We are looking forward to getting feedback/input/participation from your community!

Thank you for your attention

breakpoints detected in metadata & visual inspection, but not by any of the groups? breakpoints detected by a number (all) tools, but no metadata information? time window! When are breakpoints coincident?

If significant frequencies were not accounted for, it may lead to wrong trend estimates. Computations were performed with Maximum Likelihood Estimation (MLE) in the Hector software.

Characterization of real data: Annual signal. The largest differences between seasonal curves were found for KIT3, IISC and POL2 due to a shift in phase.

Why did we go for synthetic differences (DIFF_synt)? We examined ERAI_synt-GPS_synt and we couldn’t have covered some part of power… There is remaining unmodelled “signals and noise” in the time series of the real differences.