Benchmark database based on surrogate climate records Victor Venema.

Slides:



Advertisements
Similar presentations
Statistical modelling of precipitation time series including probability assessments of extreme events Silke Trömel and Christian-D. Schönwiese Institute.
Advertisements

Noise & Data Reduction. Paired Sample t Test Data Transformation - Overview From Covariance Matrix to PCA and Dimension Reduction Fourier Analysis - Spectrum.
Developing a Caribbean Climate Interactive Database (CCID) Rainaldo F. Crosbourne, Michael A. Taylor, A. M. D. Amarakoon** CLIMATE STUDIES GROUP MONA Department.
Kriging.
Repeat station crustal biases and accuracy determined from regional field models M. Korte, E. Thébault* and M. Mandea, GeoForschungsZentrum Potsdam (*now.
Budapest May 27, 2008 Unifying mixed linear models and the MASH algorithm for breakpoint detection and correction Anders Grimvall, Sackmone Sirisack, Agne.
Bayesian Image Super-resolution, Continued Lyndsey C. Pickup, David P. Capel, Stephen J. Roberts and Andrew Zisserman, Robotics Research Group, University.
REFERENCES Begert M., Schlegel T., Kirchhofer W., Homogeneous temperature and precipitation series of Switzerland from 1864 to Int. J. Climatol.,
Stratospheric Temperature Variations and Trends: Recent Radiosonde Results Dian Seidel, Melissa Free NOAA Air Resources Laboratory Silver Spring, MD SPARC.
TR32 time series comparison Victor Venema. Content  Jan Schween –Wind game: measurement and synthetic –Temporal resolution of 0.1 seconds  Heye Bogena.
Benchmark database inhomogeneous data, surrogate data and synthetic data Victor Venema.
Sorin CHEVAL*, Tamás SZENTIMREY**, Ancuţa MANEA*** *National Meteorological Administration, Bucharest, Romania and Euro-Mediterranean Centre for Climate.
Global analysis of recent frequency component changes in interannual climate variability Murray Peel 1 & Tom McMahon 1 1 Civil & Environmental Engineering,
Statistical characteristics of surrogate data based on geophysical measurements Victor Venema 1, Henning W. Rust 2, Susanne Bachner 1, and Clemens Simmer.
Daily Stew Kickoff – 27. January 2011 First Results of the Daily Stew Project Ralf Lindau.
Benchmark database inhomogeneous data, surrogate data and synthetic data Victor Venema.
Detected Inhomogeneities In Wind Direction And Speed Data From Ireland Predrag Petrović Republic Hydrometeorological Service of Serbia Mary Curley Met.
Two and a half problems in homogenization of climate series concluding remarks to Daily Stew Ralf Lindau.
Nynke Hofstra and Mark New Oxford University Centre for the Environment Trends in extremes in the ENSEMBLES daily gridded observational datasets for Europe.
Spatial Interpolation of monthly precipitation by Kriging method
Detection of inhomogeneities in Daily climate records to Study Trends in Extreme Weather Detection of Breaks in Random Data, in Data Containing True Breaks,
Benchmark dataset processing P. Štěpánek, P. Zahradníček Czech Hydrometeorological Institute (CHMI), Regional Office Brno, Czech Republic, COST-ESO601.
After HOME : Progress in the practical application of statistical homogenisation Peter Domonkos Dimitrios Efthymiadis Centre for Climate Change University.
COSTOC Olivier MestreMétéo-FranceFrance Ingebor AuerZAMGAustria Enric AguilarU. Rovirat i VirgiliSpain Paul Della-MartaMeteoSwissSwitzerland Vesselin.
SCIENTIFIC REPORT ON COST SHORT TERM SCIENTIFIC MISSION Tania Marinova National Institute of Meteorology and Hydrology at the Bulgarian Academy of Sciences,
Gridding Daily Climate Variables for use in ENSEMBLES Malcolm Haylock, Climatic Research Unit Nynke Hofstra, Mark New, Phil Jones.
Secular variation in Germany from repeat station data and a recent global field model Monika Korte and Vincent Lesur Helmholtz Centre Potsdam, German Research.
On the multiple breakpoint problem and the number of significant breaks in homogenisation of climate records Separation of true from spurious breaks Ralf.
Temporal Analysis using Sci2 Ted Polley and Dr. Katy Börner Cyberinfrastructure for Network Science Center Information Visualization Laboratory School.
Dataset Development within the Surface Processes Group David I. Berry and Elizabeth C. Kent.
Breaks in Daily Climate Records Ralf Lindau University of Bonn Germany.
Progress Toward a New Weather Generator Eric Schmidt, Colorado State University - Pueblo Dr. James O’Brien, Florida State University Anthony Arguez, Florida.
National Climate Monitoring Products Andrew Watkins and John Kennedy (updated 28/4/2014)
European Climate Assessment CCl/CLIVAR ETCCDMI meeting Norwich, UK November 2003 Albert Klein Tank KNMI, the Netherlands.
Quality control and homogenization of the COST benchmark dataset Petr Štěpánek Pavel Zahradníček Czech Hydrometeorological Institute, regional office Brno.
Correction of daily values for inhomogeneities P. Štěpánek Czech Hydrometeorological Institute, Regional Office Brno, Czech Republic
Quality control of daily data on example of Central European series of air temperature, relative humidity and precipitation P. Štěpánek (1), P. Zahradníček.
Gap-filling and Fault-detection for the life under your feet dataset.
WFM 6311: Climate Risk Management © Dr. Akm Saiful Islam WFM 6311: Climate Change Risk Management Akm Saiful Islam Lecture-7:Extereme Climate Indicators.
Regional Climate Model Evaluation System based on satellite and other observations for application to CMIP/AR downscaling Peter Lean 1, Jinwon Kim 1,3,
September Advertisement: CM SAF Data freely available in netcdf-format User-friendly data access via the Web User Interface:
On the reliability of using the maximum explained variance as criterion for optimum segmentations Ralf Lindau & Victor Venema University of Bonn Germany.
Benchmark database inhomogeneous data, surrogate data and synthetic data Victor Venema.
A novel methodology for identification of inhomogeneities in climate time series Andrés Farall 1, Jean-Phillipe Boulanger 1, Liliana Orellana 2 1 CLARIS.
Experience regarding detecting inhomogeneities in temperature time series using MASH Lita Lizuma, Valentina Protopopova and Agrita Briede 6TH Homogenization.
ACTION COST-ES0601: Advances in homogenisation methods of climate series: an integrated approach (HOME), WG Meeting, Palma de Mallorca, January, 25-27,
“Building the daily observations database for the European Climate Assessment” KNMI.nl CLARIS meeting, 7 july 2005.
Homogenization of Chinese daily surface air temperatures:An update for CHHT1.0 Li Qingxiang, Xu Wenhui, Xiaolan Wang, and coauthors (National Meteorological.
Developing long-term homogenized climate Data sets Olivier Mestre Météo-France Ecole Nationale de la Météorologie Université Paul Sabatier, Toulouse.
Perspectives on Historical Observing Practices and Homogeneity of the Snowfall Record Kenneth E. Kunkel NOAA Cooperative Institute for Climate and Satellites.
The ENSEMBLES high- resolution gridded daily observed dataset Malcolm Haylock, Phil Jones, Climatic Research Unit, UK WP5.1 team: KNMI, MeteoSwiss, Oxford.
1 Detection of discontinuities using an approach based on regression models and application to benchmark temperature by Lucie Vincent Climate Research.
Data quality control for the ENSEMBLES grid Evelyn Zenklusen Michael Begert Christof Appenzeller Christian Häberli Mark Liniger Thomas Schlegel.
The joint influence of break and noise variance on break detection Ralf Lindau & Victor Venema University of Bonn Germany.
ENVIRONMENTAL AGENCY OF THE REPUBLIC OF SLOVENIA COST benchmark dataset homogenisation: issues and remarks of the “Slovenian team” Presentation.
Calorimeter global commissioning: progress and plans Patrick Robbe, LAL Orsay & CERN, 25 jun 2008.
Homogenization of daily data series for extreme climate index calculation Lakatos, M., Szentimey T. Bihari, Z., Szalai, S. Meeting of COST-ES0601 (HOME)
Inhomogeneities in temperature records deceive long-range dependence estimators Victor Venema Olivier Mestre Henning W. Rust Presentation is based on:
The SeaDataNet data products regional temperature and salinity historical data collections S. Simoncelli 1, C. Coatanoan 2, O. Bäck 3, H. Sagen 4, S.
Actions & Activities Report PP8 – Potsdam Institute for Climate Impact Research, Germany 2.1Compilation of Meteorological Observations, 2.2Analysis of.
Benchmark database Victor Venema, Olivier Mestre, Enric Aguilar, Ingeborg Auer, José A. Guijarro, Petr Stepanek, Claude.N.Williams, Matthew Menne, Peter.
The homogenization of GPS Integrated Water Vapour time series: methodology and benchmarking the algorithms on synthetic datasets R. Van Malderen1, E. Pottiaux2,
Downloading Weather Observations
The homogenization of GPS Integrated Water Vapour time series: methodology and benchmarking the algorithms on synthetic datasets R. Van Malderen1, E. Pottiaux2,
Break and Noise Variance
The break signal in climate records: Random walk or random deviations
Meeting of COST-ES0601 (HOME) Mallorca JAN 2010
Dipdoc Seminar – 15. October 2018
Presentation of GSICS Inter-Calibration Results - Web Displays
Defining the Products: ‘GSICS Correction’
Presentation transcript:

Benchmark database based on surrogate climate records Victor Venema

Goals of COST-HOME working group 1  Literature survey  Benchmark dataset –Known inhomogeneities –Test the homogenisation algorithms (HA)

Benchmark dataset 1)Real (inhomogeneous) climate records  Most realistic case  Investigate if various HA find the same breaks  Good meta-data 2)Synthetic data  For example, Gaussian white noise  Insert know inhomogeneities  Test performance 3)Surrogate data  Empirical distribution and correlations  Insert know inhomogeneities  Compare to synthetic data: test of assumptions

Creation benchmark – Outline talk 1)Start with homogeneous data 2)Multiple surrogate and synthetic realisations 3)Mask surrogate records 4)Add global trend 5)Insert inhomogeneities in station time series 6)Published on the web 7)Homogenize by COST participants and third parties 8)Analyse the results and publish

1) Start with homogeneous data  Monthly mean temperature and precip (France)  Later also daily data  Later maybe other variables  Homogeneous  No missing data  Detrended  20 to 30 years is enough for good statistics  Longer surrogates are based on multiple copies –Larger scale correlations are small –Distribution well defined with 30a data  Generated networks are: 50, 100 and 200 a long

2) Multiple surrogate realisations  Multiple surrogate realisations –Temporal correlations –Station cross-correlations –Empirical distribution function  Annual cycle removed before, added at the end  Number of stations between 5 and 20  Cross correlation varies as much as possible  Show plot temporal structure of surrogates  Show plot cross correlations

One station – with annual cycle

One station – anomalies

Multiple stations – 10 year zoom

IAAFT algorithm smoothes jumps

3) Mask surrogate records  Beginning of records jagged (rough)  Linear increase in number of stations  Last station after 25% of full time  End of record all stations are measuring  Influence of jagged edge on detection and correction  But trend is also increasing in time (i.e. different)!  Is this a problem?

3) Mask surrogate records

4) Add global trend  NASA GISS GISS Surface Temperature Analysis (GISTEMP) by J. Hansen  Global mean surface temperature  Last year of any surrogate network is 1999

5) Insert inhomogeneities in stations  Random breaks (implemented)  Frequency of breaks 1/20a, 1/40a  Size constants for temperature: 0.25, 0.5, 1.0 °C  Size factors for rain: 0.8, 0.9, 1.1, 1.2  Simultaneous breaks  Frequency of breaks 1/50a  In 10 to 50 % of network

5) Insert inhomogeneities in stations  Outliers  Frequency: 1 – 3 %  Size: 99 and 99.9 percentiles  Local trends (only temperature)  Linear increase or decrease in one station  Duration: 30, 60a  Maximum size: 0.2 to 1.5 °C  Frequency: once in 10 % of the stations

6) Published on the web  Inhomogeneous data will be published on the COST-HOME homepage  Everyone is welcome to download and homogenize the data

7) Homogenize by participants  Return homogenised data  Should be in COST-HOME file format (next slide)  Return break detections –BREAK –OUTLI –BEGTR –ENDTR  Multiple breaks at one data possible

7) Homogenize by participants  COST-HOME file format: venema/themes/homogenisation/costhome_fileformat.pdf  For benchmark & COST homogenisation software  One data and one quality-flag file per station  Filename: variable, resolution, quality, station  ASCII network-file with station names  ASCII break-file with dates and station names

COST-HOME file format – monthly data

COST-HOME file format – network file

8) Analyse the results  Detailed analysis will be performed in the working groups –Detection –Correction –Daily data homogenisation  Synthetic and surrogate data –RMS Error –No. breaks detected (function of size) –Application: reduction in the scatter in the trends  Performance difference between synthetic (Gaussian, white noise) and surrogate data

Work in progress  Monthly precipitation  Implement some inhomogeneity types  Daily data: other inhomogeneities  Synthetic data (Gaussian white noise)  More input data!  Agree on the details of the benchmark –Next meeting?  Set deadline for the availability benchmark  Deadline for the return of the homogeneous data

Questions  Ideas for a better benchmark  For example, for other inhomogeneities, constants  Types of inhomogeneities for daily data  Automatic processing –In the order of 100 networks

7) Homogenize by participants  COST-HOME file format: venema/themes/homogenisation/costhome_fileformat.pdf  For benchmark & COST homogenisation software  Regular ASCII matrix (columns)  One data and one quality-flag file per station  Yearly, daily, subdaily data: columns for time, one for data  Monthly data: year column, 12 columns for data  Filename: variable, resolution, quality, station  ASCII network-file with station names  ASCII break-file with dates and station names