Data Transformation and Uncertainty in Geostatistical Combination of Radar and Rain Gauges CSI 991 Dan Basinger.

Slides:



Advertisements
Similar presentations
Pattern Recognition and Machine Learning
Advertisements

Spatial point patterns and Geostatistics an introduction
Chapter 3 Properties of Random Variables
1 12. Principles of Parameter Estimation The purpose of this lecture is to illustrate the usefulness of the various concepts introduced and studied in.
Sensitivity Analysis In deterministic analysis, single fixed values (typically, mean values) of representative samples or strength parameters or slope.
Forest Hydrology: Lect. 18
Basic geostatistics Austin Troy.
Cox Model With Intermitten and Error-Prone Covariate Observation Yury Gubman PhD thesis in Statistics Supervisors: Prof. David Zucker, Prof. Orly Manor.
1 Alberto Montanari University of Bologna Simulation of synthetic series through stochastic processes.
FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions.
PROVIDING DISTRIBUTED FORECASTS OF PRECIPITATION USING A STATISTICAL NOWCAST SCHEME Neil I. Fox and Chris K. Wikle University of Missouri- Columbia.
Introduction This project deals with conversion of Vector based Probable Maximum Precipitation (PMP) data into Raster based PMP data using different interpolation.
Curve-Fitting Regression
Spatial Interpolation
Evaluating Hypotheses
Applied Geostatistics
WFM 5201: Data Management and Statistical Analysis
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
Deterministic Solutions Geostatistical Solutions
Topic 3: Regression.
Ordinary Kriging Process in ArcGIS
Spatial statistics 2 Stat 518 Sp 08. Ordinary kriging where and kriging variance.
Applications in GIS (Kriging Interpolation)
Method of Soil Analysis 1. 5 Geostatistics Introduction 1. 5
Transforming the data Modified from: Gotelli and Allison Chapter 8; Sokal and Rohlf 2000 Chapter 13.
Lecture II-2: Probability Review
Modern Navigation Thomas Herring
Hydrologic Statistics
Statistical Methods For Engineers ChE 477 (UO Lab) Larry Baxter & Stan Harding Brigham Young University.
Alternative Measures of Risk. The Optimal Risk Measure Desirable Properties for Risk Measure A risk measure maps the whole distribution of one dollar.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Statistical Techniques I EXST7005 Review. Objectives n Develop an understanding and appreciation of Statistical Inference - particularly Hypothesis testing.
Spatial Interpolation of monthly precipitation by Kriging method
Edoardo PIZZOLI, Chiara PICCINI NTTS New Techniques and Technologies for Statistics SPATIAL DATA REPRESENTATION: AN IMPROVEMENT OF STATISTICAL DISSEMINATION.
Gridding Daily Climate Variables for use in ENSEMBLES Malcolm Haylock, Climatic Research Unit Nynke Hofstra, Mark New, Phil Jones.
Geo597 Geostatistics Ch9 Random Function Models.
Geographic Information Science
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Statistical Methods II&III: Confidence Intervals ChE 477 (UO Lab) Lecture 5 Larry Baxter, William Hecker, & Ron Terry Brigham Young University.
Spatial Interpolation III
Ping Zhu, AHC5 234, Office Hours: M/W/F 10AM - 12 PM, or by appointment M/W/F,
Spatial Interpolation Chapter 13. Introduction Land surface in Chapter 13 Land surface in Chapter 13 Also a non-existing surface, but visualized as a.
Chapter 16 Data Analysis: Testing for Associations.
9.3 and 9.4 The Spatial Model And Spatial Prediction and the Kriging Paradigm.
Spatial Analysis & Geostatistics Methods of Interpolation Linear interpolation using an equation to compute z at any point on a triangle.
5-1 ANSYS, Inc. Proprietary © 2009 ANSYS, Inc. All rights reserved. May 28, 2009 Inventory # Chapter 5 Six Sigma.
Interpolation and evaluation of probable Maximum Precipitation (PMP) patterns using different methods by: tarun gill.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Lab for Remote Sensing Hydrology and Spatial Modeling Dept of Bioenvironmental Systems Engineering National Taiwan University 1/45 GEOSTATISTICS INTRODUCTION.
1 HETEROSCEDASTICITY: WEIGHTED AND LOGARITHMIC REGRESSIONS This sequence presents two methods for dealing with the problem of heteroscedasticity. We will.
Stochastic Hydrology Random Field Simulation Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
Spatial Point Processes Eric Feigelson Institut d’Astrophysique April 2014.
G. Cowan Lectures on Statistical Data Analysis Lecture 9 page 1 Statistical Data Analysis: Lecture 9 1Probability, Bayes’ theorem 2Random variables and.
Goal of Stochastic Hydrology Develop analytical tools to systematically deal with uncertainty and spatial variability in hydrologic systems Examples of.
Density Estimation in R Ha Le and Nikolaos Sarafianos COSC 7362 – Advanced Machine Learning Professor: Dr. Christoph F. Eick 1.
Application of Probability Density Function - Optimal Interpolation in Hourly Gauge-Satellite Merged Precipitation Analysis over China Yan Shen, Yang Pan,
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
Fundamentals of Data Analysis Lecture 11 Methods of parametric estimation.
CWR 6536 Stochastic Subsurface Hydrology Optimal Estimation of Hydrologic Parameters.
SUR-2250 Error Theory.
RAINGUAGE NETWORK DESIGN
12. Principles of Parameter Estimation
Ch3: Model Building through Regression
Inference for Geostatistical Data: Kriging for Spatial Interpolation
Stochastic Hydrology Random Field Simulation
Concepts and Applications of Kriging
12. Principles of Parameter Estimation
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

Data Transformation and Uncertainty in Geostatistical Combination of Radar and Rain Gauges CSI 991 Dan Basinger

Abstract Quantitative Precipitation Estimates (QPE) can be performed using methods from geostatistics. Estimates are currently made by combining radar and rain gauge data. Problems arise, however, because of skewed and heteroscedastic nature of rainfall. Trans-Gaussian kriging techniques have been applied to correct for these problems.

Abstract (Contnued) Utilizing Kriging with External Drift (KED) in concert with the Box-Cox Power Transform has been applied to combined radar and rain gauge data.. Hourly precipitation data was collected in several places in Switzerland. KED measurements were performed with and without transformation.

Abstract (Continued) The effect of the transformation was examined in terms of accuracy of both point and probabilistic estimates. As a result, data transformation via kriging improved the analysis of the samples vis-à-vis the model Problems persist, however, in the case of dry gauges. In general, caution is emphasized in overutilizing the transformation due to positive bias.

Introduction Real-time radar-gauge combinations have been analyzed using different methodologies. – Interpolation with deterministic weights – Interpolation with splines – Objective analysis – Bayesian conditioning – For this study, a stochastic interpolation algorithm known as kriging is utilized.

Kriging for Geostatistical Analysis Variations of kriging have been used in the past for radar-rainfall gauge combinations. – Basic kriging – Kriging with external drift (which is utilized in this study) – Cokriging – Indicator kriging The choice of these methods can vary depending on the study scenario. Kriging with external drift (KED) has been selected for this study. The spatial covariance is derived from the data itself.

Kriging for Geostatistical Analysis (Continued) Kriging applied to radar-gauge combinations has resulted in improved point estimates. Probabilistic precipitation estimates have been generated in the form of a probability density function (PDF). – These probabilistic estimates drive the stochastic simulation of QPE estimates. – These in turn would find application in future hydrological forecasts.

Kriging for Geostatistical Analysis (Continued) The problem with geostatistical radar-gauge combinations is the fact that the underlying stochastic structure is based on the Gaussian distribution and the stationarity of: – Mean – Spatial covariance Rainfall is scattered in terms of wet and dry regions and thus, results in a skewed distribution. The Box-Cox Power Transform has been added to correct for problems related to this skewness.

Trans-Gaussian Kriging with the Box-Cox Transform Kriging with a power transform such as the Box-Cox method is called trans-Gaussian kriging. Similar applications include: – Mapping of hake distributions – Mapping of lead magnitudes in soils For radar rainfall gauge combinations throughout Switzerland, trans-Gaussian kriging is utilized to account for the non-Gaussian nature of the data. Kriging with external drift (KED) techniques are performed on the data and compared with results when no transform is applied. Hourly rainfall measurements from radar and rain gauges are utilized in this study.

Combination of Radar and Rainfall Gauges Analysis methods are based on the concept that precipitation data are perceived as a singular realization of a multivariate random variable. This variable is assumed to have a Gaussian distribution. – Mean also known as the drift of trend [deterministic] – Covariance function which describes the spatial dependence between the residuals [random] This distribution for precipitation generates a probabilistic estimate of the variable at any location.

Combination of Radar and Rainfall Gauges (Continued) The kriging process computes the expected value also known as the point estimate as well as the variance which is the prediction uncertainty within the distribution, Kriging can produce maps of the following: – Point estimates – Quartiles of the probabilistic estimates – Uncertainty estimates – Simulated realizations based on the observations

Combination of Radar and Rainfall Gauges (Continued) On this study, kriging with external drift (KED) has been selected. Mathematically, we have: Where Prec is precipitation with i,j being indices, alpha and beta are the trend parameters, the intercept and radar coefficient, and Z is a random process modeling the deviations of precipitation from the scaled radar field (Page 1334)

Combination of Radar and Rainfall Gauges (Continued) The data is collected at 75 gauge stations throughout Switzerland. Most of the time, a large fraction of these stations report dry conditions. The method of Restricted Maximum Likelihood (REML) is utilized to estimate trend and covariance parameters in the context of a relatively limited sample size.

Combination of Radar and Rainfall Gauges (Continued) Every data pair (radar and rainfall gauge) is examined individually. The radar- rainfall gauge matching is performed using the Nearest Neighbor Algorithm. – The radar value at a particular gauge location is derived from the radar pixel (1 km X 1 km). – This experiment is performed in such a way so as to decrease inconsistencies in spatial sampling between radar and rainfall gauges.

Application of Kriging with External Drift (KED) KED presupposes the random-process stochastic Z (i,j) to be Gaussian and stationary. – Constant Mean – Constant Variance – Constant Covariance As it turns out, the deviations from the radar- rainfall gauge relation are not Gaussian, but rather, positively skewed. – Precipitation values are truncated to 0 – Variance of the deviations is not constant, but increase with the precipitation values.

Application of Trans-Gaussian Kriging The algorithm utilized to correct for the deviations is trans-Gaussian kriging. – The method is used to transform the variable along with the covariates with a monotonic function. – The requirements for the Gaussian distribution are better fulfilled. Kriging is then performed on these transformed values. A back transformation is subsequently performed on the predictions resulting from the kriging process.

The Box-Cox Power Transform A useful set of transformations is the Box-Cox Power Transform. It is applicable to nonnegative data and has the format: Where Y and Y* are the original and transformed variables and λ is the transformation parameter. (Page 1334)

The Box-Cox Power Transform (Continued) Values of λ > 0 and λ < 1 are useful in correction for positive skewness. The case in which λ = 0 is not useful because of the preponderance of dry conditions. For positive values of λ, the minimum for Y* is -1/λ. For trans-Gaussian KED, this Box-Cox Power Transform is applied to both radar and rainfall gauge data. This is intended to correct for the skewness within both variables.

The Box-Cox Power Transform (Continued) The same value for λ is used for both sets of variables. Application of KED on the transformed data generates a probabilistic estimate at each location. – This yields a genuine Gaussian distribution with a mean value given by the point estimate. – This also yields a new variance corresponding to the kriging variance. The new Gaussian distribution is subsequently subjected to the Inverse Box-Cox Transform in order to obtain the probabilistic estimates from the original untransformed data.

The Box-Cox Power Transform (Continued) The point estimate for the precipitation values is approximated by the mean of the back- transformed quantiles. In the case of the lowest of the quantiles, the transformed values are set to -1/λ. Physically, these correspond to the dry precipitation cases. These quantiles are subsequently set to 0.

The Box-Cox Power Transform (Continued) The KED process is performed with four settings of L in the Box-Cox Transform. – If λ = 1, we have the classical application without transformation. – If λ = 0.5, we have the square root transformation which has been determined to be very suitable for radar-rainfall gauge combinations. – If λ = 0.1, we have an expression which is very close to a logarithmic transformation and not very suitable for radar-rainfall gauge combinations. – In a fourth case, we set λ = λ which is estimated case by case individually. e

The Box-Cox Power Transform (Continued) The fourth case deals with an estimate that turns out to work out very well to hourly precipitation values. There are several criteria that drive the selection of a value for case-dependent λ : – Gaussian distribution of residuals e – Relative linearity of the radar and rainfall gauge relationship.

The Box-Cox Power Transform (Continued) Applications of the Box-Cox transform over the course of a year have yielded values of 0.2 to 0.53 for λ. e Estimates of λ vary only slightly between seasons. e Values of λ vary more distinctly with rainfall variances e

The Box-Cox Power Transform (Continued) Large values of λ (> 0.3) usually occur in hours e of even and widespread rainfall. Small values of λ ( < 0.22) usually occur in e relatively limited and/or variable precipitation patterns. All general calculations are performed in “R” and the Box-Cox algorithm is performed in the “geoR” package.

Evaluation of the Methodology Several parameters are calculated in this study to access the quality of the point estimate. The ratio of total estimated to measured water amounts (RTW) measures the overall bias. The parameter called SCATTER is a measure of the error spread at wet locations. The Hansen-Kuipers Discriminant (HK) is utilized to measure the ability to differentiate between wet and dry areas.

Evaluation of the Methodology (Continued) The gauge measurements are compared with the probability density function applying the concept of a probability integral transform. We then calculate the frequency of observations that fall in between predefined quantiles. The Ratio of Observed to Expected frequencies based on interquantile bins measures the reliability of the kriging algorithm.

Evaluation of the Methodology (Continued) The Stable Equitable Error in Probability Space (SEEPS) is a triple category contingency value measuring the correspondence of dry, light, and intense rainfall conditions. Light rainfall is defined as the lower 67 % Intense rainfall is defined as the upper 33 %. The mean root transformed error is calculated as: n 2 MRTE= (1/n)Sum (SQRT(pred)–SQRT(obs) ) (Page 1336) i = 1 i i where “pred” refers to the point estimate “obs” refers to the gauge measurement.

Data Sources The source of the data for the trans-Gaussian radar and rainfall gauge measurements is hourly rainfall over the whole of Switzerland. A radar composite of hourly accumulated rainfall (in mm) covering the whole of Switzerland is placed on a grid space of 1 km X 1km. Rainfall gauge data is based on hourly accumulated rainfall with stations distributed evenly over the country.

Results Trans-Gaussian kriging has been utilized in the analysis of combined radar and rainfall gauge data. This was based on the hypothesis that data transformation can correct for the non-Gaussian nature of the data distributions. Kriging with external drift (KED) in conjunction with the Box-Cox Power Transform was utilized to modify the data distribution.

Results (Continued) Many improvements were observed when KED was utilized to generate a proper transformation as compared with the untransformed data. The underlying assumption of a Gaussian residual distribution along with relative stationarity of residual variance was better satisfied. Transformation changes the probabilistic estimate into a positively skewed probability distribution function. This properly scales the precipitation quantities correcting for unrealistic dependencies from KED without the Box-Cox Transform.

Results (Continued) KED without the Box-Cox Transform yielded rainfall distributions very similar to those from a parameter with exponents between 0.2 and 0.5. External transformations close to the logarithmic function (Box-Cox parameters less than 0.2) tended to produce a positive bias. Thus, in the overall analysis, it is important to avoid transformations close to the logarithmic. Other problems that the transformation has yet to rectify: – Cases with a high number of zeroes – Cases involving heteroscedasticity Cases in which the square root transform was derived produced good results.

Results (Continued) Overall, the best reliability for the probabilistic estimate was found in those case-dependent values of the transformation parameter λ. These resulted in optimization of Gaussian residuals. Case-by-case estimates can be generated on alternative criteria such as the linearity of the radar rainfall gauge relationship.

Conclusions Utilizing data transformations in the context of geostatistical applications is one way of analyzing the complex nature of rainfall data. Also there are currently limitations to this approach. These limitations can be potentially overcome by extending the trans-Gaussian kriging process by introducing additional terms such as: – Radar Quality – Topography – Precipitation Patterns and Structure

Selected References Box, G. E. P. and Cox, D.R. 1964: “An Analysis of Transformations” J. Roy. Stat. Soc. 26A: Diggle, P.J. and Ribeiro, P.J Model-Based Geostatistics Springer-Verlag Haberland, U “Geostatistical Interpolation of Hourly Precipitation from Rain Gauges and Radar from a Large-Scale Extreme Rainfall Event” J. Hydrology 332:

Selected References (Continued) Hoel, P.G., Port, S.C., Stone, C.J Introduction to Probability Theory Houghton Mifflin Schabenberger, O. and Gotway, C.A Statistical Methods for Spatial Data Analysis Chapman and Hall/CRC

Selected References (Continued) Schneider, S. and Steinacker, R “Utilization of Radar Information to Refine Precipitation Fields by a Variational Approach” Journal of Meteor.Atmos. Physics Sinclair, S. and Pegram, G “Combining Radar and Rain Gauge Rainfall Estimates Using Conditional Merging” Atmospheric Science Letters 6: 19-22