Introduction to kriging: The Best Linear Unbiased Estimator (BLUE)

Slides:



Advertisements
Similar presentations
Spatial point patterns and Geostatistics an introduction
Advertisements

Spatial point patterns and Geostatistics an introduction
Introduction to Data Assimilation NCEO Data-assimilation training days 5-7 July 2010 Peter Jan van Leeuwen Data Assimilation Research Center (DARC) University.
Introduction to Smoothing and Spatial Regression
Point Estimation Notes of STAT 6205 by Dr. Fan.
Kriging.
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
STAT 497 APPLIED TIME SERIES ANALYSIS
Sampling Distributions
FOUR METHODS OF ESTIMATING PM 2.5 ANNUAL AVERAGES Yan Liu and Amy Nail Department of Statistics North Carolina State University EPA Office of Air Quality,
Applied Geostatistics
Deterministic Solutions Geostatistical Solutions
Economics 20 - Prof. Anderson
FOUR METHODS OF ESTIMATING PM 2.5 ANNUAL AVERAGES Yan Liu and Amy Nail Department of Statistics North Carolina State University EPA Office of Air Quality,
A) Transformation method (for continuous distributions) U(0,1) : uniform distribution f(x) : arbitrary distribution f(x) dx = U(0,1)(u) du When inverse.
Geostatistics Mike Goodchild. Spatial interpolation n A field –variable is interval/ratio –z = f(x,y) –sampled at a set of points n How to estimate/guess.
Method of Soil Analysis 1. 5 Geostatistics Introduction 1. 5
Lecture II-2: Probability Review
Gaussian process modelling
EXPLORING SPATIAL CORRELATION IN RIVERS by Joshua French.
1 LES of Turbulent Flows: Lecture 1 Supplement (ME EN ) Prof. Rob Stoll Department of Mechanical Engineering University of Utah Fall 2014.
Monte Carlo Simulation CWR 6536 Stochastic Subsurface Hydrology.
Geo479/579: Geostatistics Ch12. Ordinary Kriging (1)
Gridding Daily Climate Variables for use in ENSEMBLES Malcolm Haylock, Climatic Research Unit Nynke Hofstra, Mark New, Phil Jones.
1 G Lect 3b G Lecture 3b Why are means and variances so useful? Recap of random variables and expectations with examples Further consideration.
Geog. 579: GIS and Spatial Analysis - Lecture 21 Overheads 1 Point Estimation: 3. Methods: 3.6 Ordinary Kriging Topics: Lecture 23: Spatial Interpolation.
Geographic Information Science
Geo479/579: Geostatistics Ch16. Modeling the Sample Variogram.
9.3 and 9.4 The Spatial Model And Spatial Prediction and the Kriging Paradigm.
Week 21 Stochastic Process - Introduction Stochastic processes are processes that proceed randomly in time. Rather than consider fixed random variables.
Review of Random Process Theory CWR 6536 Stochastic Subsurface Hydrology.
Geology 5670/6670 Inverse Theory 21 Jan 2015 © A.R. Lowry 2015 Read for Fri 23 Jan: Menke Ch 3 (39-68) Last time: Ordinary Least Squares Inversion Ordinary.
Introduction to kriging: The Best Linear Unbiased Estimator (BLUE) for space/time mapping.
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
Gaussian Processes For Regression, Classification, and Prediction.
Advanced Tutorial on : Global offset and residual covariance ENVR 468 Prahlad Jat and Marc Serre.
Stochastic Hydrology Random Field Simulation Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
Geostatistics GLY 560: GIS for Earth Scientists. 2/22/2016UB Geology GLY560: GIS Introduction Premise: One cannot obtain error-free estimates of unknowns.
Spatial Point Processes Eric Feigelson Institut d’Astrophysique April 2014.
CWR 6536 Stochastic Subsurface Hydrology
Modeling Space/Time Variability with BMEGUI Prahlad Jat (1) and Marc Serre (1) (1) University of North Carolina at Chapel Hill.
Exposure Prediction and Measurement Error in Air Pollution and Health Studies Lianne Sheppard Adam A. Szpiro, Sun-Young Kim University of Washington CMAS.
CWR 6536 Stochastic Subsurface Hydrology Optimal Estimation of Hydrologic Parameters.
I. Statistical Methods for Genome-Enabled Prediction of Complex Traits OUTLINE THE CHALLENGES OF PREDICTING COMPLEX TRAITS ORDINARY LEAST SQUARES (OLS)
Study of PM10 Annual Arithmetic Mean in USA  Particulate matter is the term for solid or liquid particles found in the air  The smaller particles penetrate.
Stochastic Process - Introduction
Multiple Random Variables and Joint Distributions
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005
STATISTICS POINT ESTIMATION
水分子不時受撞,跳格子(c.p. 車行) 投骰子 (最穩定) 股票 (價格是不穏定,但報酬過程略穩定) 地震的次數 (不穩定)
Probability.
ICS 280 Learning in Graphical Models
Ch3: Model Building through Regression
Modeling Space/Time Variability with BMEGUI
STATISTICS Random Variables and Distribution Functions
Ch9 Random Function Models (II)
Presentation of the project of Yasuyuki Akita Temporal GIS Fall 2004
Review of Probability Theory
CSCI 5822 Probabilistic Models of Human and Machine Learning
Inference for Geostatistical Data: Kriging for Spatial Interpolation
Propagating Uncertainty In POMDP Value Iteration with Gaussian Process
Stochastic Hydrology Random Field Simulation
Generalized Spatial Dirichlet Process Models
Interpolation & Contour Maps
STOCHASTIC HYDROLOGY Random Processes
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005
What are BLUP? and why they are useful?
CS723 - Probability and Stochastic Processes
数据的矩阵描述.
Presentation transcript:

Introduction to kriging: The Best Linear Unbiased Estimator (BLUE) for space/time mapping

Definition of Space Time Random Fields Spatiotemporal Continuum p=(s,t) denotes a location in the space/time domain E=SxT Spatiotemporal Field A field is the distribution c across space/time of some parameter X Space/Time Random Field (S/TRF) A S/TRF is a collection of possible realizations c of the field, X(p)={p, c} The collection of realizations represents the randomness (uncertainty and variability) in X(p) X(p) Space s Time t Realization c(1) X(p) Space s Time t Realization c(2)

Multivariate PDF for the mapping points Defining a S/TRF at a set of mapping points We restrict Space/Time to a set of n mapping points, pmap=(p1,…, pn) Each field realization reduces to a set of n values, xmap= (x1,…, xn) The S/TRF reduces to set of n random variables, Xmap= (X1,…, Xn) The multivariate PDF The multivariate PDF fX characterizes the joint event Xmap≈ xmap as Prob.[xmap< Xmap< xmap+ dxmap] = fX(xmap) dxmap hence the multivariate PDF provides a complete stochastic description of trends and dependencies of the S/TRF X(p) at its mapping points Marginal PDFs The marginal PDF for a subset Xa of Xmap= (Xa, Xb) is fX(xa) = ∫ dxb fX(xa , xb) hence we can define any marginal PDF from fX(xmap)

Statistical moments Stochastic Expectation Mean trend and covariance The stochastic expectation of some function g(X(p), X(p’), …) of the S/TRF is E [g(X(p), X(p’), …)] = ∫ dx1 dx2 ... g(x1, x2 , ...) fX(x1, x2 , ...; p ; p’ , ...) Mean trend and covariance The mean trend mX(p) =E [X(p)] and covariance cX(p, p’) =E [ (X(p)-m(p)) (X(p’)-m(p’)) ] are statistical moments of order 1 and 2, respectively, that characterizes the consistent tendencies and dependencies, respectively, of X(p)

Homogeneous/Stationary S/TRF A homogeneous/stationary S/TRF is defined by A mean trend that is constant over space (homogeneity) and time (stationarity) mX(p) = mX A covariance between point p =(s,t) and p’ =(s’,t’) that is only a function of spatial lag r=||s-s’|| and the temporal lag t = |t-t’| cX(p, p’) = cX ( (s,t), (s’,t’) ) = cX( r=||s-s’|| , t=|t-t’| ) A homogeneous/stationary S/TRFs has the following properties It’s variance is constant, i.e. sX2(p)= sX2 Proof: sX2(p)= E[(X(p)- mX(p))2] = cX(p, p) = cX( r=0, t=0 ) is not a function of p It’s covariance can be written as cX(r , t)= E[X(s,t)X(s’,t’)] ||s-s’|| =r, |t-t’| =t- mX2 ,  This is a useful equation to estimate the covariance

Experimental estimation of covariance When having site-specific data, and assuming that the S/TRF is homogeneous/stationary, then we obtain experimental values for it’s covariance using the following estimator where N(r,t) is the number of pairs of points with values (xhead, xtail) separated by a distance of r and a time of t. In practice we use a tolerance dr and dt, i.e. such that r-dr ≤ ||shead-stail|| ≤ r+dr and t-dt ≤ ||thead-ttail|| ≤ t+dt

Spatial covariance models Gaussian model: cX(r) = co exp-(3r2/ar2) co = sill = variance ar = spatial range Very smooth processes Exponential model: cX(r) = co exp-(3r/ar) more variability Nugget effect model cX(r) = co d(r) purely random Nested models cX(r) = c1(r) + c2(r) + … where c1(r), c2(r), etc. are permissible covariance models Example: Arsenic cX(r) = 0.7sX2 exp-(3r/7Km) + 0.3sX2 exp-(3r/40Km) where the first structure represents variability over short distances (7Km), e.g. geology, the second structure represents variability over longer distances (40Km) e.g. aquifers.

Space/time covariance models cX(r,t) is a 2D function with spatial component cX(r,t=0) and temporal component cX(r=0,t) Space/time separable covariance model cX(r,t) = cXr(r) cXt(t) , where cXr(r) and cXt(t) are permissible models Nested space/time separable models cX(r,t) = cr1(r) ct1(t) + cr2(r)ct2 (t) + … Example: Yearly Particulate Matter concentration (ppm) across the US cX(r,t) = c1 exp(-3r/ar1-3t/at1) + c2 exp(-3r/ar2-3t/at2) 1st structure c1=0.0141(log mg/m3)2, ar1=448 Km, at1=1years is weather driven 2nd structure c1=0.0141(log mg/m3)2, ar1=17 Km, at1=45years due to human activities

The simple kriging (SK) estimator Gather the data xhard=[x1, x2, x3 , …]T and obtain the experimental covariance Fit a covariance model cX(r) to the experimental covariance Simple kriging (SK) is a linear estimator Xk(SK) = l0 + l T Xhard SK is unbiased E[Xk(SK) ] = E[Xk] ═► Xk(SK) = mk +l T (Xhard - mhard) SK minimizes the estimation variance sSK2 = E[(xk - xk(SK) )2] ∂sSK2 / ∂lT = 0 ═► lT = Ck,hard Chard,hard-1 Hence the SK estimator is given by xk(SK) = mk + Ck,hard Chard,hard-1 (xhard. - mhard) T And its variance is sSK2 = sk2 - Ck,hard Chard,hard-1 Chard,k

Example of kriging maps introToKrigingExample.m Run Kriging Example introToKrigingExample.m

Example of kriging maps Observations Only hard data are considered Exactitude property at the data points Kriging estimates tend to the (prior) expected value away from the data points Hence, kriging maps are characterized by “islands” around data points Kriging variance is only a function to the distance from the data points Limitations of kriging Kriging does not provide a rigorous framework to integrate hard and soft data Kriging is a linear combination of data (i.e. it is the “best” only among linear estimators, but it might be a poor estimator compared to non-linear estimators) The estimation variance does not account for the uncertainty in the data itself Kriging assumes that the data is Gaussian, whereas in reality uncertainty may be non-Gaussian Traditionally kriging has been implemented for spatial estimation, and space/time is merely viewed as adding another spatial dimension (this is wrong because it is lacking any explicit space/time metric)