Bayesian kriging Instead of estimating the parameters, we put a prior distribution on them, and update the distribution using the data. Model: Prior: Posterior:

Slides:

Advertisements

Similar presentations

Bayesian Learning & Estimation Theory

Advertisements

Pattern Recognition and Machine Learning

Pattern Recognition and Machine Learning

Spatial point patterns and Geostatistics an introduction

Spatial point patterns and Geostatistics an introduction

Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.

Pattern Recognition and Machine Learning

Basic geostatistics Austin Troy.

Visual Recognition Tutorial

Classification and risk prediction

Meet the professor Friday, January 23 at SFU 4:30 Beer and snacks reception.

G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem, random variables, pdfs 2Functions.

Nonstationary covariance structures II NRCSE. Drawbacks with deformation approach Oversmoothing of large areas Local deformations not local part of global.

Some remarks about Lab 1. image(parana.krige,val=sqrt(parana.kri ge$krige.var)) contour(parana.krige,loc=loci,add=T)

Nonstationary covariance structure I: Deformations.

Spatial Statistics IV Stat 518 Sp08. Recall Method of moments: square of all pairwise differences, smoothed over lag bins Problems: Not necessarily a.

Applied Geostatistics

Machine Learning CMPT 726 Simon Fraser University

Machine Learning CUNY Graduate Center Lecture 3: Linear Regression.

Continuity and covariance Recall that for stationary processes so if C is continuous then Z is mean square continuous. To get results about sample paths.

Variograms/Covariances and their estimation

Modelling non-stationarity in space and time for air quality data Peter Guttorp University of Washington NRCSE.

Deterministic Solutions Geostatistical Solutions

7. Least squares 7.1 Method of least squares K. Desch – Statistical methods of data analysis SS10 Another important method to estimate parameters Connection.

Bayesian kriging Instead of estimating the parameters, we put a prior distribution on them, and update the distribution using the data. Model: Matrix with.

Nonstationary covariance structures II NRCSE. Drawbacks with deformation approach Oversmoothing of large areas Local deformations not local part of global.

Visual Recognition Tutorial

Global processes Problems such as global warming require modeling of processes that take place on the globe (an oriented sphere). Optimal prediction of.

Computer vision: models, learning and inference

Spatial Statistics III Stat 518 Sp08. Bayesian kriging Instead of estimating the parameters, we put a prior distribution on them, and update the distribution.

G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.

Stat 592A Spatial Statistical Methods NRCSE.

Statistical Tools for Environmental Problems NRCSE.

Modern Navigation Thomas Herring

Principles of the Global Positioning System Lecture 10 Prof. Thomas Herring Room A;

Binary Variables (1) Coin flipping: heads=1, tails=0 Bernoulli Distribution.

PATTERN RECOGNITION AND MACHINE LEARNING

Using ESRI ArcGIS 9.3 Spatial Analyst

Prof. Dr. S. K. Bhattacharjee Department of Statistics University of Rajshahi.

G. Cowan Lectures on Statistical Data Analysis Lecture 3 page 1 Lecture 3 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.

Geographic Information Science

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 3: LINEAR MODELS FOR REGRESSION.

Semivariogram Analysis and Estimation Tanya, Nick Caroline.

Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.

BCS547 Neural Decoding. Population Code Tuning CurvesPattern of activity (r) Direction (deg) Activity

Point Estimation of Parameters and Sampling Distributions Outlines:  Sampling Distributions and the central limit theorem  Point estimation  Methods.

Short course on space-time modeling Instructors: Peter Guttorp Johan Lindström Paul Sampson.

Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.

Maximum likelihood estimators Example: Random data X i drawn from a Poisson distribution with unknown  We want to determine  For any assumed value of.

Geostatistics GLY 560: GIS for Earth Scientists. 2/22/2016UB Geology GLY560: GIS Introduction Premise: One cannot obtain error-free estimates of unknowns.

ESTIMATION METHODS We know how to calculate confidence intervals for estimates of  and  2 Now, we need procedures to calculate  and  2, themselves.

Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”

G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.

Space-time processes NRCSE. Separability Separable covariance structure: Cov(Z(x,t),Z(y,s))=C S (x,y)C T (s,t) Nonseparable alternatives Temporally varying.

CS Statistical Machine learning Lecture 7 Yuan (Alan) Qi Purdue CS Sept Acknowledgement: Sargur Srihari’s slides.

Biointelligence Laboratory, Seoul National University

Probability Theory and Parameter Estimation I

LECTURE 09: BAYESIAN ESTIMATION (Cont.)

Ch3: Model Building through Regression

CH 5: Multivariate Methods

NRCSE 2. Covariances.

Special Topics In Scientific Computing

More about Posterior Distributions

Modelling data and curve fitting

Paul D. Sampson Peter Guttorp

Computing and Statistical Data Analysis / Stat 8

Lecture 3 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.

Parametric Methods Berlin Chen, 2005 References:

Concepts and Applications of Kriging

Probabilistic Surrogate Models

Presentation transcript:

Bayesian kriging Instead of estimating the parameters, we put a prior distribution on them, and update the distribution using the data. Model: Prior: Posterior:

geoR Default covariance model exponential Prior is assigned to  and. The latter assumed zero unless specified. The distributions can be discretized. Default prior on  is flat (if not specified, assumed constant). (Lots of different assignments are possible)

Prior/posterior of 

Variogram estimates mean median mode

Predictive mean

Kriging variances BayesClassical range (52,307) range (170,278)

2. Covariances NRCSE

Valid covariance functions Bochner’s theorem: The class of covariance functions is the class of positive definite functions C: Why?

Spectral representation By the spectral representation any isotropic continuous correlation on R d is of the form By isotropy, the expectation depends only on the distribution G of. Let Y be uniform on the unit sphere. Then

Isotropic correlation J v (u) is a Bessel function of the first kind and order v. Hence and in the case d=2 (Hankel transform)

The Bessel function J 0 –0.403

The exponential correlation A commonly used correlation function is  (v) = e –v/ . Corresponds to a Gaussian process with continuous but not differentiable sample paths. More generally,  (v) = c(v=0) + (1-c)e –v/  has a nugget c, corresponding to measurement error and spatial correlation at small distances. All isotropic correlations are a mixture of a nugget and a continuous isotropic correlation.

The squared exponential Usingyields corresponding to an underlying Gaussian field with analytic paths. This is sometimes called the Gaussian covariance, for no really good reason. A generalization is the power(ed) exponential correlation function,

The spherical Corresponding variogram nugget sillrange

The Matérn class where is a modified Bessel function of the third kind and order . It corresponds to a spatial field with  –1 continuous derivatives  =1/2 is exponential;  =1 is Whittle’s spatial correlation; yields squared exponential.

Some other covariance/variogram families NameCovarianceVariogram Wave Rational quadratic LinearNone Power lawNone

Recall Method of moments: square of all pairwise differences, smoothed over lag bins Problems: Not necessarily a valid variogram Not very robust Estimation of variograms

A robust empirical variogram estimator (Z(x)-Z(y)) 2 is chi-squared for Gaussian data Fourth root is variance stabilizing Cressie and Hawkins:

Least squares Minimize Alternatives: fourth root transformation weighting by 1/  2 generalized least squares

Maximum likelihood Z~N n ( ,  )  =  [  (s i -s j ;  )] =  V(  ) Maximize and  maximizes the profile likelihood

A peculiar ml fit

Some more fits

All together now...

Asymptotics Increasing domain asymptotics: let region of interest grow. Station density stays the same Bad estimation at short distances, but effectively independent blocks far apart Infill asymptotics: let station density grow, keeping region fixed. Good estimates at short distances. No effectively independent blocks, so technically trickier

Stein’s result Covariance functions C 0 and C 1 are compatible if their Gaussian measures are mutually absolutely continuous. Sample at {s i, i=1,...,n}, predict at s (limit point of sampling points). Let e i (n) be kriging prediction error at s for C i, and V 0 the variance under C 0 of some random variable. If lim n V 0 (e 0 (n))=0, then

Global processes Problems such as global warming require modeling of processes that take place on the globe (an oriented sphere). Optimal prediction of quantities such as global mean temperature need models for global covariances. Note: spherical covariances can take values in [-1,1]–not just imbedded in R 3. Also, stationarity and isotropy are identical concepts on the sphere.

Isotropic covariances on the sphere Isotropic covariances on a sphere are of the form where p and q are directions,  pq the angle between them, and P i the Legendre polynomials. Example: a i =(2i+1)  i

Global temperature Global Historical Climatology Network 7280 stations with at least 10 years of data. Subset with 839 stations with data selected.

Isotropic correlations