Applied Geostatistics

Slides:



Advertisements
Similar presentations
Spatial point patterns and Geostatistics an introduction
Advertisements

Spatial point patterns and Geostatistics an introduction
Kin 304 Regression Linear Regression Least Sum of Squares
Correlation and regression
Statistical Techniques I EXST7005 Simple Linear Regression.
Forecasting Using the Simple Linear Regression Model and Correlation
Correlation and Regression
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
Basic geostatistics Austin Troy.
Spatial Autocorrelation Basics NR 245 Austin Troy University of Vermont.
Spatial Interpolation
Correlation and Autocorrelation
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #18.
Deterministic Solutions Geostatistical Solutions
Regionalized Variables take on values according to spatial location. Given: Where: A “structural” coarse scale forcing or trend A random” Local spatial.
Deterministic Solutions Geostatistical Solutions
Basics: Notation: Sum:. PARAMETERS MEAN: Sample Variance: Standard Deviation: * the statistical average * the central tendency * the spread of the values.
SA basics Lack of independence for nearby obs
REGRESSION AND CORRELATION
Ordinary Kriging Process in ArcGIS
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Business Statistics - QBM117 Statistical inference for regression.
Applications in GIS (Kriging Interpolation)
Correlation and Regression Analysis
Method of Soil Analysis 1. 5 Geostatistics Introduction 1. 5
Introduction to Regression Analysis, Chapter 13,
Correlation & Regression
Correlation and Regression
Regression and Correlation Methods Judy Zhong Ph.D.
Introduction to Linear Regression and Correlation Analysis
Inference for regression - Simple linear regression
EXPLORING SPATIAL CORRELATION IN RIVERS by Joshua French.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Spatial Interpolation of monthly precipitation by Kriging method
Using ESRI ArcGIS 9.3 Spatial Analyst
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Quantitative Skills 1: Graphing
Basic geostatistics Austin Troy.
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Ecosystems are: Hierarchically structured, Metastable, Far from equilibrium Spatial Relationships Theoretical Framework: “An Introduction to Applied Geostatistics“,
The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.
Explorations in Geostatistical Simulation Deven Barnett Spring 2010.
Geographic Information Science
Geo479/579: Geostatistics Ch16. Modeling the Sample Variogram.
Examining Relationships in Quantitative Research
Spatial Statistics in Ecology: Continuous Data Lecture Three.
GEOSTATISICAL ANALYSIS Course: Special Topics in Remote Sensing & GIS Mirza Muhammad Waqar Contact: EXT:2257.
The Semivariogram in Remote Sensing: An Introduction P. J. Curran, Remote Sensing of Environment 24: (1988). Presented by Dahl Winters Geog 577,
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Spatial Analysis & Geostatistics Methods of Interpolation Linear interpolation using an equation to compute z at any point on a triangle.
Week 21 Stochastic Process - Introduction Stochastic processes are processes that proceed randomly in time. Rather than consider fixed random variables.
Geo479/579: Geostatistics Ch4. Spatial Description.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
Examining Relationships in Quantitative Research
Lab for Remote Sensing Hydrology and Spatial Modeling Dept of Bioenvironmental Systems Engineering National Taiwan University 1/90 GEOSTATISTICS VARIOGRAM.
Lecture 6: Point Interpolation
Geo479/579: Geostatistics Ch7. Spatial Continuity.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Geostatistics GLY 560: GIS for Earth Scientists. 2/22/2016UB Geology GLY560: GIS Introduction Premise: One cannot obtain error-free estimates of unknowns.
BPS - 5th Ed. Chapter 231 Inference for Regression.
INTERPOLATION Procedure to predict values of attributes at unsampled points within the region sampled Why?Examples: -Can not measure all locations: - temperature.
Synthesis.
Spatial statistics: Spatial Autocorrelation
Simple Linear Regression - Introduction
Spatial Analysis Longley et al..
CHAPTER 29: Multiple Regression*
Basic Practice of Statistics - 3rd Edition Inference for Regression
Concepts and Applications of Kriging
Presentation transcript:

Applied Geostatistics Notes on Introduction to Spatial Autocorrelation Week 4 – April 18, 2005 ---------------------------------------------------------------- Geostatistical methods were developed for interpreting data that varies continuously over a predefined, fixed spatial region. The study of geostatistics assumes that at least some of the spatial variation observed for natural phenomena can be modeled by random processes with spatial autocorrelation. Geostatistics is based on the theory of regionalized variables, variable distributed in space (or time). Geostatiscal theory supports that any measurement of regionalozed variables can be viewed as a realization of a random function (or random process, or random field, or stochastic process)

Describing spatial structure is useful for: Geostatisticals techniques are designed to evaluate the spatial structure of a variable, or the relationship between a value measured at a point in one place, versus a value from another point measured a certain distance away. Describing spatial structure is useful for: Indicating intensity of pattern and the scale at which that pattern is exposed Interpolating to predict values at unmeasured points across the domain (e.g. kriging) Assessing independence of variables before applying parametric tests of significance

+ Regionalized Variable Zi Function Z in domain D = a set of space dependent values + Z1 Z3 Zn Z4 Zi Z2 Histogram of samples zi Z(x) Z(x+h) Variables are spatially correlated, Therefore: Z(x+h) can be estimated from Z(x) by using a regression model. ** This assumption holds true with a recognized increased in error, from other lest square models. Cov(Z(x),Z(x+h))

Regionalized Variable Regionalized Variables take on values according to spatial location. Given a variable z, measured at a location i , the variability in z can be broken down into three components: Usually removed by detrending Where: A “structural” coarse scale forcing or trend What we are interested in A random” Local spatial dependency error variance (considered normally distributed) Coarse scale forcing or trends can be removed by fitting a surface to the trend using regression and then working with regression residuals

As θ decreases, a/b goes to 0 Correlation: calculated using the geometric rather than the arithmetric mean Two distributions One distribution θ2 ) a a ) θ ) θ1 b b As θ decreases, a/b goes to 0 data deviates X 1 3 2 : µ Y 3 2 5 : µ x y x·y x2 y2 If you were to calculate correlation by hand …. You would produce these Terms. Deviations Sum of squares Product of Deviations

Spatial Autocorrelation A variable is thought to be autocorrelated if it is possible to predict its value at a given location, by knowing its value at other nearby locations. Positive autocorrelation: Negative autocorrelation: No autocorrelation: Autocorrelation is evaluated using structure functions that assess the spatial structure or dependency of the variable. Two of these functions are autocorrelation and semivariance which are graphed as a correlogram and semivariogram, respectively. Both functions plot the spatial dependence of the variable against the spatial separation or lag distance.

Moran I (1950) A cross-product statistic that is used to describe autocorrelation Compares value of a variable at one location with values at all other locations Where: n is the number of pairs Zi is the deviation from the mean for value at location i (i.e., Zi = yi – y for variable y) Zj is the deviation from the mean for value at location j (i.e., Zj = yj – y for variable y) wij is an indicator function or weight at distance d (e.g. wij = 1, if j is in distance class d from point i, otherwise = 0) Wij is the sum of all weights (number of pairs in distance class) The numerator is a covariance (cross-product) term; the denominator is a variance term. Values range from [-1, 1] Value = 1 : Perfect positive correlation Value = -1: Perfect negative correlation

The numerator in this equation is a distance term that gets squared. GEARY C (1954) A squared difference statistic for assessing spatial autocorrelation Considers differences in values between pairs of observations, rather than the covariation between the pairs (Moran I) The numerator in this equation is a distance term that gets squared. The Geary C statistic is more sensitive to extreme values & clustering than the Moran I. Values range from [0,3] Value = 1 : No autocorrelation Value = 0 : Positive autocorrelation Value > 1 : Negative autocorrelation

Semivariance The geostatistical measure that describes the rate of change of the regionalized variable is known as the semivariance. Semivariance is used for descriptive analysis where the spatial structure of the data is investigated using the semivariogram and for predictive applications where the semivariogram is fitted to a theoretical model, parameterized, and used to predict the regionalized variable at other non-measured points (kriging). Where : j is a point at distance d from i nd is the number of points in that distance class (i.e., the sum of the weights wij for that distance class) wij is an indicator function set to 1 if the pair of points is within the distance class.

A semivariogram is a plot of the structure function that, like autocorrelation, describes the relationship between measurements taken some distance apart. Semivariograms define the range or distance over which spatial dependence exists. The nugget is the semivariance at a distance 0.0, (the y –intercept) The sill is the value at which the semivariogram levels off (its asymptotic value) The range is the distance at which the semivariogram levels off (the spatial extent of structure in the data)

Autocorrelation assumes stationarity, meaning that the spatial structure of the variable is consistent over the entire domain of the dataset. The stationarity of interest is second-order (weak) stationarity, requiring that: the mean is constant over the region variance is constant and finite; and covariance depends only on between-sample spacing In many cases this is not true because of larger trends in the data In these cases, the data are often detrended before analysis. One way to detrend data is to fit a regression to the trend, and use only the residuals for autocorrelation analysis

Anistotropy Autocorrelation also assumes isotropy, meaning that the spatial structure of the variable is consistent in all directions. Often this is not the case, and the variable exhibits anisotropy, meaning that there is a direction-dependent trend in the data. If a variable exhibits different ranges in different directions, then there is a geometric anisotropy. For example, in a dune deposit, permeability might have a larger range in the wind direction compared to the range perpendicular to the wind direction. If the variable exhibits different sills in different directions, then there is a zonal anisotropy. For example, a variogram in a vertical wellbore typically shows a bigger sill than a variogram in the horizontal direction. Some variograms are a combination of both geometric and zonal anisotropies.

Assumes no sill or range For predictions, the empirical semivariogram is converted to a theoretic one by fitting a statistical model (curve) to describe its range, sill, & nugget. There are four common models used to fit semivariograms: Assumes no sill or range Linear: Exponential: Spherical: Where: c0 = nugget b = regression slope a = range c0+ c = sill Gaussian:

Variogram Modeling Suggestions Check for enough number of pairs at each lag distance (from 30 to 50). Removal of outliers Truncate at half the maximum lag distance to ensure enough pairs Use a larger lag tolerance to get more pairs and a smoother variogram Start with an omnidirectional variogram before trying directional variograms Use other variogram measures to take into account lag means and variances (e.g., inverted covariance, correlogram, or relative variograms) Use transforms of the data for skewed distributions (e.g. logarithmic transforms). Use the mean absolute difference or median absolute difference to derive the range