Spatial statistics: Spatial Autocorrelation

Slides:



Advertisements
Similar presentations
Spatial point patterns and Geostatistics an introduction
Advertisements

Spatial autoregressive methods
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Managerial Economics in a Global Economy
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Statistical Techniques I EXST7005 Simple Linear Regression.
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
Objectives (BPS chapter 24)
Basic geostatistics Austin Troy.
Spatial Autocorrelation Basics NR 245 Austin Troy University of Vermont.
Correlation and Autocorrelation
Deterministic Solutions Geostatistical Solutions
The Simple Regression Model
Applied Geostatistics
Deterministic Solutions Geostatistical Solutions
SA basics Lack of independence for nearby obs
Why Geography is important.
Business Statistics - QBM117 Statistical inference for regression.
Applications in GIS (Kriging Interpolation)
Correlation and Regression Analysis
Method of Soil Analysis 1. 5 Geostatistics Introduction 1. 5
Introduction to Regression Analysis, Chapter 13,
Simple Linear Regression Analysis
IS415 Geospatial Analytics for Business Intelligence
Correlation & Regression
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Inference for regression - Simple linear regression
Chapter 6 & 7 Linear Regression & Correlation
Spatial Statistics in Ecology: Area Data Lecture Four.
Ecosystems are: Hierarchically structured, Metastable, Far from equilibrium Spatial Relationships Theoretical Framework: “An Introduction to Applied Geostatistics“,
The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.
©2006 Thomson/South-Western 1 Chapter 13 – Correlation and Simple Regression Slides prepared by Jeff Heyl Lincoln University ©2006 Thomson/South-Western.
Geographic Information Science
Examining Relationships in Quantitative Research
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Spatial Analysis & Geostatistics Methods of Interpolation Linear interpolation using an equation to compute z at any point on a triangle.
Geo479/579: Geostatistics Ch4. Spatial Description.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
Exploratory Spatial Data Analysis (ESDA) Analysis through Visualization.
Geo479/579: Geostatistics Ch7. Spatial Continuity.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Statistical methods for real estate data prof. RNDr. Beáta Stehlíková, CSc
INTERPOLATION Procedure to predict values of attributes at unsampled points within the region sampled Why?Examples: -Can not measure all locations: - temperature.
URBDP 422 URBAN AND REGIONAL GEO-SPATIAL ANALYSIS
Why Model? Make predictions or forecasts where we don’t have data.
Synthesis.
Regression and Correlation
Statistical Data Analysis - Lecture /04/03
Correlation and Simple Linear Regression
Chapter 5 Part B: Spatial Autocorrelation and regression modelling.
Chapter 11: Simple Linear Regression
Correlation and Simple Linear Regression
Correlation and Regression
Chapter 14 – Correlation and Simple Regression
Spatial Autocorrelation
Correlation and Simple Linear Regression
Spatial Data Analysis: Intro to Spatial Statistical Concepts
Descriptive and Inferential
Simple Linear Regression
Simple Linear Regression and Correlation
Product moment correlation
SPATIAL ANALYSIS IN MACROECOLOGY
Modeling Spatial Phenomena
REGRESSION ANALYSIS 11/28/2019.
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Spatial statistics: Spatial Autocorrelation Spatial modelling of biological ecosystem properties course: Spatial statistics: Spatial Autocorrelation Andrew K. Skidmore Babak Naimi

What is Spatial Autocorrelation?! Basic Concept What is Spatial Autocorrelation?! Spatial Autocorrelation?!!! It ‘s too complicated….

What is Spatial Autocorrelation?! Basic Concept What is Spatial Autocorrelation?! Correlation is a statistical relationship between two variables X Y 12 1 78 8 24 3 .

What is Spatial Autocorrelation?! Basic Concept What is Spatial Autocorrelation?! The correlation of values in a variable with itself !! Similarity between observations as a function of time Similarity in Time

Basic Concept What is Spatial Autocorrelation?! Similarity in Space Similarity between observations as a function of Space / Distance

Spatial Autocorrelation Waldo Tobler First Law of Geography “Everything is related to everything else, but near things are more related than distant things.” Waldo Tobler (1970)

Spatial Autocorrelation - Examples Soil Nitrogen Source: http://www.unl.edu/nac/atlas/Map_Html/Stable_and_Productive_Soils/National/Kjeldahl_Soil_Nitrogen/Kjeldahl_Soil_Nitrogen.htm Soil Phosphors, Potassium and PH for a farm

Spatial Autocorrelation - Examples

How to measure ? Moran’s I Geary’s C Semi-Variance By measuring similarity/dissimilarity for pairs of observations, it can be quantified as a function of distance Moran’s I An index of similarity (ranges between -1 and 1) An index of dissimilarity (ranges between 0 and 2) Geary’s C Semi-Variance An index of dissimilarity

Values at locations of i and j (pair samples) How to measure ? By Measuring Similarity/dissimilarity for pairs of observations, it can be quantified as a function of distance Moran’s I Values at locations of i and j (pair samples) Average of values Number of samples Specifies whether the pair samples of i and j is within a given distance Values range from [-1, 1] Value = 1 : Perfect positive correlation Value = -1: Perfect negative correlation

How to measure ? Geary’s C By Measuring Similarity/dissimilarity for pairs of observations, it can be quantified as a function of distance Geary’s C Sum of wij It measure differences of values for pairs of observations, while Moran’s I measures co-variation Moran’s I is inversely related to Geary’s C but not identical It ranges between [0,2] Value < 1 : Positive autocorrelation Value = 1 : No autocorrelation Value > 1 : Negative autocorrelation

Don’t worry! You will find later what it is  How to measure ? By Measuring Similarity/dissimilarity for pairs of observations, it can be quantified as a function of distance Semi-Variance Number of samples in distance class (lag) ? Don’t worry! You will find later what it is 

How to measure ? By Measuring Similarity/dissimilarity for pairs of observations, it can be quantified as a function of distance Xi : Value of variable at location (i) Site j Xj : Value of variable at location (j) Distance Dij : Distance between locations (i) and (j) Site i - So, How many pairs is there?!! - What if the sample size is large?!!

How to measure ? By Measuring Similarity/dissimilarity for pairs of observations, it can be quantified as a function of distance Usually, the number of possible pairs is very large, therefore: A lag parameter can be specified. It is a distance interval in which the similarity/dissimilarity of all existing pairs are averaged to simplify the calculation 2 km A cutoff distance threshold can be specified up to which point pairs are included in similarity/dissimilarity measurement

Plotting Similarity/Dissimilarity measures against Distance How to measure ? Spatial structure in study area: Plotting Similarity/Dissimilarity measures against Distance Correlogram Plotting Moran’s I against Distance Semi-Variogram Plotting Semi-Variance against Distance Source: Webster & Oliver (2007), Geostatistics for Environmental Statistics

How to measure ? Spatial structure in study area: Variogram + Lag = 2 km Variogram cloud Variogram + Variogram model 2 4 6 8 Distance Semi-variance

How to measure ? Spatial structure in study area: Semi-Variogram Sill: maximum semi-variance; represents variability in the absence of spatial dependence Range: separation between point-pairs at which the sill is reached; distance at which there is no evidence of spatial dependence Nugget: semi-variance as the separation approaches zero; represents variability at a point that can’t be explained by spatial structure. Semi-variance Lag (Distance) Sill Nugget Range

Spatial structure-Examples Which one is more Spatially Autocorrelated?

Spatial structure-Examples

How to measure ? Spatial structure in study area: Correlogram Positive Autocorrelation in short distances Moran’s I Null Autocorrelation in long distances Distance

Spatial structure-Examples

Stationary assumption Keep in mind…! Stationary assumption Autocorrelation assumes stationary, meaning that the spatial structure of the variable is consistent over the entire field, requiring that: the mean is constant over the region variance is constant and finite; and covariance depends only on between-sample spacing In many cases this is not true because of larger trends in the data In these cases, the data are often de-trended before analysis. Stratify the data into homogenous areas Fit a regression to the trend, and use only the residuals for autocorrelation analysis

Isotropy/Anistotropy Keep in mind…! Isotropy/Anistotropy Autocorrelation also assumes isotropy, meaning that the spatial structure of the variable is consistent in all directions Often this is not the case, and the variable exhibits anisotropy, meaning that there is a direction-dependent trend in the data. If a variable exhibits different ranges in different directions, then there is a geometric anisotropy. For example, in a dune deposit, permeability might have a larger range in the wind direction compared to the range perpendicular to the wind direction. If the variable exhibits different sills in different directions, then there is a zonal anisotropy. For example, a variogram in a vertical wellbore typically shows a bigger sill than a variogram in the horizontal direction.

Spatial Autocorrelation- Enemy or Friend?! many of the standard techniques and methods documented in standard statistics textbooks have significant problems when we try to apply them to the analysis of the spatial distributions. Why ?!

Spatial Autocorrelation- Enemy or Friend?! It is assumed by OLS that residuals are Normally distributed without any structure, i.e. they are independent Yi=0+1Xi +i Y Intercept (a) slope (b) Least squares (OLS) Positive spatial autocorrelation violates this, because samples taken from nearby areas are related to each other and are not independent X

Dealing with SA Sampling design: When sampling pattern tends to be clustered, it is likely that residuals become spatially autocorrelated!

Dealing with SA Ripley’s K Can be used to check if the pattern is homogenous. Ripley's K function detects deviations from spatial homogeneity where dij is the Euclidean distance between the ith and jth points in a data set of n points, and λ is the average density of points, generally estimated as n/A, where A is the area of the region containing all points. If the points are approximately homogeneous, the estimated value of Ripley’s K  should be approximately equal to πs2.

Measure spatial structure in residuals! Dealing with SA Test spatial autocorrelation for regression: Measure spatial structure in residuals! Are the residuals spatially autocorrelated? No, you’re safe! Yes, nonspatial models are not reliable!

Dealing with SA Traditional Statistics Spatial Statistics Mean, StDev (Normal Curve) Central Tendency Typical Response (scalar) Minimum= 5.4 ppm Maximum= 103.0 ppm Mean= 22.4 ppm StDEV= 15.5 Spatial Statistics Map of the Variance (gradient) Spatial Distribution Numerical Spatial Relationships (Surface)

Dealing with SA

Dealing with SA Autocovariate regression Different techniques are developed as spatial statistical modeling: Autocovariate regression Generalized Least Squares (GLS) Simultaneous Autoregressive Model (SAR) Conditional Autoregressive Model (CAR) Generalized Linear Mixed Model (GLMM) Generalized Estimating Equations (GEE)

Dealing with SA Autocovariate regression (Autoregression) : The idea of this techniques is to add a new predictor (the autocovariate) given by: y = Wy + X +  W models neighborhood relationships  models strength of spatial dependencies The Y values are a function of all other Ys value, “weighted” by the relationship in W matrix (i.e., geographic distances) Y1 = Y2*W12+Y3*W13+Y4*W14+...Yn*W1n