Spatial Autocorrelation Basics NR 245 Austin Troy University of Vermont.

Slides:



Advertisements
Similar presentations
Chapter 3 Properties of Random Variables
Advertisements

Spatial autoregressive methods
Test of (µ 1 – µ 2 ),  1 =  2, Populations Normal Test Statistic and df = n 1 + n 2 – 2 2– )1– 2 ( 2 1 )1– 1 ( 2 where ] 2 – 1 [–
Managerial Economics in a Global Economy
Lesson 10: Linear Regression and Correlation
CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
The Multiple Regression Model.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Spatial Autocorrelation using GIS
1 SSS II Lecture 1: Correlation and Regression Graduate School 2008/2009 Social Science Statistics II Gwilym Pryce
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
Spatial statistics Lecture 3.
Introduction to Applied Spatial Econometrics Attila Varga DIMETIC Pécs, July 3, 2009.
Basic geostatistics Austin Troy.
Local Measures of Spatial Autocorrelation
Workshop 2: Spatial scale and dependence in biogeographical patterns Objective: introduce the following fundamental concepts on spatial data analysis:
CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
Review of Basics. REVIEW OF BASICS PART I Measurement Descriptive Statistics Frequency Distributions.
ANOVA notes NR 245 Austin Troy
Correlation and Autocorrelation
PSY 307 – Statistics for the Behavioral Sciences
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Spatial autoregressive methods Nr245 Austin Troy Based on Spatial Analysis by Fortin and Dale, Chapter 5.
Applied Geostatistics Geostatistical techniques are designed to evaluate the spatial structure of a variable, or the relationship between a value measured.
Topic 2: Statistical Concepts and Market Returns
Applied Geostatistics
SA basics Lack of independence for nearby obs
Chapter 2 Simple Comparative Experiments
Why Geography is important.
Linear Regression Example Data
Analysis of Variance & Multivariate Analysis of Variance
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Introduction to Regression Analysis, Chapter 13,
University of Wisconsin-Milwaukee Geographic Information Science Geography 625 Intermediate Geographic Information Science Instructor: Changshan Wu Department.
Global Measures of Spatial Autocorrelation
Correlation and Linear Regression
AM Recitation 2/10/11.
Geo479/579: Geostatistics Ch13. Block Kriging. Block Estimate  Requirements An estimate of the average value of a variable within a prescribed local.
Fundamentals of Data Analysis Lecture 7 ANOVA. Program for today F Analysis of variance; F One factor design; F Many factors design; F Latin square scheme.
Concepts and Notions for Econometrics Probability and Statistics.
1 Tests with two+ groups We have examined tests of means for a single group, and for a difference if we have a matched sample (as in husbands and wives)
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 2 – Slide 1 of 25 Chapter 11 Section 2 Inference about Two Means: Independent.
T-Tests and Chi2 Does your sample data reflect the population from which it is drawn from?
Dr. Marina Gavrilova 1.  Autocorrelation  Line Pattern Analyzers  Polygon Pattern Analyzers  Network Pattern Analyzes 2.
Spatial Statistics in Ecology: Area Data Lecture Four.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Ecosystems are: Hierarchically structured, Metastable, Far from equilibrium Spatial Relationships Theoretical Framework: “An Introduction to Applied Geostatistics“,
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
© Copyright McGraw-Hill 2000
Geo479/579: Geostatistics Ch4. Spatial Description.
Local Spatial Statistics Local statistics are developed to measure dependence in only a portion of the area. They measure the association between Xi and.
Analyzing Expression Data: Clustering and Stats Chapter 16.
T tests comparing two means t tests comparing two means.
T Test for Two Independent Samples. t test for two independent samples Basic Assumptions Independent samples are not paired with other observations Null.
Exploratory Spatial Data Analysis (ESDA) Analysis through Visualization.
Statistical methods for real estate data prof. RNDr. Beáta Stehlíková, CSc
Material from Prof. Briggs UT Dallas
Chapter 13 Understanding research results: statistical inference.
Synthesis.
Moran’s I and Correlation Coefficient r Differences and Similarities
Spatial statistics: Spatial Autocorrelation
Task 2. Average Nearest Neighborhood
Two-Sample Hypothesis Testing
Kakhramon Yusupov June 15th, :30pm – 3:00pm Session 3
Chapter 5 Part B: Spatial Autocorrelation and regression modelling.
Two Sample Tests When do use independent
Fundamentals of regression analysis
Spatial Autocorrelation
Statistics II: An Overview of Statistics
Presentation transcript:

Spatial Autocorrelation Basics NR 245 Austin Troy University of Vermont

SA basics Lack of independence for nearby obs Negative vs. positive vs. random Induced vs inherent spatial autocorrelation First order (gradient) vs. second order (patchiness) Within patch vs between patch Directional patterns: anisotropy Measured based on point pairs

Spatial lags Source: ESRI, ArcGIS help

Statistical ramifications Spatial version of redundancy/ pseudo-replication OLS estimator biased and confidence intervals too narrow. fewer number of independent observations than degrees of freedom indicate Estimate of the standard errors will be too low. Type 1 errors Systematic bias towards variables that are correlated in space

Tests Null hypothesis: observed spatial pattern of values is equally likely as any other spatial pattern Test if values observed at a location do not depend on values observed at neighboring locations

Moran’s I See for more detailhttp:// Ratio of two expressions: similarity of pairs adjusted for number of items, over variance Similarity based on difference from global mean

Moran’s I Where N is the number of cases Xi is the variable value at a particular location Xj is the variable value at another location is the mean of the variable Wij is a weight applied to the comparison between location i and location j See for more detailhttp://

Moran’s I See for more detailhttp:// # of connections In matrix Deviation from global mean for j W ij is a contiguity matrix. Can be: Adjacency based Inverse distance-based (1/d ij ) Or can use both

Moran’s I See for more detailhttp:// “Cross product”: deviations from mean of all neighboring features, multiplied together and summed When values of pair are either both larger than mean or both smaller, cross-product positive. When one is smaller than the mean and other larger than mean, the cross-product negative. The larger the deviation from the mean, the larger the cross-product result.

Moran’s I Varies between –1.0 and –When autocorrelation is high, the coefficient is high –A high I value indicates positive autocorrelation –Zero indicates negative and positive cross products balance each other out, so no correlation Significance tested with Z statistic Z scores are standard deviations from normal dist Source: ESRI ArcGIS help

Geary’s C One prob with Moran’s I is that it’s based on global averages so easily biased by skewed distribution with outliers. Geary’s C deals with this because: Interaction is not the cross-product of the deviations from the mean like Moran, but the deviations in intensities of each observation location with one another

Geary’s C Value typically range between 0 and 2 C=1: spatial randomness C< 1: positive SA C>1: negative SA Inversely related to Moran’s I Emphasizes diff in values between pairs of observations, rather than the covariation between the pairs. Moran more global indicator, whereas the Geary coefficient is more sensitive to differences in small neighborhoods.

Scale effects Can measure I or C at different spatial lags to see scale dependency with spatial correlogram Source:

Source: Fortin and Dale, Spatial Analysis

LISA Local version of Moran: maps individual covariance components of global Moran Require some adjustment: standardize row total in weight matrix (number of neighbors) to sum to 1—allows for weighted averaging of neighbors’ influence Also use n-1 instead of n as multiplier Usually standardized with z-scores +/ is usually a critical threshold value for Z Displays HH vs LL vs HL vs LH where And expected value

Local Getis-Ord Statistic (High/low Clustering) Indicates both clustering and whether clustered values are high or low Appropriate when fairly even distribution of values and looking for spatial spikes of high values. When both the high and low values spike, they tend to cancel each other out For a chosen critical distance d, where xi is the value of ith point, w(d) is the weight for point i and j for distance d. Only difference between numerator and denominator is weighting. Hence, w/ binary weights, range is from 0 to 1.

Median House Price

Local Moran: polygon adjacency

Local Moran: inverse distance

m threshold

Local Moran: inverse distance z values

Local Moran: inverse distance: p values

Local Getis: inverse distance: Z score