Spatial Data Analysis: Intro to Spatial Statistical Concepts

Slides:



Advertisements
Similar presentations
Autocorrelation and Heteroskedasticity
Advertisements

Regression Analysis.
Spatial Autocorrelation using GIS
Introduction to Applied Spatial Econometrics Attila Varga DIMETIC Pécs, July 3, 2009.
Objectives (BPS chapter 24)
Spatial Autocorrelation Basics NR 245 Austin Troy University of Vermont.
GIS and Spatial Statistics: Methods and Applications in Public Health
Correlation and Autocorrelation
SA basics Lack of independence for nearby obs
Why Geography is important.
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
Business Statistics - QBM117 Statistical inference for regression.
Inferential Statistics
Understanding Statistics
CJT 765: Structural Equation Modeling Class 7: fitting a model, fit indices, comparingmodels, statistical power.
Name: Angelica F. White WEMBA10. Teach students how to make sound decisions and recommendations that are based on reliable quantitative information During.
Spatial Statistics in Ecology: Area Data Lecture Four.
Why Is It There? Getting Started with Geographic Information Systems Chapter 6.
Correlation & Regression
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Chapter 1 Measurement, Statistics, and Research. What is Measurement? Measurement is the process of comparing a value to a standard Measurement is the.
INTRODUCTION TO ANALYSIS OF VARIANCE (ANOVA). COURSE CONTENT WHAT IS ANOVA DIFFERENT TYPES OF ANOVA ANOVA THEORY WORKED EXAMPLE IN EXCEL –GENERATING THE.
Spatial Interpolation III
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Geo479/579: Geostatistics Ch4. Spatial Description.
Question paper 1997.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
Correlation & Regression Analysis
ANOVA, Regression and Multiple Regression March
Exploratory Spatial Data Analysis (ESDA) Analysis through Visualization.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Statistical methods for real estate data prof. RNDr. Beáta Stehlíková, CSc
1 HETEROSCEDASTICITY: WEIGHTED AND LOGARITHMIC REGRESSIONS This sequence presents two methods for dealing with the problem of heteroscedasticity. We will.
Why Is It There? Chapter 6. Review: Dueker’s (1979) Definition “a geographic information system is a special case of information systems where the database.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Some Terminology experiment vs. correlational study IV vs. DV descriptive vs. inferential statistics sample vs. population statistic vs. parameter H 0.
Stats Methods at IC Lecture 3: Regression.
Comparing Counts Chi Square Tests Independence.
Advanced Data Analytics
Chapter 13 Simple Linear Regression
Intro to Research Methods
Why Model? Make predictions or forecasts where we don’t have data.
Synthesis.
Inference for Regression
Spatial statistics: Spatial Autocorrelation
Understanding Results
CJT 765: Structural Equation Modeling
Chapter 25 Comparing Counts.
Elementary Statistics
POSC 202A: Lecture Lecture: Substantive Significance, Relationship between Variables 1.
Introduction to Inferential Statistics
Correlation and Regression
Regression model Y represents a value of the response variable.
Module 8 Statistical Reasoning in Everyday Life
Autocorrelation.
No notecard for this quiz!!
Spatial Autocorrelation
Spatial Data Analysis: Intro to Spatial Statistical Concepts
Chapter 7 – Correlation & Differential (Quasi)
Basic Practice of Statistics - 3rd Edition Inference for Regression
Chapter 26 Comparing Counts.
Product moment correlation
Regression Assumptions
The Examination of Residuals
Chapter 26 Comparing Counts Copyright © 2009 Pearson Education, Inc.
Chapter Nine: Using Statistics to Answer Questions
Autocorrelation.
Data and Data Collection
Chapter 26 Comparing Counts.
Regression Assumptions
Presentation transcript:

Spatial Data Analysis: Intro to Spatial Statistical Concepts Spatial Structures in the Social Sciences Spatial Data Analysis: Intro to Spatial Statistical Concepts Scott Bell GIS Institute Global Positioning Systems

Spatial Stats rely on Spatial Data Spatial Structures in the Social Sciences Spatial Stats rely on Spatial Data Traditional statistics are based on distributions of data along a single axis Spatial data by its nature exists on two axes (X and Y) I. E. the median in traditional statistics in the sum of all values divided by the number of observations Spatial mean is the X, Y coordinate result from calculating the means of X and Y Global Positioning Systems

Exploratory Spatial Data Analysis Spatial Structures in the Social Sciences Exploratory Spatial Data Analysis Used like descriptive statistics Potentially more options Related to Thematic Mapping and Geo-visualization Pattern identification/Hypothesis generation Global Positioning Systems

Traditional vs Spatial Spatial Structures in the Social Sciences Traditional vs Spatial “Independence of observations” Assumption Spatial Statistics operate on data that are assumed to be spatially dependent Spatial statistics (Spatial autocorrelation(SA)) have been developed to account for SA so distribution theory can be applied Global Positioning Systems

Traditional vs Spatial Spatial Structures in the Social Sciences Traditional vs Spatial “Replication” Assumption Spatial (and other systems) are complex and hard to replicate Precise Data Samples drawn from hypothetical universe In ability to replicate (and size and complexity of system) usually means our sample spatial data is the universe Distribution under null can be obtained by creating an experiment (environment) in which the null is true due to sample being universe it is virtually impossible to obtain the distribution under null hypothesis conditions First, they assume designed experiments that can be replicated. This notion of "replication" forms the philosophical basis of distribution theory. Usually, although not always, classical statistics assume precise data so that an observation's value could be known exactly if we employed sufficiently accurate and precise measuring instruments. This differs subtly from the notion of experimental error that is incorporated into most classical statistical models. Third, classical statistics assume the samples are drawn from a hypothetical universe or population. Again, this notion of a universe is part of the philosophical basis needed for distribution theory to apply. Fourth, classical statistics assume the distribution of the test statistic under the null hypothesis can be obtained by replicating a null experiment. By null experiment we mean the experiment conducted in a situation under which the null hypothesis is true. Examples of classical statistics include analysis of variance, t-test, regression, general linear models and so on. Spatial randomization tests are based on assumptions that differ substantially from those of classical tests. First, randomization tests assume observational data that cannot be replicated. Spatial systems are often large and complex, and it is usually difficult or impossible to conduct experiments on such systems. For this reason, we are often faced with only the data set at hand. This lack of replication means the sample is taken to be the universe -- we cannot sample from a larger population because the study itself cannot be replicated. These constraints mean the distribution under the null hypothesis cannot be obtained with any ease from distribution theory. Randomization tests side step this problem by randomizing the observations within the sample. A large body of literature has arisen around such randomization tests. For spatial randomization the text by Brian Manly is a good introduction. Global Positioning Systems

Spatial Autocorrelation Spatial Structures in the Social Sciences Spatial Autocorrelation What is it? Uses of spatial autocorrelation Types of spatial dependence Distance K-nearest neighbors Contiguity Rooks, bishops, and Kings cases “Everything is related to everything, but near things are more related.” (Tobler, 1976) Spatial autocorrelation is an assessment of the correlation of a variable in reference to spatial location of the variable. Assess if the values are interrelated, and if so is there a spatial pattern to the correlation, there is spatial autocorrelation. Spatial autocorrelation measures the level of interdependence between the variables, the nature and strength of the interdependence. Spatial autocorrelation may be classified as either positive or negative. Positive spatial autocorrelation has all similar values appearing together, while negative spatial autocorrelation has dissimilar values appearing in close association. Does the non-spatial variable (attribute) in adjacent spaces vary with the time or space separating them. Uses of assessment of spatial autocorrelation: - identification of patterns which may reveal an underlying process, - describe a spatial pattern and use as evidence, such as a diagnostic tool for the nature of residuals in a regression analysis, - as an inferential statistic to buttress assumptions about the data, - data interpolation technique. Global Positioning Systems

Spatial Autocorrelation Spatial Structures in the Social Sciences Spatial Autocorrelation Deal simultaneously with similarities in the location (space) of objects and their (non-spatial) attributes. (Goodchild, et. al. 2001) Similar location/Similar attribute = high spatial autocorrelation Similar location/dissimilar attributes = negative spatial autocorrelation Attributes are independent of location = zero/low correlation Global Positioning Systems

Spatial Structures in the Social Sciences Global Positioning Systems

Spatial Structures in the Social Sciences Correlation= -1.00 Correlation= -.393 Correlation= 0 Correlation= +.393 Correlation= +.857 Global Positioning Systems

Spatial Structures in the Social Sciences Global Positioning Systems

Spatial Regression (in GeoDa and ArcGIS) Allows for control of spatially auto-correlated error or DV (non-independent observations) Error: Unexplained variation in DV is related to nearby values of error Lag: spatial dependence in DV, additional IV term added to model