Spatial Statistics II RESM 575 Spring 2010 Lecture 8.

Slides:



Advertisements
Similar presentations
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Advertisements

Spatial Autocorrelation using GIS
Spatial statistics Lecture 3.
2-5 : Normal Distribution
Correlation and Autocorrelation
QUANTITATIVE DATA ANALYSIS
Applied Geostatistics Geostatistical techniques are designed to evaluate the spatial structure of a variable, or the relationship between a value measured.
GG313 Lecture 8 9/15/05 Parametric Tests. Cruise Meeting 1:30 PM tomorrow, POST 703 Surf’s Up “Peak Oil and the Future of Civilization” 12:30 PM tomorrow.
Chi-square Test of Independence
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Introduction to Mapping Sciences: Lecture #5 (Form and Structure) Form and Structure Describing primary and secondary spatial elements Explanation of spatial.
CHAPTER 6 Statistical Analysis of Experimental Data
OMS 201 Review. Range The range of a data set is the difference between the largest and smallest data values. It is the simplest measure of dispersion.
SA basics Lack of independence for nearby obs
You would not go and count every single flower
Advanced GIS Using ESRI ArcGIS 9.3 Arc ToolBox 5 (Spatial Statistics)
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Intro. To GIS Lecture 6 Spatial Analysis April 8th, 2013
Density vs Hot Spot Analysis. Density Density analysis takes known quantities of some phenomenon and spreads them across the landscape based on the quantity.
Measures of Central Tendency
Point Pattern Analysis
+ Quantitative Statistics: Chi-Square ScWk 242 – Session 7 Slides.
Problem A newly married couple plans to have four children and would like to have three girls and a boy. What are the chances (probability) their desire.
STAT 5372: Experimental Statistics Wayne Woodward Office: Office: 143 Heroy Phone: Phone: (214) URL: URL: faculty.smu.edu/waynew.
Data Presentation.
Spatial Statistics Applied to point data.
Applied Cartography and Introduction to GIS GEOG 2017 EL
Estimation of Statistical Parameters
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
CS433: Modeling and Simulation Dr. Anis Koubâa Al-Imam Mohammad bin Saud University 15 October 2010 Lecture 05: Statistical Analysis Tools.
Spatial Association Defining the relationship between two variables.
The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.
BINOMIALDISTRIBUTION AND ITS APPLICATION. Binomial Distribution  The binomial probability density function –f(x) = n C x p x q n-x for x=0,1,2,3…,n for.
Sampling Populations Ideal situation - Perfect knowledge Not possible in many cases - Size & cost Not necessary - appropriate subset  adequate estimates.
Biostatistics Class 3 Discrete Probability Distributions 2/8/2000.
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
QUANTITATIVE RESEARCH AND BASIC STATISTICS. TODAYS AGENDA Progress, challenges and support needed Response to TAP Check-in, Warm-up responses and TAP.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Local Indicators of Spatial Autocorrelation (LISA) Autocorrelation Distance.
Statistical Analysis IB Topic 1. Why study statistics?  Scientists use the scientific method when designing experiments  Observations and experiments.
CHAPTER 11 VECTOR DATA ANALYSIS 11.1 Buffering
Spatial Statistics in Ecology: Point Pattern Analysis Lecture Two.
NR 143 Study Overview: part 1 By Austin Troy University of Vermont Using GIS-- Introduction to GIS.
KNR 445 Statistics t-tests Slide 1 Introduction to Hypothesis Testing The z-test.
What’s the Point? Working with 0-D Spatial Data in ArcGIS
Point Pattern Analysis Point Patterns fall between the two extremes, highly clustered and highly dispersed. Most tests of point patterns compare the observed.
So, what’s the “point” to all of this?….
Final Project : 460 VALLEY CRIMES. Chontanat Suwan Geography 460 : Spatial Analysis Prof. Steven Graves, Ph.D.
12/23/2015Slide 1 The chi-square test of independence is one of the most frequently used hypothesis tests in the social sciences because it can be used.
PCB 3043L - General Ecology Data Analysis.
STATISTICS FOR SCIENCE RESEARCH (The Basics). Why Stats? Scientists analyze data collected in an experiment to look for patterns or relationships among.
Probability and Distributions. Deterministic vs. Random Processes In deterministic processes, the outcome can be predicted exactly in advance Eg. Force.
Week 6 Dr. Jenne Meyer.  Article review  Rules of variance  Keep unaccounted variance small (you want to be able to explain why the variance occurs)
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Material from Prof. Briggs UT Dallas
1 Statistical Analysis - Graphical Techniques Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
CHAPTER 11 Mean and Standard Deviation. BOX AND WHISKER PLOTS  Worksheet on Interpreting and making a box and whisker plot in the calculator.
Spatial statistics Lecture 3 2/4/2008. What are spatial statistics Not like traditional, a-spatial or non-spatial statistics But specific methods that.
PCB 3043L - General Ecology Data Analysis Organizing an ecological study What is the aim of the study? What is the main question being asked? What are.
Statistical analysis.
Task 2. Average Nearest Neighborhood
STATISTICS FOR SCIENCE RESEARCH
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Statistical analysis.
PCB 3043L - General Ecology Data Analysis.
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Test Review: Ch. 7-9
Nicholas A. Procopio, Ph.D, GISP
CHAPTER – 1.2 UNCERTAINTIES IN MEASUREMENTS.
CHAPTER – 1.2 UNCERTAINTIES IN MEASUREMENTS.
Presentation transcript:

Spatial Statistics II RESM 575 Spring 2010 Lecture 8

Last time Part A: Spatial statistics What are spatial measurements and statistics? Part B: Measuring geographic distributions Testing statistical significance Identifying patterns

Review Used in lab7

What we will use spatial stats to find How features are distributed Where are the clusters What is the pattern created by the features What are the relationships between sets of features or values

Identifying patterns Part A: Measuring the pattern of discrete or noncontinuous features (points, lines, polygons) Part B: Measuring the spatial pattern of feature values

Measuring the pattern of discrete or noncontinuous features (points, lines, polygons) Options: Overlaying areas of equal size Calculating the average distance between features Counting the number of features within defined distances

Overlaying areas of equal size Termed Quadrat analysis GIS overlays areas of equal size on the study area and counts the number of features in each Calculates the expected counts for a random distribution If fewer areas than expected contain most of the features and more areas than expected contain few or no features, the features form a clustered pattern . . . .

Notes Traditionally, squares are used for the quadrats The quadrat size will affect whether any patterns are identified! What size to use? Originally done with archeological digs where squares were easiest to lay out with stakes and a string

Quadrat size Quadrat size = twice the size of the mean area per feature Length of a side of quadrat = sqrt[{2*(Area of the extent / # of features)}]

Example Area = 6,000m * 6,000m = 36,000,000m2 n = 36 Length of a side of quadrat = sqrt[{2*(36,000,000 / 36)}] 1412.14 m

Hawth’s spatial ecology tools Already loaded in 317

Use Hawth’s tools to create quadrat ArcMap ->View -> Toolbars -> 1412

Quadrat analysis Because the method counts the number of features per unit area, it measures the density of features It doesn’t take into account the proximity of features to each other or their arrangement

Calculating the quadrat analysis stat GIS can be used to count the number of features in each quadrat and then summarize it to create a frequency table We want a table listing the number of quadrats containing 0, 1, 2, etc features How do we create this in GIS??????

Creating a frequency table Spatially join the point features to the quadrat polygons (right click on the quadrat polygons ->joins and relates -> join…) Make sure it is on spatial location join The Point features Sum because we want total # of points per quadrat Creates a new output shapefile to be used in step 2

Creating a frequency table Now summarize the count field from the output shapefile (right click Count field) Will create final frequency table shown on next slide

Finished frequency table # of points per quad # of quads that had that many points in them Right clicking on Options will allow us to export dbf file for stat tests in Excel

Calculating the quadrat analysis stat We then need a frequency table for the expected distribution usually based on a Poisson distribution Calculating the probability that a given number of features will occur in a quadrat λ = n / k Where λ = the ave # of features per quadrat n = the # of features k = the # of quadrats

Finding the prob of a # of features occurring in any given quadrat P(x) P(x) = e-λλx / x! e is Euler’s constant, with a value of 2.7183 X! is x factorial (if x is 3 then 3! is 3*2*1= 6 Excel can be used to easily calculate this equation and create a frequency table NOTE: After calculating P(x), or the frequency table, we then multiply the results by the total # of features to get the number of quadrats expected to contain that number of features

Analysis By comparing the two frequency tables, you can see whether the features create a pattern If the observed distribution table has more features than the table for the random distribution, then the features create a clustered pattern

Kolmogorov-Smirnov test Stat tests to find how much the frequency tables (and thus the patterns) differ Kolmogorov-Smirnov test Compares the difference in frequencies between the tables Chi-Square test Compares the cumulative difference in frequencies

Kolmogorov-Smirnov test Calculates the proportion of quads in each class for each line in the frequency table (the number of quads in the class divided by the total number of quads) Then creates a running cumulative total of proportions from top of the table to the bottom – the total of all classes equals 1 (100% of the quads)

Kolmogorov-Smirnov test Calculated difference in proportions Compares this value to the critical value for a confidence level you specify

Critical value of Kolmogorov-Smirnov test Critical value = K / sqrt(m) Where m = number of quads k = is the constant for a confidence interval you’re using For a CI of .20 (80%), K is 1.07 If 63 quads, Critical value = .135 If the calculated difference in proportions between the observed and random distribution is greater than the critical value, the difference can be considered statistically significant.

Chi-Square Also based on the frequency table Test to find if two sets of frequencies (or two distributions) are significantly different

Chi-Square test You then compare the test value with the critical value If the X2 value is greater than the critical value, the distributions are significantly different, and you can consider the observed distribution to not be random, but either clustered or dispersed

Factors influencing the results of quadrat analysis Works best with small, tight clusters (earthquakes, bird studies) Since it does not measure the relationship or distance between features (just if they fall in the quad or not) it may not recognize certain patterns . . . . . . .

Measuring the pattern of discrete or noncontinuous features (points, lines, polygons) Options: Overlaying areas of equal size Calculating the average distance between features Counting the number of features within defined distances

Calculating the Average Distance Between Features Referred to as the Nearest Neighbor index Measures the distance between each feature and its nearest neighbor an then calculates the average Distributions that have a smaller ave distance between features than a random distribution would have are considered clustered . . . . .

Notes on Nearest Neighbor Use if there is a direct interaction between features Good for data analyzed along a line (ie plants or wildlife observations collected along a transect) Stat is calculated using the distance between features

Calculating the index in GIS

Interpretation of results

Testing the results statistically of NNI The null hypoth is that the features are randomly distributed A spatially random distribution would result in a normal distribution of the NNI Z score

Testing the results statistically of NNI A positive Z score indicates a dispersed pattern, negative Z score is clustered At 95% CI the z would have to be > 1.96 or < -1.96 to be significantly significant Can compare distributions (the higher the z the more clustered) Make sure study areas are the same size

Factors influencing the NNI For lines, NNI could be misleading if endpoints are close and midpoints aren’t If single continuous lines are really several joined lines Centroids of large, convoluted polygons or of different sizes, area boundaries may be close while the centroids are far apart

Side note: finding points of lines, or points of polygons (centroids) in ArcGIS Select source tab

Factors influencing the NNI Extent of the study area

Measuring the pattern of discrete or noncontinuous features (points, lines, polygons) Options: Overlaying areas of equal size Calculating the average distance between features Counting the number of features within defined distances

Counting the number of features within defined distances Known as the K-function, or Ripley’s K-function Here you specify a distance interval and use GIS to calculate the average number of neighboring features within the distance of each feature If the ave # of features found at a distance is greater than the ave concentration of features throughout the study area, then the distribution is considered clustered at that distance . . . . . . K is the distance bands

Notes on K-function Good for if you are interested in how the pattern changes at different scales of analysis Includes all distances to neighbors within a given distance Emergency calls, bird nests

K function in GIS Finds the distance from each point to every other point Then, for each point, counts up the number of surrounding points within a given distance

Interpreting the K stat

Paper on class website for today.

Factors influencing the K function Points near the edge of a study area are likely to have less neighboring points At larger distances its more likely that fewer neighboring points will be found than actually exist

Comparing methods for measuring the pattern of feature locations