Pattern Statistics Michael F. Goodchild University of California Santa Barbara.

Slides:



Advertisements
Similar presentations
“Students” t-test.
Advertisements

Sampling Distributions Suppose I throw a dice times and count the number of times each face turns up: Each score has a similar frequency (uniform.
Likelihood Ratio, Wald, and Lagrange Multiplier (Score) Tests
Zakaria A. Khamis GE 2110 GEOGRAPHICAL STATISTICS GE 2110.
Spatial Statistics II RESM 575 Spring 2010 Lecture 8.
GIS and Spatial Statistics: Methods and Applications in Public Health
Correlation and Autocorrelation
QUANTITATIVE DATA ANALYSIS
Applied Geostatistics Geostatistical techniques are designed to evaluate the spatial structure of a variable, or the relationship between a value measured.
Horng-Chyi HorngStatistics II127 Summary Table of Influence Procedures for a Single Sample (I) &4-8 (&8-6)
Anomaly Detection. Anomaly/Outlier Detection  What are anomalies/outliers? The set of data points that are considerably different than the remainder.
Modern methods The classical approach: MethodProsCons Time series regression Easy to implement Fairly easy to interpret Covariates may be added (normalization)
SA basics Lack of independence for nearby obs
Personality, 9e Jerry M. Burger
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Ch 5 Practical Point Pattern Analysis Spatial Stats & Data Analysis by Magdaléna Dohnalová.
Inferential statistics Hypothesis testing. Questions statistics can help us answer Is the mean score (or variance) for a given population different from.
Point Pattern Analysis
University of Wisconsin-Milwaukee Geographic Information Science Geography 625 Intermediate Geographic Information Science Instructor: Changshan Wu Department.
AM Recitation 2/10/11.
Methods in Medical Image Analysis Statistics of Pattern Recognition: Classification and Clustering Some content provided by Milos Hauskrecht, University.
Chapter 1: Introduction to Statistics
Single-Sample T-Test Quantitative Methods in HPELS 440:210.
Chapter 3 Fundamentals: Maps as Outcomes of Processes By: Mindy Syfert.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Spatial Statistics in Ecology: Area Data Lecture Four.
Causal inferences During the last two lectures we have been discussing ways to make inferences about the causal relationships between variables. One of.
15. Descriptive Summary, Design, and Inference. Outline Data mining Descriptive summaries Optimization Hypothesis testing.
Spatial Association Defining the relationship between two variables.
Slides to accompany Weathington, Cunningham & Pittenger (2010), Chapter 3: The Foundations of Research 1.
Extending Spatial Hot Spot Detection Techniques to Temporal Dimensions Sungsoon Hwang Department of Geography State University of New York at Buffalo DMGIS.
Sampling Populations Ideal situation - Perfect knowledge Not possible in many cases - Size & cost Not necessary - appropriate subset  adequate estimates.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 18 Inference for Counts.
CPS 270: Artificial Intelligence Machine learning Instructor: Vincent Conitzer.
Chapter 4 – Distance methods
1 Chapter 6 – Analysis of mapped point patterns This chapter will introduce methods for analyzing and modeling the spatial distribution of mapped point.
+ Chi Square Test Homogeneity or Independence( Association)
Sampling  When we want to study populations.  We don’t need to count the whole population.  We take a sample that will REPRESENT the whole population.
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Data Mining Anomaly Detection Lecture Notes for Chapter 10 Introduction to Data Mining by.
Spatial Statistics in Ecology: Point Pattern Analysis Lecture Two.
What’s the Point? Working with 0-D Spatial Data in ArcGIS
Three Broad Purposes of Quantitative Research 1. Description 2. Theory Testing 3. Theory Generation.
Point Pattern Analysis Point Patterns fall between the two extremes, highly clustered and highly dispersed. Most tests of point patterns compare the observed.
So, what’s the “point” to all of this?….
Point Pattern Analysis. Methods for analyzing completely censused population data F Entire extent of study area or F Each unit of an array of contiguous.
Dynamic biological processes influence population density, dispersion, and demographics Chapter 53, Section 1.
Chapter 14: Inference for Regression. A brief review of chapter 4... (Regression Analysis: Exploring Association BetweenVariables )  Bi-variate data.
Methods for point patterns. Methods consider first-order effects (e.g., changes in mean values [intensity] over space) or second-order effects (e.g.,
Point Pattern Analysis
STATISTICS FOR SCIENCE RESEARCH (The Basics). Why Stats? Scientists analyze data collected in an experiment to look for patterns or relationships among.
Applied Quantitative Analysis and Practices LECTURE#14 By Dr. Osman Sadiq Paracha.
Statistical methods for real estate data prof. RNDr. Beáta Stehlíková, CSc
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Zakaria A. Khamis GE 2110 GEOGRAPHICAL STATISTICS GE 2110.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.
METU, GGIT 538 CHAPTER V MODELING OF POINT PATTERNS.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Cases and controls A case is an individual with a disease, whose location can be represented by a point on the map (red dot). In this table we examine.
STATISTICS FOR SCIENCE RESEARCH
Quantifying Scale and Pattern Lecture 7 February 15, 2005
Summary of Prev. Lecture
Spatial Point Pattern Analysis
Quantitative Methods in HPELS HPELS 6210
Comparing two Rates Farrokh Alemi Ph.D.
Comparing Two Proportions
Local Habitat Sampling
Comparing Two Proportions
Section 11.1: Significance Tests: Basics
Instructor: Vincent Conitzer
Presentation transcript:

Pattern Statistics Michael F. Goodchild University of California Santa Barbara

Outline n Some examples of analysis n Objectives of analysis n Cross-sectional analysis n Point patterns

What are we trying to do? n Infer process –processes leave distinct fingerprints on the landscape –several processes can leave the same fingerprints enlist time to resolve ambiguity invoke Occam's Razor confirm a previously identified hypothesis

Alternatives n Expose aspects of pattern that are otherwise invisible –Openshaw –Cova n Expose anomalies, patterns n Convince others of the existence of patterns, problems, anomalies

Cross-sectional analysis n Social data collected in cross-section –longitudinal data are difficult to construct –difficult for bureaucracies to sustain –compare temporal resolution of process to temporal resolution of bureaucracy n Cross-sectional perspectives are rich in context –can never confirm process –though they can perhaps falsify –useful source of hypotheses, insights

What kinds of patterns are of interest? n Unlabeled objects –how does density vary? –do locations influence each other? –are there clusters? n Labeled objects –is the arrangement of labels random? –or do similar labels cluster? –or do dissimilar labels cluster?

First-order effects n Random process (CSR) –all locations are equally likely –an event does not make other events more likely in the immediate vicinity n First-order effect –events are more likely in some locations than others –events may still be independent –varying density

Second-order effects n Event makes others more or less likely in the immediate vicinity –clustering –but is a cluster the result of first- or second- order effects? –is there a prior reason to expect variation in density?

Testing methods n Counts by quadrat –Poisson distribution

Deaths by horse-kick in the Prussian army n Mean m = 0.61, n = 200 Deaths per yr01234 Probability Number of years expected Number of years observed

Towns in Iowa n 1173 towns, 154 quadrats 20mi by 10mi Chisquare with 8 df = 12.7 Accept H 0

Distance to nearest neighbor n Observed mean distance r o n Expected mean distance r e = 1/2  d –where d is density per unit area n Test statistic:

Towns in Iowa n 622 points tested n 643 per unit area n Observed mean distance 3.52 n Expected mean distance 3.46 n Test statistic 0.82 n Accept H 0

But what about scale? n A pattern can be clustered at one scale and random or dispersed at another n Poisson test –scale reflected in quadrat size n Nearest-neighbor test –scale reflected in choosing nearest neighbor –higher-order neighbors could be analyzed

Weaknesses of these simple methods n Difficulty of dealing with scale n Second-order effects only –density assumed uniform n Better methods are needed

K-function analysis n K(h) = expected number of events within h of an arbitrarily chosen event, divided by d n How to estimate K? –take an event i –for every event j lying within h of i: score 1

Allowing for edge effects score < 1

The K function n In CSR K(h) =  h 2 n So instead plot:

What about labeled points? n How are the points located? –random, clustered, dispersed n How are the values assigned among the points? –among possible arrangments –random –clustered –dispersed

Moran and Geary indices