Ch 5 Practical Point Pattern Analysis Spatial Stats & Data Analysis by Magdaléna Dohnalová.

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Clustering.
Learning Trajectory Patterns by Clustering: Comparative Evaluation Group D.
Hotspot/cluster detection methods(1) Spatial Scan Statistics: Hypothesis testing – Input: data – Using continuous Poisson model Null hypothesis H0: points.
Statistical approaches for detecting clusters of disease. Feb. 26, 2013 Thomas Talbot New York State Department of Health Bureau of Environmental and Occupational.
The Evolution of Spatial Outlier Detection Algorithms - An Analysis of Design CSci 8715 Spatial Databases Ryan Stello Kriti Mehra.
Spatial statistics Lecture 3.
Image Segmentation Image segmentation (segmentace obrazu) –division or separation of the image into segments (connected regions) of similar properties.
Generated Waypoint Efficiency: The efficiency considered here is defined as follows: As can be seen from the graph, for the obstruction radius values (200,
1st Meeting Industrial Geometry Computational Geometry ---- Some Basic Structures 1st IG-Meeting.
Zakaria A. Khamis GE 2110 GEOGRAPHICAL STATISTICS GE 2110.
Approaches to Data Acquisition The LCA depends upon data acquisition Qualitative vs. Quantitative –While some quantitative analysis is appropriate, inappropriate.
Spatial Statistics II RESM 575 Spring 2010 Lecture 8.
University of Wisconsin-Milwaukee Geographic Information Science Geography 625 Intermediate Geographic Information Science Instructor: Changshan Wu Department.
GIS and Spatial Statistics: Methods and Applications in Public Health
Correlation and Autocorrelation
QUANTITATIVE DATA ANALYSIS
Applied Geostatistics Geostatistical techniques are designed to evaluate the spatial structure of a variable, or the relationship between a value measured.
Why Geography is important.
University of Wisconsin-Milwaukee Geographic Information Science Geography 625 Intermediate Geographic Information Science Instructor: Changshan Wu Department.
Geographic Information Science
Point Pattern Analysis
University of Wisconsin-Milwaukee Geographic Information Science Geography 625 Intermediate Geographic Information Science Instructor: Changshan Wu Department.
Descriptive Statistics for Spatial Distributions Chapter 3 of the textbook Pages
ETM 607 – Random Number and Random Variates
Area Objects and Spatial Autocorrelation Chapter 7 Geographic Information Analysis O’Sullivan and Unwin.
Data Mining Techniques
Spatial Statistics Applied to point data.
Applied Cartography and Introduction to GIS GEOG 2017 EL
Dr. Marina Gavrilova 1.  Autocorrelation  Line Pattern Analyzers  Polygon Pattern Analyzers  Network Pattern Analyzes 2.
Chapter 11 Chi-Square Procedures 11.3 Chi-Square Test for Independence; Homogeneity of Proportions.
Texture. Texture is an innate property of all surfaces (clouds, trees, bricks, hair etc…). It refers to visual patterns of homogeneity and does not result.
Location Fingerprint Analyses Toward Efficient Indoor Positioning
Extending Spatial Hot Spot Detection Techniques to Temporal Dimensions Sungsoon Hwang Department of Geography State University of New York at Buffalo DMGIS.
Sampling Populations Ideal situation - Perfect knowledge Not possible in many cases - Size & cost Not necessary - appropriate subset  adequate estimates.
Chapter 4 – Distance methods
Testing Hypothesis That Data Fit a Given Probability Distribution Problem: We have a sample of size n. Determine if the data fits a probability distribution.
Sampling  When we want to study populations.  We don’t need to count the whole population.  We take a sample that will REPRESENT the whole population.
Geo479/579: Geostatistics Ch4. Spatial Description.
Spatial Statistics in Ecology: Point Pattern Analysis Lecture Two.
Point Pattern Analysis Point Patterns fall between the two extremes, highly clustered and highly dispersed. Most tests of point patterns compare the observed.
So, what’s the “point” to all of this?….
Grid-based Map Analysis Techniques and Modeling Workshop
Local Indicators of Categorical Data Boots, B. (2003). Developing local measures of spatial association for categorical data. Journal of Geographical Systems,
Methods for point patterns. Methods consider first-order effects (e.g., changes in mean values [intensity] over space) or second-order effects (e.g.,
Point Pattern Analysis
1 CLUSTER VALIDITY  Clustering tendency Facts  Most clustering algorithms impose a clustering structure to the data set X at hand.  However, X may not.
Probability and Distributions. Deterministic vs. Random Processes In deterministic processes, the outcome can be predicted exactly in advance Eg. Force.
Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.
MAKING MEANING OUT OF DATA Statistics for IB-SL Biology.
Zakaria A. Khamis GE 2110 GEOGRAPHICAL STATISTICS GE 2110.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Non-parametric Methods for Clustering Continuous and Categorical Data Steven X. Wang Dept. of Math. and Stat. York University May 13, 2010.
+ Mortality. + Starter for 10…. In pairs write on a post it note: One statistic that we use to measure mortality On another post it note write down: A.
METU, GGIT 538 CHAPTER V MODELING OF POINT PATTERNS.
Cases and controls A case is an individual with a disease, whose location can be represented by a point on the map (red dot). In this table we examine.
Introduction to Spatial Statistical Analysis
Spatial analysis Measurements - Points: centroid, clustering, density
Chapter 2: The Pitfalls and Potential of Spatial Data
Quantifying Scale and Pattern Lecture 7 February 15, 2005
Summary of Prev. Lecture
Probabilistic Data Management
Statistics in medicine
Lecture 6 Implementing Spatial Analysis
CSE572, CBS598: Data Mining by H. Liu
Spatial Point Pattern Analysis
Spatial Data Analysis: Intro to Spatial Statistical Concepts
CSE572, CBS572: Data Mining by H. Liu
Spatial Data Analysis: Intro to Spatial Statistical Concepts
CSE572: Data Mining by H. Liu
Presentation transcript:

Ch 5 Practical Point Pattern Analysis Spatial Stats & Data Analysis by Magdaléna Dohnalová

Problems of pure Spatial Statistical Analysis  Null Hypothesis: Is that IRP/CSR? –Insufficient description  First-order influence  Process-Pattern Matching –Either it does or it doesn’t –Global technique

In fact, what we need to know is..  Where the pattern deviates from expectations >>> CLUSTER DETECTION  Where are the Clusters?

Case Study: Sellafield Leukemia Study, UK  Children leukemia deaths clustered around nuclear plant  Proved that THERE WAS a cluster, but missing evidence of linking cause –Apparent clusters occur naturally in many diseases –The actual number in cluster was very low –Similar clusters have been found around nonnuclear plants

Cluster analysis of Point Patterns  Problem with small clusters  Distance Rings –Rates of occurrence –Distance form the plant  Geographical Analysis Machine (GAM) –Automated cluster detector for point patterns

GAM…how the  Two dimensional grid  Series of different circles –various size and density  Number of events within each circle  Exceeds threshold? (Monte Carlo simulation of expected pattern) –If YES, draw circle on the map  END RESULT: map of significant circles

Pattern of Circles used by GAM

About Cluster Detectors  More recent genetic algorithms (intelligent) –Map Explorer (MAPEX) & Space Time Attribute Creature (STAC)  Data Availability –When aggregate data -> MAUP  Variation in Background Rate –Assume uniform geography –Overlapping of significant circles not independent Setting variable threshold!!!  Time problem –Snapshot effect –Aggregation over time, similar to MAUP

Extension of Basic Point Pattern  Multiple Sets of Events –Contingency table analysis Chi-Square Test Discards location information –Cross Functions (G and K functions) Cumulative Nearest-Neighbor function Distance from event in each pattern (G) Events counts within in distance to the other (K) Random if events are independent of each other

Extension of Basic Point Pattern  When was it Clustered? –Clustering in space and time together! –Knox test Distance in space (near-far) and time (close-distant) Contingency table + Chi-square Threshold decision – similar to MAUP –Mantel Test Distance and space distance for all objects –Modified K function Combining two K functions in Contingency table Test difference between the two

Point Pattern Analysis: Proximity Polygons  Using DENSITY and DISTANCE  Geographical Space is not random!  Delaunay triangulation of proximity polygons  Neighborhood relations are defined in respect to local patterns!

Point Pattern Analysis: Proximity Polygons  Delaunay proximity polygons –Distribution of area –The number of neighbors –Lengths of Edges –Minimum Spanning Tree (from Gabriel graph)

Point Pattern Analysis: Distance Based Methods  Distance Matrices –Large amount of data (not the most efficient but convenient for computer calculations) –Underlines shortest distance (nearest neighbor & G function)  Convert to Adjacency Matrices (K function)  Derived Matrices (F function)

Questions  What are the two major questions we ask about clusters?  What is the final product of GAM?  What are the main challenges in cluster detection?  What are the strengths of using Proximity Polygons for cluster detection? Describe the minimum spanning tree.