Introduction to Smoothing and Spatial Regression

Slides:



Advertisements
Similar presentations
Spatial modelling an introduction
Advertisements

Spatial point patterns and Geostatistics an introduction
Spatial point patterns and Geostatistics an introduction
Geography 360 Principles of Cartography May 15~17, 2006.
Basic geostatistics Austin Troy.
Spatial Interpolation
Steve Kopp and Steve Lynch
University of Wisconsin-Milwaukee Geographic Information Science Geography 625 Intermediate Geographic Information Science Instructor: Changshan Wu Department.
GIS and Spatial Statistics: Methods and Applications in Public Health
Correlation and Autocorrelation
WFM 6202: Remote Sensing and GIS in Water Management © Dr. Akm Saiful IslamDr. Akm Saiful Islam WFM 6202: Remote Sensing and GIS in Water Management Akm.
Deterministic Solutions Geostatistical Solutions
Spatial Analysis Longley et al., Ch 14,15. Transformations Buffering (Point, Line, Area) Point-in-polygon Polygon Overlay Spatial Interpolation –Theissen.
Spatial Interpolation
Applied Geostatistics
Deterministic Solutions Geostatistical Solutions
Why Geography is important.
Ordinary Kriging Process in ArcGIS
Geostatistics Mike Goodchild. Spatial interpolation n A field –variable is interval/ratio –z = f(x,y) –sampled at a set of points n How to estimate/guess.
Statistical Tools for Environmental Problems NRCSE.
Applications in GIS (Kriging Interpolation)
Method of Soil Analysis 1. 5 Geostatistics Introduction 1. 5
Geostatistic Analysis
Applied Geostatistics geog. buffalo. edu/~lbian/GEO497_597
Spatial Interpolation
Spatial Interpolation of monthly precipitation by Kriging method
Using ESRI ArcGIS 9.3 Spatial Analyst
$88.65 $ $22.05/A profit increase Improving Wheat Profits Eakly, OK Irrigated, Behind Cotton.
Geo479/579: Geostatistics Ch12. Ordinary Kriging (1)
Basic geostatistics Austin Troy.
Ecosystems are: Hierarchically structured, Metastable, Far from equilibrium Spatial Relationships Theoretical Framework: “An Introduction to Applied Geostatistics“,
Explorations in Geostatistical Simulation Deven Barnett Spring 2010.
Geographic Information Science
Spatial Statistics in Ecology: Continuous Data Lecture Three.
GEOSTATISICAL ANALYSIS Course: Special Topics in Remote Sensing & GIS Mirza Muhammad Waqar Contact: EXT:2257.
Data Types Entities and fields can be transformed to the other type Vectors compared to rasters.
Spatial Interpolation III
Interpolation Content Point data Interpolation Review Simple Interpolation Geostatistical Analyst in ArcGIS IDW in Geostatistical Analyst Semivariograms.
Spatial Interpolation Chapter 13. Introduction Land surface in Chapter 13 Land surface in Chapter 13 Also a non-existing surface, but visualized as a.
9.3 and 9.4 The Spatial Model And Spatial Prediction and the Kriging Paradigm.
Concepts and Applications of Kriging
Lab for Remote Sensing Hydrology and Spatial Modeling Dept of Bioenvironmental Systems Engineering National Taiwan University 1/90 GEOSTATISTICS VARIOGRAM.
Grid-based Map Analysis Techniques and Modeling Workshop
Esri UC 2014 | Technical Workshop | Concepts and Applications of Kriging Eric Krause Konstantin Krivoruchko.
Statistical Surfaces Any geographic entity that can be thought of as containing a Z value for each X,Y location –topographic elevation being the most obvious.
Lecture 6: Point Interpolation
Short course on space-time modeling Instructors: Peter Guttorp Johan Lindström Paul Sampson.
Interpolation and evaluation of probable Maximum Precipitation (PMP) patterns using different methods by: tarun gill.
Special Topics in Geo-Business Data Analysis Week 3 Covering Topic 6 Spatial Interpolation.
Geostatistics GLY 560: GIS for Earth Scientists. 2/22/2016UB Geology GLY560: GIS Introduction Premise: One cannot obtain error-free estimates of unknowns.
Geo479/579: Geostatistics Ch12. Ordinary Kriging (2)
Spatial Point Processes Eric Feigelson Institut d’Astrophysique April 2014.
INTERPOLATION Procedure to predict values of attributes at unsampled points within the region sampled Why?Examples: -Can not measure all locations: - temperature.
Exposure Prediction and Measurement Error in Air Pollution and Health Studies Lianne Sheppard Adam A. Szpiro, Sun-Young Kim University of Washington CMAS.
URBDP 422 URBAN AND REGIONAL GEO-SPATIAL ANALYSIS
Synthesis.
Labs Put your name on your labs! Layouts: Site 1 Photos Title Legend
Spatial statistics: Spatial Autocorrelation
Ch9 Random Function Models (II)
Labs Labs 5 & 6 are on the website. Lab 5 is due Tuesday (Oct. 31st).
Creating Surfaces Steve Kopp Steve Lynch.
Spatial Analysis Longley et al..
Inference for Geostatistical Data: Kriging for Spatial Interpolation
Paul D. Sampson Peter Guttorp
Interpolation & Contour Maps
Spatial interpolation
Concepts and Applications of Kriging
Concepts and Applications of Kriging
Modeling Spatial Phenomena
Presentation transcript:

Introduction to Smoothing and Spatial Regression University at Albany School of Public Health EPI 621, Geographic Information Systems and Public Health Introduction to Smoothing and Spatial Regression Glen Johnson, PhD Lehman College / CUNY School of Public Health glen.johnson@lehman.cuny.edu

Consider points distributed in space “Pure” Point process: Only coordinates locating some “events”. Set of points, S ={s1, s2, … , sn} Points represent locations of something that is measured. Values of a random variable, Z, are observed for a set S of locations, such that the set of measurements are Z(s) ={Z(s1), Z(s2), … , Z(sn)} _____________________ Examples include location of burglaries location of disease cases location of trees, etc. ___________________________ Examples include cases and controls (binary outcome) identified by location of residence Population-based count (integer outcome) tied to geographic centroids PCBs measured in mg/kg (continuous outcome) in soil cores taken at specific point locations

Example of a Pure Point Process: Baltimore Crime Events Question: How to interpolate a smoothed surface that shows varying “intensity” of the points? (source: http://www.people.fas.harvard.edu/~zhukov/spatial.html)

Kernel Density Estimation From: Cromely and McLafferty. 2002. GIS and Public Health.

Kernel Density Estimation Estimate “intensity” of events at regular grid points as a function of nearby observed events. General formula for any point x is: where xi are “observed” points for i = 1, …, n locations in the study area, k(.) is a kernel function that assigns decreasing weight to observed points as they approach the bandwidth h. Points that lie beyond the bandwidth, h, are given zero weighting.

Results from Kernel Density Smoothing in R

Kernel Density Surface of Bike Share Locations in NYC Source: http://spatialityblog.com/2011/09/29/spatial-analysis-of-nyc-bikeshare-maps/

Examples of Values Observed at Point Locations, Z(s) : Question: How to interpolate a smoothed surface that captures variation in Z(s)?

First, consider “deterministic” approaches to spatial interpolation: Deterministic models do not acknowledge uncertainty. Only real advantage is simplicity; good for exploratory analysis Several options, all with limitations. We will consider Inverse Distance Weighted (IDW) because of its common usage.

Inverse Distance Weighted Surface Interpolation Define search parameters Define power of distance-decay function

Illustration: Tampa Bay sediment total organic carbon

True “geostatistical” models assume the data, Z(S) = {Z(s1), Z(s2), … , Z(sn)}, are a partial realization of a random field. Note that the set of locations S are a subset of some 2-dimensional spatial domain D, that is a subset of the real plane.

General Protocol: Characterize properties of spatial autocorrelation through variogram modeling; Predict values for spatial locations where no data exist, through Kriging.

A semivariogram is defined as for distance h between the two locations, and is estimated as for nh pairs separated by distance hj (called a “lag”). After repeating for different lags, say j =1, … 10, the semivariance can be plotted as a function of distance.

Given any location si, all other locations are treated as within distance h if they fall within a search window defined by the direction, lag h, angular tolerance and bandwidth. bandwidth Adapted from Waller and Gotway. Applied Spatial Statistics for Public Health. Wiley, 2004.

Example semivariogram cloud for pairwise differences (red dots) , with the average semivariance for each lag (blue +), and a fitted semivariogram model (solid blue line)

Characteristics of a semivariogram Range = the distance within which positive spatial autocorrelation exists Nugget = spatial discontinuity + observation error Sill = maximum semivariance

If it does depend on direction, it is anisotropic. If the variogram form does not depend on direction, the spatial process is isotropic. If it does depend on direction, it is anisotropic. Multiple semi-variograms for different directions. Note changing parameter is the range. Surface map of semivariance shows values more similar in NW-SE direction and more different in SW-NE direction.

Kriging then uses semivariogram model results to define weights used for interpolating values where no data exists. The result is called the “Best Linear Unbiased Predictor”. The basic form is Where the λi assign weights to neighboring values according to semivariogram modeling that defines a distance-decay relation within the range, beyond which the weight goes to zero.

Several variations of Kriging: Simple (assumes known mean) Ordinary (assumes constant mean, though unknown) [our focus this week] Universal (non-stationary mean) Cokriging (prediction based on more than one inter-related spatial processes) Indicator (probability mapping based on binary variable) [you will see in the lab work] Block (areal prediction from point data) And other variations …

Example of two types of Kriging for California O3: Ordinary Kriging (Detrended, Anisotropic) -continuous surface Indicator Kriging - probability isolines

What if point locations are centroids of polygons and the value Z(si) represents aggregation within polygon i ?

With polygon data, we can still define neighbors as some function of Euclidean distance between polygon centroids, as we do for point-level data, but now we have other ways to define neighbors and their weights …

Defining spatial “Neighborhoods” Raster or Lattice: Rook Queen - 1st order Queen 2nd order

Spatial Regression Modeling as a method for both assessing the effects of covariates and… smoothing a response variable