Geographic Access to Family Practice Physicians in New Mexico: Exploratory Spatial Analysis Prepared for: Geography 580L (Spatial Statistics) Spring Semester,

Slides:



Advertisements
Similar presentations
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Notes on Residuals Simple Linear Regression Models.
Advertisements

Homoscedasticity equal error variance. One of the assumption of OLS regression is that error terms have a constant variance across all value so f independent.
Geographic Access to Primary Care Physicians in New Mexico Prepared for: Geog. 491(Problems) and Geog. 499 (Python programming) Fall Semester, 2014 Larry.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
SPATIAL DATA ANALYSIS Tony E. Smith University of Pennsylvania Point Pattern Analysis Spatial Regression Analysis Continuous Pattern Analysis.
Spatial Autocorrelation Basics NR 245 Austin Troy University of Vermont.
Correlation and Autocorrelation
Multivariate Data Analysis Chapter 4 – Multiple Regression.
SA basics Lack of independence for nearby obs
Why Geography is important.
Advanced GIS Using ESRI ArcGIS 9.3 Arc ToolBox 5 (Spatial Statistics)
Business Statistics - QBM117 Statistical inference for regression.
Geographic Access Gravity Model 1.Statement of Problem (Measurement) 2.The Theory and Method (Potential Accessibility) 3.Applications (Preliminary Maps)
Geographic Access Gravity Model 1.Statement of Problem (Measurement) 2.The Theory and Method (Potential Accessibility) 3.Applications (Examples -Preliminary.
Title: Spatial Data Mining in Geo-Business. Overview  Twisting the Perspective of Map Surfaces — describes the character of spatial distributions through.
Modeling US County Premature Mortality James L. Wilson Department of Geography, Northern Illinois University & Christopher J. Mansfield Center for Health.
Esri International User Conference | San Diego, CA Technical Workshops | Spatial Statistics: Best Practices Lauren Rosenshein, MS Lauren M. Scott, PhD.
© 2004 Prentice-Hall, Inc.Chap 15-1 Basic Business Statistics (9 th Edition) Chapter 15 Multiple Regression Model Building.
Food Store Location Analysis Albuquerque New Mexico, 2010 Prepared for: Geography 586L - Spring Semester, 2014 Larry Spear M.A., GISP Sr. Research Scientist.
Food Store Location Analysis Albuquerque New Mexico, 2010 Prepared for: Geography 586L - Spring Semester, 2014 Larry Spear M.A., GISP Sr. Research Scientist.
VI. Evaluate Model Fit Basic questions that modelers must address are: How well does the model fit the data? Do changes to a model, such as reparameterization,
Spatially Assessing Model Error Using Geographically Weighted Regression Shawn Laffan Geography Dept ANU.
Model Selection and Validation. Model-Building Process 1. Data collection and preparation 2. Reduction of explanatory or predictor variables (for exploratory.
Food Store Location Analysis Albuquerque New Mexico, 2010 Prepared for: Geography 591(Problems) and 580L(Quant. Methods) Fall Semester 2015 – Presented.
Linear Regression Linear Regression. Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Purpose Understand Linear Regression. Use R functions.
Chapter 15 Inference for Regression. How is this similar to what we have done in the past few chapters?  We have been using statistics to estimate parameters.
Yandell – Econ 216 Chap 15-1 Chapter 15 Multiple Regression Model Building.
Stats Methods at IC Lecture 3: Regression.
Migration Modelling using Global Population Projections
Correlation and Linear Regression
Determining How Costs Behave
F-tests continued.
Computer aided teaching of statistics: advantages and disadvantages
Statistical Data Analysis - Lecture /04/03
Linear Regression.
Travelling to School.
Chow test.
AMPO Annual Conference October 22, 2014
Spatial Analysis & Dissemination of Census Data
Chapter 11: Simple Linear Regression
A Statistical and GIS Approach to Analyzing a Museum’s Customer Base
Statistics in MSmcDESPOT
Multivariate Regression
Newcomer Service Access: Census and Postal Code Mapping
Elementary Statistics
Geographic Access to Primary Care Physicians in New Mexico: ArcPy to Python 3 (GeoPandas) Prepared for: Geography 528 (Advanced Programming for GIS) Spring.
Linear Regression Models
Diagnostics and Transformation for SLR
Food Store Location Analysis Albuquerque New Mexico, 2010 Prepared for: Geography 591(Problems) and 580L(Quant. Methods) Fall Semester 2015 Larry Spear.
The Use of Enrollment Forecasts in School Redistricting May 17, 2018 McKibben Demographic Research Jerome McKibben, Ph.D. Rock Hill, SC
Tabulations and Statistics
EQ: How well does the line fit the data?
Why are Spatial Data Special?
Ch11 Curve Fitting II.
Spatial Data Analysis: Intro to Spatial Statistical Concepts
Adequacy of Linear Regression Models
Statistics II: An Overview of Statistics
Product moment correlation
Regression III.
Adequacy of Linear Regression Models
Inferential Statistics
BEC 30325: MANAGERIAL ECONOMICS
Adequacy of Linear Regression Models
Adequacy of Linear Regression Models
Concepts and Applications of Kriging
Diagnostics and Transformation for SLR
Interpreting Data for use in Charts and Graphs.
BEC 30325: MANAGERIAL ECONOMICS
STA 291 Spring 2008 Lecture 23 Dustin Lueker.
Presentation transcript:

Geographic Access to Family Practice Physicians in New Mexico: Exploratory Spatial Analysis Prepared for: Geography 580L (Spatial Statistics) Spring Semester, 2017 Larry Spear M.A., GISP Sr. Research Scientist (Ret.) Division of Government Research University of New Mexico http://www.unm.edu/~lspear Version 2.3 – 5/3/17

Background Project originally prepared by the Division of Government Research at the University of New Mexico (DGR, UNM) for the New Mexico Health Policy Commission (NM HPC). Most of GIS work done under contract as part of a larger body of data management work and special reports including Hospital Inpatient Discharge Data (HIDD) from 1998 to 2002. No formal academic publication (NM HPC Quick Facts). Used both SAS (Statistical Analysis System) and ArcGIS in combination (dBase file transfer). Note: SAS had data set (data frame) since 1970s.

Background SAS used to program a Gravity Model to measure potential accessibility of patients at a given location to all physicians at other locations. (Spatial Interaction) ArcMap used to produce map display of results. Current project goals: to develop a R Script Tool to program the Gravity Model and display the results on a map entirely within R without SAS. (QGIS and ArcGIS Pro maps if necessary, Next R function development) Also use GWR (Geographically Weighted Regression) to evaluate the relationship of gravity model results (dependent var.) to socio-economic data (independent vars.) Note: Previous developments - ArcGIS Model Builder and Python (ArcPy), also QGIS Python (PyQGIS)

2002 ESRI SWUG (Taos, NM) – Winning Poster

DGR used well developed commercial software packages SAS for data management and data analysis ArcGIS for GIS spatial data display and analysis University educational versions (affordable at UNM)

The Problem - Measurement How to measure geographic access (spatial interaction) of population to healthcare providers and facilities? Solution – Develop a reliable method to measure (and compare) the distribution of facilities and providers and the population. Note: Does not solve physician shortages, provides a better way to monitor change. Reliable measurement requires a geographic framework in which to collect and organize observations. This project used Census Tract (centroids) for calculations and boundaries (polygons) for data display. Note: Family practice physician data (2013) from infogroup (Esri business partner, purchased by NM DOIT) and population/socio-economic data (2013) from Census ACS.

The Problem(s) Reliable measurement requires a common scale that allows for comparison of values. Reliable measurement requires a method to handle arbitrary boundaries imposed by data collection units (geographic framework). Traditionally used Federal and State service capacity standard is a ratio of persons per physician by county geography. (regional availability approach) Note: Big counties and patients cross boundaries.

A Common Measurement Scale Service Capacity Standards (traditional measure - Fed. and State guidelines). Ratio of provider or facilities per population. Can be expressed as either: One M.D. per 1,500 persons (Prov./ Pop.) 1,500 persons per M.D. (Pop. / Prov.) **What we used, and NM service capacity goal ( NM GADS workgroup?) - Geographic Access Data System NM Legislature SJM 36 (1996) also 45minute maximum travel time (about 35 miles distance).

Potential Accessibility DGR’s Gravity Model PAj = Potential accessibility for Census Tract Popi = Population of a Census Tract Provi = Number of Providers / Facilities in Census Tract

Analysis Steps using R (ArcGIS Pro & QGIS) Read in data from ArcGIS Pro (R-ArcGIS Bridge) Create SPDF’s (Spatial Point/Poly Data Frames) Examine data and make test plots (R maps) Subset data (tract centroids) for gravity model Run the gravity model and plot (R & Arc Map) results Merge gravity model results (point & poly) with Census ACS data ArcGIS Pro Exploratory Regression Perform R OLS ( diagnostics and Moran’s I) Perform R GWR (diagnostic maps with QGIS & ArcGIS Pro) Note: R - merge and cbind back to tract polygons

DGR Gravity Model (R Version) Note: Nested for loops, no attach for spdf

Note: Hard to control class breaks in R

Gravity Model Results (ArcGIS Pro)

Note: Geographic (socio-economic) data are rarely normally distributed even after transform

Transform Gravity Model Results (gmratio to ln_gmratio)

ArcGIS Pro – Exploratory Regression lnGMRatio = -House_Avg +PerCapita_Inc +White +HispLat

R – OLS Results

R – OLS Diagnostics

R – OLS Diagnostics

R – Residual Spatial Autocorrelation

R – GWR Results

R – Residual Spatial Autocorrelation

QGIS Makes Better Maps than R

GWR – Residuals (Stud_resid) - Blue and Green over-predict, enough physicians under used by population? - Yellow good predictions, adequate number of physicians? - Red Orange under-predicted, not enough physicians? * Lower Moran’s I Less Clustered ->

GWR – Local R2 (Local_R2) -Better fit high population areas. -NW Reservations number of physicians not available. -SE people go to Texas for care.

Special Note: - The interpretation of GWR coefficients is based on some understanding of the underlying processes in the area of study. These interpretations can be difficult and subjective. However, they can provide new insight for further study.

Switch to ArcGIS Pro Maps Note: Beta Coefficients very small, problem with QGIS Maps * Could redo analysis without transformed GMRatio to LN_GMRatio (easier to interpret, maybe next time) Additional Note: Linear regression does not require normally distributed dependent or Independent variables. However, transformations are preferred for more accurate results. The only normality assumption is for resulting residuals (errors).

GWR Coefficient Influence - HOUSEHOLD_AVG – Why Different? Negative coefficients - an inverse relationship, as HOUSEHOLD_AVG increases LN_GMRATIO decreases. Trending to less population per physician. (SF & LA) Positive coefficients – a direct relationship, as HOUSEHOLD_AVG increases LN_GMRATIO increases . Trending to more population per physician. (ABQ & LC)

GWR Coefficient Influence – PER_CAPITA_INC Negative coefficients - an inverse relationship, as PER_CAPITA_INC increases LN_GMRATIO decreases. Trending to less population per physician. (Rural pop. loss, SF & LA gain MDs) Positive coefficients – a direct relationship, as PER_CAPITA_INC increases LN_GMRATIO increases . Trending to more population per physician. (SE – go to Texas for care)

GWR Coefficient Influence – WHITE Negative coefficients - an inverse relationship, as WHITE increases LN_GMRATIO decreases. Trending to less population per physician. (ABQ & LC attract MDs) Positive coefficients – a direct relationship, as WHITE increases LN_GMRATIO increases . Trending to more population per physician. (Rural & SE)

GWR Coefficient Influence – HISPLAT Negative coefficients - an inverse relationship, as HISPLAT increases LN_GMRATIO decreases. Trending to less population per physician. (N. & W. out-migration, aging) Positive coefficients – a direct relationship, as HISPLAT increases LN_GMRATIO increases . Trending to more population per physician. (ABQ, SF, not LC)

Conclusions and Comments The quality of the infogroup physician data may have strongly affected the results – not an accurate count of physicians. Census ACS are also just survey estimates! The DGR gravity model produced reasonably realistic results – drive time distance an improvement. (Next – ArcGIS PRO Network O-D Matrix & Python - PANDAS) GWR produced interesting, and sometimes difficult to explain localized spatial patterns. Subjective Interpretation. More study to compare DGR gravity model to recently developed more elaborate methods (2SFCA, E2SFCA). More study to reevaluate selection of socio-economic factors to use with GWR (SES, Esri Tapestry) – better understand local influences for physician distribution.