Assessing Quality of Geocoded Data The Florida Registry Experience.

Slides:



Advertisements
Similar presentations
Graphic Data Clean-Up Issues. Graphic Data Clean-up Very Important if you plan to use existing CAD or GIS data from another agency, department, or private.
Advertisements

A Joint Project Between OSDH and DEQ Using GIS Elizabeth Kruger, OSDH Hub Baggett, DEQ OK-PHETS May 2005.
Ellen F. Heineman, Ph.D. National Cancer Institute Epidemiology & Genetics Research Program The GIS for the Long Island Breast Cancer Study Project.
Conflation with JCS Automated and Human-Assisted Conflation Using the JCS Conflation Suite Presented at GeoTec 2003 Vancouver, BC Martin Davis, Technical.
Geographic Access to Primary Care Physicians in New Mexico Prepared for: Geog. 491(Problems) and Geog. 499 (Python programming) Fall Semester, 2014 Larry.
Implementing “CLEAN Address” Address Validation Software at Oakland University.
The Geographic Support System Initiative (GSS-I) Partnership Program Donna Bullock U.S. Census Bureau November 2,
Modeling travel distance to health care using geographic information systems Anupam Goel, MD Wayne State University Detroit, MI (USA)
Use of GIS Systems in Official Statistics. Overview of Presentation  Overview of GIS systems  Current use within the ABS  Potential future applications.
GIS Tutorial 1 Lecture 7 Geocoding. Outline  Geocoding overview  Linear (street) geocoding  Problems and solutions  Street map sources  Polygon geocoding.
The Geographical and Temporal Distribution of Vital Statistics in Champaign County, from 2005 to 2009 Lan Luo Supervisor: Awais Vaid (C-UPHD) Dr.Ruiz (University.
Perinatal Periods of Risk Starting Over in Miami-Dade County Presented by:
Vector-Based GIS Data Processing Chapter 6. Vector Data Model Feature Classes points lines polygons Layers limited to one class of data Figure p. 186.
Population and Housing Estimates for Census Blocks: The San Diego Experience Jeff Tayman San Diego Association of Governments.
Geog 458: Map Sources and Errors Uncertainty January 23, 2006.
Lecture 16: Data input 1: Digitizing and Geocoding By Austin Troy University of Vermont Using GIS-- Introduction to GIS.
Fundamentals of GIS Materials by Austin Troy © 2008 Lecture 18: Data Input: Geocoding and Digitizing By Austin Troy University of Vermont NR 143.
Asthma Distribution patterns and their relationship with the urban landscape and social conditions in Newark NJ Authors: Francisco Artigas, Leonard Beilory,
Getting the Map into the Computer Getting Started with Geographic Information Systems Chapter 4.
Geocoding: - Table to geocode may be an ASCII, spreadsheet, dBase, or MapInfo table - Referred to as the “target” table - The target table is the attribute.
Geocoding - Advanced Techniques
GIS Internet Map Servers for Health Applications Carol L. Hanchette, Ph.D. Rebecca D. Martin, Ph.D. Research Triangle Institute Research Triangle Park,
INCORPORATING E911 ADDRESSING INTO COUNTY GIS Des Moines County GIS and E911 Matt Warner GIS Coordinator Des Moines County GIS Commission.
Welcome to Geocoding using ArcGIS Presented by: The Nevada Division of Public and Behavioral Health, Office of Public Health Informatics and Epidemiology.
Using ArcGIS/SaTScan to detect higher than expected breast cancer incidence Jim Files, BS Appathurai Balamurugan, MD, MPH.
COMMUTE Atlanta A Comparison of Geocoding Methodologies for Transportation Planning Applications Jennifer Indech Nelson Dr. Randall Guensler Dr. Hainan.
Overview of Administrative Records on Population and Housing
Creating a Statewide Geocoding Service for West Virginia 2008 West Virginia GIS Conference.
Iterative Batch Geocoding With Atlas GIS Version 4.0.
Fundamentals of GIS Materials by Austin Troy © 2008 Lecture 18: Data Input: Geocoding and Digitizing By Austin Troy University of Vermont.
Indianapolis/Marion County GIS & DigSmart Ryan R. Johnson March 8, 2006.
GIS Modeling for Primary Stroke Center Development Anna Kate Sokol, M.U.P. Sr. GIS Specialist City of South Bend, IN.
Introduction to ArcGIS for Environmental Scientists Module 3 – GIS Analysis Address Geocoding.
National Institute for Public Health and the Environment The Dutch RIF Experiences and possibilities.
Wende Mix, Buffalo State College GIS Research Needs Strategic Planning.
Type of Historical Data Collected Contextual Level (Place of Residence)
Improvements in Ohio’s Vital Statistics residence data with geo-coding software Presented at NAPHSIS/VSCP meeting Cincinnati, Ohio June 5, 2005 John O.
ArcGIS Data Reviewer: An Introduction
GIS Tutorial 1 Lecture 9 Spatial Analysis.
GIS 1 Geocoding. GIS 2 Geocoding Address Geocoding - Process of creating geometric representations for locations (such as point features) from descriptions.
Validation and Outreach of Oklahoma State Broadband Initiative Presenters: Scott Cox, Ying Zhang OU Center for Spatial Analysis (OUCSA) CoIT Meeting, May.
Harry Williams, Cartography1 INTRODUCTION TO GIS A Geographic Information System is a combination of software and hardware that can store, manipulate,
1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment.
GIS in Business Presentation American Collegiate Retailing Association March 13, 2003 Colleen M. Schelde ESRI Inc.
Data Creation and Editing Based in part on notes by Prof. Joseph Ferreira and Michael Flaxman Lulu Xue | Nov. 3, :A Workshop on Geographical.
Geocoding The Foundation for Spatial Analysis. PB Location Intelligence Disciplines The capability to organize and understand complex relationships between.
Geography Matters ! GIS Day. Why is Geography Important to You?
Introduction to Geographic Information Systems Fall 2013 (INF 385T-28620) Dr. David Arctur Research Fellow, Adjunct Faculty University of Texas at Austin.
Lab 6: Geocoding You have received a dBase file that contains the address list of over 500 homes in your neighborhood that have had reports of lead poisoning.
Raster Data Models: Data Compression Why? –Save disk space by reducing information content –Methods Run-length codes Raster chain codes Block codes Quadtrees.
“It’s Not a Sprint – It’s a Marathon” GIS 101 For Local Government ICIT Midyear Conference June 17, 2010 Jeff Miller, Dubuque County Matt Boeck, Story.
NAACCR’S National Provider Identifier Resource Kevin Henry Frank Boscoe Chris Johnson NAACCR GIS Committee June 10, 2008.
NAACCR 2006 Conference NAACCR SHOWCASE Great Circle Distance Calculator Chris Johnson NAACCR GIS Committee Cancer Data Registry of Idaho.
Screening for increased cancer risk near toxic waste sites Recinda Sherman, MPH CTR Florida Cancer Data Systems NAACCR Annual Meeting, June 2008.
Kevin A Henry, Ph.D New Jersey Cancer Registry Cancer Epidemiology Services Frank Boscoe, Ph.D New York State Cancer Registry Estimating the accuracy of.
GEOGRAPHIC CLUSTERS OF HEAD & NECK CANCER IN FLORIDA Recinda Sherman, MPH, CTR Florida Cancer Data Systems NAACCR Detroit, June 7, 2007.
GIS Database. Why - Geography Gene x Environment Flights x Environment Environment: the surroundings of a physical system that may interact with the system.
RESIDENTIAL MOVEMENT BETWEEN TIME OF CANCER DIAGNOSIS & DEATH Recinda Sherman, MPH, CTR Florida Cancer Data System.
Geocoding Chapter 16 GISV431 &GEN405 Dr W Britz. Georeferencing, Transformations and Geocoding Georeferencing is the aligning of geographic data to a.
Geocoding Chapter 16 GISV431 &GEN405 Dr W Britz. Georeferencing, Transformations and Geocoding Georeferencing is the aligning of geographic data to a.
Using the National Change of Address Database to Locate Children Designated as Moved or Gone Elsewhere CHEAR Unit, Division of General Pediatrics, University.
State of GIS Activities Among NAACCR Member Registries
An Addressing Success Story
Addressing address quality in public health surveillance data
GIS Lecture: Geocoding
Accurate GIS is more important than EVER
Evaluation of Geocoding Quality in Montana
Quality assurance and assessment in the vital statistics system
Chris DuClos, MS, GISP Program Manager
A Review of the Address Coverage Enhancement Scheme for In-person Household Surveys Michael Jones, Westat Sylvia Dohrmann, Graham Kalton, Jean Opsomer,
Presentation transcript:

Assessing Quality of Geocoded Data The Florida Registry Experience

Overview What is geocoding quality? Florida’s geocoding experience – Identifying geocoding errors – Results before and after improved geocoding – Monitoring for geocoding problems 2

What is Geocoding? Spatially enable Assign geocode – Latitude/Longitude – FIPS—Census Units Match address to street file – Batch (automated) – Interactive (manual 5-10%) 3

Geocoding Quality Components Match rate – Coverage, % with spatial location Precision – Scale County center versus census block – NAACCR Items #366,#364,#365 GIS Coordinate Quality, Census Tract Certainty Accuracy – Correct location 4

Geocoding Match Software – Deterministic, Probabilistic – Parsing algorithm, Assumptions (ties) – “Black box” Underlying street files Quality of address data Batch versus manual NE 2nd, Miami, FL Did you mean: 133 NE 2nd St, Miami, 133 SE 2nd Ave, Miami, 133 NW 2nd Ave, Miami, 133 SW 2nd St, Miami, 133 SW 2nd Ave, Miami, 133 SE 2nd St, Miami,

Geocoding Precision Parcel match – “gold standard” – Match to building footprint Street level match – Most common – Interpolate along street segment Centroid – Center of polygon Block, tract, zipcode, county – Population center, physical 6

Geocoding Accuracy 7

FCDS Geocoding Proprietary, local vendor Problems found via use – Reported county does not match geocoded county – Representativeness of cases – Cases assigned to invalid or zero population block groups Problems found via scrutiny – Cases in nautical areas (not islands) – Vendor assumptions 8

Geocoding Project Test file – Created “gold standard” files – FIPS (cancer cases) – Long/Lat (well locations) Selected a vendor – Based on logistics rather than quality New vendor re-geocoded entire registry Compared Results – Before and After 9

Old versus Improved Vendor: County Match Problem 10

Old versus Improved Vendor: County Match Problem 11

Old versus Improved Vendor: % Matched 12

Old versus Improved Vendor: Representativeness of Cases Environmental Health – Re-geocoded our data Census Data – 96% Black Old Geocoding Vendor – 15% Black Cases New Geocoding Vendor – 85% Black Cases 13

Old versus Improved Vendor: Nautical, Invalid, Zero pop Cases assigned to the sea – 0 cases from new vendor Cases assigned to invalid bg – 0 cases from new vendor Cases assigned to 0, 1, 10 population bgs – 5,765 cases – 743 cases (3+ more years of data) – SF1 vs. SF3; Overlay 14

Specificity ? Old Data:Improved Data: 15

Sensitivity ? Old Data:Improved Data: 16

Validity ? Old Data : Oral Cancer by SES Wealthy – 34.0 ref Mid High – 36.6RR 1.08 Mid Low – 39.1RR 1.15 Poorest – 46.3RR 1.36 New Data : Oral Cancer by SES Wealthy – 37.3 ref Mid High – 40.1RR 1.08 Mid Low – 45.4RR 1.22 Poorest – 49.2 RR

Monitoring Geocoding Quality % County match – Florida zipcodes; military addresses geocoded to NJ % Contiguous counties Incorrect FIPS Nautical FIPS # Zero Pops Representativeness 18

Impact Fewer, smaller, lower risk clusters Greater % ungeocodable – More accurate – Less specific Ungeocodable – Rural, Poor, Old – Potential bias – Manual geocoding 19

Addressing Ungeocodables Address quality? – Implemented edits Software development – Improve matching algorithm Specific to our data Link with administrative databases DMV, Medicaid, Medicare Geo-imputation – Kevin Henry Requires institutional priority ! 20

21

Acknowledgements Dr. Greg Kearny – Environmental Health, FL DOH N. Dean Powell – FCDS Jackie Button – FCDS Dr. Monique Hernandez – FCDS We acknowledge the CDC for financial support under cooperative agreement U58/DP Contents are responsibility of authors and do not represent views of CDC, FL DOH, or FCDS 22