Adding Value to Registries through Geospatial Big Data Fusion Geospatial Health Context Big Table Facilitating Geospatial Analysis in Health Research.

Slides:



Advertisements
Similar presentations
Diabetes Hotspots Mapping Using GIS tools to Target Quality Gaps CQI “Right Care” Initiative Nicole Lurie, M.D., MSPH & Allen Fremont, MD, PhD September.
Advertisements

Linking Dispatch, Paramedic, Hospital, and Regional Planning Data in Portland, Oregon: Christopher Bangs, MS Department of Emergency Medicine, Oregon.
The Virginia Health Chart Book Steve Sedlock Executive Director, VANGHR Presentation: Public Health Network of Virginia Tech November 12, 2012.
GIS Level 2 MIT GIS Services
NPS Introduction to GIS: Lecture 1
Dr. David Liu Objectives  Understand what a GIS is  Understand how a GIS functions  Spatial data representation  GIS application.
GIS Internet Map Servers for Health Applications Carol L. Hanchette, Ph.D. Rebecca D. Martin, Ph.D. Research Triangle Institute Research Triangle Park,
Introduction to the Use of Geographic Information Systems in Public Health Elio Spinello, MPH California State University, Northridge.
Preparing Data for Analysis and Analyzing Spatial Data/ Geoprocessing Class 11 GISG 110.
Exploratory Analysis of Disease Data & Introduction to UNC’s GIS Reference Library Prepared originally by Kristen Hampton Updated and maintained by Ben.
Exploratory Analysis of Disease Data & Introduction to UNC’s GIS Reference Library.
GIS FOR COMMUNITY DEVELOPMENT. WHAT DOES GIS STAND FOR? Hardware and Software Data Mapping Standards GIS Savvy Users GIS G eographic I nformation S ystems.
Health Datasets in Spatial Analyses: The General Overview Lukáš MAREK Department of Geoinformatics, Faculty.
Chapter 4 & 5: GIS Database & Vector Analysis. Chapter Four: GIS Database 2.
The Statistical Spatial Framework for Australia - enabling location analysis Gemma Van Halderen First Assistant Statistician Population, Education & Data.
Support the spread of “good practice” in generating, managing, analysing and communicating spatial information Introduction to GIS for the Purpose of Practising.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
U.S. Census Data & TIGER/Line Files
Introduction to Geographic Information Systems Fall 2013 (INF 385T-28620) Dr. David Arctur Research Fellow, Adjunct Faculty University of Texas at Austin.
Location Intelligence
TruVue LLC Visual Decision Support Tools TruVue provides location-based solutions to the healthcare industry for facility and physician network optimization.
Geospatial Mapping, Analysis, & Data (GeoMAD) Unit For more information, contact: Ariann Nassel – GeoMAD Director – –
The BOP (Billion Object Platform) and WorldMap / Dataverse Integration Harvard Center for Geographic Analysis Tuesday, July 12, 2016 Ben Lewis, Mercè Crosas,
Public Health February 2017
Geo-referenced data and DLI aggregate data sources
State of GIS Activities Among NAACCR Member Registries
Key Terms Attribute join Target table Join table Spatial join.
Research using Registries
Introduction to the VRAM
Introduction to Spatial Statistical Analysis
Geographic Information System [GIS]
GIS Institute Center for Geographic Analysis
Designing a Spatial/GIS Project
GIS Basic Training June 7, 2007 – ICIT Midyear Conference
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
GIS MAP OVERLAY ANALYSIS
The Center for Geographic Analysis Harvard University Our mission, services, process, staff, and sample projects Jeff Blossom, Center for Geographic.
Dept of Biostatistics, Emory University
pSCANNER’s Value: Beckstrom’s Law at Work
Flanders Marine Institute (VLIZ)
New Information - New Technology - New Directions
Illustrating HIV/AIDS in the United States
Geographic Information Systems
Federal Land Manager Environmental Database (FED)
CyberGIS: Reston, VA, September 22, 2018
Spatial Data Processing
Preliminaries: -- vector, raster, shapefiles, feature classes.
Illustrating HIV/AIDS in the United States
Drinking Water Mapping Application
Illustrating HIV/AIDS in the United States
Data Queries Raster & Vector Data Models
Illustrating HIV/AIDS in the United States
Illustrating HIV/AIDS in the United States
FAC Net kick-off - Boise, ID - April 10-11th, 2013
Illustrating HIV/AIDS in the United States
GIS Lecture: Selection
Italian National Institute of Statistics Major developments in 2010
Geographic & Resources Analysis in Primary Health Care
TerraPop Goals Lower barriers to conducting interdisciplinary human-environment interactions research by making data with different formats from different.
The Big Data ecosystem is supported by the NSF CNS
GIS Institute Center for Geographic Analysis
CYDL Project One Symposium
Geographic Information System [GIS]
Geographic Information System [GIS]
Illustrating HIV/AIDS in the United States
GIS Institute Center for Geographic Analysis
Scottish Cancer Registry and Intelligence Service (SCRIS)
NPS Introduction to GIS: Lecture 1 Based on NIMC and Other Sources.
ENVIRONMENT AND PUBLIC SAFETY COMMITTEE SESSION ON RESILIENCY
GIS, Data Democratization, and Public Health
Presentation transcript:

Adding Value to Registries through Geospatial Big Data Fusion Geospatial Health Context Big Table Facilitating Geospatial Analysis in Health Research Tim Haithcoat & Chi-Ren Shyu University of Missouri Informatics Institute June 13, 2019

THE GOAL Develop robust processes for health researchers and practitioners to more easily incorporate spatially integrated health, social, cultural, access, infrastructure, and environmental parameters/factors and spatial context in their research using scalable geospatially enabled databases, analytics, and visualizations.

Unique Infrastructure Typical Relational DB Typical Geospatial DB Talk about current state-of-the-art and shortcomings of raster and vector approaches Why designed this way. Column based – why not row based? Why did we pick this method and design;

Tessellation over Census blocks Block centroids = 343,565 points

Thiessen Proximal Polygons Tessellation with Census Centroids Thiessen Proximal Polygons

Extent of the Data Table Defined a point file with 318 million points for contiguous 48 states. How many columns (attributes)? Projection  10,000+ How many data sets? US Data.gov – Federal GIS > 1,000 What is the size of the table? 1.5 Gb/attribute Growth Projection  90 Tb Using Spark big data ecosystem Australian Cancer Atlas Determined Main Common Keys Census Geography Zip Code Watershed Etc. Created point summary counts for all geographies to use for analytics How deal with various resolutions and scale across database How & why set up regions and to speed query and retrieval How mapped? KD Tree: how make it work and make it efficient

Establishing Context Inter-layer Distance measures Coded 1st & 2nd Order Relationships

Registry Data Loading Registry Data Records

Leveraging Geospatial in Registries Geocoding of Registry Attach an X,Y coordinate to each record with associated confidence (strongest) Attach a primary key(s) (i.e. Census ID, Zip Code Tabulation Area) based on geocode of address to create ‘easy’ linkage to associated data when needed. Use geocoded location to determine association with a primary key to move attributes of interest directly to the registry record. Determine what information, and at what geographic summarization level, registry data gets shared

Using the Big Data Table Geospatial Health Context Big Table Data Required Socio-Economic Demographic Infrastructure Environmental Cultural Derived Physical Modeled LIFESTYLE 50% HEALTH CARE 25% BIOLOGY 15% ENVIRON 10% User Data Address Zip Code Tract County Inquiry Type Exploratory Simple Question Complex Question Complex Question w Temporal Aggregation Unit Zip Code Tract Block Group County Watershed School Dist Health Service Area

Choose an Issue Right-Sizing Care: Over the next decade, the aging American population is expected to place increased demands on the U.S. healthcare system. For older Americans, a review of medical records, found that 38% of doctor visits, including 27% of Emergency Room (E.R.) visits could have been replaced with telemedicine. Effort Required Census data tables (2 hrs) Census geography (1 hr) Hospital types (2 hrs) Road network zones (time and/or distance) (1 week) Broadband type (2 hrs) Query Elements Age > 60 years Gender Hospital Service Area Broadband Service The Data Needed Census age & gender Hospital locations Attributed road network Broadband attributes Census geography Select & retrieval MAUP can be addressed simply Key geographies

GeoHCBT: A case study of Leukemia

Example Complex Questions What factors in different demographic groups or locations discourage people from cancer treatment? How can we update our healthcare delivery strategy based on availability of medical services with relation to cancer risk based on population growth, ageing, and cancer type? Can we identify any new relationships between cancer occurrence and environmental, socio-cultural, infrastructural, or other data to explore or generate new hypotheses? What is the magnitude of population cancer disparities in an area, where are they located, and what factors might be creating these ‘hot spots’?

Relevance The Geospatial Health Context Big Table provides: Cancer Researchers an integrated big data repository to: Search - Enable stronger research designs (i.e. develop sampling / surveillance approached). Explore - Understand spatial interaction of a multitude of attributes. Ability to add contextual information based on neighborhood Decision Makers with a new tool to evaluate policy implications and focus on areas / populations affected. Public Health Professionals an ability to identify, mitigate, and potentially prevent health disparities in cancer incidence.

Acknowledgments Looking for research collaborations: Collaborators: Chi-Ren Shyu, PhD Richard D. Hammer, M.D. Tim Matisziw, PhD Iris Zachary, PhD Eileen Avery, PhD Kelly Bowers, D.O. Mirna Becevic, PhD This work is supported by the NIH BD2K T32 Training grant (5T32LM012410-02) The Big Data ecosystem is supported by the NSF CNS-1429294 Looking for research collaborations: Contact: HaithcoatT@missouri.edu