Utah Department of Health 1 1 Identifying Peer Areas for Community Health Collaboration and Data Smoothing Brian Paoli Utah Department of Health 6/6/2007.

Slides:



Advertisements
Similar presentations
ANALYSIS OF VARIANCE (ONE WAY)
Advertisements

Discriminant Analysis Database Marketing Instructor:Nanda Kumar.
1 A Common Measure of Identity and Value Disclosure Risk Krish Muralidhar University of Kentucky Rathin Sarathy Oklahoma State University.
What is a sample? Epidemiology matters: a new introduction to methodological foundations Chapter 4.
11 ACS Public Use Microdata Samples of 2005 and 2006 – How to Use the Replicate Weights B. Dale Garrett and Michael Starsinic U.S. Census Bureau AAPOR.
QUANTITATIVE DATA ANALYSIS
Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson2-1 Lesson 2: Descriptive Statistics.
Chap 3-1 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 3 Describing Data: Numerical.
DESCRIBING DATA: 2. Numerical summaries of data using measures of central tendency and dispersion.
Measures of Income, Poverty and Health Insurance Wesley Basel, U.S. Census Bureau Presented at the Walter Cronkite School of Journalism June 17, :00.
Regression and Correlation
Clustered or Multilevel Data
1 Pertemuan 13 Uji Koefisien Korelasi dan Regresi Matakuliah: A0392 – Statistik Ekonomi Tahun: 2006.
Measures of Variability or Dispersion
The process of [social research theory/model/framework conceptual relationships hypotheses working hypotheses and measurement research design data collection.
1 Basic statistics Week 10 Lecture 1. Thursday, May 20, 2004 ISYS3015 Analytic methods for IS professionals School of IT, University of Sydney 2 Meanings.
Business and Economics 7th Edition
Personality, 9e Jerry M. Burger
Stat 112: Lecture 9 Notes Homework 3: Due next Thursday
Your Community by the Numbers Accessing the most current and relevant Census data Alexandra Barker Data Dissemination Specialist U.S Census Bureau New.
Socio-Economic & Demographic Data Tools for Proactive Planning Robin Blakely-Armitage STATE OF NEW YORK CITIES: Creative Responses to Fiscal Stress March.
Regression and Correlation Methods Judy Zhong Ph.D.
COMMUNITY PROFILE: TULSA 2014 Prepared by the Community Service Council, with support from the Metropolitan Human Services Commission November 2014.
Measurement Tools for Science Observation Hypothesis generation Hypothesis testing.
MGQ 201 WEEK 4 VICTORIA LOJACONO. Help Me Solve This Tool.
Tabulate, chart, map, download: Pre-tabulated health indicators.
Adaptive Kernel Density in Demographic Analysis Richard Lycan Institute on Aging Portland State University.
Sampling: Theory and Methods
1/26/09 1 Community Health Assessment in Small Populations: Tools for Working With “Small Numbers” Region 2 Quarterly Meeting January 26, 2009.
Essentials of Marketing Research
Introduction to the Public Use Microdata Sample (PUMS) File from the American Community Survey Updated February 2013.
Slide 1 Estimating Performance Below the National Level Applying Simulation Methods to TIMSS Fourth Annual IES Research Conference Dan Sherman, Ph.D. American.
A Demographic Evaluation of the Stability of American Community Survey Estimates for Selected Test Sites: 2000 to 2011 J. Gregory Robinson and Eric B.
American Community Survey Overview September 4, 2013 Tim Gilbert American Community Survey Office.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Sampling Class 7. Goals of Sampling Representation of a population Representation of a population Representation of a specific phenomenon or behavior.
1 Things That May Affect Estimates from the American Community Survey.
Urbanisation and spatial inequalities in health in Brazil and India
When trying to explain some of the patterns you have observed in your species and community data, it sometimes helps to have a look at relationships between.
Multilevel Data in Outcomes Research Types of multilevel data common in outcomes research Random versus fixed effects Statistical Model Choices “Shrinkage.
Studying Injuries Using the National Hospital Discharge Survey Marni Hall, Ph.D. Hospital Care Statistics Branch, Division of Health Care Statistics.
Small Area Health Insurance Estimates (SAHIE) Program Joanna Turner, Robin Fisher, David Waddington, and Rick Denby U.S. Census Bureau October 6, 2004.
2008 Wisconsin County Health Rankings Online Webinar Available November 14, 2008 Kyla Taylor.
Linear correlation and linear regression + summary of tests
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
CHAPTER 12 Descriptive, Program Evaluation, and Advanced Methods.
Things that May Affect the Estimates from the American Community Survey Updated February 2013.
Educ 200C Wed. Oct 3, Variation What is it? What does it look like in a data set?
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Conducting and interpreting multivariate analyses.
Relative Values. Statistical Terms n Mean:  the average of the data  sensitive to outlying data n Median:  the middle of the data  not sensitive to.
NCHS July 11, A Semiparametric Approach to Forecasting US Mortality Age Patterns Presenter: Rong Wei 1 Coauthors: Guanhua Lu 2, Benjamin Kedem 2.
Introduction to Basic Statistical Tools for Research OCED 5443 Interpreting Research in OCED Dr. Ausburn OCED 5443 Interpreting Research in OCED Dr. Ausburn.
1 Module One: Measurements and Uncertainties No measurement can perfectly determine the value of the quantity being measured. The uncertainty of a measurement.
Quality Control: Analysis Of Data Pawan Angra MS Division of Laboratory Systems Public Health Practice Program Office Centers for Disease Control and.
1 Part09: Applications of Multi- level Models to Spatial Epidemiology Francesca Dominici & Scott L Zeger.
1 Module IV: Applications of Multi-level Models to Spatial Epidemiology Francesca Dominici & Scott L Zeger.
Life expectancy Stuart Harris Public Health Intelligence Analyst Course – Day 3.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Central Bank of Egypt Basic statistics. Central Bank of Egypt 2 Index I.Measures of Central Tendency II.Measures of variability of distribution III.Covariance.
Residential Segregation: A Key Connector Between Race and Environmental Health Disparities Jennifer Davis, Sacoby Wilson, Muhammad Salaam, Rahnuma Hassan.
Descriptive Statistics ( )
Analysis for Designs with Assignment of Both Clusters and Individuals
Statistics: The Z score and the normal distribution
Summary descriptive statistics: means and standard deviations:
BA 275 Quantitative Business Methods
Summary descriptive statistics: means and standard deviations:
Chapter 3 Statistical Concepts.
Business and Economics 7th Edition
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

Utah Department of Health 1 1 Identifying Peer Areas for Community Health Collaboration and Data Smoothing Brian Paoli Utah Department of Health 6/6/2007

Utah Department of Health 2 Acknowledgments - Dr. Lois Haggard, UDOH - Dr. David Mason, Univ. of Utah - Mohammed Chaara, Univ. of Utah - Michael Friedrichs, UDOH - Kathryn Marti, UDOH

Utah Department of Health 3 Outline - Background: –Why Peer Areas? –Data Smoothing –Previous Peer Area Attempts - Methodology and Procedures –Utah’s 61 Small Areas –Demographic Similarity –Producing Smoothed Estimates

Utah Department of Health 4 Why Peer Areas? - Community Collaboration –Identify areas that are similar for purposes of comparison –Collaborate on strategies, interventions - Data Smoothing –“Borrow strength” from geographic areas that are similar. –Especially useful when multi-year trend data are not available

Utah Department of Health 5 Data Smoothing - Why smooth? –We calculate measures, such as rates of death and disease, to assess the underlying disease risk in a population. –Measures from small populations are inherently erratic – subject to sampling variation. –Rare events such as infant mortality can vary widely from year to year.

Utah Department of Health 6

7 Data Smoothing - Most methods smooth data over time, requiring data from multiple years –Pool multiple years (e.g., 3- or 5-year averages) –Moving average –Combine areas to increase the number of cases

Utah Department of Health 8 Goal - Produce reliable (smooth, not erratic) and timely estimates. - Make appropriate inferences about the underlying disease risk in each community - Method must be simple to apply – and easy to implement

Utah Department of Health 9 29 UT Counties

Utah Department of Health 10

Utah Department of Health 11 Utah Small Areas - 61 small areas were defined using both ZIP code and County –Each area is either a ZIP code area, –two or more contiguous ZIP codes, or –a combination of ZIP code and county information.

Utah Department of Health UT Small Areas

Utah Department of Health 13

Utah Department of Health 14 Previous Attempts - Geographic adjacency –rooks’s adjacency –queen’s adjacency –geographic centroid –population centroid

Utah Department of Health 15 Previous Attempts - Cluster analysis –odd-sized groups –odd-ball areas –groups mutually exclusive (okay, but not necessary)

Utah Department of Health 16 Develop Methodology to: - Identify Peer Areas –Create “Demographic Distance” matrix - Smooth Data –Median? Pooled? Weighted? - Measure Our Success –Reliability of results –Appropriateness of making inference to index area from smoothed rates

Utah Department of Health Identify Peer Areas - Demographic Characteristics –Use available demographic information from the U.S. Bureau of the Census –Use demographic variables that are associated with population health –Select a small number of these demographic variables –Produce a methodology others can replicate

Utah Department of Health Identify Peer Areas - Factor analysis to reduce dimensionality. –Foreign born/Hispanic –Education –Income/Poverty –Employment –Age –Urban and Rural

Utah Department of Health Identify Peer Areas - Selected 5 variables based on factor analysis and correlation with health outcomes (e.g., infant mortality, heart disease, etc.) –% Hispanic –% age 25+ with Bachelor’s degree –% children in poverty –% owner-occupied housing –% age 65+

Utah Department of Health Identify Peer Areas - Create distance matrix The distance d(x,y) between two areas with n dimensional observations x=[x 1,x 2,…,x n ]’ and y=[y 1,y 2,…,y n ]’ is: d(x,y)= ([x-y]’S -1 [x-y]) 1/2 The matrix S contains the variances and covariances of the n variables.

Utah Department of Health 21 Identify Peer Areas - Demographic distance

Utah Department of Health Identify Peer Areas - Which are the Peer Areas for purposes of collaboration? –3 (or some #) areas with smallest distances? –All areas within a certain distance?

Utah Department of Health Identify Peer Areas - Which are the Peer Areas for purposes of data smoothing? –Same areas as for collaboration? - Need to think about the smoothing algorithm.

Utah Department of Health Smooth the Data - Options –Weighted Median rate using a group of five areas –Pool a selected number of areas together and treat them as a single area (crude rate for the combined areas) –Pool all areas together and weight them by a function of their distances to the index area (closer areas -> more weight)

Utah Department of Health 25 Close Neighbors Distant Neighbors

Utah Department of Health 26 Areas that are close contribute more to the smoothed rate Areas that are distant contribute little to the smoothed rate

Utah Department of Health 27

Utah Department of Health 28

Utah Department of Health 29

Utah Department of Health 30

Utah Department of Health Measure Our Success - Reliability –Did the data get smoother? –Intraclass Correlation Coefficient (ICC) Ratio of the amount of variance between areas to the sum of the variance within and between areas (MS between – MS within )/( MS between +(k-1)Ms within ) Range from 0 to 1 1 = perfectly smooth and level, only variance in the data is from one area to the next

Utah Department of Health Measure Our Success - Appropriateness of Inference –Is it appropriate to infer that the smoothed rate represents the true underlying disease risk in the community? (Overall, are the smoothed scores in the ballpark?) –Sum of Squared Differences (SS) from smoothed data to original data. Smoothed estimate should be close to the index area’s crude rate

Utah Department of Health Measure Our Success - ICC – Want high scores, close to 1 - SS – Want low scores, given high ICC - HOW DID THE SMOOTHED RATES PERFORM?

Utah Department of Health 34 Smoothed ICC=.901 Crude ICC=.835

Utah Department of Health 35 Smoothed ICC=.901 Crude ICC=.835

Utah Department of Health 36 Smoothed ICC=.901 Crude ICC=.835

Utah Department of Health 37 Smoothed ICC=.901 Crude ICC=.835

Utah Department of Health 38 Summary - A small number of demographic variables were identified –Capture the demographic variability –Related to health outcomes - Peer Areas were identified –Groupings seem intuitive

Utah Department of Health 39 Summary - Smoothing algorithm was identified –Had characteristics we liked Index area gets highest weight Peer areas get high weights Dissimilar areas weight=0

Utah Department of Health 40 Summary - Smoothed rates performed generally well –They were smooth (ICC ~ 1.0) –They represented the underlying risk in the index area (SS relatively small)

Utah Department of Health 41 Summary - Easy to replicate? –Excel spreadsheet You: –Enter your demographic variables –Enter health outcomes for the same areas –Change smoothing parameters (if desired) Excel: –Calculates distance matrix –Generates smoothed rates –Generates performance measures

Utah Department of Health 42 Challenges/Limitations - Demographic characteristics change, distance scores will need to be updated (decennial census years?) - How much smoothing to use is a subjective decision. - Smoothing may not seem credible to members of community - Peer Groups are not symmetric

Utah Department of Health 43 Excel Spreadsheet The spreadsheet is free and the files can be downloaded from the IBIS website. Go to Look for “Peer Area Analysis Tool” under the “News and System Enhancements” heading.

Utah Department of Health 44 - Contact Information: - Brian Paoli Office of Public Health Assessment Utah Department of Health 288 North 1460 West P.O. Box Salt Lake City, Utah