1/26/09 1 Community Health Assessment in Small Populations: Tools for Working With “Small Numbers” Region 2 Quarterly Meeting January 26, 2009.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

SAMPLE DESIGN: HOW MANY WILL BE IN THE SAMPLE—DESCRIPTIVE STUDIES ?
Parameter Estimation Chapter 8 Homework: 1-7, 9, 10 Focus: when  is known (use z table)
Adjusted Rates Nancy D. Barker. Adjusted Rates Crude Rates Table 1.
Stat 301 – Day 28 Review. Last Time - Handout (a) Make sure you discuss shape, center, and spread, and cite graphical and numerical evidence, in context.
Clustered or Multilevel Data
CONFIDENCE INTERVALS What is the Purpose of a Confidence Interval?
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
BCOR 1020 Business Statistics
Mapping Rates and Proportions. Incidence rates Mortality rates Birth rates Prevalence Proportions Percentages.
How do cancer rates in your area compare to those in other areas?
1 Psych 5500/6500 Statistics and Parameters Fall, 2008.
BC Jung A Brief Introduction to Epidemiology - IV ( Overview of Vital Statistics & Demographic Methods) Betty C. Jung, RN, MPH, CHES.
Measures of Central Tendency
Review of normal distribution. Exercise Solution.
Medical Statistics (full English class) Ji-Qian Fang School of Public Health Sun Yat-Sen University.
Lecture 3: Measuring the Occurrence of Disease
Hypothesis Testing II The Two-Sample Case.
Ch 10 Comparing Two Proportions Target Goal: I can determine the significance of a two sample proportion. 10.1b h.w: pg 623: 15, 17, 21, 23.
Multiple Choice Questions for discussion
Sampling Distribution of the Mean Central Limit Theorem Given population with and the sampling distribution will have: A mean A variance Standard Error.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Chapter 11: Estimation Estimation Defined Confidence Levels
A Sampling Distribution
Statistics: Concepts and Controversies What Is a Confidence Interval?
Health and Disease in Populations 2001 Sources of variation (2) Jane Hutton (Paul Burton)
Chapter 7 Statistical Inference: Confidence Intervals
ESTIMATION. STATISTICAL INFERENCE It is the procedure where inference about a population is made on the basis of the results obtained from a sample drawn.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Estimation PowerPoint Prepared by Alfred P. Rovai.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Chapter 3: Measures of Morbidity and Mortality Used in Epidemiology
Data Analysis & Interpretation Intermediate Injury Prevention August 23-26, 2011 Billings, MT.
Statistical estimation, confidence intervals
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
Sampling Error.  When we take a sample, our results will not exactly equal the correct results for the whole population. That is, our results will be.
Standardizing Rates Nam Bains October 15 th, 2007 Statistics and Analysis in Public Health APHEO.
Crude Rates and Standardisation Standardisation: used widely when making comparisons of rates between population groups and over time (ie. Number of health.
Instructor Resource Chapter 9 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Estimation PowerPoint Prepared by Alfred P. Rovai.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 4th Lesson Estimating Population Values part 2.
PPA 501 – Analytical Methods in Administration Lecture 6a – Normal Curve, Z- Scores, and Estimation.
© Copyright McGraw-Hill 2000
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved. Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and.
Standardization of Rates. Rates of Disease Are the basic measure of disease occurrence because they most clearly express probability or risk of disease.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 8. Parameter Estimation Using Confidence Intervals.
Measures of Disease Frequency
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Ex St 801 Statistical Methods Inference about a Single Population Mean (CI)
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Margin of Error S-IC.4 Use data from a sample survey to estimate a population mean or proportion; develop a margin of error through the use of simulation.
Confidence Intervals and Hypothesis Testing Mark Dancox Public Health Intelligence Course – Day 3.
Estimating a Population Proportion ADM 2304 – Winter 2012 ©Tony Quon.
Standardisation Alexander Ives Public Health England, South West.
Confidence Intervals Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.
Sampling and Sampling Distributions Chapter 8.4 – 8.7.
 Confidence Intervals  Around a proportion  Significance Tests  Not Every Difference Counts  Difference in Proportions  Difference in Means.
Chapter 10: The t Test For Two Independent Samples.
Comparing Two Proportions Chapter 21. In a two-sample problem, we want to compare two populations or the responses to two treatments based on two independent.
Statistical analysis.
ESTIMATION.
Statistical analysis.
Measures of Mortality Part 2
ESTIMATION.
Presentation transcript:

1/26/09 1 Community Health Assessment in Small Populations: Tools for Working With “Small Numbers” Region 2 Quarterly Meeting January 26, 2009

1/26/09 New Mexico Department of Health 2 Outline  Description of the Problem –Random variation –Survey samples versus complete count datasets –Observed events versus underlying risk  Statistical Tools –Confidence intervals –Combining data –SMR

1/26/09 3 Small Numbers: The Problem

1/26/09 New Mexico Department of Health 4 Random Variation  Exercise: –Select a sample –Calculate the median age  State of New Mexico Median age 1 –36.0  Why are they different? American Community Survey, U.S. Census Bureau. Downloaded on 1/21/09 from

1/26/09 New Mexico Department of Health 5 Random Variation and Sample Size  What if we had a sample of New Mexico residents that was: –Randomly selected –n=5,000  Would it better match the state Census Bureau estimate?

1/26/09 New Mexico Department of Health 6 Size Matters  The larger sample helps to “cancel out” the effects of random variation. –Some sample subjects are older than the median. –Some sample subjects are younger than the median. –As you increase the number of sample subjects, the differences cancel out, and you get closer to the median.

1/26/09 New Mexico Department of Health 7 Reliability and Validity  The term "accuracy" is often used in relation to validity, while the term, "precision" is used to describe reliability.

1/26/09 New Mexico Department of Health 8 Numerator vs. Denominator  A large sample size means we have a large denominator, but the numerator also matters.  Some methods use a “Poisson distribution,” which considers ONLY the numerator size when assessing precision.  If we have only 1 event in one year, and 2 the next year, the addition of a single event doubles the rate of occurrence.

1/26/09 New Mexico Department of Health 9 Random Variation and Complete Count Datasets  What are some “complete count” datasets?  How do we use them for community health assessment?

1/26/09 New Mexico Department of Health 10

1/26/09 New Mexico Department of Health 11

1/26/09 New Mexico Department of Health 12

1/26/09 New Mexico Department of Health 13

1/26/09 New Mexico Department of Health 14

1/26/09 New Mexico Department of Health 15

1/26/09 New Mexico Department of Health 16

1/26/09 New Mexico Department of Health 17 Summary of the Problem  Measurements are subject to sampling variability, also known as random error.  Even complete count datasets are subject to random error because we use them as a reflection of the underlying disease risk or rate.

1/26/09 New Mexico Department of Health 18 Summary of the Problem A larger sample (denominator, population size) helps to “cancel out” the effects of random variation. Size matters, in both the numerator and the denominator. A measure that is relatively free from the effects of random variation is called “precise,” “reliable,” and “stable.” Those terms are synonymous.

1/26/09 19 Small Numbers: Statistical Tools

1/26/09 New Mexico Department of Health 20 Tool #1. Confidence Intervals  Use confidence intervals to help you decide whether the rate is stable.  Won’t solve the problem, but will provide information to help you interpret the rates.  The stability of an observed rate is important when comparing areas or assessing whether disease risk has increased or decreased.

1/26/09 New Mexico Department of Health 21

1/26/09 New Mexico Department of Health 22

1/26/09 New Mexico Department of Health 23

1/26/09 New Mexico Department of Health 24

1/26/09 New Mexico Department of Health 25 Calculation of 95% C.I.  The 95% confidence interval is calculated as 1.96 x Standard Error of the estimate (s.e.).  s.e. is calculated as  So the 95% C.I. is 1.96

1/26/09 New Mexico Department of Health 26 The “Normal” Distribution

1/26/09 New Mexico Department of Health 27 Poisson Distribution

1/26/09 New Mexico Department of Health 28 Calculation of 95% C.I.  p stands for “probability.” It is the rate without the multiplier (e.g., 100,000 for deaths). q is the complement of the probability (1 minus P).  In Union County, there were 2 diabetes deaths among the 4,470 population, for a probability of (45 in 100,000)

1/26/09 New Mexico Department of Health 29 Calculation of 95% C.I.  Formula: 1.96  p= , q= , n=4,470  (pq)/n = / 4470 =  √(pq)/n = ………………… =  1.96√(pq)/n = 1.96 x =  Then we need to add the multiplier back in, so the confidence interval is: + 100,000* = + 62

1/26/09 New Mexico Department of Health 30 Calculation of 95% C.I.  The diabetes death rate was 44.7 per 100,000.  The confidence interval statistic is applied both above and below the rate.  C.I. LL (lower limit) is: = -17.3, and since we cannot have a negative rate, we’ll call it “0”  C.I. UL (upper limit) is: =  The diabetes death rate for Union County in 2006 was 44.7 per 100,000 (95% C.I., 0 to 106.7)

1/26/09 New Mexico Department of Health 31

1/26/09 New Mexico Department of Health 32

1/26/09 New Mexico Department of Health 33

1/26/09 New Mexico Department of Health 34 Confidence Interval Factoids  The confidence interval may be thought of as the range of probable true values for a statistic.  The confidence interval is an indication of the precision (stability, reliability) of the estimate.  A confidence interval is typically expressed as a symmetric value (e.g., "plus or minus 5%"). But for percentages, when the point estimate is close to 0% or 100%, a confidence interval with an asymmetric shape can be used.

1/26/09 New Mexico Department of Health 35 More Confidence Interval Factoids  The 95% confidence interval (calculated as 1.96 times the standard error of a statistic) indicates the range of values within which the statistic would fall 95% of the time if the researcher were to calculate the statistic from an infinite number of samples of the same size drawn from the same base population. Unless otherwise stated, a confidence interval will be the "95% confidence interval."

1/26/09 New Mexico Department of Health 36 More Confidence Interval Factoids  The 90% confidence interval, also commonly used, is calculated as 1.65 times the standard error of the estimate.  To calculate a confidence interval when the number of health events = 0, you may use 0 as the lower confidence limit, and for the upper confidence limit, assume a count of 3 health events in the same population.

1/26/09 New Mexico Department of Health 37 Tool #2. Combine Data  Combine years  Combine geographic areas (e.g., use the regional estimate rather than the county estimate)  Use a broader age group

1/26/09 New Mexico Department of Health 38

1/26/09 New Mexico Department of Health 39

1/26/09 New Mexico Department of Health 40

1/26/09 New Mexico Department of Health 41 Interpretation of Diabetes Deaths in Union County  Union County’s diabetes death rate ( ) was higher than the state, overall rate, but was not statistically significantly higher.  In other words, the Union County rate was “marginally higher” than the New Mexico state rate.  Was it higher than Santa Fe County?

1/26/09 New Mexico Department of Health 42 Differences Between Two Rates SStatistical significance of a change in a rate from time 1 to time 2 SStatistical significance of the difference between two rates in one time period (e.g., Union County versus Santa Fe County). **Test of Proportions**

1/26/09 New Mexico Department of Health 43 Test of Proportions  Proportion1: Union County diabetes death rate: 41.3/100,000 =  Proportion 2: Santa Fe County diabetes death rate: 20.4/100,000 =  Difference between the two proportions: =

1/26/09 New Mexico Department of Health 44 Test of Proportions (cont’d)  The difference between the two rates ( ) must be considered in the context of the standard error of the difference between two rates (pooled standard error), computed as:  If the difference between the two rates, , is greater than 1.96 x s.e. diff, then the difference is considered statistically significant. Bruning, J.L., and Kintz, B.L. (1977) Computational Handbook of Statistics. Scott, Foresman and Company: London.

1/26/09 New Mexico Department of Health 45 Calculation of s.e. diff  Union County: –p1= –q1= –n1=33,929  Santa Fe County: –p2= –q2= –n2=1,092,565  p=proportion, q=(1-p), n is the person-years at risk, or the sum of the population counts across all eight years.

1/26/09 New Mexico Department of Health 46 Calculation of s.e. diff

1/26/09 New Mexico Department of Health 47 Evaluation of the Difference  Union County: 41.3/100,000 =  Santa Fe County: 20.4/100,000 =  Difference: =  s.e. diff =  1.96 * s.e. diff =  Is greater than ?  No. Union County’s rate is greater than Santa Fe County’s rate, but the difference is NOT statistically significant.

1/26/09 New Mexico Department of Health 48 Tool #3. SMR and ISR  Standardized Mortality (or Morbidity) Ratio (SMR) –Estimates the number of deaths (or health events) one would EXPECT, based on … The age- and sex-specific rates in a standard population (e.g., New Mexico rate) The age and sex distribution of the index area.  Indirectly Standardized Rates –Use SMR to perform age adjustment when the number of cases is less than 20.

1/26/09 New Mexico Department of Health 49 Standardized Mortality Ratio  The all-cause death rate in New Mexico in 2006 was deaths per 100,000 population.  All other things being equal, we should expect the same death rate in Union County.

1/26/09 New Mexico Department of Health 50 Standardized Mortality Ratio  BUT all other things are NOT equal. –2006, % of population over age 65 was 18.9% in Union County, compared with 12.3% statewide.  In an older population, we would expect a higher death rate.

1/26/09 New Mexico Department of Health 51 Standardized Mortality Ratio  And Union County’s death rate is higher: deaths per 100,000.  IF we adjust the New Mexico death rate to account for Union County’s older population, THAT is how many deaths we should EXPECT.

1/26/09 New Mexico Department of Health 52 SMR (Observed/Expected) (Rate x Pop) / 100,000 Standardized Mortality Ratio for 2006 Union County, All-cause Mortality

1/26/09 New Mexico Department of Health 53 SMR, Union County  An SMR <1.0 indicates less-than-expected mortality.  An SMR >1.0 indicates greater-than- expected mortality (also known as “excess mortality).  Union County’s SMR was 1.28, so the county had excess mortality in  Was it significantly more than expected?

1/26/09 New Mexico Department of Health 54 Indirect Age-Standardization  You should not use direct age adjustment when there are fewer than 20 (some say 25) health events. If you multiply the New Mexico crude rate by the Union County SMR, you get the indirectly age-adjusted rate for Union County. –Union Co. crude all-cause death rate: –NM crude all-cause death rate: –Union County SMR: 1.28 –Union County indirectly age-standardized rate: (still higher than the state rate, but the effects of Union County’s age distribution have been removed).

1/26/09 New Mexico Department of Health 55 Confidence Interval for SMR  Observed deaths: 61 (# deaths from Vital Records data)  Expected deaths: 47.7 (# expected from SMR calculation)  SMR: 1.28 (observed / expected)  StdErr for SMR: 0.16 (SQRT(observed)) / expected  95% Confidence Interval: 0.32 (1.96 x StdErr)  Significance TestDoes the 95% confidence interval include 1.0? –If "yes" -> not significant –If "no" -> statistically significant

1/26/09 New Mexico Department of Health 56 Summary: Statistical Tools  Use confidence intervals assess the stability of a rate.  Use C.I. to see if your local rate is significantly different from the state rate.  A statistic called a “Test of Proportions” uses the “pooled standard error” to test whether two local rates are significantly different.

1/26/09 New Mexico Department of Health 57 Summary: Statistical Tools  Combine data to improve the stability of your rate. –Combine persons (e.g., broader age group) –Combine place (larger area) –Combine time (more years)

1/26/09 New Mexico Department of Health 58 Summary: Statistical Tools  Use the Standardized Mortality (Morbidity) Ratio (SMR) to compare a local rate to a standard population (e.g., state overall).  The SMR “expected” can be used for indirect age-adjustment when the number of health events is fewer than 20, or if the age-specific death rates are not known.

1/26/09 59 Thanks! Lois M. Haggard, PhD Community Health Assessment Program, NMDOH