1 Case Study 1: How to Deal with Estimates with Low Reliability 2009 Population Association of America ACS Workshop April 29, 2009.

Slides:



Advertisements
Similar presentations
Multiple Indicator Cluster Surveys Survey Design Workshop
Advertisements

Statistical Significance and Population Controls Presented to the New Jersey SDC Annual Network Meeting June 6, 2007 Tony Tersine, U.S. Census Bureau.
Using American FactFinder John DeWitt Project Manager Social Science Data Analysis Network Lisa Neidert Data Services Population Studies Center.
1 Case Study 2: Choosing between ACS 1-, 3-, and 5-Year Estimates 2009 Population Association of America ACS Workshop April 29, 2009.
“It Don’t Come Easy”, Ringo Starr.  Period Estimates – not point in time, not easy for people to understand or explain  Different residence rules –
Chap 8-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 8 Estimation: Single Population Statistics for Business and Economics.
Technical Issues Associated with the American Community Survey Lisa Neidert NPC Poverty/American Community Survey Workshop June 22-26, 2009.
Point and Confidence Interval Estimation of a Population Proportion, p
Limitations of Analytical Methods l The function of the analyst is to obtain a result as near to the true value as possible by the correct application.
Timed. Transects Statistics indicate that overall species Richness varies only as a function of method and that there is no difference between sites.
Technical Issues Associated with the American Community Survey Lisa Neidert NPC Poverty/American Community Survey Workshop July , 2010.
11 American Community Survey Summary Data Products.
7-2 Estimating a Population Proportion
Technical Issues Associated with the American Community Survey Lisa Neidert NPC Poverty/American Community Survey Workshop June 23-27, 2008.
11 American Community Survey Data Products. 2 What do I need to know before using ACS data and data products?
American Community Survey Continuous Survey Methodology 250,000 Households sampled per month About 1 in 40 Households sampled per.
Case Study 3: Making Comparisons 2009 Population Association of America ACS Workshop April 29, 2009.
Measures of Regression and Prediction Intervals
1 What is a “Statistical Calculator”? Presented by Doug Hillmer Independent Consultant.
Transportation leadership you can trust. presented to TRB Census Data for Transportation Planning Meeting presented by Kevin Tierney Cambridge Systematics,
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Confidence Interval Estimation
Liesl Eathington Iowa Community Indicators Program Iowa State University October 2014.
Exploring Error and the American Community Survey.
1/26/09 1 Community Health Assessment in Small Populations: Tools for Working With “Small Numbers” Region 2 Quarterly Meeting January 26, 2009.
Estimation Statistics with Confidence. Estimation Before we collect our sample, we know:  -3z -2z -1z 0z 1z 2z 3z Repeated sampling sample means would.
Chap 8-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 8 Confidence Interval Estimation Business Statistics: A First Course.
Working with the data
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
Biostatistics: Measures of Central Tendency and Variance in Medical Laboratory Settings Module 5 1.
Case 5 Introduction to Demographic Research Using Aggregated ACS Data for Ecological Regression: Changes in County Poverty Katherine Curtis Adam Slez Jennifer.
Using the American Community Survey (ACS) Maryland Sate Data Center Affiliate Meeting April 4, 2007.
A Demographic Evaluation of the Stability of American Community Survey Estimates for Selected Test Sites: 2000 to 2011 J. Gregory Robinson and Eric B.
Using the ACS: Issues with studying small areas and change over time Presented to Association of Public Data Users January 20, 2011.
1 Things That May Affect Estimates from the American Community Survey.
American Community Survey Getting the Most Out of ACS Jane Traynham Maryland State Data Center.
American Community Survey Maryland State Data Center Affiliate Meeting September 16, 2010.
1 The American Community Survey An Update Pamela Klein American Community Survey Office Washington Metropolitan Council on Governments Cooperative Forecasting.
American Community Survey (ACS) 1 Oregon State Data Center Meeting Portland State University April 14,
Using ACS and Census 2010 in Communities and Neighborhoods: Guidelines and Tools POPULATION REFERENCE BUREAU | PRESENTATION BY MARK MATHER.
Some ACS Data Issues and Statistical Significance (MOEs) Table Release Rules Statistical Filtering & Collapsing Disclosure Review Board Statistical Significance.
American Community Survey “It Don’t Come Easy”, Ringo Starr Jane Traynham Maryland State Data Center March 15, 2011.
Copyright © 2012 Pearson Education. All rights reserved © 2010 Pearson Education Copyright © 2012 Pearson Education. All rights reserved. Chapter.
Things that May Affect the Estimates from the American Community Survey Updated February 2013.
The American Community Survey: The Census Bureau’s new annual survey of America Will “Chip” Sawyer Vermont State Data Center.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
American Community Survey (ACS) Product Types: Tables and Maps Samples Revised
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
1 ACS Statistical Issues and Challenges: One-, Three-, and Five-year Period Estimates Alfredo Navarro U.S. Census Bureau Association of Professional Data.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 8. Parameter Estimation Using Confidence Intervals.
Can We Trust Data Users to Consider Data Quality? Presented at the 2008 European Conference on Quality in Official Statistics.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Statistics for Political Science Levin and Fox Chapter Seven
Iowa State University Extension and Outreach Staff Professional Development October 20, 2015 Sandra Burke, Bailey Hanson, Christopher Seeger, and Biswa.
Confidence Intervals for a Population Proportion Excel.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics: A First Course 5 th Edition.
The American Community Survey Getting Ready for a New Decade of Socio-Economic Data.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Estimation and Confidence Intervals Chapter 9.
Variability. The differences between individuals in a population Measured by calculations such as Standard Error, Confidence Interval and Sampling Error.
Variability.
And distribution of sample means
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
Addressing Conservation Issues Using IMBCR Data and Results
Confidence Interval Estimation
Lecture Slides Elementary Statistics Twelfth Edition
Presentation transcript:

1 Case Study 1: How to Deal with Estimates with Low Reliability 2009 Population Association of America ACS Workshop April 29, 2009

2 What is Reliability? Sampling Error is the uncertainty associated with an estimate that is based on data gathered from a sample of the population rather than the full population Measures of sampling error give users an idea of how reliable, or precise, estimates are and speak to their fitness-for-use

3 Measures of Sampling Error Standard Error (SE) – foundational measure of the variability of an estimate due to sampling Margin of Error (MOE) – precision of an estimate at a given level of confidence Confidence Interval (CI) - a range (based on a fixed level of confidence) that is expected to contain the population value of the characteristic Coefficient of Variation (CV) - The relative amount of sampling error associated with a sample estimate

4 Calculating Measures of Sampling Error At a 90 percent confidence level MOE = SE x SE = MOE / CI = Estimate +/- MOE CV = SE / Estimate * 100%

5 ACS Displays Margins of Error

6 Example 1 – Calculating Sampling Errors 2007 ACS 1-year estimates for Washington, DC Estimate of the percent of married couple families = 22.2% with a MOE of 1.2% SE = MOE/1.645 = 1.2% / = 0.729% CI = Estimate +/- MOE = 22.2% +/- 1.2% = 21.0% to 23.4% CV = SE/Estimate * 100% = 0.729% / 22.2% * 100% = 3.28%

7 Interpreting Coefficients of Variation CVs are a standardized indicator of reliability that tell us the relative amount of sampling error in the estimate Estimates with CVs that are less than 15% are generally considered reliable, while estimates with CVs that are greater than 30% are generally considered unreliable

8 Distinguishing Between Reliable and Unreliable Estimates There are no specific rules about acceptable levels of sampling error – the classification as “reliable” will vary based on the application Some estimates warrant greater precision than others due to the consequences of their use Reliability should always be considered when making comparisons

9 Example 2 – Assessing Utility A mayor of a small town can receive funding to support a language program if the proportion of the population speaking Vietnamese exceeds 5 percent. The 2007 ACS 1-year estimates shows the rate to be 1.2% with a MOE of 1.1%. The CV of this estimate is over 50% and the estimates would be deemed unreliable, but the mayor can with confidence conclude that the Vietnamese-speaking population is less than 5%.

10 Example 3 – Assessing Utility Officials in Savannah city, GA, are considering an outreach program to the foreign-born population of the city using the public transportation system as advertising. Officials need to know how many foreign-born people use public transportation. What do the 2007 ACS 1-year estimates show?

11

12 Example 3 – Assessing Utility The 2007 ACS 1-year estimate of the foreign-born using public transportation is 229 with a MOE of +/ This indicates a confidence interval of 0 to 589 and a CV of over 95%. This is a highly unreliable estimate and shouldn’t be used alone in an application such as this.

13 Example 4 – What to do with unreliable estimates Officials in Cook County, IL are looking to improve the quality of life for the elderly population by identifying sub county areas with people over 65 who are poor or near poor. An analyst finds a detailed table (B17024) from the 2007 ACS 1-year estimates that includes poverty data by age, providing a detailed series of income- to-poverty ratios.

14 Example 4 – Detailed Table In this table (B17024), data are available separately for people years and 75 years and over and for 12 income-to-poverty ratios CVs are high – for example, the estimate of 403 persons 75 and over with a ratio of 1.25 to 1.49, has a MOE of 314 and a CV of 47.4%

15 Option 1 Consider the collapsed version of a table You will find two versions of most detailed tables – one with full detail and another with detailed cells that have been collapsed Collapsed tables include fewer estimates that are usually more reliable

16 Option 1 Check out the collapsed version of this table In Table C17024 the two elderly age groups are combined and the 12 detailed income-to- poverty ratios are collapsed into 8 ratios CVs are still high, but better; for example, the CV for the estimate of persons 65 and over with a ratio of 1.25 to 1.99 is 18.3%

17 Option 2 Consider additional collapsing of detail In our example, we don’t need the detail in the collapsed table. It is sufficient to identify the “poor and near poor” as including all people with an income-to-poverty ratio of less than 2.0. We can collapse 4 detailed categories – under 0.5, 0.50 to 0.99, 1.00 to 1.24, and 1.25 to 1.99 to create a new category of “Under 2.00”

18 Option 2 Consider additional collapsing of detail While summing estimates of people in poverty across four income-to-poverty ratios provides the combined estimate, summing MOEs will not produce the correct MOE. The MOE of an aggregate estimate is determined by obtaining each component estimate’s MOE, squaring it, summing these, and taking the square root of that sum.

19 Option 2 - Calculations Age by Ratio of Income to Poverty Level – Bloom Township, Cook County, IL 65 years and over EstimateMOEMOE 2 Square root of sum Under to to to 1.992, Under 2.00 Source: 2007 ACS 1-year Estimates, Table C17024

20 Option 2 - Calculations Age by Ratio of Income to Poverty Level – Bloom Township, Cook County, IL 65 years and over EstimateMOEMOE 2 Square root of sum Under , to , to , to 1.992, ,944 Under 2.003,727937, Source: 2007 ACS 1-year Estimates, Table C17024

21 Option 2 - Results Age by Ratio of Income to Poverty Level – Bloom Township, Cook County, IL 65 years and overEstimateMOESECV Under % 0.50 to % 1.00 to % 1.25 to 1.992, % Under 2.003, % Source: 2007 ACS 1-year Estimates, Table C17024

22 Option 2 Summary The analyst should probably not directly use the estimates for each of the four income-to-poverty ratios to guide program planning (the CVs are very high for all but the last estimate) Collapsing the four detailed ratios into one ratio with less detail results in a more reliable estimate

23 Option 3 Consider combining geographic areas In our example, Bloom township is one sub county area in Cook County. It has two neighboring townships – Rich and Thornton If the geographic detail isn’t critical, estimates for these 3 areas could be combined

24 Option 3 - Calculations Age by Ratio of Income to Poverty Level – Bloom, Rich, and Thornton Townships, Cook County, IL 65 years and over EstimateMOEMOE 2 Square root of sum Under 0.50 Bloom Rich Thornton Combined Source: 2007 ACS 1-year Estimates, Table C17024

25 Option 3 - Calculations Age by Ratio of Income to Poverty Level – Bloom, Rich, and Thornton Townships, Cook County, IL 65 years and over EstimateMOEMOE 2 Square root of sum Under 0.50 Bloom ,225 Rich ,225 Thornton ,441 Combined1,253452, Source: 2007 ACS 1-year Estimates, Table C17024

26 Option 3 - Results Age by Ratio of Income to Poverty Level – Bloom, Rich, and Thornton Townships, Cook County, IL 65 years and overEstimateMOESECV Under 0.501, % 0.50 to 0.992, % 1.00 to 1.242, % 1.25 to 1.996, % Source: 2007 ACS 1-year Estimates, Table C17024

27 Option 3 - Calculations Age by Ratio of Income to Poverty Level – Bloom, Rich, and Thornton Townships, Cook County, IL 65 years and over EstimateMOEMOE 2 Square root of sum Under 0.501, , to 0.992, , to 1.242,6841,0211,042, to 1.996,6351,2791,635,841 Under ,1813,835,1321,958 Source: 2007 ACS 1-year Estimates, Table C17024

28 Option 3 - Results Age by Ratio of Income to Poverty Level – Bloom, Rich, and Thornton Townships, Cook County, IL 65 years and overEstimateMOESECV Under 0.501, % 0.50 to 0.992, % 1.00 to 1.242, % 1.25 to 1.996, % Under , % Source: 2007 ACS 1-year Estimates, Table C17024

29 Option 3 Summary Combining data for 3 neighboring areas improved the reliability of the detailed poverty data; collapsing this detail improved the estimate even more Users need to consider the most important dimensions – geography or characteristic detail when considering collapsing If both are critical, consider option 4

30 Option 4 Consider Multiyear Estimates This will be covered in the next two case studies

31 Summary Extrapolation to Large Data Sets While these case studies referenced the use of a single set of estimates for a limited number of geographic areas, the underlying logic applies to analysts working with large data sets covering many areas Be aware of the reliability limitations of the data before conducting your analyses, consider options to access or create more reliable estimates

32 What have we learned about dealing with ACS estimates with low reliability? You should review the collapsed version of a detailed table to see if the collapsed values are sufficient for your needs You can improve the reliability of ACS estimates by collapsing characteristic detail or combining geographies

33 Contact Debbie Griffin U.S. Census Bureau