Estimating the Level of Underreporting of Expenditures among Expenditure Reporters: A Further Micro-Level Latent Class Analysis Clyde Tucker Bureau of.

Slides:



Advertisements
Similar presentations
The Wealth Index MICS3 Data Analysis and Report Writing Workshop.
Advertisements

National Seminar on Developing a Program for the Implementation of the 2008 SNA and Supporting Statistics in Turkey Derya TUNCER 10 September 2013 Ankara.
Paul Biemer, UNC and RTI Bac Tran, US Census Bureau Jane Zavisca, University of Arizona SAMSI Conference, 11/10/2005 Latent Class Analysis of Rotation.
Welsh Health Survey Anne Kingdon Welsh Assembly Government Health Promotion Division.
Outline of talk The ONS surveys Why should we weight?
Nonparametric estimation of non- response distribution in the Israeli Social Survey Yury Gubman Dmitri Romanov JSM 2009 Washington DC 4/8/2009.
Preparing Data for Quantitative Analysis
Preliminary Results from the 2008 Oklahoma Health Care Insurance and Access Survey Presentation to the Oklahoma Health Care Authority Board November 13,
1 Avalaura L. Gaither and Eric C. Newburger Population Division U.S. Census Bureau Washington, D.C. June 2000 Population Division Working Paper No. 44.
National Center for Health Statistics DCC CENTERS FOR DISEASE CONTROL AND PREVENTION Changes in Race Differentials: The Impact of the New OMB Standards.
UNECE Gender Statistics Session, October Dissemination and Use of Time Use Data The New Zealand Experience UNECE Gender Statistics Session Geneva,
1 Incorporating Statistical Process Control and Statistical Quality Control Techniques into a Quality Assurance Program Robyn Sirkis U.S. Census Bureau.
Propensity Score Models for Nonresponse and Measurement Error John Dixon U.S. Bureau of Labor Statistics June 20, 2011 The opinions expressed.
Predictors of Emergency Department Utilization by Homeless Persons: A National Study Clarilee Hauser PhD, RN.
Exploring Administrative Records Use for Race and Hispanic Origin Item Non-Response Sonya Rastogi, Leticia Fernandez, James Noon, Ellen Zapata and Renuka.
ICES 3° International Conference on Educational Sciences 2014
11 ACS Public Use Microdata Samples of 2005 and 2006 – How to Use the Replicate Weights B. Dale Garrett and Michael Starsinic U.S. Census Bureau AAPOR.
Incorporating Nonresponse in a Markov Latent Class Measurement Error Model of Consumer Expenditure Brian Meekins, Clyde Tucker Bureau of Labor Statistics.
Combining the strengths of UMIST and The Victoria University of Manchester An analysis of the relationship between time spent on active leisure and educational.
CE Overview Jay T. Ryan Chief, Division of Consumer Expenditure Survey December 8, 2010.
© John M. Abowd 2005, all rights reserved Household Samples John M. Abowd March 2005.
Consumer Expenditure Survey Redesign Jennifer Edgar Bureau of Labor Statistics COPAFS Quarterly Meeting March 4, 2011.
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Using Substantive Diagnostics to Evaluate the Validity of Micro-level Latent Class Indicators of Measurement Error Clyde Tucker and Brian Meekins.
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Split Sample Validation General criteria for split sample validation Sample problems.
The Gender Gap in Educational Attainment: Variation by Age, Race, Ethnicity, and Nativity in the United States Sarah R. Crissey, U.S. Census Bureau Nicole.
Page 1 Walk Through the 2011 Recreational Boating Statistical Abstract Section 1 Boating Population 1.Participation Estimates Participation Study.
Household Surveys ACS – CPS - AHS INFO 7470 / ECON 8500 Warren A. Brown University of Georgia February 22,
A Deeper Understanding of Avery Fitness Center Customers
Effects of Income Imputation on Traditional Poverty Estimates The views expressed here are the authors and do not represent the official positions.
Survey on Violence against women: experience of the Republic of Moldova UNECE Work Session on Gender statistics, March 2012, Geneva, Switzerland.
Housing Units with Negative Equity, George R. Carter III, Ph.D. U.S. Census Bureau HUD Data Users Conference Washington, DC March 8,
Target Markets: Segmentation and Evaluation
HS499 Bachelor’s Capstone Week 6 Seminar Research Analysis on Community Health.
TURKISH STATISTICAL INSTITUTE Social Sector Statistics Department Tourism Statistics Group
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
The Uninsured in Alameda County 2010 December 2010.
HPRP: New Reports HPRP new reports and data entry reporting review April 2010.
Register-based migration statistics and using additional administrative data sources Barica Razpotnik Statistical Office of the Republic of Slovenia UNECE.
The relationship between error rates and parameter estimation in the probabilistic record linkage context Tiziana Tuoto, Nicoletta Cibella, Marco Fortini.
1 Data Linkage for Educational Research Royal Statistical Society March 19th 2007 Andrew Jenkins and Rosalind Levačić Institute of Education, University.
Current Population Survey Sponsor: Bureau of Labor Statistics Collector: Census Bureau Purpose: Monthly Data for Analysis of Labor Market Conditions –CPS.
Economics and Statistics Administration U.S. CENSUS BUREAU U.S. Department of Commerce Assessing the “Year of Naturalization” Data in the American Community.
Introduction Introduction Alcohol Abuse Characteristics Results and Conclusions Results and Conclusions Analyses comparing primary substance of abuse indicated.
Barriers to Independence Among TANF Recipients: Comparing Caseworker Records & Client Surveys Correne Saunders Pamela C. Ovwigho Catherine E. Born Paper.
3.14 X AXIS 6.65 BASE MARGIN 5.95 TOP MARGIN 4.52 CHART TOP LEFT MARGIN RIGHT MARGIN ©TNS 2013 Are ‘better’ interviewers more successful at.
CHAPTERS HYPOTHESIS TESTING, AND DETERMINING AND INTERPRETING BETWEEN TWO VARIABLES.
RESEARCH METHODS IN TOURISM Nicos Rodosthenous PhD 14/03/ /3/20131Dr Nicos Rodosthenous.
RESEARCH METHODS Lecture 26
1 The Mortality of China’s Oldest Old: Comparisons from the Healthy Longevity Survey (HLS) and the 2000 Census Daniel Goodkind International Programs.
Pre-printing experiences at Statistics Sweden Anders Holmberg Department of Research & Development Statistics Sweden SE Örebro Sweden Tel:
Muskie School of Public Service University of Southern Maine Who Uses Individual Health Insurance & For How Long? Erika Ziller Andrew Coburn Timothy McBride.
2009 Survey of Disability, Ageing and Carers (SDAC) – emerging data Presentation to Carers NSW Biennial Conference 17 March 2011 Steve Gelsi Assistant.
Assessing Nonresponse Bias and Measurement Error Using Statistical Matching John Dixon U.S. Bureau of Labor Statistics June 15, 2010 The opinions.
BY: ALEJANDRA REYES DALILA OCHOA MARY GARCIA Part A Introduction to Research Methods Topics 1-5.
Using Data from the National Survey of Children with Special Health Care Needs Centers for Disease Control and Prevention National Center for Health Statistics.
1 Economically Active Population Survey Dong-Wook JEONG Employment Statistics Div. Statistics Korea.
Does High School Homework Increase Academic Achievement? Charlene Marie Kalenkoski Texas Tech University Sabrina Wulff Pabilonia U.S. Bureau of Labor Statistics.
Evaluating a Product Line Extension Opportunity
Transportation-related Injuries among US Immigrants: Findings from National Health Interview Survey.
Survey Research and Methodology UNL-GALLUP Research Center Evaluating Data quality in Time Diary Surveys Using Paradata Ana Lucía Córdova Cazar Robert.
AC 1.2 present the survey methodology and sampling frame used
CHS Community Health Survey
Trena M. Ezzati-Rice, Frederick Rohde, Robert Baskin
Prague EU-SILC Best Practice Workshop, 14th and 15th September 2017
Identifying Gender Minorities in Population-based Proxy Surveys
Multiple Regression – Split Sample Validation
Andrew Jenkins and Rosalind Levačić
Integration of inconsistent data sources using Hidden Markov Models (HMMs) Paulina Pankowska, Bart Bakker, Daniel Oberski & Dimitris Pavlopoulos.
Presentation transcript:

Estimating the Level of Underreporting of Expenditures among Expenditure Reporters: A Further Micro-Level Latent Class Analysis Clyde Tucker Bureau of Labor Statistics Paul P. Biemer Research Triangle Institute Brian Meekins Bureau of Labor Statistics DISCLAIMER: The views expressed in this presentation are those of the authors and do not necessarily represent the views of the Bureau of Labor Statistics or the Department of Labor.

Outline Background Research Goals Past Approach Past Results Current Approach Results Future Research

U.S. Consumer Expenditure (CE) Interview Survey ~ 6,000 households/year –Interviewed every 3 months about prior 3 months expenditures –5 consecutive interviews for each household 6 years of CE data: 1996 – 2002

Research Goals This multi-year project has three goals: 1.Identify patterns of underreporting of expenditures in different commodities 2.Identify the characteristics of respondents contributing most to the underreports 3.Use the knowledge gained to design new procedures for overcoming underreporting

Phases of Research Phase 1 (2003) –Markov LCA on macro level data –Non-reporters only Phase 2 (2004) –(Ordered) LCA on micro level data for total consumer expenditures –Reporters only Phase 2 (current) –(Ordered) LCA on micro level data for separate commodity expenditures –Examination of possible casual linkages to respondent characteristics –Reporters only Phase 3 (future) –Combine the macro and micro analyses –All sample –Produce overall estimates of underreporting by category and respondent characteristics

Phase 1: Approach Used 4 consecutive CE interviews Since the 1st of (month, 3 months ago), have you (or any members of your household) had any expenses for __________? Used 1 st and 2 nd order Markov LCA to fit models to dichotomous response to screening question Explored effect on underreporting of: –family size, income, age, family type, gender, education, record use, interview length

Phase 1: Design Obtained estimates of false negative probability i.e. P(no purchase reported | made a purchase) Produced estimates for each commodity of: –True proportion of purchasers – Accuracy rate i.e. P(report a purchase | truly made a purchase) Used these estimates to examine relationships between demographic variables and probability of accurate reporting

Phase 1: Conclusions Model fit was adequate for all commodities Levels of underreporting vary by commodity Variables were found to be positively related to accurate reporting included: Education Family Size Income Use of Records Length of Interview The effect of age was highly variable

Phase 2 in 2004 Differences between Phase 1 and Phase 2: –Used only Interview 2 data, not Markov LCA –Micro level analysis –Reporters only –Latent variable represents level of underreporting, as opposed to purchasing status as in Phase 1

Approach Analysis Plan –Ran both ordered latent class models and unordered. Order was determined based on theoretical relationship between values of indicators and level of underreporting. –Ran all combinations of indicators in groups of 3 Using only reporters Using only 2 nd interview data

Application of Model For the final model: –Each combination of indicator was assigned to a latent class –The probability of being in that class given the value of the indicators was used to assign classes –Each respondent was assigned to a latent class given the value of their indicator variables –Expenditure means were found for each latent class.

Summary of Findings in 2004 Levels of underreporting were found to vary by interview level characteristics including: 1.Number of contacts 2.Missing income data 3.Type and frequency of records used 4.Length of interview Total expenditure means for respondents assigned to each latent class confirmed this

Current Phase 2 Using same general methodology as 2004 Refine indicators Apply methodology to separate commodity categories Identify best model for each commodity and assign respondents to latent classes Examine the pattern of mean expenditures for each latent class to confirm results Run demographic analysis to identify characteristics of members of each latent class

Indicators Interview level indicators considered: 1. Number of contacts 2. Ratio of respondents/household members 3. Missing income data 4. Type and frequency of records used 5. Length of interview 6. Ratio of expenditures in last month to quarter 7. Combination of type of record and interview length

Indicator Coding #contacts (1=0-2; 2=3-5; 3=6+) Resp/hh size (1= <.5; 2=.5+) Income missing (1=present; 2=missing) Records use (1=never; 2=single type or sometimes; 3=multiple types and always) Interview length (1= <45; 2=45-90; 3= 90+) Month3 expn/all (1= <.25; 2=.25-.5; 3= +.5) Combined records and length (1= poor; 2= fair; 3=good)

Demographic Coding CU size (1=1; 2=2; 3=3+) Age (1= 30<; 2= 30-49; 3=50+) Education (1=< H.S.; 2= H.S.+) Income rank (1= <=.25; 2= and missing; 3=+.75) Race (1= White; 2= Other) Tenure (1= renter; 2= owner) Urban (1= urban; 2= rural)

BICDiss Index Kids Clothing Womens Clothing Mens Clothing Furniture Electricity Minor Vehicle Repairs Kitchen Accessories LCM Fit by Commodity

Value of Latent Class 1 = Poor23 = Good Kids Clothing Mean44.90(a)59.62(b)71.09(c) n24,66612,0016,331 Womens Clothing Mean99.00(b)148.08(a)152.94(a) n21,31611,28110,401 Mens Clothing Mean78.98(b)107.04(a)105.46(a) n36, ,076 Furniture Mean117.25(a)66.22(b)266.63(c) n23,43716,3153,246 Electricity Mean230.47(a)198.87(b)223.30(c) n32,9054,3775,716 Minor Vehicle Expenditures Mean39.47(a)57.03(b)82.28(c) n9,86426,2886,846 Kitchen Accessories Mean23.27(b)52.51(a)47.58(a) n26,5892,93413,.475 Expenditure Means by Latent Class

Exp(b)PR(X 2 ) Famsize 1.801<.0001 Famsize <.0001 Age 1.862<.0001 Age Educ Inclass 1.776<.0001 Inclass 2.762<.0001 Race1.056<.0001 Tenure.909<.0001 urban1.104<.0001 Max-rescaled R Proportional Odds Model Results for Minor Vehicle

Future Research –Other categories and total expenditures –Add a Markov component –Combine the macro and micro analyses (underreporting for both reporters and nonreporters) –Produce overall estimates of underreporting by category and respondent characteristics