Incorporating Nonresponse in a Markov Latent Class Measurement Error Model of Consumer Expenditure Brian Meekins, Clyde Tucker Bureau of Labor Statistics.

Slides:



Advertisements
Similar presentations
National Seminar on Developing a Program for the Implementation of the 2008 SNA and Supporting Statistics in Turkey Derya TUNCER 10 September 2013 Ankara.
Advertisements

International Conference on Establishment Surveys Montreal, 2007 Richard Rosen, U.S. Bureau of Labor Statistics
Estimating the Level of Underreporting of Expenditures among Expenditure Reporters: A Further Micro-Level Latent Class Analysis Clyde Tucker Bureau of.
Paul Biemer, UNC and RTI Bac Tran, US Census Bureau Jane Zavisca, University of Arizona SAMSI Conference, 11/10/2005 Latent Class Analysis of Rotation.
Paul Smith Office for National Statistics
Survey Methodology Nonresponse EPID 626 Lecture 6.
Tobacco Use Supplement To The Current Population Survey Users’ Workshop June 2009 Tips and Tricks of Handling the TUS Data James “Todd” Gibson Information.
An example of longitudinal LFS weights
Brian A. Harris-Kojetin, Ph.D. Statistical and Science Policy
1 Incorporating Statistical Process Control and Statistical Quality Control Techniques into a Quality Assurance Program Robyn Sirkis U.S. Census Bureau.
Propensity Score Models for Nonresponse and Measurement Error John Dixon U.S. Bureau of Labor Statistics June 20, 2011 The opinions expressed.
Multiple Indicator Cluster Surveys Data dissemination and further analysis workshop Literacy Education MICS4 Data Dissemination and Further Analysis Workshop.
CE Overview Jay T. Ryan Chief, Division of Consumer Expenditure Survey December 8, 2010.
© 2004 Prentice-Hall, Inc.Chap 1-1 Basic Business Statistics (9 th Edition) Chapter 1 Introduction and Data Collection.
Documentation and survey quality. Introduction.
Consumer Expenditure Survey Redesign Jennifer Edgar Bureau of Labor Statistics COPAFS Quarterly Meeting March 4, 2011.
Labor Statistics in the United States Grace York March 2004.
Domestic Tourism Destination Choices- A Choice Modelling Analysis Assignment 3 Group 3 Hari Hara Sharan Nagalur Subraveti Kasun Dilhara Wimalasena Kento.
REPRESENTATIVE RESPONSE Some Examples ITSEW Peter Lundquist Statistics Sweden.
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
CHAPTER 14, QUANTITATIVE DATA ANALYSIS. Chapter Outline  Quantification of Data  Univariate Analysis  Subgroup Comparisons  Bivariate Analysis  Introduction.
Using Substantive Diagnostics to Evaluate the Validity of Micro-level Latent Class Indicators of Measurement Error Clyde Tucker and Brian Meekins.
1 Health Status and The Retirement Decision Among the Early-Retirement-Age Population Shailesh Bhandari Economist Labor Force Statistics Branch Housing.
PEAS wprkshop 2 Non-response and what to do about it Gillian Raab Professor of Applied Statistics Napier University.
Measurement in Survey Research MKTG 3342 Fall 2008 Professor Edward Fox.
Eurostat Repeated surveys. Presented by Eva Elvers Statistics Sweden.
Household Surveys ACS – CPS - AHS INFO 7470 / ECON 8500 Warren A. Brown University of Georgia February 22,
Effects of Income Imputation on Traditional Poverty Estimates The views expressed here are the authors and do not represent the official positions.
LSS Black Belt Training Forecasting. Forecasting Models Forecasting Techniques Qualitative Models Delphi Method Jury of Executive Opinion Sales Force.
Organizing Your Data for Statistical Analysis in SPSS
Father Involvement and Child Well-Being: 2006 Survey of Income and Program Participation (SIPP) Child Well-Being Topical Module 1 By Jane Lawler Dye Fertility.
TRANSLATING VISITS INTO PATIENTS USING AMBULATORY VISIT DATA (Hypertensive patient case study) by Esther Hing, M.P.H. and Julia Holmes, Ph.D U.S. DEPARTMENT.
Copyright 2010, The World Bank Group. All Rights Reserved. Estimation and Weighting, Part I.
“... providing timely, accurate, and useful statistics in service to U.S. agriculture.” Wendy Barboza, Darcy Miller, Nathan Cruze United States Department.
12th Meeting of the Group of Experts on Business Registers
A Latent Class Call-back Model for Survey Nonresponse Paul P. Biemer RTI International and UNC-CH Michael W. Link Centers for Disease Control and Prevention.
Pushing forward with ASPIRE A System for Product Improvement, Review and Evaluation Heather Bergdahl, Paul Biemer, Dennis Trewin Q2014.
MBA7020_05.ppt/June 27, 2005/Page 1 Georgia State University - Confidential MBA 7020 Business Analysis Foundations Time Series Forecasting June 27, 2005.
Eurostat Overall design. Presented by Eva Elvers Statistics Sweden.
Scot Exec Course Nov/Dec 04 Survey design overview Gillian Raab Professor of Applied Statistics Napier University.
The relationship between error rates and parameter estimation in the probabilistic record linkage context Tiziana Tuoto, Nicoletta Cibella, Marco Fortini.
The Practice of Social Research Chapter 14 – Quantitative Data Analysis.
Panel Study of Entrepreneurial Dynamics Richard Curtin University of Michigan.
Economics and Statistics Administration U.S. CENSUS BUREAU U.S. Department of Commerce Assessing the “Year of Naturalization” Data in the American Community.
1 Unobserved common causes of measurement and nonresponse error on the 2008 ANES Panel Survey International Total Survey Error Workshop Stowe, VT - June.
The Challenge of Non- Response in Surveys. The Overall Response Rate The number of complete interviews divided by the number of eligible units in the.
Copyright 2010, The World Bank Group. All Rights Reserved. Reducing Non-Response Section A 1.
Measurement of Income in NCHS Surveys Diane M. Makuc NCHS Data Users Conference July 12, 2006 Centers for Disease Control and Prevention National Center.
1 SIPP IMPUTATION SCHEME AND DISCUSSION ITEMS Presenters: Nat McKee - Branch Chief Census Bureau Demographic Surveys Division (DSD) Income Surveys Programming.
Building Wave Response Rates in a Longitudinal Survey: Essential for Nonsampling Error Reduction or Last In - First Out? Steven B. Cohen Fred Rohde and.
© John M. Abowd 2007, all rights reserved General Methods for Missing Data John M. Abowd March 2007.
United Nations Workshop on Revision 3 of Principles and Recommendations for Population and Housing Censuses and Evaluation of Census Data, Amman 19 – 23.
Interviewer Effects on Paradata Predictors of Nonresponse Rachael Walsh, US Census Bureau James Dahlhamer, NCHS European Survey Research Association, 2015.
Welcome to MM305 Unit 5 Seminar Prof Greg Forecasting.
11 How Much of Interviewer Variance is Really Nonresponse Error Variance? Brady T. West Michigan Program in Survey Methodology University of Michigan-Ann.
Does Inclusion of Both Partial and Complete Interviews from the Source Sampling Frame Have an Effect on Nonresponse Error and Measurement Error in a National.
Pg 1 Mode Effects Measurement and Correction: A Case Study Stas Kolenikov, Courtney Kennedy, Ali Ackermann, and Chintan Turakhia Abt SRBI Michael O. Emerson.
1 Kuo-hsien Su, National Taiwan University Nan Lin, Academia Sinica and Duke University Measurement of Social Capital: Recall Errors and Bias Estimations.
An Indicator of Nonresponse Bias Derived from Call-back Analysis Paul P. Biemer RTI International and UNC.
Assessing Nonresponse Bias and Measurement Error Using Statistical Matching John Dixon U.S. Bureau of Labor Statistics June 15, 2010 The opinions.
© 2006 IMS Health Incorporated or its affiliates. All rights reserved. June 20, 2007 Determination of Target Sample Sizes for Physicians Surveys Darrell.
1 Using Household Surveys to Collect Data on Remittances: Lessons Learned from the CPS Migration Supplement Elizabeth M. Grieco Chief, Immigration Statistics.
Small area estimation combining information from several sources Jae-Kwang Kim, Iowa State University Seo-Young Kim, Statistical Research Institute July.
Utility of an overlapping panel design in the MEPS Steven B. Cohen, Ph.D.
Real Personal Income and Regional Price Parities
Jonathan Eggleston and Lori Reeder
Multiple Imputation Using Stata
Share of median income (%)
Economic Performance Chapter 13.
SEM evaluation and rules
Presentation transcript:

Incorporating Nonresponse in a Markov Latent Class Measurement Error Model of Consumer Expenditure Brian Meekins, Clyde Tucker Bureau of Labor Statistics Paul Biemer RTI International Any opinions expressed in this paper are those of the authors and do not constitute policy of the Bureau of Labor Statistics

Uses repeated measurements from panel survey data to estimate classification error Used successfully in evaluation of labor force data (e.g., Biemer & Bushery, 2001 Tucker, Biemer, Meekins 2012) Does not require external validation data; estimates of error directly from panel data Applied to reports of expenditures in the CEIS Markov Latent Class Analysis

U.S. Consumer Expenditure Interview Survey (CEIS) ~ 6,000 CU’s/year CU’s interviewed every 3 months about prior 3 months expenditures 4 consecutive interviews on each CU New data: N=23,719 From to Unweighted analysis

ElectricityMen’s Clothing OnlyMinor Vehicle Repairs CableWomen’s Clothing OnlyVehicle Oil TrashMen’s AccessoriesMajor Vehicle Repairs GasWomen’s AccessoriesVehicle Other TVMen’s ShoesDental SportsWomen’s ShoesEyes FurnitureKid’s ApparelDrugs* Other HouseholdKid’s Clothing OnlyPets Kitchen AccessoriesKid’s Accessories Men’s ApparelKid’s Shoes Commodity Categories 4

1 st Order Markov Model 5 ZYX W Where

Definition of Indicator Variables Define for Interview 1, 1, if reported as a purchaser for the quarter 2, if reported as non-purchaser with similar definition for X, Y, Z for 2nd, 3rd, and 4th interviews W=

2 nd Order Markov Model 7 ZYX W

Mover-Stayer 8 ZYX W M 1, P(W=1) = P(X =1) = P(Y =1) = P(Z = 1) = 1 M = 2, P(W=1) = P( X =1) = P(Y =1) = P(Z = 1) = 0 3, P(W),P(X), P(Y), and P(Z) are unconstrained.

Measurement Error Model 9 ADCB ZYX W = = = =

Markov Latent Class Model Notation True expenditure status is a latent variable Latent Var. Indicator Var. Interview 1W A Interview 2X B Interview 3Y C Interview 4Z D

Definition of Latent Variables Where, 1, if one or more purchases of an item during the W=quarter (“purchaser”) 2, if no purchase (“non-purchaser”) with similar definition for X, Y, Z for 2nd, 3rd, and 4th interview

Markov or Mover-Stayer model assumptions Equal measurement error across all interviews No False Positives Model Assumptions

Model Selection Limitations on Lem forced estimation of separate pieces of the full model: ME & NR Multiple iterations to avoid local maxima Best refusal, noncontact, and measurement error variables were selected based on fit in these component models Using best variables constructed combined model

Estimates ME Model No covariates P(A=1|W=1) = P(B=1|X=1) = P(C=1|Y=1) = P(D=1|Z=1) =

Measurement Error Model 15 ADCB ZYX W = = = = Measurement Error Indicators

Measurement Error Model 16 ADCB ZYX W Measurement Error Indicators

ME Variables 1. Missing income data (CU) 2. Type and frequency of records used (interiew) 3. Length of interview (interview) 4. Ratio of expend. in last month to entire quarter (interview) 5. Combination of types of record and int length (interview) 6. Number of expenditure questions imputed/allocated (interview) 7. Completion mode (interview) 8. Family size (CU)

ME Model w/Covariates 18 ME Model w/ Covariates P(A=1|W=1)0.781 P(B=1|X=1)0.733 P(C=1|Y=1)0.696 P(D=1|Z=1)0.687

ME Model: Covariates 19 Income MissingP(A=1|W=1) Missing0.779 Not Missing0.796 ModeP(A=1|W=1) Telephone0.766 In person0.814 Record use/Int timeP(A=1|W=1) Short int/few records0.664 Medium int/med records0.779 Long int/lots of records0.982

Nonresponse Error Model 20 EHGF Noncontact Covariates 1 Reports All Qtr 2 Reports Some 3 Reports None M Refusal Covariates

Nonresponse Error Model 21 EHGF Noncontact Covariates 1 Latent.. likelihood c of Interview M Refusal Covariates

Noncontact Variables 1. Number of noncontact problems reported in CHI (interview) 2. Age (CU) 3. Owner/Renter (CU) 4. Urbanicity (CU)

Noncontact Estimates 23 WaveProb of Responding E1.000 F0.877 G0.823 H0.798

Noncontact Estimates 24 Noncontact Problems P(F=1) % Samp No mentions or 2 mentions Three or more P(F=1) % Samp Rents Owns

Noncontact Estimates 25 AgeP(F=1) % Samp < UrbanicityP(F=1) % Samp Urban Rural

Refusal Variables 1. Factor1 variables – Reluctance privacy concerns (CU) 2. Factor2 variables – Reluctance time concerns (CU) 3. Any reluctance mentioned (CU) 4. Conversion Refusal – only wave 1 available? (CU) 5. Region (CU) 26

Refusal Estimates 27 Refusal Reason Factor 2P(M=3) Worst % Samp Never mentioned Avg 1 per interview More Refusal Reason Factor 1P(M=3) Worst % Samp Mentioned Not mentioned

Refusal Estimates 28 RegionP(M=3) Worst % Samp Northeast Midwest South West Any Conversion RefusalP(M=3) Worst % Samp Yes No

29 EHGF Noncontact Covariates Refusal Covariates ADCB ZYX W = = = = Measurement Error Indicators 1 Reports All Qtr 2 Reports Some 3 Reports None M Combined ModelCombined Model

Estimates: Full Model Measurement Error (Item Nonresponse) P(A=1|W=1)0.781 P(B=1|X=1)0.758 P(C=1|Y=1)0.739 P(D=1|Z=1)0.745 Unit Nonresponse P(F=1|X=1)0.957 P(G=1|Y=1)0.953 P(H=1|Z=1)0.957

Missing Reports Of the estimated 1,939 missed reports  1,721 were a result of item nonresponse  218 were a result of unit nonresponse Reminder: nonresponse conditioned on first wave response 31

Discussion Can be conducted for each commodity category Models need further specification & adjustments, thorough evaluation of covariates Estimates can be used to inform:  Allocation of resources  Adjustment: relationship of NR and ME  Fatigue, conditioning, etc. 32

Contact Information Brian Meekins Office of Survey Methods Research U.S. Bureau of Labor Statistics

Objective Diagnositics Fit Statistics  L-square Dissimilarity Index BIC

Subjective Diagnostics True purchase pattern given estimated purchase pattern Mover-stayer classification by purchase pattern Accuracy rates by subgroup  Accuracy is the percent of true purchasers that reported purchasing that commodity