Survey nonresponse and the distribution of income Emanuela Galasso* Development Research Group, World Bank * Based on joint work by Martin Ravallion, Anton.

Slides:



Advertisements
Similar presentations
Survey Methodology Nonresponse EPID 626 Lecture 6.
Advertisements

The estimation strategy of the National Household Survey (NHS) François Verret, Mike Bankier, Wesley Benjamin & Lisa Hayden Statistics Canada Presentation.
Monitoring and Evaluating the Poverty Reduction and Social Protection: A Case Study of China Shaohua Chen Development Research Group, World Bank Further.
Further Updating Poverty Mapping in Albania Gianni Betti*, Andrew Dabalen**, Celine Ferrè** and Laura Neri* * University of Siena, Italy, ** The World.
United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan,
Copyright 2010, The World Bank Group. All Rights Reserved. Estimation and Weighting Part II.
Life Opportunities Survey (LOS) Wave 2 Weighting Andy Fallows and Sangeetha Gallagher.
Calculating The Estimates And Weights For The Vacancy Rate Prepared for the 2010 SDC-CIC Joint Steering Committee Meetings February 24, 2010 Edward C.
NLSCY – Non-response. Non-response There are various reasons why there is non-response to a survey  Some related to the survey process Timing Poor frame.
Sampling: Final and Initial Sample Size Determination
CE Overview Jay T. Ryan Chief, Division of Consumer Expenditure Survey December 8, 2010.
1 Daniel Felsenstein Eyal Ashbel Simultaneous Modeling of Developer Behavior and Land Prices in UrbanSim UrbanSim European Users Group meeting, ETH Zurich,
The Effort to Develop Disability Questions for the Current Population Survey Terence M. McMenamin U.S. Bureau of Labor Statistics October 5, 2006.
Why sample? Diversity in populations Practicality and cost.
Questions re Internet Use How many/what percent use it How often, how much For what purposes From where (home, office, public-access computer) Differences.
Chapter 2 – Tools of Positive Analysis
The Excel NORMDIST Function Computes the cumulative probability to the value X Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Hypothesis Tests Regarding a Parameter 10.
En Route Surveys Chapter 38 Research Methodologies.
Methodology for a school- leavers’ survey Irena Kogan MZES, University of Mannheim.
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
Distribution of Income & Income Inequality The Lorenz Curve & The Gini Index.
How data quality affects poverty and inequality measurement PovcalNet team DECPI The World Bank.
Household Surveys ACS – CPS - AHS INFO 7470 / ECON 8500 Warren A. Brown University of Georgia February 22,
Effects of Income Imputation on Traditional Poverty Estimates The views expressed here are the authors and do not represent the official positions.
Copyright 2010, The World Bank Group. All Rights Reserved. Estimation and Weighting, Part I.
Sampling: What you don’t know can hurt you Juan Muñoz.
Nonresponse issues in ICT surveys Vasja Vehovar, Univerza v Ljubljani, FDV Bled, June 5, 2006.
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
Poverty measurement: experience of the Republic of Moldova UNECE, Measuring poverty, 4 May 2015.
Evaluating Job Training Programs: What have we learned? Haeil Jung and Maureen Pirog School of Public and Environmental Affairs Indiana University Bloomington.
Faculty of Economics and Business Administration Page 1 Discussion on: Estimating the impact of a policy reform on welfare participation:
The Challenge of Non- Response in Surveys. The Overall Response Rate The number of complete interviews divided by the number of eligible units in the.
Copyright 2010, The World Bank Group. All Rights Reserved. Reducing Non-Response Section B 1.
AFRICA IMPACT EVALUATION INITIATIVE, AFTRL Africa Program for Education Impact Evaluation David Evans Impact Evaluation Cluster, AFTRL Slides by Paul J.
Issues in Estimation Data Generating Process:
SAMPLE SELECTION in Earnings Equation Cheti Nicoletti ISER, University of Essex.
1 SIPP IMPUTATION SCHEME AND DISCUSSION ITEMS Presenters: Nat McKee - Branch Chief Census Bureau Demographic Surveys Division (DSD) Income Surveys Programming.
A discussion of Comparing register and survey wealth data ( F. Johansson and A. Klevmarken) & The Impact of Methodological Decisions around Imputation.
1 Ministry of Finance and Ministry of Samurdhi Welfare Benefits Board The Targeting Formula: Analysis Using Pilot Data Welfare Workshop Colombo,
© John M. Abowd 2007, all rights reserved General Methods for Missing Data John M. Abowd March 2007.
Household Survey Data on Remittances in Sending Countries Johan A. Mistiaen International Technical meeting on Measuring Remittances Washington DC - January.
Targeting of Public Spending Menno Pradhan Senior Poverty Economist The World Bank office, Jakarta.
Data Weighting Issues Adult Equivalent Scales, Stratified Sampling and general Population Weighting Issues.
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
Randomized Assignment Difference-in-Differences
By R. Gambacorta and A. Neri Bank of Italy - Statistical Analysis Directorate Wealth and its returns: economic inequality in Italy, The Bank.
1 Aspects of Sampling for Household Surveys Kathleen Beegle Workshop 17, Session 1c Designing and Implementing Household Surveys March 31, 2009.
1 Chapter 13 Collecting the Data: Field Procedures and Nonsampling Error © 2005 Thomson/South-Western.
1 Data Collection and Sampling ST Methods of Collecting Data The reliability and accuracy of the data affect the validity of the results of a statistical.
Common Pitfalls in Randomized Evaluations Jenny C. Aker Tufts University.
1 Survey Nonresponse Survey Research Laboratory University of Illinois at Chicago March 16, 2010.
Introduction Data may be presented in a way that seems flawless, but upon further review, we might question conclusions that are drawn and assumptions.
Modeling Poverty Martin Ravallion Development Research Group, World Bank.
Kotchikpa Gabriel Lawin Lota Dabio Tamini
Sampling Why use sampling? Terms and definitions
Sampling: Final and Initial Sample Size Determination
Impact evaluation: The quantitative methods with applications
Matching Methods & Propensity Scores
The European Statistical Training Programme (ESTP)
Matching Methods & Propensity Scores
Sampling: Final and Initial Sample Size Determination
The European Statistical Training Programme (ESTP)
Matching Methods & Propensity Scores
Microeconometric Modeling
Sampling: Final and Initial Sample Size Determination
Evaluating Impacts: An Overview of Quantitative Methods
Chapter 13: Item nonresponse
Presentation transcript:

Survey nonresponse and the distribution of income Emanuela Galasso* Development Research Group, World Bank * Based on joint work by Martin Ravallion, Anton Korinek and Johan Mistiaen

2 1: Why are we concerned about non response? 2: Implications for measurement of poverty and inequality 3: Evidence for the US –Estimation methods –Results 4: An example for China

3 1: Why do we care?

4 Types of nonresponse Item-nonresponse (participation to the survey but non-response on single questions) –Imputation methods using matching Lillard et al. (1986); Little and Rubin (1987)

5 Types of nonresponse Item-nonresponse –Imputation methods using matching Lillard et al. (1986); Little and Rubin (1987) The idea: –For sub-sample with complete data: –Then impute missing data using: Observations with… XY complete dataYes missing dataYesNo

6 Types of nonresponse Unit-nonresponse (“non-compliance”) (non-participation to the survey altogether)

7 Unit-nonresponse: possible solutions Ex-ante: –Replace non respondents with similar households –Increase the sample size to compensate for it –Using call-backs, monetary incentives: Van Praag et al. (1983), Alho (1990), Nijman and Verbeek (1992) Ex-post: Corrections by re-weighting the data –Use imputation techniques (hot-deck, cold-deck, warm-deck, etc.) to simulate the answers of nonrespondents

8 Unit-nonresponse: possible solutions Ex-ante: –Replace nonrespondents with similar households –Increase the sample size to compensate for it –Using call-backs, monetary incentives: Van Praag et al. (1983), Alho (1990), Nijman and Verbeek (1992) Ex-post: Corrections by re-weighting the data –Use imputation techniques (hot-deck, cold-deck, warm-deck, etc.) to simulate the answers of nonrespondents None of the above…

The best way to deal with unit-nonresponse is to prevent it Lohr, Sharon L. Sampling: Design & Analysis (1999)

10 Total Nonresponse Interviewers Type of survey Respondents Training Work LoadMotivation QualificationData collection method Demographic Socio-economic Economic Burden Motivation Proxy Availability Source: “Some factors affecting Non-Response.” by R. Platek Survey Methodology

11 Rising concern about unit- nonresponse High nonresponse rates of 10-30% are now common –LSMS: 0-26% nonresponse (Scott and Steele, 2002) –UK surveys: 15-30% –US: 10-20% Concerns that the problem might be increasing

12 Nonresponse is a choice, so we need to understand behavior Survey participation is a matter of choice –nobody is obliged to comply with the statistician’s randomized assignment There is a perceived utility gain from compliance –the satisfaction of doing one’s civic duty But there is a cost too An income effect can be expected

13 Nonresponse bias in measuring poverty and inequality Compliance is unlikely to be random: Rich people have: –higher opportunity cost of time –more to hide (tax reasons) –more likely to be away from home? –multiple earners Poorest might also not comply: –alienated from society? –homeless

14 2: Implications for poverty and inequality measures

15 Implications for poverty F(y) is the true income distribution, density f(y) is the observed distribution, density Note: and

16 Implications for poverty F(y) is the true income distribution, density f(y) is the observed distribution, density Note: and Definition: correction factor w(y) such that:

17 Implications for poverty cont., If compliance falls with income then poverty is overestimated for all measures and poverty lines. i.e., first-order dominance: if w’(y) > 0 for all y  ( y P, y R ), then for all y  (y P, y R )

18 First-order dominance w’(y) > 0

19 Example

20 Implications for inequality If compliance falls with income ( w’(y) > 0 ) then the implications for inequality are ambiguous Lorenz curves intersect so some inequality measures will show higher inequality, some lower

21 Example of crossing Lorenz Curves

22 3: Evidence for the U.S.

23 Current Population Survey Source: CPS March supplement, 1998 – 2002, Census Bureau 3 types of “non-interviews:” type A: individual refused to respond or could not be reached  what we define as “non-response” type B: housing unit vacant; type C: housing unit demolished  we ignore type B/C in our analysis

24 Dependence of response rate on income Response rate and average per-capita income for 51 US states, CPS March supplement 2002

25 Dependence of response rate on income Response rate and average per-capita income for 51 US states, CPS March supplement 2002

26 Estimation method In survey data, the income of non-responding households is by definition unobservable. However, we can observe the survey compliance rates by geographical areas. The observed characteristics of responding households, in conjunction with the observed compliance rates of the areas in which they live, allow one to estimate the household-specific probability of survey response. Thus we can correct for selective compliance by re-weighting the survey data.

27 Estimation method cont., {( X ij, m ij )} … set of households in state j s.t. m ij households each carry characteristics X ij, where X ij includes e.g. ln( y ij ), a constant, etc. total number of households in state j: M j representative sample S j in state j with sampled households m j =  m ij for each sampled household ε there’s a probability of response D εij {0,1}

28 Estimation method cont., The observed mass of respondents of group i in state/area j is Then summing up for a given j yields: Now let’s define: This is known!These are the individual weights

29 Estimation method cont., where obviously Then we can estimate

30 Estimation method cont., Optimal weighting matrix W = Var(()) … Hansen (1982) Assume for single state j: This can be estimated as Finally, where

31 Alternative Specifications

32 Results From Specification 2 P = logit( 1 +  2 ln(y))

33 Graph of specification 2: Probability of compliance as a function of income

34 Empirical and Corrected Cumulative Income Distribution

35 Income Distribution: Magnification

36 Correction by Percentile of Income

37 Empirical and Corrected Lorenz Curve

38 Lorenz Curves: Magnification

39 Specifications with Other Variables Specifications 10 – 18, P = logit( 1 +  2 ln(y)+  3 X 1 +  4 X 2 ):

40 4: China

41 Example for China Urban Household Survey of NBS Two stages in sampling –Stage 1: Large national random sample with very short questionnairre and high repsonse rate –Stage 2: Random sample drawn from Stage 1 sample, given very detailed survey, including daily diary, regular visits etc Use Stage 1 data to model determinants of compliance Then re-weight the data

42 Further reading Korinek, Anton, Johan Mistiaen and Martin Ravallion, “An Econometric Method of Correcting for unit Nonresponse Bias in Surveys,” Journal of Econometrics, (2007), 136: Korinek, Anton, Johan Mistiaen and Martin Ravallion, “Survey Nonresponse and the Distribution of Income.” Journal of Economic Inequality, (2006), 4:33-55