Challenges in small area estimation of poverty indicators

Slides:



Advertisements
Similar presentations
Conducting of EU - SILC in the Republic of Macedonia, 2010 REPUBLIC OF MACEDONIA STATE STATISTICAL OFFICE State Statistical Office of Republic of Macedonia.
Advertisements

Chapter 3 Properties of Random Variables
The role of models in model- assisted and model-dependent estimation for domains and small areas Risto Lehtonen University of Helsinki
Pinpointing the security boundary in high-dimensional spaces using importance sampling Simon Tindemans, Ioannis Konstantelos, Goran Strbac Risk and Reliability.
1.2.4 Statistical Methods in Poverty Estimation 1 MEASUREMENT AND POVERTY MAPPING UPA Package 1, Module 2.
Riku Salonen Regression composite estimation for the Finnish LFS from a practical perspective.
1 1 Effects of attrition in the Norwegian Survey on statistics on income and living conditions Marit Wilhelmsen Statistics Norway Q European Conference.
Model- vs. design-based sampling and variance estimation on continuous domains Cynthia Cooper OSU Statistics September 11, 2004 R
Approaches to Data Acquisition The LCA depends upon data acquisition Qualitative vs. Quantitative –While some quantitative analysis is appropriate, inappropriate.
Copyright © 2006 Pearson Addison-Wesley. All rights reserved. Lecture 4: Mathematical Tools for Econometrics Statistical Appendix (Chapter 3.1–3.2)
OUTLIER, HETEROSKEDASTICITY,AND NORMALITY
Complex Surveys Sunday, April 16, 2017.
Small area Estimation of Italian poverty and social exclusion indicators Stefano Falorsi Michele D’Alò Loredana Di Consiglio Fabrizio Solari Matteo Mazziotta.
Chapter 10 Sampling and Sampling Distributions
STAT262: Lecture 5 (Ratio estimation)
A) Transformation method (for continuous distributions) U(0,1) : uniform distribution f(x) : arbitrary distribution f(x) dx = U(0,1)(u) du When inverse.
Increasing Survey Statistics Precision Using Split Questionnaire Design: An Application of Small Area Estimation 1.
 Deviation is a measure of difference for interval and ratio variables between the observed value and the mean.  The sign of deviation (positive or.
Estimating social inequalities in Healthy Life Years in Belgium Estimating social inequalities in HLE: Challenges and opportunities 10 February, 2012 Rana.
Arun Srivastava. Small Areas What is a small area? Sub - population Domain The Domain need not necessarily be geographical. Examples Geographical Subpopulations.
Joint UNECE/Eurostat Meeting on Population and Housing Censuses (28-30 October 2009) Accuracy evaluation of Nuts level 2 hypercubes with the adoption of.
A Monte Carlo exploration of methods to determine the UHECR composition with the Pierre Auger Observatory D.D’Urso for the Pierre Auger Collaboration
Use of Administrative Data in Statistics Canada’s Annual Survey of Manufactures Steve Matthews and Wesley Yung May 16, 2004 The United Nations Statistical.
Optimal Allocation in the Multi-way Stratification Design for Business Surveys (*) Paolo Righi, Piero Demetrio Falorsi 
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
ISTAT - Italian National Institute of Statistics Labour Force Survey Division Unit “Methods for LFS data treatment” European Conference on Quality in Official.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
1 Enhancing Small Area Estimation Methods Applications to Istat’s Survey Data Ranalli M.G. ~ Università di Perugia D’Alo’ M., Di Consiglio L., Falorsi.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
Handbook on Residential Property Price Indices Chapter 5: Methods Jan de Haan UNECE/ILO Meeting, May 2010.
Using administrative registers in sample surveys European Conference on Quality in Official Statistics 3-–6 May 2010 Kaja Sõstra Statistics Estonia.
Poverty Estimation in Small Areas Agne Bikauskaite European Conference on Quality in Official Statistics (Q2014) Vienna, 3-5 June 2014.
Weighting and estimation methods: description in the Memobust handbook Loredana di Consiglio, Fabrizio Solari 2013 European Establishment Statistics Workshop.
Chapter 5 Parameter estimation. What is sample inference? Distinguish between managerial & financial accounting. Understand how managers can use accounting.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Eurostat Weighting and Estimation. Presented by Loredana Di Consiglio Istituto Nazionale di Statistica, ISTAT.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
Outlier Treatment in HCSO Present and future. Outline Outlier detection – types, editing, estimation Description of the current method Alternatives Future.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Improving of Household Sample Surveys Data Quality on Base of Statistical Matching Approaches Ganna Tereshchenko Institute for Demography and Social Research,
Representativity Indicators for Survey Quality Programme: Cooperation Theme: Socio-economic sciences and Humanities Activity: Socio-economic and scientific.
Multivariate selective editing via mixture models: first applications to Italian structural business surveys Orietta Luzi, Guarnera U., Silvestri F., Buglielli.
Statistics Canada Citizenship and Immigration Canada Methodological issues.
Quality Analysis in a Survey on Transportation of Goods by Road Juris Breidaks University of Latvia Central Statistical Bureau of Latvia.
Chapter 5 Joint Probability Distributions and Random Samples  Jointly Distributed Random Variables.2 - Expected Values, Covariance, and Correlation.3.
Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson0-1 Supplement 2: Comparing the two estimators of population variance by simulations.
IAOS Shanghai – Reshaping Official Statistics Some Initiatives on Combining Data to Support Small Area Statistics and Analytical Requirements at.
Marginal Distribution Conditional Distribution. Side by Side Bar Graph Segmented Bar Graph Dotplot Stemplot Histogram.
1 AAEC 4302 ADVANCED STATISTICAL METHODS IN AGRICULTURAL RESEARCH Part II: Theory and Estimation of Regression Models Chapter 5: Simple Regression Theory.
Small area estimation combining information from several sources Jae-Kwang Kim, Iowa State University Seo-Young Kim, Statistical Research Institute July.
Enrico Fabrizi°, Maria Rosaria Ferrante* , Carlo Trivisano*
Planning a Simulation Study
Conducting of EU - SILC in the Republic of Macedonia, 2010
Statistics and Art: Sampling, Response Error, Mixed Models, Missing Data, and Inference Ed Stanek And others: Recai Yucel, Julio Singer, and others on.
Tetiana Ianevych and Veronika Serhiienko
Random Sampling Population Random sample: Statistics Point estimate
Flash estimates on income distribution dynamics
Estimation of Sampling Errors, CV, Confidence Intervals
Effect of Panel Length and Following Rules on Cross-Sectional Estimates of Income Distribution: Empirical Evidence from FI-SILC Marjo Pyy-Martikainen Workshop.
Statistical Assumptions for SLR
The European Statistical Training Programme (ESTP)
Estimation of Employment for Cities, Towns and Rural Districts
New Techniques and Technologies for Statistics 2017  Estimation of Response Propensities and Indicators of Representative Response Using Population-Level.
Marie Reijo, Population and Social Statistics
Sampling and estimation
Small area estimation with calibration methods
Small area estimation for the Dutch Investment Survey
Workshop on best practices for EU-SILC revision, −
Stratification, calibration and reducing attrition rate in the Dutch EU-SILC Judit Arends.
Presentation transcript:

Challenges in small area estimation of poverty indicators Risto Lehtonen, Ari Veijanen, Maria Valaste (University of Helsinki) , and Mikko Myrskylä (Max Planck Institute for Demographic Research, Rostock) Ameli 2010 Conference, 25-26 February 2010, Vienna

Outline Background Material and methods Results Discussion References

EU/FP7 Project AMELI Advanced Methodology for European Laeken Indicators (2008-2011) The project is supported by European Commission funding from the Seventh Framework Programme for Research DoW: The study will include research on data quality including Measurement of quality Treatment of outliers and nonresponse Small area estimation The measurement of development over time

Material and methods Investigation of statistical properties (bias and accuracy) of estimators of selected Laeken indicators for population subgroups or domains and small areas Method: Design-based Monte Carlo simulation experiments based on real data Data: Statistical register data based on merging of administrative register data at the unit level (Finland)

Laeken indicators based on binary variables At-risk-of poverty rate Direct estimators Horvitz-Thompson estimators HT Indirect estimators Model-assisted GREG and MC estimators Model-based EBLUP and EB estimators Modelling framework Generalized linear mixed models GLMM Lehtonen and Veijanen (2009) Rao (2003), Jiang and Lahiri (2006)

Laeken indicators based on medians or quantiles Indicators based on medians or quantiles of cumulative distribution function of the underlying continuous variable Relative median at-risk-of poverty gap Quintile share ratio (S20/S80 ratio) Gini coefficient Direct estimators DEFAULT Synthetic estimators SYN Expanded prediction SYN estimators EP-SYN Composite estimators COMP Simulation-based methods

Generalized linear mixed models

Design-based GREG type estimators for poverty rate

Model-based estimators for poverty rate

Poverty gap for domains Relative median at-risk-of poverty gap Poverty gap in domain d describes the difference between the poor people's median income and the at-risk-of-poverty threshold t

Estimators of poverty gap

Estimators of poverty gap

Estimators of poverty gap

Estimators of poverty gap

MSE estimation for direct estimator DEFAULT

MSE estimation for SYN estimator

Monte Carlo simulation Fixed finite population of 1,000,000 persons D = 70 domains of interest Cross-classification of NUTS 3 with sex and age group (7x2x5) Y-variables Equivalized income (based on register data) Binary indicator for persons in poverty X-variables (binary or continuous variables) house _owner (binary) education_level (7 classes) and educ_thh lfs_code (3 classes) and empmohh socstrat (6 classes) sex_class and age_class (5 age classes) NUTS3

Sampling designs SRSWOR sampling Stratified SRSWOR Sample size n = 5,000 persons Stratified SRSWOR Stratification by education level of HH head H = 7 strata Unequal inclusion probabilities Design weights vary between strata Min: 185, Max: 783 K = 1000 independent samples

Quality measures of estimators Design bias Absolute relative bias ARB (%) Accuracy Relative root mean squared error RRMSE (%)

Discussion: Poverty rate Indirect design-based estimator MLGREG Design unbiased Large variance in small domains Small variance in large domains Indirect model-based estimator EB Design biased Small variance also in small domains Accuracy: EB outperformed MLGREG Might be the best choice at least for small domains unless it is important to avoid design bias

Discussion: Poverty gap Direct estimator DEFAULT Small design bias but large variance Indirect model-based SYN Very large bias but small variance Indirect model-based EP-SYN based on expanded predictions Much smaller bias and variance than in SYN Composite (DEFAULT with EP-SYN) Small domains: good compromise Large domains: bias can still dominate the MSE

Thank you for your attention!