Weighting and estimation methods: description in the Memobust handbook Loredana di Consiglio, Fabrizio Solari 2013 European Establishment Statistics Workshop.

Slides:



Advertisements
Similar presentations
By: Saad Rais, Statistics Canada Zdenek Patak, Statistics Canada
Advertisements

Investigation of Treatment of Influential Values Mary H. Mulry Roxanne M. Feldpausch.
1 ESTIMATION IN THE PRESENCE OF TAX DATA IN BUSINESS SURVEYS David Haziza, Gordon Kuromi and Joana Bérubé Université de Montréal & Statistics Canada ICESIII.
Innovation data collection: Methodological procedures & basic forms Regional Workshop on Science, Technology and Innovation (STI) Indicators.
Innovation data collection: Advice from the Oslo Manual South East Asian Regional Workshop on Science, Technology and Innovation Statistics.
Innovation Surveys: Advice from the Oslo Manual South Asian Regional Workshop on Science, Technology and Innovation Statistics Kathmandu,
Innovation Surveys: Advice from the Oslo Manual National training workshop Amman, Jordan October 2010.
Unido.org/statistics International workshop on industrial statistics 8 – 10 July, Beijing Non response in industrial surveys Shyam Upadhyaya.
Treatment of missing values
Brian A. Harris-Kojetin, Ph.D. Statistical and Science Policy
The estimation strategy of the National Household Survey (NHS) François Verret, Mike Bankier, Wesley Benjamin & Lisa Hayden Statistics Canada Presentation.
Deliverable 2.8: Outliers Gary Brown Office for National Statistics UK.
Riku Salonen Regression composite estimation for the Finnish LFS from a practical perspective.
Selection of Research Participants: Sampling Procedures
Results and next steps from the ESSnet Admin Data Alison Pritchard Business Outputs & Developments, Office for National Statistics, UK 4 December 2012.
Small area Estimation of Italian poverty and social exclusion indicators Stefano Falorsi Michele D’Alò Loredana Di Consiglio Fabrizio Solari Matteo Mazziotta.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Nonparametric, Model-Assisted Estimation for a Two-Stage Sampling Design Mark Delorey Joint work with F. Jay Breidt and Jean Opsomer September 8, 2005.
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
Increasing Survey Statistics Precision Using Split Questionnaire Design: An Application of Small Area Estimation 1.
Overview of Robust Methods Analysis Jinxia Ma November 7, 2013.
Eurostat Repeated surveys. Presented by Eva Elvers Statistics Sweden.
Definitions Observation unit Target population Sample Sampled population Sampling unit Sampling frame.
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
1 OECD Workshop on Business and Consumer Tendency Surveys, Rome, 19 September 2006 SESSION 1 RECENTLY DEVELOPED INTERNATIONAL GUIDELINES FOR OPINION SURVEYS.
Eurostat The impact of the Memobust project results.
Sander Scholtus and Leon Willenborg Editing and Imputation in the Memobust Handbook.
The MEMOBUST project Ágnes Andics EESW Nürnberg 1.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
1 Enhancing Small Area Estimation Methods Applications to Istat’s Survey Data Ranalli M.G. ~ Università di Perugia D’Alo’ M., Di Consiglio L., Falorsi.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
The Memobust Handbook on Methodology for Modern Business Statistics Sander Scholtus Rob van de Laar Leon Willenborg
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
for statistics based on multiple sources
CHAPTER 12 Descriptive, Program Evaluation, and Advanced Methods.
Evaluating generalised calibration / Fay-Herriot model in CAPEX Tracy Jones, Angharad Walters, Ria Sanderson and Salah Merad (Office for National Statistics)
A Theoretical Framework for Adaptive Collection Designs Jean-François Beaumont, Statistics Canada David Haziza, Université de Montréal International Total.
Eurostat Statistical matching when samples are drawn according to complex survey designs Training Course «Statistical Matching» Rome, 6-8 November 2013.
Eurostat Weighting and Estimation. Presented by Loredana Di Consiglio Istituto Nazionale di Statistica, ISTAT.
The challenge of a mixed-mode design survey and new IT tools application: the case of the Italian Structure Earning Surveys Fabiana Rocci Stefania Cardinleschi.
Outlining a Process Model for Editing With Quality Indicators Pauli Ollila (part 1) Outi Ahti-Miettinen (part 2) Statistics Finland.
Multivariate selective editing via mixture models: first applications to Italian structural business surveys Orietta Luzi, Guarnera U., Silvestri F., Buglielli.
Eurostat Accuracy of Results of Statistical Matching Training Course «Statistical Matching» Rome, 6-8 November 2013 Marcello D’Orazio Dept. National Accounts.
Chapter 6 Conducting & Reading Research Baumgartner et al Chapter 6 Selection of Research Participants: Sampling Procedures.
Handbook for Health Care Research, Second Edition Chapter 7 © 2010 Jones and Bartlett Publishers, LLC CHAPTER 7 Designing the Experiment.
Statistics Canada Citizenship and Immigration Canada Methodological issues.
Sampling Design and Analysis MTH 494 Lecture-21 Ossam Chohan Assistant Professor CIIT Abbottabad.
Investigating the Potential of Using Non-Probability Samples Debbie Cooper, ONS.
Introduction to statistics Definitions Why is statistics important?
Workshop on MDG, Bangkok, Jan.2009 MDG 3.2: Share of women in wage employment in the non-agricultural sector National and global data.
1 General Recommendations of the DIME Task Force on Accuracy WG on HBS, Luxembourg, 13 May 2011.
Small area estimation combining information from several sources Jae-Kwang Kim, Iowa State University Seo-Young Kim, Statistical Research Institute July.
How to deal with quality aspects in estimating national results Annalisa Pallotti Short Term Expert Asa 3st Joint Workshop on Pesticides Indicators Valletta.
Theme (i): New and emerging methods
Multiple Imputation using SOLAS for Missing Data Analysis
Introduction to Survey Data Analysis
Regression composite estimation for the Finnish LFS from a practical perspective Riku Salonen.
An Active Collection using Intermediate Estimates to Manage Follow-Up of Non-Response and Measurement Errors Jeannine Claveau, Serge Godbout and Claude.
Essnet on Methodology for Modern Business Statistics
The European Statistical Training Programme (ESTP)
Chapter 8: Weighting adjustment
A New Business Statistics in Finland - Quarterly Investments
Chapter 10: Selection of auxiliary variables
The European Statistical Training Programme (ESTP)
Chapter: 9: Propensity scores
New Techniques and Technologies for Statistics 2017  Estimation of Response Propensities and Indicators of Representative Response Using Population-Level.
Sampling and estimation
The European Statistical Training Programme (ESTP)
Chapter 13: Item nonresponse
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

Weighting and estimation methods: description in the Memobust handbook Loredana di Consiglio, Fabrizio Solari 2013 European Establishment Statistics Workshop Nuremberg, 10 September 2013

Project MEMOBUST (MEthodology for MOdern BUsiness STatistics) Main goals: identification of best practices and the development of common methodologies and guidelines supporting the production of business statistics aiming at fostering efficiency and integration of processes. replacement of a methodological handbook for the production of business statistics. This is intended to be an update of the existing “Handbook on design and implementation of business statistics” (edited by Willeboordse in 1998). MEMOBUST 2 Weighting and estimation methods: description in the Memobust handbook, Fabrizio Solari – Nuremberg, 10/09/2013

Modules are written for statisticians/managers responsible for running and producing business statistics methodologists Two types of modules are used: theme modules (of general nature; broad readership) method module (more technical; especially for methodologists) Handbook on Business Statistics 3 Weighting and estimation methods: description in the Memobust handbook, Fabrizio Solari – Nuremberg, 10/09/2013

Method modules contained in the theme module “Weighting and estimation” can be roughly grouped in the following topics: Calibration Robust estimation in presence of outliers Preliminary estimates Small area estimation Estimation with administrative data Weighting and Estimation (theme module) 4 Weighting and estimation methods: description in the Memobust handbook, Fabrizio Solari – Nuremberg, 10/09/2013

XIX. Weighting and Estimation 0. Main theme module 1. Design of estimation 2.c. Calibration 2.d. GREG 4.a. Outlier treatment (robust estimation) 4.b. Winsorisation 4.c. Weight trimming 5. Model-based estimation 7.a. Estimation for short term statistics 7.b. Preliminary estimation with design-based methods 7.c. Preliminary estimation with model-based methods 8.a. Small area estimation 8.b. Synthetic estimator 8.c. Sample size dependant and composite estimator 8.e. EBLUP area level for small area estimation 8.f. EBLUP unit level for small area estimation 8.g. Small area estimation methods for time series data 9. Estimation with administrative data Weighting and Estimation (theme module) 5 Weighting and estimation methods: description in the Memobust handbook, Fabrizio Solari – Nuremberg, 10/09/2013

Design weights equal the inverse of the inclusion probability and can be thought as the number of units of the population each unit in the sample is representative of. Hence, a simple method to obtain estimates is to use these design weights to inflate the sample observations. Design weights can also be adjusted to consider non-response. Design weights can be modified to take into account auxiliary information (Sändal et al.., 1992) by applying calibration estimation. Calibration estimation 6 Weighting and estimation methods: description in the Memobust handbook, Fabrizio Solari – Nuremberg, 10/09/2013

The principle of weighting is also applied to account for unit non- response. Design weights can be adjusted also to consider non-response in order to reduce the possible bias of resulting estimates. For example, the sample can be partitioned into sub-groups of units where the response rates are assumed to be constant, and where it can be assumed that non-respondents behave similarly to respondents. Equivalently, the method is based on the assumption that the non- response depends on auxiliary variables defining a partition of the population, but conditionally on these variables it is independent of the target variable (i.e. non-response is MAR, see Little and Rubin, 2002). Weighting adjustment for non-response 7 Weighting and estimation methods: description in the Memobust handbook, Fabrizio Solari – Nuremberg, 10/09/2013

The effect of outliers on estimation can be significant, since in such situations estimators don’t retain their properties in terms of bias or efficiency.Outliers treatment at estimation stage (robust estimation) aims at reducing the effect on variance of outliers, also controlling the possible bias of the estimator. The module presents some nonstandard robust estimation techniques: modification of the Greg estimator as proposed by Chambers, Falvey, Hedlin and Kokic (2001), Winsor estimator (see Mackin and Preston, 2002), local regression (Kim, Breidt and Opsomer, 2001) Robust estimation in presence of outliers 8 Weighting and estimation methods: description in the Memobust handbook, Fabrizio Solari – Nuremberg, 10/09/2013

Modification of the Greg estimator The modification proposed by Chambers, Falvey, Hedlin and Kokic (2001) refers to GREG estimators assuming heteroscedasticity. They propose to reduce the proportion of disjunctive observations by their substitution or by post-stratification. Robust estimation in presence of outliers 9 Weighting and estimation methods: description in the Memobust handbook, Fabrizio Solari – Nuremberg, 10/09/2013

Winsor estimation Units drawn into the sample, for which the variable takes a value beyond specified border points, are changed (Kokic and Bell, 1994, and Chambers, 1996). The biggest problem in Winsor estimation is the designation of adequate border points, which are crucial in indicating outliers. Robust estimation in presence of outliers 10 Weighting and estimation methods: description in the Memobust handbook, Fabrizio Solari – Nuremberg, 10/09/2013

Preliminary (or provisional) estimates are the estimates that are computed using the statistical information available on the basis of the sample that is observed at time of first release of the estimates. The main problem that has to be faced off in preliminary estimation context concerns the possible self-selection of early respondents, since self-selection can lead to biased estimates. Preliminary estimation 11 Weighting and estimation methods: description in the Memobust handbook, Fabrizio Solari – Nuremberg, 10/09/2013

Preliminary estimation methods may be classified in function of the stage on which the preliminary method is applied: 1.at the sampling design stage, by selecting a preliminary subsample of the final sample; 2. at the estimation stage, in the following ways: a)by means of imputation techniques of missing data, that are applied to non respondent units; b)by means of weighting adjustment; c)by applying direct and indirect estimators and/or time series of preliminary and final estimates Preliminary estimation 12 Weighting and estimation methods: description in the Memobust handbook, Fabrizio Solari – Nuremberg, 10/09/2013

The aim of small area estimation methods is to produce reliable estimstes for unplanned domains for which direct estimates cannot be considered reliable (in some cases direct estimator cannot be even computed when no sampling units are observed for some specific domain). The main idea underlying small area techniques is to increase their effective sample size, (see Rao, 2003). An improvement in the efficiency of the estimates can be achieved by assuming, implicitly or explicitly, a relationship linking together sampling units in the small area of interest and sampling units in the small areas which behave similarly to the small area of interest. Furthermore, an increase in efficiency can be obtained using information coming from exploiting the sample for the same areas at different times. Small area estimation 13 Weighting and estimation methods: description in the Memobust handbook, Fabrizio Solari – Nuremberg, 10/09/2013

SAE methods described in the modules can be divided in: design based methods: -Sample-size dependent estimator (Drew et al., 1982), the James- Stein estimator (see Rao, 2003). model based methods: -Model based methods based on Linerar Mixed Models (see Rao, 2003). A specific module is dedicated to small area estimation methods for time series data. Small area estimation 14 Weighting and estimation methods: description in the Memobust handbook, Fabrizio Solari – Nuremberg, 10/09/2013

Two situations can be distinguished when producing estimates in an administrative data based system with the largest enterprises being surveyed. Situation A: the available administrative data provide good coverage when the estimates have to be made. Situation B: no or only few administrative data are available for the STS estimates. The module provides the technical descriptions for producing estimates for situations A and B respectively. Estimation with administrative data 15 Weighting and estimation methods: description in the Memobust handbook, Fabrizio Solari – Nuremberg, 10/09/2013

Draft results from MEMOBUST project are available at (registration needed) 16 Weighting and estimation methods: description in the Memobust handbook, Fabrizio Solari – Nuremberg, 10/09/2013