Joint UNECE/Eurostat Meeting on Population and Housing Censuses (28-30 October 2009) Accuracy evaluation of Nuts level 2 hypercubes with the adoption of.

Slides:



Advertisements
Similar presentations
Multiple Indicator Cluster Surveys Survey Design Workshop
Advertisements

Innovation data collection: Advice from the Oslo Manual South East Asian Regional Workshop on Science, Technology and Innovation Statistics.
Quality in Italian consumer price survey: optimal allocation of resources and indicators to monitor the data collection process Federico Polidoro, Rosabel.
United Nations Expert Group Meeting on Revising the Principles and Recommendations for Population and Housing Censuses New York, 29 October – 1 November.
Taejin Jung, Ph.D. Week 8: Sampling Messages and People
Evaluation.
United Nations Workshop on Revision 3 of Principles and recommendations for Population and Housing Censuses and Census Evaluation Amman, Jordan, 19 – 23.
Trade and business statistics: use of administrative data Lunch Seminar Enrico Giovannini Italian National Statistical Institute (ISTAT) New York, February,
Joint UNECE/Eurostat Meeting on Population and Housing Censuses (13-15 May 2008) Sample results expected accuracy in the Italian Population and Housing.
Joint UNECE/Eurostat Work Session on Migration Statistics 3 March, 2008, Geneva, Switzerland Selected methods to improve emigration estimates MEASURING.
Sample Design.
United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan,
Eurostat Repeated surveys. Presented by Eva Elvers Statistics Sweden.
Improving Quality in the Office for National Statistics’ Annual Earnings Statistics Pete Brodie & Kevin Moore UK Office for National Statistics.
United Nations Workshop on Revision 3 of Principles and recommendations for Population and Housing Censuses and Census Evaluation Amman, Jordan, 19 – 23.
Innovations on methods and survey process for the 2011 Italian population census European Conference on Quality in Official Statistics 8-11 July, 2008.
Module E Overview of Sampling ACCT-40801Mudule E.
Giovanna Brancato, Marina Signore Istat Work Session on Statistical Metadata (METIS) Metadata and Quality Indicators Reuse for Quality reporting Geneva,
5 Marzo 2007 EMERGING METHODOLOGIES OF CONTINUOUS USE OF REGISTERS AND GEOCODED DATABASES IN THE ITALIAN POPULATION AND HOUSING CENSUS Fabio Crescenzi,
Crop area estimates with area frames in the presence of measurement errors Elisabetta Carfagna University of Bologna Department.
9 th Workshop on Labour Force Survey Methodology – Rome, May 2014 The Italian LFS sampling design: recent and future developments 9 th Workshop on.
Optimal Allocation in the Multi-way Stratification Design for Business Surveys (*) Paolo Righi, Piero Demetrio Falorsi 
Emerging methodologies for the census in the UNECE region Paolo Valente United Nations Economic Commission for Europe Statistical Division International.
1 Sources of gender statistics Angela Me UNECE Statistics Division.
United Nations Economic Commission for Europe Statistical Division Sources of gender statistics Angela Me UNECE Statistics Division.
Use of web scraping and text mining techniques in the Istat survey on “Information and Communication Technology in enterprises” Giulio Barcaroli(*), Alessandra.
Eurostat Overall design. Presented by Eva Elvers Statistics Sweden.
Sampling strategy for the dual-system correction of the under-coverage in the Register Supported 2011 Italian Population Census Loredana Di Consiglio,
1 Improving Statistics for Food Security, Sustainable Agriculture and Rural Development – Action Plan for Africa THE RESEARCH COMPONENT OF THE IMPLEMENTATION.
European Conference on Quality in Official Statistics Session 26: Quality Issues in Census « Rome, 10 July 2008 « Quality Assurance and Control Programme.
CZECH STATISTICAL OFFICE | Na padesátém 81, Prague 10 | czso.cz1/16 Ondřej Nývlt, Ilona Nováková, Lukáš Savko EFFECTIVE DISTRIBUTION OF SAMPLE OVER.
Managerial Economics Demand Estimation & Forecasting.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Bangkok,
JOINT UN-ECE/EUROSTAT MEETING ON POPULATION AND HOUSING CENSUSES GENEVA, OCTOBER 2009 ADOPTING THE CES RECOMMENDATIONS IN THE 2011 UK CENSUS IAN.
Handbook on Precision Requirements and Variance Estimation for ESS Household Surveys Denisa Florescu, Eurostat European Conference on Quality in Official.
Sub-regional Workshop on Census Data Evaluation, Phnom Penh, Cambodia, November 2011 Evaluation of Age and Sex Distribution United Nations Statistics.
Statistical Expertise for Sound Decision Making Quality Assurance for Census Data Processing Jean-Michel Durr 28/1/20111Fourth meeting of the TCG - Lubjana.
JOINT UN-ECE/EUROSTAT MEETING ON POPULATION AND HOUSING CENSUSES GENEVA, MAY 2009 DETERMINING USER NEEDS FOR THE 2011 UK CENSUS IAN WHITE, Office.
The challenge of a mixed-mode design survey and new IT tools application: the case of the Italian Structure Earning Surveys Fabiana Rocci Stefania Cardinleschi.
Sampling Sources: -EPIET Introductory course, Thomas Grein, Denis Coulombier, Philippe Sudre, Mike Catchpole -IDEA Brigitte Helynck, Philippe Malfait,
Joint UNECE/Eurostat Work Session on Migration Statistics, April 2010,Geneva Monthly Demographic Now Cast. Monthly estimates of migration flows in.
United Nations Workshop on Revision 3 of Principles and Recommendations for Population and Housing Censuses and Evaluation of Census Data, Amman 19 – 23.
Topic (i): Selective editing / macro editing Discussants Orietta Luzi - Italian National Statistical Institute Rudi Seljak - Statistical Office of Slovenia.
1 1 Confidentiality protection of large frequency data cubes UNECE Workshop on Statistical Confidentiality Ottawa October 2013 Johan Heldal and Svetlana.
Marketing Information System A Marketing Information System is the structure of people, equipment, and procedures used to gather, analyze, and distribute.
Overview and challenges in the use of administrative data in official statistics IAOS Conference Shanghai, October 2008 Heli Jeskanen-Sundström Statistics.
Sampling technique  It is a procedure where we select a group of subjects (a sample) for study from a larger group (a population)
JOINT UN-ECE/EUROSTAT MEETING ON POPULATION AND HOUSING CENSUSES GENEVA, 7-9 JULY 2010 A QUALITY ASSURANCE STRATEGY FOR THE 2011 CENSUS IN ENGLAND AND.
SUITLAND WORKING GROUP: Task Force on Improving Migration and Migrant Data Using Household Surveys and Other Sources Eric B. Jensen Population Division.
Census quality evaluation: Considerations from an international perspective Bernard Baffour and Paolo Valente UNECE Statistical Division Joint UNECE/Eurostat.
Joint UNECE-Eurostat worksession on confidentiality, 2011, Tarragona Sampling as a way to reduce risk and create a Public Use File maintaining weighted.
Census Office Fernando Casimiro Geneva, July 2010 Portugal – Census results tailored to user needs «
An Overview of Editing and Imputation Methods for the next Italian Censuses Gianpiero Bianchi, Antonia Manzari, Alessandra Reale UNECE-Eurostat Meeting.
Meeting of the Group of Experts on Population and Housing Censuses Geneva, 23 – 26 September 2014 Principles and Recommendations for Population and Housing.
COMBINING SURVEY AND ADMINISTRATIVE DATA IN THE ITALIAN EU-SILC EXPERIENCE: POSITIVE AND CRITICAL ASPECTS National Institute of Statistics - Italy Claudio.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Addis.
Evaluating imputation of sex and age for substitutes in substitute households Michael Ryan 2008 UNECE Work Session on Statistical Data Editing.
11 Measuring Disclosure Risk and Data Utility for Flexible Table Generators Natalie Shlomo, Laszlo Antal, Mark Elliot University of Manchester
Session topic (i) – Editing Administrative and Census data Discussants Orietta Luzi and Heather Wagstaff UNECE Worksession on Statistical Data Editing.
Sampling and Sampling Distribution
Chapter 4 Basic Estimation Techniques
Module E Overview of Sampling ACCT-4080 Mudule E.
Basic Estimation Techniques
Dual Mode of Data Collection – A New Approach in the Population, Housing and Dwelling Census in Slovakia in 2011 European Conference on Quality in Official.
Basic Estimation Techniques
Survey phases, survey errors and quality control system
Working Group on Population and Housing Censuses
Survey phases, survey errors and quality control system
Teodora Brandmuller Head of Section – Regional and urban statistics
Silvia Losco, ISTAT, Strategies and approaches for managing risks in the official statistics production: ISTAT experience in the.
Presentation transcript:

Joint UNECE/Eurostat Meeting on Population and Housing Censuses (28-30 October 2009) Accuracy evaluation of Nuts level 2 hypercubes with the adoption of a sampling strategy in the 2011 Italian Population Census Giancarlo Carbonetti, Mariangela Verrascina Istat – Italian National Institute of Statistics Division for General Censuses Geneva, October 29th 2009

Joint UNECE Eurostat Meeting 2 Why do we adopt sampling techniques in the Italian Census? Sampling is crucial for the new census strategy.  The main solutions proposed are related to: use of population registers; census forms mail out; mixed mode of data collection.  A high response rate is needed.  The 2011 population census has been planned in order to: improve the efficiency of the survey operations; reduce the workload of the municipalities; minimize the statistical burden for the people.

Joint UNECE Eurostat Meeting 3 Which effects by adopting a sampling technique?  To keep high level of quality (reducing non-sampling error sources) → this is an opportunity → this is a constraint  Timeliness for a smaller amount of data to process (hypercubes must be delivered to Eurostat by 1 April 2014) → this is a constraint Advantages Disadvantages  Introduction of sampling error → an evaluation of accuracy of the sampling estimates is required

Joint UNECE Eurostat Meeting 4 The framework  POPULATION: private households.  LISTS: population registers managed by the municipalities.  VARIABLES: non-demographic variables.  DOMAINS: Census Areas.  DIFFERENT STRATEGY: municipality demographic thresholds. A simulation study has been conducted in order to define which methodology performs the most accurate estimation.

Joint UNECE Eurostat Meeting 5  Simple Random Sampling of HOUseholds (SRSHOU) from population registers.  Area Frame Sampling where reliable population registers are not available.  Calibrated Estimators.  Definition of Census Areas of about 15,000 inhabitants.  Sampling Ratio of 33%. Some results of the simulation study

Joint UNECE Eurostat Meeting 6 Distribution of average and maximum cv% for classes of cell counts for three tested sampling ratios (SRSHOU design) - for cells of 1,000 units cv is about 4% - for cells of 100 units cv is about 13% - for cells of 10 units cv is about 40%

Joint UNECE Eurostat Meeting 7 Curves of sampling errors drawn by the simulation results

Joint UNECE Eurostat Meeting 8 Relevant issue  Which is the impact of the sampling error on the dissemination hypercubes? The answer is the core of this presentation where the impact of the sampling strategy on the final results will be carefully explained.

Joint UNECE Eurostat Meeting 9 “When can the quality of the statistical table be considered acceptable?” Example 1: if less than 1/3 of cell counts have a cv>12.5% Example 2: if less than 10% of persons are classified in cell counts where cv>12.5% Impact of sampling errors on dissemination hypercubes For a fixed cv (for instance, a critical level should be 12.5%), the global quality of a dissemination hypercube can be acceptable: → if the percentage of cell counts estimated with a cv higher than the critical value is low; → if the percentage of persons classified in those cells is low. Having chosen the sampling strategy (SRSHOU; calibrated estimator), for an area and a dissemination hypercube:

Joint UNECE Eurostat Meeting 10 Evaluation of the sets of estimates with critical accuracy by means of a sampling errors curve cv_max 12.5% TSTS Set of estimates with cv>12.5% High sampling errors Absolute frequencies estimated with a critical quality Set of estimates with cv<12.5% Absolute Frequencies T cv critical threshold sampling error The lower the amount of information estimated with high levels of cv (referred to persons classified in cells with absolute frequencies lower than the threshold T S ), the higher the quality of the related dissemination hypercubes.

Joint UNECE Eurostat Meeting 11  Evaluations are related to 8 Eurostat hypercubes crossing demographic variables with one or more long form variables and referred to NUTS level 2.  The considered hypercubes contain topics with breakdowns used in 2001 Italian Census dissemination, close (in terms of number and information content) to breakdowns to be provided for the next census round.  The number of cells goes from 1,000 to more than 20,000 depending on the complexity of the statistical table. Quality evaluations for hypercubes at NUTS level2

Joint UNECE Eurostat Meeting 12 Each non-demographic variable has been individually crossed with sex and age (single ages). Hypercubes at NUTS level2 considered in the study (draft version, April 2009) More than one non-demographic variables have been crossed with sex and age (age classes). Hypercube computations are simulated with 2001 Census data Long Form variables

Joint UNECE Eurostat Meeting 13 Number of potential cells and acceptable cells for each hypercube considered in the study Number of potential cells = the product of the number of categories Number of acceptable cells = the number of potential cells without “structural zeros”

Joint UNECE Eurostat Meeting 14 Indicators of global accuracy Two indicators are proposed to measure the global accuracy of census data produced by adopting a sampling strategy and referred to a dissemination hypercube: 1) Percentage of critical cells = number of cell counts (>0) lower than the critical threshold T s / number of acceptable cells 2) Percentage of persons in critical cells = persons classified in critical cells / total of persons In particular, the second indicator quantifies the percentage of people classified in cells which will be estimated with a low accuracy (10% could be considered a tolerable limit).

Joint UNECE Eurostat Meeting 15 Example 1: Hypercube H.B1.E1.R3. Quality indicators related to NUTS2 areas of Italy: Molise, Marche and Sicilia MoliseMarcheSicilia Sampling ratio threshold Ts % of critical cells % of persons in critical cells threshold Ts % of critical cells % of persons in critical cells threshold Ts % of critical cells % of persons in critical cells 10% % % Hypercube H.B1.E1.R3: sex (2) by age (21) by current activity status (6) by industry (17) by educational attainment (7). Number of acceptable cells = 5,574 (no structural zeros). The cells are critical if the related absolute frequency is lower than the threshold T S observed in correspondence of cv_max =12.5%.

Joint UNECE Eurostat Meeting 16 Example 2: Hypercube H.B1.E1.R4. Quality indicators related to NUTS2 areas of Italy: Molise, Marche and Sicilia MoliseMarcheSicilia Sampling ratio threshold Ts % of critical cells % of persons in critical cells threshold Ts % of critical cells % of persons in critical cells threshold Ts % of critical cells % of persons in critical cells 10% % % Hypercube H.B1.E1.R4: sex (2) by age (13) by occupation (10) by industry (17) by educational attainment (7). Number of acceptable cells = 26,350 (no structural zeros). The cells are critical if the related absolute frequency is lower than the threshold T S observed in correspondence of cv_max =12.5%.

Joint UNECE Eurostat Meeting 17 Expected quality for hypercubes at NUTS level2 Distribution of all 20 Italian Nuts2 areas by percentage of persons classified in critical cells for the Eurostat hypercubes considered in the study and the three tested sampling ratios. * * * * * * * * *

Joint UNECE Eurostat Meeting 18 Concluding remarks  The adoption of a sampling strategy doesn’t seem to bring a reduction of accuracy.  The sampling error could have a considerable impact only to estimate very small frequencies.  NUTS2 hypercubes with different complexity could be estimated with good accuracy even for lower sampling ratios.  The revised version of the hypercubes considered in the work seems to be less detailed. This will hopefully bring more accuracy.

Joint UNECE Eurostat Meeting 19 Some solutions to enhance accuracy  Adopting small area estimators.  Increasing the set of variables to be observed on the whole population, reducing the set of variables that have to be surveyed on samples of households: adoption of a medium/long form. Enhancements of estimates regarding rare events and small domains in order to increase their efficiency and to reduce the number of critical cells.

Joint UNECE Eurostat Meeting 20 Thank you for your attention. -