Testing the use of administrative data to edit the 2009 Agriculture Census Dolores Lorca National Statistical Institute of Spain.

Slides:



Advertisements
Similar presentations
Innovation data collection: Advice from the Oslo Manual South East Asian Regional Workshop on Science, Technology and Innovation Statistics.
Advertisements

Innovation Surveys: Advice from the Oslo Manual South Asian Regional Workshop on Science, Technology and Innovation Statistics Kathmandu,
Innovation Surveys: Advice from the Oslo Manual National training workshop Amman, Jordan October 2010.
“Students” t-test.
Hypothesis Testing. To define a statistical Test we 1.Choose a statistic (called the test statistic) 2.Divide the range of possible values for the test.
One Sample T-tests One sample t-tests are used in the following two situations The size of a sample is less than 25 The population standard deviation is.
1 Incorporating Statistical Process Control and Statistical Quality Control Techniques into a Quality Assurance Program Robyn Sirkis U.S. Census Bureau.
ECONOMIC STATISTICS AND NATIONAL ACCOUNT IN ETHIOPIA By Sehin Merawi Central Statistical Agency of Ethiopia.
Editing and Imputing VAT Data for the Purpose of Producing Mixed- Source Turnover Estimates Hannah Finselbach and Daniel Lewis Office for National Statistics,
Business Statistics for Managerial Decision
© 2004 Prentice-Hall, Inc.Chap 1-1 Basic Business Statistics (9 th Edition) Chapter 1 Introduction and Data Collection.
Clustered or Multilevel Data
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Independent and Dependent Variables Between and Within Designs.
CHAPTER 8 Estimating with Confidence
Maintenance of Selective Editing in ONS Business Surveys Daniel Lewis.
Data Editing United Nations Statistics Division (UNSD) 16 March 2011 Santiago, Chile.
Social Research Methods
Vienna, 23 April 2008 UNECE Work Session on SDE Topic (v) Editing on results (post-editing) 1 Topic (v): Editing based on results Discussants: Maria M.
Improving Quality in the Office for National Statistics’ Annual Earnings Statistics Pete Brodie & Kevin Moore UK Office for National Statistics.
Improvements Of Sample Design For Rural Statistical Surveys In China Michael Steiner National Agricultural Statistics Service United States Department.
1 WORLD TOURISM ORGANIZATION (UNWTO) MEASURING TOURISM EXPENDITURE: A UNWTO PROPOSAL SESRIC-UNWTO WORKSHOP ON TOURISM STATISTICS AND THE ELABORATION OF.
Ch 10 Comparing Two Proportions Target Goal: I can determine the significance of a two sample proportion. 10.1b h.w: pg 623: 15, 17, 21, 23.
Ilie Dumitrescu National Institute of Statistics Third International Conference on Agricultural Statistics MEXSAI-ICAS-III Cancun- Mexico, 2-4 November.
Rudi Seljak, Metka Zaletel Statistical Office of the Republic of Slovenia TAX DATA AS A MEANS FOR THE ESSENTIAL REDUCTION OF THE SHORT-TERM SURVEYS RESPONSE.
Deliverable 2.6: Selective Editing Hannah Finselbach 1 and Orietta Luzi 2 1 ONS, UK 2 ISTAT, Italy.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5.
Sampling is the other method of getting data, along with experimentation. It involves looking at a sample from a population with the hope of making inferences.
CHAPTER 14 Introduction to Inference BPS - 5TH ED.CHAPTER 14 1.
Scot Exec Course Nov/Dec 04 Survey design overview Gillian Raab Professor of Applied Statistics Napier University.
The application of selective editing to the ONS Monthly Business Survey Emma Hooper Office for National Statistics
Surveys on agricultural distribution and agricultural use of fertilizers in Italy Mario Adua ISTAT- National Institute of Statistics.
Jeroen Pannekoek - Statistics Netherlands Work Session on Statistical Data Editing Oslo, Norway, 24 September 2012 Topic (I) Selective and macro editing.
System of Economic Surveys in Egypt. Agenda Introduction Survey design stages What types of surveys are needed Challenges in surveying the informal sector.
European Conference on Quality in Official Statistics Session 26: Quality Issues in Census « Rome, 10 July 2008 « Quality Assurance and Control Programme.
15 may 2008 The validation of the french population census Olivier Lefebvre, Insee.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Stop the Madness: Use Quality Targets Laurie Reedman.
October 28-30, 2009 UNECE Geneva Quality Assessment of 2008 Integrated Census - Israel Pnina ZADKA Central Bureau of Statistics Israel.
CHAPTER 8 Estimating with Confidence
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
BPS - 3rd Ed. Chapter 131 Confidence Intervals: The Basics.
Quality Assurance Programme of the Canadian Census of Population Expert Group Meeting on Population and Housing Censuses Geneva July 7-9, 2010.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Section 8-2 Basics of Hypothesis Testing.
AP STATISTICS LESSON INFERENCE FOR A POPULATION PROPORTION.
Interval Estimation and Hypothesis Testing Prepared by Vera Tabakova, East Carolina University.
CBS-SSB STATISTICS NETHERLANDS – STATISTICS NORWAY Work Session on Statistical Data Editing Oslo, Norway, September 2012 Jeroen Pannekoek and Li-Chun.
1 Selective data editing Development & implementation Q 2010 Helsinki Jörgen Svensson Process Owner Statistics Sweden (SCB)
1 C. ARRIBAS, D. LORCA, A. SALINERO & A. COLMENERO Measuring statistical quality at the Spanish National Statistical Institute.
StatisticalData Editing Anders Norberg, Statistics Sweden (SCB)
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Section 7-3 Estimating a Population Mean: σ Known.
Survey Research. In Case of a System Glitch… After forming into your usual teams: –Create a brief survey that seeks to discern citizens’ attitudes about.
Measuring Disability: Results from the 2001 Census and the 2001 Post-Censal Disability Survey Statistics Canada January 10, 2003.
Chapter 6: 1 Sampling. Introduction Sampling - the process of selecting observations Often not possible to collect information from all persons or other.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
USE OF AERIAL FRAMES FOR AGRICULTURAL CENSUSES AND SURVEYS Fiji Experience.
The hidden side of successful story – implication of wide use of administrative data sources at national statistical institutes Metka Zaletel, Irena Križman.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5. Sampling Distributions and Estimators What we want to do is find out the sampling distribution of a statistic.
Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 13 Collecting Statistical Data 13.1The Population 13.2Sampling.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Estimating with Confidence Section 12.1 Estimating a Population Proportion.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
Population vs Sample Population = The full set of cases Sample = A portion of population The need to sample: More practical Budget constraint Time constraint.
R&D statistics in Denmark organization of data collection, and dissemination of R&D statistics.
How to deal with quality aspects in estimating national results Annalisa Pallotti Short Term Expert Asa 3st Joint Workshop on Pesticides Indicators Valletta.
Improving the efficiency of editing in ONS business surveys
Elementary Statistics
Comparing Populations
Interval Estimation and Hypothesis Testing
The change of data sources in the Spanish SILC
Presentation transcript:

Testing the use of administrative data to edit the 2009 Agriculture Census Dolores Lorca National Statistical Institute of Spain

Summary A selective editing procedure is applied to test the use of administrative data to edit the Agriculture Census Study case: Using data from previous Agriculture Census and the 2005 Farm Structural Survey (FSS)

1. Introduction The Spanish NSI carries out every 10 years an Agriculture Census The large number of questionnaires collected by many interviewers during a short time about different kind of holdings can have quite errors to amend in editing

1. Introduction Different editing approaches are applied to the complex set of collected data: –In the data collection phase simple checks are applied using build-in edits in a CAPI system –Selective editing is applied to determine the units that will be manually reviewed –Automatic editing is applied to rest of units using DIA system –Macroediting

2. Selective editing procedure Using administrative data for editing census data Selective editing procedure: To determine and prioritise the suspect units to be manually reviewed

2. Selective editing procedure Using simple expansion estimators: W i is the sample weight for the unit i n is the sample size X i denotes the X variable value for the unit i

2. Selective editing procedure Local score function: X i is the reported value is the administrative value w i =1 ( census data)

2. Selective editing procedure Scaled local score is the total estimate of variable X calculated using administrative data

2. Selective editing procedure Global score GS i =max (LS i )

2. Selective editing procedure Selection of the thresholds: the simulation study approach by Lawrence and Mackenzie (2000): Absolute pseudo-bias of Latouche and Berthelot (1992) is the total estimate obtained by replacing all reported values with a global score larger than the pre- determined threshold (the percentile p) by their administrative values and leaving reported values in place for the others

2. Selective editing procedure Recontact rate: the number of units with a global score larger than the pre-determined threshold (percentile p) divided by the total number of units To choose the thresholds trying to balance between a low pseudo-bias and a low re- contact rate

3. Study case Selected variable: area of olive grove planted 1999 Agriculture Census and administrative register: 253,038 units 2005 FSS and administrative register: 5,804 units Selection of thresholds by geographical area: PB p p=95, 90, 85, 80, 75, 70 Recontacts for each p

3. Study case

With a re-contact rate of 5% the reduction of the pseudo-bias is much greater than the rest of rates At least, for most region, we would use the global score distribution percentile p=95 as threshold

4. Conclusions The pilot study shows that this selective editing approach could help us to prioritise follow-up actions for those units with significant discrepancies with administrative data