Migration of a large survey onto a micro-economic platform Val Cox April 2014.

Slides:



Advertisements
Similar presentations
1 « June, 6 and 7, 2007 Paris « Satellite Account for Education for Portugal: Implementation process and links with the National Accounts and Questionnaire.
Advertisements

Scope and method of pilot survey in China Yang kuan kuan Deputy director-general of office on Leading group of the Second National Economic Census under.
Statistics NZs experience in using Administrative Data in an Integrated Programme of Economic Vince Galvin General Manager Strategy & Communications.
A Statistical Architecture for Economic Statistics Ron McKenzie ICES III.
Evaluating the Effects of Business Register Updates on Monthly Survey Estimates Daniel Lewis.
Slide 1Slide Slide 1 International Conference on Establishment Surveys III Montreal June 18-21, 2007 United States Department of Agriculture National Agricultural.
Improved Questionnaire Design Yields Better Data: Experiences from the UKs Annual Survey of Hours and Earnings Jacqui Jones, Pete Brodie, Sarah Williams.
Simulating Publicly Subsidized Reinsurance Strategies In Three States Lisa Clemans-Cope, Ph.D. (presenter) Randall R. Bovbjerg, J.D. (PI for Reinsurance.
ESRC UK Longitudinal Studies Centre A Framework for Quality Profiles Nick Buck and Peter Lynn Institute for Social and Economic Research University of.
1 Session 10 Sampling Weights: an appreciation. 2 To provide you with an overview of the role of sampling weights in estimating population parameters.
SADC Course in Statistics Samples and Populations (Session 02)
Work Session on Statistical Data Editing Paris, France, April 2014 Topic (i): Selective editing / macro editing Experiences from Selective Editing.
Data Imputation United Nations Statistics Division (UNSD) 16 March 2011 Santiago, Chile.
New Paradigms for Measuring Savings
5.9 + = 10 a)3.6 b)4.1 c)5.3 Question 1: Good Answer!! Well Done!! = 10 Question 1:
Statistics 2020 and Platform Approach Te Käpehu Whetü May 2011.
TAJSTAT: Strengthening the National Statistical System Project Mustafa Dinc TLSS and MICS Conference Dushanbe, Tajikistan July 1, 2008.
UNECE Work Session on Statistical Data Editing Vienna April 2008 Topic ii – Editing Administrative Data and Combined Sources.
1 Editing Administrative Data and Combined Data Sources Introduction.
NLSCY – Non-response. Non-response There are various reasons why there is non-response to a survey  Some related to the survey process Timing Poor frame.
An Integrated Approach to Economic Statistics “ The Canadian Experience” UNSD – IBGE Workshop on Manufacturing Statistics Kevin Roberts Rio de Janeiro,
OECD Short-Term Economic Statistics Working PartyJune Analysis of revisions for short-term economic statistics Richard McKenzie OECD OECD Short.
Beyond 2011 – A new paradigm for population statistics? Pete Benton, Beyond 2011 Programme Director Office for National Statistics, UK.
Vienna, 23 April 2008 UNECE Work Session on SDE Topic (v) Editing on results (post-editing) 1 Topic (v): Editing based on results Discussants: Maria M.
Household Surveys ACS – CPS - AHS INFO 7470 / ECON 8500 Warren A. Brown University of Georgia February 22,
IAOS 2014 Da Nang: An agile approach to question testing and satisfying a new data requirement Pete Brodie ONS, UK.
Regional GDP Workshop. Purpose of the Project October Regional GDP Workshop Regional GDP Scope Annual Current price (nominal) GDP By region.
Combining administrative and survey data: potential benefits and impact on editing and imputation for a structural business survey UNECE Work Session on.
Administrative Data at Statistics Canada – Current Uses and the Way Forward 27 th Voorburg Group Meeting Warsaw, Poland André Loranger October 4, 2012.
Measuring the quality of regional estimates from the ABS Jennie Davies and Daniel Ayoubkhani.
Electronic reporting in Poland 27th Voorburg Group Meeting Warsaw, Poland October 1st to October 5th, 2012 Central Statistical Office of Poland.
1 Business Register: Quality Practices Eddie Salyers
The Canadian Integrated Approach to Economic Surveys Marie Brodeur, Peter Koumanakos, Jean Leduc, Éric Rancourt, Karen Wilson Statistics Canada International.
Rudi Seljak, Metka Zaletel Statistical Office of the Republic of Slovenia TAX DATA AS A MEANS FOR THE ESSENTIAL REDUCTION OF THE SHORT-TERM SURVEYS RESPONSE.
Improving the Design of UK Business Surveys Gareth James Methodology Directorate UK Office for National Statistics.
Use of Administrative Data in Statistics Canada’s Annual Survey of Manufactures Steve Matthews and Wesley Yung May 16, 2004 The United Nations Statistical.
All the answers? Statistics New Zealand’s Integrated Data Infrastructure Paper by Felibel Zabala, Rodney Jer, Jamas Enright and Allyson Seyb Presented.
Impact of using fiscal data on the imputation strategy of the Unified Enterprise Survey of Statistics Canada Ryan Chepita, Yi Li, Jean-Sébastien Provençal,
Integration of Annual Economic Collections – The Australian Experience ICESIII, Canada, 2007 Presented by Eden Brinkley.
Collecting Electronic Data From the Carriers: the Key to Success in the Canadian Trucking Commodity Origin and Destination Survey François Gagnon and Krista.
Eurostat Overall design. Presented by Eva Elvers Statistics Sweden.
Copyright 2010, The World Bank Group. All Rights Reserved. Managing processes Core business of the NSO Part 2 Strengthening Statistics Produced in Collaboration.
On Tap: Developments in Statistical Data Editing at Statistics New Zealand Paper by Allyson Seyb, Felibel Zabala and Les Cochran Presented by Felibel Zabala.
Topic (ii): New and Emerging Methods Maria Garcia (USA) Jeroen Pannekoek (Netherlands) UNECE Work Session on Statistical Data Editing Paris, France,
Editing a Mixture of Canadian 2006 Census and Tax Data Mike Bankier Statistics Canada 2006 Work Session on Statistical Data Editing
Transforming how we produce statistics – an inside perspective Michelle Feyen Statistics New Zealand October 2014.
Direction and system changes impacting on data editing and imputation at Statistics New Zealand Paper by Emma Bentley and Felibel Zabala, presented by.
Census Quality: another dimension! Paper for Q2008 conference, Rome Louisa Blackwell Quality Assurance Manager, 2011 Census.
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
Editing of linked micro files for statistics and research.
Lyne Guertin Census Data Processing and Estimation Section Social Survey Methods Division Methodology Branch, Statistics Canada UNECE April 28-30, 2014.
SNA seminar in the Caribbean Integrated questionnaires Marie Brodeur Director General, Industry Statistics Branch, Statistics Canada St. Lucia February,
Outlining a Process Model for Editing With Quality Indicators Pauli Ollila (part 1) Outi Ahti-Miettinen (part 2) Statistics Finland.
Furthering the use of administrative data in sub-annual financial statistics International Association for Official Statistics Conference Da Nang, Viet.
Developing and applying business process models in practice Statistics Norway Jenny Linnerud and Anne Gro Hustoft.
Integrated Approach Processing Marie Brodeur Director General, Industry Statistics Branch, Statistics Canada St. Lucia February, 2014 SNA seminar in the.
Copyright 2010, The World Bank Group. All Rights Reserved. Economic statistics, part 2 Business statistics; core element of economic statistics 1 Business.
Process reengineering at Statistics Sweden Bo Sundgren
Elaborating on the Business Architecture of SN Robbert Renssen Statistics Netherlands Standard Process Steps.
First meeting of the Technical Cooperation Group for the Population and Housing Censuses in South East Europe Vienna, March 2010 POST-ENUMERATION.
Using administrative data to produce official social statistics New Zealand’s experience.
Administrative Data at Statistics Canada – Current Uses and the Way Forward Wesley Yung and Peter Lys, Statistics Canada.
Use of the Statistics New Zealand Business Register for the agriculture industry and the not for profit sector Geoff Mead
FDI - Imputation. Overview Introduction Overview of Imputation Methods Overview of Outliering methods Overview of Estimation methods Aggregation Disclosure.
4-6 September 2013, Vilnius Quality in Statistics: Administrative Data and Official Statistics USING ADMINISTRATIVE DATA SOURCES IN OFFICIAL.
Redesigning French structural business statistics, using more administrative data ICESIII, Montréal, june 2007.
Andris Fisenko and Jānis Lapiņš
Survey phases, survey errors and quality control system
Survey phases, survey errors and quality control system
Tomaž Špeh, Rudi Seljak Statistical Office of the Republic of Slovenia
Presentation transcript:

Migration of a large survey onto a micro-economic platform Val Cox April 2014

Micro-economic Platform (MEP) Standardises and automates processes - Provides more efficient processing, more analysis Enables Statistics NZ to gain more from available data - Basic principle: use administrative data wherever possible, with surveys filling the gaps - Objective: bring core information about every business in the economy into the Longitudinal Business DB to allow Statistics NZ to respond quickly to changing needs for economic statistics 2

Aim of paper To discuss the challenges of building a non- response imputation package for a large survey on the MEP - Rationalises the use of  Banff for outlier detection and imputation  SEVANI (System for Estimation of Variance due to Nonresponse and Imputation) to estimate sampling and non-sampling errors 3

Annual Enterprise Survey(AES) Provides statistics on the financial performance and position of New Zealand businesses - Captures about 90% of New Zealand's GDP Uses four different major data sources -Three administrative (covers 72% of the population) -One postal survey 4

AES before MEP 5

Editing strategy of AES on MEP Guided by the Methodological Standard for E&I Key objective of standard - Editing is fit-for-purpose and enables continuous improvement of processes and data quality Key principles used -Automate editing processes where possible -Use Statistics NZ standard editing tools, wherever possible, to achieve standardisation 6

Editing system of AES in MEP Uses Banff to automate and standardise editing and imputation processes Uses analytical views to assess the quality of the edited data 7

Challenges and solutions A.Sheer volume of data -28 questionnaires, 113 industries and 180 variables Solution: Use of a “thin slice” approach -Restrict dataset to one questionnaire and one industry to show all stages of E&I are working -Once successful, expand dataset to include more industries until all 28 questionnaires are replicated -Successful in determining optimal level of automation for correcting failed edits 8

Challenges and solutions B.Determining which variable is erroneous when groups of variables must add or subtract to a total - Banff “errorloc” procedure always recommends to change one variable by a large amount -Change is done by “deterministic” procedure Solution: Assign weights to variables -Assign lower weights to more reliable variables so Banff doesn’t change their values Examples: totals, gross profit, since respondents use this to determine the tax they pay 9

Challenges and solutions C.Outlier detection -Old system detects outlier in 3 key variables but unlinks whole unit (all variables) - Banff does univariate outlier detection Solution: Compared 2 E&I runs of data -1 st run had only the 3 key variables set as outliers and 2 nd had all variables included in outlier steps -Decision: Choose variables to be set as outliers based on the effect on the totals 10

Challenges and solutions D.Running imputation one variable at a time would have been very time-consuming Solution: Group variables -By imputation method (4 methods) -By industry (some industries have different characteristics) -By type of variable (e.g. some variables can be negative) 11

Challenges and solutions E.Imputation failed for some variables -Some imputation cells were too small Solution: Merged small imputation cells -Each imputation stage was run twice, the first without cell merging and the second with cell merging, resulting in 8 imputation stages -Use of a “catch-all” stage at the end (9 th stage) to carry out mean imputation by industry 12

Challenges and solutions F.Challenges with no solutions -Analysis of improvements in the E&I was slow as it took several hours to run E&I and write back to the main data storage area to view data in a cube -Attempt to replicate published results as closely as possible created a dilemma: When to stop trying? -What was the “right” answer? 13

SEVANI Provided a standardised and automated method to report on estimates of variances due to sampling as well as non-response and imputation Challenges: - Can produce output for one variable at a time - SEVANI required a lot of parameters to set-up - MEP is unit-based so can’t easily output SEVANI results Solution: -Use of a macro to identify variable names -Created a SAS code to set-up parameters -Output SEVANI results outside MEP 14

Next steps Educate the users of the new system on MEP Identify potential areas to make improvements in the editing and imputation system Create a new MEP collection for Charities data to include its own editing and imputation system 15