Enumeration using frozen versions (based on slides produced by Peter Stoltze, Chief Consultant, Statistical Methods, SD)

Slides:



Advertisements
Similar presentations
Innovation data collection: Methodological procedures & basic forms Regional Workshop on Science, Technology and Innovation (STI) Indicators.
Advertisements

Innovation data collection: Advice from the Oslo Manual South East Asian Regional Workshop on Science, Technology and Innovation Statistics.
7 (a) Under what circumstances is stratified random sampling procedure is considered appropriate?How would you select such samples?Explain by means of.
Chapter 7 Sampling Distributions
FRA’s Earned Value Management System Corrective Action Plan Implementation January12, 2010 Dean Hoffer and Mike Rhoades.
Why sample? Diversity in populations Practicality and cost.
Sampling Prepared by Dr. Manal Moussa. Sampling Prepared by Dr. Manal Moussa.
Formalizing the Concepts: Simple Random Sampling.
IP Addressing & Subnetting Made Easy. Part 1: Working with IP Addresses.
Sample Design.
Eurostat Repeated surveys. Presented by Eva Elvers Statistics Sweden.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Definitions Observation unit Target population Sample Sampled population Sampling unit Sampling frame.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Chapter 7 Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction to Sampling Distributions Introduction to.
1 1 Slide Chapter 7 (b) – Point Estimation and Sampling Distributions Point estimation is a form of statistical inference. Point estimation is a form of.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Eurostat Overall design. Presented by Eva Elvers Statistics Sweden.
1 1 Slide Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction to Sampling Distributions Introduction to.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 1-1 Statistics for Managers Using Microsoft ® Excel 4 th Edition Chapter.
7.1Sampling Methods 7.2Introduction to Sampling Distribution 7.0 Sampling and Sampling Distribution.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics.
1 Statistical business registers as a prerequisite for integrated economic statistics. By Olav Ljones Deputy Director General Statistics Norway
Suphalak Prasertsang Pornpan Kaewsringam Krongsuk Chayakul Tanaporn Kongprasert Pannee Pattanapradit NATIONAL STATISTICAL OFFICE, THAILAND MAY 2010.
An Overview of the Sample Survey Process in Business Statistics Peter Tibert Stoltze Statistical Methodology Survey Sampling and Estimation November 2014Survey.
Business Sampling 25th April 2012 Peter Linde Survey and Methods Statistics Denmark.
Data Collection. At the end of this lesson, the student should be able to:  1. recognize the importance of data gathering;  2. distinguish primary from.
Dynamic Populations in Sample Surveys concerning Business Statistics Peter Tibert Stoltze Statistical Methodology Survey Sampling and Estimation December.
Research Design
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 1-1 Statistics for Managers Using Microsoft ® Excel 4 th Edition Chapter.
Looking for statistical twins
Sampling.
Peter Linde, Interviewservice Statistics Denmark
Sampling Why use sampling? Terms and definitions
Social Research Methods
Chapter 7 (b) – Point Estimation and Sampling Distributions
Redesigning French structural business statistics, using more administrative data ICESIII, Montréal, june 2007.
Statistics – Chapter 1 Data Collection
Graduate School of Business Leadership
John Loucks St. Edward’s University . SLIDES . BY.
SAMPLING (Zikmund, Chapter 12.
Social Research Methods
Slides by JOHN LOUCKS St. Edward’s University.
Presentation to Pacific Statistics Methods Board
Dublin, april 2012 Role of Business Register in coordinated sampling
Quality Aspects and Approaches in Business Statistics
Linking Population and Housing Censuses with Agricultural Censuses
ESTP COURSE ON PRODCOM STATISTICS
Chapter 7 Sampling Distributions
SAMPLING.
Goals and objectives of Work package 2 of the ESSnet on Consistency of concepts and applied methods of business and trade-related statistics Norbert Rainer,
Random sampling Carlo Azzarri IFPRI Datathon APSU, Dhaka
Domestic extraction of mineral raw materials
A new fantastic source for updating the Statistical Business Register
Market Research Sampling Methods.
Daniela Stan Raicu School of CTI, DePaul University
SAMPLING (Zikmund, Chapter 12).
2011 POPULATION AND HOUSING CENSUS PREPARATORY WORKS
Pixel Non-Uniformity Study
Regional Seminar on Developing a Program for the Implementation of the 2008 SNA and Supporting Statistics Gülçin ERDOĞAN September 2013 Ankara.
Business Statistics: A First Course (3rd Edition)
Contents Co-operation about one common register Public accessible
ANALYSIS OF POSSIBILITY TO USE TAX AUTHORITY DATA IN STS. RESULTS
Sampling and estimation
Parallel Session: BR maintenance Quality in maintenance of a BR:
Social Research Methods
Sampling.
Task Force on Small and Medium Sized Enterprise Data (SMED)
1st Joint Workshop Pesticides Statistics
STEPS Site Report.
Presentation transcript:

Enumeration using frozen versions (based on slides produced by Peter Stoltze, Chief Consultant, Statistical Methods, SD)

Foundation of the Statistics Definition of the population is essential in relation to interpretation of the statistics If we do not have a firm grip on the population, everything else is unimportant! SBR is central in this respect Updated by administrative sources Updated by information from different Statistical Divisions a benefit for all Statistical Divisions

Definition of the Population (1) Population of interest is the collection of objects in which we are interested Example: All businesses in Ukraine Target population is the section of the population of interest that we, for practical reasons, must confine ourselves to observe Example: All businesses with at least 10 employees Sampling frame is the data representation of the target population available to us – it is from here that the sample is drawn Example: Extracts from SBR

Definition of the Population (2) Sample Sampling frame Target population Population of interest

Frame Imperfections The difference between the target population and the sampling frame is due to the fact, that our registers are not perfect Over-coverage: Businesses which are included in the sampling frame, but ought not to be included Can be discovered during data collection Example: The business went bankrupt long before the starting date of the reference period Under-coverage: Businesses which ought to be included in the sample frame, but are not included Can be discovered, if we have knowledge of the area via other sources

Estimation based on an updated population Design weights are sacred Selection probabilities are sacred The handling of stratum changes should be conducted by calibration and domain-estimation Estimation may account for cut-off sampling

Dynamic Frame Population Current version Historic version Time t+2 Time t+1 Time t t t+1 t+2

Frozen Frame Population Current version Historic version Frozen version Time t+4 Time t+2 Time t+3 Time t Time t+1 t t+1 t+2

Population at Estimation stage Current version Historic version Frozen version Sample Estimation of structural survey Estimation of short-term survey t t+1 t+2

SBS statistics (all kind) (1) Purpose: Give information about the structure Be able to compare across statistics When: Year t (a period) or Ultimo t (a point in time) Based on: A survey 100 % Big enterprises 50 % ? Medium sized enterprises 25 % ? Small enterprises 0 % ? Micro enterprises divided eventually into sub-strata The survey is drawn on the basis of a frozen SBR version 15th Nov year t The survey is carried through during e.g. Marts-June t+1

SBS statistics (2) New information on Year t requires updating of frozen SBR What ? All active enterprises and other units, as e.g. LKAU, (during the year t) have to be in the frozen version All relevant changes/corrections (and that is changes related to the Year t and not t+1) have to be in the frozen version but be aware of eventually bias – not only information from surveys has to be taken in what could be the sources for updating SBR? for the year t? When ? Before the first SBS statistic is produced Hopefully it is also when the information is available Hvordan skal det fortolkes, at ikke kun information fra surveys skal indarbejdes.

SBS statistics (3) And now to the Enumeration The sample was drawn 15th Nov t The new frozen version is formed ddmmyy year t+1? Principle: At the estimation stage we discover, that a unit selected in stratum ha with π = 0.1 has moved to stratum hb We then have to believe that 9 other (unobserved) units from ha have made a similar move Instead of changing the selection probabilities, the combination Activity*Size are regarded as domains, and calibration is conducted on the basis of these new domains

SBS statistics (4) And what does that mean? (The table has been removed because it is not as simple as it was shown Regression analysis has to be used What is important is to know about the population at the time for enumeration! See theory!!)

SBS statistics (5) A few names: Horwitz-Thompson estimat or pi-expansion the sum of design-weights over the sample within a stratum has to sum to the size of the stratum Calibration can be implemented in the form of regression estimator SD uses SCB CLAN survey (a collection of Swedish macros to SAS - http://www.amstat.org/meetings/ices/2000/proceedings/S09.pdf) but other possibilities exist, e.g. package Survey to R by Thomas Lumley google: "regression estimator sampling", "model assisted survey sampling" or "SCB CLAN survey"

SBS statistics (6) Problems How do you get to know the ‘correct’ population when the frozen version 2 is formed? How do you distribute between strata? But it is risky only to include information from surveys New units should not be included in the sample Deceased units has to be placed in the stratum for deceased units so they get a weight, but it could be tricky to estimate the size (and depends whether the information is from the survey and not from the population (frozen version)

STS statistics (1) Purpose: Give information about development When: Quarter x Year t+1 (a period) or Ultimo quarter x Year t+1 (a point in time) Based on: A survey 100 % Big enterprises, 50 % ? Medium sized enterprises, 25 % ? Small enterprises and 0 % ? Micro enterprises divided into sub-strata The survey is drawn on the basis of a frozen SBR version 15th Nov year t The survey is carried through during April t+1, July t+1, October t+1 and January t+2

STS statistics (2) What is the problem? What about new enterprises? In year t (from 15th Nov to 31st Dec) What about any change from 31st Dec year t and to April, July, … t+1(2)?

STS statistics (2) What is the solution? Two possibilities keep the frozen version and look at changes Disadvantage: this does not take into account new enterprises Advantage: easy make new frozen versions for each quarter (or even use the actual version of SBR*) and continue as for SBS Disadvantages: time consuming what is the sources for producing new versions and to this those mentioned for SBS Advantage: More correct description of the development Either possibilities makes it possible to compare SBS and STS * But it is important to know the whole population and be able to distribute to strata

Frozen versions Statistics Denmark SBS year t Version 1 (Temporary: t+1 5th Match (Turnover/Employees 15th Match) Version 2 (Temporary: t+1 5th Sept. (Turnover/Employees 15th Sept.)) Version 3 (Final: t+1 5th Dec. (Turnover/Employees 15th Dec.)) STS 1st quarter year t+1 Version 1 (Temporary: t+1 5th May (Turnover/Employees 15th May) Version 2 (Final: t+1 5th Aug. (Turnover/Employees 15th Aug.)) Samples might be drawn from any version before enumenation