Sampling Strategy for Establishment Surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013.

Slides:



Advertisements
Similar presentations
Multiple Indicator Cluster Surveys Survey Design Workshop
Advertisements

Multiple Indicator Cluster Surveys Survey Design Workshop
Innovation data collection: Advice from the Oslo Manual South East Asian Regional Workshop on Science, Technology and Innovation Statistics.
Innovation Surveys: Advice from the Oslo Manual South Asian Regional Workshop on Science, Technology and Innovation Statistics Kathmandu,
Innovation Surveys: Advice from the Oslo Manual National training workshop Amman, Jordan October 2010.
Unido.org/statistics International workshop on industrial statistics 8 – 10 July, Beijing Non response in industrial surveys Shyam Upadhyaya.
Outline Major programmes for collecting industrial statistics Decennial Census of Industries Annual survey of industries Survey of construction industries.
Determining How to Select a Sample
Estimates and sampling errors for Establishment Surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013.
Sampling Frames for Establishment Surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013.
Unido.org/statistics Effect of the Cut-off Size on International Comparability of Industrial Statistics Shyam Upadhyaya International workshop on industrial.
Riku Salonen Regression composite estimation for the Finnish LFS from a practical perspective.
Multiple Indicator Cluster Surveys Survey Design Workshop
Ch 4: Stratified Random Sampling (STS)
Who and How And How to Mess It up
Sampling.
Sampling.
Sampling and Sample Size Determination
Chapter 11 Sampling Design. Chapter 11 Sampling Design.
STAT262: Lecture 5 (Ratio estimation)
A new sampling method: stratified sampling
Stratified Simple Random Sampling (Chapter 5, Textbook, Barnett, V
STAT 4060 Design and Analysis of Surveys Exam: 60% Mid Test: 20% Mini Project: 10% Continuous assessment: 10%
1 Marketing Research Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides.
Formalizing the Concepts: STRATIFICATION. These objectives are often contradictory in practice Sampling weights need to be used to analyze the data Sampling.
Sampling Design.
Sampling Concepts Population: Population refers to any group of people or objects that form the subject of study in a particular survey and are similar.
Sampling Designs and Sampling Procedures
Key terms in Sampling Sample: A fraction or portion of the population of interest e.g. consumers, brands, companies, products, etc Population: All the.
Sample Design.
United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan,
Copyright 2010, The World Bank Group. All Rights Reserved. Agricultural Census Sampling Frames and Sampling Section A 1.
Determining the Sampling Plan
Sample Design Establishments Surveys Stuart Brown Research, Design & Evaluation January 2013 STATISTICAL INSTITUTE OF JAMAICA.
Definitions Observation unit Target population Sample Sampled population Sampling unit Sampling frame.
Near East Regional Workshop - Linking Population and Housing Censuses with Agricultural Censuses. Amman, Jordan, June 2012 Improving Efficiency.
Sampling: Theory and Methods
Multiple Indicator Cluster Surveys Survey Design Workshop Sampling: Overview MICS Survey Design Workshop.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Central egency for public mobilization and statistics.
Scot Exec Course Nov/Dec 04 Survey design overview Gillian Raab Professor of Applied Statistics Napier University.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Bangkok,
Optimum sampling strategy for National Households In Abu Dhabi (The Household Master Sample 2012) Authors : Mohammed Al Rifai (Ph.D.) Mariam.
Optimum sampling strategy for National Households In Abu Dhabi (The Household Master Sample 2012) Authors : Mohammed Al Rifai (Ph.D.) Mariam.
ICES 2007 Labour Cost Index and Sample Allocation Outi Ahti-Miettinen and Seppo Laaksonen Statistics Finland (+ University of Helsinki) Labour cost index.
Copyright 2010, The World Bank Group. All Rights Reserved. Part 1 Sample Design Produced in Collaboration between World Bank Institute and the Development.
Chapter 6: 1 Sampling. Introduction Sampling - the process of selecting observations Often not possible to collect information from all persons or other.
Improving of Household Sample Surveys Data Quality on Base of Statistical Matching Approaches Ganna Tereshchenko Institute for Demography and Social Research,
McGraw-Hill/IrwinCopyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved. SAMPLING Chapter 14.
Statistics Canada Citizenship and Immigration Canada Methodological issues.
STATISTICAL DATA GATHERING: Sampling a Population.
Sampling technique  It is a procedure where we select a group of subjects (a sample) for study from a larger group (a population)
Quarterly and monthly establishment surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013.
1. 2 DRAWING SIMPLE RANDOM SAMPLING 1.Use random # table 2.Assign each element a # 3.Use random # table to select elements in a sample.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition Instructor’s Presentation Slides 1.
Sampling Concepts Nursing Research. Population  Population the group you are ultimately interested in knowing more about “entire aggregation of cases.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Addis.
Sampling Design and Procedure
Variability. The differences between individuals in a population Measured by calculations such as Standard Error, Confidence Interval and Sampling Error.
Variability.
Sampling.
Graduate School of Business Leadership
Meeting-6 SAMPLING DESIGN
Basic Sampling Issues.
2. Stratified Random Sampling.
Random sampling Carlo Azzarri IFPRI Datathon APSU, Dhaka
Sampling and estimation
SMALL AREA ESTIMATION FOR CITY STATISTICS
Presentation transcript:

Sampling Strategy for Establishment Surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013

Annual and Quarterly/Monthly Surveys Annual establishment surveys Annual establishment surveys Generally larger sample size to cover all sectorsGenerally larger sample size to cover all sectors Results needed for national accounts, planning for economic developmentResults needed for national accounts, planning for economic development Quarterly/monthly surveys Quarterly/monthly surveys Smaller sample sizeSmaller sample size May be based on subsample of annual survey, or cut-off samplingMay be based on subsample of annual survey, or cut-off sampling Designed to measure short-term trendsDesigned to measure short-term trends

Stratification of Sampling Frame Stratification is key concept in sampling Stratification is key concept in sampling Sampling frame divided into homogeneous groups (strata) Sampling frame divided into homogeneous groups (strata) Sample selection is independent within each stratum Sample selection is independent within each stratum Variance is calculated only within strata, not between strata Variance is calculated only within strata, not between strata Stratification can reduce the sampling error of survey estimates considerably, especially for economic surveys Stratification can reduce the sampling error of survey estimates considerably, especially for economic surveys

Stratification criteria For business register or list frame of establishments, main variables used for stratification are: For business register or list frame of establishments, main variables used for stratification are: ISIC groupISIC group Number of employeesNumber of employees Other measure of size (such as revenue)Other measure of size (such as revenue) ISIC groups can be defined at different levels (2 to 4 digits) ISIC groups can be defined at different levels (2 to 4 digits) Some 4-digit ISICs with many establishments may be individual stratumSome 4-digit ISICs with many establishments may be individual stratum Group similar ISICs that have less establishmentsGroup similar ISICs that have less establishments

Stratification criteria (continued) Stratification by region also possible depending on domains of analysis Stratification by region also possible depending on domains of analysis ISIC and size strata within each regionISIC and size strata within each region Increases sample size considerably – minimum sample size required for each regionIncreases sample size considerably – minimum sample size required for each region

Stratification by size Size of establishment important to provide efficient allocation of sample Size of establishment important to provide efficient allocation of sample Size strata defined within ISIC group Size strata defined within ISIC group Establish certainty stratum for largest establishments, such as 20+ or 50+ employees Establish certainty stratum for largest establishments, such as 20+ or 50+ employees Certainty cut-off may vary by ISIC groupCertainty cut-off may vary by ISIC group

Stratification by size (continued) Establish other strata by reasonable ranges of employment size, for example: Establish other strata by reasonable ranges of employment size, for example: 1 employee1 employee 2-4 employees2-4 employees 5-9 employees5-9 employees employees10-19 employees employees20-49 employees 50+ employees50+ employees Sampling rates will vary by size stratum Sampling rates will vary by size stratum

Misclassification or Change in ISIC or Size Group Sometimes economic activity may be misclassified in sampling frame Sometimes economic activity may be misclassified in sampling frame It is possible that an establishment may change the predominant activity It is possible that an establishment may change the predominant activity Number of employees can vary over time Number of employees can vary over time Probabilities of selection based on original frame Probabilities of selection based on original frame

Change in ISIC or size group (continued) Important to calculate weights and sampling errors based on ORIGINAL stratification Important to calculate weights and sampling errors based on ORIGINAL stratification Possible to post-stratify the sample by the correct ISIC and size groups for the tabulations Possible to post-stratify the sample by the correct ISIC and size groups for the tabulations Both sets of classification codes should be included in the economic survey data file Both sets of classification codes should be included in the economic survey data file

Sample size Sample size depends on the level of precision required for survey estimates as well as resource constraints Sample size depends on the level of precision required for survey estimates as well as resource constraints Precision measured by sampling error or coefficient of variation (CV) Precision measured by sampling error or coefficient of variation (CV) Sampling errors inversely proportional to square root of sample sizeSampling errors inversely proportional to square root of sample size

Sample size (continued) Required level of precision should be specified for each domain that will be included in the tabulations Required level of precision should be specified for each domain that will be included in the tabulations Accuracy also depends on bias from nonsampling errors Accuracy also depends on bias from nonsampling errors Nonsampling errors generally increase with the sample size, since quality and operational control become more difficult Nonsampling errors generally increase with the sample size, since quality and operational control become more difficult

Calculate sampling errors for estimates from previous surveys Previous economic survey data very useful for determining required sample size Previous economic survey data very useful for determining required sample size Estimate sampling errors for most important indicators by domain Estimate sampling errors for most important indicators by domain Identify domains where it is necessary to increase or decrease the sample size Identify domains where it is necessary to increase or decrease the sample size

Use of sampling frame data for estimating expected precision If sampling frame has data on number of employees, revenue or other variables, possible to estimate expected level of precision If sampling frame has data on number of employees, revenue or other variables, possible to estimate expected level of precision Simulation study to calculate the sampling error for each domain based on a specific sample size Simulation study to calculate the sampling error for each domain based on a specific sample size Use the information from the frame for the final sample to estimate the sampling errors Use the information from the frame for the final sample to estimate the sampling errors Possible to adjust the final sample size for some domains if necessaryPossible to adjust the final sample size for some domains if necessary

Sample allocation First level of sample allocation depends on the domains of analysis for the survey tables First level of sample allocation depends on the domains of analysis for the survey tables Minimum sample required for each ISIC group that will appear in tables Minimum sample required for each ISIC group that will appear in tables Examine the distribution of frame of establishments by ISIC group and size Examine the distribution of frame of establishments by ISIC group and size Any rare but important ISIC group can be included in the sample with certainty Any rare but important ISIC group can be included in the sample with certainty

Sample allocation (continued) After establishing size cut-off for certainty stratum (for example, 50+ employees, determine total number of certainty establishments After establishing size cut-off for certainty stratum (for example, 50+ employees, determine total number of certainty establishments Subtract number of certainty establishments from maximum sample size to determine sample to be allocated to remaining strata Subtract number of certainty establishments from maximum sample size to determine sample to be allocated to remaining strata

Sample allocation (continued) Some countries determine a practical and approximately optimum sampling rate by size stratum Some countries determine a practical and approximately optimum sampling rate by size stratum Example: Example: Sampling Rate Employment Size Stratum (Number of Persons engaged) %5%10%25%50%100%

Optimum allocation Neyman optimum allocation Neyman optimum allocation Used when costs per unit are similar for each stratum Used when costs per unit are similar for each stratum Formula: Formula:

Example of Neyman allocation ParameterSize Stratum (Number of Persons Engaged) Total NhNh 3,26311,3318,06022,654 ShSh N h x S h Optimum n h ,120 Sampling Rate1.5%4.4%7.1%

Sample allocation (continued) Sampling rates by stratum from Neyman allocation used as guideline to determine sampling rates by size for each ISIC group Sampling rates by stratum from Neyman allocation used as guideline to determine sampling rates by size for each ISIC group Ensure that each ISIC group has sufficient number of observations for reliable estimates Ensure that each ISIC group has sufficient number of observations for reliable estimates

Cut-off sampling Select only establishments above a certain size category Select only establishments above a certain size category Remaining establishments below cut-off excluded from sampling frame Remaining establishments below cut-off excluded from sampling frame Not a probability sample Not a probability sample The establishments above cut-off may represent 90% or more of the employment in the frame The establishments above cut-off may represent 90% or more of the employment in the frame Sometimes used for monthly indicators of trends in production for largest manufacturing establishments, for example Sometimes used for monthly indicators of trends in production for largest manufacturing establishments, for example

Sample selection procedures – list frame from census or business register Separate all the establishments to be included in the sample with certainty Separate all the establishments to be included in the sample with certainty Sort the remaining establishments by ISIC group and size stratum, perhaps also by geography or number of employees to provide further implicit stratification Sort the remaining establishments by ISIC group and size stratum, perhaps also by geography or number of employees to provide further implicit stratification Separate individual ISIC group and size strata Separate individual ISIC group and size strata Select sample systematically with equal probability Select sample systematically with equal probability Use sampling rate established for the stratum Use sampling rate established for the stratum

Selection of establishments with probability proportional to size PPS sampling procedure used less frequently for selecting establishments PPS sampling procedure used less frequently for selecting establishments Within each ISIC group stratum, establishments can be selected with PPS, based on employment, revenue or another measure of size Within each ISIC group stratum, establishments can be selected with PPS, based on employment, revenue or another measure of size

PPS selection (continued) Advantage – larger establishments selected with higher probability Advantage – larger establishments selected with higher probability Disadvantage – weighting procedures more complex Disadvantage – weighting procedures more complex Weights vary by establishmentWeights vary by establishment Stratification by size may be simpler to implement Stratification by size may be simpler to implement

Sample selection procedures – area sample Determine measure of size for each segment (enumeration area) Determine measure of size for each segment (enumeration area) Total number of establishments (excluding those covered in list frame)Total number of establishments (excluding those covered in list frame) Total number of employeesTotal number of employees For each ISIC group, calculate the proportion of employees or revenue in each EA, and sum across EAs to obtain measure of sizeFor each ISIC group, calculate the proportion of employees or revenue in each EA, and sum across EAs to obtain measure of size Within each geographic stratum, select sample of EAs systematically with PPS Within each geographic stratum, select sample of EAs systematically with PPS

Selection of areas (continued) Conduct a listing of all establishments in each sample EA Conduct a listing of all establishments in each sample EA Match listing to business register to exclude establishments included in list frame Match listing to business register to exclude establishments included in list frame Select sample establishments systematically with equal probability within each EA, stratified at second stage by major economic activities Select sample establishments systematically with equal probability within each EA, stratified at second stage by major economic activities