Sampling and estimation

Slides:



Advertisements
Similar presentations
Innovation data collection: Methodological procedures & basic forms Regional Workshop on Science, Technology and Innovation (STI) Indicators.
Advertisements

Innovation data collection: Advice from the Oslo Manual South East Asian Regional Workshop on Science, Technology and Innovation Statistics.
Innovation Surveys: Advice from the Oslo Manual South Asian Regional Workshop on Science, Technology and Innovation Statistics Kathmandu,
Innovation Surveys: Advice from the Oslo Manual National training workshop Amman, Jordan October 2010.
Introduction Simple Random Sampling Stratified Random Sampling
Sampling Strategy for Establishment Surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013.
Sampling.
Sampling and Sample Size Determination
Chapter 17 Additional Topics in Sampling
Chapter 11 Sampling Design. Chapter 11 Sampling Design.
Formalizing the Concepts: STRATIFICATION. These objectives are often contradictory in practice Sampling weights need to be used to analyze the data Sampling.
Sampling Concepts Population: Population refers to any group of people or objects that form the subject of study in a particular survey and are similar.
Determining the Sampling Plan
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 13.
Sampling: Theory and Methods
Multiple Indicator Cluster Surveys Survey Design Workshop Sampling: Overview MICS Survey Design Workshop.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Agricultural and Biological Statistics. Sampling and Sampling Distributions Chapter 5.
Population and Sampling
CHAPTER 12 DETERMINING THE SAMPLE PLAN. Important Topics of This Chapter Differences between population and sample. Sampling frame and frame error. Developing.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
STANDARD ERROR Standard error is the standard deviation of the means of different samples of population. Standard error of the mean S.E. is a measure.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
Sampling Methods. Probability Sampling Techniques Simple Random Sampling Cluster Sampling Stratified Sampling Systematic Sampling Copyright © 2012 Pearson.
7.1Sampling Methods 7.2Introduction to Sampling Distribution 7.0 Sampling and Sampling Distribution.
Eurostat Weighting and Estimation. Presented by Loredana Di Consiglio Istituto Nazionale di Statistica, ISTAT.
Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.
 When every unit of the population is examined. This is known as Census method.  On the other hand when a small group selected as representatives of.
RESEARCH METHODS Lecture 28. TYPES OF PROBABILITY SAMPLING Requires more work than nonrandom sampling. Researcher must identify sampling elements. Necessary.
Institute of Professional Studies School of Research and Graduate Studies Selecting Samples and Negotiating Access Lecture Eight.
AC 1.2 present the survey methodology and sampling frame used
Enumeration using frozen versions (based on slides produced by Peter Stoltze, Chief Consultant, Statistical Methods, SD)
Sample design strategies
Sampling Chapter 5.
Sampling.
Marketing Research Aaker, Kumar, Leone and Day Eleventh Edition
2a. WHO of RESEARCH Quantitative Research
Sampling Why use sampling? Terms and definitions
Social Research Methods
RESEARCH METHODS Lecture 28
Graduate School of Business Leadership
SAMPLING (Zikmund, Chapter 12.
Meeting-6 SAMPLING DESIGN
Sampling: Design and Procedures
Sampling: Theory and Methods
Social Research Methods
STRATIFIED SAMPLING.
Dublin, april 2012 Role of Business Register in coordinated sampling
Sampling Design.
An Active Collection using Intermediate Estimates to Manage Follow-Up of Non-Response and Measurement Errors Jeannine Claveau, Serge Godbout and Claude.
Chapter 7 Sampling Distributions
2. Stratified Random Sampling.
Variables and Measurement (2.1)
SAMPLING.
Sampling Sampling relates to the degree to which those surveyed are representative of a specific population The sample frame is the set of people who have.
Sampling Lecture 10.
Estimation of Sampling Errors, CV, Confidence Intervals
2. Stratified Random Sampling.
Random sampling Carlo Azzarri IFPRI Datathon APSU, Dhaka
Sampling Sampling relates to the degree to which those surveyed are representative of a specific population The sample frame is the set of people who have.
Chapter 8: Weighting adjustment
Sampling and Sample Size Calculations
SAMPLING (Zikmund, Chapter 12).
Sampling and Power Slides by Jishnu Das.
The European Statistical Training Programme (ESTP)
Chapter: 9: Propensity scores
Istat - Structural Business Statistics
The European Statistical Training Programme (ESTP)
Social Research Methods
Metode Penelitian Pertemuan 10.
Presentation transcript:

Sampling and estimation ESTP course on SBS, 17 – 19 October 2012 Photis Stavropoulos

Probability sample surveys Sample survey: the collection of information in an organised and methodical manner from a fraction (the sample) of the units of a population. Probability sample survey: the members of the sample are selected at random, with known probabilities of selection. Material borrowed from Statistics Canada (2003) Survey methods and practices. Ottawa: Statistics Canada. This slide replaces the earlier slide about what is survey methodology.

Population and sample Target population Example: all enterprises with principal activity ‘manufacture of leather and related products’ Frame population Survey population Sample

Stratified random sampling (StrRS) Stratification variables: Activity Size class by: employment turnover Additional: region Frame population Stratification by Activity Stratification by Size Class 1. Removed the bullet point about combination of survey with administrative data. Moved StrRS to the heading. 2. I did not understand the term “unplanned domain” next to stratum h and I removed it. For me domain is a sub-population therefore the stratum seems planned, i.e. you know how many members of it will be in the sample. Stratum h Sample drawn by Simple Random Sampling (SRS)

Applicability of StrRS The variables of interest (e.g. purchases) vary widely across elements of population The enterprises can be grouped (e.g. by number of employees) and within groups (strata) their variables of interest (e.g. purchases) are close in value Stratification variables (e.g. number of employees) are available and of good quality in the frame

Pro’s and con’s of StrRS Lower sampling errors as compared to SRS All strata are represented in the sample Frame population is already divided into sub-populations Well suited to business surveys Stratification criteria have to be adequate Highly reliant on register data Some strata may be too small 1. Added a fourth advantage od StRS, that it is suitable for skewed business populations (I will mention skewness while speaking).

Sampling and estimation Population in stratum h = Nh Sample in stratum h = nh= gross sample Inclusion probability of units of stratum h: Sampling weight in stratum h:

Estimation Estimation of totals (e.g. total value of purchases): One term per stratum. Example for stratum 1 (h=1): N1 = 10; n 1 = 5 (small numbers only for simplification of the example) y (value of purchases) for the 5 sampled enterprises = 4; 6; 10; 20; 10 Horvitz-Thomson estimator Interpretation: each sampled enterprise represents enterprises within each stratum

The importance of good stratification for estimation Example: Variable of interest: value of purchases; N=4; n=2. Case Unit code Unknown true value of purchases (for N) Value of purchases known for sample (n) Estimated value of purchases, for N, from sample (n) 1 2 3 4 8 9 10 Total 36 34 (close to 36) 17 20 (not close to 36)

The importance of a good stratification for estimation (2) Example: Value of purchases for 14 enterprises Stratification alternatives: a) b) 2, 2, 2, 3, 5, 6, 7, 8, 9, 9, 10, 11, 12, 13 2, 2, 2, 3 5, 6, 7 8, 9, 9, 10 11, 12, 13 2, 9, 10 3, 9, 13 2, 6, 7, 12 2, 5, 8, 11 Case a: the strata are very homogeneous (Lower within-strata variance ) Case b: the strata are heterogeneous and similar with each other (Lower between-strata variance ) 1. Changed the text of the bullet points below the figures.

Precision and bias of estimates Precise and not biased Precise but biased not precise and biased Not biased but not precise

Mean squared error Total survey error of an estimator ~ measure of accuracy 1. Introduced the term “accuracy”.

Precision of estimates Sampling error = the random error = distance between the sample estimates and the expected value of the estimator The expected value of the estimator = the average of all sample estimates generated in the long run over repeated sampling 1. Changed “expected population parameter” into “expected value of the estimator”.

Bias of estimates Bias = the systematic error = the expected value of the estimator - the true population parameter Examples: Deficiencies of frame population Bad combination of the sample design and the estimator Characteristics of non-respondents and respondents differ

Sampling variance Variance of the estimator of a total: : population variance of stratum h. Estimate of the variance: : variance of the sample from stratum h

Strategies for calculation of sample size Bottom-up approach: Aim: a certain reliability at stratum level Top-down approach: Aim: a certain reliability at overall population level Optimization approach Aim: both a certain reliability at stratum and population level Inflate sample sizes taking into account the anticipated response rates!

Top-down approach Proportional allocation of the overall sample Equal inclusion probabilities in all strata: Gains in precision accrue to all survey measures Risk: High sampling errors for small strata which are heterogeneous 1. Removed the reference to sampling weights.

Top-down approach (2) Optimal (Neymann) allocation of the overall sample The smallest possible sampling errors at overall level of a given variable Risks: - High sampling errors for some small strata - Higher sampling errors at overall level for other variables = measure of heterogeneity within stratum h (standard deviation) Not the standard deviation of the sample from h

Estimation with non-response Sampling weight: Adjustment of weight for unit non-response Assumption: the characteristics of respondents are the same as the characteristics of non-respondents If not: bias! Population in stratum h = Nh Sample in stratum h = nh=gross sample Respondents in stratum h =mh= net sample

Improving estimation Calibration match the estimated totals of auxiliary variables with reliable statistics from other sources: - adjust the sampling weights so that matches the total of the same auxiliary variable from a reliable source minimize the distance between the sampling weights and the final calibration weights software e.g. CALMAR, SAS macro Introduced reference to auxiliary variables to avoid confusion with the variables of interest.

Estimation formulae Equivalent formulae, with final weights replacing sampling weights

Treatment of enterprises according to their size Take-all strata containing large enterprises Inclusion probabilities proportional to size Cut-off sampling

ESS methodological documents More in… ESS methodological documents Handbook on the design and implementation of business surveys (1997) Survey sampling reference guidelines – Introduction to sample design and estimation techniques (2008) Monographs of official statistics - Variance estimation methods in the European Union (2002) Handbook of Recommended Practices for Questionnaire Development and Testing in the ESS (2004) Download from Eurostat website: path: Eurostat home page -> About Eurostat -> Research and Methodology -> ESS Methodological Documents or directly: http://epp.eurostat.ec.europa.eu/portal/page/portal/research_methodology/metho dology/ess_methodological_documents Not sure whether the list needs updating. E-mail: ESTAT-METHODOLOGY@ec.europa.eu

Thank you for your attention!