1 A prediction approach to representative sampling Ib Thomsen & Li-Chun Zhang Statistics Norway

Slides:



Advertisements
Similar presentations
Social Statistics Estimation and complex survey design Ian Plewis, CCSR, University of Manchester.
Advertisements

Introduction Simple Random Sampling Stratified Random Sampling
Statistics for Managers Using Microsoft® Excel 5th Edition
Prediction and Imputation in ISEE - Tools for more efficient use of combined data sources Li-Chun Zhang, Statistics Norway Svein Nordbotton, University.
3.3 Toward Statistical Inference. What is statistical inference? Statistical inference is using a fact about a sample to estimate the truth about the.
Complex Surveys Sunday, April 16, 2017.
Research Methods in MIS: Sampling Design
Chapter 7 Sampling Distributions
Ranked Set Sampling: Improving Estimates from a Stratified Simple Random Sample Christopher Sroka, Elizabeth Stasny, and Douglas Wolfe Department of Statistics.
Who and How And How to Mess It up
Sampling.
1 Field Research Methods Lecture 4: Sampling & Sampling Error Edmund Malesky, Ph.D., UCSD.
Why sample? Diversity in populations Practicality and cost.
Chapter 11 Sampling Design. Chapter 11 Sampling Design.
Chapter 12 Sample Surveys. At the end of this chapter, you should be able to Identify populations, samples, parameters and statistics for a given problem.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics 10 th Edition.
Responding driven sampling Principles of Sampling Session 1.
STAT262: Lecture 5 (Ratio estimation)
A new sampling method: stratified sampling
Stratified Simple Random Sampling (Chapter 5, Textbook, Barnett, V
STAT 4060 Design and Analysis of Surveys Exam: 60% Mid Test: 20% Mini Project: 10% Continuous assessment: 10%
Formalizing the Concepts: Simple Random Sampling.
Correlation and Regression Analysis
STRATIFIED SAMPLING DEFINITION Strata: groups of members that share common characteristics Stratified sampling: the population is divided into subpopulations.
Impact Evaluation Session VII Sampling and Power Jishnu Das November 2006.
Chapter 5: Descriptive Research Describe patterns of behavior, thoughts, and emotions among a group of individuals. Provide information about characteristics.
Sampling Design  M. Burgman & J. Carey Types of Samples Point samples (including neighbour distance samples) Transects line intercept sampling.
Chapter 12: AP Statistics
1st NRC Meeting, October 2006, Amsterdam 1 ICCS Sampling Design.
Statistical Methods, part 1 Module 2: Latent Class Analysis of Survey Error Models for measurement errors Dan Hedlin Stockholm University November 2012.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Use of Administrative Data in Statistics Canada’s Annual Survey of Manufactures Steve Matthews and Wesley Yung May 16, 2004 The United Nations Statistical.
7-1 Chapter Seven SAMPLING DESIGN. 7-2 Selection of Elements Population Element the individual subject on which the measurement is taken; e.g., the population.
1 Ratio estimation under SRS Assume Absence of nonsampling error SRS of size n from a pop of size N Ratio estimation is alternative to under SRS, uses.
Jeroen Pannekoek - Statistics Netherlands Work Session on Statistical Data Editing Oslo, Norway, 24 September 2012 Topic (I) Selective and macro editing.
Sampling Methods.
Sampling “Sampling is the process of choosing sample which is a group of people, items and objects. That are taken from population for measurement and.
Defining Success Understanding Statistical Vocabulary.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
Sampling. BASIC DEFINITIONS Population=aggregate set of all case that conform some set of specifications Subpopulation=population stratum = stratum Population.
Tahir Mahmood Lecturer Department of Statistics. Outlines: E xplain the role of sampling in the research process D istinguish between probability and.
Chapter 7 The Logic Of Sampling The History of Sampling Nonprobability Sampling The Theory and Logic of Probability Sampling Populations and Sampling Frames.
1 1 A two-phase life-cycle model of integrated statistical micro data Li-Chun Zhang Statistics Norway
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics.
Introduction to Secondary Data Analysis Young Ik Cho, PhD Research Associate Professor Survey Research Laboratory University of Illinois at Chicago Fall,
Sampling Sources: -EPIET Introductory course, Thomas Grein, Denis Coulombier, Philippe Sudre, Mike Catchpole -IDEA Brigitte Helynck, Philippe Malfait,
Research Design ED 592A Fall Research Concepts 1. Quantitative vs. Qualitative & Mixed Methods 2. Sampling 3. Instrumentation 4. Validity and Reliability.
Oversampling the capital cities in the EU SAfety SUrvey (EU-SASU) Task Force on Victimization Eurostat, February 2010 Guillaume Osier Service Central.
Chapter Ten Copyright © 2006 John Wiley & Sons, Inc. Basic Sampling Issues.
STATISTICAL DATA GATHERING: Sampling a Population.
1 STK 4600: Statistical methods for social sciences. Survey sampling and statistical demography Surveys for households and individuals.
SAMPLING Why sample? Practical consideration – limited budget, convenience, simplicity. Generalizability –representativeness, desire to establish the broadest.
Learning Objectives Determine when to use sampling. Determine the pros and cons of various sampling techniques. Be aware of the different types of errors.
Selecting a Sample. outline Difference between sampling in quantitative & qualitative research.
We’ve been limited to date being given to us. But we can collect it ourselves using specific sampling techniques. Chapter 12: Sample Surveys.
Plan for Today: Chapter 1: Where Do Data Come From? Chapter 2: Samples, Good and Bad Chapter 3: What Do Samples Tell US? Chapter 4: Sample Surveys in the.
1. 2 DRAWING SIMPLE RANDOM SAMPLING 1.Use random # table 2.Assign each element a # 3.Use random # table to select elements in a sample.
Sampling Design and Analysis MTH 494 LECTURE-11 Ossam Chohan Assistant Professor CIIT Abbottabad.
ESTIMATING RATIOS OF MEANS IN SURVEY SAMPLING Olivia Smith March 3, 2016.
11-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
1 A theoretical framework for register-based statistics --- Can we carry on without it? Li-Chun Zhang Statistics Norway
Sampling Why use sampling? Terms and definitions
Ratio and regression estimation STAT262, Fall 2017
جمعیت –نمونه –روشهای نمونه گیری دکتر محسن عسکرشاهی دکترای آمار زيستی
Random sampling Carlo Azzarri IFPRI Datathon APSU, Dhaka
Model Comparison: some basic concepts
Types of Control I. Measurement Control II. Statistical Control
5 points piece Homework Check
STAT 245.
Sampling Techniques Statistics.
Presentation transcript:

1 A prediction approach to representative sampling Ib Thomsen & Li-Chun Zhang Statistics Norway

The birth of representative method Kruskal and Mosteller (1979a,b,c): origins and development of the concept representative sampling N. Kiær’s representative method (ISI meeting, 1895, Bern) –A three-stage design, with 1890 census as frame:  1st: 128 counties and 23 towns throughout the country  2nd: cohorts of males of age 17, 22, 27, 32, etc.  3rd: persons with surname initial A, B, C, L, M, N –Comparison of sample marginal averages with census averages ISI committee in 1924 & report at the following meeting: “I think I may venture to say that nowadays there is hardly one statistician, who in principle will contest the legitimacy of the representative method”. (Jensen) Bowley (1926) member of the committee.

Rise and fall of the representative method: Balance vs. randomization Kiær did not take a probabilistic point of view. –Representative sample surveys instead of representative sampling –Idea of variability of population over time (quote) –Miniature population  multivariate simple balance Design-based approach: –Neyman (1934): representative sampling = randomization (quote) –Subsequent development: Hansen & co., Deming, Kish, Cochran, Mahalanobis, etc. –Godambe (1955): no minimum variance linear estimator –Representative sampling vs. efficient estimation Prediction approach: –Royall (1970): purposive sample –Royall and Eberhardt (1975): Simple balance for bias protection (quote) –Representative sample vs. efficiency

A definition of representative sampling from a prediction point of view Prediction of each individual in the population Representative sampling connected to individual mean squared error of prediction (IMSEP), i.e. Conditional IMSEP: zero inside the sample, positive outside Use randomization design to control unconditional IMSEP, i.e. expected amount of information about each population unit. Control of individual prediction as a design criterion, i.e.

An example under ratio model

Motivating familiar but seemingly unconnected sampling techniques from a unified point of view Constant mean and variance throughout the population: equal prediction  epsem/SRS Constant mean and variance in subpopulation groups: stratified equal prediction  stratified epsem/SRS; relative equal prediction w.r.t. individual variance  stratified epsem/SRS with proportional allocation Business survey: –Division of take-all, take-some and take-none units –Stratified SRS with progressive allocation Two-stage sampling: –PPS-SRS and SRS-SRS are equal prediction designs, respectively, provided zero or unity intra-cluster correlation –Stratified SRS-SRS with progressive first-stage allocation

Three principle advantages of CIP as a design-criterion Model-based inference as a mode of inference –Prediction of individual impossible under design-based perspective Randomization designs motivated by prediction –Simple random sampling (SRS) unmotivated for efficiency –SRS yields non-informative sampling, but so can any randomization. –SRS targets at simple balance, but it is not effective for that. Combination with optimality/efficiency for total (OPT) –Need for population totals –Need for socio-economic micro-data –Need for statistics at more detailed levels