Download presentation
Presentation is loading. Please wait.
Published byStanley Harper Modified over 8 years ago
1
1 A prediction approach to representative sampling Ib Thomsen & Li-Chun Zhang Statistics Norway E-mail: lcz@ssb.no
2
The birth of representative method Kruskal and Mosteller (1979a,b,c): origins and development of the concept representative sampling N. Kiær’s representative method (ISI meeting, 1895, Bern) –A three-stage design, with 1890 census as frame: 1st: 128 counties and 23 towns throughout the country 2nd: cohorts of males of age 17, 22, 27, 32, etc. 3rd: persons with surname initial A, B, C, L, M, N –Comparison of sample marginal averages with census averages ISI committee in 1924 & report at the following meeting: “I think I may venture to say that nowadays there is hardly one statistician, who in principle will contest the legitimacy of the representative method”. (Jensen) Bowley (1926) member of the committee.
3
Rise and fall of the representative method: Balance vs. randomization Kiær did not take a probabilistic point of view. –Representative sample surveys instead of representative sampling –Idea of variability of population over time (quote) –Miniature population multivariate simple balance Design-based approach: –Neyman (1934): representative sampling = randomization (quote) –Subsequent development: Hansen & co., Deming, Kish, Cochran, Mahalanobis, etc. –Godambe (1955): no minimum variance linear estimator –Representative sampling vs. efficient estimation Prediction approach: –Royall (1970): purposive sample –Royall and Eberhardt (1975): Simple balance for bias protection (quote) –Representative sample vs. efficiency
4
A definition of representative sampling from a prediction point of view Prediction of each individual in the population Representative sampling connected to individual mean squared error of prediction (IMSEP), i.e. Conditional IMSEP: zero inside the sample, positive outside Use randomization design to control unconditional IMSEP, i.e. expected amount of information about each population unit. Control of individual prediction as a design criterion, i.e.
5
An example under ratio model
6
Motivating familiar but seemingly unconnected sampling techniques from a unified point of view Constant mean and variance throughout the population: equal prediction epsem/SRS Constant mean and variance in subpopulation groups: stratified equal prediction stratified epsem/SRS; relative equal prediction w.r.t. individual variance stratified epsem/SRS with proportional allocation Business survey: –Division of take-all, take-some and take-none units –Stratified SRS with progressive allocation Two-stage sampling: –PPS-SRS and SRS-SRS are equal prediction designs, respectively, provided zero or unity intra-cluster correlation –Stratified SRS-SRS with progressive first-stage allocation
7
Three principle advantages of CIP as a design-criterion Model-based inference as a mode of inference –Prediction of individual impossible under design-based perspective Randomization designs motivated by prediction –Simple random sampling (SRS) unmotivated for efficiency –SRS yields non-informative sampling, but so can any randomization. –SRS targets at simple balance, but it is not effective for that. Combination with optimality/efficiency for total (OPT) –Need for population totals –Need for socio-economic micro-data –Need for statistics at more detailed levels
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.