On the use of auxiliary variables in agricultural surveys design

Slides:



Advertisements
Similar presentations
Linearization Variance Estimators for Survey Data: Some Recent Work
Advertisements

Social Statistics Estimation and complex survey design Ian Plewis, CCSR, University of Manchester.
Module B-4: Processing ICT survey data TRAINING COURSE ON THE PRODUCTION OF STATISTICS ON THE INFORMATION ECONOMY Module B-4 Processing ICT Survey data.
1 Multiple Frame Surveys Tracy Xu Kim Williamson Department of Statistical Science Southern Methodist University.
Riku Salonen Regression composite estimation for the Finnish LFS from a practical perspective.
1 STRATIFIED SAMPLING Stratification: The elements in the population are divided into layers/groups/ strata based on their values on one/several.
Chapter 5 Stratified Random Sampling n Advantages of stratified random sampling n How to select stratified random sample n Estimating population mean and.
Optimal Sampling Strategies for Multidomain, Multivariate Case with different amount of auxiliary information Piero Demetrio Falorsi, Paolo Righi 
Ranked Set Sampling: Improving Estimates from a Stratified Simple Random Sample Christopher Sroka, Elizabeth Stasny, and Douglas Wolfe Department of Statistics.
NEW ESTIMATORS FOR THE POPULATION MEDIAN IN SIMPLE RANDOM SAMPLING Sibel Al Hacettepe University, Department of Statistics, Ankara, Turkey
Sampling.
Why sample? Diversity in populations Practicality and cost.
STAT262: Lecture 5 (Ratio estimation)
A new sampling method: stratified sampling
STAT 4060 Design and Analysis of Surveys Exam: 60% Mid Test: 20% Mini Project: 10% Continuous assessment: 10%
Formalizing the Concepts: Simple Random Sampling.
5.10: Stratification after Selection of Sample – Post Stratification n Situations can arise in which we cannot place sampling units into their correct.
Course Content Introduction to the Research Process
Increasing Survey Statistics Precision Using Split Questionnaire Design: An Application of Small Area Estimation 1.
Lecture 30 sampling and field work
Joint UNECE/Eurostat Meeting on Population and Housing Censuses (13-15 May 2008) Sample results expected accuracy in the Italian Population and Housing.
1. An Overview of the Data Analysis and Probability Standard for School Mathematics? 2.
Chapter 7 Estimation: Single Population
Copyright 2010, The World Bank Group. All Rights Reserved. Agricultural Census Sampling Frames and Sampling Section A 1.
Joint UNECE/Eurostat Meeting on Population and Housing Censuses (28-30 October 2009) Accuracy evaluation of Nuts level 2 hypercubes with the adoption of.
Crop area estimates with area frames in the presence of measurement errors Elisabetta Carfagna University of Bologna Department.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
9 th Workshop on Labour Force Survey Methodology – Rome, May 2014 The Italian LFS sampling design: recent and future developments 9 th Workshop on.
Optimal Allocation in the Multi-way Stratification Design for Business Surveys (*) Paolo Righi, Piero Demetrio Falorsi 
1 Ratio estimation under SRS Assume Absence of nonsampling error SRS of size n from a pop of size N Ratio estimation is alternative to under SRS, uses.
Chapter 18 Additional Topics in Sampling ©. Steps in Sampling Study Step 1: Information Required? Step 2: Relevant Population? Step 3: Sample Selection?
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Introduction Since 1995, the Municipality of Firenze designed a quarterly labour force (LF) survey, parallel to that of ISTAT, to cope with the unavailability,
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
Elementary Statistics (Math 145) September 8, 2010.
Eurostat Statistical matching when samples are drawn according to complex survey designs Training Course «Statistical Matching» Rome, 6-8 November 2013.
Math 341 January 23, Outline 1. Recap 2. Other Sampling Designs 3. Graphical methods.
Eurostat Weighting and Estimation. Presented by Loredana Di Consiglio Istituto Nazionale di Statistica, ISTAT.
Sampling Design in Regional Fine Mapping of a Quantitative Trait Shelley B. Bull, Lunenfeld-Tanenbaum Research Institute, & Dalla Lana School of Public.
Eurostat Accuracy of Results of Statistical Matching Training Course «Statistical Matching» Rome, 6-8 November 2013 Marcello D’Orazio Dept. National Accounts.
 When every unit of the population is examined. This is known as Census method.  On the other hand when a small group selected as representatives of.
Math 145 January 29, Outline 1. Recap 2. Sampling Designs 3. Graphical methods.
Joint UNECE-Eurostat worksession on confidentiality, 2011, Tarragona Sampling as a way to reduce risk and create a Public Use File maintaining weighted.
Sampling Designs Outline
1. 2 DRAWING SIMPLE RANDOM SAMPLING 1.Use random # table 2.Assign each element a # 3.Use random # table to select elements in a sample.
Sampling Design and Analysis MTH 494 LECTURE-11 Ossam Chohan Assistant Professor CIIT Abbottabad.
Elementary Statistics (Math 145) June 19, Statistics is the science of collecting, analyzing, interpreting, and presenting data. is the science.
Survey sampling Outline (1 hr) Survey sampling (sources of variation) Sampling design features Replication Randomization Control of variation Some designs.
How to deal with quality aspects in estimating national results Annalisa Pallotti Short Term Expert Asa 3st Joint Workshop on Pesticides Indicators Valletta.
Chapter 9: Means and Proportions as Random Variables 9.1 Understanding dissimilarity among samples 9.2 Sampling distributions for sample proportions 9.3.
Sample design strategies
Math 145 May 27, 2009.
Sampling Why use sampling? Terms and definitions
Weighting and reweighting in surveys
Math 145 June 25, 2013.
Chapter Six Normal Curves and Sampling Probability Distributions
STRATIFIED SAMPLING.
Math 145.
STAT 145.
Math 145 January 28, 2015.
Estimation of Employment for Cities, Towns and Rural Districts
Multivariate Linear Regression Models
The European Statistical Training Programme (ESTP)
Chapter: 9: Propensity scores
ANALYSIS OF POSSIBILITY TO USE TAX AUTHORITY DATA IN STS. RESULTS
STAT 245.
Sampling and estimation
Small area estimation with calibration methods
Math 145 September 3, 2008.
Math 145 May 23, 2016.
Presentation transcript:

On the use of auxiliary variables in agricultural surveys design Federica Piersimoni ISTAT - Italian National Institute of Statistics Roberto Benedetti University “G.d’Annunzio” of Chieti-Pescara, Italy Giuseppe Espa Universy of Trento, Italy

Contents Actual situation Proposal Estimators Sampling designs Data description Simulation Analysis of the results Conclusions

Actual situation Population units Sample units

in sample surveys

2001 scatter plot matrix tc1= cattle slaughterings 2001 tc2= sheep and goats slaughterings 2001 tc3= pigs slaughterings 2001 tc4= equines slaughterings 2001

2000 scatter plot matrix tc10= cattle slaughterings 2000 tc20= sheep and goats slaughterings 2000 tc30= pigs slaughterings 2000 tc40= equines slaughterings 2000

1999 scatter plot matrix tc19= cattle slaughterings 1999 tc29= sheep and goats slaughterings 1999 tc39= pigs slaughterings 1999 tc49= equines slaughterings 1999

SCATTER PLOTS tc1: cattle slaughterings 2001 tc2: sheep and goats slaughterings 2001 tc3: pigs slaughterings 2001 tc4: equines slaughterings 2001 tc10: cattle slaughterings 2000 tc20: sheep and goats slaughterings 2000 tc30: pigs slaughterings 2000 tc40: equines slaughterings 2000 tc19: cattle slaughterings 1999 tc29: sheep and goats slaughterings 1999 tc39: pigs slaughterings 1999 tc49: equines slaughterings 1999

Year 2001 Year 2000 Year 1999

Sampling frame: N = 2.211 units (enterprises) and 12 variables: number of: cattle, pigs, sheep and goats, equines slaughtered at the census surveys of 1999, 2000 e 2001. 

the auxiliary variables  2000 samples of size n = 200… …using as auxiliary information the complete frame at 1999 and at 2000 to obtain estimates at 2001! Estimates obtained through the HorvitzThompson expansion estimator and the calibration estimator (PV) by Deville and Särndal (1992): Vector of the totals of the auxiliary variables Distance function

Samples selection simple random sampling (SRS) stratified sampling (ST) ranked set sampling (RSS) probability proportional to size (PS) balanced sampling PS + balanced sampling   

SRS: direct estimate doesn’t use auxiliary information   ST: auxiliary information is used ex ante the strata setting up; five planned strata; multivariate allocation model by Bethel (1989).   

RSS: original formulation:   Selection SRS without reinsertion of a first sample of n units; Ranking in increasing order of the n units of the sample with respect to an auxiliary variable x known for every population unit; The interest variable y is measured on the first unit only; A second SRS is drawn and ranked; The interest variable y is measured on the second unit only; ….and so on till n replications.

Ranking variable: with k =1,…,N, i =1,4 and t=1999, 2000. For the units k: 

PS: If y  positive auxiliary variable x  selection with probability  x. Such ex ante probability is

BALANCED SAMPLING and PS + BALANCED SAMPLING: The balance constraint has been imposed for the four variables to be estimated. The difference between the two criteria: in the second case the constraint is imposed ex post to PS samples

TOTAL

Conclusions but…... with efficient sampling designs may reduce It is better to impose the balance constraints in design phase, than in ex post (cf. RMSE SRS - RMSE BAL) Best performances: balanced PS selections and PS with calibration a joint use of complex estimators together with efficient sampling designs may reduce considerably the variability of the estimates but…...

but…... PS and PS with calibration selection criteria more efficient less robust of the others when outliers are present bad performance of RSS method forced univariate use of the auxiliary information for the ranking setting up when linear independence is present

Simulated sampling distribution of the tc2 estimates in the case of pps, with calibration estimator based on auxiliary variables of 2000 TRUE VALUE

Simulated sampling distribution of the tc3 estimates in the case of pps, with calibration estimator based on auxiliary variables of 1999 TRUE VALUE

Simulated sampling distribution of the tc4 direct estimates in the case of balanced pps, based on auxiliary variables of 1999 TRUE VALUE

Simulated sampling distribution of the tc2 direct estimates in the case of balanced pps, based on auxiliary variables of 2000 TRUE VALUE

References AlSaleh M.F., AlOmari A.I. (2002) Multistage ranked set sampling, Journal of Statistical Planning and Inference, 102, 273286. Bai Z., Chen Z. (2003) On the theory of rankedset sampling and its ramifications, Journal of Statistical Planning and Inference, 109, 8199. Bethel J. (1989) Sample allocation in multivariate surveys, Survey Methodology, 15, 4757. Deville J.C., Särndal C.E. (1992) Calibration Estimators in Survey Sampling, Journal of the American Statistical Association, 87, 418, 376382. Dorfman A.H., Valliant R. (2000) Stratification by size revised, Journal of Official Statistics, 16, 2, 139154. Espa G., Benedetti R., Piersimoni F. (2001) Prospettive e soluzioni per il data editing nelle rilevazioni in agricoltura, Statistica Applicata, 13, 4, 363391. Hidiroglou M.A. (1986) The construction of a self-representing stratum of large units in survey design, The American Statistician, 40, 1, 2731. Li D., Sinha B.K., Perron F. (1999) Random selection in ranked set sampling and its applications, Journal of Statistical Planning and Inference, 76, 185201. McIntyre G.A. (1952) A method for unbiased selective sampling, using ranked set, The Australian Journal of Agricultural and Resource Economics, 3, 385390. Patil G.P., Sinha A.K., Taillie C. (1994a) Ranked set sampling, in G.P. Patil and C.R. Rao (eds) Handbook of Statistics, Volume 12, Environmental Statistics, North Holland Elsevier, New York, 167–200. Patil G.P., Sinha A.K., Taillie C. (1994b) Ranked set sampling for multiple characteristics, International Journal of Ecology and Environmental Sciences, 20, 94–109. Ridout M.S. (2003) On ranked set sampling for multiple characteristics, Environmental and Ecological Statistics, 10, 255–262. Rosén B. (1997) On sampling with probability proportional to size, Journal of Statistical Planning and Inference, 62, 159191. Royall R.M. (1970) On finite population sampling theory under certain linear regression models, Biometrika, 57, 2, 377387. Royall R.M. (1992) Robustness and optimal design under prediction models for finite populations, Survey Methodology, 18, 179185. Royall R.M., Herson J. (1973a) Robust estimation in finite populations I, Journal of the American Statistical Association, 68, 344, 880889. Royall R.M., Herson J. (1973b) Robust estimation in finite population II: stratification on a size variable, Journal of the American Statistical Association, 68, 344, 890893. Särndal C-E, Swensson B., Wretman J. (1992) Model Assisted Survey Sampling, Springer Verlag, New York.

THANK YOU FOR YOUR ATTENTION!