Optimal Sampling Strategies for Multidomain, Multivariate Case with different amount of auxiliary information Piero Demetrio Falorsi, Paolo Righi 

Slides:



Advertisements
Similar presentations
1 ESTIMATION IN THE PRESENCE OF TAX DATA IN BUSINESS SURVEYS David Haziza, Gordon Kuromi and Joana Bérubé Université de Montréal & Statistics Canada ICESIII.
Advertisements

Annual growth rates derived from short term statistics and annual business statistics Dr. Pieter A. Vlag, Dr. K. van Bemmel Department of Business Statistics,
Page 1 Measuring Survey Quality through Representativity Indicators using Sample and Population based Information Chris Skinner, Natalie Shlomo, Barry.
Survey of Electronic Commerce and Technology: Past, Present and Future Challenges Jason Raymond Third International Conference on Establishment Surveys.
Possibilities of exploiting administrative data in short term statistics in Poland Jacek Kowalewski STATISTICAL OFFICE IN POZNAŃ.
Using Business Taxation Data as Auxiliary Variables and as Substitution Variables in the Australian Bureau of Statistics Frank Yu, Robert Clark and Gabriele.
GENEralised software for Sampling Estimates and Errors in Surveys (GENESEES V. 3.0) Piero Demetrio Falorsi - Salvatore Filiberti Istat Structural Business.
1 Multiple Frame Surveys Tracy Xu Kim Williamson Department of Statistical Science Southern Methodist University.
Riku Salonen Regression composite estimation for the Finnish LFS from a practical perspective.
1 STRATIFIED SAMPLING Stratification: The elements in the population are divided into layers/groups/ strata based on their values on one/several.
On the use of auxiliary variables in agricultural surveys design
2006 August Labour statistics The usage of administrative data sources for Lithuanian data of earnings Milda Šličkutė-Šeštokienė Statistics Lithuania.
Jump to first page STATISTICAL INFERENCE Statistical Inference uses sample data and statistical procedures to: n Estimate population parameters; or n Test.
Ranked Set Sampling: Improving Estimates from a Stratified Simple Random Sample Christopher Sroka, Elizabeth Stasny, and Douglas Wolfe Department of Statistics.
STAT262: Lecture 5 (Ratio estimation)
A new sampling method: stratified sampling
Increasing Survey Statistics Precision Using Split Questionnaire Design: An Application of Small Area Estimation 1.
Eurostat Sample Selection. Presented by Desislava Nedyalkova Swiss Federal Statistical Office.
Trade and business statistics: use of administrative data Lunch Seminar Enrico Giovannini Italian National Statistical Institute (ISTAT) New York, February,
Joint UNECE/Eurostat Meeting on Population and Housing Censuses (13-15 May 2008) Sample results expected accuracy in the Italian Population and Housing.
Vienna, 23 April 2008 UNECE Work Session on SDE Topic (v) Editing on results (post-editing) 1 Topic (v): Editing based on results Discussants: Maria M.
One Sample  M ean μ, Variance σ 2, Proportion π Two Samples  M eans, Variances, Proportions μ1 vs. μ2 σ12 vs. σ22 π1 vs. π Multiple.
Combining administrative and survey data: potential benefits and impact on editing and imputation for a structural business survey UNECE Work Session on.
Joint UNECE/Eurostat Meeting on Population and Housing Censuses (28-30 October 2009) Accuracy evaluation of Nuts level 2 hypercubes with the adoption of.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
9 th Workshop on Labour Force Survey Methodology – Rome, May 2014 The Italian LFS sampling design: recent and future developments 9 th Workshop on.
Integrating administrative and survey data in the new Italian system for SBS: quality issues O. Luzi, F. Oropallo, A. Puggioni, M. Di Zio, R. Sanzo Nurnberg,
Improvements in stratification in the UK's Office for National Statistics Pete Brodie, Martina Portanti & Emily Carless UK Office for National Statistics.
Optimal Allocation in the Multi-way Stratification Design for Business Surveys (*) Paolo Righi, Piero Demetrio Falorsi 
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Impact of using fiscal data on the imputation strategy of the Unified Enterprise Survey of Statistics Canada Ryan Chepita, Yi Li, Jean-Sébastien Provençal,
Use of web scraping and text mining techniques in the Istat survey on “Information and Communication Technology in enterprises” Giulio Barcaroli(*), Alessandra.
Eurostat Overall design. Presented by Eva Elvers Statistics Sweden.
Alternative Methods of Unit Nonresponse Weighting Adjustments: An Application from the 2003 Survey of Small Business Finances * Lieu N. Hazelwood, Traci.
A Strategy for Prioritising Non-response Follow-up to Reduce Costs Without Reducing Output Quality Gareth James Methodology Directorate UK Office for National.
BES Equitable and Sustainable Well-being in Italy
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
The new multiple-source system for Italian Structural Business Statistics based on administrative and survey data Orietta Luzi, Ugo Guarnera, Paolo Righi.
Comments: The Big Picture for Small Areas Alan M. Zaslavsky Harvard Medical School.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
for statistics based on multiple sources
Impact of updating weights on tracking performance and volatility: Industry survey G. Bruno, L. Crosilla, P. Margani, A. Righi EU Workshop on Recent Developments.
Eurostat Statistical matching when samples are drawn according to complex survey designs Training Course «Statistical Matching» Rome, 6-8 November 2013.
A Comparison of Variance Estimates for Schools and Students Using Taylor Series and Replicate Weighting Ellen Scheib, Peter H. Siegel, and James R. Chromy.
The challenge of a mixed-mode design survey and new IT tools application: the case of the Italian Structure Earning Surveys Fabiana Rocci Stefania Cardinleschi.
Outlier Treatment in HCSO Present and future. Outline Outlier detection – types, editing, estimation Description of the current method Alternatives Future.
What is a Confidence Interval?. Sampling Distribution of the Sample Mean The statistic estimates the population mean We want the sampling distribution.
Topic (i): Selective editing / macro editing Discussants Orietta Luzi - Italian National Statistical Institute Rudi Seljak - Statistical Office of Slovenia.
Chapter 16 Social Statistics. Chapter Outline The Origins of the Elaboration Model The Elaboration Paradigm Elaboration and Ex Post Facto Hypothesizing.
WERST – Methodology Group
Multivariate selective editing via mixture models: first applications to Italian structural business surveys Orietta Luzi, Guarnera U., Silvestri F., Buglielli.
Sampling Theory and Some Important Sampling Distributions.
1 Optimal Number of Replicates for Variance Estimation Mansour Fahimi, Darryl Creel, Peter Siegel, Matt Westlake, Ruby Johnson, and Jim Chromy Third International.
Lecture 4 Confidence Intervals. Lecture Summary Last lecture, we talked about summary statistics and how “good” they were in estimating the parameters.
Joint UNECE-Eurostat worksession on confidentiality, 2011, Tarragona Sampling as a way to reduce risk and create a Public Use File maintaining weighted.
Regional Seminar on Developing a Program for the Implementation of the 2008 SNA and Supporting Statistics Cenker Burak METİN September 2013 Ankara.
Sampling Design and Analysis MTH 494 LECTURE-11 Ossam Chohan Assistant Professor CIIT Abbottabad.
Small area estimation combining information from several sources Jae-Kwang Kim, Iowa State University Seo-Young Kim, Statistical Research Institute July.
Inference for the Mean of a Population
Enrico Fabrizi°, Maria Rosaria Ferrante* , Carlo Trivisano*
Behavioral Statistics
Implementation of a more efficient way of collecting data SBS: use of administrative data Statistics Belgium June 2009.
Regression composite estimation for the Finnish LFS from a practical perspective Riku Salonen.
HS 167 Test Prep Wednesday 5/23/07; 9:45 – 12:00
CHAPTER 6 Statistical Inference & Hypothesis Testing
Sampling and Power Slides by Jishnu Das.
Istat - Structural Business Statistics
ANALYSIS OF POSSIBILITY TO USE TAX AUTHORITY DATA IN STS. RESULTS
Sampling and estimation
Small area estimation with calibration methods
Presentation transcript:

Optimal Sampling Strategies for Multidomain, Multivariate Case with different amount of auxiliary information Piero Demetrio Falorsi, Paolo Righi   Italian National Statistical Institute Seminar UNECE, 12 June 2012

Outline Aim of the talk Statement of the problem (The unified approach for) sampling design (Mgreg) Estimator Experimental results Conclusions

Aim of the talk An overall strategy

Statement of the problem

Statement of the problem: Challenging informative context Multiple sources of auxiliary information

Statement of the problem: Design

Statement of the problem: Estimation  Standard solution for estimation (calibration estimators) may allow for calibrating at domain level only for the register variables and does not calibrate on the domain existing totals deriving from auxiliary data sources  Main drawback: Too small sample size for some domains Risk that the estimation of variables that could derive from administrative Data Source are significantly different from known totals Biased estimation for small domains Effect of non response or measurement error

Sampling Design: Multiple sources of auxiliary information

Estimation: Multiple sources of auxiliary information

Estimation:The Working model

Estimation:The Mgreg Estimator

Estimation: Properties

Estimation: Properties - auxiliary=interest

Empirical Results: Population of simulation Italian enterprises from 1 to 99 employees- Computer and related economic activities (2-digits NACE Rev.1) ITACOSM June 2011, Pisa, Italy - 12 Populatio n size Number of cross-classified strata Cumulative (%) distribution More than The domains of interest (44): (1) geographical region with 20 marginal domains (DOM1); (2) economic activity group by Size class (24 domains)

Empirical Results: Simulation: allocation comparison between the one- way and multi-way design  Prediction models: M1M1 M2M Value addedLabour cost % Model

Sampling distributions over the partition with different auxiliary information Empirical Results: multiple sources of auxiliary information: example – efficiency of the proposed strategy

Conclusions

The last result (The unified approach) of a research that has lasted almost 6 years Survey Methodology (2008) Statistics in Transition (2006) 2 books published by Franco Angeli illustrating the main findings of a research of strategic interest financed by the Ministry of University and Research Presentations NTTS (2011), Neuchatel (2011) Invited talk to the next scientific conference of the Italian Society of Statistics Accepted talk for the ICES

References  Bethel J. (1989) Sample Allocation in Multivariate Surveys, Survey Methodology, 15,  Chromy J. (1987). Design Optimization with Multiple Objectives, Proceedings of the Survey Research Methods Sec-tion. American Statistical Association,  Deville J.-C., Tillé Y. (2004) Efficient Balanced Sampling: the Cube Method, Biometrika, 91,  Deville J.-C., Tillé Y. (2005) Variance approximation under balanced sampling, Journal of Statistical Planning and Inference, 128,  Falorsi P. D., Righi P. (2008) A Balanced Sampling Approach for Multi-way Stratification Designs for Small Area Estimation, Survey Methodology, 34,  Falorsi P. D., Orsini D., Righi P., (2006) Balanced and Coordinated Sampling Designs for Small Domain Estimation, Statistics in Transition, 7,  Isaki C.T., Fuller W.A. (1982) Survey design under a regression superpopulation model, Journal of the American Statistical Association, 77, 89-96