Increasing Survey Statistics Precision Using Split Questionnaire Design: An Application of Small Area Estimation 1.

Slides:



Advertisements
Similar presentations
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
Advertisements

7 (a) Under what circumstances is stratified random sampling procedure is considered appropriate?How would you select such samples?Explain by means of.
SAMPLING METHODS OR TECHNIQUES
Research on Improvements to Current SIPP Imputation Methods ASA-SRM SIPP Working Group September 16, 2008 Martha Stinson.
Uncertainty in fall time surrogate Prediction variance vs. data sensitivity – Non-uniform noise – Example Uncertainty in fall time data Bootstrapping.
Split Questionnaire Designs for Consumer Expenditure Survey Trivellore Raghunathan (Raghu) University of Michigan BLS Workshop December 8-9, 2010.
Estimates and sampling errors for Establishment Surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013.
The estimation strategy of the National Household Survey (NHS) François Verret, Mike Bankier, Wesley Benjamin & Lisa Hayden Statistics Canada Presentation.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Sampling: Final and Initial Sample Size Determination
Statistics for Managers Using Microsoft® Excel 5th Edition
Small Area Prediction under Alternative Model Specifications By Wayne A. Fuller and Andreea L. Erciulescu Department of Statistics, Iowa State University.
Random effects estimation RANDOM EFFECTS REGRESSIONS When the observed variables of interest are constant for each individual, a fixed effects regression.
QBM117 Business Statistics Statistical Inference Sampling 1.
Chapter 7 Sampling Distributions
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Ranked Set Sampling: Improving Estimates from a Stratified Simple Random Sample Christopher Sroka, Elizabeth Stasny, and Douglas Wolfe Department of Statistics.
Who and How And How to Mess It up
Data Sources The most sophisticated forecasting model will fail if it is applied to unreliable data Data should be reliable and accurate Data should be.
Sampling.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics 10 th Edition.
7-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter 7 Sampling and Sampling Distributions Statistics for Managers using Microsoft.
STAT 4060 Design and Analysis of Surveys Exam: 60% Mid Test: 20% Mini Project: 10% Continuous assessment: 10%
Partially Missing At Random and Ignorable Inferences for Parameter Subsets with Missing Data Roderick Little Rennes
Modelling health care costs: practical examples and applications Andrew Briggs Philip Clarke University of Oxford & Daniel Polsky Henry Glick University.
Course Content Introduction to the Research Process
UNECE Workshop on Confidentiality Manchester, December 2007 Comparing Fully and Partially Synthetic Data Sets for Statistical Disclosure Control.
Introduction to plausible values National Research Coordinators Meeting Madrid, February 2010.
2015 AprilUNIVERSITY OF HAIFA, DEPARTMENT OF STATISTICS, SEMINAR FOR M.A 1 Hastie, Tibshirani and Friedman.The Elements of Statistical Learning (2nd edition,
by B. Zadrozny and C. Elkan
Optimal Allocation in the Multi-way Stratification Design for Business Surveys (*) Paolo Righi, Piero Demetrio Falorsi 
Use of web scraping and text mining techniques in the Istat survey on “Information and Communication Technology in enterprises” Giulio Barcaroli(*), Alessandra.
1 Ratio estimation under SRS Assume Absence of nonsampling error SRS of size n from a pop of size N Ratio estimation is alternative to under SRS, uses.
Copyright ©2011 Pearson Education 7-1 Chapter 7 Sampling and Sampling Distributions Statistics for Managers using Microsoft Excel 6 th Global Edition.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Performance of Resampling Variance Estimation Techniques with Imputed Survey data.
Defining Success Understanding Statistical Vocabulary.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
Evaluating generalised calibration / Fay-Herriot model in CAPEX Tracy Jones, Angharad Walters, Ria Sanderson and Salah Merad (Office for National Statistics)
Eurostat Statistical matching when samples are drawn according to complex survey designs Training Course «Statistical Matching» Rome, 6-8 November 2013.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics.
CROSS-VALIDATION AND MODEL SELECTION Many Slides are from: Dr. Thomas Jensen -Expedia.com and Prof. Olga Veksler - CS Learning and Computer Vision.
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
Notes 1.3 (Part 1) An Overview of Statistics. What you will learn 1. How to design a statistical study 2. How to collect data by taking a census, using.
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Sampling and Sampling Distributions Basic Business Statistics 11 th Edition.
Chapter 6 Conducting & Reading Research Baumgartner et al Chapter 6 Selection of Research Participants: Sampling Procedures.
Basic Business Statistics, 8e © 2002 Prentice-Hall, Inc. Chap 1-1 Inferential Statistics for Forecasting Dr. Ghada Abo-zaid Inferential Statistics for.
Basic Business Statistics
ANOVA Overview of Major Designs. Between or Within Subjects Between-subjects (completely randomized) designs –Subjects are nested within treatment conditions.
Chapter 12 Vocabulary. Matching: any attempt to force a sample to resemble specified attributed of the population Population Parameter: a numerically.
1 SPSS MACROS FOR COMPUTING STANDARD ERRORS WITH PLAUSIBLE VALUES.
Computacion Inteligente Least-Square Methods for System Identification.
ESTIMATING RATIOS OF MEANS IN SURVEY SAMPLING Olivia Smith March 3, 2016.
Synthetic Approaches to Data Linkage Mark Elliot, University of Manchester Jerry Reiter Duke University Cathie Marsh Centre.
1 General Recommendations of the DIME Task Force on Accuracy WG on HBS, Luxembourg, 13 May 2011.
© Copyright McGraw-Hill CHAPTER 14 Sampling and Simulation.
Presentation : “ Maximum Likelihood Estimation” Presented By : Jesu Kiran Spurgen Date :
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Multiple Imputation using SOLAS for Missing Data Analysis
CH 5: Multivariate Methods
Chapter 7 Sampling Distributions
The European Statistical Training Programme (ESTP)
Chapter 8: Weighting adjustment
Sampling and estimation
The European Statistical Training Programme (ESTP)
Principal Component Analysis
Chapter 13: Item nonresponse
Presentation transcript:

Increasing Survey Statistics Precision Using Split Questionnaire Design: An Application of Small Area Estimation 1

Content IntroductionSplit Questionnaire DesignPopulation Characteristics Estimation Introduction Small Area Models Nested Error Regression Model A Simulation Study Steps of Procedure Measures of Comparisons Results 2

Introduction Issue: – Effects of a lengthy survey questionnaire on: Increasing response burden declining response rate and precision of survey statistics. A solution: Splitting the questionnaire into sub-questionnaires and assigning each one to a group of sample units. Procedure of sub-sample selection is at random, therefore, the resulting nonresponse is completely at random. The resulting nonresponse would be imputed by the common imputation methods. Our method: – Designing and analyzing the split questionnaire, using Small Area Estimation technique. – The method is applicable where the efficient survey estimates are required. – Complete data set is not provided in our method. 3

Content IntroductionSplit Questionnaire DesignPopulation Characteristics Estimation Introduction Small Area Models Nested Error Regression Model A Simulation Study Steps of Procedure Measures of Comparisons Results 4

Split Questionnaire Design (To apply small area estimation) Design steps: i.The original questionnaire is divided to (m) sub-questionnaires. Some common items as covariates are assigned to the all sub-questionnaires. Therefore, all sample units respond to them. ii.All sample units are classified with respect to a known auxiliary variable. Consequently, we make homogeneity within classes. Each class is considered as an area. iii.Sample units belong to each area randomly divided into (m) sub-samples. In each class, each sub-questionnaire is administrated to a sub-sample. iv. Step iii is repeated for all classes. Note: In each class, the number of sub-questionnaires and number of sub samples should be equal. 5

Pattern of administering subquestionnaires to sub-samples in small area estimation approach: Split Questionnaire Design (To apply small area estimation) (continued) 6

Content IntroductionSplit Questionnaire DesignPopulation Characteristics Estimation Introduction Small Area Models Nested Error Regression Model A Simulation Study Steps of Procedure Measures of Comparisons Results 7

Population Characteristics Estimation: Introduction There is not large enough sample to support direct estimates of appropriate precision based on the proposed design. Small area estimation method as a solution of insufficient sample size in split questionnaire method would be useful, in order to improve the efficiency of survey statistics. 8

Area level models – Fay-Herriot Model – Model with Correlated Sampling Errors – … Unit level models – Nested Error Regression Model ✓ – Random Error Variance Linear Model – … Population characteristics Estimation: Small Area Models 9

Population characteristics Estimation Nested Error Regression model (Rao 2003) 10

Population characteristics Estimation: Nested Error Regression model (Continued) (Rao 2003) 11

Population characteristics Estimation: Nested Error Regression model (Continued) (Rao 2003) 12

Population characteristics Estimation Nested Error Regression Model (Continued) 13

Content IntroductionSplit Questionnaire DesignPopulation Characteristics Estimation Introduction Small Area Models Nested Error Regression Model A Simulation Study Steps of Procedure Measures of Comparisons Results 14

Split questionnaire design: i.Creating a questionnaire with 17 questions. ii.Splitting the questionnaire into five different components (based on split questionnaire design (Raghunathan and Grizzle 1995)). iii.Considering five items (which are highly correlated with other twelve items) as a core part. iv. Administering the core part to all sample units. v.Assigning three items to each component in such a way that the within component correlation is small whereas, items in different components are highly correlated. vi.Creating 6 subquestionnaires consist of each double combination of four components plus the core part. A Simulation Study: Steps of Procedure 15

Data generator i.Generating a multivariate normal random vector (50,000 times), under the described correlation pattern. ii.Producing a multinomial variable as a stratification variable which is strongly correlated with the other variables. iii.Classifying the population units based on the stratification variable. iv.Selecting a simple random sample (without replacement) of a fixed size n=2000 from the population. v.Assigning sample units in each stratum to the all 6 subquestionnaires. vi.Estimating the population mean of each item by applying multiple imputation approach using the predictive mean matching method (Rubin 1987) and the small area estimation technique. vii.Generating 1000 simulated bootstrap samples to compare two approaches. A Simulation Study Steps of Procedure (Continued) 16

A simulation Study: Measures of Comparisons 17

A simulation Study: Measures of Comparisons (Continued) 18

Results of the Study The estimation of absolute relative bias, MSE and relative efficiency for 1000 bootstrap samples using sample auxiliary information 19

Results of the Study (Continued) Absolute relative bias, MSE and relative efficiency for 1000 bootstrap samples using population auxiliary information 20

Small area estimators mostly have lower ARB respect to multiple imputation based estimators. There were no cases in which the multiple imputation approach gave a smaller MSE than the small area method across all items. Small area estimates are more efficient than multiple imputation approach estimates. Small Area technique requires less computation compare to multiple imputation method. Small Area method does not require to produce data, Hence it would be more applicable, where the goals is estimation of population auxiliary and not the improvement of data quality. Results of the Study (Continued) 21

References 22

References (Continued) 23

Thanks for your attention… 24