Arun Srivastava. Variance Estimation in Complex Surveys Linearization (Taylor’s series) Random Group Methods Balanced Repeated Replication (BRR) Re-sampling.

Slides:



Advertisements
Similar presentations
Calculation of Sampling Errors MICS3 Regional Workshop on Data Archiving and Dissemination Alexandria, Egypt 3-7 March, 2007.
Advertisements

Calculation of Sampling Errors MICS3 Data Analysis and Report Writing Workshop.
Variance Estimation When Donor Imputation is Used to Fill in Missing Values Jean-François Beaumont and Cynthia Bocci Statistics Canada Third International.
Variance Estimation in Complex Surveys Third International Conference on Establishment Surveys Montreal, Quebec June 18-21, 2007 Presented by: Kirk Wolter,
Introduction Simple Random Sampling Stratified Random Sampling
Variance Estimation in Complex Surveys Drew Hardin Kinfemichael Gedif.
Variance Estimation in EU-SILC Survey
Module B-4: Processing ICT survey data TRAINING COURSE ON THE PRODUCTION OF STATISTICS ON THE INFORMATION ECONOMY Module B-4 Processing ICT Survey data.
Uncertainty in fall time surrogate Prediction variance vs. data sensitivity – Non-uniform noise – Example Uncertainty in fall time data Bootstrapping.
9. Weighting and Weighted Standard Errors. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
Complex Surveys Sunday, April 16, 2017.
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Ranked Set Sampling: Improving Estimates from a Stratified Simple Random Sample Christopher Sroka, Elizabeth Stasny, and Douglas Wolfe Department of Statistics.
Resampling techniques
Chapter 17 Additional Topics in Sampling
Linear statistical models 2008 Model diagnostics  Residual analysis  Outliers  Dependence  Heteroscedasticity  Violations of distributional assumptions.
2008 Chingchun 1 Bootstrap Chingchun Huang ( 黃敬群 ) Vision Lab, NCTU.
STAT 4060 Design and Analysis of Surveys Exam: 60% Mid Test: 20% Mini Project: 10% Continuous assessment: 10%
Basics of regression analysis
Linear and generalised linear models Purpose of linear models Least-squares solution for linear models Analysis of diagnostics Exponential family and generalised.
STRATIFIED SAMPLING DEFINITION Strata: groups of members that share common characteristics Stratified sampling: the population is divided into subpopulations.
Increasing Survey Statistics Precision Using Split Questionnaire Design: An Application of Small Area Estimation 1.
Principles of the Global Positioning System Lecture 10 Prof. Thomas Herring Room A;
STAT 572: Bootstrap Project Group Members: Cindy Bothwell Erik Barry Erhardt Nina Greenberg Casey Richardson Zachary Taylor.
The Neymann-Pearson Lemma Suppose that the data x 1, …, x n has joint density function f(x 1, …, x n ;  ) where  is either  1 or  2. Let g(x 1, …,
Arun Srivastava. Types of Non-sampling Errors Specification errors, Coverage errors, Measurement or response errors, Non-response errors and Processing.
Complexities of Complex Survey Design Analysis. Why worry about this? Many government studies use these designs – CDC National Health Interview Survey.
Definitions Observation unit Target population Sample Sampled population Sampling unit Sampling frame.
Lecture 11: Kalman Filters CS 344R: Robotics Benjamin Kuipers.
Chapter 15 Modeling of Data. Statistics of Data Mean (or average): Variance: Median: a value x j such that half of the data are bigger than it, and half.
1 Guide to the PISA Data Analysis ManualPISA Data Analysis Manual Computation of Standard Errors for Multistage Samples.
Using Stata to asses the achievement of Latin American students in Mathematics, Reading and Science Roy Costilla Latin American Laboratory for Assessment.
18b. PROC SURVEY Procedures in SAS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
Design Effects: What are they and how do they affect your analysis? David R. Johnson Population Research Institute & Department of Sociology The Pennsylvania.
Engineering Statistics ENGR 592 Prepared by: Mariam El-Maghraby Date: 26/05/04 Design of Experiments Plackett-Burman Box-Behnken.
Modern Navigation Thomas Herring
Part III Gathering Data.
Yaomin Jin Design of Experiments Morris Method.
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
Multilevel Data in Outcomes Research Types of multilevel data common in outcomes research Random versus fixed effects Statistical Model Choices “Shrinkage.
Performance of Resampling Variance Estimation Techniques with Imputed Survey data.
Resampling techniques
A Comparison of Variance Estimates for Schools and Students Using Taylor Series and Replicate Weighting Ellen Scheib, Peter H. Siegel, and James R. Chromy.
Limits to Statistical Theory Bootstrap analysis ESM April 2006.
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
1 Workshop on Labour Force Survey Methodology, Paris, April 2010 Reviewing the LFS precision requirements Nicola Massarelli – Eurostat
Statistics Canada Citizenship and Immigration Canada Methodological issues.
Chapter 8. Process and Measurement System Capability Analysis
ICCS 2009 IDB Seminar – Nov 24-26, 2010 – IEA DPC, Hamburg, Germany Training Workshop on the ICCS 2009 database Weights and Variance Estimation picture.
Variance Stabilizing Transformations. Variance is Related to Mean Usual Assumption in ANOVA and Regression is that the variance of each observation is.
Chapter 12 Vocabulary. Matching: any attempt to force a sample to resemble specified attributed of the population Population Parameter: a numerically.
Stratified Sampling Lecture 8 Section 2.6 Wed, Jan 28, 2004.
Replication methods for analysis of complex survey data in Stata Nicholas Winter Cornell University
Sample Design of the National Health Interview Survey (NHIS) Linda Tompkins Data Users Conference July 12, 2006 Centers for Disease Control and Prevention.
Computacion Inteligente Least-Square Methods for System Identification.
Peter Linde, Interviewservice Statistics Denmark
Multiple Imputation using SOLAS for Missing Data Analysis
Introduction to secondary analysis of complex survey data
Presented: 2009 Canadian Users Stata Group Meeting
Two-Phase Sampling (Double Sampling)
Estimates of Bias & The Jackknife
Nonlinear Regression KNNL – Chapter 13.
Analysis of Complex Sample Data
BOOTSTRAPPING: LEARNING FROM THE SAMPLE
Nested Designs and Repeated Measures with Treatment and Time Effects
Today (2/23/16) Learning objectives:
Bootstrapping Jackknifing
Cross-validation Brenda Thomson/ Peter Fox Data Analytics
Accuracy assessment & non–response in sample surveys June Jorge M
Presentation transcript:

Arun Srivastava

Variance Estimation in Complex Surveys Linearization (Taylor’s series) Random Group Methods Balanced Repeated Replication (BRR) Re-sampling techniques Jackknife, Bootstrap

Taylor’s Series Linearization Method Non-linear statistics are approximated to linear form using Taylor’s series expansion. Variance of the first-order or linear part of the Taylor series expansion retained. This method requires the assumption that all higher- order terms are of negligible size. We are already familiar with this approach in a simple form in case of ratio estimator.

Random Group Methods Concept of replicating the survey design. Interpenetrating samples. Survey can also be divided into R groups so that each group forms a miniature version of the survey. Based on each of the R groups estimates can be developed for the parameter of interest. Let be the estimate based on rth sample.

BRR method Consider that there are H strata with two units selected per stratum. There are ways to pick 1 from each stratum. Each combination could be treated as a sample. Pick R samples. Which samples should we include?

BRR method (Contd.) Assign each value either 1 or –1 within the stratum Select samples that are orthogonal to one another to create balance One can use the design matrix for a fraction factorial Specify a vector of 1,-1 values for each stratum

BRR method (Contd.) An estimator of variance based on BRR method is given by where

Jack-knife Method Let be the estimator of  after omitting the i-th observation. Define

Soft-wares for Variance estimation OSIRIS – BRR, Jackknife SAS – Linearization Stata – Linearization SUDAAN – Linearization, Bootstrap, Jackknife WesVar – BRR, JackKnife, Bootstrap

THANKS