Design Effects: What are they and how do they affect your analysis? David R. Johnson Population Research Institute & Department of Sociology The Pennsylvania.

Slides:



Advertisements
Similar presentations
Session 1: Introduction to Complex Survey Design
Advertisements

9. Weighting and Weighted Standard Errors. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
Estimates and sampling errors for Establishment Surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013.
STATISTICS FOR MANAGERS LECTURE 2: SURVEY DESIGN.
Evaluating Methods of Standard Error Estimation for Use with the Current Population Survey’s Public Use Data The Hawaii Coverage For All Technical Workshop.
Business Statistics for Managerial Decision
Multiple Indicator Cluster Surveys Survey Design Workshop
QBM117 Business Statistics Statistical Inference Sampling 1.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Chapter 4 Multiple Regression.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Why sample? Diversity in populations Practicality and cost.
7-2 Estimating a Population Proportion
1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
A new sampling method: stratified sampling
Sampling Designs and Techniques
Sampling Methods.
Formalizing the Concepts: Simple Random Sampling.
CHAPTER 7, the logic of sampling
Scot Exec Course Nov/Dec 04 Ambitious title? Confidence intervals, design effects and significance tests for surveys. How to calculate sample numbers when.
How survey design affects analysis Susan Purdon Head of Survey Methods Unit National Centre for Social Research.
Complexities of Complex Survey Design Analysis. Why worry about this? Many government studies use these designs – CDC National Health Interview Survey.
Hypothesis Testing in Linear Regression Analysis
Determining Sample Size
Copyright 2010, The World Bank Group. All Rights Reserved. Agricultural Census Sampling Frames and Sampling Section A 1.
COLLECTING QUANTITATIVE DATA: Sampling and Data collection
Chapter 1: Introduction to Statistics
Sampling. Concerns 1)Representativeness of the Sample: Does the sample accurately portray the population from which it is drawn 2)Time and Change: Was.
Definitions Observation unit Target population Sample Sampled population Sampling unit Sampling frame.
From Sample to Population Often we want to understand the attitudes, beliefs, opinions or behaviour of some population, but only have data on a sample.
Foundations of Sociological Inquiry The Logic of Sampling.
Multiple Indicator Cluster Surveys Survey Design Workshop Sampling: Overview MICS Survey Design Workshop.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
18b. PROC SURVEY Procedures in SAS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Secondary Data Analysis Linda K. Owens, PhD Assistant Director for Sampling and Analysis Survey Research Laboratory University of Illinois.
Scot Exec Course Nov/Dec 04 Survey design overview Gillian Raab Professor of Applied Statistics Napier University.
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
Population and Sampling
CHAPTER 12 DETERMINING THE SAMPLE PLAN. Important Topics of This Chapter Differences between population and sample. Sampling frame and frame error. Developing.
Sampling Design and Analysis MTH 494 Lecture-30 Ossam Chohan Assistant Professor CIIT Abbottabad.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
Chapter 13 Multiple Regression
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
Introduction to Secondary Data Analysis Young Ik Cho, PhD Research Associate Professor Survey Research Laboratory University of Illinois at Chicago Fall,
Part III – Gathering Data
Chapter 6: 1 Sampling. Introduction Sampling - the process of selecting observations Often not possible to collect information from all persons or other.
Data Collection & Sampling Dr. Guerette. Gathering Data Three ways a researcher collects data: Three ways a researcher collects data: By asking questions.
1 Data Collection and Sampling Chapter Methods of Collecting Data The reliability and accuracy of the data affect the validity of the results.
Bangor Transfer Abroad Programme Marketing Research SAMPLING (Zikmund, Chapter 12)
1 Introduction to Statistics. 2 What is Statistics? The gathering, organization, analysis, and presentation of numerical information.
Statistics Canada Citizenship and Immigration Canada Methodological issues.
ICCS 2009 IDB Seminar – Nov 24-26, 2010 – IEA DPC, Hamburg, Germany Training Workshop on the ICCS 2009 database Weights and Variance Estimation picture.
CASE STUDY: NATIONAL SURVEY OF FAMILY GROWTH Karen E. Davis National Center for Health Statistics Coordinating Center for Health Information and Service.
CHAPTER 7, THE LOGIC OF SAMPLING. Chapter Outline  A Brief History of Sampling  Nonprobability Sampling  The Theory and Logic of Probability Sampling.
1 Data Collection and Sampling ST Methods of Collecting Data The reliability and accuracy of the data affect the validity of the results of a statistical.
Arun Srivastava. Variance Estimation in Complex Surveys Linearization (Taylor’s series) Random Group Methods Balanced Repeated Replication (BRR) Re-sampling.
Replication methods for analysis of complex survey data in Stata Nicholas Winter Cornell University
Topics Semester I Descriptive statistics Time series Semester II Sampling Statistical Inference: Estimation, Hypothesis testing Relationships, casual models.
RESEARCH METHODS Lecture 28. TYPES OF PROBABILITY SAMPLING Requires more work than nonrandom sampling. Researcher must identify sampling elements. Necessary.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Addis.
Sampling Design and Procedure
Sampling Why use sampling? Terms and definitions
RESEARCH METHODS Lecture 28
Using Weights in the Analysis of Survey Data
Random sampling Carlo Azzarri IFPRI Datathon APSU, Dhaka
Using Weights in the Analysis of Survey Data
Presentation transcript:

Design Effects: What are they and how do they affect your analysis? David R. Johnson Population Research Institute & Department of Sociology The Pennsylvania State University

What are Design Effects? Applies to the analysis of data gathered in a sample from a population. For Social Science folks, this is survey data. Design effects are the ways departures of the sampling frame from a simple random sample (SRS) impact statistical estimates from the sample. These departures from a SRS can affect: –Standard errors and significance tests –Estimates of coefficients

Simple Random Sampling Much of statistical theory used to develop inferential statistics assumes a simple random sample. SRS assumptions include: –Equal probability of selection for all elements –Each element selected at random independently from other elements in the sample. If these assumptions are not met the estimates are likely to be in error (biased) Yet most sample surveys depart from a SRS design.

Why Depart from a Simple Random Sample? To reduce data collection costs (increase the efficiency of sample). –Cluster sampling –Stratification –Disproportionate sampling To adjust for bias in the sample. –Design weights: (adjust for disproportionate sampling) –Post-estimation weights: (adjust for non-response and coverage)

Example of a cluster sampling design in a multistage area probability sample. Would include in sample several (5 – 10) housing units in the final segment. Violates the SRS assumption that are elements are sampled independently Reduces cost by greatly decreasing listing and interviewer costs. Source: Use of Clustering in Sampling Designs

Other common clustered designs Students in Schools. Where schools are randomly sampled but multiple students are surveys in each selected school. –Example: Add Health (80 schools; many students in each school) Members in Organizations. –Example: A random sample of long term care providers in which all employees were surveyed in each organization.

The Impact of Clustering Because two random elements sampled within the same cluster may be more similar than two random element selected between clusters the information gained by adding more elements within clusters is less than that gained by adding more clusters. This can results in higher standard errors than would be found in a simple random sample.

A measure of Design Effect (deff) deff is a measure of how much the sampling variability in a sample differs from the sampling variability in a simple random sample. deff = 1 + rho (n – 1) Where rho is the interclass correlation and n is the number of elements in the cluster. rho measures the similarity two randomly selected elements within a cluster compared to two randomly selected elements between clusters. The higher the value the more similar elements are within clusters. A deff of 2, for example, would mean that it the sample would have to be twice as large to yield the same sampling variability (standard errors) that would have been found with a simple random sample.

Example A study of rent rates in large apartment complexes. Draw a random sample of 50 apartment complexes in the population. Randomly sample 10 apartments in each complex (n = 10). If the rent of each apartment were the same within each apartment and different between each of the complexes then rho = 1 and deff = 1 + 1(10 -1) = 10 In this extreme case, each additional apartment surveyed within a cluster adds no new information about the rental cost. Only surveying one apartment in each complex would give us the same information (with the same standard error) about level of rent as we get from surveying 10.

Example Another extreme example… If we studying a variable like “shoe size” of residents of apartments the estimate of the design effect might be quite different. We would not expect “shoe size” to be clustered by apartment complex, so we expect rho = 0. deff = 1 + 0(10 – 1) = 1 The sampling variability in our cluster sample would be the same as found in a simple random sample

An important point!!! The design effect is not a fixed characteristic of the sample but one that differs from variable to variable. Shown here for the clustering effect but this is also true of design effects from stratification and weighting. When design effects are present our estimates and standard errors are likely to be wrong unless we adjust for the sampling design in calculating our estimates.

Stratification Stratification can make our sample more accurate than a simple random sample. We use prior knowledge about the distribution in the sample to reduce variability. For example, let’s say we have 1000 students in a school and we want to draw a representative sample of 100 of them. Assume we know the gender of each student in the school and 50% are male and 50% are female. If we randomly sample 50 from among the males and 50 from among the females the distribution by gender in our sample will be exactly the same as in the population. With a SRS this might not have been the case. Will improve the estimates for other variables only if they are also related to gender.

Stratification The most widely used stratification variables in large national probability samples are geographical. –Census Region –Metropolitan areas –Population sizes of geographical subareas Census data and census estimates are often used to define the strata.

What estimates do clustering and stratification affect? These do not affect the point estimates –Means –Regression coefficients They only affect the standard errors, confidence intervals, and significance tests. Weights, however, can bias both the point estimates and the standard errors, confidence intervals, and significance tests. The impact of weights on point estimates is widely know, but the effects on inferential statistics less so.

Weights – The Good and the Bad The Good –Weights are designed to increase the representativeness of our sample. –e.g. if the percent male in our sample is 40% but 50% in the population, we assign weights so each male is worth more than one male and each female is worth less than one female to yield the population percent. –Weights can adjust for design decisions as well, e.g., most surveys randomly select only one adult to interview per household so adults in households with several adults are underrepresented. –These can reduce the bias in our sample.

Weights – The Good and the Bad The Bad –Weights always yield a deff > 1 –The size of the design effect will be impacted by the variability in the weights. Large differences in the size of the weights for the cases will result in larger deff Very large weights appear to have more effect on the deff than very small weights. –Although weights decrease bias they do it at the cost of increasing the variability of our estimates.

What to do… More Bad News: Most datasets used in the social science have at least one of these features that affect the estimates. Most standard statistical software does not adjust the estimates for these design factors. More journals and granting agencies are requiring that the statistical findings are adjusted for design effects.

What to do… But the Good News is: The major statistical packages now have relatively easy to use procedures for most types of statistical analysis that adjust for them. Design effects appear to have substantially less impact on the standard errors of coefficients from multivariate analysis (e.g. regression coefficients) than they do on descriptive statistics (means, percentages) Previous published analytic research findings are not likely to be affected very much by failing to adjust for such effects (especially the effects of clustering and stratification)

How can we adjust for the design effects? Documentation for most large datasets contain information on the variables included in the data that can be used adjust for the design. The design data can take several forms which require different adjustment procedures. The most common are: –Variables identifying the primary sampling units (psu), the strata, and the weight –A set of replicates (e.g. 40 – 80) variables that give the structure for a resampling (replication) method for adjusting standard errors and replace the need for information on the psu and strata. –A set of replicate weights (e.g ) that replace psu, strata and weight information. (The replicate methods are used to hide the psu information for confidentiality reasons.)

Software to adjust for Design Effects Until recently, specialized software, not an integrated part of standard packages was required to include design information in the estimates. –Sudaan: A separate program later included in SAS –WesVar: A program using replicate methods available to some degree in SPSS but also stand alone –IVEware: A public domain software package from the University of Michigan Flexible procedures for design effects now available in: –SAS: A set of survey analysis procedures separate from Sudaan –Stata: A comprehensive set of SVY: procedures –R: A set of survey analysis procedures –SPSS: A survey analysis module available for extra cost (not part of SPSS site license at Penn State)

Computational procedures used to create the adjusted estimates. Taylor series expansion method. Considered the “gold standard” method. – A computational method involving estimating non-linear equations. Equations are different for different statistics. –Requires information on the psu and strata to compute. Re-sampling or Replication methods. –Uses techniques such as the Jackknife and Bootstrap to draw multiple replicate samples which convey information on the dispersion in the sample. –These methods need either a set of replicates or can generate these (in some software) if the psu and strata are available.

The National Survey of Families and Households (NSFH) A large national personal interview survey with a complex sampling design employing a multistage area probability sampling design with clusters and strata. Over 13,000 respondents. There were 100 primary sampling units and 1,700 clusters with an average of 7.1 respondents per cluster. Provides design information and weights to adjust for design effects in two ways: –Variables for the strata and psu’s –A set of replicate variables

Replicates in the NSFH includes a set of 52 balanced, half-sample, random replicates instead of case-level information on the sampling units and strata. Balanced half-sample replicates require two or more primary sampling units in each stratum. For each replicate, one of the two primary sampling units in each stratum is assigned a value of zero, and the other is assigned a value of 1. The primary sampling units assigned zero are excluded from that replicate. Programs such as Stata or WesVar can use these to adjust for the design effects.

Design Information also available for the NSFH study idstratumpsunewla Listing Area or cluster The Stratum and psu variables can be used to convey design information to many software packages

Stata svyset command for NSFH svyset psu [pweight=weight], strata(stratum) To use the replicates in Stata you might want to consult a PRI programmer.

Design Information in the American Community Survey (ACS) Conducted by the Census Bureau as a substitute for the long form of the Census A large mail survey with telephone and personal interview follow-ups of non-respondents. Considered a complex survey design but it is not an area probability sample or a SRS. Available as a public use dataset. Presents design effects in a set of 80 replicate weights that include both design and weight information.

Examples of replicate weights for ACS rw1 rw2 rw3 rw4 rw5 rw

Using the ACS weights The documentations suggests the following: –Conduct your analysis 80 times, substituting in each weight respectively. –Save your parameter estimate in a file. –The standard deviation of your estimate over the 80 runs is your correct standard error. It may also be possible to do this with a setting in the svyset command in Stata.

Setting the design parameters for a dataset. Consult the documentation. –Examples for setting the design for some software packages is often provided. May need to consult with a PRI programmer if in doubt. Set the design and forget it!! You only need to do this once…

Thank You!!! This PowerPoint will be available on the PRI web site. There is also a list of references on the web site to sources that discuss and explain design effect issues.