Examining Best Practices for Sampling and Weighting Dual Frame Surveys Liz Kantor Advised by Dr. David Redlawsk Rutgers, The State University of New Jersey.

Slides:



Advertisements
Similar presentations
©2007, Harris Interactive Inc. All rights reserved. Anheuser-Busch Support For Education vs. Restrictions December 13, 2007 Revised.
Advertisements

Multiple Indicator Cluster Surveys Survey Design Workshop
Estimating the Level of Underreporting of Expenditures among Expenditure Reporters: A Further Micro-Level Latent Class Analysis Clyde Tucker Bureau of.
Online Privacy Survey Results Conducted: December 2011.
Recent Improvements for Dual-Frame RDD Sampling Methodology 29 th Annual Meeting March 26, 2012.
How are we doing? Sort the types of error into sampling and non-sampling errors, then match the situations to the types of error.
1 Sampling Telephone Numbers and Adults, and Interview Length, and Weighting in the California Health Survey Cell Phone Pilot Study J. Michael Brick, Westat.
SAMPLING.
Analysis of frequency counts with Chi square
Dr. Michael R. Hyman, NMSU Differences Between Group Means.
Why sample? Diversity in populations Practicality and cost.
PSY 1950 Confidence and Power December, Requisite Quote “The picturing of data allows us to be sensitive not only to the multiple hypotheses that.
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
Course Content Introduction to the Research Process
~ Draft version ~ 1 HOW TO CHOOSE THE NUMBER OF CALL ATTEMPTS IN A TELEPHONE SURVEY IN THE PRESENCE OF NONRESPONSE AND MEASUREMENT ERRORS Annica Isaksson.
Impact Evaluation Session VII Sampling and Power Jishnu Das November 2006.
Estimating Phone Service and Usage Percentages: How to Weight the Data from a Local, Dual-Frame Sample Survey of Cellphone and Landline Telephone Users.
Cell phone problem and alternative approaches Chris McCarty University of Florida PHC 6716 May,
Chapter 5: Descriptive Research Describe patterns of behavior, thoughts, and emotions among a group of individuals. Provide information about characteristics.
Trends in Chronic Diseases by Demographic Variables, Hawaii’s Older Population, Hawaii Health Survey (HHS) K. Kromer Baker 1, A. T. Onaka 1, B. Horiuchi.
Is Public Opinion Really Anti- Tax and Pro-Spending Cuts?
Learning Objective Chapter 13 Data Processing, Basic Data Analysis, and Statistical Testing of Differences CHAPTER thirteen Data Processing, Basic Data.
Joint Canada/U.S. Health Survey Catherine Simile, National Center for Health Statistics Patrice Mathieu, Statistics Canada Ed Rama, Statistics Canada NCHS.
Sampling: Theory and Methods
Addressed Based Sampling as an Alternative to Traditional Sampling Approaches: An Exploration May 6, 2013.
1 Tests with two+ groups We have examined tests of means for a single group, and for a difference if we have a matched sample (as in husbands and wives)
©2007, Harris Interactive Inc. All rights reserved. Anheuser-Busch Support For Education vs. Restrictions January 27, 2009 Revised.
Chapter Thirteen Validation & Editing Coding Machine Cleaning of Data Tabulation & Statistical Analysis Data Entry Overview of the Data Analysis.
Sociology 5811: Lecture 14: ANOVA 2
Associate Professor Arthur Dryver, PhD School of Business Administration, NIDA url:
Why Draw A Sample? Why not just the get the whole enchilada? – Pragmatic reasons – The true population is typically “unknowable” When done right, a small.
Sampling Class 7. Goals of Sampling Representation of a population Representation of a population Representation of a specific phenomenon or behavior.
Who Needs RDD? Combining Directory Listings with Cell Phone Exchanges for an Alternative Sampling Frame Presented at AAPOR 2008 New Orleans, LA May 16,
Chapter Fourteen Data Preparation 14-1 Copyright © 2010 Pearson Education, Inc.
Handling Attrition and Non- response in the 1970 British Cohort Study Tarek Mostafa Institute of Education – University of London.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
Sampling Presentation on workshop in Luxembourg 10.April 2008 Johan Heldal.
Weighting opt-in samples Some preliminary results from Sweden.
Chi- square test x 2. Chi Square test Symbolized by Greek x 2 pronounced “Ki square” A Test of STATISTICAL SIGNIFICANCE for TABLE data.
A Theoretical Framework for Adaptive Collection Designs Jean-François Beaumont, Statistics Canada David Haziza, Université de Montréal International Total.
Elementary statistics for foresters Lecture 5 Socrates/Erasmus WAU Spring semester 2005/2006.
Testing for Coverage Bias when Combining Directory-Listed And Cellphone Samples T. M. Guterbock, A. Diop, J. M. Ellis, J. L. P. Holmes and K. T. Le, Center.
I271B The t distribution and the independent sample t-test.
Introduction to Secondary Data Analysis Young Ik Cho, PhD Research Associate Professor Survey Research Laboratory University of Illinois at Chicago Fall,
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
META-ANALYSIS, RESEARCH SYNTHESES AND SYSTEMATIC REVIEWS © LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON.
Weighting Household Surveys By David F. Pearson, Ph.D., P.E. April 2007.
Improving of Household Sample Surveys Data Quality on Base of Statistical Matching Approaches Ganna Tereshchenko Institute for Demography and Social Research,
Retain H o Refute hypothesis and model MODELS Explanations or Theories OBSERVATIONS Pattern in Space or Time HYPOTHESIS Predictions based on model NULL.
1 G Lect 7a G Lecture 7a Comparing proportions from independent samples Analysis of matched samples Small samples and 2  2 Tables Strength.
The Importance of Sample Size and Its Varying Effects on Precision in Large-Scale Surveys Dipankar Roy, PhD Bangladesh Bureau of Statistics
Effects of Sampling and Screening Strategies in an RDD Survey Anthony M. Roman, Elizabeth Eggleston, Charles F. Turner, Susan M. Rogers, Rebecca Crow,
Bangor Transfer Abroad Programme Marketing Research SAMPLING (Zikmund, Chapter 12)
Castleton Poll September Presidential Politics in Vermont 80 percent of Vermonters say that they’ve given some thought to whom they favor in the.
1 of 22 INTRODUCTION TO SURVEY SAMPLING October 6, 2010 Linda Owens Survey Research Laboratory University of Illinois at Chicago
Presented by: Khaleel S. Hussaini PhD Bureau Chief, Public Health Statistics Division of Public Health Preparedness Judy Bass Arizona’s BRFSS Coordinator.
E-Reading Rises as Device Ownership Jumps BY: KATHERINE ZICKUHR AND LEE RAINIE.
Washington State Department of Health Center for Health Statistics December 19, 2013 If you haven’t registered for this GoToWebinar, use this link: JoinWebinar.com.
BUS 308 Entire Course (Ash Course) For more course tutorials visit BUS 308 Week 1 Assignment Problems 1.2, 1.17, 3.3 & 3.22 BUS 308.
BUS 308 Entire Course (Ash Course) FOR MORE CLASSES VISIT BUS 308 Week 1 Assignment Problems 1.2, 1.17, 3.3 & 3.22 BUS 308 Week 1.
Dr Grant Blank Prepared for the General Online Research conference, Cologne, Germany, 5 March 2014 Who uses Twitter? Representativeness of Twitter Users.
Chapter Fourteen Data Preparation 14-1 Copyright © 2010 Pearson Education, Inc.
Pairwise comparisons: Confidence intervals Multiple comparisons Marina Bogomolov and Gili Baumer.
Weighting and imputation PHC 6716 July 13, 2011 Chris McCarty.
Sampling Why use sampling? Terms and definitions
SASU manual: sampling issues
Mean, Median, Mode The Mean is the simple average of the data values. Most appropriate for symmetric data. The Median is the middle value. It’s best.
New Techniques and Technologies for Statistics 2017  Estimation of Response Propensities and Indicators of Representative Response Using Population-Level.
Sampling and estimation
Presentation transcript:

Examining Best Practices for Sampling and Weighting Dual Frame Surveys Liz Kantor Advised by Dr. David Redlawsk Rutgers, The State University of New Jersey PANJAAPOR 2015 Spring Event

We know we need to include cell phones in telephone RDD samples and weight accordingly… …but we don’t agree on how

PANJAAPOR 2015 Spring Event Considerations Sampling  What % of the sample should be cell phones? Landlines? Weighting  Base weight?  Frame overlap adjustment?  Calibration adjustments?  Trimming?  Low variance!

PANJAAPOR 2015 Spring Event Data and Methods Data  Rutgers-Eagleton Poll Methods  Demographics: Chi-square  Sampling analysis: weight C & LL samples to make up varying proportions of overall sample  Weighting: 17 weighting & trimming combinations

PANJAAPOR 2015 Spring Event Weight Calculations Base weight: sample selection probability multiplied by “household” selection probability Frame adjustment: ½ Compositing, Effective Sample Size From Kennedy (2012)

PANJAAPOR 2015 Spring Event Weight Calculations Calibration  “Basic Sample Balancing” – one manual iteration of each demographic variable  Raking – multiple iterations of each demographic variable; SPSSINC RAKE Extension Trimming  Hard cap:.2 and 5  Percentile cap: 2 nd and 98 th percentiles

PANJAAPOR 2015 Spring Event Weighting Combinations Taking a cue from Kennedy (2012): * 0 = Rutgers-Eagleton Poll practices at the time of research

PANJAAPOR 2015 Spring Event Hypotheses I. Cell-only respondents will follow patterns in the literature of demographic distinctiveness and geographic mobility II. Dual users from the cell frame will differ demographically from dual users from the landline frame III. A higher % of cell phones than used in the Rutgers-Eagleton Poll will be required in the sample to match NHIS estimates

PANJAAPOR 2015 Spring Event Hypotheses IV. Half compositing will be less efficient (higher DEFF) than effective sample size V. No difference in efficiency between hard cap and percentile cap; can’t know a priori which will increase variance more

PANJAAPOR 2015 Spring Event Demographics Statistically significant differences at α =.001 * α =.01 C Only vs. Dual vs. LL only Dual from cell vs. Dual from LL Gender ✔✔ Age ✔✔ Race ✔✔ Party ID ✔ Education ✔✔*✔* Income ✔

PANJAAPOR 2015 Spring Event Demographics Differences in gender composition… Rutgers-Eagleton Poll

PANJAAPOR 2015 Spring Event Geographic Mobility Match between FIPS code (county) from sample provider and from respondent Rutgers-Eagleton Poll

PANJAAPOR 2015 Spring Event Simulated Sample Composition CELL- ONLY CELL- MOSTLY DUAL USE LANDLINE -MOSTLY LANDLINE -ONLY NHIS (NJ) 19.4%25.7%31.1%15.2%6.9% 30% C 70% LL 10.4%33.7%26.6%23.3%6.0% 40% C 60% LL 13.9%34.1%25.8%21.1%5.1% 50% C 50% LL 17.4%34.4%24.9%19.0%4.3% 60% C 40% LL 20.8%34.8%24.1%16.9%3.4% 70% C 30% LL 24.3%35.2%23.2%14.7%2.6% Rutgers-Eagleton Poll

PANJAAPOR 2015 Spring Event Design Effects Rutgers-Eagleton Poll

PANJAAPOR 2015 Spring Event Margins of Error *Calculated using Langer Research Associate’s MoE Machine Rutgers-Eagleton Poll

PANJAAPOR 2015 Spring Event Average deviation from NHIS estimates CELL- ONLY CELL- MOSTLYDUAL USE LANDLINE- MOSTLY LANDLINE- ONLY NHIS (NJ) 19.4%25.7%31.1%15.2%6.9% * indicates that the weighting combination came within 1% of the cell-only estimate Rutgers-Eagleton Poll

PANJAAPOR 2015 Spring Event Discussion Support for Hypotheses I, II, and III:  Significant demographic differences among cell only, dual users, and landline only Estimated geographic mobility much greater for Rs from cell sample vs. Rs from landline sample  Significant demographic differences between dual users from cell and landline samples  More cell phone interviews (~50%) are needed to match NHIS estimates of cell only

PANJAAPOR 2015 Spring Event Discussion No support for Hypothesis IV  ½ Compositing was not less efficient than Effective Sample Size Little support for Hypothesis V  Percentage cap was more efficient than the hard cap

PANJAAPOR 2015 Spring Event Weighting Recommendations Ultimately, no obvious best practices Criteria for choosing weighting and trimming combinations:  Includes a frame adjustment (avoid biased estimators)  Inclusion of base weight Brings percentage of cell-only respondents within 1% of NHIS estimates  Percentile cap – more efficient

PANJAAPOR 2015 Spring Event Weighting Recommendations Under these criteria, good weighting and trimming combinations are:  B: Base Weight, ½ Compositing, Basic Sample Balancing, Percentile Cap  D: Base Weight, ½ Compositing, Raking, Percentile Cap  F: Base Weight, Effective Sample Size, Basic Sample Balancing, Percentile Cap

PANJAAPOR 2015 Spring Event Future Research Replicate using other data Strengthen the theoretical basis of the criteria used to recommend weighting combinations Experiment with more aggressive percentile caps (e.g. 5% and 95%)

PANJAAPOR 2015 Spring Event Thank You