Paradata and Survey Management in the National Survey of Family Growth (NSFG), 2006-2012 William D. Mosher, NCHS; James Wagner, Mick Couper, & Nicole Kirgis,

Slides:



Advertisements
Similar presentations
1 Non-Response Bias Analyses of the Survey of Workplace Violence Prevention Andrew Kato, Kathy Downey, William McCarthy, and Samantha Cruz U.S. Bureau.
Advertisements

Correcting for Common Causes of Nonresponse and Measurement Error Andy Peytchev International Total Survey Error Workshop Stowe, June 15, 2010.
Cross Sectional Designs
Brian A. Harris-Kojetin, Ph.D. Statistical and Science Policy
The English Longitudinal Study of Ageing (ELSA) Sample design & response Shaun Scholes NatCen.
Challenges to Surveys Non-response error –Falling response rates over time Sample coverage –Those without landline telephones excluded –Growing.
1 William Mosher, PhD, NCHS NCHS Data User’s Conference, Session 38, July 11, 2006, 3:30 pm K: 2006DUC-Relig-V1 Using data on Religious affiliation and.
Documentation Tools in the Survey Lifecycle. Outline What is NSFG Webdoc? Instrument documentation != Survey documentation Data Cleaning/Processing in.
Agenda: Block Watch: Random Assignment, Outcomes, and indicators Issues in Impact and Random Assignment: Youth Transition Demonstration –Who is randomized?
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
08/08/2015 Statistics Canada Statistique Canada Paradata Collection Research for Social Surveys at Statistics Canada François Laflamme International Total.
18/08/2015 Statistics Canada Statistique Canada Responsive Collection Design (RCD) for CATI Surveys and Total Survey Error (TSE) François Laflamme International.
Aspects of the National Health Interview Survey (NHIS) Chris Moriarity National Conference on Health Statistics August 16, 2010
Are your C4 data reflective of the families you serve? Joy Markowitz, Director Jean Dauphinee, TA Specialist Measuring Child and Family Outcomes Conference,
1 Joyce Abma and Gladys Martinez Adolescent Sexual Risk Behaviors and Reproductive Health: Data from the National Survey of Family Growth, 2002 National.
Responsive Design for Household Surveys: Illustration of Management Interventions Based on Survey Paradata Robert M. Groves, Emilia Peytcheva, Nicole Kirgis,
FFT in California: Evaluation Outcomes Cricket Mitchell, PhD CIMH Consultant April 3, 2008.
Nonresponse issues in ICT surveys Vasja Vehovar, Univerza v Ljubljani, FDV Bled, June 5, 2006.
Introduction Theoretical Perspectives Research.  Sampling : Identifying the appropriate population of people to be studied.  Random Sample : Each member.
Multiple Indicator Cluster Surveys Survey Design Workshop Sampling: Overview MICS Survey Design Workshop.
A Latent Class Call-back Model for Survey Nonresponse Paul P. Biemer RTI International and UNC-CH Michael W. Link Centers for Disease Control and Prevention.
EVALUATING THE IMPACT OF ADDING THE RECLAIMING FUTURES APPROACH TO JUVENILE TREATMENT DRUG COURTS: RECLAIMING FUTURES/JUVENILE DRUG COURT EVALUATION Josephine.
The 2006 National Health Interview Survey (NHIS) Paradata File: Overview And Applications Beth L. Taylor 2008 NCHS Data User’s Conference August 13 th,
 Collecting Quantitative  Data  By: Zainab Aidroos.
Evaluating a Research Report
DRAFT - not for publication Nonresponse Bias Analysis in a Survey of Banks Carl Ramirez U.S. Government Accountability Office
Consumer behavior studies1 CONSUMER BEHAVIOR STUDIES STATISTICAL ISSUES Ralph B. D’Agostino, Sr. Boston University Harvard Clinical Research Institute.
Measuring Disability in school-aged children: Findings from the National Health Interview Survey PH Pastor, CA Reuben, ME Loeb NCHS Washington,
1 Renee M. Gindi NCHS Federal Conference on Statistical Methodology Statistical Policy Seminar December 4, 2012 Responsive Design on the National Health.
A Strategy for Prioritising Non-response Follow-up to Reduce Costs Without Reducing Output Quality Gareth James Methodology Directorate UK Office for National.
American Community Survey ACS Content Review Webinar State Data Centers and Census Information Centers James Treat, ACSO Division Chief December 4, 2013.
Panel Study of Entrepreneurial Dynamics Richard Curtin University of Michigan.
National Center for Health Statistics DCC CENTERS FOR DISEASE CONTROL AND PREVENTION Women’s Health Data in the National Survey of Family Growth (NSFG)
Healthy People 2010 Focus Area 9: Family Planning Richard J. Klein Progress Review November 6, 2008.
Handling Attrition and Non- response in the 1970 British Cohort Study Tarek Mostafa Institute of Education – University of London.
Copyright 2010, The World Bank Group. All Rights Reserved. Reducing Non-Response Section A 1.
VALIDITY AND VALIDATION: AN INTRODUCTION Note: I have included explanatory notes for each slide. To access these, you will probably have to save the file.
Copyright 2010, The World Bank Group. All Rights Reserved. Reducing Non-Response Section B 1.
Measurement of Income in NCHS Surveys Diane M. Makuc NCHS Data Users Conference July 12, 2006 Centers for Disease Control and Prevention National Center.
A Theoretical Framework for Adaptive Collection Designs Jean-François Beaumont, Statistics Canada David Haziza, Université de Montréal International Total.
Building Wave Response Rates in a Longitudinal Survey: Essential for Nonsampling Error Reduction or Last In - First Out? Steven B. Cohen Fred Rohde and.
VerdierView Graph # 1 OVERVIEW Problems With State-Level Estimates in National Surveys of the Uninsured Statistically Enhancing the Current Population.
Research.  Random Sample : Each member of the population has an equal chance of being included in the sample.  Problem of Refusal (or nonresponse) :
FCS681 Types of Research Design: Non Experimental Hira Cho.
3.14 X AXIS 6.65 BASE MARGIN 5.95 TOP MARGIN 4.52 CHART TOP LEFT MARGIN RIGHT MARGIN ©TNS 2013 Are ‘better’ interviewers more successful at.
Interviewer Effects on Paradata Predictors of Nonresponse Rachael Walsh, US Census Bureau James Dahlhamer, NCHS European Survey Research Association, 2015.
11 How Much of Interviewer Variance is Really Nonresponse Error Variance? Brady T. West Michigan Program in Survey Methodology University of Michigan-Ann.
The Use of Random Digit Dialing in Household Surveys: Challenges and Changes Chris Chapman 2008 IES Research Conference Washington, DC June 11, 2008
1 Responsive Design and Survey Management in the National Survey of Family Growth (NSFG) William D. Mosher, NCHS FCSM Statistical Policy Seminar Washington,
National Health Interview Survey Early Release Program: Overview and Key Health Indicators Report Jeannine S. Schiller, M.P.H. Division of Health Interview.
CASE STUDY: NATIONAL SURVEY OF FAMILY GROWTH Karen E. Davis National Center for Health Statistics Coordinating Center for Health Information and Service.
Effectiveness of Selected Supplemental Reading Comprehension Interventions: Impacts on a First Cohort of Fifth-Grade Students June 8, 2009 IES Annual Research.
1 European Conference on Quality in Official Statistics - Helsinki. Finland 3-6 May 2010 The use of R-indicators in responsive survey design – Some Norwegian.
5 A Day Behavior and Knowledge of Recommendations in Relation to Health Communication in the Health Information National Trends Survey (HINTS) Jennifer.
1 Assessment of Potential Bias in the National Immunization Survey (NIS) from the Increasing Prevalence of Households Without Landline Telephones Meena.
(Slides not created solely by me – the internet is a wonderful tool) SW388R7 Data Analysis & Compute rs II Slide 1.
Chapter Three Research Design. 3-2 Chapter Outline 1) Overview 2) Research Design: Definition 3) Research Design: Classification 4) Exploratory Research.
IMPACT EVALUATION PBAF 526 Class 5, October 31, 2011.
RTI International is a registered trademark and a trade name of Research Triangle Institute. Using Paradata for Monitoring Interviewer Performance.
STA248 week 121 Bootstrap Test for Pairs of Means of a Non-Normal Population – small samples Suppose X 1, …, X n are iid from some distribution independent.
Interviewer Observations in the National Survey of Family Growth: Lessons Learned and Unanswered Questions Brady T. West Survey Research Center, Institute.
SESRI Workshop on Survey-based Experiments
Context for the experiment?
SESRI Workshop on Survey-based Experiments
Data-Based Instructional Decision Making
SESRI Workshop on Survey-based Experiments
The European Statistical Training Programme (ESTP)
Chapter 6: Measures of representativity
Determining Subsampling Rates for Nonrespondents
Presentation transcript:

Paradata and Survey Management in the National Survey of Family Growth (NSFG), William D. Mosher, NCHS; James Wagner, Mick Couper, & Nicole Kirgis, U of Michigan ISR National Conference on Health Statistics Washington, DC, August 7, 2012

Outline Goals & Features of the NSFG Using Paradata for Responsive Design in the NSFG The NSFG has used Paradata to manage: – survey costs & response rates. – data quality & interviewer performance Summary Future work on paradata in the NSFG 2

Main Features of the National Survey of Family Growth, National multi-stage area probability sample Topics: Pregnancy, contraception, infertility, marriage, parenting Interview men & women age All interviews in person. No phone, no internet. Continuous design, using four 12-week “quarters” per year. 14,000 addresses selected each year, to yield at least 5,000 interviews per year. Release data files with every 2-4 years of data. 3

NSFG (continued) Two phases of data collection in each 12-week quarter: – Phase 1: 10 week data collection – Phase 2: Subsample remaining cases  Select a sample of about 1/3 of the remaining cases;  That reduces each interviewer’s case load by 2/3  Oversample cases with large weights (adults, whites)  Change data collection model:  added interview token of appreciation,  interviewer behavior change --(triple the number of hours per case) – Combine data and response rates from two phases using weights 4

Goals of Responsive Design in the NSFG The major expense in an in-person area probability sample is interviewer time, including labor and local travel costs. So the NSFG’s initial goals in using responsive design were to use paradata to ration and allocate the interviewers’ time to manage costs and response rates. The NSFG is trying to balance the following goals: – Control costs & prevent cost increases; – keep response rates at acceptable levels (70- 80% overall & for sub-groups) – get at least 5,000 interviews per year. 5

Key Elements of Responsive Design Monitoring key indicators using paradata – The use of statistical process control methods to track interviewer effort Statistically-based decisions to alter the design – Targeted at one or more measurable outcomes like response rates, sample yield, hours of interviewer labor per interview, or data quality. Interventions targeted at particular subsets of the sample – As opposed to changing a strategy for the entire sample Documentation of the decision process Evaluation of the success of the intervention (some work, some don’t; you have to drop things that don’t work.) 6

NSFG Responsive Design Interventions Weeks 1-10: Phase 1 interventions – Targeting interviewer effort at screener versus main calls at selected points in the quarter – Identifying and targeting “high priority” cases, often with embedded randomization Weeks 11-12: Phase 2 intervention – Subselection of high priority cases targeted at improving response rates and minimizing bias All interventions informed by – Response propensity models run daily (overnight) – Dashboard: – Tracking of key indicators on daily basis, – comparisons to survey goals and prior quarters 7

Effort The NSFG Dashboard hours % production Active Sample nonworked interviews Productivity calls/day % peak calls scrn’r/main calls mean calls 8+ calls locked bldgs resistant hard appt. occupied eligible propensity calls/interview hours/interview % with kids response rate I’rs working calls/hour noncontacts cum. interviews Data Set Balance % sexually active group rates CV group rates X 8

NSFG Intervention: Screener vs Main Extra effort directed to screening 9

NSFG Intervention: Subgroup Response Rates Response rates for Hispanic adult males lag before intervention; higher after it. 10

NSFG Intervention: Response Rates for sub-groups varied more in Quarter 1 than in Quarter 16 (Male & Female, & 20-44; Hispanic & Black & other race) 11

“Interventions” used in the 16 quarters of NSFG fieldwork in At least 16 experimental (randomized) “interventions” were used, to increase response rates or sample yield Most interventions increased rates for the group they were aimed at, but not always significantly. Some will be continued in the new NSFG, others won’t be. Paper on the interventions: J Wagner et al, “Use of Paradata in a Responsive Design Framework to Manage a Field Data Collection,” Journal of Official Statistics, forthcoming. 12

Using Paradata to predict the survey’s outcome variables We can compare means of key statistics for subgroups defined by the paradata. We tracked 19 key statistics by several paradata indicators. For example, we can look at means of key statistics by an interviewer observation as to whether the interviewer thinks that the adults living here are in a marriage or cohabiting relationship. The percentage of these women who had ever been pregnant was: Example: Proportion of women ever pregnant Married or cohabiting: 69% ever been pregnant Not married or cohab: 27% ever pregnant 13

How are we using paradata now, in the NSFG? Continuing development and use of dashboard to guide responsive design decisions – Identifying and improving key indicators – Identifying and documenting interventions (some successful, some not) Challenges of dealing with the federal security requirements (we can only access paradata in secure locations). Development and evaluation of data quality indicators (see next slides) 14

Paradata for Data Quality Evaluation Many surveys now use Computer audio-recorded interviewing (CARI) to evaluate data quality and interviewer performance. We don’t, for two reasons: – sensitive nature of NSFG content raises privacy & data security issues. – Budget reasons. Exploring alternatives using item-level paradata (keystroke files) 15

Indirect Data Quality Indicators (R = Respondent) Example indicators derived from keystroke data – How long does it take R’s to answer the question? – how many times did an interviewer back up and change an answer? – Error messages and actions taken by interviewer. – Item missing data (DK, RF) rates. – How often does the interviewer access the Help function? – How often does the interviewer enter a comment on R’s answer? Used factor analysis to combine indicators Goal= identify outlier interviewers on multiple factors 16

Use of Data Quality Indicators by interviewer, and by question By Interviewer: We are currently using these indicators to identify interviewers for special attention, including re-training in the weekly calls with supervisor. By Question: But showing these indicators for specific QUESTIONS (instead of INTERVIEWERS) can help identify potentially problematic items or questions. 17

How much do interviewers access Help and enter comments? Standard scores, by interviewer interviewers access help & enter comments more than others !

Summary: Use of Paradata in the NSFG In the NSFG, paradata were used in to (a) limit costs, (b) get adequate sample sizes, and (c) equalize response rates across subgroups by age-race-gender that are strongly correlated with NSFG outcomes Uses included targeting groups with lagging response rates, encouraging adequate screening effort, and designing the non-response follow-up. In , we have also been using paradata to monitor data quality and interviewer performance. Intent is to improve questionnaire design and interviewer training. Thinking about a future paradata data file: Which paradata elements would be useful for a limited release paradata file? 19

References and further information 1.NSFG web site: 2. ISR/U of Michigan methodology research on NSFG: 3.Groves et al, “Planning & Development of the Continuous National Survey of Family Growth.” NCHS, Vital & Health Statistics, Series 1, No. 48, Sept 2009, pages Lepkowski et al, The National Survey of Family Growth: Sample Design & Analysis of a Continuous Survey. Vital & Health Statistics, Series 2, No. 150, June 2010, pages