1 Total Survey Error: Past, Present, and Future Robert M. Groves University of Michigan and Joint Program in Survey Methodology.

Slides:



Advertisements
Similar presentations
Educational Research: Causal-Comparative Studies
Advertisements

ASSESSING RESPONSIVENESS OF HEALTH MEASUREMENTS. Link validity & reliability testing to purpose of the measure Some examples: In a diagnostic instrument,
Making the Case for Metadata at SRS-NSF National Science Foundation Division of Science Resources Statistics Jeri Mulrow, Geetha Srinivasarao, and John.
Issues of Technical Adequacy in Measuring Student Growth for Educator Effectiveness Stanley Rabinowitz, Ph.D. Director, Assessment & Standards Development.
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
Introduction to the Quantitative Research Process
1 Editing Administrative Data and Combined Data Sources Introduction.
Validity, Sampling & Experimental Control Psych 231: Research Methods in Psychology.
MEASUREMENT. Measurement “If you can’t measure it, you can’t manage it.” Bob Donath, Consultant.
Concept of Measurement
Beginning the Research Design
Documentation and survey quality. Introduction.
Sampling and Experimental Control Goals of clinical research is to make generalizations beyond the individual studied to others with similar conditions.
1 Measurement Measurement Rules. 2 Measurement Components CONCEPTUALIZATION CONCEPTUALIZATION NOMINAL DEFINITION NOMINAL DEFINITION OPERATIONAL DEFINITION.
Personality, 9e Jerry M. Burger
Chapter Three Research Design.
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
Validity Lecture Overview Overview of the concept Different types of validity Threats to validity and strategies for handling them Examples of validity.
CORRELATIO NAL RESEARCH METHOD. The researcher wanted to determine if there is a significant relationship between the nursing personnel characteristics.
Arun Srivastava. Types of Non-sampling Errors Specification errors, Coverage errors, Measurement or response errors, Non-response errors and Processing.
Do We Still Need Probability Sampling in Surveys? Robert M. Groves University of Michigan and Joint Program in Survey Methodology, USA.
Chapter 4 Hypothesis Testing, Power, and Control: A Review of the Basics.
Dr. Engr. Sami ur Rahman Assistant Professor Department of Computer Science University of Malakand Research Methods in Computer Science Lecture: Research.
Variation, Validity, & Variables Lesson 3. Research Methods & Statistics n Integral relationship l Must consider both during planning n Research Methods.
Ch 6 Validity of Instrument
● Midterm exam next Monday in class ● Bring your own blue books ● Closed book. One page cheat sheet and calculators allowed. ● Exam emphasizes understanding.
MICS4 Survey Design Workshop Multiple Indicator Cluster Surveys Survey Design Workshop Data Analysis and Reporting.
Measurement Error.
Statistical Methods, part 1 Module 2: Latent Class Analysis of Survey Error Models for measurement errors Dan Hedlin Stockholm University November 2012.
Analyzing Reliability and Validity in Outcomes Assessment (Part 1) Robert W. Lingard and Deborah K. van Alphen California State University, Northridge.
Volunteer Angler Data Collection and Methods of Inference Kristen Olson University of Nebraska-Lincoln February 2,
Final Study Guide Research Design. Experimental Research.
Q2010, Helsinki Development and implementation of quality and performance indicators for frame creation and imputation Kornélia Mag László Kajdi Q2010,
Descriptive and Causal Research Designs Copyright © 2010 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
HOW TO WRITE RESEARCH PROPOSAL BY DR. NIK MAHERAN NIK MUHAMMAD.
Slides to accompany Weathington, Cunningham & Pittenger (2010), Chapter 3: The Foundations of Research 1.
Methodology Matters: Doing Research in the Behavioral and Social Sciences ICS 205 Ha Nguyen Chad Ata.
1 A Review of (Total) Survey Error Models William D. Kalsbeek Survey Research Unit University of North Carolina.
Nonresponse Rates and Nonresponse Bias In Surveys Robert M. Groves University of Michigan and Joint Program in Survey Methodology, USA Emilia Peytcheva.
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
Research Seminars in IT in Education (MIT6003) Quantitative Educational Research Design 2 Dr Jacky Pow.
Copyright  2003 by Dr. Gallimore, Wright State University Department of Biomedical, Industrial Engineering & Human Factors Engineering Human Factors Research.
VALIDITY AND VALIDATION: AN INTRODUCTION Note: I have included explanatory notes for each slide. To access these, you will probably have to save the file.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1.1 Chapter Five Data Collection and Sampling.
1 Total Survey Error: Past, Present, and Future Robert M. Groves U.S. Census Bureau Lars Lyberg Statistics Sweden and Stockholm University.
Academic Research Academic Research Dr Kishor Bhanushali M
Paul P. Biemer RTI International Lars E. Lyberg Statistics Sweden I ntroduction to S urvey Q uality.
Question paper 1997.
Copyright © Allyn & Bacon 2008 Intelligent Consumer Chapter 14 This multimedia product and its contents are protected under copyright law. The following.
Research Methodology and Methods of Social Inquiry Nov 8, 2011 Assessing Measurement Reliability & Validity.
The Theory of Sampling and Measurement. Sampling First step in implementing any research design is to create a sample. First step in implementing any.
MEASUREMENT. MeasurementThe assignment of numbers to observed phenomena according to certain rules. Rules of CorrespondenceDefines measurement in a given.
A modeler’s perspective on Total Survey Error
Assessing Responsiveness of Health Measurements Ian McDowell, INTA, Santiago, March 20, 2001.
3-1 Copyright © 2010 Pearson Education, Inc. Chapter Three Research Design.
Reliability: Introduction. Reliability Session 1.Definitions & Basic Concepts of Reliability 2.Theoretical Approaches 3.Empirical Assessments of Reliability.
Chapter Eight: Quantitative Methods
Measurement Experiment - effect of IV on DV. Independent Variable (2 or more levels) MANIPULATED a) situational - features in the environment b) task.
1 Chapter 13 Collecting the Data: Field Procedures and Nonsampling Error © 2005 Thomson/South-Western.
Research Methodology Lecture No :32 (Revision Chapters 8,9,10,11,SPSS)
Workshop on MDG, Bangkok, Jan.2009 MDG 3.2: Share of women in wage employment in the non-agricultural sector National and global data.
Small area estimation combining information from several sources Jae-Kwang Kim, Iowa State University Seo-Young Kim, Statistical Research Institute July.
Implementation of Quality indicators for administrative data
WRITING AND PUBLISHING RESEARCH ARTICLES
Chapter Eight: Quantitative Methods
Analyzing Reliability and Validity in Outcomes Assessment Part 1
Lesson 1 Foundations of measurement in Psychology
What do Quantitative Researchers Want from Qualitative Research
Errors in Surveys Training Course «Quality Management and
Presentation transcript:

1 Total Survey Error: Past, Present, and Future Robert M. Groves University of Michigan and Joint Program in Survey Methodology

2 Outline 1.Evolution of “total survey error” (TSE) 2.Weaknesses of the TSE framework 3.Strengths of the TSE framework 4.Future of surveys and research agenda ideas

3 Deming (1944) “On Errors in Surveys” American Sociological Review! First listing of sources of problems, beyond sampling, facing surveys

4

5 Comments on Deming (1944) Does include nonresponse, sampling, interviewer effects, mode effects, various other measurement errors, and processing errors Omits coverage errors Includes nonstatistical notions (auspices) Includes estimation step errors (wrong weighting) “total survey error” not used as a term

6 Sampling Text Treatments of Total Survey Error Deming, Some Theory of Sampling, 1950 –repeats set of comments in 1944 article Hansen, Hurwitz, Madow, Sample Survey Methods and Theory, 1953 –9 of 638 pages on response and other nonsampling errors –“errors due to faulty planning,” “coverage errors,” “classification errors,” “publication errors”

7 Sampling Text Treatment of Total Survey Error Kish, Survey Sampling, 1965 –65 of 643 pages on various errors, with specified relationship among errors –Graphic on biases

8 Sampling Biases Frame biases “Consistent” Sampling Bias Constant Statistical Bias Nonsampling Biases Noncoverage NonresponseNonobservation Field: data collection Office: processing Observation

9 Sampling Text Treatment of Total Survey Error Särndal, Swensson, Wretman, Model Assisted Survey Sampling, 1992 –Part IV, 124 pp. of 694, coverage, nonresponse, measurement error; omits processing error Lohr, Sampling Design and Analysis, 1999 –34 of 500 pages, nonresponse, randomized response

10 Health Survey Methods Conferences Mid 1970’s, “data error archive” idea of Horvitz 1977 key paper, “Total Survey Design: Effect of Nonresponse Bias and Procedures for Controlling Measurement Errors,” Kalsbeek and Lessler

11 Total Survey Error (1979) Anderson, Kasper, Frankel, and Associates Empirical studies on nonresponse, measurement, and processing errors for health survey data Initial total survey error framework in more elaborated nested structure

12 Total Error Variable Error Sampling Nonsampling Field Processing Bias Nonsampling Observation Field Processing Sampling Frame Consistent Nonobservation Noncoverage Nonresponse

13 Survey Errors and Survey Costs (1989), Groves Attempts conceptual linkages between total survey error framework and –psychometric true score theories –econometric measurement error and selection bias notions Ignores processing error Highest conceptual break on variance vs. bias Second conceptual break on errors of nonobservation vs. errors of observation

14 CoverageNonresponseSamplingInterviewerRespondentInstrumentMode CoverageNonresponseSamplingInterviewerRespondentInstrumentMode Errors of Nonobservation Observational Errors Bias Errors of Nonobservation Observational Errors Variance Mean Square Error construct validity theoretical validity empirical validity reliability criterion validity - predictive validity - concurrent validity

15 Nonsampling Error in Surveys (1992), Lessler and Kalsbeek Evokes “total survey design” more than total survey error Omits processing error

16 Components of ErrorTopics Frame errors Missing elements Nonpopulation elements Unrecognized multiplicities Improper use of clustered frames Sampling errors Nonresponse errors Deterministic vs. stochastic view of nonresponse Unit nonresponse Item nonresponse Measurement errors Error models of numeric and categorical data Studies with and without special data collections

17 Introduction to Survey Quality, (2003), Biemer and Lyberg Major division of sampling and nonsampling error Adds “specification error” (a la “construct validity”) Formally discusses process quality Discusses “fitness for use” as quality definition

18 Sources of ErrorTypes of Error Specification error Concepts Objectives Data element Frame error Omissions Erroneous inclusions Duplications Nonresponse error Whole unit Within unit Item Incomplete Information Measurement error Information system Setting Mode of data collection Respondent Interview Instrument Processing error Editing Data entry Coding Weighting Tabulation

19 Survey Methodology, (2004) Groves, Fowler, Couper, Lepkowski, Singer, Tourangeau Notes twin inferential processes in surveys –from a datum reported to the given construct of a sampled unit –from estimate based on respondents to the target population parameter Links inferential steps to error sources

20 Construct Inferential Population Measurement Response Target Population Sampling Frame Sample Validity Measurement Error Coverage Error Sampling Error Measurement Representation Respondents Nonresponse Error Edited Data Processing Error Survey Statistic

21 Key Statistical Developments in Total Survey Error 1 Criteria for true values (1951) 1.uniquely defined 2.defined in a manner that purposes of survey are met 3.when possible, defined in terms of operations that can be carried through Essential survey conditions, correlated response variance (1959) –conditioning factors for variance estimate –factors that affect value of variance –often ignore by users of

22 Correlated response variance (1959) –assume no covariance between true values and response deviations –assume no correlation of response deviations across interviewers –correlated response variance independent of workload –use of interpenetration Simple response variance via reinterviews (1964) –assumes no covariance between trials Relaxed assumptions of zero covariance of true values and response deviations (1964) –combine use of interpenetration and reinterview “Error Profile” of Current Population Survey (1978) Multi-method multi-trait models on survey measures (1984) –within-construct, among item covariance as tool for “simple response variance –imported from psychometrics Key Statistical Developments in Total Survey Error 2

23 Measurement of imputation error variance through multiple imputation (1987) Total error model for PES (1991) Measurement Errors in Surveys (1992) –attempt to juxtapose psychometric notions with survey statistical notions of measurement error Latent class model applications to survey errors (late 1990’s) Key Statistical Developments in Total Survey Error 3

24 Understudied components –coverage error variance –nonresponse error variance –all processing errors –biases Key Statistical Developments in Total Survey Error 4

25 Variance Bias

26 Summary of the Evolution of “Total Survey Error” Roots in cautioning against sole attention to sampling error Framework contains statistical and nonstatistical notions Most statistical attention on variance components, most on measurement error variance Late 1970’s attention to “total survey design” 1980’s-1990’s attempt to import psychometric notions Key omissions in research

27 Weaknesses of the Common Usage of “Total Survey Error” 1 1.Exclusions of key quality concepts –notably a user perspective is missing

28 Key National Indicator Initiative Quality Workshop

29 Credibility

30 Relevance

31 Estimator Quality

32 Data Quality

33 Weaknesses of the Total Survey Error Paradigm 2 2. Lack of routine measurements - error/quality profiles are useful but rare 3. Ineffective influence on professional standards - little expansion beyond sampling error in practice - press releases on Federal statistics rarely contain even sampling errors 4. Large burden on design of some estimators - interpenetration, reinterviews for variance estimation not routine 5. Patently wrong assumptions - correlation of true values with errors (drug use reports) - intraclass correlation independent of workload (interviewer experience effects)

34 Strengths of the Total Survey Error Framework 1.Taxonomic decomposition of errors nomenclature for different components 2.Separation of phenomena affecting statistics in different ways variance vs. bias; observation vs. nonobservation; respondent/interviewer/measurement task; processing 3.Conceptual foundation of the field of survey methodology subfields defined by errors 4.Tool to identifying lacunae in the research literature e.g., where are the error evaluation papers on processing?

35 A Vision of the Future Why “total survey errorists” have permanent employment opportunities: 1.new areas of study using surveys 2.populations being studied change their behavior 3.external world offers new measurement tools

36 Needed Steps in a Research Agenda for Total Survey Error 1 1.Integrating causal models of survey errors cognitive psychological mechanisms (anchoring, recall decay) 2.Research on interplay of two or more error sources jointly e.g., nonresponse and measurement error 3.Research on the interplay of biases and variances e.g., does simple response variance increase accompany some response bias reductions (self-administration effects) 4.Development of diagnostic tools to study model-error progress is slowed with just-identified models without sensitivity analysis

37 5. Guidance on tradeoffs between quality measurement and quality maximization - how much should we spend on quality enhancement vs. measurement of quality (Spencer, 1985)? 6. Integrating other notions of quality into the total survey error paradigm - if “fitness for use” predominates as a conceptual base, how can we launch research that incorporates error variation associated with different uses? 7. Exploiting a multiple-mode, multiple frame, multiple phase survey world - how can we build models that exploit non-randomized design variations to give insight into various survey errors? Needed Steps in a Research Agenda for Total Survey Error 2

38 A Summary Pitch for the Union of Design and Estimation “To err is human, to forgive divine – but to include errors in your design is statistical. Kish, 1977