Some type of major TSE effort TSE for an “important” statistic Form a group to design a TSE evaluation for existing survey “exemplary TSE estimation plan”

Slides:



Advertisements
Similar presentations
Paul Smith Office for National Statistics
Advertisements

Nonresponse Bias Correction in Telephone Surveys Using Census Geocoding: An Evaluation of Error Properties Paul Biemer RTI International and University.
Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.
Evaluation Research Pierre-Auguste Renoir: Le Grenouillere, 1869.
The estimation strategy of the National Household Survey (NHS) François Verret, Mike Bankier, Wesley Benjamin & Lisa Hayden Statistics Canada Presentation.
SBSE Course 3. EA applications to SE Analysis Design Implementation Testing Reference: Evolutionary Computing in Search-Based Software Engineering Leo.
NLSCY – Non-response. Non-response There are various reasons why there is non-response to a survey  Some related to the survey process Timing Poor frame.
AP Statistics Chapter 5 Notes.
SE 450 Software Processes & Product Metrics Reliability: An Introduction.
Non-Experimental designs: Developmental designs & Small-N designs
Swami NatarajanJune 17, 2015 RIT Software Engineering Reliability Engineering.
SE 450 Software Processes & Product Metrics Reliability Engineering.
Bridging the Gaps: Dealing with Major Survey Changes in Data Set Harmonization Joint Statistical Meetings Minneapolis, MN August 9, 2005 Presented by:
Retrieval Evaluation. Brief Review Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
Meta Analysis An Introduction. What… is… it? A “study of studies,” i.e., averaging results across studies in a given domain to get a better estimate of.
Single-subject experimental designs
Sampling Methods.
1 Developing A Quality Assurance Program For The US Census Bureau’s Business Register David. D. Chapman US CENSUS BUREAU 1.
OECD Short-Term Economic Statistics Working PartyJune Analysis of revisions for short-term economic statistics Richard McKenzie OECD OECD Short.
12 Autocorrelation Serial Correlation exists when errors are correlated across periods -One source of serial correlation is misspecification of the model.
Key terms in Sampling Sample: A fraction or portion of the population of interest e.g. consumers, brands, companies, products, etc Population: All the.
18/08/2015 Statistics Canada Statistique Canada Responsive Collection Design (RCD) for CATI Surveys and Total Survey Error (TSE) François Laflamme International.
Role of editing and imputation in integration of sources for structural business statistics Svein Gåsemyr, Statistics Norway Svein Nordbotten, University.
Arun Srivastava. Types of Non-sampling Errors Specification errors, Coverage errors, Measurement or response errors, Non-response errors and Processing.
Tobacco Control Interventions – Design Trade-Offs K. S. (Steve) Brown Department of Statistics and Actuarial Science Health Behaviour Research Group University.
Achieving Better Reliability With Software Reliability Engineering Russel D’Souza Russel D’Souza.
One-Factor Experiments Andy Wang CIS 5930 Computer Systems Performance Analysis.
The Campbell Collaborationwww.campbellcollaboration.org Introduction to Robust Standard Errors Emily E. Tanner-Smith Associate Editor, Methods Coordinating.
Presentation: Techniques for user involvement ITAPC1.
Use of survey (LFS) to evaluate the quality of census final data Expert Group Meeting on Censuses Using Registers Geneva, May 2012 Jari Nieminen.
Nonresponse issues in ICT surveys Vasja Vehovar, Univerza v Ljubljani, FDV Bled, June 5, 2006.
12th Meeting of the Group of Experts on Business Registers
Data Mining Process A manifestation of best practices A systematic way to conduct DM projects Different groups has different versions Most common standard.
Prof Max Munday The E4G Toolkit. What is an E4G project expected to do/collect in terms of visitor numbers and related information? When you need to deliver.
Eurostat Overall design. Presented by Eva Elvers Statistics Sweden.
DEVELOPMENT OF SURVEY FROM AN ITEM BANK For Counselors Motivational Interviewing Performance Management and Vocational Rehabilitation Program Evaluation.
Generic Approaches to Model Validation Presented at Growth Model User’s Group August 10, 2005 David K. Walters.
Scot Exec Course Nov/Dec 04 Survey design overview Gillian Raab Professor of Applied Statistics Napier University.
Brown, Suter, and Churchill Basic Marketing Research (8 th Edition) © 2014 CENGAGE Learning Basic Marketing Research Customer Insights and Managerial Action.
Statistics Section 2-3 Day 1 - Sampling in the Real World.
Topic (vi): New and Emerging Methods Topic organizer: Maria Garcia (USA) UNECE Work Session on Statistical Data Editing Oslo, Norway, September 2012.
Potential Errors In Epidemiologic Studies Bias Dr. Sherine Shawky III.
CHAPTER 12 Descriptive, Program Evaluation, and Advanced Methods.
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
Statistical Surveys 1 Sampling Errors. Survey In a survey data are collected from the members of a population. Eg if we want to collect data on the money.
MDG data at the sub-national level: relevance, challenges and IAEG recommendations Workshop on MDG Monitoring United Nations Statistics Division Kampala,
Testing, Experiments, and Media Planning. Market Tests and Experiments Test –Simple field test of advertising Experiment –Carefully designed study of.
Instrumental Variables: Introduction Methods of Economic Investigation Lecture 14.
META-ANALYSIS, RESEARCH SYNTHESES AND SYSTEMATIC REVIEWS © LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON.
© John M. Abowd 2007, all rights reserved General Methods for Missing Data John M. Abowd March 2007.
Marketing Research Approaches. Research Approaches Observational Research Ethnographic Research Survey Research Experimental Research.
Survey Research.
Workshop LFS, April Estimating the non-response bias using exogenous data on employment Etienne Debauche Corinne Prost.
Practical Survey Design Strategies for Minimizing MSE Lars Lyberg and Bo Sundgren Statistics Sweden
The effect of variable sampling efficiency on reliability of the observation error as a measure of uncertainty in abundance indices from scientific surveys.
1 Chapter 13 Collecting the Data: Field Procedures and Nonsampling Error © 2005 Thomson/South-Western.
Methodology: How Social Psychologists Do Research
LECTURE 02: EVALUATING MODELS January 27, 2016 SDS 293 Machine Learning.
E-Mission + Team of undergraduates = ??? Background and motivation.
1 1 Using administrative registers to evaluate the effects of proxy interviews in the Norwegian Labour Force Survey Øyvin Kleven, Ib Thomsen and Ole Villund.
Synthetic Approaches to Data Linkage Mark Elliot, University of Manchester Jerry Reiter Duke University Cathie Marsh Centre.
Adjusting for coverage error in administrative sources in population estimation Owen Abbott Research, Development and Infrastructure Directorate.
Review Measure testosterone level in rats; test whether it predicts aggressive behavior. What would make this an experiment? Randomly choose which rats.
Sampling Distributions and Hypothesis Testing
Chapter 10 Verification and Validation of Simulation Models
Statistics and Research Desgin
CHAPTER 29: Multiple Regression*
Experimental Design.
Experimental Design.
Meta-analysis, systematic reviews and research syntheses
Presentation transcript:

Some type of major TSE effort TSE for an “important” statistic Form a group to design a TSE evaluation for existing survey “exemplary TSE estimation plan” Develop group to design reliability and validity measures – US version of EU database.

Predictive Models Predictive models for response, other types of TSE components Interdisciplinary teams to motivate models for TSE (paradata, process data): –fixed effects to reduce errors by design IV tenure –Random effects for predictive value Probing behavior Model validation: gold standard vs. model fit.

Resource allocation Allocating resources to deal with effect of improving response rates: tradeoff b/w errors and costs. How do we understand components of error? How do we handle more than one component of error at a time? –Manipulating response propensities –Gold standard for estimating measurement error –Errors not independent

Resource allocation Resource allocation is extremely important –Convincing need for tradeoff: what should be the priority for sample size vs. follow-up. –Non-response error is currently considered critical; measurement error usually ignored because of lack of gold standard. –What could serve as GS? Intensive effort to get high response rate in small sample; try to obtain same info via multiple questions (across surveys, possibly).

Meta-analytic work on individual error sources NR bias vs. NR rates Comparable measures across surveys (e.g. why unemployment rate differs b/w CES and CPS – unmeasured error…) –Studies to match employers w/ employees –Using Census as gold standard –Small area estimates for CPS, integrating in 2000 Census data –Could use SIPP vs. CPS to compare, e.g, income measures, look at discrepancies using SIPP as “gold-standard” to improve Census measures of income –Also matches of administrative data: crosswalk between CPS, tax returns: multi-trait, multivariate method: not assuming admin record correct Meta analysis within each TSE “box,” then conduct “meta- meta” analysis

Processing Error Processing error: coding, editing. Improve or introduce error? Are we spending too much (35% in Sweden, 20% in US)? How do me model? Computer science handles this? –W. Winkler has compilation of lit.

Using TSE to Improve Meta- Analysis Bring in study characteristics. This is done, but maybe survey analysts can inform what the metrics should be. Unless you know a lot about the topic, DK what the metrics should be. Finding information is difficult. –Nonresponse error. –TSE informs design so that we can get info (reinterview)? Currently contact is validated, but rarely is response variance determined.

Simulation Nonresponse, response error, interviewer-respondent interaction.

What is the “quality space”? How do our measures affect what the user does with the data? SE gives sample size that is important; what about TSE? Timeliness: need it if future trader, not if historian. Similarly, short-term trends (ignore response bias) vs. long term means (need to account for response bias) Could create datasets that account for TSE vs. “real data” Need to talk to agencies, etc. to learn what is needed? Remember the ultimate goal is to use the research to improve the survey design – otherwise the managers might just view this as info they don’t want to have.