The Use of Test Scores in Secondary Analysis

Slides:

Advertisements

Similar presentations

Multiple Regression Analysis

Advertisements

Copyright © Allyn & Bacon 2008 This multimedia product and its contents are protected under copyright law. The following are prohibited by law: any public.

NLSCY – Non-response. Non-response There are various reasons why there is non-response to a survey  Some related to the survey process Timing Poor frame.

Econ Prof. Buckles1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 1. Estimation.

Stat 512 – Lecture 12 Two sample comparisons (Ch. 7) Experiments revisited.

Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.

Understanding sample survey data

FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS

Target population-> Study Population-> Sample 1WWW.HIVHUB.IR Target Population: All homeless in country X Study Population: All homeless in capital shelters.

4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.

Volunteer Angler Data Collection and Methods of Inference Kristen Olson University of Nebraska-Lincoln February 2,

Topic 5 Statistical inference: point and interval estimate

HOW TO WRITE RESEARCH PROPOSAL BY DR. NIK MAHERAN NIK MUHAMMAD.

1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u.

Properties of OLS How Reliable is OLS?. Learning Objectives 1.Review of the idea that the OLS estimator is a random variable 2.How do we judge the quality.

ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.

INTRODUCTION TO STATISTICS. Anthony J Greene2 Lecture Outline I.The Idea of Science II.Experimental Designs A.Variables 1.Independent Variables 2.Dependent.

Africa Program for Education Impact Evaluation Dakar, Senegal December 15-19, 2008 Experimental Methods Muna Meky Economist Africa Impact Evaluation Initiative.

1 Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 1. Estimation.

Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.

to become a critical consumer of information.

Design of Clinical Research Studies ASAP Session by: Robert McCarter, ScD Dir. Biostatistics and Informatics, CNMC

Research Methodology Lecture No :32 (Revision Chapters 8,9,10,11,SPSS)

Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,

Statistics for Business and Economics 7 th Edition Chapter 7 Estimation: Single Population Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.

GS/PPAL Research Methods and Information Systems

Chapter 1 Introduction and Data Collection

Evaluation Requirements for MSP and Characteristics of Designs to Estimate Impacts with Confidence Ellen Bobronnikov March 23, 2011.

Writing a sound proposal

Approaches to social research Lerum

Do Adoptees Have Lower Self Esteem?

This will help you understand the limitations of the data and the uses to which it can be put (and the confidence with which you can put it to those.

Understanding Results

Graduate School of Business Leadership

Confidence Intervals for Proportions

HLM with Educational Large-Scale Assessment Data: Restrictions on Inferences due to Limited Sample Sizes Sabine Meinck International Association.

Sampling And Sampling Methods.

CHAPTER 12 Sample Surveys.

Multiple Regression Analysis

12 Inferential Analysis.

Introduction to Survey Data Analysis

Chapter Three Research Design.

Chapter Eight: Quantitative Methods

Power, Sample Size, & Effect Size:

Reliability and Validity of Measurement

Combining Effect Sizes

CHAPTER 4 Designing Studies

Sampling Lecture 10.

Estimation of Sampling Errors, CV, Confidence Intervals

Random sampling Carlo Azzarri IFPRI Datathon APSU, Dhaka

Ten things about Experimental Design

Third year project – review of basic statistical concepts

Discussions and Conclusions

Types of Control I. Measurement Control II. Statistical Control

CHAPTER 4 Designing Studies

12 Inferential Analysis.

Sampling and Power Slides by Jishnu Das.

Group Experimental Design

Analysing RWE for HTA: Challenges, methods and critique

Inference for Sampling

Positive analysis in public finance

Statistical Inference

What do Samples Tell Us Variability and Bias.

Mean vs Median Sampling Techniques

Measuring the Wealth of Nations

Advanced Tools and Techniques of Program Evaluation

The use of test scores in secondary analysis

Presentation transcript:

The Use of Test Scores in Secondary Analysis The Use of Survey Weights in Regression Analysis (Wooldridge) Discussion The Use of Test Scores in Secondary Analysis PIAAC Methodological Seminar, June 2019, Paris Dr. Sabine Meinck

“Primary” vs “Secondary” Analysis? Concept note: “… primary analysis is mainly about … efficient estimation of the main parameters … (the cognitive skills) in the population of interest. Secondary analysis … is mainly focused on the … parameters of a statistical model that aims at uncovering the causal relationship between different variables.” Is there really a significant difference between “primary” and “secondary” analysis?

“Primary” vs “Secondary” Analysis? I don’t fully agree on the arguments made in the concept note. Reason to mostly utilize simple statistical indicators for “primary” reports is the wealth of the data “Primary” analysis is still more broad than just measuring some domains. Most of the papers with “secondary analysis” I read/reviewed, actually aim for making population inferences, even when using more advanced analysis types

“Primary” vs “Secondary” Analysis? In which circumstance would we actually NOT be interested in inferring on the population? I actually have a hard time in thinking of any. Hence, this difference alone clearly is no argument for altering recommendations on using weights.

Different Weighting Schemes Needed? Perhaps. Perhaps not. Using sampling weights for design-unbiased estimation of simple statistical indicators is out of question. Weighting in simple regression analysis? Weighting for propensity score modelling? Weighting in MLM? Weighting in SEM? More evidence is needed to develop efficient weighting schemes for advanced analysis methods. Technical doc’s/User Guides should cover related topics in more comprehensive ways.

Treatment effect estimation? Randomized treatments can hardly ever be observed in LSA. LSA are never RCT’s. If external variables are imposed as “treatments” (e.g., some specific reform in an education sub-system), bias may arise from not having considered this variable in the conditioning model used for scaling. Related discussion really relevant?

Estimating Causal Effects of LSA? There is a large body of literature with a very critical view on the possibilities of uncovering causal effects with LSA data. E.g., special issue of Large-scale Assessments in Education on “Quasi-causal methods” (May 2016). Need to encourage further discussion among economists and educational researchers?

Weighting Reduces Precision of Estimates? Using sampling weights can have an effect on sampling errors. It can go in both directions (increasing or decreasing sampling errors), BUT it reflects the true sampling variance resulting from unequal sample allocation.

Weighting Reduces Precision of Estimates? Think of a country in which all public schools follow the same curriculum Each private school follows a different curriculum If private schools are over-sampled, we overestimate the variety in the school system and also sampling variance. Unless we use weights.

Weighting Reduces Precision of Estimates? Neglecting weights will lead to biased sampling error estimates. In other words: WLS may produce higher S.E.’s than OLS, but WLS produces unbiased S.E. estimates.

Trusting Sampling Weights? “To ensure consistency use the sampling weights – if they can be trusted.” Generally, all procedures to derive sampling weights are methodologically sound and well established. No doubt that design weights are correctly reflected in the final weights! Sampling frames highly reliable; selection probabilities tracked for each sampling stage

Trusting Sampling Weights? Critical issue: weight adjustments Assumption: non-informative response model It can be very well argued that this assumption is often violated There is even evidence that nonresponse is not occurring at random Why don’t we do something about it? Information on the mechanisms of nonresponse is often not available (not at all or not in time) Even if a thorough NRBA is possible, this is still no proof of unbiasedness This is why LSA require such high participation rates

Summary Enforce statements about using weights (e.g., Carstens and Hastedt, 2010; Braun and von Davier, 2017; Von Davier, Gonzalez and Mislevy, 2009, and now Wooldridge): All recommend using sampling weights for most if not all statistical analysis. More research is needed. 

Sabine Meinck sabine.meinck@iea-hamburg.de Thank you! Sabine Meinck sabine.meinck@iea-hamburg.de