Nonequivalent Groups: Linear Methods Kolen, M. J., & Brennan, R. L. (2004). Test equating, scaling, and linking: Methods and practices (2 nd ed.). New.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Managerial Economics in a Global Economy
Lesson 10: Linear Regression and Correlation
Kin 304 Regression Linear Regression Least Sum of Squares
Statistical Techniques I EXST7005 Simple Linear Regression.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
IRT Equating Kolen & Brennan, IRT If data used fit the assumptions of the IRT model and good parameter estimates are obtained, we can estimate person.
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Part 1 Cross Sectional Data
Multiple Linear Regression Model
Chapter 10 Simple Regression.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
By Dr. Mohammad H. Omar Department of Mathematical Sciences May 16, 2006 Presented at Statistic Research (STAR) colloquium, King Fahd University of Petroleum.
Chapter 11 Multiple Regression.
Topic 3: Regression.
6.4 Prediction -We have already seen how to make predictions about our dependent variable using our OLS estimates and values for our independent variables.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Simple Linear Regression and Correlation
Simple Linear Regression Analysis
Correlation & Regression
EPSY 8223: Test Score Equating
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Inference for regression - Simple linear regression
The Accuracy of Small Sample Equating: An Investigative/ Comparative study of small sample Equating Methods.  Kinge Mbella Liz Burton Rob Keller Nambury.
Linking & Equating Psych 818 DeShon. Why needed? ● Large-scale testing programs often require multiple forms to maintain test security over time or to.
1 Chapter 6. Section 6-1 and 6-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
1 Conceptual Issues in Observed-Score Equating Wim J. van der Linden CTB/McGraw-Hill.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Simple Linear Regression. The term linear regression implies that  Y|x is linearly related to x by the population regression equation  Y|x =  +  x.
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Regression Analysis © 2007 Prentice Hall17-1. © 2007 Prentice Hall17-2 Chapter Outline 1) Correlations 2) Bivariate Regression 3) Statistics Associated.
Chapter 10 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 A perfect correlation implies the ability to predict one score from another perfectly.
The Simple Linear Regression Model: Specification and Estimation ECON 4550 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s.
1 Chapter 6. Section 6-1 and 6-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Testing Differences between Means, continued Statistics for Political Science Levin and Fox Chapter Seven.
University of Ostrava Czech republic 26-31, March, 2012.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
4 basic analytical tasks in statistics: 1)Comparing scores across groups  look for differences in means 2)Cross-tabulating categoric variables  look.
1 HETEROSCEDASTICITY: WEIGHTED AND LOGARITHMIC REGRESSIONS This sequence presents two methods for dealing with the problem of heteroscedasticity. We will.
More on regression Petter Mostad More on indicator variables If an independent variable is an indicator variable, cases where it is 1 will.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 7: Regression.
Hypothesis Testing and Statistical Significance
Regression Analysis Presentation 13. Regression In Chapter 15, we looked at associations between two categorical variables. We will now focus on relationships.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Classical Test Theory Psych DeShon. Big Picture To make good decisions, you must know how much error is in the data upon which the decisions are.
IRT Equating Kolen & Brennan, 2004 & 2014 EPSY
STA302/1001 week 11 Regression Models - Introduction In regression models, two types of variables that are studied:  A dependent variable, Y, also called.
Regression Analysis AGEC 784.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Virtual COMSATS Inferential Statistics Lecture-26
12 Inferential Analysis.
Correlation and Regression
Inference about the Slope and Intercept
6-1 Introduction To Empirical Models
Inference about the Slope and Intercept
Simple Linear Regression
12 Inferential Analysis.
Simple Linear Regression
Statistics II: An Overview of Statistics
Item & Test Statistics Psych DeShon.
CHAPTER – 1.2 UNCERTAINTIES IN MEASUREMENTS.
Created by Erin Hodgess, Houston, Texas
CHAPTER – 1.2 UNCERTAINTIES IN MEASUREMENTS.
Presentation transcript:

Nonequivalent Groups: Linear Methods Kolen, M. J., & Brennan, R. L. (2004). Test equating, scaling, and linking: Methods and practices (2 nd ed.). New York, NY: Springer. EPSY 82251

Equating Common Population Observed Score True Score: CTT/IRT Anchor Test Observed Score True Score: CTT/IRT EPSY 82252

Nonequivalent Groups Only one test form is administered at a time. The different groups cannot be assumed to be randomly equivalent. Each group comes from a different population. To allow the linking of the two forms, a set of common items is included on each form. Common items could be internal (scored) or external (not part of the score on the form). EPSY 82253

Notation Y = score on the old form X = score on the new form V = score on the common item set EPSY 82254

The Synthetic Population The design involves two populations. But the equating function is defined for a single population. We define a synthetic population, where each of the independent populations is weighted by w 1 and w 2 where w 1 + w 2 = 1 and w 1, w 2  0 EPSY 82255

Linear Equating The linear conversion is defined by setting standardized deviation scores equal EPSY 82256

Linear Equating Function Where s indicates the synthetic population EPSY 82257

Synthetic Population Parameters  s (X) = w 1  1 (X) + w 2  2 (X)  s (Y) = w 1  1 (Y) + w 2  2 (Y) EPSY 82258

Synthetic Population Parameters In population 1, Y is not administered In population 2, X is not administered The following parameters cannot be estimated directly: EPSY 82259

Parameters Estimated from Data Form X administered to population 1 Form Y administered to population 2 EPSY

Parameters Estimated from Assumptions Form X moments in Population 2 Form Y moments in Population 1 EPSY

Nonequivalent Group Equating What sets apart these equating methods is the set of statistical assumptions used. Assumptions must be introduced to estimate the parameters not observed. Different methods employ different assumptions. EPSY

Tucker Method Assumption 1: regression of total scores on common-item scores – The regression of X on V is the same linear function for both populations 1 and 2 – The same assumption is made for the regression of Y on V The slope and regression intercept are assumed to be the same for the observed data with each population and the unobserved parameters in the other population EPSY

Tucker Method Assumption 2: conditional variances of total scores given common-item scores – Conditional variance of X given V is the same for population 1 and 2 – The same assumption is made for the conditional variance of Y given V EPSY

Tucker Method The conditional mean score on the new form increases linearly with scores on the anchor – Use a simple formula to estimate the conditional mean in the synthetic population The conditional standard error is the same at all levels of the anchor score – Estimate a single value for the conditional SE Need: Mean and SD of anchor scores in synthetic population EPSY

Tucker Method Result: the synthetic population means and variances for X and Y are adjusted to directly observable quantities. The adjustment is a function of the differences in means and variances for the common items across the two populations. If  1 (V) =  2 (V) and the synthetic parameters would equal observable means and variances. EPSY

Estimating Synthetic Pop Values  s (X) =  1 (X) - w 2 γ 1 [  1 (V) -  2 (V)]  s (Y) =  2 (Y) - w 1 γ 2 [  1 (V) -  2 (V)] Where γ 1 and γ 2 are the regression slopes from the regression of X on V for the two populations. Similarly, we can estimate EPSY

Weighting Options w 1 + w 2 = 1 and w 1, w 2  0 One could conceive of the synthetic population to be the new population: w 1 = 1 and w 2 = 0 Weights can be proportional to population sizes: w 1 = N 1 /(N 1 + N 2 ) and w 1 = N 2 /(N 1 + N 2 ) Weights can be equal, combining both populations equally: w 1 + w 2 =.5 EPSY

Tucker Issues Problems in the equating can occur if – the ability distributions between those who take the different forms differ a great deal – When the anchor is not strongly correlated with the test scores – The test scores and the anchor scores do not yield near perfect reliabilities EPSY

Levine Method Assumes that X, Y, and V are all measuring the same thing so that T X and T V as well as T Y and T V are perfectly correlated in both populations. Assumptions about true scores are made in terms of the linear regression of X on V and Y on V Assumptions about the error variances (measurement error) are made similarly EPSY

Levine The method is one that relates observed scores from form X to the scale of observed scores on form Y. However, the assumptions underlying Levine method are about the true scores, T X, T Y, and T V. These are related to observed scores as in CTT, where the error has E[ε] = 0 and ρ εT = 0. EPSY

Levine The assumptions are that X, Y, and V all measure the same thing, which implies True scores of X and V as well as True scores of Y and V are perfectly correlated in both populations 1 and 2. ρ 1 = (T X, T V ) = ρ 2 (T X, T V ) = 1 ρ 1 = (T Y, T V ) = ρ 2 (T Y, T V ) = 1 EPSY

Levine True Score Method Similar assumptions about true scores are made in this method Instead of equating observed scores, true scores are equated Although the derivations employ true scores, the equating is actually done on observed scores. EPSY

Levine True Score Method In CTT, observed score means = true score means And based on previous derivation results: Observed scores are used in place of true scores. The results do not rely on the synthetic population. EPSY

Levine True-Score Property Turns out, using observed scores in Levine’s true score equating function for the common- item nonequivalent groups design results in first-order equity, under the congeneric model. For the population with a given true score on Y, the expected value for the linearly transformed scores on X equals the expected value of the scores on Y, for all true scores on Y. EPSY

Nonequivalent Groups: Equipercentile Methods EPSY

Frequency Estimation f (x, v ) is the joint distribution; the probability that X = x and V = v. f (x) is the marginal distribution of scores on X; the probability of obtaining a score of x on X. h( v ) is the marginal distribution of scores on V. f (x| v ) is the conditional distribution of scores on Form X for examinees with a particular score on V. EPSY

The conditional expectation is It follows that EPSY

Synthetic Populations f s (x) = w 1 f 1 (x) + w 2 f 2 (x) g s (y) = w 1 g 1 (y) + w 2 g 2 (y) Because form X is not administered to population 2, f 2 (x) is not directly estimable Because form Y is not administered to population 1, g 1 (y) is not directly estimable EPSY

Assumptions For both Form X and Form Y, the conditional distribution of total score given each score, V = v, is the same in both populations. EPSY

Estimation These assumptions can be used to find expressions for f 2 (x) and g 1 (y) using quantities for which direct estimates are available. EPSY

Chained Equipercentile Equating Angoff (1971) Form X scores are converted to scores on the common items using examinees from population 1 Scores on the common items are equated to form Y scores using examinees from Population 2. These conversions are chained together to produce a conversion of Form X to Form Y EPSY