Lesson 2 Main Test Theories: The Classical Test Theory (CTT)

Slides:



Advertisements
Similar presentations
Measurement Concepts Operational Definition: is the definition of a variable in terms of the actual procedures used by the researcher to measure and/or.
Advertisements

REGRESSION, IV, MATCHING Treatment effect Boualem RABTA Center for World Food Studies (SOW-VU) Vrije Universiteit - Amsterdam.
Item Response Theory in a Multi-level Framework Saralyn Miller Meg Oliphint EDU 7309.
Topics: Quality of Measurements
RELIABILITY Reliability refers to the consistency of a test or measurement. Reliability studies Test-retest reliability Equipment and/or procedures Intra-
© McGraw-Hill Higher Education. All rights reserved. Chapter 3 Reliability and Objectivity.
Chapter 4 – Reliability Observed Scores and True Scores Error
VALIDITY AND RELIABILITY
Item Response Theory in Health Measurement
Part II Knowing How to Assess Chapter 5 Minimizing Error p115 Review of Appl 644 – Measurement Theory – Reliability – Validity Assessment is broader term.
RELIABILITY consistency or reproducibility of a test score (or measurement)
LECTURE 5 TRUE SCORE THEORY. True Score Theory OBJECTIVES: - know basic model, assumptions - know definition of reliability, relation to TST - be able.
Item Response Theory. Shortcomings of Classical True Score Model Sample dependence Limitation to the specific test situation. Dependence on the parallel.
Psych 231: Research Methods in Psychology
Analysis of Covariance Goals: 1)Reduce error variance. 2)Remove sources of bias from experiment. 3)Obtain adjusted estimates of population means.
Research Methods in MIS
Measurement Joseph Stevens, Ph.D. ©  Measurement Process of assigning quantitative or qualitative descriptions to some attribute Operational Definitions.
Classical Test Theory By ____________________. What is CCT?
Chapter 14 Inferential Data Analysis
Inferential Statistics
Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Modified for EPE/EDP 711 by Kelly Bradley on January 8, 2013.
Multivariate Methods EPSY 5245 Michael C. Rodriguez.
Chapter 8 Experimental Research
Experimental Design The Gold Standard?.
MEASUREMENT MODELS. BASIC EQUATION x =  + e x = observed score  = true (latent) score: represents the score that would be obtained over many independent.
Issues in Experimental Design Reliability and ‘Error’
Measurement in Exercise and Sport Psychology Research EPHE 348.
I/O Psychology Research Methods. What is Science? Science: Approach that involves the understanding, prediction, and control of some phenomenon of interest.
Unanswered Questions in Typical Literature Review 1. Thoroughness – How thorough was the literature search? – Did it include a computer search and a hand.
Test item analysis: When are statistics a good thing? Andrew Martin Purdue Pesticide Programs.
Reliability Chapter 3.  Every observed score is a combination of true score and error Obs. = T + E  Reliability = Classical Test Theory.
1 Chapter 4 – Reliability 1. Observed Scores and True Scores 2. Error 3. How We Deal with Sources of Error: A. Domain sampling – test items B. Time sampling.
Forming the Method Subjects Instruments or apparatus Procedures Design and analysis The Method section explains how the study was conducted and should.
Intro: “BASIC” STATS CPSY 501 Advanced stats requires successful completion of a first course in psych stats (a grade of C+ or above) as a prerequisite.
6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)
© Copyright McGraw-Hill 2000
ANOVA Assumptions 1.Normality (sampling distribution of the mean) 2.Homogeneity of Variance 3.Independence of Observations - reason for random assignment.
1 Item Analysis - Outline 1. Types of test items A. Selected response items B. Constructed response items 2. Parts of test items 3. Guidelines for writing.
1 EPSY 546: LECTURE 1 SUMMARY George Karabatsos. 2 REVIEW.
Research Methodology and Methods of Social Inquiry Nov 8, 2011 Assessing Measurement Reliability & Validity.
MEASUREMENT. MeasurementThe assignment of numbers to observed phenomena according to certain rules. Rules of CorrespondenceDefines measurement in a given.
The ABC’s of Pattern Scoring
Item Factor Analysis Item Response Theory Beaujean Chapter 6.
Reliability performance on language tests is also affected by factors other than communicative language ability. (1) test method facets They are systematic.
Reliability: Introduction. Reliability Session 1.Definitions & Basic Concepts of Reliability 2.Theoretical Approaches 3.Empirical Assessments of Reliability.
Reliability: Introduction. Reliability Session Definitions & Basic Concepts of Reliability Theoretical Approaches Empirical Assessments of Reliability.
Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Heriot Watt University 12th February 2003.
Chapter 6 - Standardized Measurement and Assessment
Reliability a measure is reliable if it gives the same information every time it is used. reliability is assessed by a number – typically a correlation.
Methodology: How Social Psychologists Do Research
2. Main Test Theories: The Classical Test Theory (CTT) Psychometrics. 2011/12. Group A (English)
Chapter 13 Understanding research results: statistical inference.
Lesson 5.1 Evaluation of the measurement instrument: reliability I.
Generalizability Theory A Brief Introduction Greg Brown UCSD.
5. Evaluation of measuring tools: reliability Psychometrics. 2011/12. Group A (English)
Classical Test Theory Psych DeShon. Big Picture To make good decisions, you must know how much error is in the data upon which the decisions are.
IRT Equating Kolen & Brennan, 2004 & 2014 EPSY
Chapter 6 INFERENTIAL STATISTICS I: Foundations and sampling distribution.
Questions What are the sources of error in measurement?
Reliability.
Evaluation of measuring tools: validity
Classical Test Theory Margaret Wu.
Item Analysis: Classical and Beyond
Evaluation of measuring tools: reliability
By ____________________
Psy 425 Tests & Measurements
Item Analysis: Classical and Beyond
Multitrait Scaling and IRT: Part I
Chapter 8 VALIDITY AND RELIABILITY
Item Analysis: Classical and Beyond
Presentation transcript:

Lesson 2 Main Test Theories: The Classical Test Theory (CTT)

Test theories They allow to set one functional relationship between: – Observable variables (from empirical scores obtained by subjects in tests or in their items); and – Unobservable variables (true scores or the skill level of the subjects in the construct that is measuring).

Test theories Main test theories: 1.Classical Test Theory (CTT). 2.Item Response Theory (IRT). 3.Generalizability Theory (GT).

1. Classical Test Theory (CTT) Spearman (1904, 1907, 1910, 1913) Functional relationship between empirical or observed scores (X), true scores (T) and scores due to error (E). Linear model: X = T + E

1. Classical Test Theory (CTT) The actions of one subject responding to a test at a particular time are affected by many factors difficult to control. That implies that the obtained score (empirical) doesn’t match with their true score. It will be necessary to estimate the true score based on assumptions of the model.

1. Classical Test Theory (CTT) The error term includes all random errors that are affecting empirical scores. They can come from several sources: – The subject (emotional state, fatigue, stress, etc..). – The test (due to their items and type of format). – Characteristics of the applicators. – Environmental conditions. – Instructions. – Etc. We should try to control them through the study of reliability (lesson 5).

1.1. Classical Test Theory (CTT) Model assumptions A. The true score (T) is the mathematical expectation of the empirical score (X). – If we pass an infinite number of times the same test to a person (assuming that the applications are independent, so the score obtained in one application do not influence the others), the mean of all observed scores (X) would be the real score of the subject. T = E(X)

1.1. Classical Test Theory (CTT) Model assumptions B. The correlation between true scores of 'n' participants in a test and measurement errors is equal to 0. – There is not relationship between measurement errors and true scores. r te = 0

1.1. Classical Test Theory (CTT) Model assumptions C. The correlation between measurement errors (r e1e2 ) that affect scores in two different tests (X 1 y X 2 ) is equal to 0. – There is no reason to assume that measurement errors committed in one test will influence, positively or negatively, the measurement errors in another test if tests are applied correctly. r e1e2 = 0

1.2. Classical Test Theory (CTT) Model deductions 1. when E = X – T 5. cov (T, E) = 0 6. cov (X, T) =var (T)= 7. 8.

2. Item Response Theory (IRT) Lord (1952, 1953). The probability that one person emits a specific response to an item depends on the skill level of the person in the construct and on item characteristics (difficulty, discrimination, pseudochance). The IRT provides a number of models that assume a functional relationship between the values ​​of the variable that items measure (skill level of the subjects in the measured construct) and the likelihood that the subjects hit each item, depending on their skill level. This function is called: Item Characteristic Curve.

3. Generalizability Theory (GT) Cronbach, Glesser, Nanda & Rajaratnam (1972) It represents a way to try to systematize and classify the error as a function of the possible sources that cause it. It takes into account all possible sources of error (due to individual factors, situational characteristics of the evaluator, and instrumental variables) and tries to differentiate by applying the classical procedures of analysis of variance (ANOVA).