Psychological Measurement: Reliability and the Properties of Random Errors The last two lectures were concerned with some basics of psychological measurement:

Slides:



Advertisements
Similar presentations
Chapter 8 Flashcards.
Advertisements

Measurement Concepts Operational Definition: is the definition of a variable in terms of the actual procedures used by the researcher to measure and/or.
Reliability IOP 301-T Mr. Rajesh Gunesh Reliability  Reliability means repeatability or consistency  A measure is considered reliable if it would give.
How good are our measurements? The last three lectures were concerned with some basics of psychological measurement: What does it mean to quantify a psychological.
Conceptualization and Measurement
Taking Stock Of Measurement. Basics Of Measurement Measurement: Assignment of number to objects or events according to specific rules. Conceptual variables:
Some (Simplified) Steps for Creating a Personality Questionnaire Generate an item pool Administer the items to a sample of people Assess the uni-dimensionality.
Chapter 4 – Reliability Observed Scores and True Scores Error
1 Reliability in Scales Reliability is a question of consistency do we get the same numbers on repeated measurements? Low reliability: reaction time High.
Lesson Six Reliability.
1Reliability Introduction to Communication Research School of Communication Studies James Madison University Dr. Michael Smilowitz.
 A description of the ways a research will observe and measure a variable, so called because it specifies the operations that will be taken into account.
Reliability Analysis. Overview of Reliability What is Reliability? Ways to Measure Reliability Interpreting Test-Retest and Parallel Forms Measuring and.
Measurement Reliability and Validity
Quiz Do random errors accumulate? Name 2 ways to minimize the effect of random error in your data set.
Validity In our last class, we began to discuss some of the ways in which we can assess the quality of our measurements. We discussed the concept of reliability.
Reliability, the Properties of Random Errors, and Composite Scores.
-生醫統計期末報告- Reliability 學生 : 劉佩昀 學號 : 授課老師 : 蔡章仁.
Reliability Analysis. Overview of Reliability What is Reliability? Ways to Measure Reliability Interpreting Test-Retest and Parallel Forms Measuring and.
When Measurement Models and Factor Models Conflict: Maximizing Internal Consistency James M. Graham, Ph.D. Western Washington University ABSTRACT: The.
MGTO 231 Human Resources Management Personnel selection I Dr. Kin Fai Ellick WONG.
Measurement: Reliability and Validity For a measure to be useful, it must be both reliable and valid Reliable = consistent in producing the same results.
Lesson Seven Reliability. Contents  Definition of reliability Definition of reliability  Indication of reliability: Reliability coefficient Reliability.
Psych 231: Research Methods in Psychology
Variables cont. Psych 231: Research Methods in Psychology.
Validity, Reliability, & Sampling
Reliability, Validity, & Scaling
Defining and Measuring Variables Slides Prepared by Alison L. O’Malley Passer Chapter 4.
Goals for Today Review the basics of an experiment Learn how to create a unit-weighted composite variable and how/why it is used in psychology. Learn how.
Chapter 1: Research Methods
1 Chapter 4 – Reliability 1. Observed Scores and True Scores 2. Error 3. How We Deal with Sources of Error: A. Domain sampling – test items B. Time sampling.
Validity In our last class, we began to discuss some of the ways in which we can assess the quality of our measurements. We discussed the concept of reliability.
Reliability, the Properties of Random Errors, and Composite Scores Week 7, Psych R. Chris Fraley
Chapter 2: Behavioral Variability and Research Variability and Research 1. Behavioral science involves the study of variability in behavior how and why.
Research Methodology and Methods of Social Inquiry Nov 8, 2011 Assessing Measurement Reliability & Validity.
Reliability: The degree to which a measurement can be successfully repeated.
Experimental Research Methods in Language Learning Chapter 12 Reliability and Reliability Analysis.
MEASUREMENT: PART 1. Overview  Background  Scales of Measurement  Reliability  Validity (next time)
Reliability: Introduction. Reliability Session 1.Definitions & Basic Concepts of Reliability 2.Theoretical Approaches 3.Empirical Assessments of Reliability.
Measurement Experiment - effect of IV on DV. Independent Variable (2 or more levels) MANIPULATED a) situational - features in the environment b) task.
Chapter 6 - Standardized Measurement and Assessment
Reliability a measure is reliable if it gives the same information every time it is used. reliability is assessed by a number – typically a correlation.
Reliability When a Measurement Procedure yields consistent scores when the phenomenon being measured is not changing. Degree to which scores are free of.
Reliability EDUC 307. Reliability  How consistent is our measurement?  the reliability of assessments tells the consistency of observations.  Two or.
Copyright © 2014 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 11 Measurement and Data Quality.
Measurement and Scaling Concepts
MGMT 588 Research Methods for Business Studies
Reliability Analysis.
Ch. 5 Measurement Concepts.
Product Reliability Measuring
Catching Up: Review.
Measurement: Part 1.
CHAPTER 5 MEASUREMENT CONCEPTS © 2007 The McGraw-Hill Companies, Inc.
assessing scale reliability
Classical Test Theory Margaret Wu.
Measurement with Numbers Scaling: What is a Number?
پرسشنامه کارگاه.
5. Reliability and Validity
Reliability and Validity of Measurement
Chapter 5 Conceptualization, Operationalization, and Measurement
Measurement: Part 1.
Evaluation of measuring tools: reliability
By ____________________
Reliability Analysis.
The first test of validity
Reliability, the Properties of Random Errors, and Composite Scores
Measurement Concepts and scale evaluation
Measurement: Part 1.
Reliability and Validity
Presentation transcript:

Psychological Measurement: Reliability and the Properties of Random Errors The last two lectures were concerned with some basics of psychological measurement: What does it mean to quantify a psychological variable? How do we operationally define both observable and latent variables? The next important issue concerns the quality of our measurements How can we help make our measurements precise? How can we determine whether we’re measuring what we think we’re measuring?

Reliability Reliability: the extent to which measurements are free of random errors. Random error: nonsystematic mistakes in measurement misreading a questionnaire item observer looks away when coding behavior nonsystematic misinterpretations of a behavior

Reliability What are the implications of random measurement errors for the quality of our measurements?

Reliability O = T + E + S O = T + E O = a measured score (e.g., performance on an exam) T = true score (e.g., the value we want) E = random error S = systematic error O = T + E (we’ll ignore S for now, but we’ll return to it later)

Reliability O = T + E The error becomes a part of what we’re measuring This is a problem if we’re operationally defining our variables using equivalence definitions because part of our measurement is based on the true value that we want and part is based on error. Once we’ve taken a measurement, we have an equation with two unknowns. We can’t separate the relative contribution of T and E. 10 = T + E

Reliability: Do random errors accumulate? Question: If we sum or average multiple observations, will random errors accumulate?

Reliability: Do random errors accumulate? Answer: No. If E is truly random, we are just as likely to overestimate T as we are to underestimate T. Height example

5’2 5’3 5’4 5’5 5’6 5’7 5’8 5’9 5’10 5’11 6 6’1 6’2 6’3 6’4 6’5 6’6 6’7 6’8 8’9

Reliability: Do random errors accumulate? Note: The average of the seven O’s is equal to T

Reliability: Implications These demonstrations suggest that one important way to help eliminate the influence of random errors of measurement is to use multiple measurements. operationally define latent variables via multiple indicators use more than one observer when quantifying behaviors

Reliability: Estimating reliability Question: How can we estimate the reliability of our measurements? Answer: Two common ways: (a) test-retest reliability (b) internal consistency reliability

Reliability: Estimating reliability Test-retest reliability: Reliability assessed by measuring something at least twice at different time points. The logic is as follows: If the errors of measurement are truly random, then the same errors are unlikely to be made more than once. Thus, to the degree that two measurements of the same thing agree, it is unlikely that those measurements contain random error.

Reliability: Estimating reliability Internal consistency: Reliability assessed by measuring something at least twice within the same broad slice of time. Split-half: based on an arbitrary split (e.g, comparing odd and even, first half and second half) Cronbach’s alpha (): based on the average of all possible split-halves

Item A 4 3 Item B 5 5 Item C 6 7 Item D 5 5 Item E 4 3 Item F 5 5 Less error More error Item A 4 3 Item B 5 5 Item C 6 7 Item D 5 5 Item E 4 3 Item F 5 5 Items A, B, & C yield an average score of (3+5+7)/3 = 5. Items A, B, & C yield an average score of (4+5+6)/3 = 5. Items D, E, & F yield an average scores of (5, 3, 5)/3 = 4.3. Items D, E, & F yield an average scores of (5, 4, 5)/3 = 4.6. These two estimates are off by only .4 of a point. These two estimates are off by .7 of a point.

Reliability: Final notes An important implication: As you increase the number of indicators, the amount of random error in the averaged measurement decreases. An important assumption: The entity being measured is not changing. An important note: Common indices of reliability range from 0 to 1; higher numbers indicate better reliability (i.e., less random error).