Download presentation
Presentation is loading. Please wait.
Published byDylan Arthur Fletcher Modified over 9 years ago
1
Reliability Chapter 3
2
Every observed score is a combination of true score and error Obs. = T + E Reliability = Classical Test Theory
3
Systematic versus unsystematic error Reliability only takes unsystematic error into account Reliability
4
Reliability & Correlation Reliability often based on consistency between two sets of scores Correlation: Statistical technique used to examine consistency
5
Positive Correlation
6
Negative Correlation
7
Correlation coefficient: a numerical indicator of the relationship between two sets of data Pearson-Product Moment correlation coefficient is most common Pearson-Product Moment Correlation Coefficient
8
The percentage of shared variance between two sets of data Coefficient of Determination
9
Test-Retest Alternate/Parallel Forms Internal Consistency Measures Types of Reliability
10
Correlating performance on first administration with performance on the second Co-efficient of stability Test-Retest
11
Two forms of instrument, administered to same individuals Alternate/Parallel Forms
12
Split-half reliability Spearman-Brown formula Kuder-Richardson formulas KR 20 KR 21 Coefficient Alpha Internal Consistency Measures
13
Typical methods for determining reliability may not be suitable for: Speed tests Criterion-referenced tests Subjectively-scored instruments Interrater reliability Nontypical Situations
14
Examine purpose for using instrument Be knowledgeable about reliability coefficients of other instruments in that area Examine characteristics of particular clients against reliability coefficients Coefficients may vary based on SES, age, culture/ethnicity, etc. Evaluating Reliability Coefficients
15
Standard Error of Measurement Provides estimate of range of scores if someone were to take instrument repeatedly Based on premise that when individuals take a test multiple times, scores fall into normal distribution
16
Sam’s SAT Verbal = 550 r =.91; s = 100 SEM 68% of the time, Sam’s true score would fall between 520 and 580 95% of the time, Sam’s true score would fall between 490 and 610 99.5% of the time, Sam’s true score would fall between 460 and 640 SEM: Example
17
Determining Range of Scores Using SEM
18
Method to determine if difference between two scores is significant Takes into account SEM of both scores Standard Error of Difference
19
Generalizability or Domain Sampling Theory Focus is on estimating the extent to which specific sources of variation under defined conditions are contributing to the score on the instrument Alternative Theoretical Model
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.