Reliability Chapter 3.  Every observed score is a combination of true score and error Obs. = T + E  Reliability = Classical Test Theory.

Slides:



Advertisements
Similar presentations
Consistency in testing
Advertisements

Topics: Quality of Measurements
Reliability Definition: The stability or consistency of a test. Assumption: True score = obtained score +/- error Domain Sampling Model Item Domain Test.
Chapter 5 Understanding, Calculating, and Evaluating Reliability and Objectivity.
© McGraw-Hill Higher Education. All rights reserved. Chapter 3 Reliability and Objectivity.
Chapter 5 Reliability Robert J. Drummond and Karyn Dayle Jones Assessment Procedures for Counselors and Helping Professionals, 6 th edition Copyright ©2006.
The Department of Psychology
© 2006 The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Validity and Reliability Chapter Eight.
Psychometrics William P. Wattles, Ph.D. Francis Marion University.
Chapter 4 – Reliability Observed Scores and True Scores Error
Assessment Procedures for Counselors and Helping Professionals, 7e © 2010 Pearson Education, Inc. All rights reserved. Chapter 5 Reliability.
VALIDITY AND RELIABILITY
Lesson Six Reliability.
1Reliability Introduction to Communication Research School of Communication Studies James Madison University Dr. Michael Smilowitz.
 A description of the ways a research will observe and measure a variable, so called because it specifies the operations that will be taken into account.
Reliability for Teachers Kansas State Department of Education ASSESSMENT LITERACY PROJECT1 Reliability = Consistency.
Methods for Estimating Reliability
CORRELATIONAL ANALYSES EDRS 5305 EDUCATIONAL RESEARCH & STATISTICS.
Chapter 17 Making Sense of Advanced Statistical Procedures in Research Articles.
Testing 05 Reliability.
Reliability and Validity of Research Instruments
Reliability n Consistent n Dependable n Replicable n Stable.
Reliability n Consistent n Dependable n Replicable n Stable.
Reliability and Validity
Lecture 7 Psyc 300A. Measurement Operational definitions should accurately reflect underlying variables and constructs When scores are influenced by other.
Lesson Seven Reliability. Contents  Definition of reliability Definition of reliability  Indication of reliability: Reliability coefficient Reliability.
Session 3 Normal Distribution Scores Reliability.
Research Methods in MIS
Classroom Assessment A Practical Guide for Educators by Craig A
Reliability of Selection Measures. Reliability Defined The degree of dependability, consistency, or stability of scores on measures used in selection.
Classical Test Theory By ____________________. What is CCT?
Classroom Assessment Reliability. Classroom Assessment Reliability Reliability = Assessment Consistency. –Consistency within teachers across students.
Measurement and Data Quality
Validity and Reliability
Reliability, Validity, & Scaling
Data Analysis. Quantitative data: Reliability & Validity Reliability: the degree of consistency with which it measures the attribute it is supposed to.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 14 Measurement and Data Quality.
Psychometrics William P. Wattles, Ph.D. Francis Marion University.
Instrumentation (cont.) February 28 Note: Measurement Plan Due Next Week.
Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E.
Creating Assessments AKA how to write a test. Creating Assessments All good assessments have three key features: All good assessments have three key features:
Reliability & Validity
1 Chapter 4 – Reliability 1. Observed Scores and True Scores 2. Error 3. How We Deal with Sources of Error: A. Domain sampling – test items B. Time sampling.
Counseling Research: Quantitative, Qualitative, and Mixed Methods, 1e © 2010 Pearson Education, Inc. All rights reserved. Basic Statistical Concepts Sang.
Tests and Measurements Intersession 2006.
Assessing Learners with Special Needs: An Applied Approach, 6e © 2009 Pearson Education, Inc. All rights reserved. Chapter 4:Reliability and Validity.
EDU 8603 Day 6. What do the following numbers mean?
Appraisal and Its Application to Counseling COUN 550 Saint Joseph College For Class # 3 Copyright © 2005 by R. Halstead. All rights reserved.
Chapter 2: Behavioral Variability and Research Variability and Research 1. Behavioral science involves the study of variability in behavior how and why.
RELIABILITY Prepared by Marina Gvozdeva, Elena Onoprienko, Yulia Polshina, Nadezhda Shablikova.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Assessing Measurement Quality in Quantitative Studies.
Psychometrics. Goals of statistics Describe what is happening now –DESCRIPTIVE STATISTICS Determine what is probably happening or what might happen in.
Measurement MANA 4328 Dr. Jeanne Michalski
Experimental Research Methods in Language Learning Chapter 12 Reliability and Reliability Analysis.
Reliability n Consistent n Dependable n Replicable n Stable.
©2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Reliability performance on language tests is also affected by factors other than communicative language ability. (1) test method facets They are systematic.
Reliability: Introduction. Reliability Session 1.Definitions & Basic Concepts of Reliability 2.Theoretical Approaches 3.Empirical Assessments of Reliability.
Reliability: Introduction. Reliability Session Definitions & Basic Concepts of Reliability Theoretical Approaches Empirical Assessments of Reliability.
Reliability a measure is reliable if it gives the same information every time it is used. reliability is assessed by a number – typically a correlation.
Classroom Assessment Chapters 4 and 5 ELED 4050 Summer 2007.
Measuring Research Variables
Dr. Jeffrey Oescher 27 January 2014 Technical Issues  Two technical issues  Validity  Reliability.
Copyright © 2014 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 11 Measurement and Data Quality.
Reliability. Basics of test score theory Each person has a true score that would be obtained if there were no errors in measurement. However, measuring.
Chapter 2 Norms and Reliability. The essential objective of test standardization is to determine the distribution of raw scores in the norm group so that.
Copyright © 2009 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 47 Critiquing Assessments.
Reliability & Validity
By ____________________
The first test of validity
Presentation transcript:

Reliability Chapter 3

 Every observed score is a combination of true score and error Obs. = T + E  Reliability = Classical Test Theory

 Systematic versus unsystematic error  Reliability only takes unsystematic error into account Reliability

Reliability & Correlation  Reliability often based on consistency between two sets of scores  Correlation: Statistical technique used to examine consistency

Positive Correlation

Negative Correlation

 Correlation coefficient: a numerical indicator of the relationship between two sets of data  Pearson-Product Moment correlation coefficient is most common Pearson-Product Moment Correlation Coefficient

 The percentage of shared variance between two sets of data Coefficient of Determination

 Test-Retest  Alternate/Parallel Forms  Internal Consistency Measures Types of Reliability

 Correlating performance on first administration with performance on the second  Co-efficient of stability Test-Retest

 Two forms of instrument, administered to same individuals Alternate/Parallel Forms

 Split-half reliability  Spearman-Brown formula  Kuder-Richardson formulas  KR 20  KR 21  Coefficient Alpha Internal Consistency Measures

 Typical methods for determining reliability may not be suitable for:  Speed tests  Criterion-referenced tests  Subjectively-scored instruments  Interrater reliability Nontypical Situations

 Examine purpose for using instrument  Be knowledgeable about reliability coefficients of other instruments in that area  Examine characteristics of particular clients against reliability coefficients  Coefficients may vary based on SES, age, culture/ethnicity, etc. Evaluating Reliability Coefficients

Standard Error of Measurement  Provides estimate of range of scores if someone were to take instrument repeatedly  Based on premise that when individuals take a test multiple times, scores fall into normal distribution

 Sam’s SAT Verbal = 550  r =.91; s = 100  SEM  68% of the time, Sam’s true score would fall between 520 and 580  95% of the time, Sam’s true score would fall between 490 and 610  99.5% of the time, Sam’s true score would fall between 460 and 640 SEM: Example

Determining Range of Scores Using SEM

 Method to determine if difference between two scores is significant  Takes into account SEM of both scores Standard Error of Difference

 Generalizability or Domain Sampling Theory  Focus is on estimating the extent to which specific sources of variation under defined conditions are contributing to the score on the instrument Alternative Theoretical Model