Types of Reliability.

Slides:



Advertisements
Similar presentations
Topics: Quality of Measurements
Advertisements

Reliability.
Taking Stock Of Measurement. Basics Of Measurement Measurement: Assignment of number to objects or events according to specific rules. Conceptual variables:
Reliability and Validity checks S-005. Checking on reliability of the data we collect  Compare over time (test-retest)  Item analysis  Internal consistency.
Procedures for Estimating Reliability
Chapter 5 Understanding, Calculating, and Evaluating Reliability and Objectivity.
MEASUREMENT CONCEPTS © 2012 The McGraw-Hill Companies, Inc.
Chapter 4 – Reliability Observed Scores and True Scores Error
Assessment Procedures for Counselors and Helping Professionals, 7e © 2010 Pearson Education, Inc. All rights reserved. Chapter 5 Reliability.
Reliability And Validity
VALIDITY AND RELIABILITY
 A description of the ways a research will observe and measure a variable, so called because it specifies the operations that will be taken into account.
Reliability Analysis. Overview of Reliability What is Reliability? Ways to Measure Reliability Interpreting Test-Retest and Parallel Forms Measuring and.
Part II Sigma Freud & Descriptive Statistics
MEQ Analysis. Outline Validity Validity Reliability Reliability Difficulty Index Difficulty Index Power of Discrimination Power of Discrimination.
Methods for Estimating Reliability
CH. 9 MEASUREMENT: SCALING, RELIABILITY, VALIDITY
LECTURE 9.
Reliability and Validity of Research Instruments
Scaling, Reliability and Validity
Reliability n Consistent n Dependable n Replicable n Stable.
Reliability Analysis. Overview of Reliability What is Reliability? Ways to Measure Reliability Interpreting Test-Retest and Parallel Forms Measuring and.
Reliability and Validity Dr. Roy Cole Department of Geography and Planning GVSU.
RELIABILITY & VALIDITY
RELIABILITY consistency or reproducibility of a test score (or measurement)
Conny’s Office Hours will now be by APPOINTMENT ONLY. Please her at if you would like to meet with.
Lesson Seven Reliability. Contents  Definition of reliability Definition of reliability  Indication of reliability: Reliability coefficient Reliability.
Measurement Reliability Objective & Subjective tests Standardization & Inter-rater reliability Properties of a “good item” Item Analysis Internal Reliability.
Validity, Reliability, & Sampling
Research Methods in MIS
Classroom Assessment Reliability. Classroom Assessment Reliability Reliability = Assessment Consistency. –Consistency within teachers across students.
Psychometrics Timothy A. Steenbergh and Christopher J. Devers Indiana Wesleyan University.
Choosing tests for EEF evaluations – reliability and validity and other issues Steve Higgins & Carole Torgerson
Reliability, Validity, & Scaling
Measurement of Variables: Scaling, Reliability, Validity
VALIDITY, RELIABILITY, and TRIANGULATED STRATEGIES
Instrumentation.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Data Analysis. Quantitative data: Reliability & Validity Reliability: the degree of consistency with which it measures the attribute it is supposed to.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 14 Measurement and Data Quality.
Reliability Chapter 3.  Every observed score is a combination of true score and error Obs. = T + E  Reliability = Classical Test Theory.
Study of the day Misattribution of arousal (Dutton & Aron, 1974)
The Basics of Experimentation Ch7 – Reliability and Validity.
Validity and Reliability THESIS. Validity u Construct Validity u Content Validity u Criterion-related Validity u Face Validity.
1 Chapter 4 – Reliability 1. Observed Scores and True Scores 2. Error 3. How We Deal with Sources of Error: A. Domain sampling – test items B. Time sampling.
Reliability & Agreement DeShon Internal Consistency Reliability Parallel forms reliability Parallel forms reliability Split-Half reliability Split-Half.
Appraisal and Its Application to Counseling COUN 550 Saint Joseph College For Class # 3 Copyright © 2005 by R. Halstead. All rights reserved.
Estimating Reliability Test-Retest Coefficient Parallel-Forms Coefficient Internal Consistency Coefficient Interrater (interobserver) Reliability © 2015.
Advanced Research Methods Unit 3 Reliability and Validity.
Chapter 2: Behavioral Variability and Research Variability and Research 1. Behavioral science involves the study of variability in behavior how and why.
Evaluating Survey Items and Scales Bonnie L. Halpern-Felsher, Ph.D. Professor University of California, San Francisco.
Designs and Reliability Assessing Student Learning Section 4.2.
Reliability: The degree to which a measurement can be successfully repeated.
Measurement Reliability Objective & Subjective tests Standardization & Inter-rater reliability Properties of a “good item” Item Analysis Internal Reliability.
Reliability n Consistent n Dependable n Replicable n Stable.
©2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
MEASUREMENT: PART 1. Overview  Background  Scales of Measurement  Reliability  Validity (next time)
Reliability: Introduction. Reliability Session 1.Definitions & Basic Concepts of Reliability 2.Theoretical Approaches 3.Empirical Assessments of Reliability.
DENT 514: Research Methods
Measurement Experiment - effect of IV on DV. Independent Variable (2 or more levels) MANIPULATED a) situational - features in the environment b) task.
Measurement Validity & Reliability. Measurement – Validity & Reliability l The Idea of Construct Validity The Idea of Construct Validity The Idea of Construct.
Reliability When a Measurement Procedure yields consistent scores when the phenomenon being measured is not changing. Degree to which scores are free of.
Reliability. Basics of test score theory Each person has a true score that would be obtained if there were no errors in measurement. However, measuring.
Quality instrument* Questions are determined by objectives Resist the temptation to ask questions that are interesting but not relevant to your hypothesis.
Measurement and Scaling Concepts
CHAPTER 5 MEASUREMENT CONCEPTS © 2007 The McGraw-Hill Companies, Inc.
Instrumentation: Reliability Measuring Caring in Nursing
The first test of validity
How can one measure intelligence?
Presentation transcript:

Types of Reliability

Reliability of Consistency of What? Observers or raters Tests over time Different versions of the same test A test at one point in time

Inter-Rater or Inter-Observer Reliability Object or phenomenon

Inter-Rater or Inter-Observer Reliability Object or phenomenon Observer 1

Inter-Rater or Inter-Observer Reliability Object or phenomenon Observer 1 Observer 2

Inter-Rater or Inter-Observer Reliability Object or phenomenon ? = Observer 1 Observer 2

Inter-Rater or Inter-Observer Reliability Are different observers consistent? Can establish this outside of your study in a pilot study. Can look at percent of agreement (especially with category ratings). Can use correlation (with continuous ratings).

Test-Retest Reliability Time 1 Time 2

Test-Retest Reliability = Test Time 1 Time 2

Test-Retest Reliability Stability over time Test = Test Time 1 Time 2

Test-Retest Reliability Measure instrument at two times for multiple persons. Compute correlation between the two measures. Assumes there is no change in the underlying trait between time 1 and time 2.

Parallel-Forms Reliability Time 1 Time 2

Parallel-Forms Reliability Form A = Form B Time 1 Time 2

Parallel-Forms Reliability Stability across forms Form A = Form B Time 1 Time 2

Parallel-Forms Reliability Administer both forms to the same people. Get correlation between the two forms. Usually done in educational contexts where you need alternative forms because of the frequency of retesting and where you can sample from lots of equivalent questions.

Internal Consistency Reliability Average inter-item correlation

Internal Consistency Reliability Average Inter-Item Correlation Test

Internal Consistency Reliability Item 1 Average Inter-Item correlation Item 2 Item 3 Test Item 4 Item 5 Item 6

Internal Consistency Reliability Item 1 Average inter-item correlation Item 2 I1 I2 I3 I4 I5 I6 1.00 .89 1.00 .91 .92 1.00 .88 .93 .95 1.00 .84 .86 .92 .85 1.00 .88 .91 .95 .87 .85 1.00 I1 I2 I3 I4 I5 I6 Item 3 Test Item 4 Item 5 Item 6

Internal Consistency Reliability Item 1 Average inter-item correlation Item 2 I1 I2 I3 I4 I5 I6 1.00 .89 1.00 .91 .92 1.00 .88 .93 .95 1.00 .84 .86 .92 .85 1.00 .88 .91 .95 .87 .85 1.00 I1 I2 I3 I4 I5 I6 Item 3 Test Item 4 Item 5 Item 6 .90

Internal Consistency Reliability Average item-total correlation

Internal Consistency Reliability Average item-total correlation Test

Internal Consistency Reliability Average item-total correlation Item 1 Item 2 Item 3 Test Item 4 Item 5 Item 6

Internal Consistency Reliability Average item-total correlation Item 1 I1 I2 I3 I4 I5 I6 Item 2 I1 I2 I3 I4 I5 I6 Total 1.00 .89 1.00 .91 .92 1.00 .88 .93 .95 1.00 .84 .86 .92 .85 1.00 .88 .91 .95 .87 .85 1.00 .84 .88 .86 .87 .83 .82 1.00 Item 3 Test Item 4 Item 5 Item 6

Internal Consistency Reliability Average item-total correlation Item 1 I1 I2 I3 I4 I5 I6 Item 2 I1 I2 I3 I4 I5 I6 Total 1.00 .89 1.00 .91 .92 1.00 .88 .93 .95 1.00 .84 .86 .92 .85 1.00 .88 .91 .95 .87 .85 1.00 .84 .88 .86 .87 .83 .82 1.00 Item 3 Test Item 4 Item 5 Item 6 .85

Internal Consistency Reliability Split-half correlations

Internal Consistency Reliability Split-half correlations Test

Internal Consistency Reliability Split-half correlations Item 1 Item 2 Item 3 Test Item 4 Item 5 Item 6

Internal Consistency Reliability Split-half correlations Item 1 Item 1 Item 3 Item 4 Item 2 Item 3 Test Item 4 Item 5 Item 6

Internal Consistency Reliability Split-half correlations Item 1 Item 1 Item 3 Item 4 Item 2 Item 3 Test Item 4 Item 5 Item 2 Item 5 Item 6 Item 6

Internal Consistency Reliability Split-half correlations Item 1 Item 1 Item 3 Item 4 Item 2 Item 3 Test .87 Item 4 Item 5 Item 2 Item 5 Item 6 Item 6

Internal Consistency Reliability Cronbach’s alpha ()

Internal Consistency Reliability Cronbach’s alpha () Test

Internal Consistency Reliability Cronbach’s alpha () Item 1 Item 2 Item 3 Test Item 4 Item 5 Item 6

Internal Consistency Reliability Cronbach’s alpha () Item 1 .87 item 1 item 3 item 4 item 2 item 5 item 6 .87 item 1 item 3 item 4 item 2 item 5 item 6 .87 item 1 item 3 item 4 item 2 item 5 item 6 Item 2 Item 3 Test Item 4 Item 5 Item 6

Internal Consistency Reliability Cronbach’s alpha () Item 1 .87 item 1 item 3 item 4 item 2 item 5 item 6 .87 item 1 item 3 item 4 item 2 item 5 item 6 .87 item 1 item 3 item 4 item 2 item 5 item 6 Item 2 Item 3 Test SH1 .87 SH2 .85 SH3 .91 SH4 .83 SH5 .86 ... SHn .85 Item 4 Item 5 Item 6

Internal Consistency Reliability Cronbach’s alpha () Item 1 .87 item 1 item 3 item 4 item 2 item 5 item 6 .87 item 1 item 3 item 4 item 2 item 5 item 6 .87 item 1 item 3 item 4 item 2 item 5 item 6 Item 2 Item 3 Test SH1 .87 SH2 .85 SH3 .91 SH4 .83 SH5 .86 ... SHn .85 Item 4 Item 5 Item 6  = .85

Like the average of all possible split-half correlations Internal Consistency Reliability Cronbach’s alpha () Item 1 .87 item 1 item 3 item 4 item 2 item 5 item 6 .87 item 1 item 3 item 4 item 2 item 5 item 6 .87 item 1 item 3 item 4 item 2 item 5 item 6 Item 2 Item 3 Test SH1 .87 SH2 .85 SH3 .91 SH4 .83 SH5 .86 ... SHn .85 Item 4 Like the average of all possible split-half correlations Item 5 Item 6  = .85

Internal Consistency Reliability - Summary Average inter-item correlation Average item-total correlation Split-half reliability Cronbach’s alpha ()