Reliability Presentation Test-Retest James Blackwood – AED 615 Fall Semester 2006.

Slides:



Advertisements
Similar presentations
Design of Experiments Lecture I
Advertisements

Managerial Economics Estimation of Demand
1 COMM 301: Empirical Research in Communication Kwan M Lee Lect4_1.
Reliability Definition: The stability or consistency of a test. Assumption: True score = obtained score +/- error Domain Sampling Model Item Domain Test.
Chapter 5 Reliability Robert J. Drummond and Karyn Dayle Jones Assessment Procedures for Counselors and Helping Professionals, 6 th edition Copyright ©2006.
© 2006 The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Validity and Reliability Chapter Eight.
Psychometrics William P. Wattles, Ph.D. Francis Marion University.
Assessment Procedures for Counselors and Helping Professionals, 7e © 2010 Pearson Education, Inc. All rights reserved. Chapter 5 Reliability.
1Reliability Introduction to Communication Research School of Communication Studies James Madison University Dr. Michael Smilowitz.
Reliability for Teachers Kansas State Department of Education ASSESSMENT LITERACY PROJECT1 Reliability = Consistency.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
1) Introduction Prior to the Exxon Valdez oil spill, the estimation of passive use value, was an area of economic research not well known. However, based.
Chapter 4 Validity.
Work Session 1 Locating Baseline Data Office of Research, Evaluation and Policy Studies Ximena D. Burgin, Ed.D. November 30 th, 2010.
Chapter 8 Estimation: Single Population
Steps in the Research Process I have a research question, what do I do next?
PSY 1950 Confidence and Power December, Requisite Quote “The picturing of data allows us to be sensitive not only to the multiple hypotheses that.
Monitoring and Pollutant Load Estimation. Load = the mass or weight of pollutant that passes a cross-section of the river in a specific amount of time.
Copyright © 2008 by Pearson Education, Inc. Upper Saddle River, New Jersey All rights reserved. John W. Creswell Educational Research: Planning,
FOUNDATIONS OF NURSING RESEARCH Sixth Edition CHAPTER Copyright ©2012 by Pearson Education, Inc. All rights reserved. Foundations of Nursing Research,
AP Statistics: Chapter 23
1 Inference About a Population Variance Sometimes we are interested in making inference about the variability of processes. Examples: –Investors use variance.
Correlational Designs
Survey Research Chapter 17: How To Design And Evaluate Research In Education James Blackwood AED 615 – Fall Semester 2006.
Classroom Assessment A Practical Guide for Educators by Craig A
Reliability of Selection Measures. Reliability Defined The degree of dependability, consistency, or stability of scores on measures used in selection.
Chapter 9 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 What is a Perfect Positive Linear Correlation? –It occurs when everyone has the.
Relationships Among Variables
Measurement Concepts & Interpretation. Scores on tests can be interpreted: By comparing a client to a peer in the norm group to determine how different.
Correlation Nabaz N. Jabbar Near East University 25 Oct 2011.
Psychometrics Timothy A. Steenbergh and Christopher J. Devers Indiana Wesleyan University.
Measurement and Data Quality
Data Analysis. Quantitative data: Reliability & Validity Reliability: the degree of consistency with which it measures the attribute it is supposed to.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 14 Measurement and Data Quality.
Analyzing Reliability and Validity in Outcomes Assessment (Part 1) Robert W. Lingard and Deborah K. van Alphen California State University, Northridge.
Lecture 22 Dustin Lueker.  The sample mean of the difference scores is an estimator for the difference between the population means  We can now use.
Reliability Chapter 3. Classical Test Theory Every observed score is a combination of true score plus error. Obs. = T + E.
Reliability Chapter 3.  Every observed score is a combination of true score and error Obs. = T + E  Reliability = Classical Test Theory.
Quantitative vs Qualitative Research
1 Things That May Affect Estimates from the American Community Survey.
Advanced Research Methods Unit 3 Reliability and Validity.
Learning Objective Chapter 9 The Concept of Measurement and Attitude Scales Copyright © 2000 South-Western College Publishing Co. CHAPTER nine The Concept.
C M Clarke-Hill1 Analysing Quantitative Data Forming the Hypothesis Inferential Methods - an overview Research Methods.
Chapter 16 Data Analysis: Testing for Associations.
Chronbach’s Alpha Ginnie Bushong. Chronbach Alpha Also known as Alpha Coefficient An internal consistency or reliability coefficient for an instrument.
Presented By Dr / Said Said Elshama  Distinguish between validity and reliability.  Describe different evidences of validity.  Describe methods of.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Assessing Measurement Quality in Quantitative Studies.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
SOCW 671: #5 Measurement Levels, Reliability, Validity, & Classic Measurement Theory.
© (2015, 2012, 2008) by Pearson Education, Inc. All Rights Reserved Chapter 11: Correlational Designs Educational Research: Planning, Conducting, and Evaluating.
1 Virtual COMSATS Inferential Statistics Lecture-25 Ossam Chohan Assistant Professor CIIT Abbottabad.
T tests comparing two means t tests comparing two means.
Criteria for selection of a data collection instrument. 1.Practicality of the instrument: -Concerns its cost and appropriateness for the study population.
Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.
Criterion Validity Kyle Sharp A ED 615 Fall 2006.
Reliability Ability to produce similar results when repeated measurements are made under identical conditions. Consistency of the results Can you get.
Lesson 5.1 Evaluation of the measurement instrument: reliability I.
Slide Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple.
Copyright © 2014 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 11 Measurement and Data Quality.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 3 Investigating the Relationship of Scores.
CRITICALLY APPRAISING EVIDENCE Lisa Broughton, PhD, RN, CCRN.
ESTABLISHING RELIABILITY AND VALIDITY OF RESEARCH TOOLS Prof. HCL Rawat Principal UCON,BFUHS Faridkot.
Section 12.2 Linear Regression
Lecture 5 Validity and Reliability
Analyzing Reliability and Validity in Outcomes Assessment Part 1
Introduction to Econometrics
Analyzing Reliability and Validity in Outcomes Assessment
Chapter 8 VALIDITY AND RELIABILITY
Qualities of a good data gathering procedures
Presentation transcript:

Reliability Presentation Test-Retest James Blackwood – AED 615 Fall Semester 2006

Test-Retest Reliability Test-Retest method of determining reliability is accomplished by administering a test to a group Test-Retest method of determining reliability is accomplished by administering a test to a group After a period of time has passed, the same test is re-administered to the same group. After a period of time has passed, the same test is re-administered to the same group. It is also known as stability reliability It is also known as stability reliability It is used in both qualitative and quantitative research (qualitative requires a different technique of analysis) It is used in both qualitative and quantitative research (qualitative requires a different technique of analysis)

Reliability Coefficient 1. After two tests have been administered, a reliability coefficient is calculated to determine the relationship between the two scores obtained. 2. If the same results are obtained from the two tests then the coefficient = The coefficient is influenced by the amount of time that has passed between the administration of the two tests.

Reliability Coefficient Measurement 1. The reliability coefficient is expected to be lower the longer the time interval between the tests due to the possibility of changes in the population taking the test. 2. The shorter the time gap, the higher the correlation; the longer the time gap, the lower the correlation.

Test-Retest Issues There cannot be any measurable change in the construct being measured between the two tests. There cannot be any measurable change in the construct being measured between the two tests. This method will not work when measuring a variable that is not stable in an individual. This method will not work when measuring a variable that is not stable in an individual. Unless the instrument is reliable, relationships with other variables in the study will not be identified. Unless the instrument is reliable, relationships with other variables in the study will not be identified.

Testing.. You can obtain considerably different estimates of reliability depending on the interval between tests. You can obtain considerably different estimates of reliability depending on the interval between tests. For educational research, examination of scores over a two to three month period is sufficient for test-retest reliability verification. For educational research, examination of scores over a two to three month period is sufficient for test-retest reliability verification. The time interval between the two tests should always be reported when using test-retest as a measure of reliability. The time interval between the two tests should always be reported when using test-retest as a measure of reliability.

Test-Retest Equation

Test-Retest Issues 1. Requires twice the data collection 2. Population for the test would need to be willing to repeat the test (impractical) 3. Higher costs due to multiple tests being administered 4. Only works well when practical (better for smaller population rather than a large one) 5. Researcher may not be able to retest the population

Examples of Test-Retest Educational assessment Educational assessment Drug testing Drug testing Testing Measurement Equipment Testing Measurement Equipment Medical Evaluations Medical Evaluations

Research Literature Example American Journal of Agricultural Economics Volume 71 - Number 1 (Feb 1989), pp Test-Retest Reliability of the Contingent Valuation Method: A Comparison of General Population and Visitor Responses John B. Loomis Abstract: The reliability of the contingent valuation method is evaluated by resurveying the same general households and visitors nine months after their original survey. Test-retest correlations on willingness to pay are statistically significant and ranged from.422 for the general population sample to.782 for the visitor sample. Using a paired T-test, there was no statistical difference between an individual's first and second reported willingness to pay. Chow tests comparing the original and resurvey willingness-to-pay functions showed no statistical difference at the.01 level. Reported willingness to pay is reasonably stable over the time period surveyed.

References Fraenkel, J. R., & Wallen, N. E. (2006). How to design and evaluate research in education (6th ed.). New York: Mc-Graw-Hill. Guttman, L. (1946). The test-retest reliability of qualitative data. Psychometrika, 11(2), Abstract retrieved October 15, 2006 from Springer Link database. Loomis, J. B. (1989) Test-retest reliability of the contingent valuation method: A comparison of general population and visitor responses. American Journal of Agricultural Economics, 71(1), Trochim, W. M., (2006). Types of reliability. Research Methods Knowledge Base. Retrieved from