Reliability, validity, and scaling

Slides:



Advertisements
Similar presentations
Questionnaire Development
Advertisements

Agenda Levels of measurement Measurement reliability Measurement validity Some examples Need for Cognition Horn-honking.
Chapter 8 Flashcards.
Taking Stock Of Measurement. Basics Of Measurement Measurement: Assignment of number to objects or events according to specific rules. Conceptual variables:
Independent and Dependent Variables
Chapter 4 – Reliability Observed Scores and True Scores Error
Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules.
1 Measurement PROCESS AND PRODUCT. 2 MEASUREMENT The assignment of numerals to phenomena according to rules.
Measurement: Reliability and Validity For a measure to be useful, it must be both reliable and valid Reliable = consistent in producing the same results.
Validity Lecture Overview Overview of the concept Different types of validity Threats to validity and strategies for handling them Examples of validity.
Foundations of Educational Measurement
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 14 Measurement and Data Quality.
McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals.
Unanswered Questions in Typical Literature Review 1. Thoroughness – How thorough was the literature search? – Did it include a computer search and a hand.
Final Study Guide Research Design. Experimental Research.
Chapter Five Measurement Concepts. Terms Reliability True Score Measurement Error.
1 Chapter 4 – Reliability 1. Observed Scores and True Scores 2. Error 3. How We Deal with Sources of Error: A. Domain sampling – test items B. Time sampling.
Tests and Measurements Intersession 2006.
Research methods in clinical psychology: An introduction for students and practitioners Chris Barker, Nancy Pistrang, and Robert Elliott CHAPTER 4 Foundations.
Measurement and Questionnaire Design. Operationalizing From concepts to constructs to variables to measurable variables A measurable variable has been.
CHAPTER OVERVIEW The Measurement Process Levels of Measurement Reliability and Validity: Why They Are Very, Very Important A Conceptual Definition of Reliability.
KNR 295 Measurement Slide 1 Measurement Theory & Construct Validity Chapter 3.
SOCW 671: #5 Measurement Levels, Reliability, Validity, & Classic Measurement Theory.
Measurement Experiment - effect of IV on DV. Independent Variable (2 or more levels) MANIPULATED a) situational - features in the environment b) task.
Chapter 6 - Standardized Measurement and Assessment
Reliability a measure is reliable if it gives the same information every time it is used. reliability is assessed by a number – typically a correlation.
Construct validity s.net/kb/consthre.htm.
Copyright © 2014 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 11 Measurement and Data Quality.
1 Measurement Error All systematic effects acting to bias recorded results: -- Unclear Questions -- Ambiguous Questions -- Unclear Instructions -- Socially-acceptable.
Professor Jim Tognolini
MGMT 588 Research Methods for Business Studies
Reliability and Validity
Assist. Prof. Merve Topcu Department of Psychology, Çankaya University
Chapter 2 Theoretical statement:
6 Scales, Tests, & Indexes.
Ch. 5 Measurement Concepts.
Chapter 4 Research Methods in Clinical Psychology
Chapter 7 Cooper and Schindler
Product Reliability Measuring
Catching Up: Review.
Reliability and Validity
Experiment Basics: Variables
Instructor’s manual Mass Media Research: An Introduction, 7th Edition
Assessment Theory and Models Part II
Measurement: Part 1.
Associated with quantitative studies
Social Research Methods MAN-10 Erlan Bakiev, Ph. D.
CHAPTER 5 MEASUREMENT CONCEPTS © 2007 The McGraw-Hill Companies, Inc.
Journalism 614: Reliability and Validity
Definition & Measurement
Introduction to Measurement
Measurement with Numbers Scaling: What is a Number?
پرسشنامه کارگاه.
5. Reliability and Validity
Reliability and Validity of Measurement
Measurement: Part 1.
Evaluation of measuring tools: reliability
Chapter 6 Indexes, Scales, And Typologies
INDEXES, SCALES, & TYPOLOGIES
Reliability.
By ____________________
Experiment Basics: Variables
The first test of validity
Reliability and Validity
Measurement Concepts and scale evaluation
Measurement: Part 1.
Measurement scales.
Multitrait Scaling and IRT: Part I
Chapter 8 VALIDITY AND RELIABILITY
Reliability and validity
Presentation transcript:

Reliability, validity, and scaling

Basics What is reliability? What is validity? Can you have one without the other? Why or why not?

Reliability Observed score = True score + random error + systematic error. Explain. Why is this important? How can you decrease these errors? What does a reliability estimate of .85 tell you? What do we want our reliability estimates to be?

Types of Reliability What are each, how do you calculate them, when would you use them, and how can you increase them? Inter-rater reliability Test-retest reliability Parallel-forms reliability

Internal consistency reliability What are each, how do you calculate them, what do they tell you, where do you want values to be, and how can you increase them? Average inter-item correlation Average item-total Split-half reliability Cronbach’s alpha Kuder-Richardson Formula 20 (KR-20)

Cronbach’s alpha a Schmitt, 1996 Write down what you knew about alpha before reading the article List 2 things you learned from the article

Uses and abuses of alpha What is the difference between internal consistency and homogeneity? Which does alpha tell you? What increases alpha? What is a problem with using alpha to correct correlations for reliability? Why is a high alpha not necessarily a good thing?

Sample SPSS analyses

Construct Validity How does construct validity relate to internal and external validity? What are each of these, how would you calculate them, and what do they tell you? Translational validity Face validity Content validity Criterion-related validity Predictive validity Concurrent validity (aka known groups) Convergent validity Discriminant validity/Divergent validity How high/low should your correlations be?

Multitrait-multimethod matrix (MTMM) Nomological network Cronbach & Meehl, 1955 MTMM Campbell & Fiske, 1959 Look at example p. 69 What information does this give you? Pattern matching Advantages/disadvantages How can SEM be used to show this?

Design Threats to construct validity What are these, and how can the problem be decreased? Inadequate preoperational explication of constructs Mono-operation bias Mono-method bias Interaction of different treatments Interaction of testing and treatment Restricted generalizability across constructs Confounding constructs and levels of constructs

Social threats to CV What are these, and how can the problem be decreased? Hypothesis guessing Evaluation apprehension Experimenter expectancies Other threats Social desirability Response styles Demand characteristics

Method variance/Method bias Podsakoff, MacKenzie, & Podsakoff, 2012 What is it? What are some types and causes? What effects does it have? Why does it have these effects?

How can you deal with method bias? Use more than one method—get predictor and criterion from different sources Separate measures temporally, proximally, or psychologically Use different types and points on scales Label all points on scales, and make items less ambiguous Decrease tendencies for socially desirable responses Reverse score items Try to increase motivation and ability of participants Control for bias statistically (several options) Make it a manipulation or look for interaction effects

Controlling for other variables Westfall & Tarkoni, 2016 What is the main point of this article?

Statistical control Ways to test: Regression methods for control Incremental validity If 2 measures are distinct If 1 measure is “better” than another Regression methods for control What is the problem with using this method? When will it be more of a problem? How can you correct this problem?

Table 1. Type 1 error rates for a few parameter combinations. Westfall J, Yarkoni T (2016) Statistically Controlling for Confounding Constructs Is Harder than You Think. PLOS ONE 11(3): e0152719. https://doi.org/10.1371/journal.pone.0152719 http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0152719

Fig 12. Power to detect incremental validity using SEM. Westfall J, Yarkoni T (2016) Statistically Controlling for Confounding Constructs Is Harder than You Think. PLOS ONE 11(3): e0152719. https://doi.org/10.1371/journal.pone.0152719 http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0152719

Levels of measurement Why do they matter? What are the four types? Examples? Stats that can be run on them? Nominal Ordinal Interval Ratio What types of scales do we typically use in psychology? Is that a problem?

Basics of scaling Scales vs. response scales vs. index vs. questionnaire (every set of questions is not a scale) What are unidimensional vs. multidimensional scales? When should you use one vs. the other?

Types of (Unidimensional) scales Thurstone (method of equally appearing intervals) Generate items Have judges rate them Choose ones that represent the whole scale Guttman (cumulative scale) Coefficient of reproducibility Likert (summative rating scale) Semantic differential scale What are the advantages and disadvantages of each? How do you score each?

Thurstone I believe the church is the greatest institution in America today. (.2) I believe in religion, but I seldom go to church. (5.4) I believe in sincerity and goodness without any church ceremonies. (6.7) I believe the church is a hindrance to religion for it still depends on magic, superstition, and myth. (9.6) I think the church is a parasite on society. (11.0)

Guttman I am more than 54 inches tall. I am more than 56 inches tall.

Other issues with scaling Standardization Norms

Steps To choosing a scale to use? To creating a scale?

Steps to creating an index Conceptualize the index Operationalize and measure the components Develop the rules for calculating the index score (weighting?) Validate it!

Methods assignment #1

Next week Surveys ESM presentation Scale development assignment due Chapter Articles on data cleaning Short articles on survey design ESM presentation Scale development assignment due Decision about Feb. 28