Validity Does test measure what it says it does? Is the test useful? Can a test be reliable, but not valid? Can a test be valid, but not reliable?

Slides:



Advertisements
Similar presentations
Agenda Levels of measurement Measurement reliability Measurement validity Some examples Need for Cognition Horn-honking.
Advertisements

Chapter 8 Flashcards.
 Degree to which inferences made using data are justified or supported by evidence  Some types of validity ◦ Criterion-related ◦ Content ◦ Construct.
Cal State Northridge Psy 427 Andrew Ainsworth PhD
The Research Consumer Evaluates Measurement Reliability and Validity
Reliability and Validity
VALIDITY AND RELIABILITY
Chapter 5 Measurement, Reliability and Validity.
Research Methodology Lecture No : 11 (Goodness Of Measures)
What is a Good Test Validity: Does test measure what it is supposed to measure? Reliability: Are the results consistent? Objectivity: Can two or more.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT
General Information --- What is the purpose of the test? For what population is the designed? Is this population relevant to the people who will take your.
Chapter 4A Validity and Test Development. Basic Concepts of Validity Validity must be built into the test from the outset rather than being limited to.
Part II Knowing How to Assess Chapter 5 Minimizing Error p115 Review of Appl 644 – Measurement Theory – Reliability – Validity Assessment is broader term.
RESEARCH METHODS Lecture 18
Chapter 4 Validity.
Test Validity: What it is, and why we care.
Reliability or Validity Reliability gets more attention: n n Easier to understand n n Easier to measure n n More formulas (like stats!) n n Base for validity.
VALIDITY.
MEASUREMENT. Measurement “If you can’t measure it, you can’t manage it.” Bob Donath, Consultant.
Concept of Measurement
Beginning the Research Design
Concept of Reliability and Validity. Learning Objectives  Discuss the fundamentals of measurement  Understand the relationship between Reliability and.
Validity of Selection. Objectives Define Validity Relation between Reliability and Validity Types of Validity Strategies.
Chapter 7 Correlational Research Gay, Mills, and Airasian
Chapter 7 Evaluating What a Test Really Measures
Classroom Assessment A Practical Guide for Educators by Craig A
Rosnow, Beginning Behavioral Research, 5/e. Copyright 2005 by Prentice Hall Ch. 6: Reliability and Validity in Measurement and Research.
1 Measurement Adapted from The Research Methods Knowledge Base, William Trochim (2006). & Methods for Social Researchers in Developing Counries, The Ahfad.
Test Validity S-005. Validity of measurement Reliability refers to consistency –Are we getting something stable over time? –Internally consistent? Validity.
Measurement in Exercise and Sport Psychology Research EPHE 348.
Ch 6 Validity of Instrument
Instrument Validity & Reliability. Why do we use instruments? Reliance upon our senses for empirical evidence Senses are unreliable Senses are imprecise.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 14 Measurement and Data Quality.
Psychometrics William P. Wattles, Ph.D. Francis Marion University.
Foundations of Recruitment and Selection I: Reliability and Validity
Validity. Face Validity  The extent to which items on a test appear to be meaningful and relevant to the construct being measured.
Chapter Seven Measurement and Decision-Making Issues in Selection.
Validity Is the Test Appropriate, Useful, and Meaningful?
Tests and Measurements Intersession 2006.
6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
Measurement Validity.
Chapter 8 Validity and Reliability. Validity How well can you defend the measure? –Face V –Content V –Criterion-related V –Construct V.
CHAPTER OVERVIEW The Measurement Process Levels of Measurement Reliability and Validity: Why They Are Very, Very Important A Conceptual Definition of Reliability.
Session 4 Reliability and Validity. Validity What does the instrument measure and How well does it measure what it is supposed to measure? Is there enough.
Validity Validity: A generic term used to define the degree to which the test measures what it claims to measure.
Research Methodology and Methods of Social Inquiry Nov 8, 2011 Assessing Measurement Reliability & Validity.
Chapter 4 Validity Robert J. Drummond and Karyn Dayle Jones Assessment Procedures for Counselors and Helping Professionals, 6 th edition Copyright ©2006.
The Theory of Sampling and Measurement. Sampling First step in implementing any research design is to create a sample. First step in implementing any.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Assessing Measurement Quality in Quantitative Studies.
MEASUREMENT. MeasurementThe assignment of numbers to observed phenomena according to certain rules. Rules of CorrespondenceDefines measurement in a given.
MOI UNIVERSITY SCHOOL OF BUSINESS AND ECONOMICS CONCEPT MEASUREMENT, SCALING, VALIDITY AND RELIABILITY BY MUGAMBI G.K. M’NCHEBERE EMBA NAIROBI RESEARCH.
Validity and Item Analysis Chapter 4. Validity Concerns what the instrument measures and how well it does that task Not something an instrument has or.
Validity and Item Analysis Chapter 4.  Concerns what instrument measures and how well it does so  Not something instrument “has” or “does not have”
Week 4 Slides. Conscientiousness was most highly voted for construct We will also give other measures – protestant work ethic and turnover intentions.
Chapter 9 Correlation, Validity and Reliability. Nature of Correlation Association – an attempt to describe or understand Not causal –However, many people.
RESEARCH METHODS IN INDUSTRIAL PSYCHOLOGY & ORGANIZATION Pertemuan Matakuliah: D Sosiologi dan Psikologi Industri Tahun: Sep-2009.
Chapter 6 - Standardized Measurement and Assessment
Measuring Research Variables
© 2009 Pearson Prentice Hall, Salkind. Chapter 5 Measurement, Reliability and Validity.
Concept of Test Validity
Evaluation of measuring tools: validity
Week 3 Class Discussion.
Week 10 Slides.
پرسشنامه کارگاه.
PSY 614 Instructor: Emily Bullock Yowell, Ph.D.
Reliability and Validity of Measurement
RESEARCH METHODS Lecture 18
Cal State Northridge Psy 427 Andrew Ainsworth PhD
Presentation transcript:

Validity Does test measure what it says it does? Is the test useful? Can a test be reliable, but not valid? Can a test be valid, but not reliable?

Types of validity Face validity –Important only so far as it doesn’t interfere with an examinee’s willingness to cooperate. Content validity –How well does the test cover areas of content that it should? –How adequately does it sample the universe of behavior it was designed to assess?

Content validity (cont.) Panel of “experts” –Is the item/content essential? –Lawshe (1975) >50% of experts see skill as essential Important for: –Achievement/classroom tests –Training program exams –Professional exams

Criterion-Related Validity How well does a test score relate to another score/variable of interest? –Correlate test with criterion Standard against which test is evaluated Concurrent Predictive

Criterion-Related Validity (cont.) Criterion should be –Reliable Reliability limits validity; can’t be valid if not reliable. –Relevant –Valid –Uncontaminated Criterion measure has been based in part on predictor measure

Criterion-Related Validity (cont.) Concurrent validity –Criterion immediately available –Present standing on a criterion Diagnosis, score on another test –Used to predict the performance of new test takers or for people for whom the criterion isn’t available.

Criterion-Related Validity (cont.) Predictive validity –Test given, criterion measured later –Ex. ACT & College GPA; employment test & job performance Incremental validity

Base Rate & Decision Theory Base rate: proportion of population who possess a certain trait, characteristic or attribute –% of EIU undergrads who graduate –% of African Americans with sickle cell anemia Base rate affects usefulness of tests

Decision Theory 4 outcomes False rejections/negatives Valid Acceptances/ Positives Valid Rejections/ negatives False Acceptances/ Positives

Cut scores & Hit rates False rejections/negativesValid Acceptances/ Positives Valid Rejections/ negatives False Acceptances/ Positives

Cut scores & Hit rates (cont.) Reciprocal relationship between # of false rejections and # of false acceptances Which is more acceptable: to limit the number accepted who shouldn’t be, or to minimize the # rejected who could be successful?

Construct Validity Construct: –Scientific idea hypothesized to explain behavior –Postulated attribute of people, assumed to be reflected in test score –Ex.: intelligence, self-esteem, motivation Construct validity: Does the test measure the construct? –Gives theoretical meaning to scores; –Subsumes all other types of validity

Construct Validity (cont.) Convergent evidence/validity Divergent/discriminant evidence Factor analysis –Data reduction/simplification of complex correlational matrices … to reveal major dimensions that underlie a set of items –A factor is considered to be the construct that best represents relationships among variables

Factor Analysis (cont.) Methods of factor analysis –Exploratory 1.Correlation matrix 2.Factor matrix with loadings 3.Label factors Used to develop or eliminate items or scales from composite scores

Factor Analysis (cont.) Confirmatory factor analysis –Goodness of fit –After test has been developed

Validity & Bias Bias: a factor inherent within a test that systematically prevents accurate, impartial measurement –Bias implies systematic, not random variation Can you make equally valid predictions for different groups?

Bias in Predictions Questions of regression –Slope –Intercept –Error of estimate

Slope Bias

Intercept Bias

Rating error Leniency Error Severity Error Central Tendency Error Halo Effect

Test Fairness Is the test used in an impartial, just, and equitable manner? Good tests Discriminate among individuals –Are group differences due to inadequate tests? –Is the test being used fairly?