Lab 6 Validity. Picking a Topic for Your Paper Were you able to come up with 3 ideas? Let’s chat about some of the ideas to make sure we’re all on the.

Slides:



Advertisements
Similar presentations
Chapter 8 Flashcards.
Advertisements

 Degree to which inferences made using data are justified or supported by evidence  Some types of validity ◦ Criterion-related ◦ Content ◦ Construct.
Cal State Northridge Psy 427 Andrew Ainsworth PhD
Lab 3: Correlation and Retest Reliability. Today’s Activities Review the correlation coefficient Descriptive statistics for Big Five at Time 1 and Time.
Identifying Good Measurement
Psychometrics William P. Wattles, Ph.D. Francis Marion University.
Chapter 4A Validity and Test Development. Basic Concepts of Validity Validity must be built into the test from the outset rather than being limited to.
Reliability and Validity Scale Evaluation Reliability Validity Test-Retest Internal Consistency Alternative Forms Construct Criterion Content Convergent.
Chapter 4 Validity.
Test Validity: What it is, and why we care.
Concept of Reliability and Validity. Learning Objectives  Discuss the fundamentals of measurement  Understand the relationship between Reliability and.
Reliability and Validity
The Multitrait-Multimethod Matrix. What Is the MTMM Matrix? An approach developed by Campbell, D. and Fiske, D. (1959). Convergent and Dicriminant Validation.
Personality, 9e Jerry M. Burger
Chapter 7 Correlational Research Gay, Mills, and Airasian
Chapter 7 Evaluating What a Test Really Measures
Classroom Assessment A Practical Guide for Educators by Craig A
Understanding Validity for Teachers
Chapter 4. Validity: Does the test cover what we are told (or believe)
Measurement Concepts & Interpretation. Scores on tests can be interpreted: By comparing a client to a peer in the norm group to determine how different.
PhD Research Seminar Series: Reliability and Validity in Tests and Measures Dr. K. A. Korb University of Jos.
Ch 6 Validity of Instrument
Near East University Department of English Language Teaching Advanced Research Techniques Correlational Studies Abdalmonam H. Elkorbow.
Instrumentation.
Foundations of Educational Measurement
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 14 Measurement and Data Quality.
McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals.
Unanswered Questions in Typical Literature Review 1. Thoroughness – How thorough was the literature search? – Did it include a computer search and a hand.
Psychometrics William P. Wattles, Ph.D. Francis Marion University.
Principles of Test Construction
MGTO 231 Human Resources Management Personnel selection II Dr. Kin Fai Ellick WONG.
Chapter 4: Test administration. z scores Standard score expressed in terms of standard deviation units which indicates distance raw score is from mean.
Validity. Face Validity  The extent to which items on a test appear to be meaningful and relevant to the construct being measured.
MGTO 324 Recruitment and Selections Validity I (Construct Validity) Kin Fai Ellick Wong Ph.D. Department of Management of Organizations Hong Kong University.
Chapter Seven Measurement and Decision-Making Issues in Selection.
Reliability & Validity
Validity Is the Test Appropriate, Useful, and Meaningful?
Assessing Learners with Special Needs: An Applied Approach, 6e © 2009 Pearson Education, Inc. All rights reserved. Chapter 4:Reliability and Validity.
Validity Psych DeShon. Example: Validity of a Measure “The use of the polygraph (lie detector test) is not nearly as valid as some say and can easily.
Assessing the Quality of Research
6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
Measurement Validity.
Session 4 Reliability and Validity. Validity What does the instrument measure and How well does it measure what it is supposed to measure? Is there enough.
Validity Validity: A generic term used to define the degree to which the test measures what it claims to measure.
Research Methodology and Methods of Social Inquiry Nov 8, 2011 Assessing Measurement Reliability & Validity.
Chapter 4 Validity Robert J. Drummond and Karyn Dayle Jones Assessment Procedures for Counselors and Helping Professionals, 6 th edition Copyright ©2006.
Testing, Measurement & Assessment Unit 5 Seminar - Validity.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Assessing Measurement Quality in Quantitative Studies.
MEASUREMENT. MeasurementThe assignment of numbers to observed phenomena according to certain rules. Rules of CorrespondenceDefines measurement in a given.
Validity and Item Analysis Chapter 4. Validity Concerns what the instrument measures and how well it does that task Not something an instrument has or.
Validity and Item Analysis Chapter 4.  Concerns what instrument measures and how well it does so  Not something instrument “has” or “does not have”
Week 4 Slides. Conscientiousness was most highly voted for construct We will also give other measures – protestant work ethic and turnover intentions.
Measurement MANA 4328 Dr. Jeanne Michalski
Chapter 7 Measuring of data Reliability of measuring instruments The reliability* of instrument is the consistency with which it measures the target attribute.
Technical Adequacy of Tests Dr. Julie Esparza Brown SPED 512: Diagnostic Assessment.
Chapter 6 - Standardized Measurement and Assessment
©2005, Pearson Education/Prentice Hall CHAPTER 6 Nonexperimental Strategies.
© 2009 Pearson Prentice Hall, Salkind. Chapter 5 Measurement, Reliability and Validity.
MGMT 588 Research Methods for Business Studies
Unit 8: Intelligence (Cognition)
Concept of Test Validity
Evaluation of measuring tools: validity
Reliability & Validity
Human Resource Management By Dr. Debashish Sengupta
Week 3 Class Discussion.
پرسشنامه کارگاه.
PSY 614 Instructor: Emily Bullock Yowell, Ph.D.
Week 11 Slides.
Cal State Northridge Psy 427 Andrew Ainsworth PhD
Presentation transcript:

Lab 6 Validity

Picking a Topic for Your Paper Were you able to come up with 3 ideas? Let’s chat about some of the ideas to make sure we’re all on the same page about paper topics

Outline Content validity Criterion validity –Restriction of Range Multitrait-Multimethod Matrix Homework #5

Content Validity Do the items adequately represent all the domains of the construct? Measure contamination: Measure includes items that are irrelevant to the construct of interest. Measure deficiency: Measure fails to include items relevant to all aspects of the construct.

A well-defined construct Let’s say that the construct of narcissism consists of: –“a grandiose, yet fragile, sense of self and entitlement” –“a preoccupation with success” –“demands of admiration” Ames, D. R., Rose, P. Anderson, C. P. (2006). The NPI-16 as a short measure of narcissism. Journal of Research in Personality, 40,

Content valid items? 1.I am an extraordinary person. 2.I insist upon getting the respect that is due me. 3.It's fun to see how far you can push people before they get upset. 4.I often act without thinking through all the alternatives.

Criterion Validity Criterion-related evidence examines correlations between the measure of interest and outcomes or behaviors (criteria) that should be predictable from the measure. The strength of the observed correlation is used to support the validity of the measure.

Criterion Validity For example, a person’s Agreeableness ought to be related (negatively) to counterproductive school behaviors. Agreeableness measures interpersonal orientation. Higher scorers are seen as helpful, trusting, and kind. We have such data in the Big Survey! What is this validity coefficient?

Agreeableness Items I feel little concern for others. (Reverse) I sympathize with others' feelings. I am not interested in other people's problems. (Reverse)

Counterproductive School Behaviors – Interpersonal Domain Made fun of someone at school. Cursed at someone at school. Publicly embarrassed someone at school. Made a derogatory ethnic, religious, or racial remark at school.

Criterion Validity Examine the distributions of Agreeableness scores (IPIPA) and Counterproductive School Behaviors (CSBInter) Calculate the means, medians and SDs for these variables. Look at the Scatterplot. Calculate the correlation coefficient. (IPIPA with CSBInter). This is the validity coefficient.

Restriction of Range What happens when the variability of the predictor or the criterion is limited? For example, the Graduate Record Exam (GRE) is designed to predict success in graduate school. What happens when we look for evidence of criterion validity of the GRE?

Restriction of Range How does restriction of range affect criterion validity estimates? Restrict the sample on Agreeableness (.5 SD or above the mean) and recalculate the descriptive statistics, scatterplot, and validity coefficient. What happened?

Convergent Validity Convergent validity – the measure is related to other measures of the same construct –Different types of thermometers all give similar estimates of air temperature.

Discriminant Validity Discriminant validity – the measure is unrelated to measures of different constructs in theoretical predicted ways. –Measures of barometric pressure are not related to measures of temperature.

Multitrait Multimethod Matrix The multitrait-multimethod (MTMM) matrix is a table of correlation coefficients that enables us to simultaneously evaluate the convergent and discriminant validity of a construct. If our measures are valid, we should see a pattern emerge between the correlations of our measures. (Hoyle, Harris, & Judd, 2002)

Multitrait Multimethod Matrix Assume you measure three constructs (or traits) using three different methods. –Traits: Anger, Guilt, Depression –Measures: Test, Clinical Rating, Self- report

Psychological TestClinical RatingSelf Rating A1G1D1A2G2D2A3G3D3 Psychological Test A1 1.0 G1 1.0 D1 1.0 Clinical Rating A2 1.0 G2 1.0 D2 1.0 Self Rating A3 1.0 G3 1.0 D3 1.0

Multitrait Multimethod Matrix The main diagonal is the correlation of the variable with itself and is by definition 1.0. It is common practice to replace the main diagonal entries with estimates of reliability, such as alpha or other estimates as appropriate.

Multitrait Multimethod Matrix The Blue diagonals are heteromethod- monotrait correlations (trait effects). Convergent validity diagonals (different measures of the same construct converge and provide the same measurement) They should be very high (~ 0.5)

Multitrait Multimethod Matrix The Green blocks are monomethod, heterotrait correlations (method effects). Discriminant validity (measures of different traits should not be highly correlated) They should be low (~ 0.2).

Multitrait Multimethod Matrix The Purple blocks are heteromethod, heterotrait correlations (nonsense correlations). They should be very close to zero.

Reliabilities, Convergent validities, Method effects, Nonsense validities Psychological TestClinical RatingSelf Rating A1G1D1A2G2D2A3G3D3 Psychological Test A1 (.91) G1G1.21(.93) D (.90) Clinical Rating A (.96) G2G (.93) D (.89) Self Rating A (.92) G3G (.91) D (.93)

MTMM for Homework Published in Campbell & Fiske (1959) from this original paper: Carroll, J. B. (1952). Ratings on traits measured by a factored personality inventory. Journal of Abnormal and Social Psychology, 47,

What you need to turn in for HW #5 Report descriptive statistics (mean, median, and standard deviation) for Agreeableness and Counter Productive Behaviors. Calculate the correlation between these variables. Interpret this score in terms of the criterion validity of the Agreeableness Measure. (2 points) Restrict scores on Agreeableness to 0.5 SD above the mean and higher. Find the correlation between Agreeableness and Counter Productive Behaviors. What happened and why? (2 points) Examine the attached MTMM. Identify the 3 convergent validity correlations for each of the 3 traits. Make a table of those correlations. (3 points) Identify the patterns of discriminant validity for self-ratings and the psychological test. Make a table for both sets. Which set appears to have a higher degree of discriminant validity? (3 points)

Psychological TestClinical RatingSelf Rating A1G1D1A2G2D2A3G3D3 Psychological Test A1 (.91) G1G1.21(.93) D1D (.90 ) Clinical Rating A (.96) G2G (.93) D2D (.89) Self Rating A (.92) G3G (.91) D3D (.93) Reliabilities, Convergent validities, Discriminant validities, Discriminant validities