MEQ Analysis. Outline Validity Validity Reliability Reliability Difficulty Index Difficulty Index Power of Discrimination Power of Discrimination.

Slides:

Advertisements

Similar presentations

Questionnaire Development

Advertisements

Topics: Quality of Measurements

Reliability and Validity checks S-005. Checking on reliability of the data we collect  Compare over time (test-retest)  Item analysis  Internal consistency.

Types of Reliability.

MEASUREMENT CONCEPTS © 2012 The McGraw-Hill Companies, Inc.

© 2006 The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Validity and Reliability Chapter Eight.

Psychometrics William P. Wattles, Ph.D. Francis Marion University.

Chapter 4 – Reliability Observed Scores and True Scores Error

Assessment Procedures for Counselors and Helping Professionals, 7e © 2010 Pearson Education, Inc. All rights reserved. Chapter 5 Reliability.

VALIDITY AND RELIABILITY

Part II Sigma Freud & Descriptive Statistics

Part II Sigma Freud & Descriptive Statistics

Methods for Estimating Reliability

Measurement. Scales of Measurement Stanley S. Stevens’ Five Criteria for Four Scales Nominal Scales –1. numbers are assigned to objects according to rules.

Reliability and Validity of Research Instruments

Reliability n Consistent n Dependable n Replicable n Stable.

Reliability Analysis. Overview of Reliability What is Reliability? Ways to Measure Reliability Interpreting Test-Retest and Parallel Forms Measuring and.

Definition & Measurement “measurement is the beginning of science, … until you can measure something, your knowledge is meager and unsatisfactory” Lord.

Reliability n Consistent n Dependable n Replicable n Stable.

Lecture 7 Psyc 300A. Measurement Operational definitions should accurately reflect underlying variables and constructs When scores are influenced by other.

Conny’s Office Hours will now be by APPOINTMENT ONLY. Please her at if you would like to meet with.

Measurement: Reliability and Validity For a measure to be useful, it must be both reliable and valid Reliable = consistent in producing the same results.

Research Methods in MIS

Classroom Assessment A Practical Guide for Educators by Craig A

Reliability and Validity. Criteria of Measurement Quality How do we judge the relative success (or failure) in measuring various concepts? How do we judge.

Classical Test Theory By ____________________. What is CCT?

Test Validity S-005. Validity of measurement Reliability refers to consistency –Are we getting something stable over time? –Internally consistent? Validity.

Psychometrics Timothy A. Steenbergh and Christopher J. Devers Indiana Wesleyan University.

Technical Issues Two concerns Validity Reliability

Validity and Reliability

Validity and Reliability of Research and the Instruments

Reliability and Validity what is measured and how well.

Instrumentation.

Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.

MEASUREMENT CHARACTERISTICS Error & Confidence Reliability, Validity, & Usability.

Data Analysis. Quantitative data: Reliability & Validity Reliability: the degree of consistency with which it measures the attribute it is supposed to.

Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 14 Measurement and Data Quality.

Unanswered Questions in Typical Literature Review 1. Thoroughness – How thorough was the literature search? – Did it include a computer search and a hand.

LECTURE 06B BEGINS HERE THIS IS WHERE MATERIAL FOR EXAM 3 BEGINS.

Chapter Five Measurement Concepts. Terms Reliability True Score Measurement Error.

1 Chapter 4 – Reliability 1. Observed Scores and True Scores 2. Error 3. How We Deal with Sources of Error: A. Domain sampling – test items B. Time sampling.

Tests and Measurements Intersession 2006.

Assessing Learners with Special Needs: An Applied Approach, 6e © 2009 Pearson Education, Inc. All rights reserved. Chapter 4:Reliability and Validity.

Research methods in clinical psychology: An introduction for students and practitioners Chris Barker, Nancy Pistrang, and Robert Elliott CHAPTER 4 Foundations.

Chapter 8 Validity and Reliability. Validity How well can you defend the measure? –Face V –Content V –Criterion-related V –Construct V.

1 Measurement and Data Collection  What and How?  Types of Scales Nominal Nominal Ordinal Ordinal Interval Interval Ratio Ratio.

Validity and Reliability Neither Valid nor Reliable Reliable but not Valid Valid & Reliable Fairly Valid but not very Reliable Think in terms of ‘the purpose.

Validity Validity: A generic term used to define the degree to which the test measures what it claims to measure.

Evaluating Survey Items and Scales Bonnie L. Halpern-Felsher, Ph.D. Professor University of California, San Francisco.

Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Assessing Measurement Quality in Quantitative Studies.

Reliability: The degree to which a measurement can be successfully repeated.

Reliability n Consistent n Dependable n Replicable n Stable.

MEASUREMENT: PART 1. Overview  Background  Scales of Measurement  Reliability  Validity (next time)

Reliability and Validity Themes in Psychology. Reliability Reliability of measurement instrument: the extent to which it gives consistent measurements.

Measurement Experiment - effect of IV on DV. Independent Variable (2 or more levels) MANIPULATED a) situational - features in the environment b) task.

Reliability a measure is reliable if it gives the same information every time it is used. reliability is assessed by a number – typically a correlation.

Dr. Jeffrey Oescher 27 January 2014 Technical Issues  Two technical issues  Validity  Reliability.

1 Measurement Error All systematic effects acting to bias recorded results: -- Unclear Questions -- Ambiguous Questions -- Unclear Instructions -- Socially-acceptable.

Professor Jim Tognolini

Ch. 5 Measurement Concepts.

Lecture 5 Validity and Reliability

Product Reliability Measuring

Tests and Measurements: Reliability

Classical Test Theory Margaret Wu.

Journalism 614: Reliability and Validity

مركز مطالعات و توسعه آموزش دانشگاه علوم پزشكي كرمان

By ____________________

The first test of validity

Presentation transcript:

MEQ Analysis

Outline Validity Validity Reliability Reliability Difficulty Index Difficulty Index Power of Discrimination Power of Discrimination

Validity “evidence present to support or refute the meaning assigned to assessment results” face validity face validity content validity content validity criterion-related validity criterion-related validity construct validity construct validity

Face Validity high face validity high face validity  seemed valid (only)  ~ person knee jerk & nervous system knee jerk & nervous system  for doctor => high face validity  for lay people => low face validity MEQ => high face validity MEQ => high face validity

Content Validity ~ sample & population ~ sample & population MCQ => high content validity MCQ => high content validity MEQ => low content validity MEQ => low content validity

Validity “evidence present to support or refute the meaning assigned to assessment results” face validity face validity content validity content validity criterion-related validity criterion-related validity construct validity construct validity - don’t need the score - before using the test - score needed - after using the test

Criterion-Related Validity Predictive validity Predictive validity  MEQ score & close observation score of real practice at ER Concurrent Validity Concurrent Validity  MEQ score & VIVA score in the same topic Statistic = correlation coefficient Statistic = correlation coefficient

Construct Validity based on theory based on theory  communication skill ~ leadership correlation OSCE : communication VS questionnaire : leadership correlation OSCE : communication VS questionnaire : leadership  good ethics ~ beloved doctor MEQ : medical ethics VS questionnaire : beloved doctor? MEQ : medical ethics VS questionnaire : beloved doctor?

Reliability stability stability internal consistency internal consistency equivalent equivalent

Stability test-retest reliability test-retest reliability parallel form reliability parallel form reliability intra-rater reliability intra-rater reliability  ~ scoring key statistics : correlation coeff. statistics : correlation coeff.  0-1

Internal Consistency [homogeneity] [homogeneity] item - item correlation item - item correlation item - total correlation item - total correlation split half correlation split half correlation

Item-Item Correlation each item each item  Dichotomous Phi Correlation Phi Correlation  Interval Pearson’s Product Moment Correlation Pearson’s Product Moment Correlation whole test whole test  Mean of...

Item-Total Correlation each item each item  Dichotomous Point Biserial Correlation Point Biserial Correlation  Interval Pearson’s Product Moment Correlation Pearson’s Product Moment Correlation whole test whole test  Mean of...

Spilt Half Reliability Dichotomous Dichotomous  Kuder-Richardson 20 (KR 20) Interval Interval  Kuder-Richardson 21 (KR 21)  Cronbach’s alpha coefficient

Equivalent parallel item on alternate form reliability parallel item on alternate form reliability inter-rater reliability inter-rater reliability  agreement  kappa

Difficulty Index [p] [p] (mean H + mean L)/2(full score) (mean H + mean L)/2(full score) p = 1 => very easy p = 1 => very easy p = 0 => very difficult p = 0 => very difficult expecting p = expecting p =  must => 0.7  should => 0.5

Power of Discrimination [r] [r] (mean H - mean L)/(full score) (mean H - mean L)/(full score) > 0.40 => very good item > 0.40 => very good item => good item => good item => borderline => borderline poor item poor item

Conclusion Validity Validity Reliability Reliability Difficulty Index Difficulty Index Power of Discrimination Power of Discrimination