Examining Psychometric Properties of the Kutcher Adolescent Depression Scale (KADS-11) Using Multidimensional Item Response Theory (MIRT) M. Shojaee, O.

Slides:



Advertisements
Similar presentations
Measurement Concepts Operational Definition: is the definition of a variable in terms of the actual procedures used by the researcher to measure and/or.
Advertisements

Item Response Theory in the Secondary Classroom: What Rasch Modeling Can Reveal About Teachers, Students, and Tests. T. Jared Robinson tjaredrobinson.com.
Some (Simplified) Steps for Creating a Personality Questionnaire Generate an item pool Administer the items to a sample of people Assess the uni-dimensionality.
Part II Sigma Freud & Descriptive Statistics
Item Response Theory in Health Measurement
The Narcissism of Personal Uniqueness: A New Measure of the Vulnerability Phenotype Paul C. Stey, Daniel K. Lapsley, & Ashleigh A. Renteria Available at:
ITEM RESPONSE MODELING OF PRESENCE-SEVERITY ITEMS: APPLICATION TO MEASUREMENT OF PATIENT-REPORTED OUTCOMES Ying Liu and Jay Verkuilen.
Part II Knowing How to Assess Chapter 5 Minimizing Error p115 Review of Appl 644 – Measurement Theory – Reliability – Validity Assessment is broader term.
Patricia C. Post, Psy.D., Licensed Psychologist
Concept of Measurement
When Measurement Models and Factor Models Conflict: Maximizing Internal Consistency James M. Graham, Ph.D. Western Washington University ABSTRACT: The.
Item Response Theory. Shortcomings of Classical True Score Model Sample dependence Limitation to the specific test situation. Dependence on the parallel.
In the name of Allah. Development and psychometric Testing of a new Instrument to Measure Affecting Factors on Women’s Behaviors to Breast Cancer Prevention:
Education 795 Class Notes Factor Analysis II Note set 7.
Classical Test Theory By ____________________. What is CCT?
Item Analysis: Classical and Beyond SCROLLA Symposium Measurement Theory and Item Analysis Modified for EPE/EDP 711 by Kelly Bradley on January 8, 2013.
Multivariate Methods EPSY 5245 Michael C. Rodriguez.
Measurement and Data Quality
DIFFERENCES BETWEEN WOMEN IN SHORT- AND LONG-TERM RELATIONSHIPS IN CUES FOR SEXUAL DESIRE Ana Carvalheira, PhD 1, Lori Brotto, Ph.D 2 & Isabel Leal, PhD.
Reliability and factorial structure of a Portuguese version of the Children’s Hope Scale José Tomás da Silva Maria Paula Paixão Catarina Carvalho dos Santos.
Educational level relative to gender (Ν=28)
Instrumentation.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 14 Measurement and Data Quality.
LECTURE 06B BEGINS HERE THIS IS WHERE MATERIAL FOR EXAM 3 BEGINS.
The emotional distress of children with cancer in China: An Item Response Analysis of C-Ped-PROMIS Anxiety and Depression Short Forms Yanyan Liu 1, Changrong.
Introduction Neuropsychological Symptoms Scale The Neuropsychological Symptoms Scale (NSS; Dean, 2010) was designed for use in the clinical interview to.
The ABC’s of Pattern Scoring Dr. Cornelia Orr. Slide 2 Vocabulary Measurement – Psychometrics is a type of measurement Classical test theory Item Response.
Measuring Mathematical Knowledge for Teaching: Measurement and Modeling Issues in Constructing and Using Teacher Assessments DeAnn Huinker, Daniel A. Sass,
Purpose The present study examined the psychometric properties of the SCARED in order to begin establishing an evidence base for using the SCARED in pediatric.
EVIDENCE ABOUT DIAGNOSTIC TESTS Min H. Huang, PT, PhD, NCS.
EDU 8603 Day 6. What do the following numbers mean?
6. Evaluation of measuring tools: validity Psychometrics. 2012/13. Group A (English)
Confirmatory Factor Analysis Psych 818 DeShon. Construct Validity: MTMM ● Assessed via convergent and divergent evidence ● Convergent – Measures of the.
Measurement Models: Exploratory and Confirmatory Factor Analysis James G. Anderson, Ph.D. Purdue University.
Appraisal and Its Application to Counseling COUN 550 Saint Joseph College For Class # 3 Copyright © 2005 by R. Halstead. All rights reserved.
VALIDITY AND VALIDATION: AN INTRODUCTION Note: I have included explanatory notes for each slide. To access these, you will probably have to save the file.
Reliability Analysis Based on the results of the PAF, a reliability analysis was run on the 16 items retained in the Task Value subscale. The Cronbach’s.
Neural test theory model for graded response data SHOJIMA Kojiro The National Center for University Entrance Examinations, Japan 1.
Research Methodology and Methods of Social Inquiry Nov 8, 2011 Assessing Measurement Reliability & Validity.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Assessing Measurement Quality in Quantitative Studies.
Validity and Item Analysis Chapter 4. Validity Concerns what the instrument measures and how well it does that task Not something an instrument has or.
Validity and Item Analysis Chapter 4.  Concerns what instrument measures and how well it does so  Not something instrument “has” or “does not have”
NATIONAL CONFERENCE ON STUDENT ASSESSMENT JUNE 22, 2011 ORLANDO, FL.
Item Response Theory in Health Measurement
Chapter 6 - Standardized Measurement and Assessment
Reliability a measure is reliable if it gives the same information every time it is used. reliability is assessed by a number – typically a correlation.
2. Main Test Theories: The Classical Test Theory (CTT) Psychometrics. 2011/12. Group A (English)
The Process of Psychometric Validation of an Instrument across Language and Culture Halfway around the World Huey-Shys Chen PhD, RN, CHES Assistant Professor,
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
Copyright © 2014 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 11 Measurement and Data Quality.
Chapter 2 Norms and Reliability. The essential objective of test standardization is to determine the distribution of raw scores in the norm group so that.
Classical Test Theory Psych DeShon. Big Picture To make good decisions, you must know how much error is in the data upon which the decisions are.
Copyright © 2014 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 25 Critiquing Assessments Sherrilene Classen, Craig A. Velozo.
Looking at the both ‘ends’ of the social aptitude dimension
Department of Psychology, University of Cincinnati
Further Validation of the Personal Growth Initiative Scale – II: Gender Measurement Invariance Harmon, K. A., Shigemoto, Y., Borowa, D., Robitschek, C.,
METHODS AND PARTICIPANTS ANALYSES AND STUDY QUESTIONS
The 12-Item General Health Questionnaire (GHQ-12):
Evaluation of measuring tools: validity
Item Analysis: Classical and Beyond
Reliability & Validity
Reliability and Validity of Measurement
First study published in JOGS.
Validation of the Portuguese DSM-IV-MR-J
National Conference on Student Assessment
By ____________________
EPSY 5245 EPSY 5245 Michael C. Rodriguez
Item Analysis: Classical and Beyond
Multitrait Scaling and IRT: Part I
Item Analysis: Classical and Beyond
Presentation transcript:

Examining Psychometric Properties of the Kutcher Adolescent Depression Scale (KADS-11) Using Multidimensional Item Response Theory (MIRT) M. Shojaee, O. Bulut, M. Shahidi Centre for Research in Applied Measurement and Evaluation, University of Alberta Method Abstract This study involved 300 students who were randomly selected by using the systematic multistage random sampling method from Islamic Azad University-Tehran Central in Iran. Since the KADS-11 was demonstrated to be a multidimensional scale measuring Core Depressive Symptomatic Factor and Suicidal and Physical Factor (Shahidi & Shojaee, 2014), and also because the IRT models were not used to analyze its items before, the multidimensional version of GRM (Samejima, 1969) was used to address the research questions in this study. The goal of the GRM is to model the relationship between item responses and the latent trait when the ratings include two or more ordered categories using item parameters (Muraki, & Carlson, 1993). Based on the rationales and suitability of the GRM for KADS-11, the researchers analyzed the item difficulty, item discrimination, test information function, the standard error of measurement and the item information function of the scale through using R software. Additionally, the most psychometric properties such as the construct validity and the reliability of KADS-11 were also analyzed based on CTT through using SPSS to provide a pathway for comparing both IRT and CTT. Using the principal component analysis with the Varimax rotation for the factorial structure of instrument, the analysis was restricted by the following criterion to select the items for a factor: ‘an item must have loaded at least 0.40 on its own factor but less than 0.40 on any other factor’. To estimate the reliability of the data, alpha coefficient was computed as a measure of internal consistency. Model fit. Inspecting whether GRM is an appropriate model for this type of data, a few matters were checked: First, the accuracy of the parameters; item difficulties or boundaries, item discriminations and person parameter or level of the participants’ ability were estimated by IRT. Second, expected total score diagram and finally the standard error of measurement were examined. The last evidence for fitting GRM to this data is the Standard Error of Measurements (SEM). The diagram of ‘Test Standard Error’ displayed that when there are not enough responses in a specific level of theta, the level of error goes up. Discussion: One of the advantages in analyzing the data with IRT is that the researchers could have more technical information about each item of a test. This valuable information led to optimize developing KADS-11. In this study to visualize the amount of information that can be yielded from the each item was depicted in the ‘Item Information Trace Line’ (see Figure 5). As shown in the Figure 5, most of the items provide the maximum amount of information close to 1.00, which implies a reasonable amount of information about the latent trait (depression) in the participants. Particularly, items 8 and 5 gained the most amount of information indicating that these items are key items in KADS-11. The other notably fact was that most of the items were centralized on the middle range of the trait, and there were not much information about the two extreme levels of the trait. This characteristic is compatible with the nature of sample population which was a normal population. However, the items 3, 10 and 11 could provide some information about the higher level of the depression. Kutcher Adolescent Depression Scale-11 Items (KADS-11) is a diagnostic instrument measuring depression and suicidal thoughts in adolescents and young adults. Some characteristics of KADS-11 such as ease of administration, treatment sensitivity, and the ability to distinguish comorbid symptoms motivated the researchers to examine the psychometric properties of the scale by utilizing the multidimensional form of Graded Response Model in order to find the relationship between item responses and the latent trait. Results indicated that most of the items provided the maximum amount of information about examinees’ depression, and also two extracted factors (Core Depressive factor and Suicidal and Physical factor) could explain 55.20% of total variance of KADS-11. . Introduction Depression is one of the prevalent mental disorders across Canada (Centre for Addiction and Mental Health-CAMH, 2011; Canadian Mental Health Association-CMHA, 2013) and around the world (World Health Organization, 2015). Amongst various depressive tools, Hamilton Rating Scale for Depression (HAM-D or HRSD- Bech, Paykel, Sireling, & Yiend, 2015; Broen et al., 2015; Hamilton, 1960), Hospital Anxiety and Depression Scale (HADS-FC- Roberge, Dore, Menear, Chartrand, Ciampi, Duhoux, & Fournier, 2013) and Kutcher Adolescent Depression Scale-11 Items (KADS-11) (Brooks & Kutcher, 2001; Shahidi & Shojaee, 2014) are seemingly the popular scales that are used in most clinical settings. Based on the nature of diagnostic tools (e.g., unidimensional or multidimensional), it is important whether such tools fulfil certain psychometric requirements or not (Ackerman, Gierl & Walker, 2003; Sheng & Wikle, 2007). The psychometric requirements for diagnostic and screening scales have been argued in two major theories, Classical Test Theory (CTT) and Item Response Theory (IRT). Using both CTT and IRT in an integrative way to explore, develop and to maximize the applicability of scales is recently central to psychometric studies (Barthel, et al., 2014; Güler, et al., 2014; Rubio, et al., 2007; Zoghi, & Valipour, 2014). To explore the strengths and weaknesses of KADS-11, the current study was focused on the psychometric properties of KADS-11. Accordingly, the following questions were addressed: 1) what are the item parameters of KADS-11 based on CTT and IRT? 2) Are the estimated parameters in IRT comparable with estimated parameters in CTT for KADS-11? 3) Does the GRM in IRT have a better fit to the data compared to CTT for KADS-11? 4) Is there any advantage for the scale to be analyzed based on multidimensional item response theory (MIRT)? Conclusion In conclusion, since the parameterization is comparable in both measurement theories, and because IRT has many advantages in analyzing this data including extra technical information about each item, better treatment of the SEM and better model fit, it can be deduced that the IRT provides the researchers with a more useful and comprehensive information regarding the multidimensional instruments and the participants simultaneously. Next Step With regards to the dimensions of this scale, it is suggested that the two items in Suicidal and Physical Factor are not enough to measure individuals’ suicidal thoughts and the physical symptoms of depression. Although, this research showed that these items (e.g., 10 and 11) determined significantly a unique factor, these items should be increased to a reliable level in the next new version of scale. Author contact information: mshojaee@ualbeerta.ca