Purpose Dimensional versus Dichotomous Scoring: A Meta-analytical Review of Personality Disorder Stability James B. Hoelzle, B.S., Erin M. Guell, B.A.,

Slides:



Advertisements
Similar presentations
Test Development.
Advertisements

RELIABILITY Reliability refers to the consistency of a test or measurement. Reliability studies Test-retest reliability Equipment and/or procedures Intra-
Correlation Chapter 6. Assumptions for Pearson r X and Y should be interval or ratio. X and Y should be normally distributed. Each X should be independent.
 A description of the ways a research will observe and measure a variable, so called because it specifies the operations that will be taken into account.
Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 12 Measures of Association.
Effect Size and Meta-Analysis
Part II Knowing How to Assess Chapter 5 Minimizing Error p115 Review of Appl 644 – Measurement Theory – Reliability – Validity Assessment is broader term.
National Human Services Training Evaluation Symposium Fasih Ahmed, Ph.D.
1 Graphical Diagnostic Tools for Evaluating Latent Class Models: An Application to Depression in the ECA Study Elizabeth S. Garrett Department of Biostatistics.
15 de Abril de A Meta-Analysis is a review in which bias has been reduced by the systematic identification, appraisal, synthesis and statistical.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
FOUNDATIONS OF NURSING RESEARCH Sixth Edition CHAPTER Copyright ©2012 by Pearson Education, Inc. All rights reserved. Foundations of Nursing Research,
Correlational Designs
Chapter 7 Correlational Research Gay, Mills, and Airasian
Chapter 14 Inferential Data Analysis
Educational Research: Correlational Studies EDU 8603 Educational Research Richard M. Jacobs, OSA, Ph.D.
8/10/2015Slide 1 The relationship between two quantitative variables is pictured with a scatterplot. The dependent variable is plotted on the vertical.
Smith/Davis (c) 2005 Prentice Hall Chapter Eight Correlation and Prediction PowerPoint Presentation created by Dr. Susan R. Burns Morningside College.
BASIC STATISTICS WE MOST OFTEN USE Student Affairs Assessment Council Portland State University June 2012.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Measurement and Data Quality
Background Research consistently indicates that numerous factors from multiple domains (e.g., individual, family) are associated with heavy alcohol use.
Inference for regression - Simple linear regression
Chapter 13: Inference in Regression
Fundamentals of Statistical Analysis DR. SUREJ P JOHN.
@ 2012 Wadsworth, Cengage Learning Chapter 5 Description of Behavior Through Numerical 2012 Wadsworth, Cengage Learning.
Statistics for Education Research Lecture 3 Bivariate Correlations Coefficients Instructor: Dr. Tung-hsien He
Foundations of Educational Measurement
McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals.
ISSPD_The Hague_2007 Extending the Spectrum Idea: Child Personality, Parenting and Psychopathology Karla Van Leeuwen, Ivan Mervielde, Barbara De Clercq,
LECTURE 06B BEGINS HERE THIS IS WHERE MATERIAL FOR EXAM 3 BEGINS.
Analyzing Reliability and Validity in Outcomes Assessment (Part 1) Robert W. Lingard and Deborah K. van Alphen California State University, Northridge.
Self-Esteem and Problem Drinking Among Male & Female College Students William R. Corbin, Lily D. McNair, James Carter University of Georgia Journal of.
بسم الله الرحمن الرحیم.. Multivariate Analysis of Variance.
L 1 Chapter 12 Correlational Designs EDUC 640 Dr. William M. Bauer.
Statistical analysis Prepared and gathered by Alireza Yousefy(Ph.D)
HOW TO WRITE RESEARCH PROPOSAL BY DR. NIK MAHERAN NIK MUHAMMAD.
Diagnosing Mental Disorders- The Multiaxial Approach
The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.
Counseling Research: Quantitative, Qualitative, and Mixed Methods, 1e © 2010 Pearson Education, Inc. All rights reserved. Basic Statistical Concepts Sang.
Reliability & Agreement DeShon Internal Consistency Reliability Parallel forms reliability Parallel forms reliability Split-Half reliability Split-Half.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Discriminant Analysis Discriminant analysis is a technique for analyzing data when the criterion or dependent variable is categorical and the predictor.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
SW388R6 Data Analysis and Computers I Slide 1 Multiple Regression Key Points about Multiple Regression Sample Homework Problem Solving the Problem with.
Classification and Diagnosis Multiaxial System. Little Early Uniformity World Health Organization –In 1939 classified mental and physical disorders –Not.
Chapter 16 Data Analysis: Testing for Associations.
© (2015, 2012, 2008) by Pearson Education, Inc. All Rights Reserved Chapter 11: Correlational Designs Educational Research: Planning, Conducting, and Evaluating.
Statistics in Applied Science and Technology Chapter14. Nonparametric Methods.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.
©2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Chapter 7 Measuring of data Reliability of measuring instruments The reliability* of instrument is the consistency with which it measures the target attribute.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Reliability: Introduction. Reliability Session 1.Definitions & Basic Concepts of Reliability 2.Theoretical Approaches 3.Empirical Assessments of Reliability.
Educational Research: Data analysis and interpretation – 1 Descriptive statistics EDU 8603 Educational Research Richard M. Jacobs, OSA, Ph.D.
M ENTAL HEALTH V S M ENTAL ILLNESS. N ORMALITY AND ABNORMALITY SOCIO-CULTURAL Behaviour that is accepted in a particular society or culture, but not in.
Beginners statistics Assoc Prof Terry Haines. 5 simple steps 1.Understand the type of measurement you are dealing with 2.Understand the type of question.
Data Analysis. Qualitative vs. Quantitative Data collection methods can be roughly divided into two groups. It is essential to understand the difference.
DISCRIMINANT ANALYSIS. Discriminant Analysis  Discriminant analysis builds a predictive model for group membership. The model is composed of a discriminant.
Educational Research Descriptive Statistics Chapter th edition Chapter th edition Gay and Airasian.
Copyright © 2014 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 11 Measurement and Data Quality.
PSY 325 AID Education Expert/psy325aid.com FOR MORE CLASSES VISIT
Choosing and using your statistic. Steps of hypothesis testing 1. Establish the null hypothesis, H 0. 2.Establish the alternate hypothesis: H 1. 3.Decide.
Statistics & Evidence-Based Practice
Logic of Hypothesis Testing
Measures of Agreement Dundee Epidemiology and Biostatistics Unit
Reliability and Validity of Measurement
Analyzing Reliability and Validity in Outcomes Assessment Part 1
Statistics II: An Overview of Statistics
Presentation transcript:

Purpose Dimensional versus Dichotomous Scoring: A Meta-analytical Review of Personality Disorder Stability James B. Hoelzle, B.S., Erin M. Guell, B.A., & Gregory J. Meyer, Ph.D. The University of Toledo We conducted a meta-analysis to examine dimensional or dichotomous measurement as a potential moderator of personality disorder (PD) stability over time. Relevant personality disorders were those in the body or appendix of DSM version III, III-R, IV, or IV-TR, or ICD version 9 or 10. Psychopathy was also included because of its close link to antisocial PD. Sixty-nine articles met inclusion criteria. From these we coded seventy-five independent or dependent samples. An independent sample presented one set of findings per group of subjects whereas a dependent sample presented different types of data for the same group of subjects (e.g., PD assessed at 6 month and 1 year retest intervals). Findings for instruments in the same method family (i.e., semi-structured interview, questionnaire) were averaged. Table 1. Descriptive Statistics The average retest interval was positively skewed (skew = 4.1) so it was re-expressed using the natural log transformation, which normalized the distribution (skew = -.01). The average Log retest interval of 4.7 is equal to days. Methods continued Introduction By definition, personality disorders are long-standing, stable dispositions that are inflexible and maladaptive. Since 1980 when the 3rd edition of the Diagnostic and Statistical Manual for Mental Disorders (DSM-III) was published and personality disorders began to be systematically classified on a distinct DSM axis, an extensive research literature developed examining the reliability and validity of personality disorders. However, at present, a systematic and quantitative summary indicating the extent to which personality disorders remain stable over time does not exist. Because stability is a defining feature of personality disorders, the current study fills this important gap in the evidence base. Furthermore, we focus on dichotomous versus dimensional- measurement of personality disorders as a potential moderator of stability. The relative merits of dichotomous and dimensional measurements is an important one in regards to the creation of DSM-V. Results continued Table 2. Sample Size Weighted Average Stability Coefficients For each cell, k ranges from 6 to 38 and N ranges from 836 to To predict expected levels of stability over different time intervals, we used weighted least squares regression analyses, with weights determined by sample size. Length of retest interval was the predictor and separate equations were generated for kappa- based averages across all PDs and the r-based average across all PDs. Because only 11 samples contributed ICC results, a prediction equation was not developed for the ICC findings. Below are the predicted stability values over time. Table 3. Predicted Stability For Any Specific PD Over Various Time Intervals Category Mean SD N Age Retest days Log retest Male 49.4% Type of Measurement 1 Week 1 Year 10 Years Pearson/ Spearman’s r Kappa DichotomousDimensional Average Across PDsKappaICC r or r S All Disorders Cluster A Cluster B Cluster C

Overall, the magnitude of the stability coefficients is surprisingly low. These findings call into question a key assumption in the DSM and ICD regarding the stability of PD characteristics. Nonetheless, we expected dimensional variables to be more stable over time than dichotomous diagnoses. The kappa and ICC coefficients in Table 2 show this hypothesis was clearly supported. In line with psychometric expectations, stability values are ordered such that kappa values are lower than ICC values, which in turn are lower than r values. This pattern is evident for the average across all PDs and in each of the three Clusters. Comparing the two dimensional stability coefficients, mean level changes in PD symptoms likely occurred at follow-up assessment causing ICC results to be less than r results. Table 2 also suggests that PD stability may vary as a function of the disorders under consideration. Disorders within Cluster B are generally more stable than those in Cluster C, which in turn appear somewhat more stable than those in Cluster A. Table 3 shows stability clearly decreases as the retest interval lengthens. Regardless of predicted retest interval, however, dimensional stability is higher than dichotomous stability. A growing amount of empirical research favors the utility of system-cluster dimensions over categorical diagnoses. As a result, the future structure of Axis II in the DSM is in question. Currently, the DSM requires a certain number of criteria to be met in order for a personality disorder diagnosis to be assigned. The diagnosis requires one to make a dichotomous or categorical decision on the presence of a personality disorder even though the underlying criteria form a dimensional measure. However, several authors (Clark & Watson, 1999; Widiger, 2000; Widiger & Clark, 2000) have proposed that for DSM-V it would be more appropriate to quantify Axis II characteristics dimensionally. There are strong psychometric rationales for preferring dimensional over dichotomous scores. When dimensional constructs are artificially dichotomized, it creates a loss of variance (Cohen, 1983) and lower reliability (MacCallum, Zhang, Preacher, and Rucker, 2002). Due to these reasons we expected personality disorders to be more stable over time when they are treated dimensionally rather than dichotomously. Three types of stability coefficients were examined. All studies examining dichotomous diagnoses reported kappa coefficients, while all studies examining dimensional scores reported either the intraclass correlation (ICC) or Pearson’s/Spearman’s r. Kappa and the ICC correct for chance agreement and are asymptotically equivalent (i.e., differences are increasingly trivial as N increases). Because of their statistical equivalence, when kappa and ICC results are compared in this meta-analysis it reveals the difference between dichotomous and dimensional PD classifications. Although both ICC and r can be computed on dimensional scores, r differs from the ICC (and kappa) because it is not lowered by mean differences, only by changes in rank ordering within a distribution. Thus, the ICC (and kappa) provides a more conservative estimate of stability than r. When considering all three types of coefficients, we expected kappa to be lower than ICC or r because it was computed from an artificially dichotomized dimensional construct. We also expected ICC to be lower than r because it is reduced by differences in both means and rank ordering, while r is reduced only by differences in rank ordering. Table 2 provides stability values for dimensional and dichotomous PD judgments organized by type of coefficient. Results are presented for any specific PD averaged across all disorders and within Clusters A, B, or C. Method To locate relevant studies we conducted a literature search on PsycINFO and PubMed to locate articles with the words "personality disorder" or "Axis II" in conjunction with "stability" or "retest." The initial search yielded 403 articles. To be included in the meta-analysis each article had to meet the following criteria: 1) average age  18-years, 2) sample size  20, 3) retest interval  1 week, 4) published in or after 1980, and 5) examine  1 specific ICD/DSM disorder or psychopathy. Clark, L. A., & Watson, D. (1999). Personality, disorder, and personality disorder: Towards a more rational conceptualization. Journal of Personality Disorders, 13, Cohen, J. (1983). The cost of dichotomization. Applied Psychological Measurement, 7, MacCallum, R. C., Zhang, S., Preacher, K. J., & Rucker, D. D. (2002). On the practice of dichotomization of quantitative variables. Psychological Methods, 7, Widiger, T. A. (2000). Personality disorders in the 21 st century. Journal of Personality Disorders, 14, Widiger, T. A., & Clark, L. A. (2000). Toward DSM-V and the classification of psychopathology. Psychological Bulletin, 126, ResultsDiscussion References