Statistical Methods for Multicenter Inter-rater Reliability Study

Slides:



Advertisements
Similar presentations
ANOVA and Linear Models. Data Data is from the University of York project on variation in British liquids. Data is from the University of York project.
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
RELIABILITY Reliability refers to the consistency of a test or measurement. Reliability studies Test-retest reliability Equipment and/or procedures Intra-
Correlation Chapter 6. Assumptions for Pearson r X and Y should be interval or ratio. X and Y should be normally distributed. Each X should be independent.
Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 12 Measures of Association.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Methods for Estimating Reliability
Chapter Seventeen HYPOTHESIS TESTING
Concept of Measurement
Chapter Eighteen MEASURES OF ASSOCIATION
Overview of STAT 270 Ch 1-9 of Devore + Various Applications.
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE © 2012 The McGraw-Hill Companies, Inc.
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Measures of Association Deepak Khazanchi Chapter 18.
Statistics 350 Lecture 17. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Chapter 14 Inferential Data Analysis
13 Design and Analysis of Single-Factor Experiments:
Week 9: QUANTITATIVE RESEARCH (3)
AM Recitation 2/10/11.
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Chapter 11 Simple Regression
Simple Linear Regression
Learning Objective Chapter 14 Correlation and Regression Analysis CHAPTER fourteen Correlation and Regression Analysis Copyright © 2000 by John Wiley &
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-1 Review and Preview.
Descriptive Statistics e.g.,frequencies, percentiles, mean, median, mode, ranges, inter-quartile ranges, sds, Zs Describe data Inferential Statistics e.g.,
Instrumentation.
McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Which Test Do I Use? Statistics for Two Group Experiments The Chi Square Test The t Test Analyzing Multiple Groups and Factorial Experiments Analysis of.
Statistics 11 Correlations Definitions: A correlation is measure of association between two quantitative variables with respect to a single individual.
CHAPTER 14: Nonparametric Methods to accompany Introduction to Business Statistics seventh edition, by Ronald M. Weiers Presentation by Priscilla Chaffe-Stengel.
1 Chapter 4 – Reliability 1. Observed Scores and True Scores 2. Error 3. How We Deal with Sources of Error: A. Domain sampling – test items B. Time sampling.
Reliability & Agreement DeShon Internal Consistency Reliability Parallel forms reliability Parallel forms reliability Split-Half reliability Split-Half.
Chapter 16 Data Analysis: Testing for Associations.
CHI SQUARE TESTS.
Academic Research Academic Research Dr Kishor Bhanushali M
METHODS IN BEHAVIORAL RESEARCH NINTH EDITION PAUL C. COZBY Copyright © 2007 The McGraw-Hill Companies, Inc.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Angela Hebel Department of Natural Sciences
Correlation and Linear Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
Reading Report: A unified approach for assessing agreement for continuous and categorical data Yingdong Feng.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Nonparametric Statistics
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Biostatistics Nonparametric Statistics Class 8 March 14, 2000.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Chapter 21prepared by Elizabeth Bauer, Ph.D. 1 Ranking Data –Sometimes your data is ordinal level –We can put people in order and assign them ranks Common.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Pearson Product-Moment Correlation Test PowerPoint.
STATISTICAL TEST.
Chapter 13 Understanding research results: statistical inference.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Spearman Rank-Order Correlation Test PowerPoint Prepared.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Statistics Correlation and regression. 2 Introduction Some methods involve one variable is Treatment A as effective in relieving arthritic pain as Treatment.
Chapter 81Introduction to Statistical Quality Control, 7th Edition by Douglas C. Montgomery. Copyright (c) 2012 John Wiley & Sons, Inc.
STATISTICAL TESTS USING SPSS Dimitrios Tselios/ Example tests “Discovering statistics using SPSS”, Andy Field.
1 Measuring Agreement. 2 Introduction Different types of agreement Diagnosis by different methods  Do both methods give the same results? Disease absent.
Appendix I A Refresher on some Statistical Terms and Tests.
Correlation & Simple Linear Regression Chung-Yi Li, PhD Dept. of Public Health, College of Med. NCKU 1.
Inference about the slope parameter and correlation
Copyright (c) 2005 John Wiley & Sons, Inc.
Chapter 10 CORRELATION.
Understanding Results
Introduction to Statistics
Natalie Robinson Centre for Evidence-based Veterinary Medicine
Non – Parametric Test Dr. Anshul Singh Thapa.
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE
15.1 The Role of Statistics in the Research Process
Presentation transcript:

Statistical Methods for Multicenter Inter-rater Reliability Study Jingyu Liu Allergan, Inc., USA July 12, 2006

Overview Introduction Multicenter inter-rater reliability Statistical methods References Acknowledgement July 12, 2006

Introduction Can BOTOX® improve arm or hand function of a post stroke spasticity patient? July 12, 2006

Function Assessments and Validation Identify function items Identify the measurement scale Are the function assessments reliable, sensitive, and clinically meaningful? What is the minimum important change (MIC) of the assessment? Assessment (scale) validation: reliability and validity Inter-rater reliability: One of the key steps in clinical outcome assessment scale validation. July 12, 2006

Multicenter inter-rater reliability Inter-rater reliability study: In a clinical setting, an inter-rater reliability study is to evaluate the agreement among different raters (or physicians) by using the same assessment scale on each of the subjects (or patients) enrolled in the study. Multicenter inter-rater reliability study: It is an inter-rater reliability study conducted in multiple clinical study centers (or sites). Why conduct multicenter inter-rater reliability study? It is difficult to conduct a large inter-rater reliability study in a single clinical study center. It is closer to an actual multicenter clinical study setting (preferred by some regulatory agency). July 12, 2006

Statistical Methods Will focus on the following: Inter-class correlation coefficient (ICC) based on the ANOVA method under a random effect model Introduce a new approach to evaluate the inter-rater reliability Nonparametric methods Discuss the other methods July 12, 2006

Statistical Methods Assessments at the i-th study center (i=1, 2, …,a): Subject Rater 1 2 … … 1 … 2 : July 12, 2006

Statistical Methods ANOVA method to evaluate the inter-rater reliability by an inter-class correlation coefficient (ICC) under a random effect model Statistical inference on is “straightforward”. July 12, 2006

Statistical Methods The ICC is the Pearson correlation coefficient of the assessments made by two raters on a same subject, i.e. where The Pearson correlation coefficient measures a linear relationship of two variables. It does not necessarily measure the “absolute” agreement. As noted by Lin [1] in a special case of two raters (or two assays), a good agreement between the two raters requires that the plot of the assessments of the two raters falls closely on a line through the origin. The ICC is influenced by the subject variation. In an inter-rater reliability study, the subject variation itself is not of interest and the subjects are usually not randomly selected. July 12, 2006

Statistical Methods For an interval variable with a domain [a, b], Liu [2] introduce a new agreement coefficient to evaluate the inter-rater reliability as follows or where is the theoretical range of the variable (i.e., = b – a), is the measurement variance of the raters, and is a pre-defined reliability scale parameter. is invariant under a linear transformation. With this property we can compare the reliabilities of the outcome assessment scales under different domains. Most clinical outcome assessments are either interval variable or ordinal numerical variable. is applicable to these variables. July 12, 2006

Statistical Methods where and For a multicenter inter-rater reliability study, can be estimated by where and When there are only two raters within each center, we have July 12, 2006

Statistical Methods Statistical inferences such as hypothesis test, confidence interval, and power calculation are obtained based on the results provided in Liu [2]. As a special case of two raters, the can be applied in evaluating the test-retest reliability or the intra-rater reliability. In clinical research, we select in calculating can be applied in more complicated situations such as imbalanced data as well as data with missing values. July 12, 2006

Statistical Methods Nonparametric approaches Other Statistical methods When the distribution assumption or the model assumption is not appropriate, we can use the nonparametric approach to evaluating the inter-rater reliability. For a single center study, we can directly use Kendall’s W For a multicenter study, Liu[3] introduced a new statistic which can be considered as an extension of Kendall’s W to evaluate the inter-rater reliability. The distribution property, hypothesis test, confidence interval, etc. are provided. Other weighted approaches Other Statistical methods July 12, 2006

References Lin LI. A concordance correlation coefficient to evaluate reproducibility. Biometrics 1989: 45, 255-268. Liu J. Measures of inter-rater reliability for interval data. (submitted for publication). Liu J. A nonparametric approach to evaluating multicenter inter-rater reliability. (submitted for publication). Shoukri M. Measures of Interobserver Agreement. Chapman & Hall/CRC. 2004 Fleiss JL. The Design and Analysis of Clinical Experiments. John Wiley & Sons. 1986. Schuck P. Assessing reproducibility for interval data in health-related quality of life questionnaires: Which coefficient should be used? Quality of Life Research 2004: 13, 571-586. Searle SR. Linear Model. John Wiley & Sons. 1971. Kendall MG. A new measure of rank correlation. Biometrika 1938 30:81-93. Mehta C, Patel N, Proc-StatXact 4, CYTEL Software Corporation, 1999. Friedman M. The use of ranks to avoid the assumption of normality implicit in the analysis of variance. Journal of American Statistical Association, 1937 32:675-701. July 12, 2006

Acknowledgement Thanks to Allergan Biostatistics and BOTOX-Neurology for the supports on this research. I also wish to thank the supports from the sponsors of the 2006 International Conference on Design of Experiments and Its Application. July 12, 2006