Limitations of Statistical Measures of Error in Assessing the Accuracy of Glucose Sensors Craig Kollman1, Darrell Wilson2, Tim Wysocki3, Rosanna Fiallo-Scharer4,

Slides:



Advertisements
Similar presentations
Running a model's adjoint to obtain derivatives, while more efficient and accurate than other methods, such as the finite difference method, is a computationally.
Advertisements

ASSESSING RESPONSIVENESS OF HEALTH MEASUREMENTS. Link validity & reliability testing to purpose of the measure Some examples: In a diagnostic instrument,
Down Alert Function of the GlucoWatch® G2 TM Biographer (GW2B) During Insulin- induced Hypoglycemia in Children Eva Tsalikian 1, Craig Kollman 2, Rosanna.
Learning Algorithm Evaluation
Diabetes Self-Management Profile- Revised for Conventional and Flexible Insulin Regimens Wysocki, T. 1, Xing, D. 2, Fiallo-Scharer, R. 3, Doyle, E. 4,
Descriptive statistics. Statistics Many studies generate large numbers of data points, and to make sense of all that data, researchers use statistics.
MARE 250 Dr. Jason Turner Hypothesis Testing II. To ASSUME is to make an… Four assumptions for t-test hypothesis testing:
Supervised classification performance (prediction) assessment Dr. Huiru Zheng Dr. Franscisco Azuaje School of Computing and Mathematics Faculty of Engineering.
Quality Control Procedures put into place to monitor the performance of a laboratory test with regard to accuracy and precision.
Evaluating Hypotheses
Quality Assurance.
Statistics 800: Quantitative Business Analysis for Decision Making Measures of Locations and Variability.
Lehrstuhl für Informatik 2 Gabriella Kókai: Maschine Learning 1 Evaluating Hypotheses.
Measurement Concepts & Interpretation. Scores on tests can be interpreted: By comparing a client to a peer in the norm group to determine how different.
Chemometrics Method comparison
Accuracy of the FreeStyle Navigator ™ Continuous Glucose Monitor Diabetes Research in Children Network Larry Fox, 1 Roy Beck, 2 Stuart Weinzimer, 3 Katrina.
Diabetes Research in Children Network (DirecNet) Outpatient Pilot Study to Evaluate the GlucoWatch® G2 TM Biographer in the Management of Type 1 Diabetes.
Accuracy of the A1cNow ® in Children with T1D. Diabetes Research in Children Network Larry Fox, 1 Dongyuan Xing, 2 Katrina Ruedy, 2 Roy Beck, 2 Craig Kollman,
Today Evaluation Measures Accuracy Significance Testing
Respected Professor Kihyeon Cho
Answering questions about life with statistics ! The results of many investigations in biology are collected as numbers known as _____________________.
There are no two things in the world that are exactly the same… And if there was, we would say they’re different. - unknown.
Diabetes Research in Children Network (DirecNet) Outpatient Pilot Study to Evaluate the Feasibility of Computer-Based Data Acquisition and Transmission.
A 5-center CRC-based Study of the Accuracy of the GlucoWatch® G2 TM Biographer in Children and Adolescents with Type 1 Diabetes Darrell Wilson 1, Bruce.
Topic 6.1 Statistical Analysis. Lesson 1: Mean and Range.
Mealtime Glycemic Excursions in Pediatric Subjects with Type 1 Diabetes: Results of the Diabetes Research in Children (DirecNet) Accuracy Study Study Group.
Abstract Background: As part of a study to evaluate the accuracy of the GlucoWatch ® G2 TM Biographer and the Continuous Glucose Monitoring System (CGMS.
Diabetes Control and Complications Trial (DCCT) Results indicate that most youth with T1DM should be treated intensively in order to reduce the risk of.
Background: DirecNet Diabetes Research in Children Network NIH funded collaborative study group 5 clinical centers, central laboratory, coordinating center,
Biostatistics: Measures of Central Tendency and Variance in Medical Laboratory Settings Module 5 1.
Diabetes Research in Children Network Pilot Study of the Navigator TM Continuous Glucose Monitoring System in Children with Type 1 Diabetes: Safety, Tolerability,
Data Analysis 1 Mark Stamp. Topics  Experimental design o Training set, test set, n-fold cross validation, thresholding, imbalance, etc.  Accuracy o.
Event-Based Assessments of Sensor Accuracy during Exercise-Induced Reductions in Glucose in Children with Type 1 Diabetes Craig Kollman, 1 Darrell Wilson,
Impact of Exercise on Overnight Glycemic Control in Children with Type 1 Diabetes (T1DM) Eva Tsalikian 1 ; Roy Beck 2 ; Peter Chase 3 ; Tim Wysocki 4 ;
Thinking About Psychology: The Science of Mind and Behavior 2e Charles T. Blair-Broeker Randal M. Ernst.
Doing Data Science – Chapter 12: Epidemiology Vast amounts of individual patient medical data is available – Detailed – visits, prescriptions, outcomes,
Internet-based pilot study comparing low fat with high fat evening snacks in children and adolescents with Type 1 Diabetes using continuous glucose monitoring.
Use of The FreeStyle Navigator TM Continuous Glucose Monitoring System in Children on Glargine- based Multiple Daily Injection Therapy Stuart Weinzimer.
Use of the FreeStyle Navigator ™ Continuous Glucose Monitoring System in Children with Type 1 Diabetes Diabetes Research in Children Network L. A. Fox,
Biostatistics Case Studies 2008 Peter D. Christenson Biostatistician Session 5: Choices for Longitudinal Data Analysis.
Lack of Effectiveness of the GlucoWatch Biographer (GW2B) in Altering Glycemic Control in Children with Type 1 Diabetes H. Peter Chase 1, Roy Beck 2, William.
A Comparison of the Original vs. Modified Continuous Glucose Monitoring System (CGMS™) Sensor During Hypoglycemia in the Diabetes Research in Children.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
MEASURES OF TEST ACCURACY AND ASSOCIATIONS DR ODIFE, U.B SR, EDM DIVISION.
Utility of CGMS as a Measure of Glycemic Control in Children with Type 1 Diabetes (T1DM) Rosanna Fiallo-Scharer, MD for.
Reliability of two indices of the biologic variability in glycosylation among children and adolescents with T1DM Darrell M Wilson 1, Rosanna Fiallo-Scharer.
Effectiveness of Early Intensive Therapy On β-Cell Preservation in Type 1 Diabetes Featured Article: Bruce Buckingham, M.D., Roy W. Beck, M.D., P.H.D.,
Evaluation of Factors Affecting CGMS Calibration Bruce Buckingham, 1 Craig Kollman, 2 Roy W Beck, 2 Andrea Kalajian, 2 Rosanna Fiallo-Scharer, 3 Michael.
Accuracy Study of the Medtronic Minimed Continuous Glucose Monitoring System (CGMS) and GlucoWatch® G2TM Biographer (GW2B) in Children with Type 1 Diabetes.
Alterations in White Matter Structure in Young Children With Type 1 Diabetes Featured Article: Naama Barnea-Goraly, Mira Raman, Paul Mazaika, Matthew Marzelli,
Evaluating Risk Adjustment Models Andy Bindman MD Department of Medicine, Epidemiology and Biostatistics.
Performance of the DCA 2000 ® + Analyzer for Measurement of Hemoglobin A1 c Levels in Children with T1DM in a DirecNet Outpatient Clinical Trial.
C82MST Statistical Methods 2 - Lecture 1 1 Overview of Course Lecturers Dr Peter Bibby Prof Eamonn Ferguson Course Part I - Anova and related methods (Semester.
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 2: Correlation of Time Courses of Simultaneous.
Machine Learning Tutorial-2. Recall, Precision, F-measure, Accuracy Ch. 5.
Assessing Responsiveness of Health Measurements Ian McDowell, INTA, Santiago, March 20, 2001.
Quality Control: Analysis Of Data Pawan Angra MS Division of Laboratory Systems Public Health Practice Program Office Centers for Disease Control and.
DirecNet Study of the Accuracy of the Navigator Continuous Glucose Monitoring System in Children and Adolescents with Type 1 Diabetes Darrell Wilson 1,
The Physiological Variations of Plasma Glucose Concentrations in Healthy, Non-Diabetic Children: Use of Continuous Glucose Sensors Nelly Mauras, Roy Beck,
Psychology’s Statistics Appendix. Statistics Are a means to make data more meaningful Provide a method of organizing information so that it can be understood.
Daniel A. Baur, Fernanda S. Vargas, Jordan A. Garvey, Christopher W. Bach, and Michael J Ormsbee, FACSM Institute of Sports Sciences and Medicine, Department.
Statistical analysis.
Statistical analysis.
Statistical Methods for Model Evaluation – Moving Beyond the Comparison of Matched Observations and Output for Model Grid Cells Kristen M. Foley1, Jenise.
Quantitative Data Analysis P6 M4
Introduction To Medical Technology
Estimating Population Parameters Based on a Sample
Quality Assessment The goal of laboratory analysis is to provide the accurate, reliable and timeliness result Quality assurance The overall program that.
Presentation transcript:

Limitations of Statistical Measures of Error in Assessing the Accuracy of Glucose Sensors Craig Kollman1, Darrell Wilson2, Tim Wysocki3, Rosanna Fiallo-Scharer4, Eva Tsalikian5, William Tamborlane6, Roy Beck1, Katrina Ruedy1, and the Diabetes Research In Children Network (DirecNet) Study Group. 1Jaeb Center for Health Research, Tampa, FL; 2Division of Pediatric Endocrinology and Diabetes, Stanford University, Stanford, CA; 3Nemours Children’s Clinic, Jacksonville, FL; 4Barbara Davis Center for Childhood Diabetes, University of Colorado, Denver, CO;5Department of Pediatrics, University of Iowa, Carver College of Medicine, Iowa City, IA; 6Department of Pediatrics, Yale University School of Medicine, New Haven, CT.

Abstract Limitations of Statistical Measures of Error in Assessing the Accuracy of Glucose Sensors. The appropriate set of accuracy measures to evaluate near continuous glucose monitoring remains to be developed. Traditional methods applied to glucose meters do not adequately capture the time dimension of glucose sensor data. Moreover, some of these methods have substantial limitations which should be understood in order to place analysis results in proper context. We highlight these limitations using data from an inpatient study conducted by the DirecNet Study Group. Error grid analyses and the area under the curve (AUC) for the detection of hypoglycemia are commonly cited statistics. The percentage of values within error grids A+B are usually quite high, even for inaccurate sensors, potentially giving a false sense of accuracy. When we simulated artificially inaccurate sensors by randomly shuffling paired sensor readings with laboratory reference glucose values, 76% and 78% of pairs still fell within zones A+B for the Clarke and modified error grids, respectively. The mean AUC value was 62%. Correlation is also frequently used to quantify the accuracy of glucose sensors. This measure, however, is sensitive to the variation in true glucose levels. Simulations were run distributing the “true” glucose levels uniformly over the ranges indicated below (N=10,000 per sensor) and adding a normally distributed error (standard deviation 25 mg/dL in each case) for the sensor value. These simulated sensors all have identical levels of accuracy, but their correlation values vary considerably.   In summary, the use of zone A+B percentage in error grid analysis and the AUC statistic can give misleading notions of sensor accuracy. The correlation coefficient is not a consistent measure of sensor accuracy. Novel statistical approaches are needed to better characterize the near continuous nature of these sensors.

Statistical Methods for Assessing Glucose Accuracy Originally developed for glucose meters. Do not capture the near continuous nature of glucose sensors. Difficult to assess trends. How well do sensors characterize acute changes in glucose?

Traditional Measures of Accuracy Error Grid Analysis Receiver Operating Characteristics (ROC) Area Under the Curve (AUC) Correlation Differences between Reference and Sensor Glucoses Difference Absolute difference Relative difference Relative absolute difference (RAD)

Goals of Error Grid Analysis Want to distinguish clinically meaningful vs. less important errors in glucose measurements. When would an erroneous value lead to an incorrect treatment decision? Sensors not approved for real time treatment decisions. Divide measurement errors into zones to distinguish increasing clinical significance of errors.

Potential Problems with Error Grid Analysis Zones A and B are narrower from 50-70 mg/dL. Many studies include few or no glucose values in this range. Sensor accuracy often measured by the percentage of points falling in zones A+B. This percentage can appear high, even for very inaccurate sensors. Can give a misleading notion of sensor accuracy through chance agreement.

Area Under the Curve (AUC) Often used to measure how accurately hypoglycemia (or hyperglycemia) is detected. Receiver Operating Characteristics (ROC) analysis. Look at different alarm levels that could be used and assess the sensitivity/specificity trade-off

Simulation Experiment Use data from the DirecNet Inpatient Accuracy Study. Make sensors artificially inaccurate by randomly shuffling the parings with the reference glucose. Resulting “sensors” still have realistic glucose distribution. Look at error grid and AUC analyses on resulting simulated data set. Repeat 10,000 times.

Results for Shuffled Sensors Clarke Error Grid 76% Zone A+B Modified Error Grid 78% AUC 62% mean value

Remarks Zones A and B on error grids are large enough that even inaccurate sensors will hit them the majority of the time. Much of the ROC curve involves alarm levels that would not realistically be used in practice. Resulting AUC value puts too much weight on high alarm settings.

Correlation Statistical measure of association. Number between –1 and +1. Often used as a measure of sensor accuracy. Sensitive to the amount of variation in the true glucose.

Another Simulation Create 4 simulated sensors so that each has identical accuracy. Do this by taking the sensor value to be the “true” value plus a normally distributed error with standard deviation = 25 mg/dL. Average value of the “true” glucose is 200 mg/dL for all 4 simulated sensors. Vary the range of true glucose values for each sensor.

Simulated Sensors with Identical Accuracy (N = 10,000 data pairs per sensor) Range of Pearson Intraclass Sensor # True Glucose Correlation Correlation 1 175-225 0.50 0.40 2 150-250 0.76 0.73 3 100-300 0.92 0.91 4 50-350 0.96 0.96

Summary Error grid analysis and AUC values can give inflated notions of sensor accuracy. Important to understand that “baseline” values of these statistics from inaccurate sensors are already high. Correlation is not a consistent measure of sensor accuracy.

Further Research Develop statistical measures that can incorporate the near continuous nature of sensor values. Assess sensors’ ability to detect acute changes in glucose. Measure any time lag in glucose readings.

Stanford University Bruce Buckingham Darrell Wilson Jennifer Block Paula Clinton Yale University William Tamborlane Stuart Weinzimer Elizabeth Boland Kristen Sikes Amy Steffen Jaeb Center for Health Research Roy Beck Katrina Ruedy Craig Kollman Dongyuan Xing Cynthia Silvester Barbara Davis Center H. Peter Chase Rosanna Fiallo-Scharer Jennifer Fisher Barbara Tallant University of Iowa Eva Tsalikian Michael Tansey Linda Larson Julie Coffey Amy Sheehan Nemours Children’s Clinic Tim Wysocki Nelly Mauras Keisha Bird Kelly Lofton