O Nearly all test-retest SUV measurements on the same scanner were within approximately 20% and 1.0 SUV units of each other. On different scanners SUV.

Slides:



Advertisements
Similar presentations
Imaging to Guide Early Drug Trials David A. Mankoff, MD, PhD Seattle Cancer Care Alliance University of Washington Seattle, WA work supported by NIH Grants.
Advertisements

Departments of Medicine and Biostatistics
Establishing the Integrity of Data:
Data Acquisition Work Group Co-Chairs Paul E. Kinahan and Thomas L. Chenevert DAWG Milestone Chart Diffusion MRI Demonstration Project: Gradient.
Propagation of Error Ch En 475 Unit Operations. Quantifying variables (i.e. answering a question with a number) 1. Directly measure the variable. - referred.
FDG PET/CT. FDG-PET/CT Technical Committee Breakout Regular meetings of working groups with RSNA administrative support and milestones for delivery.
ACRIN 6687 A Phase 2, Multicenter Evaluation of 18 F-Fluoride PET as a Pharmacodynamic Biomarker for Dasatinib, a Src Kinase Inhibitor, in Men With Castration-Resistant.
Laboratory Quality Control
Chapter 10 Simple Regression.
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
Concept of Measurement
PROGNOSTIC SIGNIFICANCE OF PRIMARY TUMORAL FDG UPTAKE MEASURED BY PET: Systematic Review and Meta-analysis Ben A. Dwamena, MD.
Statistical Methods in Computer Science Hypothesis Testing I: Treatment experiment designs Ido Dagan.
Data Handling l Classification of Errors v Systematic v Random.
Archiving Experiments Based on Radiological Images Fred Prior, Ph.D. Mallinckrodt Institute of Radiology Washington University School of Medicine.
Statistical Methods in Computer Science Hypothesis Testing I: Treatment experiment designs Ido Dagan.
Reliability of Selection Measures. Reliability Defined The degree of dependability, consistency, or stability of scores on measures used in selection.
Measurement System Analysis Kevin B. Craner Boise State University October 6, 2003.
Quality Assurance in the clinical laboratory
Validity and Reliability Dr. Voranuch Wangsuphachart Dept. of Social & Environmental Medicine Faculty of Tropical Medicine Mahodil University 420/6 Rajvithi.
ISSSMA June 3 rd Advanced quantification in oncology PET Irène Buvat IMNC – UMR 8165 CNRS – Paris 11 University Orsay, France
AMERICAN COLLEGE OF RADIOLOGY March 23, OUR MISSION To foster the ongoing development of widely acceptable consistent imaging.
A small clinical trial has been designed between QIN site JHU, the U of Pittsburgh QIN, and the Adult Brain Tumor Consortium. In this trial, QIN methods.
We are studying the factors affecting reproducibility of SPECT images using phantom, simulation, and patient studies. Absolute quantification in SPECT.
International Atomic Energy Agency L 4 PROTECTION ISSUES IN CLINICAL METHODOLOGY.
Education 793 Class Notes T-tests 29 October 2003.
QIBA CT Volumetrics - Cross-Platform Study (Group 1C) December 23, 2009 CT Cross-platform Sizing of Phantom Lesions Quantitative Imaging Biomarker Alliance.
Analyzing Reliability and Validity in Outcomes Assessment (Part 1) Robert W. Lingard and Deborah K. van Alphen California State University, Northridge.
Comparing Two Population Means
Introduction Stomatal conductance regulates the rates of several necessary processes in plants including transpiration, carbon dioxide assimilation, and.
May 19-20, Quantitation, both single point and longitudinal*, of tumor metabolism via FDG-PET/CT that can be used practically and efficiently as.
1 Inter-observer Agreement of The Response To Therapy AssessmentInter-observer Agreement of The Response To Therapy Assessment in Advanced Lung Cancer.
Quality Control Lecture 5
Research Project Statistical Analysis. What type of statistical analysis will I use to analyze my data? SEM (does not tell you level of significance)
EVIDENCE ABOUT DIAGNOSTIC TESTS Min H. Huang, PT, PhD, NCS.
L Berkley Davis Copyright 2009 MER301: Engineering Reliability Lecture 16 1 MER301: Engineering Reliability LECTURE 16: Measurement System Analysis and.
Corinne Introduction/Overview & Examples (behavioral) Giorgia functional Brain Imaging Examples, Fixed Effects Analysis vs. Random Effects Analysis Models.
QIBA CT Volumetrics - Cross-Platform Study (Group 1C) September 2, 2009 Interclinical Comparison of CT Volumetry Quantitative Imaging Biomarker Alliance.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Scanner Validation via the SNM Clinical Trials Network Phantom Program Paul E. Christian Molecular Imaging Program Huntsman Cancer Institute University.
Propagation of Error Ch En 475 Unit Operations. Quantifying variables (i.e. answering a question with a number) 1. Directly measure the variable. - referred.
FDG-PET/CT Technical Committee Purpose: to foster adoption of practical [I deleted cost-effective because I do not see us doing any cost-effective analyses.
1 Module One: Measurements and Uncertainties No measurement can perfectly determine the value of the quantity being measured. The uncertainty of a measurement.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Sample Size Determination
Statistics Presentation Ch En 475 Unit Operations.
Albert J. Chang, MD, PhD 1 Farrokh Dehdashti, MD 2 Perry W. Grigsby, MD, MS 1 Department of Radiation Oncology 1 Department of Radiology and Nuclear Medicine.
T tests comparing two means t tests comparing two means.
L ABORATORY Q UALITY C ONTROL. INTRODUCTION _A major role of the clinical laboratory is the measurement of substances in body fluids or tissues for the.
4D XCAT Phantom for Multimodality Imaging Research W. Paul Segars, PhD Carl E. Ravin Advanced Imaging Labs Duke University.
Quality Assurance and Quality Control in Longitudinal Studies Coralyn W. Whitney, Bonnie K. Lind, and Patricia W. Wahl Epidemiologic Reviews, 1998, Vol.
Thursday Case of the Day The arrow indicates a region of high apparent FDG uptake posterior to the bladder. What is the cause of this artifact? a) attenuation.
JESÚS SILVA-RODRÍGUEZ, PABLO AGUIAR, INÉS DOMÍNGUEZ-PRADO, MICHEL HERRANZ, ÁLVARO RUIBAL 18F-Choline: Is shine-through effect an issue for prostate SUV.
Statistical hypothesis Statistical hypothesis is a method for testing a claim or hypothesis about a parameter in a papulation The statement H 0 is called.
CRITICALLY APPRAISING EVIDENCE Lisa Broughton, PhD, RN, CCRN.
Clinical practice involves measuring quantities for a variety of purposes, such as: aiding diagnosis, predicting future patient outcomes, serving as endpoints.
The Role of PET Imaging in the Treatment of Metastatic Breast Cancer
WONIX: Training and Scanner Harmonization
Sample Size Determination
Journal of Nuclear Cardiology | Official Journal of the American Society of Nuclear Cardiology 18Fluorine Sodium Fluoride Positron Emission Tomography,
Monday Physics Case of the Day Answers (1 of 3) Paul E
Tuesday Case of the Day Physics History: A B MIP
WONIX: Training and Scanner Harmonization
Transaxial PET without attenuation correction.
This teaching material has been made freely available by the KEMRI-Wellcome Trust (Kilifi, Kenya). You can freely download,
Analyzing Reliability and Validity in Outcomes Assessment Part 1
Gerald Dyer, Jr., MPH October 20, 2016
Analyzing Reliability and Validity in Outcomes Assessment
What makes a good experimental design?
Presentation transcript:

o Nearly all test-retest SUV measurements on the same scanner were within approximately 20% and 1.0 SUV units of each other. On different scanners SUV measurements were within 24% and 1.0 SUV units. Careful instrument calibration and strict implementation of patient handling procedures contribute to optimizing reproducibility o Some variables, such as uptake time, can be easily standardized, controlled, and optimized, contributing minimally to FDG SUVmax variability. o Other variables, such as patient blood glucose, are not as easily controlled and may contribute by a greater degree to FDG SUVmax variability. o Factors contributing to SUVmax which have greater variability can reduce study power significantly o If PET/CT systems are carefully calibrated and monitored, and imaging protocols are consistent, then variability associated with FDG SUVmax between scans is similar to prior test/retest studies o Clinical trials that utilize quantitative PET/CT imaging throughout a network of calibrated PET/CT scanners could increase patient recruitment and improve confidence in trial results. o Phantom measurements suggest that instrumentation-related variability is on the order of 5% assuming proper calibration, in accord with previous studies 4 o Expand data set to include more sites and test-retest data acquired at two different sites o Develop PET study guidelines that incorporate instrument performance, patient variability, and protocol adherence into study design. Acknowledgments o This work was supported by o NIH grant U01-CA o NCI-SAIC Contract 24XS References 1. Boellaard, R. Standards for PET Image Acquisition and Quantitative Data Analysis. J. Nucl. Med., Vol 50 no. Suppl Beaulieu, S, Kinahan, P, Tseng, J, Dunnwald, L, Schubert, E, Pham, P, Lewellen, B, Mankoff, D. SUV Varies with Time After Injection in 18 F-FDG PET of Breast Cancer. J Nuc Med 2003; 44: Doot, RK, Kurland, BF, Kinahan, PE, Mankoff, DA. Design Considerations for using PET as a Response Measure in Single Site and Multicenter Clinical Trials, Acad. Rad., 19(2), February 2012:184– Doot, RK, Scheuermann, JS, Christian, PE, Karp, JS, Kinahan, PE. Instrumentation factors affecting variance and bias of quantifying tracer uptake with PET/CT. Med Phys Nov; 37(11): 6035–6046. Figure 3. Select lesions from test-retest studies. Pt #7 (left) was studied on the same scanner. Pt #8 (right) was studied on different scanners. Table 1: Patient characteristics, including scan location, time between scans and the number of lesions evaluated. Repeat Patient Scans 1. University of Washington, Seattle, WA 2. University of Pittsburgh, Pittsburgh, PA 3. Seattle Cancer Care Alliance, Seattle, WA 4. University of Pennsylvania, Philadelphia, PA Contact o Lanell M Peterson Figure 1. Dependence of SUV on uptake time 1 Figure 2. Impact of measurement error on required sample sizes from the two-sample t-test (80% power, type I error rate [alpha] = 0.05) 3 Calibration kits developed by RadQual, LLC are implicitly NIST traceable and long-lived, allowing very accurate bias measurements o For clinical trials using quantitative PET/CT, knowledge of SUV reproducibility is important for proper study design, clinical decision- making, and patient management o Potential sources of variability include inconsistent patient handling, inconsistent protocol adherence, and suboptimal instrument performance o Reducing variability can increase confidence in trial design, accelerate patient access to trials, increase accrual, and optimize clinical trial power o Understanding the influence of patient handling and instrument performance on quantitative PET/CT measures is a first step in developing protocols that minimize bias and variance. o Factors such as uptake time (Figure 1), patient physiology, and instrument calibration are known to be sources of variability in standardized uptake values (SUVs) 1, 2 o Reducing SUV variability leads to greater statistical power in clinical trials (Figure 2) 3 Materials and Methods Reproducibility of FDG SUVmax for metastatic breast cancer lesions in the same or different PET/CT scanners in a local network Lanell M Peterson 1, Brenda F. Kurland 2, Andrew T Shields 1, Darrin Byrd 1, Alena Novakova 3, Rebecca Christopel 1, Mark Muzi 1, David A. Mankoff 4, Hannah M. Linden 3, and Paul Kinahan 1 Accurate reproducibility of PET SUVs is important for patient management and for clinical trial design. Measuring and reducing SUV variability in PET scanners throughout a local area network can aid in monitoring patient response to therapy and may increase patient accrual to clinical trials o Ten female patients with metastatic breast cancer o Each underwent identical-protocol paired test-retest FDG PET/CT studies o No interim change in therapy or management o Seven patients were studied in the same scanner and 3 patients were studied in 2 different scanners o Each PET/CT scanner’s quantitative performance was monitored with NIST-traceable reference sources to ensure proper calibration o Images were interpreted and SUV metrics were estimated at a central lab o Linear mixed models with a random intercept were fitted to compare test-retest differences in multiple lesions per patient Background Table 2: Statistics for all test-retest lesions (10 patients, 68 lesions). 95% repeatability limits = 1.3 for difference, 21.8% for % difference. Results Future Directions Methods: Patients Materials: Calibration Process for Each Scanner Objective Discussion Conclusion VariableNMea n Std DevMinimumMedianMaximum SUV_avg SUV_absdiff SUV_pctdiff SequenceN Obs VariableNMeanStd Dev MinimumMedianMaximum Same scanner 35SUV_avg SUV_absdiff SUV_pctdiff Different scanners 33SUV_avg SUV_absdiff SUV_pctdiff Table 4: Statistics for all lesions on the same and different scanners. 95% repeatability limits = 1.3 for difference, 19.7% for % difference on the same scanner and 1.3 for difference and 23.5% for % difference in different scanners. Lesion Type N Obs VariableNMeanStd Dev MinimumMedianMaximum Bone52SUV_avg SUV_absdiff SUV_pctdiff Other16SUV_avg SUV_absdiff SUV_pctdiff Table 3: Statistics for bone and other lesions. 95% repeatability limits for bone lesions = 1.1 for difference, 18.8% for % difference. 95% repeatability limits for other lesions = 2.0 for difference and 30.5% for % difference. FDG testRetest, 1 day later Pt#AgeScan 1 location Scan 2 location Days - Scans # lesions 151SCCA SCCA SCCA SCCA SCCA SCCA SCCA SCCAUW HMCUW HMCUW143 Figure 4: Bland-Altman plots of SUVmax for repeated scans: a) 7 patients with repeat scans using the same scanner; b) 3 patients with repeat scans using different scanners. Plotting characters are the same for multiple lesions in a single patient. Most differences are less than 1 SUVmax unit and do not appear to differ by magnitude of SUVmax or by whether repeat scan occurred in the same or different scanner. FDG testRetest, 15 days later