Practical Issues on Clinical Validation of Digital Imaging Applications in Routine Surgical Pathology FDA Hematology and Pathology Devices Panel Meeting.

Slides:



Advertisements
Similar presentations
 Is extremely important  Need to use specific methods to identify and define target behavior  Also need to identify relevant factors that may inform.
Advertisements

The Impact of Gynecologic Pathology Diagnostic Errors on Patient Care Dana Marie Grzybicki MD, PhD Colleen M. Vrbin, BA Danielle Pirain, BS Stephen S.
Digital Pathology for Improved Patient Care
ACR and SBI Statement Margarita Zuley, MD Associate Professor, Radiology Medical Director, Breast Imaging University of Pittsburgh.
Reading the Dental Literature
Estimation and Reporting of Heterogeneity of Treatment Effects in Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare.
Measuring diagnostic accuracy of using digital slides in routine histopathology and analyzing sources of diagnostic errors László FÓNYAD 1st. Dept. of.
The ICH E5 Question and Answer Document Status and Content Robert T. O’Neill, Ph.D. Director, Office of Biostatistics, CDER, FDA Presented at the 4th Kitasato-Harvard.
Statistics for Health Care
Unit 4: Monitoring Data Quality For HIV Case Surveillance Systems #6-0-1.
1 Historical overview of FDA regulation of digital pathology imaging applications: the safety and effectiveness issues Tremel A. Faison, MS, RAC, SCT(ASCP)
Thoughts on Biomarker Discovery and Validation Karla Ballman, Ph.D. Division of Biostatistics October 29, 2007.
Diagnostic Assays to Plan Specific Drug Treatment Elizabeth Hammond MD.
High Volume Slide Scanning Architecture and Applications
12/10/02Harry Bushar1 Computerized Thermal Imaging Breast Cancer System 2100 (CTI BCS2100) Radiological Devices Advisory Panel December 10, 2002 Statistical.
Endoscopic Ultrasound in Chronic Pancreatitis
1 History and Lessons from FDA Regulation of Digital Radiology Kyle J. Myers, Ph.D. Division of Imaging and Applied Mathematics OSEL/CDRH/FDA October 22,
Chapter 1: Introduction to Statistics
Performance Measurement and Analysis for Health Organizations
Tissue Bank Challenges Repository and Pathologist View Elizabeth H. Hammond M.D.
Image Quality in Digital Pathology (from a pathologist’s perspective) Jonhan Ho, MD, MS.
1 R2 ImageChecker CT CAD PMA: Clinical Results Nicholas Petrick, Ph.D. Office of Science and Technology Center for Devices and Radiological Health U.S.
1 Preclinical-Bench Testing II Using Human Observers to Objectively Measure and Evaluate Imaging Performance of Digital WSI Systems Max Robinowitz, MD.
Real-Time Telepathology for Subspecialty Consultation Nikolaj Lagwinski, MD September 10 th, 2007.
Classroom Assessments Checklists, Rating Scales, and Rubrics
Designing a Random Assignment Social Experiment In the U.K.; The Employment Retention and Advancement Demonstration (ERA)
EDRN Approaches to Biomarker Validation DMCC Statisticians Fred Hutchinson Cancer Research Center Margaret Pepe Ziding Feng, Mark Thornquist, Yingye Zheng,
Developing a Validated Tool For Evaluating Whole Slide Images.
Validating A Currently Available Measuring Tool for Melanoma Gabor Hertz, M.D.
Laboratory QA/QC An Overview.
Pathology Reports Nicole Draper, MD.
Laying the Foundation for Scaling Up During Development.
Challenges of Cancer Diagnosis in Resource Limited Settings Optimizing Pathology Support Ann Marie Nelson, M.D. AIDS and Infectious Disease Pathology Joint.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
Regulatory Affairs and Adaptive Designs Greg Enas, PhD, RAC Director, Endocrinology/Metabolism US Regulatory Affairs Eli Lilly and Company.
Proposal for Storing Whole Slide Images for Pathology in DICOM
Proposed Studies to Support the Approval of Over-the-Counter (OTC) Home-Use HIV Test Kits Blood Products Advisory Committee March 10, 2006 Elliot P. Cowan,
EBM --- Journal Reading Presenter :呂宥達 Date : 2005/10/27.
Digital Imaging in Education and Distributed Pathology Practice
Issues in Validation of Whole Slide Digital Images for Frozen Section Interpretation Lewis A. Hassell, MD Pathology Visions October 2010.
Cap.org v. 1 Gynecologic Consensus Conference Working Group 2: Prospective and Retrospective Review June 4, 2011.
Onsite Quarterly Meeting SIPP PIPs June 13, 2012 Presenter: Christy Hormann, LMSW, CPHQ Project Leader-PIP Team.
Course: Research in Biomedicine and Health III Seminar 5: Critical assessment of evidence.
W. Scott Campbell, Ph.D., MBA University of Nebraska Medical Center
12/10/02Sacks - Clinical Assessment1 Clinical Assessment – Part II William Sacks, PhD, MD Clinical Assessment – Part II William Sacks, PhD, MD COMPUTERIZED.
Radiology Advisory Panel Meeting Radiology Advisory Panel Meeting Computer-Assisted Detection (CADe) Devices Joyce M. Whang Deputy Division Director Radiological.
Methodological Issues in Implantable Medical Device(IMDs) Studies Abdallah ABOUIHIA Senior Statistician, Medtronic.
Impact of Comorbidity on Chemotherapy Use and Outcomes in Solid Tumors: A Systematic Review Linda Lee, Winson Y. Cheung, Esther Atkinson, and Monika K.
Uses of Diagnostic Tests Screen (mammography for breast cancer) Diagnose (electrocardiogram for acute myocardial infarction) Grade (stage of cancer) Monitor.
Purpose of Epi Studies Discover factors associated with diseases, physical conditions and behaviors Identify the causal factors Show the efficacy of intervening.
Digital Pathology, An Introduction From glass slides to digital files and there after.
External Quality Assessment (Proficiency testing) in Diagnostic (Renal) Histopathology Professor Peter Furness Leicester UK.
Diagnostic Test Studies
W. Scott Campbell, MBA, PhD James R. Campbell, MD
Hematology and Pathology Devices Panel Meeting October 22-23, 2009
FDA Hematology and Pathology Devices Panel Meeting October 22-23, 2009
1 2 3 INDIAN INSTITUTE OF TECHNOLOGY ROORKEE PROJECT REPORT 2016
W. Scott Campbell, MBA, PhD University of Nebraska Medical Center
W. Scott Campbell, MBA, PhD University of Nebraska Medical Center
Medical Device Regulatory Essentials: An FDA Division of Cardiovascular Devices Perspective Bram Zuckerman, MD, FACC Director, FDA Division of Cardiovascular.
WSI-FS II: Validation of Squamous Cell Carcinoma Frozen Section Whole Slide Image Diagnosis in Surgical Pathology Vamsi Parimi MD1; Ryba Dominika, Ewa.
SPECIMEN SONOGRAM - Procedure
Crucial Statistical Caveats for Percutaneous Valve Trials
Digital Pathology Devices Panel Meeting
Presenters: Joel Saltz, Biomedical Informatics, Stony Brook University
Measuring Accuracy in Digital Pathology Ole Eichhorn Chief Technology Officer, Aperio FDA panel discussion, Oct 22-23, 2009.
Digital Pathology Devices Panel Meeting
Project Title Subtitle: make sure you specify it is a research project
Digital Pathology Devices Panel Meeting
Evidence Based Diagnosis
Presentation transcript:

Practical Issues on Clinical Validation of Digital Imaging Applications in Routine Surgical Pathology FDA Hematology and Pathology Devices Panel Meeting October 22-23, 2009 Tan Nguyen, MD, PhD, RAC FDA/CDRH/OIVD/DIHD-DCTD

2 Digitalization Not a Barrier to Pathologic Diagnosis Image-based telepathology having been in place for a number of years Availability of capable automated high-speed, high- resolution whole slide imaging technology (WSI) At issue: How can we demonstrate that pathologists can safely and effectively sign out routine surgical cases via WSI of H&E glass slides? –Compare with diagnoses made by light microscopy

3 Presentation Outline Quality of images –Image acquisition, image display Clinical performance study –Possible study designs –Selection of study participants –Case (specimen) selection –Establishing “reference” diagnosis –Evaluating diagnosis agreement –Other issues

4 Image Acquisition Optimal objective lens power for image scanning? –Digital magnification or magnification by interchangeable objective lenses? Single-focus plane or 3-D image enhancement? –Z-stacks needed for certain examinations (e.g., surgical margin, H. pylori, microcalcifications, nucleoli) Compression algorithm, user-selectable ratio? –Diagnosis made on uncompressed image or image retrieved from prior compressed image data file?

5 Image Display Viewing monitor –Standardized size, aspect ratio, display resolution (low, medium, high)? Viewing software –Image storage, retrieval, annotation Viewer functionality –“Thumbnail” view –Panning, zooming, side-by-side viewing of multiple images

6 Types of Possible Clinical Study Prospective study (“field study”)? –Replicating real-world surgical pathology practice –Minimizing case selection bias –Introducing multiple new sources of variation e.g., non-uniform specimen selection/suitability, variable quality of glass slides –Impractical? Resource constraints at each study site Possibly longer overall study duration

7 Types of Possible Clinical Study Retrospective study? –Ability to select archival cases to challenge (“stress test”) the competing diagnostic modalities –Possible to incorporate more case variation –Inherent case selection bias –Often employed in MRMC ROC studies* to assess diagnostic accuracy of radiologic imaging interpretations Large study to detect small differences in accuracy possible * Multiple-reader multiple-case receiver operating characteristic studies

8 MRMC ROC Paradigm Possible to adopt MRMC ROC paradigm? –Frequently used tool in diagnostic radiology More information per case, smaller sample sizes –Ability to compare accuracy of diagnostic modalities that rely on wide range of subjective interpretations by readers of varying skill degrees –Generalizable to similar readers and similar cases –Potentially complicated by multiple observations (diagnoses) in the same specimen

9 Selecting Study Participants Study participants –Spectrum of pathologists without formal specialty training to specialty experts or more homogeneous population? –Prior exposure to digital pathology Study locations –Community/academic practices, commercial laboratories Number of study participants? –Traditional MRMC ROC studies: readers; cases

10 Selecting a Balanced Set of Cases Adequate mix of biopsies to radical excisions –Broad spectrum of diagnostic complexity Not based on ease of diagnosis, typicality of appearance –Randomly or sequentially selected specimens –Anonymized archival or prospectively collected cases –Use of enriched samples for low-prevalence diseases? –Including all or only representative diagnostic part(s)? –How many cases? Statistical power against reader’s burden

11 Observer Variation Inherent subjectivity in interpretation thresholds –e.g., “atypia,” tumor grading, borderline or uncommon lesions –Paucity of lesional area; intra-lesional variation –Lack of clear diagnostic criteria –Non-quantitative nature of scoring (e.g., pleomorphism) –Subjective distinctions on a histologic continuum Broad spectrum of experience and confidence –Diagnostic “aggressiveness” or hedging in uncertainty

12 Reducing Observer Variation Strict adherence to diagnostic criteria and guidelines Use of pro forma histopathology reporting form Use of checklist standardized diagnostic lines Free-text diagnosis for diagnostic uncertainty? –Accommodating personal reporting style, judgment –Statistically problematic to evaluate Collapsed 2-tiered versus 3-tiered grading system? Circulating an annotated training set prior to study?

13 Establishing Light Microscopy “Reference” Diagnosis Diagnosis by expert or consensus panel? –Number of experts Consensus diagnosis by study participants? –Unanimous agreement or majority agreement? Allowing “acceptable” diagnosis? –Disagreement in opinion, but not “error” (i.e., no amendment necessary)? “Reference” diagnosis be abstraction of the primary diagnosis or all diagnostic lines?

14 Evaluating Diagnosis Agreement Primary diagnosis agreement only? –2 o diagnoses often posing no clinical impact –Unacceptable for a pathologist simply to make accurate diagnosis of malignancy! Line-by-line agreement (1 o and 2 o diagnoses)? –Ideal for collecting performance testing data –Unrealistic to expect high agreements without clearly defined diagnostic criteria for all lesions under inquiry –Incomplete agreement on 2 o diagnoses?

15 Evaluating Diagnosis Agreement “Major” versus “minor” discrepancy –Determined based on clinical impact or flat-out histopathologic error? Compound nevus versus junctional nevus; CIN II versus CIN III → flat-out error, but no difference in treatment Tumor on inked margin or within 1 mm of inked margin in breast biopsy → often a subjective call if specimen not adequately inked, but greatly affecting treatment decision –False-positive versus false-negative diagnosis? –Treated differently or equally in statistical evaluation?

16 Evaluating Diagnosis Agreement Panel’s “reference” diagnoses (light microscopy) Participants’ diagnoses by digital pathology Participants’ diagnoses by light microscopy R1R1 R2R2 R3R3

17 “Wash-out” Period E.g., a study involving the same pathologist reading: –½ cases: digital imaging followed by light microscopy –½ cases: light microscopy followed by digital imaging “Wash-out” period between digital imaging reading and light microscopy reading? –Easier said than done! –Not necessary, if desirable to know whether one modality, when seen first, resulting in improved agreement rate of the subsequent one?

18 Evaluating Diagnosis Agreement If significant disagreement between R1, R 2, and R 3 : –Case-sample variation –Intra- and interobserver variations –Variation intrinsic to each diagnostic modality Possible or need to tease out all variations? Or, account for effects of case and reader variations on accuracy of competing diagnostic modalities? –e.g., by MRMC statistical models; then comparing the overall accuracy

19 Other Issues Assuming valid performance data exist for one tissue type (e.g., breast pathology): –Can the test system be generalized and labeled for all other surgical pathology tissue types without the need for further validations ? –Can it be generalized and labeled for intraoperative (frozen section) diagnosis and telepathology? –If not, how should the label explicitly state the test system’s limitations?

20 Other Issues Generalizing performance of WSI of H&E glass slides to non-H&E-stained glass slides? Required training of pathologists prior to using WSI? –What type of training? Need for post-marketing study for additional safety and effectiveness data? –How to conduct such study? –What data to collect?