Applying the GRADE approach to diagnostic technologies Holger Schünemann for the GRADE working group.

Slides:



Advertisements
Similar presentations
Agency for Healthcare Research and Quality (AHRQ)
Advertisements

Grading the Strength of a Body of Evidence on Diagnostic Tests Prepared for: The Agency for Healthcare Research and Quality (AHRQ) Training Modules for.
Critically Evaluating the Evidence: diagnosis, prognosis, and screening Elizabeth Crabtree, MPH, PhD (c) Director of Evidence-Based Practice, Quality Management.
Azita Kheiltash Social Medicine Specialist Tehran University of Medical Sciences Diagnostic Tests Evaluation.
Critically Evaluating the Evidence: Tools for Appraisal Elizabeth A. Crabtree, MPH, PhD (c) Director of Evidence-Based Practice, Quality Management Assistant.
Summarising findings about the likely impacts of options Judgements about the quality of evidence Preparing summary of findings tables Plain language summaries.
Evaluation of Diagnostic Test Studies
Grading of Recommendations Assessment, Development and Evaluation (GRADE) Methodology.
CRITICAL APPRAISAL Dr. Cristina Ana Stoian Resident Journal Club
1 Risks and Benefits of Home-Use HIV Test Kits Richard Forshee, Ph.D. U.S. Food and Drug Administration Center for Biologics Evaluation and Research Office.
Journal Club Alcohol, Other Drugs, and Health: Current Evidence January–February 2011.
Journal Club Alcohol and Health: Current Evidence September–October 2004.
Clinical Trials Hanyan Yang
Journal Club Alcohol and Health: Current Evidence January–February 2007.
Statistics for Health Care
By Dr. Ahmed Mostafa Assist. Prof. of anesthesia & I.C.U. Evidence-based medicine.
Chronic diseases 1.Chronic diseases have long and variable preclinical phases. 2.The preclinical phase is that portion of the disease natural history during.
P H Y S I C I A N S ’ A C A D E M Y F O R C A R D I O V A S C U L A R E D U C A T I O N Oral drugs for type 2 diabetes and all cause mortality in General.
The Bahrain Branch of the UK Cochrane Centre In Collaboration with Reyada Training & Management Consultancy, Dubai-UAE Cochrane Collaboration and Systematic.
Exercise Echocardiography Cardiac Issues 2011 Douglass A Morrison, MD, PhD.
Grading of Recommendations Assessment, Development, and Evaluation (GRADE) Working Group
BASIC STATISTICS: AN OXYMORON? (With a little EPI thrown in…) URVASHI VAID MD, MS AUG 2012.
Multiple Choice Questions for discussion
Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 14 Screening and Prevention of Illnesses and Injuries: Research Methods.
EPIB-591 Screening Jean-François Boivin 29 September
DEB BYNUM, MD AUGUST 2010 Evidence Based Medicine: Review of the basics.
FRAMING RESEARCH QUESTIONS The PICO Strategy. PICO P: Population of interest I: Intervention C: Control O: Outcome.
Statistics for Health Care Biostatistics. Phases of a Full Clinical Trial Phase I – the trial takes place after the development of a therapy and is designed.
GRADE example application of Jan Brożek. My potential conflicts of interest GRADE working group Cochrane Collaboration.
Placebo-Controls in Short-Term Clinical Trials of Hypertension Sana Al-Khatib, MD, MHS Assistant Professor of Medicine Division of Cardiology Duke University.
Hypertension: Blood Pressure Measurement and the new NICE guideline Prof Richard McManus BHS Annual Meeting Cambridge 2011 NICE clinical guideline 127.
EBCP. Random vs Systemic error Random error: errors in measurement that lead to measured values being inconsistent when repeated measures are taken. Ie:
EVIDENCE ABOUT DIAGNOSTIC TESTS Min H. Huang, PT, PhD, NCS.
Plan GRADE backgroundGRADE background confidence in estimates (quality of evidence)confidence in estimates (quality of evidence) evidence profilesevidence.
1 SCREENING. 2 Why screen? Who wants to screen? n Doctors n Labs n Hospitals n Drug companies n Public n Who doesn’t ?
Critical Appraisal Did the study address a clearly focused question? Did the study address a clearly focused question? Was the assignment of patients.
CHP400: Community Health Program-lI Mohamed M. B. Alnoor Muna M H Diab SCREENING.
Appraising A Diagnostic Test
Evidence-Based Medicine Diagnosis Component 2 / Unit 5 1 Health IT Workforce Curriculum Version 1.0 /Fall 2010.
Screening of diseases Dr Zhian S Ramzi Screening 1 Dr. Zhian S Ramzi.
Screening and its Useful Tools Thomas Songer, PhD Basic Epidemiology South Asian Cardiovascular Research Methodology Workshop.
WHO GUIDANCE FOR THE DEVELOPMENT OF EVIDENCE-BASED VACCINE RELATED RECOMMENDATIONS August 2011.
Diagnostic Tests Studies 87/3/2 “How to read a paper” workshop Kamran Yazdani, MD MPH.
SCH Journal Club Use of time from fever onset improves the diagnostic accuracy of C-reactive protein in identifying bacterial infections Wednesday 13 th.
Unit 15: Screening. Unit 15 Learning Objectives: 1.Understand the role of screening in the secondary prevention of disease. 2.Recognize the characteristics.
EBM --- Journal Reading Presenter :呂宥達 Date : 2005/10/27.
Is the conscientious explicit and judicious use of current best evidence in making decision about the care of the individual patient (Dr. David Sackett)
EVALUATING u After retrieving the literature, you have to evaluate or critically appraise the evidence for its validity and applicability to your patient.
European Patients’ Academy on Therapeutic Innovation The Purpose and Fundamentals of Statistics in Clinical Trials.
GDG Meeting Wednesday November 9, :30 – 11:30 am.
بسم الله الرحمن الرحیم.
EBM --- Journal Reading Presenter :林禹君 Date : 2005/10/26.
EBM --- Journal Reading Presenter :黃美琴 Date : 2005/10/27.
Developing your research question Fiona Alderdice and Mike Clarke.
Article Title Resident Name, MD SVCH6/13/2016 Journal Club.
© 2010 Jones and Bartlett Publishers, LLC. Chapter 12 Clinical Epidemiology.
Screening Tests: A Review. Learning Objectives: 1.Understand the role of screening in the secondary prevention of disease. 2.Recognize the characteristics.
Clinical Epidemiology
Sensitivity and Specificity
for Overall Prognosis Workshop Cochrane Colloquium, Seoul
Why this talk? you will be seeing a lot of GRADE
Conflicts of interest Major role in development of GRADE
The European Society of Cardiology Presented by Dr. Bo Lagerqvist
Overview of the GRADE approach – selected slides
Plan GRADE background two steps evidence profiles
Statistical significance using p-value
Interpreting Epidemiologic Results.
Evidence Based Diagnosis
Medical Statistics Exam Technique and Coaching, Part 2 Richard Kay Statistical Consultant RK Statistics Ltd 22/09/2019.
Presentation transcript:

Applying the GRADE approach to diagnostic technologies Holger Schünemann for the GRADE working group

Today’s talk 1) Focus on patient important outcomes 2) Factors influencing the quality of evidence in diagnostic recommendations

GRADE for diagnosis Shares the fundamental logic of assessment for treatment However, assessments present unique challenges Examples and solutions for how to deal with challenges Focus on importance to patients and consumers!

Summary Quality of evidence reflects our confidence that estimates of benefits and downsides from a diagnostic strategy generated from research are correct. Consideration of the directness of evidence is based on how confident we are of the relation between being classified correctly (as a true positive or negative) or incorrectly (as a false positive or negative) and patient- important consequences. Recommendation depends on the balance between desirable and undesirable effects of the diagnostic test or strategy in terms of patient-important outcomes.

Testing makes a variety of contributions to patient care Clinicians use tests that are usually referred to as “diagnostic” signs and symptoms, imaging, biochemistry, pathology, and psychological Some tests naturally report positive and negative results (pregnancy) Other tests report their results in categories (e.g. imaging) Today we assume a diagnostic approach that ultimately categorizes test results as positive or negative

Purpose of a test Triage to minimize use of an invasive or expensive test Add-on to improve diagnosis beyond what is already done Replacement to replace test that is harmful or costly Bossuyt et al. BMJ 2006

What is the sensitivity, specificity (accuracy), or likelihood ratios of multislice spiral computed tomography (CT) of coronary arteries compared with conventional invasive angiography?

Test accuracy is a surrogate for patient important outcomes When clinicians think about diagnostic tests, they focus on their accuracy Underlying assumption: obtaining a better idea of whether a target condition is present or absent will result in superior patient management and improved outcome.

Test and treatment threshold Diagnostic tests are used in patients suspected of disease to exclude or confirm a diagnosis.

Purpose of the test  Identify the limitations for which the (new or) alternative test offers a putative remedy eliminating a high proportion of false positive or negative results, enhancing availability, decreasing invasiveness, or decreasing cost  Identification of sensible clinical questions: patients, diagnostic intervention, comparison, management, and outcomes of interest Bossuyt et al. BMJ 2006

Sensible clinical question Population: In patients suspected of coronary artery disease Intervention: does multislice spiral computed tomography (CT) of coronary arteries Comparison: compared with conventional invasive coronary angiography Outcomes: lower complications with acceptable rates of false negatives (associated with coronary events) and false positives (leading to unnecessary treatment and complications)?

Study designs for diagnosis If a test fails to improve important outcomes: no reason to use it, whatever its accuracy Best way to assess diagnostic strategy: randomized controlled trial in which investigators randomize patients to different diagnostic strategies

Study designs Focus on: mortality, morbidity, symptoms, and quality of life GRADE approach for treatment or intervention

Study designs II

Patient benefit required Inference from accuracy data that a diagnostic test or strategy improves patient-important outcome requires availability of effective management strategies Others: Reduction in test-related adverse effects Exclusion of a disease and reduction in anxiety Confirming a diagnosis improves patient well-being from the prognostic information it imparts

Patient benefit required Genetic testing for Huntington’s chorea reassurance that a patient will not suffer from the condition ability to plan for future knowing that patient will sadly fall victim Ability to plan is analogous to an effective treatment Almqvist et al Clin Gen 2003

Balance between presumed patient outcomes, complications and cost: Less complications and downsides compared to IVP would support the new test’s usefulness, but the balance between desirable and undesirable effect not clear in view of the uncertain consequences of identifying smaller stones.

Today’s talk 1) Focus on patient important outcomes 2) Factors influencing the quality of evidence in diagnostic recommendations

Factors that decrease the quality of evidence (and how they differ from treatment approach) Study design Different quality criteria for accuracy studies Valid accuracy studies: Diagnostic uncertainty Consecutive patients Evaluators should be blinded

Factors that decrease the quality of evidence Indirectness Population, Test and Comparison Outcomes Similar quality criteria Usual absence of direct evidence about impact on patient-important outcomes Accuracy studies typically provide low quality evidence

Factors that decrease the quality of evidence Inconsistency in study results Similar quality criteria & judgments but: other measures Imprecise evidence Similar quality criteria & judgments, but: Wide confidence intervals for estimates of test accuracy, true and false positives and negatives Reporting bias Similar quality criteria & judgments

Balance between presumed patient outcomes, complications and cost: Avoiding the undesirable consequences of more false positives with CT is preferable to avoiding the higher rate of complications (infarction and death) and higher cost with angiography

Evaluating tests – when can comparative evidence of test accuracy and other intermediate outcomes be used as an alternative to randomized trials (Lord, Irwig, Bossuyt)

Terminology Directness = PICO Synonyms: Generalizability, Applicability Directness of outcomes: Linkage assumptions Explicit assumptions about management of TP, TN, FP, FN, complications, indeterminates Treatment effects on: Mortality, morbidity, HRQL (including psychological well being) 25

Summary of findings

28 Outcomes Illustrative Risks (95% CI) Number of participants (studies) Quality of the Evidence 1 Comments Assumed outcome with CT – prevalence of 20% True positives (Patients correctly classified as having coronary artery disease) 192 per (21)   Moderate 2 Benefit from treatment and fewer complications.* Some patients will have to undergo angiography. True negatives (Patients correctly classified as not having coronary artery disease) 592 per (21)   Moderate 2 Benefit from reassurance and fewer complications False positives (Patients incorrectly classified as having coronary artery disease) 208 per (21)   Moderate 2 Harm from unnecessary treatment False negatives (Patients incorrectly classified as not having coronary artery disease) 8 per (21)   Low 2, 3 Detriment from delayed diagnosis or myocardial insult Complications (MI, allergic reactions, renal failure) 99 per (21)   Low 2 There is a higher rate of rare complications (infarction and death) and higher cost with angiography – a full profile would be required. Resource use* (cost of CT and Angiography) See comment Cost are higher for angiography, 1- Quality rated from 1 (very low quality) to 4 (high quality), 2- Cross sectional studies. Indirectness of outcomes in a wide spectrum of patients and indirect comparison of tests, 3– there is greater uncertainty whether these patients will have negative outcomes. *Assumed efficacy of: 1) aspirin daily = 20% RRR; 2) beta-blockage = 18% RRR.

Summary (1) Quality of evidence reflects our confidence that estimates of benefits and downsides from a diagnostic strategy generated from research are correct.

Summary (2) Consideration of the directness of evidence is based on how confident we are of the relation between being classified correctly (as a true positive or negative) or incorrectly (as a false positive or negative) and patient-important consequences.

Summary Recommendation depends on the balance between desirable and undesirable effects of the diagnostic test or strategy in terms of patient-important outcomes.

Thanks to GRADE group Contributors: Andrew D Oxman, Jan Brozek, Paul Glasziou, Roman Jaeschke, Gunn E Vist, Regina Kunz, John Williams, Jonathan Craig, Victor Montori, Patrick Bossuyt, Gordon H Guyatt

Eskerrik asko A USTEDES!

GRADE and diagnostic tests Diagnostic tests are also performed in patients already known to have the disease. Purpose may be to classify (e.g. stage or grade) or monitor the disease to guide treatment or inform prognosis. Being clear and specific about the purpose of a diagnostic test or strategy is crucial to making a proper recommendation. Diagnosis alone does not improve patient important outcomes.

Balance between presumed patient outcomes, complications and cost: Less complications and downsides compared to IVP would support the new test’s usefulness, but the balance between desirable and undesirable effect not clear in view of the uncertain consequences of identifying smaller stones.