Timothy Wiemken, PhD MPH Assistant Professor Division of Infectious Diseases Diagnostic Tests.

Slides:



Advertisements
Similar presentations
DiseaseNo disease 60 people with disease 40 people without disease Total population = 100.
Advertisements

Validity and Reliability of Analytical Tests. Analytical Tests include both: Screening Tests Diagnostic Tests.
Power and sample size.
Lecture 3 Validity of screening and diagnostic tests
Chapter 4 Pattern Recognition Concepts: Introduction & ROC Analysis.
Evaluation of segmentation. Example Reference standard & segmentation.
Improved Reflexive Testing Algorithm for Hepatitis C Infection Using Signal-to-Cutoff Ratios of a Hepatitis C Virus Antibody Assay K.K.Y. Lai, M. Jin,
Receiver Operating Characteristic (ROC) Curves
College of Medicine, KSU Medical education Department Pathology Department Medical Biochemistry Unit GIT Block (2 nd Year) Integrated Practical (Biochemistry.
Sensitivity, Specificity and ROC Curve Analysis.
What Happens to the Performance of a Diagnostic Test when the Disease Prevalence and the Cut-Point Change? Pathological scores Healthy scores Healthy population.
STUDY DESIGN FOR DIAGNOSTIC STUDY FOR MELIOIDOSIS Dr Direk Limmathurotsakul, MD MSc PhD.
Tim Wiemken, PhD MPH CIC Assistant Professor Division of Infectious Diseases University of Louisville, Kentucky Planning Your Study Statistical Issues.
CRITICAL APPRAISAL Dr. Cristina Ana Stoian Resident Journal Club
GerstmanChapter 41 Epidemiology Kept Simple Chapter 4 Screening for Disease.
Epidemiology in Medicine Sandra Rodriguez Internal Medicine TTUHSC.
Performance measures Morten Nielsen, CBS, BioCentrum, DTU.
Statistical Fridays J C Horrow, MD, MSSTAT
Jeremy Wyatt Thanks to Gavin Brown
ROC & AUC, LIFT ד"ר אבי רוזנפלד.
Lucila Ohno-Machado An introduction to calibration and discrimination methods HST951 Medical Decision Support Harvard Medical School Massachusetts Institute.
Evidence Based Medicine for the Athletic Trainer: What is It?
Screening revision! By Ilona Blee. What are some UK Screening programmes?  Antenatal & newborn screening  Newborn Blood Spot  Newborn Hearing Screening.
(Medical) Diagnostic Testing. The situation Patient presents with symptoms, and is suspected of having some disease. Patient either has the disease or.
What is Screening? Basic Public Health Concepts Sheila West, Ph.D. El Maghraby Professor of Ophthalmology Wilmer Eye Institute Johns Hopkins University.
DATASET INTRODUCTION 1. Dataset: Urine 2 From Cleveland Clinic
Statistics in Screening/Diagnosis
BASIC STATISTICS: AN OXYMORON? (With a little EPI thrown in…) URVASHI VAID MD, MS AUG 2012.
Chapter 4 Pattern Recognition Concepts continued.
Basic statistics 11/09/13.
Diagnostic Testing Ethan Cowan, MD, MS Department of Emergency Medicine Jacobi Medical Center Department of Epidemiology and Population Health Albert Einstein.
Division of Population Health Sciences Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Indices of Performances of CPRs Nicola.
Sensitivity Sensitivity answers the following question: If a person has a disease, how often will the test be positive (true positive rate)? i.e.: if the.
Statistics. Statistics = everything you need to know, but continually want to forget!!!!
Causation ? Tim Wiemken, PhD MPH CIC Assistant Professor Division of Infectious Diseases University of Louisville, Kentucky.
Sensitivity & Specificity Sam Thomson 8/12/10. Sensitivity Proportion of people with the condition who have a positive test result Proportion of people.
MEASURES OF TEST ACCURACY AND ASSOCIATIONS DR ODIFE, U.B SR, EDM DIVISION.
Appraising A Diagnostic Test
Statistical test for Non continuous variables. Dr L.M.M. Nunn.
Likelihood 2005/5/22. Likelihood  probability I am likelihood I am probability.
Evaluating Results of Learning Blaž Zupan
Screening of diseases Dr Zhian S Ramzi Screening 1 Dr. Zhian S Ramzi.
Screening and its Useful Tools Thomas Songer, PhD Basic Epidemiology South Asian Cardiovascular Research Methodology Workshop.
Positive Predictive Value and Negative Predictive Value
1 Wrap up SCREENING TESTS. 2 Screening test The basic tool of a screening program easy to use, rapid and inexpensive. 1.2.
Predictive values prevalence CK and acute myocardial infarction –sensitivity 70% –specificity 80% –prevalence - 40% –prevalence - 20% –PPV and NPV.
Diagnostic Tests Studies 87/3/2 “How to read a paper” workshop Kamran Yazdani, MD MPH.
Diagnostic Test Characteristics: What does this result mean
10 May Understanding diagnostic tests Evan Sergeant AusVet Animal Health Services.
Evaluating Classification Performance
Professor William H. Press, Department of Computer Science, the University of Texas at Austin1 Opinionated in Statistics by Bill Press Lessons #50 Binary.
Evaluation of Diagnostic Tests & ROC Curve Analysis PhD Özgür Tosun.
ROC curve estimation. Index Introduction to ROC ROC curve Area under ROC curve Visualization using ROC curve.
SCREENING FOR DISEASE. Learning Objectives Definition of screening; Principles of Screening.
Biostatistics Board Review Parul Chaudhri, DO Family Medicine Faculty Development Fellow, UPMC St Margaret March 5, 2016.
CHAPTER 3 Key Principles of Statistical Inference.
Critical Appraisal Course for Emergency Medicine Trainees Module 5 Evaluation of a Diagnostic Test.
GP ST2 Group, 28/9/11 Tom Gamble
Performance of a diagnostic test Tunisia, 31 Oct 2014
Sensitivity and Specificity
Class session 7 Screening, validity, reliability
بسم الله الرحمن الرحيم Clinical Epidemiology
What is Screening? Basic Public Health Concepts Sheila West, Ph.D.
کاربرد آمار در آزمایشگاه
Machine Learning Week 10.
Diagnosis II Dr. Brent E. Faught, Ph.D. Assistant Professor
What is Screening? Basic Public Health Concepts Sheila West, Ph.D.
Figure 1. Table for calculating the accuracy of a diagnostic test.
Patricia Butterfield & Naomi Chaytor October 18th, 2017
Investigation of the ability of the Pneumonia Severity Index to accurately predict clinically relevant outcomes: a European study  B. Renaud, E. Coma,
Presentation transcript:

Timothy Wiemken, PhD MPH Assistant Professor Division of Infectious Diseases Diagnostic Tests

Overview 1. Understand the calculation of measures of diagnostic accuracy 2. Understand the clinical utility of measures of diagnostic accuracy 3. Review concepts related to receiver operating characteristic curves

Question If you ask, “how many patients do I need for my study?” and your grant is due tomorrow, what am I going to tell you? a.3 b.Leave me alone c.Make stuff up d.%^$%# e.B and D

Overview 1. Understand the calculation of measures of diagnostic accuracy 2. Understand the clinical utility of measures of diagnostic accuracy 3. Review concepts related to receiver operating characteristic curves

Calculations Example Researchers develop a new test to detect HIV-1 from the blood based on nanotechnology.

Calculations Example How good is this new test? Researchers develop a new test to detect HIV-1 from the blood based on nanotechnology.

Calculations Any new test is compared to a gold standard test to calculate diagnostic accuracy statistics

Calculations Any new test is compared to a gold standard test to calculate diagnostic accuracy statistics 1.Run both tests on a set number of specimens (e.g. gold standard and new test)

Calculations Example What is the gold standard for HIV-1 detection?

Calculations Example What is the gold standard for HIV-1 detection? PCR HIV RNA detection

Calculations Any new test is compared to a gold standard test to calculate diagnostic accuracy statistics 1.Run both tests on a set number of specimens (e.g. gold standard and new test) 2. Compile results into a contingency table

Calculations Any new test is compared to a gold standard test to calculate diagnostic accuracy statistics 1.Run both tests on a set number of specimens (e.g. gold standard and new test) 2. Compile results into a contingency table What the heck is a contingency table?

Calculations Test +Test - Test + Test - 2x2 contingency tables are the backbone of statistics

Calculations Gold Standard Test Result Test +Test - New Test Result Test + Test - This is the layout for calculating diagnostic accuracy statistics

Calculations Gold Standard Test Result Test +Test - New Test Result Test + Test - Each cell represents a specific result.

Calculations Gold Standard Test Result Test +Test - New Test Result Test +TP Test - Each cell represents a specific result. TP: True Positive Both tests agree that the result is positive

Calculations Gold Standard Test Result Test +Test - New Test Result Test +TP Test -TN Each cell represents a specific result. TN: True Negative Both tests agree that the result is negative

Calculations Gold Standard Test Result Test +Test - New Test Result Test +TP Test -FNTN Each cell represents a specific result. FN: False Negative Gold standard says positive, new test says negative

Calculations Gold Standard Test Result Test +Test - New Test Result Test +TPFP Test -FNTN Each cell represents a specific result. FP: False Positive Gold standard says negative, new test says positive

Calculations Gold Standard Test Result Test +Test - New Test Result Test +TPFP Test -FNTN Each cell represents a specific result. Always believe the gold standard. However, it may be wrong. This is a limitation of all diagnostic accuracy statistics.

Calculations Gold Standard Test Result Test +Test - New Test Result Test +TPFP Test -FNTN Each cell represents a specific result. Obviously you always want a lot of true results and very few false results (if any).

Calculations Gold Standard Test Result Test +Test - New Test Result Test +TPFP Test -FNTN Each cell represents a specific result. If this is the case, the new test works just as well as the gold standard.

Calculations Gold Standard Test Result Test +Test - New Test Result Test Test -525 Say these are the data we get after running the two HIV-1 detection tests

Calculations Gold Standard Test Result Test +Test - New Test Result Test Test : Sensitivity Sensitivity is the proportion of people with the disease (as measured by the gold standard) that test positive on the new test.

Calculations Gold Standard Test Result Test +Test - New Test Result Test +25 (TP)45 (FP) Test -5 (FN)25 (TN) 1: Sensitivity Formula is (TP / TP+FN) x 100

Calculations Gold Standard Test Result Test +Test - New Test Result Test +25 (TP)45 (FP) Test -5 (FN)25 (TN) 1: Sensitivity Sensitivity = 25 / (5+25) x 100 = 83%

Calculations 1: Sensitivity Sensitivity = 25 / (5+25) x 100 = 83% What does this mean?

Calculations 1: Sensitivity Sensitivity = 25 / (5+25) x 100 = 83% Of all patients with HIV-1, 83% of them will test positive with the new test. 17% will be false negative (100% - 83%)

Calculations Gold Standard Test Result Test +Test - New Test Result Test Test : Specificity Specificity is the proportion of people without the disease (as measured by the gold standard) that test negative on the new test.

Calculations Gold Standard Test Result Test +Test - New Test Result Test +25 (TP)45 (FP) Test -5 (FN)25 (TN) 2: Specificity Formula is (TN / TN+FP) x 100

Calculations Gold Standard Test Result Test +Test - New Test Result Test +25 (TP)45 (FP) Test -5 (FN)25 (TN) 2: Specificity Specificity = 25 / (25+45) x 100 = 36%

Calculations 2: Specificity What does this mean? Specificity = 25 / (25+45) x 100 = 36%

Calculations 2: Specificity Of all patients without HIV-1, 36% will test negative with the new test. Specificity = 25 / (25+45) x 100 = 36% 64% will be False Positive (100% - 36%)

Calculations Gold Standard Test Result Test +Test - New Test Result Test +25 (TP)45 (FP) Test -5 (FN)25 (TN) 3: Positive Predictive Value (PPV) PPV is the proportion of people that test positive with the new test of everyone that tests positive.

Calculations Gold Standard Test Result Test +Test - New Test Result Test +25 (TP)45 (FP) Test -5 (FN)25 (TN) 3: Positive Predictive Value (PPV) Formula is (TP / TP+FP) x 100

Calculations Gold Standard Test Result Test +Test - New Test Result Test +25 (TP)45 (FP) Test -5 (FN)25 (TN) 3: Positive Predictive Value (PPV) PPV = 25 / (25+45) x 100 = 36%

Calculations 3: PPV Of everyone who tests positive with the new test, 36% will actually have HIV-1. PPV = 25 / (25+45) x 100 = 36% 64% (100%-36%) Will be False Positives (test positive but do not have the disease)

Calculations Gold Standard Test Result Test +Test - New Test Result Test +25 (TP)45 (FP) Test -5 (FN)25 (TN) 4: Negative Predictive Value (NPV) NPV is the proportion of people that test negative with the new test of everyone that tests negative.

Calculations Gold Standard Test Result Test +Test - New Test Result Test +25 (TP)45 (FP) Test -5 (FN)25 (TN) 4: Negative Predictive Value (NPV) Formula is (TN / TN+FN) x 100

Calculations Gold Standard Test Result Test +Test - New Test Result Test +25 (TP)45 (FP) Test -5 (FN)25 (TN) 4: Negative Predictive Value (NPV) PPV = 25 / (25+5) x 100 = 83%

Calculations 4: NPV Of everyone who tests negative with the new test, 83% will not have HIV-1. NPV = 25 / (25+5) x 100 = 83% 13% (100%-83%) Will be False Negative (test negative but have the disease)

Overview 1. Understand the calculation of measures of diagnostic accuracy 2. Understand the clinical utility of measures of diagnostic accuracy 3. Review concepts related to receiver operating characteristic curves

False Positives and Negatives FP and FN are critically important No test is perfect. Must understand the ramifications of each

False Positives and Negatives FP and FN are critically important Unnecessary treatment (costs, side effects) Anxiety Social stigma Outbreak investigations Unnecessary interventions False Positives

False Positives and Negatives FP and FN are critically important Disease transmission (between and within) Delayed treatment initiation Poor outcomes False Negatives

False Positives and Negatives FP and FN are critically important Which one is more important depends on the situation and disease

Using Statistics in Real Life Sn, Sp, PPV, NPV Remember Sn, Sp tell you the proportion of people who test positive with the new test out of everyone with the disease.

Using Statistics in Real Life Sn, Sp, PPV, NPV Remember Sn, Sp tell you the proportion of people who test positive with the new test out of everyone with the disease. When do you know everyone who has the disease in your practice?

Using Statistics in Real Life SnOut and SpIn Sensitivity and Specificity can still be clinically useful. vs

Using Statistics in Real Life It all comes back to false positives and false negatives Gold Standard Test Result Test +Test - New Test Result Test +25 (TP)45 (FP) Test -5 (FN)25 (TN) Sensitivity (Sn) is concerned with FN TP / (TP + FN) Specificity (Sp) is concerned with FP TN / (TN + FP) SnOut and SpIn

Using Statistics in Real Life SnOut (Sn=sensitivity) How does one get a HIGH SENSITIVITY? Gold Standard Test Result Test +Test - New Test Result Test +25 (TP)45 (FP) Test -5 (FN)25 (TN) Must have VERY FEW False Negatives Few false negatives provides a number closer to 1 (100%) TP / (TP + FN)

Using Statistics in Real Life SnOut A test with a high sensitivity (sn) must have very few false negatives.

Using Statistics in Real Life SnOut A test with a high sensitivity (sn) must have very few false negatives. Therefore, if a highly sensitive test gives you a negative result, it must be a TRUE NEGATIVE (there are few false negatives).

Using Statistics in Real Life SnOut A negative result for a highly sensitive test helps you rule out the disease (SnOut) Oink!

Using Statistics in Real Life SpIn How does one get a HIGH SPECIFICITY? Gold Standard Test Result Test +Test - New Test Result Test +25 (TP)45 (FP) Test -5 (FN)25 (TN) Must have VERY FEW False Positives Few false positives provides a number closer to 1 (100%) TN / (TN + FP)

Using Statistics in Real Life SpIn A test with a high sensitivity (sp) must have very few false positives. Therefore, if a highly specific test gives you a positive result, it must be a TRUE POSITIVE (there are few false positives).

Using Statistics in Real Life SpIn A positive result for a highly specific test helps you rule in the disease (SpIn) I have nothing clever to say!

Using Statistics in Real Life Issues with PPV, NPV These statistics vary with the prevalence of the disease in your population

Using Statistics in Real Life Issues with PPV, NPV For example, if you look at PPV and NPV of a tuberculosis test, it will vary significantly if you are testing people in a homeless shelter versus employees at the University of Louisville.

Using Statistics in Real Life Issues with PPV, NPV Sensitivity and Specificity do not suffer from this fault – these values are “inherent to the test” and do not change.

Using Statistics in Real Life HIV-1 Rapid Test Gold Standard Test Result Test +Test - New Test Result Test +20 (TP)33 (FP) Test -10 (FN)37 (TN) Bathhouse: 30% Prevalence Gold Standard Test Result Test +Test - New Test Result Test +10 (TP)40 (FP) Test -5 (FN)45 (TN) CTRSC Course: 15% Prevalence Calculate Sensitivity and Specificity

Using Statistics in Real Life HIV-1 Rapid Test Gold Standard Test Result Test +Test - New Test Result Test +20 (TP)33 (FP) Test -10 (FN)37 (TN) Bathhouse: 30% Prevalence Gold Standard Test Result Test +Test - New Test Result Test +10 (TP)40 (FP) Test -5 (FN)45 (TN) CTRSC Course: 15% Prevalence Sensitivity = 67%, Specificity = 53%

Using Statistics in Real Life HIV-1 Rapid Test Gold Standard Test Result Test +Test - New Test Result Test +20 (TP)33 (FP) Test -10 (FN)37 (TN) Bathhouse: 30% Prevalence Gold Standard Test Result Test +Test - New Test Result Test +10 (TP)40 (FP) Test -5 (FN)45 (TN) CTRSC Course: 15% Prevalence Calculate PPV and NPV

Using Statistics in Real Life HIV-1 Rapid Test Gold Standard Test Result Test +Test - New Test Result Test +20 (TP)33 (FP) Test -10 (FN)37 (TN) Bathhouse: 30% Prevalence Gold Standard Test Result Test +Test - New Test Result Test +10 (TP)40 (FP) Test -5 (FN)45 (TN) CTRSC Course: 15% Prevalence PPV= 38%, NPV= 79%PPV= 20%, NPV= 90%

Using Statistics in Real Life Relationship between NPV, PPV, and Prevalence

Overview 1. Understand the calculation of measures of diagnostic accuracy 2. Understand the clinical utility of measures of diagnostic accuracy 3. Review concepts related to receiver operating characteristic curves

Receiver-Operating Characteristic Some tests have continuous data instead of yes/no Examples Pneumonia Severity Index (PSI) APACHE Scores Glasgow Coma Scale Sequential Organ Failure Assessment Score (SOFA) Simplified Acute Physiology Score (SAPS) Clinical Pulmonary Infection Score (CPIS)

Receiver-Operating Characteristic Some tests have continuous data instead of yes/no How do we identify cutoffs of values that give us the best sensitivity and specificity? E.g. what level of the PSI is best to use as a cutoff to predict mortality in a pneumonia patient?

Receiver-Operating Characteristic ROC Curves

Receiver-Operating Characteristic ROC Curves: Evaluation At a Sensitivity= 90% Expect 55% false positives e.g. Specificity = 45% (False Positive Rate)

Receiver-Operating Characteristic ROC Curves Area Under the Curve (AUC) = how good the test is More area = High Sensitivity and Specificity

Receiver-Operating Characteristic ROC Curves 45 degree angle = 50/50 guess

Receiver-Operating Characteristic Comparing Tests via ROC Curves

Receiver-Operating Characteristic Comparing Tests via ROC Curves

Receiver-Operating Characteristic Comparing Tests via ROC Curves

Overview 1. Understand the calculation of measures of diagnostic accuracy 2. Understand the clinical utility of measures of diagnostic accuracy 3. Review concepts related to receiver operating characteristic curves