Assessing Information from Multilevel and Continuous Tests Likelihood Ratios for results other than “+” or “-” Tom Newman (based on previous lectures by.

Slides:



Advertisements
Similar presentations
2) Multilevel Tests (Michael) Likelihood ratios for results other than + or -
Advertisements

Studying a Study and Testing a Test: Sensitivity Training, “Don’t Make a Good Test Bad”, and “Analyze This” Borrowed Liberally from Riegelman and Hirsch,
Logistic Regression.
Assessing Information from Multilevel (Ordinal) and Continuous Tests ROC curves and Likelihood Ratios for results other than “+” or “-” Michael A. Kohn,
“Diagnostic value of procalcitonin in well appearing young febrile infants” Pediatrics 2012; 130:
Probability and Statistics Dr. Saeid Moloudzadeh Frequency table and graphs 1 Contents Descriptive Statistics Axioms of Probability Combinatorial.
Estimation of Sample Size
CS 8751 ML & KDDEvaluating Hypotheses1 Sample error, true error Confidence intervals for observed hypothesis error Estimators Binomial distribution, Normal.
Statistics for Health Care
Lucila Ohno-Machado An introduction to calibration and discrimination methods HST951 Medical Decision Support Harvard Medical School Massachusetts Institute.
Today Concepts underlying inferential statistics
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Between Group & Within Subjects Designs Mann-Whitney Test.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Regression and Correlation
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.
David Yens, Ph.D. NYCOM PASW-SPSS STATISTICS David P. Yens, Ph.D. New York College of Osteopathic Medicine, NYIT l PRESENTATION.
Statistics in Screening/Diagnosis
Multiple Choice Questions for discussion
Medical decision making. 2 Predictive values 57-years old, Weight loss, Numbness, Mild fewer What is the probability of low back cancer? Base on demographic.
Statistics for clinical research An introductory course.
Stats Tutorial. Is My Coin Fair? Assume it is no different from others (null hypothesis) When will you no longer accept this assumption?
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Studies of Medical Tests Thomas B. Newman, MD, MPH September 9, 2008.
Division of Population Health Sciences Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Indices of Performances of CPRs Nicola.
Review of Statistical Models and Linear Regression Concepts STAT E-150 Statistical Methods.
Representing Motion 2 In this chapter you will:
9 Mar 2007 EMBnet Course – Introduction to Statistics for Biologists Nonparametric tests, Bootstrapping
Dichotomous Tests Thomas B. Newman, MD, MPH September 27, 2012 Thanks to Josh Galanter and Michael Shlipak 1.
Chapter 6 Random Variables
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.
. Ruling in or out a disease Tests to rule out a disease  You want very few false negatives  High sensitivity 
Diagnosis: EBM Approach Michael Brown MD Grand Rapids MERC/ Michigan State University.
Limited Dependent Variables Ciaran S. Phibbs May 30, 2012.
MEASURES OF TEST ACCURACY AND ASSOCIATIONS DR ODIFE, U.B SR, EDM DIVISION.
Appraising A Diagnostic Test
Week 5: Logistic regression analysis Overview Questions from last week What is logistic regression analysis? The mathematical model Interpreting the β.
Assessing Information from Multilevel (Ordinal) and Continuous Tests ROC curves and Likelihood Ratios for results other than “+” or “-” Michael A. Kohn,
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
1 Risk Assessment Tests Marina Kondratovich, Ph.D. OIVD/CDRH/FDA March 9, 2011 Molecular and Clinical Genetics Panel for Direct-to-Consumer (DTC) Genetic.
1. Statistics Objectives: 1.Try to differentiate between the P value and alpha value 2.When to perform a test 3.Limitations of different tests and how.
A Novel Score to Estimate the Risk of Pneumonia After Cardiac Surgery
Assessing Information from Multilevel (Ordinal) Tests ROC curves and Likelihood Ratios for results other than “+” or “-” Michael A. Kohn, MD, MPP 10/4/2007.
Evaluating Results of Learning Blaž Zupan
Assessing Information from Multilevel and Continuous Tests Likelihood Ratios for results other than “+” or “-” Michael A. Kohn, MD, MPP 10/2/2008.
Assessing Information from Multilevel and Continuous Tests Likelihood Ratios for results other than “+” or “-” Michael A. Kohn, MD, MPP 10/13/2011.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 3: Testing Hypotheses.
Prediction statistics Prediction generally True and false, positives and negatives Quality of a prediction Usefulness of a prediction Prediction goes Bayesian.
Evaluating Risk Adjustment Models Andy Bindman MD Department of Medicine, Epidemiology and Biostatistics.
Copyright restrictions may apply JAMA Pediatrics Journal Club Slides: Procalcitonin Use to Predict Bacterial Infection in Febrile Infants Milcent K, Faesch.
Diagnostic Tests Studies 87/3/2 “How to read a paper” workshop Kamran Yazdani, MD MPH.
SCH Journal Club Use of time from fever onset improves the diagnostic accuracy of C-reactive protein in identifying bacterial infections Wednesday 13 th.
Afebrile Infants With UTI and the Risk for Bacteraemia Journal Club Sheffield Children’s Hospital Naheed Maher 7 th January 2015.
Diagnostic Test Characteristics: What does this result mean
Common Errors by Teachers and Proponents of EBM
Heart Disease Example Male residents age Two models examined A) independence 1)logit(╥) = α B) linear logit 1)logit(╥) = α + βx¡
BIOSTATISTICS Lecture 2. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and creating methods.
PTP 560 Research Methods Week 12 Thomas Ruediger, PT.
Diagnostic Likelihood Ratio Presented by Juan Wang.
BPS - 5th Ed. Chapter 231 Inference for Regression.
Review Design of experiments, histograms, average and standard deviation, normal approximation, measurement error, and probability.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
Diagnostic Test Studies
Chapter 8: Inference for Proportions
12 Inferential Analysis.
Lecture Slides Elementary Statistics Thirteenth Edition
Chapter 5 Normal Distribution
12 Inferential Analysis.
9/27/ A Least-Squares Regression.
Presentation transcript:

Assessing Information from Multilevel and Continuous Tests Likelihood Ratios for results other than “+” or “-” Tom Newman (based on previous lectures by Michael Kohn) 10/4/2012 1

Overview More on Chapter 3, “X-graphs” Septic arthritis example Interval LR for multi-level tests Why not dichotomize? ROC Curves and C-statistic (AUROC) “Walking Man” approach to ROC curves Additional examples WBC for meningitis Absolute Neutrophul Count (ANC) for sepsis in newborns 2

Take-home points ΣProbability ✕ cost = Expected cost Avoid dichotomizing multi-level tests Do not calculate one or more LR(+) or LR(-) for a multilevel test! The area under the ROC curve summarizes the ability of the test to discriminate between D+ and D- individuals. ROC curves tell you more than just the area Slope of ROC curve = LR(result) Length of segments relates to numbers of subjects 3

Recall from Chapter 3…a graph of expected cost 4

What do we mean by expected cost? Expected cost is the sum of costs of different possible outcomes, each weigted by its probability “Costs” include everything bad about the outcome, not just money Example: 30% probability of outcome with cost $100; 70% probability of outcome with cost $0. What is “expected cost”? 5

What is expected cost… …of “Do Not Treat” strategy when P(D+) = 0? When P(D+) is 100%? When P(D+) is 50%? Equation for line: Expected cost = P x B 6 0.5B

What is expected cost… …of “Treat” strategy when P(D+) = 0? When P(D+) is 100%? When P(D+) is 60%? Equation for line: Expected cost = C  (1–P) = C – CP 7 0.4C

Treatment threshold probability (P TT ) is the probability of disease above which you should treat because expected cost at P>P TT is less 8

Now consider a (costless) test with Sensitivity=0.9 and Specificity=0.8. What is expected cost in D+ and D-? 0.1B 0.2C Test 9

The way human beings actually make decisions is a bit more complicated! 10

Questions? 11

Septic Arthritis Bacterial infection in a joint. 12

Clinical Scenario Does this Adult Patient Have Septic Arthritis? 13

Clinical Scenario Does this Adult Patient Have Septic Arthritis?* A 48-year-old woman with a history of rheumatoid arthritis who has been treated with long-term, low- dose steroids presents to the emergency department with a 2-day history of a red, swollen, tender right knee. On examination, she is afebrile and has a fluid in her right knee joint. The white blood cell count in her blood is /µL (normal). An arthrocentesis (needle in the joint) is done to obtain some joint fluid for analysis *Margaretten, M. E., J. Kohlwes, et al. (2007). JAMA 297(13): You have the synovial fluid white blood cell (WBC) count. 14

Clinical Scenario Does this Adult Patient Have Septic Arthritis? The authors estimated the pre-test probability of septic arthritis is How do you use the synovial WBC result to revise the probability of septic arthritis? Margaretten, M. E., J. Kohlwes, et al. (2007). JAMA 297(13):

Chapter 3 method: Dichotomize at 25,000 Synovial Septic Arthritis WBC Count YesNo >25,000 77% 27% ≤ 25,000 23% 73% TOTAL100% 100% Margaretten, M. E., J. Kohlwes, et al. (2007). Jama 297(13):

Why Not Make It a Dichotomous Test? Sensitivity = 77% Specificity = 73% LR(+) = 0.77/( ) = 2.9 LR(-) = ( )/0.73 = 0.32 “+” = > 25,000/uL “-” = ≤ 25,000/uL

Clinical Scenario Synovial WBC = 48,000/mL Pre-test prob: 0.38 Pre-test odds: 0.38/0.62 = 0.61 LR(+) = 2.9 Post-Test Odds = Pre-Test Odds x LR(+) = 0.61 x 2.9 = 1.75 Post-Test prob = 1.75/(1.75+1) = 0.64

Clinical Scenario Synovial WBC = 128,000/mL Pre-test prob: 0.38 LR = ? Post-Test prob =?

Clinical Scenario Synovial WBC = 128,000/mL Pre-test prob: 0.38 Pre-test odds: 0.38/0.62 = 0.61 LR = 2.9 (same as for WBC=48,000!) Post-Test Odds = Pre-Test Odds x LR(+) = 0.61 x 2.9 = 1.75 Post-Test prob = 1.75/(1.75+1) =.64

Why Not Make It a Dichotomous Test? Because you lose information. The risk associated with a synovial WBC=48,000 is equated with the risk associated with WBC=128,000. Choosing a fixed cutpoint to dichotomize a multi- level or continuous test throws away information and (usually) reduces the value of the test.

Summary Sensitivity, Specificity, LR(+), and LR(-) of the Synovial Fluid WBC Count for Septic Arthritis at 3 Different Cutoffs (As presented) WBC (/µL)SensitivitySpecificityLR+LR- >100,00029%99% >50,00062%92% >25,00077%73% Synovial WBC Count = 48,000/uL LR = Margaretten, M. E., J. Kohlwes, et al. (2007). JAMA 297(13):

23 WBC (/µL)SensitivitySpecificityLR+LR- >100,00029%99% >50,00062%92% >25,00077%73% Synovial WBC Count = 48,000/uL Which LR should we use? NONE of THESE! Summary Sensitivity, Specificity, LR(+), and LR(- ) of the Synovial Fluid WBC Count for Septic Arthritis at 3 Different Cutoffs (As presented)

Interval Likelihood Ratios: a Better Way WBC (/uL) Interval % of D+ % of D- Interval LR >100,00029%1% , ,00033%7%4.7 25,001-50,00015%19% ,00023%73%0.3 24

Synovial Fluid WBC Count 25 LR Histogram* * Does not reflect prior probability. D+ and D- bars both sum to 100%.

ROC Curves from Histogram Trade-off between sensitivity and specificity depends on the cutpoint chosen to separate “positives” from “negatives.” The ROC curve is drawn by serially moving the cutpoint from most abnormal to least abnormal. True positives (sensitivity) are plotted against false positives (1-specificity). 26

27 Septic arthirtis No Septic arthirtis

28

29

30

31

WBC Count (x1000/uL) Sensitivity1 - Specificity > highest 0% > 10029%1% > 5062%8% > 2577%27% ≥ 0100% Margaretten, M. E., J. Kohlwes, et al. (2007). Jama 297(13): ROC Table 32

Cutoff > top value Cutoff > 100k Cutoff > 50k Cutoff > 25k Cutoff ≥ 0 33 WBC Count (K/uL ) Sensi tivity 1 - Specifi city > top 0% > 10029%1% > 5062%8% > 2577%27% ≥ 0100%

Cutoff > top Cutoff > 100k Cutoff > 50k Cutoff > 25k Cutoff ≥ 0 Area Under Curve = Area Under ROC Curve 34

Area Under ROC Curve Summary measure of test’s discriminatory ability Probability that a randomly chosen D+ individual will have a more positive test result than a randomly chosen D- individual 35

Area Under ROC Curve Also called the “c statistic” Measures discrimination for logistic regression models (“lroc” command) Statistical significance tested with the Mann-Whitney U or Wilcoxon Rank Sum Tests, non-parametric equivalents of Student’s t test based on ranks. 36

ROC Curve Describes the Test, Not the Patient Describes the test’s ability to discriminate between D+ and D- individuals Not particularly useful in interpreting a test result for a given patient 37

Evaluating the Test --Test Characteristics For dichotomous tests, we discussed sensitivity P(+|D+) and specificity P(–|D–) For multi-level and continuous tests, we can use a Receiver Operating Characteristic (ROC) curve 38

Using the Test Result to Make Decisions about a Patient For dichotomous tests, we use the LR(+) if the test is positive and the LR(–) if the test is negative For multilevel and continuous tests, we use the LR(r), where r is the result of the test 39

Likelihood Ratios LR(result) = P(result|D+)/P(result|D-) P(Result) in patients WITH disease P(Result) in patients WITHOUT disease 40

WOWO With Over WithOut 41

Likelihood Ratios The ratio of the height of the D+ bar to the height of the D- bar for any result (or result interval) 15% 19% LR = 15%/19% =

> 50k > 25k 15% 19% Slope = 15%/19% =0.8 43

Common Mistake When given an “ROC Table,” it is tempting to calculate an LR(+) or LR(-) as if the test were “dichotomized” at a particular cutoff. Example: LR(+,25,000) = 77%/27% = 2.9 This is NOT the LR of a particular result (e.g. WBC >25,000 and ≤ 50,000); it is the LR(+) if you divide “+” from “-” at 25,

WBC (/uL)SensitivitySpecificityLR+LR- >100,00029%99% >50,00062%92% >25,00077%73% Common Mistake 45

Common Mistake From JAMA paper: “Her synovial WBC count of 48,000/µL increases the probability from 38% to 64%.” (Used LR = 2.9) Correct calculation: Her synovial WBC count of 48,000/µL decreases the probability from 38% to 33%.” (Used LR = 0.8) 46

27% 77% > 25,000 Common Mistake 47

WBC (/uL)SensitivitySpecificity 1- Specificity >100,00029%99%1% >50,00062%92%8% >25,00077%73%27% 48 Obtain interval LR from ROC table by subtracting adjacent rows 77% of D+ > 25K 62% of D+ > 50 K Therefore 77%-62% =15% of D+ must be >25 K and < 50K

WBC (/uL)SensitivitySpecificity 1- Specificity >100,00029%99%1% >50,00062%92%8% >25,00077%73%27% 49 Obtain interval LR from ROC table by subtracting adjacent rows 27% of D- > 25K 8% of D- > 50 K Therefore 27%-8% =19% of D- must be >25 K and < 50K

> 50k > 25k 15% 19% Slope = 15%/19% =0.8 50

THE WALKING MAN OR … 51

… WHAT CAN YOU LEARN FROM ROC CURVES LIKE THESE? Bonsu, B. K. and M. B. Harper (2003). "Utility of the peripheral blood white blood cell count for identifying sick young infants who need lumbar puncture." Ann Emerg Med 41(2):

“Walking Man” Approach to ROC Curves Divide vertical axis into d steps, where d is the number of D+ individuals Divide horizontal axis into n steps, where n is the number of D- individuals Sort individuals from most to least abnormal test result Moving from the first individual (with the most abnormal test result) to the last (with the least abnormal test result)… 53

“Walking Man” (continued) …call out “D” if the individual is D+ and “N” if the individual is D- Let the walking man know when you reach a new value of the test The walking man takes a step up every time he hears “D” and a step to the right every time he hears “N” When you reach a new value of the test, he drops a stone. 54

DDNDN(DN)DN (NN)NNNN 55 Septic ArthritisNo Septic Arthritis

56

57 DDNDN(DN)DN (NN)NNNN

… WHAT CAN YOU LEARN FROM ROC CURVES LIKE THESE? 58

Peripheral WBC Count as Test for Bacterial Meningitis Cross sectional study of test accuracy Index test: peripheral WBC count Reference Standard: CSF culture Population: Infants 3-89 days old evaluated in a pediatric ED for serious bacterial infection. Ann Emerg Med. 2003;41:

Ann Emerg Med. 2003;41: More kids had periperhal WBC than CSF WBC Only 21 kids had bacterial meningitis Some peripheral WBC results strongly suggest meningitis What can you learn besides areas under curves?

KP/BWH Neonatal Infection Study - CBC aim* Retrospective cross-sectional study N = 67,623 newborns ≥ 34 weeks GA < 72 h old with paired CBC and BC (within 1 h), born Predictor variables: WBC, absolute neutrophil count (ANC) Outcome variable: culture positive for a bacterial pathogen from a sterile site (N=244) *Newman TB et al. Pediatrics 2010; 126:903–909 61

ANC 99% 90% 50% 10% 1% 62

ANC ROC 63

Likelihood Ratios for ANC Example: 6 hr old infant with ANC = 13 Pre-test probability 4/1000 Post-test probability 0.3 x 4/1000 = 1.2/

Stanaway et al. How fast does the Grim Reaper walk? Receiver operating characteristics curve analysis in healthy men aged 70 and over. BMJ 2011;343:d7679

Questions? 66