Professor of Clinical Biostatistics and Medical Decision Making Nov-19 Why Most Statistical Predictions Cannot Reliably Support Decision-Making:

Professor of Clinical Biostatistics and Medical Decision Making
Nov-19 Why Most Statistical Predictions Cannot Reliably Support Decision-Making: Problems Caused By Common Regression Modeling Approaches Ewout Steyerberg Professor of Clinical Biostatistics and Medical Decision Making Leiden, October 2017

Why most prediction models are false
Methods at development Rigorous validation

Validation

Three competitive models
MMRPredict (NEJM, 2006) - Common regression modeling approach; small data set MMRPro (JAMA 2006) - Bayesian modeling approach; moderate size data set PREMM (JAMA 2006; Gastroenterology 2011; JCO 2016) - Sensible regression modeling approach; large data set Which model wins? Which may do harm?

6 clinic-based, 5 pop-based cohorts

Discrimination Clinic-based Population-based

Calibration plots: obs vs predicted
Calibration slope as a measure of overfitting

Calibration

Clinical usefulness Statistical performance: Discrimination and calibration Consider full range of predictions Decision-analytic performance: Define a decision threshold: act if risk > threshold TP and FP classifications Net Benefit as a summary measure: NB = (TP – w FP) / n, with w = harm/benefit (Vickers & Elkin, MDM 2006)

Decision curve analysis
Clinic-based Population-based

Overview Clinical context: testing for Lynch syndrome
Statistical and decision-analytic performance Could poor performance have been foreseen? Prevented?

Example of “barbarian modeling strategy”

Selection based on statistical significance

Many predictors, >37 df; dichotomized

Exaggerated effects

Sample size issues Robust: strong, vigorous, sturdy, tough, powerful, powerfully built, solidly built, as strong as a horse/ox, muscular, sinewy, rugged, hardy, strapping, brawny, burly, husky

Poor performance foreseeable?
Simulate modeling strategy Small sample size 38 events at development 35 events vs >2000 at validation Stepwise selection Univariate and multivariable statistical testing Dichotomization New cohort: n=19,866; 2,051 mutations

Poor discrimination Poor calibration

Poor decision-making Illustration with 10 random samples

Could poor performance be prevented?
PREMM modeling strategy Coding of family history Continuous age

SiM 2007

Could poor performance be prevented?
PREMM modeling strategy Coding of family history Continuous age Larger sample size

Better discrimination and calibration if a) more sensible modeling and b) larger sample size

Substantially better decision-making if a) more sensible modeling and b) larger sample size

Discussion Avoid stepwise selection Avoid dichotomization
Prespecification with summary variables Advanced estimation Avoid dichotomization Keep continuous Increase sample size Combining development and validation sets Collaborative efforts Rigorous validation Statistical and decision-analytic perspective

Evaluation of decision-making
Net Benefit: “utility of the method” Peirce, Science 1884 Youden index: sens + spec – 1 Net Benefit Vickers, MDM 2006 Weight FP:TP = H:B = odds(threshold) (Vergouwe 2003) Decision Curve Analysis

Youden index and Net Benefit

Avoid miscalibration by overfitting
Shrinkage Reduce coefficients by multiplying by s, s<1 E.g.: multiply by 0.8 Penalization Ridge regression: shrink during fitting LASSO: shrink to zero; implicit selection Elastic Net: combination of Ridge and LASSO Machine learning ?

Professor of Clinical Biostatistics and Medical Decision Making Nov-19 Why Most Statistical Predictions Cannot Reliably Support Decision-Making:

Similar presentations

Presentation on theme: "Professor of Clinical Biostatistics and Medical Decision Making Nov-19 Why Most Statistical Predictions Cannot Reliably Support Decision-Making:"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Professor of Clinical Biostatistics and Medical Decision Making Nov-19 Why Most Statistical Predictions Cannot Reliably Support Decision-Making:

Similar presentations

Presentation on theme: "Professor of Clinical Biostatistics and Medical Decision Making Nov-19 Why Most Statistical Predictions Cannot Reliably Support Decision-Making:"— Presentation transcript:

Similar presentations

About project

Feedback