Prognostic modelling: General modelling strategy issues Ewout Steyerberg Professor of Medical Decision Making Dept of Public Health, Erasmus MC, Rotterdam,

Slides:



Advertisements
Similar presentations
Controlling for Time Dependent Confounding Using Marginal Structural Models in the Case of a Continuous Treatment O Wang 1, T McMullan 2 1 Amgen, Thousand.
Advertisements

1 Arlene Ash QMC - Third Tuesday September 21, 2010 (as amended, Sept 23) Analyzing Observational Data: Focus on Propensity Scores.
1 QOL in oncology clinical trials: Now that we have the data what do we do?
V.: 9/7/2007 AC Submit1 Statistical Review of the Observational Studies of Aprotinin Safety Part I: Methods, Mangano and Karkouti Studies CRDAC and DSaRM.
Simulation of “forwards-backwards” multiple imputation technique in a longitudinal, clinical dataset Catherine Welch 1, Irene Petersen 1, James Carpenter.
1 Statistical Modeling  To develop predictive Models by using sophisticated statistical techniques on large databases.
Clinical Trial Designs for the Evaluation of Prognostic & Predictive Classifiers Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer.
Departments of Medicine and Biostatistics
Comparison of the New Mayo Clinic Risk Scores and Clinical SYNTAX Score in Predicting Adverse Cardiovascular Outcomes following Percutaneous Coronary Intervention.
ODAC May 3, Subgroup Analyses in Clinical Trials Stephen L George, PhD Department of Biostatistics and Bioinformatics Duke University Medical Center.
Confounding And Interaction Dr. L. Jeyaseelan Department Of Biostatistics CMC, Vellore.
Prediction Models in Medicine Clinical Decision Support The Road Ahead Chapter 10.
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Model and Variable Selections for Personalized Medicine Lu Tian (Northwestern University) Hajime Uno (Kitasato University) Tianxi Cai, Els Goetghebeur,
Flexible modeling of dose-risk relationships with fractional polynomials Willi Sauerbrei Institut of Medical Biometry and Informatics University Medical.
Multiple Regression Models Advantages of multiple regression Important preliminary analyses Parts of a multiple regression model & interpretation Differences.
Introduction to Logistic Regression. Simple linear regression Table 1 Age and systolic blood pressure (SBP) among 33 adult women.
Darlene Goldstein 29 January 2003 Receiver Operating Characteristic Methodology.
BIOST 536 Lecture 9 1 Lecture 9 – Prediction and Association example Low birth weight dataset Consider a prediction model for low birth weight (< 2500.
How do we know whether a marker or model is any good? A discussion of some simple decision analytic methods Carrie Bennette on behalf of Andrew Vickers.
Long-term predictive value of assessment of coronary atherosclerosis by contrast- enhanced coronary computed tomography angiography: meta- analysis and.
EVIDENCE BASED MEDICINE
Validation of predictive regression models Ewout W. Steyerberg, PhD Clinical epidemiologist Frank E. Harrell, PhD Biostatistician.
CORRELATIO NAL RESEARCH METHOD. The researcher wanted to determine if there is a significant relationship between the nursing personnel characteristics.
Decision Tree Models in Data Mining
P H Y S I C I A N S ’ A C A D E M Y F O R C A R D I O V A S C U L A R E D U C A T I O N Oral drugs for type 2 diabetes and all cause mortality in General.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Clinical Trial Results. org Pexelizumab for Acute ST-Elevation Myocardial Infarction in Patients Undergoing Primary Percutaneous Coronary Intervention.
Are the results valid? Was the validity of the included studies appraised?
HSTAT1101: 27. oktober 2004 Odd Aalen
Performance Reports Andy Bindman MD Department of Medicine, Epidemiology and Biostatistics UCSF.
بسم الله الرحمن الرحیم.. Multivariate Analysis of Variance.
EDRN Approaches to Biomarker Validation DMCC Statisticians Fred Hutchinson Cancer Research Center Margaret Pepe Ziding Feng, Mark Thornquist, Yingye Zheng,
Improving the utility of comorbidity records Retha Steenkamp UK Renal Registry.
How do we know whether a marker or model is any good? A discussion of some simple decision analytic methods Carrie Bennette (on behalf of Andrew Vickers)
Leapfrog’s “Survival Predictor”: Composite Measures for Predicting Hospital Surgical Mortality May 7, 2008.
Excepted from HSRP 734: Advanced Statistical Methods June 5, 2008.
Article Review Cara Carty 09-Mar-06. “Confounding by indication in non-experimental evaluation of vaccine effectiveness: the example of prevention of.
Discriminant Analysis Discriminant analysis is a technique for analyzing data when the criterion or dependent variable is categorical and the predictor.
Introduction to Multivariate Analysis Epidemiological Applications in Health Services Research Dr. Ibrahim Awad Ibrahim.
Lecture 12: Cox Proportional Hazards Model
Evaluating Risk Adjustment Models Andy Bindman MD Department of Medicine, Epidemiology and Biostatistics.
1 Multivariable Modeling. 2 nAdjustment by statistical model for the relationships of predictors to the outcome. nRepresents the frequency or magnitude.
Stats 845 Applied Statistics. This Course will cover: 1.Regression –Non Linear Regression –Multiple Regression 2.Analysis of Variance and Experimental.
Organization of statistical research. The role of Biostatisticians Biostatisticians play essential roles in designing studies, analyzing data and.
Atypical Presentations Patients older than 75: frequently no chest pain ECG in evolution (nonspecific ECG changes) Diabetic patients: commonly no chest.
Association between Systolic Blood Pressure and Congestive Heart Failure in Hypertensive Patients Mrs. Sutheera Intajarurnsan Doctor of Public Health Student.
1 Statistical Review of the Observational Studies of Aprotinin Safety Part II: The i3 Drug Safety Study CRDAC and DSaRM Meeting September 12, 2007 P. Chris.
1 Introduction to Modeling Beyond the Basics (Chapter 7)
DISCRIMINANT ANALYSIS. Discriminant Analysis  Discriminant analysis builds a predictive model for group membership. The model is composed of a discriminant.
Carina Signori, DO Journal Club August 2010 Macdonald, M. et al. Diabetes Care; Jun 2010; 33,
Potential usefulness of a framework of 7 steps for prediction models Ewout Steyerberg Professor of Medical Decision Making Dept of Public Health, Erasmus.
Kelsey Vonderheide, PA1.  Heart Failure—a large number of conditions affecting the structure and function of the heart that make it difficult for the.
Ten Year Outcome of Coronary Artery Bypass Graft Surgery Versus Medical Therapy in Patients with Ischemic Cardiomyopathy Results of the Surgical Treatment.
Meta-analysis of observational studies Nicole Vogelzangs Department of Psychiatry & EMGO + institute.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
CoRPS Center of Research on Psychology in Somatic diseases Brief Depression Screening with the PHQ-2 Predicts Poor Prognosis following PCI with Drug-Eluting.
Matching methods for estimating causal effects Danilo Fusco Rome, October 15, 2012.
Bootstrap and Model Validation
Presenter: Wen-Ching Lan Date: 2018/03/28
VALIDATION AND UPDATING OF MODELS WITH BIOMARKERS
Selecting the Right Predictors
China PEACE risk estimation tool for in-hospital death from acute myocardial infarction: an early risk classification tree for decisions about fibrinolytic.
Regression and Clinical prediction models
Regression and Clinical prediction models
Clinical prediction models
Professor of Clinical Biostatistics and Medical Decision Making Nov-19 Why Most Statistical Predictions Cannot Reliably Support Decision-Making:
Alcohol, Other Drugs, and Health: Current Evidence May–June 2019
Advisor: Dr.vahidipour Zahra salimian Shaghayegh jalali Dec 2017
Presentation transcript:

Prognostic modelling: General modelling strategy issues Ewout Steyerberg Professor of Medical Decision Making Dept of Public Health, Erasmus MC, Rotterdam, the Netherlands Dijon, Nov 12, 2009

Erasmus MC – University Medical Center Rotterdam

Dijon

Workshop « Prognostic Modelling »  Important: prognostic modelling holds the promise to improve medical practice  Timely: prognostic modeling is a vivid area of research many papers every month on prognostic factors and prognostic models recent books

Prognosis and prediction models  Physicians do not master the art of prognostication (Baatenburg de Jong) Solution?  Data bases  New prognostic factors  Quantitative analyses  Individual predictions instead of average predictions

Individualized decision making  Individualization requires predictions  Diagnosis: Probability of disease  Therapy: Probability of outcomes (prognosis)  Prognostic factors (natural cause)  Predictive factors (response to therapy)  Predictions from multivariable models  Pragmatic: combination of prognostic factors  Knowledge: incremental (‘independent’) value of a prognostic factor

Prognostic models  Combine multiple patient or disease related characteristics to predict an outcome (‘prognostic factors’ / ‘predictors’)  Useful for medical practice  Inform patients, relatives, realistic expectations  Decision making: physicians + patients  Useful for research  Risk adjustment in observational data (comparison of series, hospitals)  Design and analysis of RCTs (selection, adjustment of treatment effects)

Prognosis in oncology

Applications in oncology: examples  TNM classification  Prognostic classifications  IPI lymphoma  Individual predictions  Nomograms / score charts  Prostate cancer  Electronic: spreadsheet / internet  Lynch syndrome

Example of an advanced prediction model  Mismatch repair (MMR) mutations cause Lynch syndrome  CRC; endometrial cancer; various other cancers  Diagnostic work-up  Simple: Amsterdam / Bethesda criteria  Advanced: prediction model, e.g. PREMM

Prediction of MLH1 and MSH2 Mutations in Lynch syndrome  PREMM 1,2 Model: Equation Log Odds (Pr/(1-Pr)) = V V V V V V V V V /V 10 – 0.358V 11 /10 – 0.293V12/10 V 1 = CRC in the proband; V 2 = 2 or more CRC in the proband; V 3 = endometrial cancer in the proband; V 4 =other HNPCC cancer in the proband; V 5 = adenoma in the proband; V 6 = 1 for presence of CRC in 1 st degree relative for presence of CRC in 2 nd degree relative; V 7 = ≥ 2 1 st degree relatives with CRC; V 8 = 1 for presence of endometrial cancer in 1 st degree relative for presence of endometrial cancer in 2 nd degree relative; V 9 = ≥ 2 1 st degree relatives with endometrial cancer; V 10 = presence of 1 st or 2 nd degree relative with other HNPCC cancer; V 11 = sum ages of CRC/adenoma; V 12= sum ages of endometrial cancer Balmana et al, JAMA 2006;296:

PREMM p(mutation): 61% CRC dx 45 CRC dx 31 Endo dx 47 CRC dx 46 Urinary tract dx 58 Endo dx 40

Conclusions I  Workshop « Prognostic Modelling » is timely and important

Modelling strategy What is the aim: knowledge on predictors, or provide predictions?  Predictors (Prognostic factors)  Traditional (demographics, patient, disease related)  Modern (‘omics’, biomarkers, imaging)  Challenges:  Testing: independent effect, adjusted for confounders  Estimation: correct functional form  Predictions  Pragmatic combination of predictors  Recent book: General considerations and 7 modeling steps  Many challenges  Biostatistical  Epidemiological / Decision-analytic

Prognostic modelling checklist

Prognostic modeling checklist: general considerations

Prognostic modeling checklist: 7 steps

Prognostic modeling checklist: validity

Example: prediction of myocardial infarction outcome

Aim: predictors or predictions?  Title vs text; additional publication focuses at clinicians, using the 5 strongest predictors only

Predictors

General considerations in GUSTO-I model

1. Data inspection, specifically: missing values Among the array of clinical characteristics considered potential predictor variables in the modeling analyses were occasional patients with missing values. Although a full set of analyses was performed in patients with complete data for all the important predictor variables (92% of the study patients), the subset of patients with one or more missing predictor variables had a higher mortality rate than the other patients, and excluding those patients could lead to biased estimates of risk. To circumvent this, a method for simultaneous imputation and transformation of predictor variables based on the concepts of maximum generalized variance and canonical variables was used to estimate missing predictor variables and allow analysis of all patients The iterative imputation technique conceptually involved estimating a given predictor variable on the basis of multiple regression on (possibly) transformed values of all the other predictor variables. End-point data were not explicitly used in the imputation process. The computations for these analyses were performed with S-PLUS statistical software (version 3.2 for UNIX 32 ), using a modification of an existing algorithm The imputation software is available electronically in the public domain

2. Coding of predictors  continuous predictors  linear and restricted cubic spline functions  truncation of values (for example for systolic blood pressure)  categorical variables  Detailed categorization for location of infarction: anterior (39%), inferior (58%), or other (3%)  Ordinality ignored for Killip class (I – IV) class III and class IV each contained only 1% of the patients

3. Model specification  Main effects: “.. which variables were most strongly related to short- term mortality”:  hypothesis testing rather than prediction question  specific technique not explicitly stated, but likely p<0.05  Interactions: many tested, one included: Age*Killip  Linearity of predictors:  transformations chosen at univariate analysis were also used in multivariable analysis

4. Model estimation  Standard ML  No shrinkage / penalization  No external information

5. Model performance  Discrimination  Area under the receiver operating characteristic curve (AUC, equivalent to the c statistic)  Calibration: observed vs predicted  Graphically, including deciles (similar to Hosmer-Lemeshow goodness of fit test)  Specific subgroups of patients

Calibration

6. Model validation “First, 10-fold cross validation was performed: the model was fitted on a randomly selected subset of 90% of the study patients, and the resulting fit was tested on the remaining 10%. This process was repeated 10 times to estimate the extent to which the predictive accuracy of the model (based on the entire sample) was overoptimistic. Second, for each of 100 bootstrap samples (samples of the same size as the original population but with patients drawn randomly, with replacement, from the full study population), the model was refitted and then tested on the original sample, again to estimate the degree to which the predictive accuracy of the model would be expected to deteriorate when applied to an independent sample of patients.”

7. Model presentation  Predictor effects:  Relative importance: Chi-square statistics  Relative effects: Odds ratios graphically  Predictions  Formula

 Risk Model for 30-Day Mortality Probability of death within 30 days=1/[1+exp (-L)], where L= age minimum (SBP, 120) [Killip class II] [Killip class III] [Killip class IV] heart rate (heart rate-50) [inferior MI] [other MI location] [previous MI] height (height-154.9) (height-165.1) (height-172.0) (height ) (height-185.4) time to treatment [current smoker] [former smoker] [diabetes] weight [previous CABG] [treatment with SK and intravenous heparin] [treatment with SK and subcutaneous heparin] [treatment with combination TPA and SK plus intravenous heparin] [hx of hypertension] [hx of cerebrovascular disease] age · [Killip class II] age · [Killip class III] age · [Killip class IV].  Explanatory notes.  1. Brackets are interpreted as [c]=1 if the patient falls into category c, [c]=0 otherwise.  2. (x)+=x if x>0, (x)+=0 otherwise.  3. For systolic blood pressure (SBP), values >120 mm Hg are truncated at 120.  4. For time to treatment, values <2 hours are truncated at 2.  5. The measurement units for age are years; for blood pressure, millimeters of mercury; for heart rate, beats per minute; for height, centimeters; for time to treatment, hours; and for weight, kilograms.  6. "Other" MI location refers to posterior, lateral, or apical but not anterior or inferior.  7. CABG indicates coronary artery bypass grafting; SK, streptokinase; and hx, history.

Conclusion II  GUSTO-I makes for an interesting case-study on  General modeling considerations  Illustration of 7 modeling steps  Internal vs external validity (early 1990s  2009?)  Debate possible on some choices  1. Missing values: multiple imputation, including the outcome  2. Coding: fractional polynomials? Lump categories?  3. Selection: stepwise works because of large N  4. Estimation: standard ML works because of large N; penalization?  5. Performance: nothing on usefulness  6. Validation: CV and bootstrap, not necessary because of large N?  7. Presentation: predictor effects: nice! Predictions: score chart / nomogram

Challenges in developing a valid prognostic model  Theoretical: biostatistical research  New analysis techniques, e.g.  Neural networks / Support vector machines / …  Fractional polynomials / splines for continuous predictors  Performance measures  Simulations: what makes sense as a strategy?  Applications: epidemiological and decision-analytic research  Subject matter knowledge  Clinical experts  Literature: review / meta-analysis  Balance research questions vs effective sample size  Incremental value new markers  Transportability and external validity  Clinical impact of using a model

Which performance measure when? 1.Discrimination: if poor, usefulness unlikely, but >= 0 2.Calibration: if poor in new setting: Prediction model may harm rather than improve decision-making

Phases of model development (Reilly Ann Intern Med 2006;144(3):201-9)