OHDSI Method Evaluation

Slides:



Advertisements
Similar presentations
THE USE OF HISTORICAL CONTROLS IN DEVICE STUDIES Vic Hasselblad Duke Clinical Research Institute.
Advertisements

Agency for Healthcare Research and Quality (AHRQ)
How would you explain the smoking paradox. Smokers fair better after an infarction in hospital than non-smokers. This apparently disagrees with the view.
Study Size Planning for Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare Research and Quality (AHRQ)
If we use a logistic model, we do not have the problem of suggesting risks greater than 1 or less than 0 for some values of X: E[1{outcome = 1} ] = exp(a+bX)/
BIOST 536 Lecture 3 1 Lecture 3 – Overview of study designs Prospective/retrospective  Prospective cohort study: Subjects followed; data collection in.
OBSERVATIONAL MEDICAL OUTCOMES PARTNERSHIP 1 OBSERVATIONAL MEDICAL OUTCOMES PARTNERSHIP Are Observational Studies Any Good? David Madigan, Columbia University.
Statistics for clinical research An introductory course.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
Doing Data Science – Chapter 12: Epidemiology Vast amounts of individual patient medical data is available – Detailed – visits, prescriptions, outcomes,
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
HSRP 734: Advanced Statistical Methods July 31, 2008.
BIOST 536 Lecture 11 1 Lecture 11 – Additional topics in Logistic Regression C-statistic (“concordance statistic”)  Same as Area under the curve (AUC)
Lecture 9: Analysis of intervention studies Randomized trial - categorical outcome Measures of risk: –incidence rate of an adverse event (death, etc) It.
Three Statistical Issues (1) Observational Study (2) Multiple Comparisons (3) Censoring Definitions.
Describing the risk of an event and identifying risk factors Caroline Sabin Professor of Medical Statistics and Epidemiology, Research Department of Infection.
The Diabetic Retinopathy Clinical Research Network Effect of Diabetes Education During Retinal Ophthalmology Visits on Diabetes Control (Protocol M) 11.
A Claims Database Approach to Evaluating Cardiovascular Safety of ADHD Medications A. J. Allen, M.D., Ph.D. Child Psychiatrist, Pharmacologist Global Medical.
T tests comparing two means t tests comparing two means.
1 EPI235: Epi Methods in HSR April 5, 2005 L3 Evaluating Health Services using administrative data 2: Advanced Topics in Risk Adjustment (Dr. Schneeweiss)
European Patients’ Academy on Therapeutic Innovation The Purpose and Fundamentals of Statistics in Clinical Trials.
Transparency in the Use of Propensity Score Methods
1 Study Design Imre Janszky Faculty of Medicine, ISM NTNU.
Date of download: 6/24/2016 Copyright © The American College of Cardiology. All rights reserved. From: The Year in Cardiovascular Surgery J Am Coll Cardiol.
Population level estimation workgroup Martijn Schuemie Marc Suchard.
Biostatistics Class 2 Probability 2/1/2000.
Jose M. de la Torre Hernández … in behalf of the 3D investigators
Martijn Schuemie, Marc Suchard, Patrick Ryan, Tomas Bergvall
AP Statistics Final Exam Review!!!!!.
Associations of Maternal Antidepressant Use During the First Trimester of Pregnancy With Preterm Birth, Small for Gestational Age, Autism Spectrum Disorder,
The comparative self-controlled case series (CSCCS)
FeatureExtraction v2.0 Martijn Schuemie.
Harvard T.H. Chan School of Public Health
Population level estimation best practices
Cost effectiveness of COX 2 selective inhibitors and traditional NSAIDs alone or in combination with a proton pump inhibitor for people with osteoarthritis.
April 18 Intro to survival analysis Le 11.1 – 11.2
A critical assessment of the utility of the case-control design in population-based databases Martijn Schuemie.
Population level estimation workgroup
Reproducing Graham et al using the CohortMethod package
Martijn Schuemie, PhD Janssen Research and Development
Martijn Schuemie, PhD Janssen Research and Development
Biostatistics Case Studies 2016
Designing and executing OHDSI studies
Method benchmark update
OHDSI population-level estimation workgroup meeting
Supplementary Table 1. PRISMA checklist
Table 1 Characteristics of study population, by pneumococcal vaccination status. From: Prior Pneumococcal Vaccination Is Associated with Reduced Death,
Method evaluation Martijn Schuemie.
OHDSI estimation-methods papers in progress
FAQs from researchers and the answers of mine for OHDSI study pipeline
Presenter: Wen-Ching Lan Date: 2018/05/09
Presenter: Wen-Ching Lan Date: 2018/08/01
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
S1316 analysis details Garnet Anderson Katie Arnold
on behalf of the LEADER Trial Steering Committee and Investigators
Presenter: Wen-Ching Lan Date: 2018/03/28
Do higher-risk patients benefit from off-pump coronary artery bypass grafting? Evidence from an ecologic analysis of randomized trials  Oliver Kuss, PhD,
Annals of Internal Medicine • Vol. 167 No. 12 • 19 December 2017
ERRORS, CONFOUNDING, and INTERACTION
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Interpreting Basic Statistics
Sampling and Power Slides by Jishnu Das.
Biostatistics Core Members Joyce Chang Kirsha Gordon Joseph Goulet
The objective of this lecture is to know the role of random error (chance) in factor-outcome relation and the types of systematic errors (Bias)
Interpreting Epidemiologic Results.
New Techniques and Technologies for Statistics 2017  Estimation of Response Propensities and Indicators of Representative Response Using Population-Level.
DRCR Retina Network Treatment for Center-Involved DME in Eyes with Good Visual Acuity (Protocol V)
Basic statistics.
Kaplan-Meier survival curves and the log rank test
Medical Statistics Exam Technique and Coaching, Part 2 Richard Kay Statistical Consultant RK Statistics Ltd 22/09/2019.
Presentation transcript:

OHDSI Method Evaluation Martijn Schuemie

Quick recap of previous meeting We discussed the SelfControlledCaseSeries package Adjust for age and season using splines Add pre-exposure windows Add other drugs to model (all drugs) Correct for event-dependent censoring Using negative controls, we’re still biased even when applying extensive adjustments. Sensitive to confounding by indication Would like to hear about “Identifying the important questions that can be answered using observational research”

Method evaluation How well does a method perform on known problems? Is the effect estimate close to the real estimate? Mean Squared Error (MSE) Do positive controls have higher estimates than negative controls? Area Under the receiver-operator Curve (AUC) Does the 95% confidence interval contain the truth in 95% of the times? Coverage

Lessons learned from previous experiments Issues with the OMOP experiment(s): Problem with contra-indication (relevant for known positive controls). Also applies to OSIM2 Limitations in methods library Forgot to censor in SCC MSCCS applied shrinkage to exposure of interest CohortMethod did not correct for differences in length of follow up Uncertainty around positive and negative status Don’t know true RR if RR != 1

Objectives Understand performance of methods Investigate differences between databases Are some DBs better at population level estimation? (for specific exposures or outcomes) Develop a process that will inform the choice of methods used to answer a particular question First step: pilot experiment

Experiment as a package https://github.com/OHDSI/StudyProtocolSandbox/tree/master/PopEstMethodEvaluation

Experiment overview Exposure-Outcome pairs Methods Data

Data

Exposure-outcome pairs RR = 1 Exposure: Diclofenac Outcomes: 35 negative controls (in inpatient setting) >= 100 events during diclofenac exposure 19 negative controls left

Injected outcomes are random! (No confounding) Signal injection Injected outcomes are random! (No confounding) Solution: Build an outcome model, and inject outcomes based on ‘baseline’ probability of outcome Assume relative risk = 1 (negative control) Count outcomes during exposure For relative risk = 2: randomly insert just as many outcomes during exposure Neg. control outcome X Patient A Example: Outcome is myocardial infarction (MI) Diabetes is a risk factor for MI People with diabetes are more likely to have extra outcome injected Diclofenac Neg. control outcome X Patient B Diclofenac Diclofenac Neg. control outcome X Patient C Diclofenac

Exposure-outcome pairs RR = (1.25, 1.5, 2, 4) Fit Poisson model for outcome during any exposure to diclofenac (when prior obs > 365 days) Inject additional outcomes to exposures

Signal injection limitations No very small event counts No time-varying confounding, including no contra-indication

Methods Self-Controlled Case Series Self-Controlled Cohort Cohort Method using celecoxib as comparator (IC Temporal Pattern Discovery) (Case-Control)

Self-Controlled Case Series Is the outcome more likely during exposed time compared to non-exposed time? Angioedema Patient A Diclofenac Angioedema Patient B Ke Diclofenac ppra Diclofenac Angioedema Patient C Diclofenac

Self-Controlled Case Series Given that the subject had an outcome, is the outcome more likely during exposed time compared to non-exposed time? Outcome Patient A Diclofenac Outcome Patient B Ke Diclofenac ppra Diclofenac Outcome Patient C Diclofenac

Self-Controlled Case Series By design, adjusted for Patient characteristics constant over time Additionally adjust for Age Season Contra-indication (All) other exposures (MSCCS) Outcome-dependent censoring

Self-Controlled Cohort Is the outcome more likely during exposed time compared to time immediately prior to exposure? Outcome Patient A Control Diclofenac Truncated control period Outcome Patient B Diclofenac Control Diclofenac Outcome Patient C Control Diclofenac

New-user cohort design Total population Initiation of treatment Diclofenac Celecoxib

Randomized controlled trial Total population Initiation of treatment Treatment arm Randomization Control arm

New-user cohort design Total population Initiation of treatment Treatment assignment is not random! Doctors have reasons why they prescribe diclofenac to some patients and celecoxib to others Diclofenac Celecoxib

Propensity model Statistical model (logistic regression) of why patients get one treatment or the other Using (all) information prior to initiation of treatment Used to adjust for differences through Trimming Matching Stratification

Outcome model After trimming / matching / stratification on the propensity score Estimate effect of treatment on outcome using Cox Poisson Logistic Include same information prior to initiation of treatment

Pilot experiment overview Exposure-Outcome pairs Methods (analyses) 19 negative controls 4*19 positive controls SCCS (5) SCC (8) CohortMethod (5) Data Truven MDCD

Results SCCS Simple SCCS Using pre-exposure window Analysis True RR AUC Coverage MSE 1 NA 0.42 0.09 2 0.37 0.11 3 4 5 0.68 0.07 1.25 0.62 0.08 0.61 0.32 0.10 0.64 0.60 0.67 0.06 1.5 0.79 0.77 0.80 0.87 0.85 0.88 1.00 Simple SCCS Using pre-exposure window Using age and season Using event-dependent observation MSCCS

Forest plot Carpal Tunnel Syndrome Analysis 5: MSCCS

Results SCC Length of exposure, index date in exposure window Analysis True RR AUC Coverage MSE 1 NA 0.74 0.02 2 0.84 0.05 3 0.03 4 5 0.79 6 0.07 7 8 1.25 0.64 0.89 0.71 0.63 0.06 0.62 0.61 1.5 1.00 0.87 0.68 0.85 0.86 0.82 0.83 0.08 0.98 0.95 0.97 0.04 0.09 0.11 0.53 0.12 0.47 0.13 Length of exposure, index date in exposure window 30 days of each exposure, index date in exposure window Length of exposure, index date in exposure window, require full obs 30 days of each exposure, index date in exposure window, require full obs Length of exposure, index date ignored 30 days of each exposure, index date ignored Length of exposure, index date ignored, require full obs 30 days of each exposure, index date ignored, require full obs

Forest plot Analysis 3: Length of exposure, index date in exposure window, require full obs

Results CohortMethod No matching, simple outcome model Analysis True RR AUC Coverage MSE 1 NA 0.63 0.19 2 1.00 0.05 3 0.03 4 0.13 5 0.31 1.25 0.44 0.74 0.16 0.56 0.06 0.62 0.95 0.04 0.54 0.84 0.14 0.61 0.88 1.5 0.55 0.15 0.72 0.89 0.08 0.77 0.66 0.17 0.81 0.42 0.69 0.91 0.92 0.79 0.18 0.67 0.47 0.93 0.68 0.36 No matching, simple outcome model Matching plus simple outcome model Stratification plus stratified outcome model Matching plus stratified outcome model Matching plus full outcome model

Forest plot Analysis 3: Stratification plus stratified outcome model

Discussion MSCCS best SCCS performer, but still vulnerable to confounding by indication Self-Controlled Cohort performs really well, but vulnerable to contra-indication (not injected) CohortMethod least powerful but high coverage

Discussion Not sure if results translate to other types of exposures and outcomes Use such an experiment to inform analysis choices for a particular hypthesis of interest …

Next topic Identifying the important questions that can be answered using observational research

Next workgroup meeting June 15st 3pm Hong Kong / Taiwan 4pm South Korea 4:30pm Adelaide 9am Central European time http://www.ohdsi.org/web/wiki/doku.php?id=projects:workgroups:est-methods