How informative priors can help model heterogeneity Network meta-analyses in a sparse evidence bases April 11, 2016.

Slides:



Advertisements
Similar presentations
Bayes rule, priors and maximum a posteriori
Advertisements

Lecture 11 (Chapter 9).
Probabilistic models Haixu Tang School of Informatics.
Week 11 Review: Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution.
Chi Squared Tests. Introduction Two statistical techniques are presented. Both are used to analyze nominal data. –A goodness-of-fit test for a multinomial.
Bayesian inference “Very much lies in the posterior distribution” Bayesian definition of sufficiency: A statistic T (x 1, …, x n ) is sufficient for 
Psychology 290 Special Topics Study Course: Advanced Meta-analysis April 7, 2014.
Evaluating Diagnostic Accuracy of Prostate Cancer Using Bayesian Analysis Part of an Undergraduate Research course Chantal D. Larose.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #21.
2005 Hopkins Epi-Biostat Summer Institute1 Module 2: Bayesian Hierarchical Models Francesca Dominici Michael Griswold The Johns Hopkins University Bloomberg.
Bayesian inference Gil McVean, Department of Statistics Monday 17 th November 2008.
1 1 The Scientist Game Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics.
The Cost-Effectiveness and Value of Information Associated with Biologic Drugs for the Treatment of Psoriatic Arthritis Y Bravo Vergel, N Hawkins, C Asseburg,
Lecture Inference for a population mean when the stdev is unknown; one more example 12.3 Testing a population variance 12.4 Testing a population.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Using ranking and DCE data to value health states on the QALY scale using conventional and Bayesian methods Theresa Cain.
BS704 Class 7 Hypothesis Testing Procedures
Heterogeneity in Hedges. Fixed Effects Borenstein et al., 2009, pp
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 7 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
Probability, Bayes’ Theorem and the Monty Hall Problem
Standard Error of the Mean
1 Level of Significance α is a predetermined value by convention usually 0.05 α = 0.05 corresponds to the 95% confidence level We are accepting the risk.
Bayesian Inference, Basics Professor Wei Zhu 1. Bayes Theorem Bayesian statistics named after Thomas Bayes ( ) -- an English statistician, philosopher.
Making decisions about distributions: Introduction to the Null Hypothesis 47:269: Research Methods I Dr. Leonard April 14, 2010.
Estimating parameters in a statistical model Likelihood and Maximum likelihood estimation Bayesian point estimates Maximum a posteriori point.
MAS2317 Presentation Jake McGlen. Assuming vague prior knowledge, obtain the posterior distribution for µ and hence construct a 95% Bayesian confidence.
Incorporating heterogeneity in meta-analyses: A case study Liz Stojanovski University of Newcastle Presentation at IBS Taupo, New Zealand, 2009.
Nguyen D. Nguyen, John A. Eisman and Tuan V. Nguyen Garvan Institute of Medical Research, Sydney, Australia Indirect comparison of anti-vertebral fracture.
Therapeutic Equivalence & Active Control Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute.
Maximum Likelihood - "Frequentist" inference x 1,x 2,....,x n ~ iid N( ,  2 ) Joint pdf for the whole random sample Maximum likelihood estimates.
Patricia Guyot1,2, Nicky J Welton1, AE Ades1
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Not in FPP Bayesian Statistics. The Frequentist paradigm Defines probability as a long-run frequency independent, identical trials Looks at parameters.
Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.
Statistics What is the probability that 7 heads will be observed in 10 tosses of a fair coin? This is a ________ problem. Have probabilities on a fundamental.
INTRODUCTION TO CLINICAL RESEARCH Introduction to Statistical Inference Karen Bandeen-Roche, Ph.D. July 12, 2010.
Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.
G. Cowan Lectures on Statistical Data Analysis Lecture 8 page 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem 2Random variables and.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
Stat 31, Section 1, Last Time Distribution of Sample Means –Expected Value  same –Variance  less, Law of Averages, I –Dist’n  Normal, Law of Averages,
LIGO- G Z August 17, 2005August 2005 LSC Meeting 1 Bayesian Statistics for Burst Search Results LIGO-T Keith Thorne and Sam Finn Penn State.
In Bayesian theory, a test statistics can be defined by taking the ratio of the Bayes factors for the two hypotheses: The ratio measures the probability.
Hypothesis Testing. Statistical Inference – dealing with parameter and model uncertainty  Confidence Intervals (credible intervals)  Hypothesis Tests.
1 Spatial assessment of deprivation and mortality risk in Nova Scotia: Comparison between Bayesian and non-Bayesian approaches Prepared for 2008 CPHA Conference,
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Outline Historical note about Bayes’ rule Bayesian updating for probability density functions –Salary offer estimate Coin trials example Reading material:
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
CHAPTER 6: SAMPLING, SAMPLING DISTRIBUTIONS, AND ESTIMATION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Prediction and Missing Data. Summarising Distributions ● Models are often large and complex ● Often only interested in some parameters – e.g. not so interested.
Confidence Intervals.
Continuous Probability Distributions
Step 1: Specify a null hypothesis
BINARY LOGISTIC REGRESSION
Module 2: Bayesian Hierarchical Models
ECO 173 Chapter 10: Introduction to Estimation Lecture 5a
Bayesian data analysis
Bayes for Beginners Stephanie Azzopardi & Hrvoje Stojic
Elementary Statistics
Session II: Reserve Ranges Who Does What
ECO 173 Chapter 10: Introduction to Estimation Lecture 5a
Bayesian Inference, Basics
Inferences about Population Means
PSY 626: Bayesian Statistics for Psychological Science
Clinical and patient-reported outcomes for patients randomised to CZP 200 mg Q2W and CZP 400 mg Q4W to week 96. Clinical and patient-reported outcomes.
CS639: Data Management for Data Science
OR for baseline predictors of MDA at weeks 12, 24, 48, 96 and 144 by univariate analysis of observed data. OR for baseline predictors of MDA at weeks 12,
Mathematical Foundations of BME Reza Shadmehr
Presentation transcript:

How informative priors can help model heterogeneity Network meta-analyses in a sparse evidence bases April 11, 2016

Background Sparse networks are relatively common when dealing with network meta-analyses (NMA) For example, biologics trials can be very expensive leading to few trials describing each comparison When a network is too sparse it may become infeasible to conduct random-effects (RE) NMA If conducted, credible intervals can “explode” Random effects estimates not significant when the pairwise meta-analysis is 2

Psoriatic arthritis example As a motivating example, consider a network of biologics for the treatment of psoriatic arthritis (PsA) The principal outcome was the American College of Rheumatology responder criteria (ACR) 20 (20 percent improvement in tender or swollen joint counts as well as 20 percent improvement in three of the other five criteria) Also of interest was Psoriasis Area and Severity Index (PASI) 50 (reduction in PASI score of at least 50%) 3

placebo Etanercept 25mg biw ACR 20 at 12 weeks – Biologics-naïve Ustekinumab 90mg RAPID-PSA ADEPT Genovese 2007 PSUMMIT1 PSUMMIT2 PSUMMIT1 PSUMMIT2 PSUMMIT1 PSUMMIT2 Mease 2004 GO-REVEAL Golimumab 100mg Golimumab 50mg Ustekinumab 45mg Adalimumab 40mg Certolizumab 400 mg

Relative risk of ACR 20 at 12 weeks 5 Placebo 0.29 (0.22, 0.53) 0.40 (0.23, 1.04) 0.29 (0.20, 0.64) 0.26 (0.19, 0.52) 0.49 (0.30, 1.12) 0.51 (0.30, 1.18) 3.41 (1.88, 4.64) ADA (0.63, 3.43) 0.99 (0.52, 2.13) 0.90 (0.48, 1.73) 0.89 (0.49, 1.78) 1.68 (0.78, 3.79) 1.74 (0.82, 3.95) 2.51 (0.96, 4.26) 0.74 (0.29, 1.60) CZP 0.73 (0.27, 1.83) 0.66 (0.25, 1.52) 0.66 (0.25, 1.49) 1.24 (0.43, 3.16) 1.28 (0.45, 3.26) 3.45 (1.56, 5.05) 1.01 (0.47, 1.92) 1.37 (0.55, 3.69) ETN25BIW 0.91 (0.42, 1.84) 0.90 (0.41, 1.81) 1.69 (0.71, 3.87) 1.75 (0.73, 4.18) 3.80 (1.92, 5.29) 1.11 (0.58, 2.08) 1.51 (0.66, 3.95) 1.10 (0.54, 2.38) GOL (0.62, 1.56) 1.87 (0.82, 4.35) 1.93 (0.85, 4.44) 3.84 (1.94, 5.32) 1.12 (0.56, 2.05) 1.52 (0.67, 3.93) 1.11 (0.55, 2.42) 1.01 (0.64, 1.60) GOL (0.85, 4.41) 1.94 (0.86, 4.45) 2.03 (0.89, 3.36) 0.59 (0.26, 1.28) 0.81 (0.32, 2.31) 0.59 (0.26, 1.42) 0.53 (0.23, 1.22) 0.53 (0.23, 1.18) UST (0.54, 1.99) 1.96 (0.85, 3.28) 0.58 (0.25, 1.22) 0.78 (0.31, 2.22) 0.57 (0.24, 1.36) 0.52 (0.23, 1.18) 0.51 (0.22, 1.16) 0.97 (0.50, 1.84) UST90 Pair wise MA RRs: 2.03 ( ) and 1.97 ( )

PASI 50 at 12 weeks – Biologics-naïve placebo Adalimumab 40mg ADEPT SPIRIT-P1 Infliximab 5mg/kg IMPACT2 GO-REVEAL Golimumab 50mg Golimumab 100mg

Relative risk of PASI 50 at 12 weeks 7 Placebo 0.18 (0.09, ) 0.15 (0.08, ) 0.17 (0.09, ) 0.13 (0.08, ) 5.51 (0.01, 11.14) ADA (0.00, ) 0.98 (0.00, ) 0.73 (0.00, ) 6.84 (0.00, 11.87) 1.18 (0.00, ) GOL (0.00, ) 0.89 (0.00, ) 5.75 (0.00, 11.62) 1.02 (0.00, ) 0.89 (0.00, ) GOL (0.00, ) 7.91 (0.00, 12.32) 1.38 (0.00, ) 1.12 (0.00, ) 1.32 (0.00, ) INF5

QUICK REVIEW 8

Bayesian parameter estimation Pr(  |Y)  Pr(Y|  ) x Pr(  ) “Likelihood” Suppose we wish to draw inference on , a parameter or set of parameters of interest (e.g., treatment effects), therefore  = parameter of interest Y represents the observed data We begin with a “prior” probability distribution for the parameters Typically a non-informative prior (blank slate) Full Bayesian framework allows for external information to inform our prior belief “Prior” Then use the data to determine the likelihood “How likely parameter values are given the observed data” The basis of frequentist statistics Parameter estimation is based on the posterior distribution Bayesian thinking: given my prior knowledge and data likelihood, what is the probability of a parameter estimate being true “Posterior” NMA can be conducted in either the frequentist of Bayesian framework Majority of NMA are conducted in the Bayesian framework

Arm-based fixed & random effects network meta-analysis model Random effects Fixed effects 10

PROBLEM AND SOLUTION 11

Likelihood Probability Prior Posterior Can result in unrealistically wide 95% credible intervals Between-trial variance (  2 ) Posterior heterogeneity distributions

Informative priors If there is too little information to overcome vague, non- informative priors, what are we to do? Informative priors can be used to integrate scientific knowledge external to the evidence base Turner et al reviewed 14,886 meta-analyses to help inform the distribution of τ when working with binary data This information can be used to construct informative priors on the heterogeneity variance parameter How it is done: Use a log-Normal prior on the heterogeneity variance τ 2 with mean and precision based on decades of scientific work 13

Informative heterogeneity priors 10

15

Changes to the BUGS/JAGS model Effectively, this implies changing only two lines to most NMA code From To Why a log-Normal distribution? Best fitting distribution among a variety of candidates 16

PsA example revisited 17 Placebo 0.18 (0.11, 0.33) 0.14 (0.09, 0.30) 0.17 (0.10, 0.42) 0.12 (0.08, 0.20) 5.60 (2.99, 8.98) ADA (0.43, 1.68) 0.97 (0.49, 2.36) 0.69 (0.38, 1.16) 6.93 (3.39, 10.71) 1.23 (0.60, 2.33) GOL (0.74, 2.25) 0.85 (0.44, 1.40) 5.76 (2.40, 9.58) 1.03 (0.42, 2.05) 0.84 (0.44, 1.36) GOL (0.31, 1.24) 8.18 (4.98, 12.00) 1.46 (0.86, 2.62) 1.17 (0.71, 2.29) 1.41 (0.80, 3.25) INF5 PlaceboADA40CZPETN25BIWGOL100GOL50 UST (1.20, 3.05) 0.59 (0.34, 1.02) 0.81 (0.42, 1.63) 0.59 (0.33, 1.09) 0.54 (0.30, 0.96) 0.53 (0.30, 0.94) UST (1.12, 2.98) 0.58 (0.32, 0.99) 0.78 (0.40, 1.58) 0.57 (0.31, 1.06) 0.52 (0.28, 0.93) 0.52 (0.28, 0.92) Relative risk of ACR 20 Relative risk of PASI 50

Application to other data Dichotomous data are very popular A more recent study has conducted the same exercise for continuous data modeled with a Normal likelihood It requires that the data first be transformed as standardized mean differences And be back-transformed after completing the NMA As of yet, there is no such evidence for models based on Poisson or Multinomial likelihoods 18

Placebo LAMALABA+LAMA ICS+LABA+LAM A LAMA+PDE-4 LABA PDE-4LABA+PDE-4 ICS+LA BA ICS 3 trials Pair wise MA RR: 0.84 ( ) Network MA RR: 0.85 ( ) Example - COPD

A B C A B C Scenario #1 Scenario #2 I 2 =25% I 2 =35%I 2 =45% I 2 =5% I 2 =35%I 2 =65% How else might this be used? Most NMA assumes the degree of heterogeneity is equal in each comparison k=3 k=4k=5 k=12 If we relax this assumption, we may have to borrow strength estimation power from somewhere else

Conclusions Sparse networks and heterogeneity are two distinct concepts Informative priors can be used to ensure the correct model is used (i.e., random-effects) when it is otherwise infeasible Use of informative heterogeneity priors are becoming widely endorsed (used within NICE, CADTH, and academic experts) There are limitations to the use of informative priors: It can only be used with Binomial and Normal likelihoods It can only be used when all treatment comparisons fall within the same category 21

22 Thank you!