Examination of Analysis Methods for Positive Continuous Dependent Variables: Model Fit and Cost Saving Implications Brian P SmithMaria De Yoreo Biostatistics.

Slides:



Advertisements
Similar presentations
Chapter 2 The Process of Experimentation
Advertisements

A Spreadsheet for Analysis of Straightforward Controlled Trials
COMPUTER INTENSIVE AND RE-RANDOMIZATION TESTS IN CLINICAL TRIALS Thomas Hammerstrom, Ph.D. USFDA, Division of Biometrics The opinions expressed are those.
Assumptions underlying regression analysis
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
AP Statistics Course Review.
Multiple Analysis of Variance – MANOVA
Fall 2013Biostat 5110 (Biostatistics 511) Discussion Section Week 4 Sandrine Moutou Medical Biometry I.
Likelihood Ratio, Wald, and Lagrange Multiplier (Score) Tests
Analysis of variance (ANOVA)-the General Linear Model (GLM)
Inference for a population mean BPS chapter 18 © 2006 W. H. Freeman and Company.
Introduction to Statistics
Sample size optimization in BA and BE trials using a Bayesian decision theoretic framework Paul Meyvisch – An Vandebosch BAYES London 13 June 2014.
What role should probabilistic sensitivity analysis play in SMC decision making? Andrew Briggs, DPhil University of Oxford.
Kyiv, TRAINING WORKSHOP ON PHARMACEUTICAL QUALITY, GOOD MANUFACTURING PRACTICE & BIOEQUIVALENCE Statistical Considerations for Bioequivalence.
Confidence Intervals © Scott Evans, Ph.D..
Design of Engineering Experiments - Experiments with Random Factors
Part I – MULTIVARIATE ANALYSIS
Journal Club Alcohol and Health: Current Evidence July-August 2006.
Elementary hypothesis testing Purpose of hypothesis testing Type of hypotheses Type of errors Critical regions Significant levels Hypothesis vs intervals.
Clustered or Multilevel Data
Today Concepts underlying inferential statistics
Modelling health care costs: practical examples and applications Andrew Briggs Philip Clarke University of Oxford & Daniel Polsky Henry Glick University.
Sample Size Determination
Overview of Robust Methods Analysis Jinxia Ma November 7, 2013.
Bootstrap spatobotp ttaoospbr Hesterberger & Moore, chapter 16 1.
1 Advances in Statistics Or, what you might find if you picked up a current issue of a Biological Journal.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 6 – Multiple comparisons, non-normality, outliers Marshall.
Copyright © Cengage Learning. All rights reserved. 8 Tests of Hypotheses Based on a Single Sample.
Inference in practice BPS chapter 16 © 2006 W.H. Freeman and Company.
Chapter 1: Introduction to Statistics
Gil Harari Statistical considerations in clinical trials
CORRELATION & REGRESSION
Biostatistics Case Studies 2015 Youngju Pak, PhD. Biostatistician Session 4: Regression Models and Multivariate Analyses.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
Probability theory 2 Tron Anders Moger September 13th 2006.
1 Desipramine is an antidepressant affecting the brain chemicals that may become unbalanced and cause depression. It was tested for recovery from cocaine.
Simplicity, Complexity and Modelling Stephen Senn (c) Stephen Senn1.
Lecture 4 Basic Statistics Dr. A.K.M. Shafiqul Islam School of Bioprocess Engineering University Malaysia Perlis
CHAPTER 16: Inference in Practice ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
1 G Lect 7M Statistical power for regression Statistical interaction G Multiple Regression Week 7 (Monday)
Randomized Trial of Preoperative Chemoradiation Versus Surgery Alone in Patients with Locoregional Esophageal Carcinoma, Ursa et al. Statistical Methods:
Confidence intervals and hypothesis testing Petter Mostad
1 THE ROLE OF COVARIATES IN CLINICAL TRIALS ANALYSES Ralph B. D’Agostino, Sr., PhD Boston University FDA ODAC March 13, 2006.
Extreme values and risk Adam Butler Biomathematics & Statistics Scotland CCTC meeting, September 2007.
Statistical Analysis of IC50s Nick Andrews, Statistics Unit CFI, HPA.
1 Mohamed Alosh, Ph.D. Kathleen Fritsch, Ph.D. Shiowjen Lee, Ph.D. DBIII, OB, CDER, FDA Efficacy Evaluation in Acne Clinical Trials.
Overview and Common Pitfalls in Statistics and How to Avoid Them
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
CHAPTER 16: Inference in Practice ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Sampling and estimation Petter Mostad
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 26 Chapter 11 Section 1 Inference about Two Means: Dependent Samples.
Sample Size Determination
Effect of the Reference Set on Frequency Inference Donald A. Pierce Radiation Effects Research Foundation, Japan Ruggero Bellio Udine University, Italy.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Statistical inference Statistical inference Its application for health science research Bandit Thinkhamrop, Ph.D.(Statistics) Department of Biostatistics.
Lesson Use and Abuse of Tests. Knowledge Objectives Distinguish between statistical significance and practical importance Identify the advantages.
Enhancements to IIIG LTMS By: Todd Dvorak
1 Bandit Thinkhamrop, PhD.(Statistics) Dept. of Biostatistics & Demography Faculty of Public Health Khon Kaen University Overview and Common Pitfalls in.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Hypothesis Testing.
CWR 6536 Stochastic Subsurface Hydrology
Rerandomization to Improve Covariate Balance in Randomized Experiments Kari Lock Harvard Statistics Advisor: Don Rubin 4/28/11.
Introduction to Biostatistics, Harvard Extension School, Fall, 2005 © Scott Evans, Ph.D.1 Sample Size and Power Considerations.
Sample Size Determination
Igor V. Cadez, Padhraic Smyth, Geoff J. Mclachlan, Christine and E
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
Statistical Reasoning December 8, 2015 Chapter 6.2
Biological Science Applications in Agriculture
Students Opportunities: Conferences:
Presentation transcript:

Examination of Analysis Methods for Positive Continuous Dependent Variables: Model Fit and Cost Saving Implications Brian P SmithMaria De Yoreo Biostatistics DirectorDepartment of Applied Mathematics UC Santa Cruz May 22, 2013 Midwest Biostatistics Workshop; Muncie, IN

2 Personal Motivation Compositional Data Analysis Using Liouville Distributions … - Forgettable Ph.D. Dissertation by BP Smith Compositional Data – Multivariate Data That Sum to 1 Clay – 0.2, Silt , Sand John Aitchison – The Statistical Analysis of Compositional Data ln odds – ln (x1/x3), ln(x2/x3) – Bivariate Normal

3 Basic principle Underlying distribution should match the sample space of the data If using multivariate normal, then must transform compositional data from Simplex  Multivariate Reals Could use Dirichlet or Liouville

4 How to follow principle with positive valued data? log transformation – Positive reals to reals Yet, colleagues were using natural scale or percent change from baseline Why? –That was what had always been done –Central limit theorem protection for type 1 error Easy to show with simulation if true distribution is log-normal and use normal distribution to analyze then there is a power loss

5 What do the critics think? Real data is not log-normal or normal So what factor Arguing a theoretical argument for a real world problem

6 Personal Motivation Part 2 It is generally accepted among statisticians that in a clinical trials the simple use of baseline as a covariate provides more power More than once with scientist – “What is this analysis of covariance, we should just do percent change from baseline.” “That is the analysis Jennings did in their paper...” Or “this is what Goodguy Pharmaceuticals did in their NDA” Me – “But you will lose power” but I have already lost this argument There appears to me to be a higher appreciation that good design can affect power than good analysis.

7 What Do I (and Maybe Some of You, if you are like minded) need? Research that not only suggests that log-transformation is better for positive data But also quantifies how much better Research that not only suggests analysis of covariance is better But also quantifies how much better This should exist, right? Not that I can find

8 What Did We Do? 70 Continuous Endpoints Analyzed 10 Analyses Endpoints Each –4 Phase 1 Studies –1 Phase 2 Study –1 Phase 3 Study 10 Endpoints Chosen from 3 Preclinical Studies

9 What Did We Do? (cont) Chose primary or secondary endpoints if continuous 1-3 per study Remaining 7-9 randomly selected from –ECGs – Vitals –Laboratory Measurements Variety of endpoints from range of studies chosen in non-subjective manner

10 The Analyses All endpoints had repeated observations over time Used Mixed Effect Model –Random subject effect –Fixed Effects Treatment Time Treatment by Time Interaction –If Cross-over study, additional random effects added 8 models examined for each endpoint

11 Eight Models IdentifierResponseCovariate for BL? UNYno LNLn(y)no URy-BLno LRLn(y/BL)no PR100∙(y-BL)/BLno UCyYes; BL LCLn(y)Yes; ln(BL) PC100∙(y-BL)/BLYes; BL

12 Three Means of Comparison For ANCOVA Only –P-value of Covariate For Log Scale –Compare Likelihoods For All Analyses –Compare Costs

13 How to Compare Costs? Compare Standard Errors of Estimates for Treatment Effect Determine change in sample size that would be needed under one model to obtain a standard error equivalent to that of another model Scaling Issue due to log-transformation If no scaling issue and two models (se1/se2) 2 is how many fold more subjects that analysis 1 would need to have the same standard error as analysis 2

14 Dealing with the Scaling Issue Natural Scale Log Scale – Consider If start with log scale and work towards natural scale

15 Which to use? If data is skewed right then Geometric Mean < Mean Use of the mean favors the natural scale (most conservative) Use of geometric mean more consistent with data We do both but Prefer Geometric Mean

16 Back to comparing cost Is the fold increase in subjects needed for the natural scale to be equivalent to the log-scale Similar argument for scaling for percent change from baseline

17 The Case for ANCOVA Comparison% p-value < 0.05 ANCOVA versus No Baseline Adjustment Natural Scale90 Log Scale90 ANCOVA versus “Change from Baseline” Natural Scale60 Log Scale65 % Change from Baseline57.5

18 The Case for ANCOVA Cont. ComparisonAverage Fold-Increase In Sample Size ANCOVA versus No Baseline Adjustment Natural Scale3.32 Log Scale3.72 ANCOVA versus “Change from Baseline” Natural Scale1.25 Log Scale1.48 % Change from Baseline1.29

19 The Case for Log Ratio over Percent Change from Baseline Comparison% Likelihood Log Ratio > Likelihood Percent Change from Baseline No Covariate80 Covariate80

20 Likelihood Plots

21 The Case for Log Ratio over Percent Change from Baseline (Cont) ComparisonAverage Fold-Increase In Sample Size With MeanWith Geometric Mean No Covariate Covariate

22 The Case for Log over Natural Scale Comparison% Likelihood Log Ratio > Likelihood Percent Change from Baseline No Baseline Adjustiment80 “Change from Baseline”79 ANCOVA82

23 Likelihood Plots

24 The Case for Log Ratio over Natural Scale (Cont) ComparisonAverage Fold-Increase In Sample Size With MeanWith Geometric Mean No Baseline Adjustiment “Change from Baseline” ANCOVA

25 Conclusions Don’t just trust us, do it yourself If these results continue to replicate can conclude –If a baseline is available, use of baseline as a covariate should always be undertaken –Although we recommend exploration of data from previous studies, percent change from baseline analyses should not be undertaken unless there is strong empirical evidence that for that endpoint it is preferred –Again with the caveat that nothing replaces exploration of data from previous studies, log-transformation ought to be the default analysis of positive data unless exploration of previous data provides convincing evidence that the natural scale is preferred.