Statistics 103 Monday, July 10, 2017.

Slides:



Advertisements
Similar presentations
The analysis of survival data in nephrology. Basic concepts and methods of Cox regression Paul C. van Dijk 1-2, Kitty J. Jager 1, Aeilko H. Zwinderman.
Advertisements

Survival Analysis. Key variable = time until some event time from treatment to death time for a fracture to heal time from surgery to relapse.
X Treatment population Control population 0 Examples: Drug vs. Placebo, Drugs vs. Surgery, New Tx vs. Standard Tx Let X = decrease (–) in cholesterol.
Survival Analysis In many medical studies, the primary endpoint is time until an event occurs (e.g. death, remission) Data are typically subject to censoring.
Survival Analysis-1 In Survival Analysis the outcome of interest is time to an event In Survival Analysis the outcome of interest is time to an event The.
Simple Logistic Regression
Survival Analysis. Statistical methods for analyzing longitudinal data on the occurrence of events. Events may include death, injury, onset of illness,
Introduction to Survival Analysis October 19, 2004 Brian F. Gage, MD, MSc with thanks to Bing Ho, MD, MPH Division of General Medical Sciences.
Departments of Medicine and Biostatistics
Objectives (BPS chapter 24)
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Logistic Regression Multivariate Analysis. What is a log and an exponent? Log is the power to which a base of 10 must be raised to produce a given number.
بسم الله الرحمن الرحیم. Generally,survival analysis is a collection of statistical procedures for data analysis for which the outcome variable of.
Lecture 23: Tues., Dec. 2 Today: Thursday:
Biostatistics in Research Practice Time to event data Martin Bland Professor of Health Statistics University of York
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
Today Concepts underlying inferential statistics
Measures of disease frequency (I). MEASURES OF DISEASE FREQUENCY Absolute measures of disease frequency: –Incidence –Prevalence –Odds Measures of association:
Richard M. Jacobs, OSA, Ph.D.
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Survival analysis Brian Healy, PhD. Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression.
Inference for regression - Simple linear regression
Simple Linear Regression
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
Biostatistics Breakdown Common Statistical tests Special thanks to: Christyn Mullen, Pharm.D. Clinical Pharmacy Specialist John Peter Smith Hospital 1.
HSRP 734: Advanced Statistical Methods July 10, 2008.
G Lecture 121 Analysis of Time to Event Survival Analysis Language Example of time to high anxiety Discrete survival analysis through logistic regression.
Dr Laura Bonnett Department of Biostatistics. UNDERSTANDING SURVIVAL ANALYSIS.
1 Introduction to medical survival analysis John Pearson Biostatistics consultant University of Otago Canterbury 7 October 2008.
Assessing Survival: Cox Proportional Hazards Model
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
The Scientific Method Probability and Inferential Statistics.
LOGISTIC REGRESSION A statistical procedure to relate the probability of an event to explanatory variables Used in epidemiology to describe and evaluate.
Introduction to Survival Analysis Utah State University January 28, 2008 Bill Welbourn.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Advanced Statistics for Interventional Cardiologists.
1 Multivariable Modeling. 2 nAdjustment by statistical model for the relationships of predictors to the outcome. nRepresents the frequency or magnitude.
01/20151 EPI 5344: Survival Analysis in Epidemiology Actuarial and Kaplan-Meier methods February 24, 2015 Dr. N. Birkett, School of Epidemiology, Public.
Nonparametric Statistics
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
SURVIVAL ANALYSIS PRESENTED BY: DR SANJAYA KUMAR SAHOO PGT,AIIH&PH,KOLKATA.
Carolinas Medical Center, Charlotte, NC Website:
Nonparametric Statistics
BINARY LOGISTIC REGRESSION
BIOST 513 Discussion Section - Week 10
An introduction to Survival analysis and Applications to Predicting Recidivism Rebecca S. Frazier, PhD JBS International.
Logistic Regression APKC – STATS AFAC (2016).
April 18 Intro to survival analysis Le 11.1 – 11.2
CHAPTER 7 Linear Correlation & Regression Methods
Survival curves We know how to compute survival curves if everyone reaches the endpoint so there is no “censored” data. Survival at t = S(t) = number still.
Notes on Logistic Regression
Chapter 13 Nonlinear and Multiple Regression
The binomial applied: absolute and relative risks, chi-square
Statistical Inference for more than two groups
Medical Statistics Dr. Gholamreza Khalili
Multiple logistic regression
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Nonparametric Statistics
Categorical Data Analysis Review for Final
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Introduction to Logistic Regression
Where are we?.
Sample Size and Power Part II
Nazmus Saquib, PhD Head of Research Sulaiman AlRajhi Colleges
Case-control studies: statistics
Presentation transcript:

Statistics 103 Monday, July 10, 2017

Survival Analysis Survival analysis takes into account the time to the outcome and has several advantages over “snapshot”-in-time methods (e.g., odds ratios, relative risk ratios) It incorporates data from the entire survival curve and not just one arbitrary point (e.g., after 5 years, or the time to median survival) It incorporates data from patients that are lost to follow up or who die of unrelated causes before the study ends It incorporates data from patients who have not had the outcome yet at the end of the study because they enrolled in the study late or the study ended early

(covariates, as in the Cox model)

Could reduce inaccuracy by doing weekly or even daily tally.

K-M: Same data, but Weekly tally Wk1 2 3 4 5

(YEARLY TALLY) SPE 95/100= .95 .95x(82/92)=.85 .85x(64/79)=.69 95/100= .95 .95x(82/92)=.85 .85x(64/79)=.69 .69x(41/61)=.46 .46x(13/38)=.16 (YEARLY TALLY)

Comparing Two Survival Curves (K-M) Kaplan-Meier method: explores effect of one variable on survival using the log rank test or the Mantel-Haenszel approach, with nearly identical results. Log rank test: Compares Observed vs. Expected Outcomes in 2 X 2 table and a chi square test is done Null hypothesis (H0): Observed Outcomes = Expected Outcomes, or Observed – Expected=0 If chi-square statistic > critical value, then null hypothesis is rejected

K-M PLOT N’

Time-to-outcome analysis using log-rank test Chan FKL et al Time-to-outcome analysis using log-rank test Chan FKL et al. Lancet 2017: 389: 2375-82 N’

KAPLAN-MEIER METHOD USING LOG RANK TEST TO OBTAIN P-VALUE WEEKLY TALLY Patients were followed until censored, death, for 219 weeks

Use log rank test to determine this

Log rank test: Observed (O) vs Expected (E) Outcomes over time as basis for Kaplan-Meier Ogroup 1 =a Egroup 1 =b Ogroup 2 =c Egroup 2 =d TIME (WK) N'1 N'2 O 1 O 2 E 1 E2 START 40 60   1 0.4 0.6 2 59 0.8 1.2 3 38 N’ 2X2 Table for chi square test a b c d etc. and then do  for each column to determine a,b,c,d 1-i , sum of the outcomes over the time periods 1 to i

SEE HANDOUT A LOG RANK TEST FORMULA: (O1-E1)2  E1 + (O2- E2)2  E2 = CHI-SQUARED VALUE (a-b)2/b + (c-d)2/d If (a-b) and (c-d) are close to zero, the chi squared value will beclose to 0 The higher the chi square value, the more likely it is to be significant

Degrees of freedom = N-1 With 2 groups, =1.

Cox proportional Hazards Regression A method for investigating the effects of several variables upon the time a specific outcome takes to happen These variables are assumed to have a multiplicative effect on risk

Comparison Two Survival Curves (Cox model) Cox proportional hazards model: explores multiple variables using the hazards ratio (HR) Compares the ratio of Observed to Expected outcomes in one group to another, taking multiple confounding variables into consideration Null hypothesis (H0): that the HR of Observed Outcomes to Expected Outcomes in the first group to those of the second group = 1 If 95% CI of the HR does not overlap a ratio of 1, the null hypothesis is rejected

Distribution of time-to-event measure is exponential (i. e Distribution of time-to-event measure is exponential (i.e., the event rate is constant over time, with fluctuations due to random variability only)

Hazard rate The risk of failure (i.e., the risk of suffering the event of interest), given that the participant has survived up to a specific time. Hazard rate for a time period t is generally expressed as h(t) and can range from 0 to 1 e.g., for patients surviving the first 2 months, hazard rate for month 3 may be 0.05

Hazard ratio (HR) Conceptually, hazard rateexposed / hazard rateunexposed Mathematically: HR= h(t) /ho(t) = exp (b1x1+b2x2+…+bpXp), where h0(t) is the baseline risk of the endpoint with no predictors present (i.e., unexposed) x1, x2, etc. are predictors of risk (independent variables predicting outcome) b1, b2, etc. are Cox coefficients (parameter estimates); For example, b1 represents the log of the hazard ratio relative to one (1) unit of change in x1, holding all other predictors constant + b1  increased risk of outcome (e.g, b1= +.2) - b1  decreased risk of outcome (e.g., b1= -.05) Ln HR= ln [h(t)/h0(t)]= b1x1+b2x2+…+bpXp

b (parameter estimate) Cox proportional hazards regression model 5,180 Framingham participants, age 45-82 , followed until death or for 10 years. Baseline age, sex, SBP, smoking, cholesterol, and DM recorded. There were 402 deaths (events, outcomes). RISK FACTOR b (parameter estimate) P value HR (95% CI) X1= age, years 0.11691 (b1) 0.0001 1.124 (1.111-1.138) X2= male sex 0.40359 (b2) 0.0002 1.497 (1.215-1.845) X3=SBP 0.01645 (b3) 1.017 (1.012-1.021) X4=current smoker 0.76798 (b4) 2.155 (1.758-2.643) X5= total serum cholesterol -0.00209 (b5) 0.0963 0.998 (.995-1.002) X6= diabetes mellitus -0.02366 (b6) 0.1585 0.816 (.615-1.083) If x2=x3=x4=x5=x6=0, and x1=1 (age 46), then ln (h(t)/h0)=b11 and h(t)/h0= eb1 =e.11691 = 1.124

Cox regression model if there is only one independent variable of interest (e.g., treatment or cancer stage) Ln HR= ln [h(t)/h0(t)]= b1x1 Becomes similar to the Kaplan-Meier method except that it uses Observed- Expected ratios in the two groups rather than differences between Observed and Expected in the two groups O1/E1 O2/ E2 Often referred to as the crude hazard ration, as there are no covariates other than the outcome in question

Cox regression model if there is only one independent variable of interest (e.g., treatment or cancer stage) 80 patients with DH Lymphoma were followed until death (EVENT) or for up to 1 year without therapy to assess survival differences in cancer stage 3 vs stage 4 54 EVENTS occurred during the follow up period SEE HANDOUT B

Final take home point It is impossible to ever know the true nature of the population from which we sample. We can only take a sample from it, carefully measure the sample, and then calculate the probability that the sample was derived from a THEORETICAL population distribution For example, if p<.0001 vs. a theoretical normal distribution, it is VERY unlikely that the sample was derived from a normal (Gaussian) distribution. Very low probability  Certainty ABIM seat charting example

Survival Ratios Odds ratio: compares outcome at one point in time Risk ratio: compares outcome at one point in time Median time ratio: compares outcome at one point in time Hazards ratio: uses entire survival curve to assess outcomes Log rank test Cox proportional model

Comparison of hazard ratio and median time ratio % CURE RATE OVER TIME group 1 1 WEEK 4 WEEKS 8 WEEKS A 50% 75% 100% B 25% 37.5% HAZARD RATIO (B/A)= .5 (FAVORING group 1 A) MEDIAN TIMES TO CURE: group 1 A: 1 WEEK group 1 B: 8 WEEKS MEDIAN TIME RATIO (B/A)= .125 (FAVORING group 1 A)

Comparison of hazard ratio and median time ratio group 1 3 MONTHS 6 MONTHS 1 YEAR 2 YEARS 3 YEARS NEW 90% 60% 20% 5% 1% STANDARD 45% 30% 10% 2.5% 0.5% HAZARD RATIO FOR SURVIVAL: 2 WITH STANDARD group 1 VS. NEW group 1 MEDIAN SURVIVAL TIMES: NEW: 7 MONTHS B: 2 MONTHS DIFFERENCE: 5 MONTHS FOR A 3 YEAR STUDY MEDIAN TIME RATIO: 3.5 FAVORING NEW group 1 % MONTHS

Common multivariate regression models OUTCOME VARIABES INDEPENDENT VARIABLES REGRESSION MODEL OUTPUT (ASSOCIATION) CONTINUOUS CATEGORICAL OR CONTINUOUS LINEAR REGRESSION AVE.  IN OUTCOME PER UNIT  IN INDEPENDENT VARIABLE BINARY CATEGORICAL (YES OR NO) LOGISTIC REGRESSION ODDS RATIO TIME TO EVENT (e.g., time to death) COX PROPORTIONAL HAZARDS MODEL HAZARDS RATIO

(HRs)

* * i.e., Observed Outcomes= Expected

2 = [(O1-E1)]2/E1 + [(O2-E2)2/E2 i, where I = 1 to n weekly intervals 2 = [(O1-E1)]2/E1 + [(O2-E2)2/E2

(of sizes n=20 and n=31) 2 = [(O1-E1)]2/E1 + [(O2-E2)2/E2

“Shortcut” using the median survival ratios “Shortcut” using the median survival ratios* (avoids long computations or need for SPSS or SAS programs) Median survivals: Astrocytoma, 80 weeks (14 deaths) Glioblastoma, 34 weeks (28 deaths) Median hazard ratio (HR) = 80/34=2.35, and ln HR=0.85 95% CI for HR SE for ln HR = 1/14 + 1/28 = .3273 95% CI for ln HR= ln HR 1.96SE= .85.6415= .2085,1.325 exp .2085, exp 1.335= 1.23. 3.76 Thus, HR = 2.35 (1.23,3.76), with P = .01 * estimates p-value, but not quite as accurate

COX MODEL Survival (%)

Observed (O) vs Expected (E) Outcomes over time as the Basis for the Cox Model Ogroup 1/Egroup 1 Ogroup 2/Ogroup 2 a/b c/d HR= = 1-i , sum of the outcomes over the time periods 1 to i