Survival analysis. First example of the day Small cell lungcanser Meadian survival time: 8-10 months 2-year survival is 10% New treatment showed median.

Slides:



Advertisements
Similar presentations
Logistic Regression Psy 524 Ainsworth.
Advertisements

Regression analysis Linear regression Logistic regression.
Week 3. Logistic Regression Overview and applications Additional issues Select Inputs Optimize complexity Transforming Inputs.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Introduction to Survival Analysis October 19, 2004 Brian F. Gage, MD, MSc with thanks to Bing Ho, MD, MPH Division of General Medical Sciences.
Departments of Medicine and Biostatistics
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Multipe and non-linear regression. What is what? Regression: One variable is considered dependent on the other(s) Correlation: No variables are considered.
N-way ANOVA. 3-way ANOVA 2 H 0 : The mean respiratory rate is the same for all species H 0 : The mean respiratory rate is the same for all temperatures.
Lecture 3 Survival analysis. Problem Do patients survive longer after treatment A than after treatment B? Possible solutions: –ANOVA on mean survival.
PH6415 Review Questions. 2 Question 1 A journal article reports a 95%CI for the relative risk (RR) of an event (treatment versus control as (0.55, 0.97).
Statistics: Data Analysis and Presentation Fr Clinic II.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Logistic regression Who survived Titanic?.
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
Notes on Logistic Regression STAT 4330/8330. Introduction Previously, you learned about odds ratios (OR’s). We now transition and begin discussion of.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Linear Regression and Linear Prediction Predicting the score on one variable.
Measures of disease frequency (I). MEASURES OF DISEASE FREQUENCY Absolute measures of disease frequency: –Incidence –Prevalence –Odds Measures of association:
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Analysis of Complex Survey Data
STAT E-150 Statistical Methods
Survival analysis Brian Healy, PhD. Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression.
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
1 Survival Analysis Biomedical Applications Halifax SAS User Group April 29/2011.
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
Dr Laura Bonnett Department of Biostatistics. UNDERSTANDING SURVIVAL ANALYSIS.
Lecture 3 Survival analysis.
7.1 - Motivation Motivation Correlation / Simple Linear Regression Correlation / Simple Linear Regression Extensions of Simple.
Chapter 3: Generalized Linear Models 3.1 The Generalization 3.2 Logistic Regression Revisited 3.3 Poisson Regression 1.
Logit model, logistic regression, and log-linear model A comparison.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 12.4.
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
Repeated measures ANOVA in SPSS Cross tabulations Survival analysis.
Repeated Measures  The term repeated measures refers to data sets with multiple measurements of a response variable on the same experimental unit or subject.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
Chap 14-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 14 Additional Topics in Regression Analysis Statistics for Business.
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 7 Logistic Regression I.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
1 היחידה לייעוץ סטטיסטי אוניברסיטת חיפה פרופ’ בנימין רייזר פרופ’ דוד פרג’י גב’ אפרת ישכיל.
Linear correlation and linear regression + summary of tests
APPLIED DATA ANALYSIS IN CRIMINAL JUSTICE CJ 525 MONMOUTH UNIVERSITY Juan P. Rodriguez.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
MBP1010 – Lecture 8: March 1, Odds Ratio/Relative Risk Logistic Regression Survival Analysis Reading: papers on OR and survival analysis (Resources)
Section 9-1: Inference for Slope and Correlation Section 9-3: Confidence and Prediction Intervals Visit the Maths Study Centre.
Statistics: Unlocking the Power of Data Lock 5 Exam 2 Review STAT 101 Dr. Kari Lock Morgan 11/13/12 Review of Chapters 5-9.
Multiple Logistic Regression STAT E-150 Statistical Methods.
Inferential Statistics Introduction. If both variables are categorical, build tables... Convention: Each value of the independent (causal) variable has.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Statistics 2: generalized linear models. General linear model: Y ~ a + b 1 * x 1 + … + b n * x n + ε There are many cases when general linear models are.
Logistic Regression Analysis Gerrit Rooks
Nonparametric Statistics
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
Bivariate analysis. * Bivariate analysis studies the relation between 2 variables while assuming that other factors (other associated variables) would.
LOGISTIC REGRESSION. Purpose  Logistical regression is regularly used when there are only two categories of the dependent variable and there is a mixture.
Methods of Presenting and Interpreting Information Class 9.
BINARY LOGISTIC REGRESSION
Logistic Regression APKC – STATS AFAC (2016).
April 18 Intro to survival analysis Le 11.1 – 11.2
CHAPTER 7 Linear Correlation & Regression Methods
Notes on Logistic Regression
Statistics 103 Monday, July 10, 2017.
Multiple logistic regression
CHAPTER 29: Multiple Regression*
Association, correlation and regression in biomedical research
Application of Logistic Regression Model to Titanic Data
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
15.1 The Role of Statistics in the Research Process
Where are we?.
Presentation transcript:

Survival analysis

First example of the day Small cell lungcanser Meadian survival time: 8-10 months 2-year survival is 10% New treatment showed median survival of 13.2months

Progressively censored observations Current life table Completed dataset Cohort life table Analysis “on the fly”

Problem Do patients survive longer after treatment 1 than after treatment 2? Possible solutions: ANOVA on mean survival time? ANOVA on median survival time? 100 person years of observation: How long has the average person been in the study. 10 persons being observed for 10 years 100 persons being observed for 100 years

Life table analysis A sub-set of 13 patients undergoing the same treatment

Life table analysis Time interval chosen to be 3 months n i number of patients starting a given period

Life table analysis d i number of terminal events, in this example; progression/response w i number of patients that have not yet been in the study long enough to finish this period

Life table analysis Number exposed to risk: n i – w i /2 Assuming that patients withdraw in the middle of the period on average.

Life table analysis q i = d i /(n i – w i /2) Proportion of patients terminating in the period

Life table analysis p i = 1 - q i Proportion of patients surviving

Life table analysis S i = p i p i-1...p i-N Cumulative proportion of surviving Conditional probability

Survival curves How long will a lung canser patient keep having canser on this particular treatment?

Kaplan-Meier Simple example with only 2 ”terminal-events”.

Confidence interval of the Kaplan-Meier method Fx after 32 months

Confidence interval of the Kaplan-Meier method Survival plot for all data on treatment 1 Are there differences between the treatments?

Comparing Two Survival Curves One could use the confidence intervals… But what if the confidence intervals are not overlapping only at some points? Logrank-stats Hazard ratio Mantel-Haenszel methods

Comparing Two Survival Curves The logrank statistics Aka Mantel-logrank statistics Aka Cox-Mantel-logrank statistics

Comparing Two Survival Curves Five steps to the logrank statistics table 1.Divide the data into intervals (eg. 10 months) 2.Count the number of patients at risk in the groups and in total 3.Count the number of terminal events in the groups and in total 4.Calculate the expected numbers of terminal events e.g. (31-40) 44 in grp1 and 46 in grp2, 4 terminal events. expected terminal events 4x(44/90) and 4x(46/90) 5.Calculate the total

Comparing Two Survival Curves Smells like Chi-Square statistics

Comparing Two Survival Curves Hazard ratio

Comparing Two Survival Curves Mantel Haenszel test Is the OR significant different from 1? Look at cell (1,1) Estimated value, E(a i ) Variance, V(a i ) row total * column total grand total

Comparing Two Survival Curves Mantel Haenszel test df = 1; p>0.05

Hazard function d is the number of terminal events  f is the sum of failure times  c is the sum of censured times

Logistic regression Who survived Titanic?

25 The sinking of Titanic Titanic sank April 14th 1912 with 2228 souls 705 survived. A dataset of 1309 passengers survived. Who survived?

26 The data Sibsp is the number of siblings and/or spouses accompanying Parsc is the number of parents and/or children accompanying Some values are missing Can we predict who will survive titanic II? pclasssurvivednamesexagesibspparch 11Allen, Miss. Elisabeth Waltonfemale Allison, Master. Hudson Trevormale Allison, Miss. Helen Lorainefemale212 10Allison, Mr. Hudson Joshua Creightonmale Allison, Mrs. Hudson J C (Bessie Waldo Daniels)female Anderson, Mr. Harrymale Andrews, Miss. Kornelia Theodosiafemale Andrews, Mr. Thomas Jrmale Appleton, Mrs. Edward Dale (Charlotte Lamson)female5320

27 Analyzing the data in a (too) simple manner Associations between factors without considering interactions

28 Analyzing the data in a (too) simple manner Associations between factors without considering interactions

29 Analyzing the data in a (too) simple manner Associations between factors without considering interactions

30 Could we use multiple linear regression to predict survival? multiple linear regressionLogistic regression Response variable is defined between –inf and +inf Response variable is defined between 0 and 1 Normal distributedBernoulli distributed

31 Logit transformation is modeled linearly The logistic function

32 The sigmodal curve

33 The sigmodal curve The intercept basically just ‘scale’ the input variable

34 The sigmodal curve The intercept basically just ‘scale’ the input variable Large regression coefficient → risk factor strongly influences the probability

35 The sigmodal curve The intercept basically just ‘scale’ the input variable Large regression coefficient → risk factor strongly influences the probability Positive regression coefficient → risk factor increases the probability

36 Logistic regression of the Titanic data

37 Logistic regression of the Titanic data – passenger class 1.Summary of data 2.Coding of the dependent variable 3.Coding of the categorical explanatory variable: First class: 1 Second class: 2 Third class: reference

38 Logistic regression of the Titanic data – passenger class A fit of the null-model, basically just the intercept. Usually not interesting The total probability of survival is 500/1309 = Cutoff is 0.5 so all are classified as non- survivers. Basically tests if the null-model is sufficient. It almost certainly is not. Shows that survival is related to pclass (which is not in the null- model)

39 Logistic regression of the Titanic data – passenger class 1.Omnibus test: Uses LR to describe if the adding the pclass variable to the model makes it better. It did! But better than the null-model, so no surprise. 2.Model Summary. Other measures of the goodness of fit. 3.Classification table: By including pclass 67.7 passengers were correctly categorized. 4.Variables in the equation: first line repeats that pclass has a significant effect on survival. B is the logistic fittet parameter. Exp(B) is the odds rations, so the odds of survival is 4.7 ( ) times higher than passengers on third class (reference class)

40 Logistic regression of the Titanic data – Adding age to the model Ups… Some data points are missing And the null model is poorer

41 Logistic regression of the Titanic data – Adding age to the model Cox and Senll’s R-square increased from to 0.141, indicating a better model By this model we can classify 69.1% passenger class only classified 67.7%

42 Logistic regression of the Titanic data – Adding age to the model Age has a significant influence on survival. The odds ratio of age is So the odds of a 31 year old is times the odds of a 30 year old. Or the odds for a 30 year old to survive is 1/0.963 = times larger than that of a 31 year old

43 Logistic regression of the Titanic data – Age alone The model is extremely poor Consequently age appear to be insignificant in estimating survival.

44 Logistic regression of the Titanic data – Adding family and sex The model is becoming better

45 Logistic regression of the Titanic data – Using the model as to predict What is the probability that a 25 year old woman accompanied only by her husband holding a second class ticket would survive Titanic? z = * =

46 Using the model to predict survival What is the probability that a 25 year old woman accompanied only by her husband holding a second class ticket would survive Titanic? z = *(-5)/ =

47 Is it realistic that Leonardo survives and the chick dies?