Regression models in bio-medical research RNDr. Karel Hrach, Ph.D.

Slides:



Advertisements
Similar presentations
The analysis of survival data in nephrology. Basic concepts and methods of Cox regression Paul C. van Dijk 1-2, Kitty J. Jager 1, Aeilko H. Zwinderman.
Advertisements

Survival Analysis. Key variable = time until some event time from treatment to death time for a fracture to heal time from surgery to relapse.
Lecture 28 Categorical variables: –Review of slides from lecture 27 (reprint of lecture 27 categorical variables slides with typos corrected) –Practice.
12-1 Multiple Linear Regression Models Introduction Many applications of regression analysis involve situations in which there are more than.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
12 Multiple Linear Regression CHAPTER OUTLINE
Departments of Medicine and Biostatistics
© 2009, KAISER PERMANENTE CENTER FOR HEALTH RESEARCH Getting to Know Your Data Basic Data Cleaning Principles.
Regression with a Binary Dependent Variable
Multiple Linear Regression Model
Linear statistical models 2008 Binary and binomial responses The response probabilities are modelled as functions of the predictors Link functions: the.
Lecture 3 Survival analysis. Problem Do patients survive longer after treatment A than after treatment B? Possible solutions: –ANOVA on mean survival.
GRA 6020 Multivariate Statistics; The Linear Probability model and The Logit Model (Probit) Ulf H. Olsson Professor of Statistics.
Introduction to Logistic Regression. Simple linear regression Table 1 Age and systolic blood pressure (SBP) among 33 adult women.
Statistics 350 Lecture 25. Today Last Day: Start Chapter 9 ( )…please read 9.1 and 9.2 thoroughly Today: More Chapter 9…stepwise regression.
Chapter 9 Assessing Studies Based on Multiple Regression.
Statistical Fridays J C Horrow, MD, MS STAT Clinical Professor, Anesthesiology Drexel University College of Medicine.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Survival analysis Brian Healy, PhD. Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression.
1 G Lect 11W Logistic Regression Review Maximum Likelihood Estimates Probit Regression and Example Model Fit G Multiple Regression Week 11.
Simple Linear Regression
Dr Laura Bonnett Department of Biostatistics. UNDERSTANDING SURVIVAL ANALYSIS.
Biostatistics Case Studies 2015 Youngju Pak, PhD. Biostatistician Session 4: Regression Models and Multivariate Analyses.
1 Introduction to medical survival analysis John Pearson Biostatistics consultant University of Otago Canterbury 7 October 2008.
CS 478 – Tools for Machine Learning and Data Mining Linear and Logistic Regression (Adapted from various sources) (e.g., Luiz Pessoa PY 206 class at Brown.
When and why to use Logistic Regression?  The response variable has to be binary or ordinal.  Predictors can be continuous, discrete, or combinations.
HSRP 734: Advanced Statistical Methods July 17, 2008.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
Three Statistical Issues (1) Observational Study (2) Multiple Comparisons (3) Censoring Definitions.
Limited Dependent Variables Ciaran S. Phibbs. Limited Dependent Variables 0-1, small number of options, small counts, etc. 0-1, small number of options,
Multiple Regression  Similar to simple regression, but with more than one independent variable R 2 has same interpretation R 2 has same interpretation.
Logistic regression (when you have a binary response variable)
Binary logistic regression. Characteristic Regression model for target categorized variable explanatory variables – continuous and categorical Estimate.
1 Introduction to Modeling Beyond the Basics (Chapter 7)
The General Linear Model. Estimation -- The General Linear Model Formula for a straight line y = b 0 + b 1 x x y.
Additional Regression techniques Scott Harris October 2009.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
SURVIVAL ANALYSIS PRESENTED BY: DR SANJAYA KUMAR SAHOO PGT,AIIH&PH,KOLKATA.
Instructor: R. Makoto 1richard makoto UZ Econ313 Lecture notes.
Methods and Statistical analysis. A brief presentation. Markos Kashiouris, M.D.
1 BUSI 6220 By Dr. Nick Evangelopoulos, © 2012 Brief overview of Linear Regression Models (Pre-MBA level)
Biostatistics Class 2 Probability 2/1/2000.
Namath s. Hussain, M.D. Penn state Hershey neurosurgery
Chapter 13 Logistic regression.
Lecture Eleven Probability Models.
BINARY LOGISTIC REGRESSION
Selective laser trabeculoplasty in Korean eyes with medically uncontrolled pseudoexfoliation glaucoma Su Chan Lee1, Jung Hyun Lee1,
Logistic Regression APKC – STATS AFAC (2016).
Survival curves We know how to compute survival curves if everyone reaches the endpoint so there is no “censored” data. Survival at t = S(t) = number still.
Logistic Regression.
Statistics Sweden September 2004 Dan Hedlin
THE LOGIT AND PROBIT MODELS
Copyright © 2004 American Medical Association. All rights reserved.
Program Evaluation Models
Business Statistics, 4e by Ken Black
Introduction to logistic regression a.k.a. Varbrul
Statistics 103 Monday, July 10, 2017.
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Log Linear Modeling of Independence
Linear Model Selection and regularization
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Introduction to Logistic Regression
Interpreting Basic Statistics
Statistics Sweden September 2004 Dan Hedlin
Additional Regression techniques
Business Statistics, 4e by Ken Black
Where are we?.
Logistic Regression.
Nazmus Saquib, PhD Head of Research Sulaiman AlRajhi Colleges
Progression-free (a) and overall (b) survival by age subgroup, Kaplan-Meier plots. Progression-free (a) and overall (b) survival by age subgroup, Kaplan-Meier.
Presentation transcript:

Regression models in bio-medical research RNDr. Karel Hrach, Ph.D. Biomedicínský výzkum s podporou evropských zdrojů v nemocnicích (12.4.2012) Regression models in bio-medical research RNDr. Karel Hrach, Ph.D.

Multiple regression („classical“)

Multiple regression (in Excel)

Multiple regression (in Excel)

Reduction of the model? Question: What if we omit the variable VĚK (Age)? Is the resulting model really BETTER than the previous one? There exists a model-building approach.

Stepwise regression (NOT in Excel) type forward: adds the best candidating regressor so, that the new model is significantly better than the sub-model type backward: removes the weekest regressor so, that the sub-model remains significant

Other types of regressors? e.g. Y=blood pressure decrease (BPD) X (X1,…) might be nominal, e.g. „treatment“: X=1 … standard medication X=2 … experimantal medication X=3 … no medication (life-style change)

Other types of regressors? Y=α+βX+ε interpretation of β? It should express the change of BPD, corresponding to the unit change of X (???)

Other types of regressors? Solution = dummy regressors e.g. (BPD example): let’s define X1=1 (if X=1), X1=0 otherwise (it is an indicator of the treatment n.1) X2=1 (if X=2), X2=0 otherwise (it is an indicator of the treatment n.2)

Other types of regressors? Re-definition of the model Y=α+βX+ε : Y=β0+β1X1+β2X2+ε with only these possible situations: X=1 … Y=β0+β1∙1+β2∙0+ε =β0+β1+ε X=2 … Y=β0+β1∙0+β2∙1+ε =β0+β2+ε X=3 … Y=β0+β1∙0+β2∙0+ε =β0+ε

Other types of regression? „classical“=model for dependency of continuous variable(s) Y (Y1,…) dependent variable Y = binary (outcome yes/no) … logistic regr. dependent variable Y = „survival“ … Cox regression (prop.hazards) and other types (e.g. Poisson regr.)

Logistic (Logit) Regression Simple case (i.e. one regressor X) model: =β0+β1X+ε (i.e.as before) BUT Logit = β0+β1X+ε Logit = ln(π/(1- π)) π =probability of Y=1 (event) 1-π =probability of Y=0

Logistic Regression Application: Data from the project „The use of diffusion tensor imaging in preoperative planning and intraoperative neuronavigation“ (Masaryk Hospital, dpt. of neurosurgery)

Logistic Regression The model found: logit = 4,05–0,68∙(TTD+TR) logit… motor response stimulated? TTD… tumor-to-tract distance TR… thickness of the remnant

Probabilities of positive stimulation for each value of TTD+TR:

„Time-to-event“ variable and censoring SURVIVAL ANALYSIS „Time-to-event“ variable and censoring

SURVIVAL FUNCTION S: S (t ) = P (T ≥ t) KAPLAN-MEIER ESTIMATE: SKM (t ) … cumulative relative frequency of surviving at the time t = time of event LIFE-TABLE ESTIMATE : SLT (t ) … cumulative relative frequency of surviving inside given time-interval

Data (FZS UJEP, dpt.of physioth.): group … =0 (mamma ablation) =1 (tumorectomy) 70surv … time (months 1-6) until the angle returns to the value of 70° 70event … censoring (did it happen within 6 months or not? … 1/0)

The difference seems to be clear …

FW „R-project“ coef e(coef) p group 1.33 3.79 0.00068

The offer for co-operation with clinicians: Application of statistical methods (even „less-traditional“) SW available (Excel, R, STATISTICA)