Using Ordinary Regression for Logit Transformed Data

Slides:



Advertisements
Similar presentations
Assistive Devices for BADL 張志仲 副教授 Jyh-Jong Chang Associate Professor TEL:2644 Office: CS505.
Advertisements

Logistic Regression I Outline Introduction to maximum likelihood estimation (MLE) Introduction to Generalized Linear Models The simplest logistic regression.
732G21/732G28/732A35 Lecture computer programmers with different experience have performed a test. For each programmer we have recorded whether.
Simple Logistic Regression
Logistic Regression Example: Horseshoe Crab Data
SLIDE 1IS 240 – Spring 2010 Logistic Regression The logistic function: The logistic function is useful because it can take as an input any.
Multinomial Logistic Regression
Nemours Biomedical Research Statistics April 23, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Introduction to Logistic Regression Analysis Dr Tuan V. Nguyen Garvan Institute of Medical Research Sydney, Australia.
Logistic regression for binary response variables.
Logistic Regression II Simple 2x2 Table (courtesy Hosmer and Lemeshow) Exposure=1Exposure=0 Disease = 1 Disease = 0.
© Department of Statistics 2012 STATS 330 Lecture 28: Slide 1 Stats 330: Lecture 28.
Categorical Data Analysis School of Nursing “Categorical Data Analysis 2x2 Chi-Square Tests and Beyond (Multiple Categorical Variable Models)” Melinda.
Understanding Multivariate Research Berry & Sanders.
Logit model, logistic regression, and log-linear model A comparison.
Business Intelligence and Decision Modeling Week 11 Predictive Modeling (2) Logistic Regression.
AN INTRODUCTION TO LOGISTIC REGRESSION ENI SUMARMININGSIH, SSI, MM PROGRAM STUDI STATISTIKA JURUSAN MATEMATIKA UNIVERSITAS BRAWIJAYA.
1 היחידה לייעוץ סטטיסטי אוניברסיטת חיפה פרופ’ בנימין רייזר פרופ’ דוד פרג’י גב’ אפרת ישכיל.
When and why to use Logistic Regression?  The response variable has to be binary or ordinal.  Predictors can be continuous, discrete, or combinations.
Linear vs. Logistic Regression Log has a slightly better ability to represent the data Dichotomous Prefer Don’t Prefer Linear vs. Logistic Regression.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Copyright © 2006 The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Dummy Variable Regression Models chapter ten.
The Chicago Guide to Writing about Multivariate Analysis, 2 nd edition. Interpreting multivariate OLS and logit coefficients Jane E. Miller, PhD.
Logistic Regression. Linear Regression Purchases vs. Income.
Multiple Logistic Regression STAT E-150 Statistical Methods.
1 G Lect 7a G Lecture 7a Comparing proportions from independent samples Analysis of matched samples Small samples and 2  2 Tables Strength.
© Department of Statistics 2012 STATS 330 Lecture 22: Slide 1 Stats 330: Lecture 22.
CHAPTER 10: Logistic Regression. Binary classification Two classes Y = {0,1} Goal is to learn how to correctly classify the input into one of these two.
© Department of Statistics 2012 STATS 330 Lecture 24: Slide 1 Stats 330: Lecture 24.
Dates Presentations Wed / Fri Ex. 4, logistic regression, Monday Dec 7 th Final Tues. Dec 8 th, 3:30.
Logistic regression (when you have a binary response variable)
Chapter 9 Minitab Recipe Cards. Contingency tests Enter the data from Example 9.1 in C1, C2 and C3.
Roger B. Hammer Assistant Professor Department of Sociology Oregon State University Conducting Social Research Logistic Regression Categorical Data Analysis.
Logistic Regression and Odds Ratios Psych DeShon.
Analysis of matched data Analysis of matched data.
Instructor: R. Makoto 1richard makoto UZ Econ313 Lecture notes.
::What Was This About?::  This was a survey asking students how much they spent on their personal appearance, while asking other such important data.
Logistic Regression Jeff Witmer 30 March Categorical Response Variables Examples: Whether or not a person smokes Success of a medical treatment.
Chapter 13 Logistic regression.
A priori violations In the following cases, your data violates the normality and homoskedasticity assumption on a priori grounds: (1) count data  Poisson.
Advanced Quantitative Techniques
Binary Logistic Regression
Logistic Regression.
Notes on Logistic Regression
Advanced Quantitative Techniques
Correlation – Regression
LOGISTIC REGRESSION 1.
Regression Statistics
Logistic regression One of the most common types of modeling in the biomedical literature Especially case-control studies Used when the outcome is binary.
Introduction to logistic regression a.k.a. Varbrul
Stratified Covariate Balancing Using R
Wildlife Population Analysis What are those βs anyway?
SQL for Calculating Likelihood Ratios
SQL for Cleaning Data Farrokh Alemi, Ph.D.
Odds Ratio Interpretation Confounder  Diabetes
Farrokh Alemi, Ph.D. Revised by Akhil Anto
Logistic Regression.
Has Rate of Hospitals Exceeding National Payment for AMI Changed?
females males Analyses with discrete variables
Simple Linear Regression
Dead Patients Visiting
Introduction to Logistic Regression
Convert from Variable Character to Float
Performing a regression analysis
Kim A. Brock, PhD, Patricia A. Goldie, PhD, Kenneth M. Greenwood, PhD 
Improving Overlap Farrokh Alemi, Ph.D.
A Summary by Justin McCullick
Table 2. Regression statistics for independent and dependent variables
Logits: Odds of 1’s vs. 0’s in each Combination of Age, Education, Membership In the chapter, we shorted age to “A,” education to “E,” membership to “M,”
Presentation transcript:

Using Ordinary Regression for Logit Transformed Data Farrokh Alemi, PhD.

Binary MFH bathing_365 bladder_365 bowelincontinence_365 dressing_365 eating_365 grooming_365 toileting_365 transferring_365 walking_365 ID gender race age 0.5 0.23 0.17 0.43 0.21 0.48 0.44 0.47 0.64 1 M NULL 70 0.27 0.16 0.45 0.29 0.51 0.63 2 3 4 5 6 7 0.6 0.24 0.26 0.67 0.32 0.59 0.36 0.49 0.79 8 B 91 0.66 0.4 0.58 0.54 0.57 0.55 0.69 9 10

Probability Odds= π 1−π

Odds= π 1−π Zero to infinity Response Variable= log 𝑒 𝜋 1−𝜋

Odds= π 1−π Response Variable= log 𝑒 𝜋 1−𝜋

Ordinary Regression Logit ln 𝜋 1−𝜋 =𝑙𝑜𝑔𝑖𝑡(𝜋)= 𝛽 0 + 𝛽 1 𝑋 1 + 𝛽 2 𝑋 2 +…+ 𝛽 𝑛 𝑋 𝑛

Undo 𝛾= 𝛽 0 + 𝛽 1 𝑋 1 + 𝛽 2 𝑋 2 +…+ 𝛽 𝑛 𝑋 𝑛 𝜋 = 𝑒 𝛾 1+ 𝑒 𝛾 = 𝑒 𝛽 0 + 𝛽 1 𝑋 1 + 𝛽 2 𝑋 2 +…+ 𝛽 𝑛 𝑋 𝑛 1+ 𝑒 𝛽 0 + 𝛽 1 𝑋 1 + 𝛽 2 𝑋 2 +…+ 𝛽 𝑛 𝑋 𝑛

Drop Table #Data Declare @AvgAge as Float Set @AvgAge = (SELECT avg(age) FROM [MFH].[dbo].[DailyCost$]) SELECT distinct [MFH] , iif([bathing_365]>.5,1,0) AS Bathing ,iif([bladder_365]>.5, 1,0) as Bladder ,iif([bowelincontinence_365]>.5, 1,0) as Bowel ,iif([dressing_365]>.5, 1, 0) as Dressing ,iif([eating_365]>.5, 1, 0) as Eating ,iif([grooming_365]>.5, 1, 0) as Grooming ,iif([toileting_365]>.5, 1, 0) as Toileting ,iif([transferring_365]>.5, 1, 0) as Transferring ,iif([walking_365]>.5, 1, 0) as Walking ,iif([gender] ='F', 0, 1) AS Male ,iif([race] ='B', 1,0) as Black ,iif(race='W', 1, 0) AS White ,iif(race='NULL', 1,0) AS RaceNull ,iif(age is null, @avgAge, age) as Age , ID INTO #Data FROM [MFH].[dbo].[DailyCost$] -- (39139 row(s) affected) Prepare Data

MFH Bathing Bladder Bowel Dressing Eating Grooming Toileting Transferring Walking Male Black White RaceNull Age ID 1 72.15989 35572 35573 35574 35575 35618

Probabilities in Groups DROP TABLE #Prob SELECT CAST(Sum(MFH) as Float)/Cast(Count(distinct ID) as float) AS Prob , count(distinct id) as n , Bathing, Bladder, Bowel, Dressing, Eating, Grooming, Toileting , Transferring, Walking, Male, Black, White, RaceNull, Floor([age]/10)*10 AS Decade INTO #Prob FROM #DATA GROUP BY Bathing, Bladder, Bowel, Dressing, Eating, Grooming, Toileting , Transferring, Walking, Male, Black, White, RaceNull, Floor([age]/10)*10 Having Count(distinct ID)>9 -- (405 row(s) affected) Probabilities in Groups

Prob n Bathing Bladder Bowel Dressing Eating Grooming Toileting Transferring Walking Male Black White RaceNull Decade 0.00 10 1 40 0.06 16 50 0.42 19 60 13

Logit Prob n Bathing Bladder Bowel Dressing Eating Grooming Toileting Transferring Walking Male Black White Race Null Decade -2.3026 10 1 40 -2.7081 0.0625 16 50 -0.3185 0.42105 19 60 -2.5649 13 -2.3979 11

  Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0% Intercept -1.977 0.562 -3.517 0.000 -3.083 -0.872 Bathing -0.691 0.228 -3.031 0.003 -1.139 -0.243 Bladder 0.341 0.178 1.917 0.056 -0.009 0.691 Bowel 0.046 0.188 0.245 0.806 -0.324 0.416 Dressing -0.239 0.186 -1.285 0.200 -0.605 0.127 Eating 0.097 0.153 0.637 0.525 -0.203 0.398 Grooming 0.090 0.164 0.546 0.586 -0.233 0.412 Toileting 0.070 0.168 0.418 0.676 -0.260 0.401 Transferring 0.646 0.148 4.364 0.355 0.937 Walking -0.183 -0.917 0.360 -0.575 0.209 Male -0.230 0.326 -0.707 0.480 -0.871 0.410 Black 0.325 0.351 0.928 0.354 -0.364 1.015 White 0.157 0.334 0.470 0.638 -0.499 0.813 RaceNull -0.095 0.362 -0.263 0.793 -0.807 0.617 Decade 0.005 0.977 0.329 -0.005 0.016