Optimal scaling for a logistic regression model with ordinal covariates Sanne JW Willems, Marta Fiocco, and Jacqueline J Meulman Leiden University & Stanford.

Slides:



Advertisements
Similar presentations
The Software Infrastructure for Electronic Commerce Databases and Data Mining Lecture 4: An Introduction To Data Mining (II) Johannes Gehrke
Advertisements

Workshop in R & GLMs: #3 Diane Srivastava University of British Columbia
Additive Models, Trees, etc. Based in part on Chapter 9 of Hastie, Tibshirani, and Friedman David Madigan.
Exploring the Shape of the Dose-Response Function.
- Word counts - Speech error counts - Metaphor counts - Active construction counts Moving further Categorical count data.
Extension The General Linear Model with Categorical Predictors.
Logistic Regression.
Week 3. Logistic Regression Overview and applications Additional issues Select Inputs Optimize complexity Transforming Inputs.
Chapter 17 Overview of Multivariate Analysis Methods
CES 514 – Data Mining Lecture 8 classification (contd…)
Introduction to Logistic Regression. Simple linear regression Table 1 Age and systolic blood pressure (SBP) among 33 adult women.
Kernel Methods and SVM’s. Predictive Modeling Goal: learn a mapping: y = f(x;  ) Need: 1. A model structure 2. A score function 3. An optimization strategy.
Lecture outline Support vector machines. Support Vector Machines Find a linear hyperplane (decision boundary) that will separate the data.
Jul-15H.S.1 Linear Regression Hein Stigum Presentation, data and programs at:
The Design and Analysis of Algorithms
1 Linear Classification Problem Two approaches: -Fisher’s Linear Discriminant Analysis -Logistic regression model.
Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)
Review of Lecture Two Linear Regression Normal Equation
Midterm Review. 1-Intro Data Mining vs. Statistics –Predictive v. experimental; hypotheses vs data-driven Different types of data Data Mining pitfalls.
MODELS OF QUALITATIVE CHOICE by Bambang Juanda.  Models in which the dependent variable involves two ore more qualitative choices.  Valuable for the.
Ch. Eick: Support Vector Machines: The Main Ideas Reading Material Support Vector Machines: 1.Textbook 2. First 3 columns of Smola/Schönkopf article on.
Simple Linear Regression
Biostatistics Case Studies 2015 Youngju Pak, PhD. Biostatistician Session 4: Regression Models and Multivariate Analyses.
Logistic Regression L1, L2 Norm Summary and addition to Andrew Ng’s lectures on machine learning.
ALISON BOWLING THE GENERAL LINEAR MODEL. ALTERNATIVE EXPRESSION OF THE MODEL.
Data Mining – Algorithms: Linear Models Chapter 4, Section 4.6.
Generalized Linear Models All the regression models treated so far have common structure. This structure can be split up into two parts: The random part:
The Group Lasso for Logistic Regression Lukas Meier, Sara van de Geer and Peter Bühlmann Presenter: Lu Ren ECE Dept., Duke University Sept. 19, 2008.
Danila Filipponi Simonetta Cozzi ISTAT, Italy Outlier Identification Procedures for Contingency Tables in Longitudinal Data Roma,8-11 July 2008.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Regression Models for Quantitative (Numeric) and Qualitative (Categorical) Predictors KNNL – Chapter 8.
Chapter 9 Correlational Research Designs. Correlation Acceptable terminology for the pattern of data in a correlation: *Correlation between variables.
Categorical Independent Variables STA302 Fall 2013.
In Stat-I, we described data by three different ways. Qualitative vs Quantitative Discrete vs Continuous Measurement Scales Describing Data Types.
Linear Discriminant Analysis and Logistic Regression.
© Copyright 2000, Julia Hartman 1 An Interactive Tutorial for SPSS 10.0 for Windows © Analysis of Covariance (Regression Approach) by Julia Hartman Next.
Analysis of Covariance KNNL – Chapter 22. Analysis of Covariance Goal: To Compare treatments (1-Factor or Multiple Factors) after Controlling for Numeric.
RECITATION 4 MAY 23 DPMM Splines with multiple predictors Classification and regression trees.
Correlations: Linear Relationships Data What kind of measures are used? interval, ratio nominal Correlation Analysis: Pearson’s r (ordinal scales use Spearman’s.
Chapter 11 – Neural Nets © Galit Shmueli and Peter Bruce 2010 Data Mining for Business Intelligence Shmueli, Patel & Bruce.
PREDICT 422: Practical Machine Learning
BINARY LOGISTIC REGRESSION
Chapter 7. Classification and Prediction
Logistic Regression APKC – STATS AFAC (2016).
Artificial Neural Networks
CHAPTER 7 Linear Correlation & Regression Methods
Chapter 13 Nonlinear and Multiple Regression
A Fast Trust Region Newton Method for Logistic Regression
Boosting and Additive Trees (2)
Table 1. Advantages and Disadvantages of Traditional DM/ML Methods
The Elements of Statistical Learning
Generalized Linear Models
Introduction to logistic regression a.k.a. Varbrul
Roberto Battiti, Mauro Brunato
Optimal Dynamic Treatment Regimes
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Classification of Variables
What is Regression Analysis?
Multivariate Statistics
COSC 4335: Other Classification Techniques
Soc 3306a Lecture 11: Multivariate 4
Introduction to Logistic Regression
Wrap-up and Course Review
Bayesian Linear Regression
Generally Discriminant Analysis
Logistic Regression.
Analysis of Covariance
Regression and Clinical prediction models
Memory-Based Learning Instance-Based Learning K-Nearest Neighbor
Chapter 4 –Dimension Reduction
Presentation transcript:

Optimal scaling for a logistic regression model with ordinal covariates Sanne JW Willems, Marta Fiocco, and Jacqueline J Meulman Leiden University & Stanford University s.j.w.willems@math.leidenuniv.nl www.SanneJWWillems.nl

Optimal scaling for generalized linear models with nonlinear covariates Sanne JW Willems, Marta Fiocco, and Jacqueline J Meulman Leiden University & Stanford University s.j.w.willems@math.leidenuniv.nl www.SanneJWWillems.nl

Goal Reducing linearity in Generalized Linear Models using Optimal Scaling Transformations

Generalized Linear Models Linear predictor: Link function - (nonlinear) relation between the linear predictor and the outcome:

Generalized Linear Models Nonlinear predictor: Link function - (nonlinear) relation between the linear predictor and the outcome:

Why?

Data types

Data types – Nominal Categorical Grouping

Data types – Nominal Categorical Grouping Dummy Coding

Data types – Ordinal Categorical Grouping Ordering

Data types – Ordinal Categorical Grouping Ordering Dummy Coding

Data types – Ordinal Categorical Grouping Ordering Continuous variable via integer Coding

Data types – Numeric Grouping Ordering Equal relative spacing

Data types – Numeric Grouping Ordering Equal relative spacing Continuous variable Grouping Ordering Equal relative spacing

What if the linear predictor should be nonlinear?

What if the linear predictor should be nonlinear? Keep ordinal property, but do not introduce equal relative spacing

What if the linear predictor should be nonlinear? Keep ordinal property, but do not introduce equal relative spacing Remove property of equal relative spacing

Solution: Optimal Scaling transformations Transform variables:

Solution: Optimal Scaling transformations Transform variables: Scaling levels: Nominal spline Numeric Nominal Ordinal Ordinal spline

How?

Optimal Scaling Generalized Linear Models Nonlinear predictor: Link function - (nonlinear) relation between the linear predictor and the outcome:

Algorithm

Algorithm

Optimal Scaling step Apply restrictions according to the chosen scaling level

Algorithm

Example: logistic regression

Example: logistic regression Inpatient treatment or ? Day clinic treatment

Result nominal scaling level

Result ordinal scaling level

Predictions for training data nominal vs ordinal Nominal: Ordinal: Sensitivity = 0.924 Specificity = 0.829 Efficiency (correct classification rate) = 0.880 Sensitivity = 0.918 Specificity = 0.823 Efficiency (correct classification rate) = 0.874

Predictions for training data ordinal vs numeric Ordinal: Numeric: Sensitivity = 0.918 Specificity = 0.823 Efficiency (correct classification rate) = 0.874 Sensitivity = 0.864 Specificity = 0.810 Efficiency (correct classification rate) = 0.839

Summary Optimal Scaling GLMs More flexibility by transforming variables Can be helpful when linear predictor should be nonlinear is nonlinear