Binary logistic regression. Characteristic Regression model for target categorized variable explanatory variables – continuous and categorical Estimate.

Binary logistic regression

Characteristic Regression model for target categorized variable explanatory variables – continuous and categorical Estimate of probability to categorize the dependent variable Enable to interpret the solution Sensitive to multicollinearity Exacting to data preparation

Applications In general: a response model to predict the probability of response To predict the probability to lose the certain type of client To predict fraud To predict the purchase of certain goods …….

Logistic regression model I Binary dependent variable  1…event occurs  0…event does not occur P(Y=1) how depends on values of independent variables?

Logistic regression model II Formula In classic linear regression model is within (-∞; ∞) In case of binary variable than indicate probability Y=1 Probability is within 0 and 1 To express probability can not be used simple linear combination of inputs Chance P/(1-P)…interval (0;∞) Logit ln(P/(1-P)…interval (-∞; ∞) and ln(P/(1-P)…the same interval

Logistic regression model III Logit of the P value is expressed as weighted sum of values of independent variable values.

Regression relation probability chance logit Logistic function

Categorical input variable – contrasts X1X2X3 Category 1100 Category 2010 Category 3001 Category 4000 reference category Contrast type Indicator

Contrasts I Convert categorical variables to several numerical variables (for example 0-1) Create contrasts with respect to interpretation  Ordinal/nominal variables  Reference catogories are not nedeed in all cases Contrast specification does not influence prediction

Contrasts II a) Indicator – each category is 0-1 variables, last or first category is skipped b) Simple – each category (except reference category) is compared with reference category c) Repeated – each category (except first) is compared with previous category d) Difference – each category (except first) is compared with average effect of previous categories c), d) Ordinal variables

Data preparation LR is sensitive to multicollinearity  Necessary to reduce the number of variables Pay special attention to  missing values  extremes In practice are often (all) input variables categorized

Categorization of variables Possible way how to smooth extremes Categorization  Based on experts  Based on quantiles  Optimal categorization with respect to target variable Categorized variable would not be based on many categories – it causes mutual relation of variables  Merging of categories

Binary logistic regression. Characteristic Regression model for target categorized variable explanatory variables – continuous and categorical Estimate.

Similar presentations

Presentation on theme: "Binary logistic regression. Characteristic Regression model for target categorized variable explanatory variables – continuous and categorical Estimate."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Binary logistic regression. Characteristic Regression model for target categorized variable explanatory variables – continuous and categorical Estimate.

Similar presentations

Presentation on theme: "Binary logistic regression. Characteristic Regression model for target categorized variable explanatory variables – continuous and categorical Estimate."— Presentation transcript:

Similar presentations

About project

Feedback