Generalized Linear Models

Slides:



Advertisements
Similar presentations
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Advertisements

Copula Regression By Rahul A. Parsa Drake University &
© Department of Statistics 2012 STATS 330 Lecture 32: Slide 1 Stats 330: Lecture 32.
Brief introduction on Logistic Regression
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
Use Macro to Enter TP Logo March 11-12, 1999, Nashville, Tennessee Mark Scully, Tillinghast-Towers Perrin The Use of Multivariate Analysis Techniques to.
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
Log-linear and logistic models
Linear and generalised linear models
Inferences About Process Quality
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation.
Severity Distributions for GLMs: Gamma or Lognormal? Presented by Luyang Fu, Grange Mutual Richard Moncher, Bristol West 2004 CAS Spring Meeting Colorado.
Classification and Prediction: Regression Analysis
Generalized Linear Models
A Primer on the Exponential Family of Distributions David Clark & Charles Thayer American Re-Insurance GLM Call Paper
Inference for regression - Simple linear regression
Discriminant Function Analysis Basics Psy524 Andrew Ainsworth.
Chapter 3: Generalized Linear Models 3.1 The Generalization 3.2 Logistic Regression Revisited 3.3 Poisson Regression 1.
Statistics & Biology Shelly’s Super Happy Fun Times February 7, 2012 Will Herrick.
Introduction to Generalized Linear Models Prepared by Louise Francis Francis Analytics and Actuarial Data Mining, Inc. October 3, 2004.
Name: Angelica F. White WEMBA10. Teach students how to make sound decisions and recommendations that are based on reliable quantitative information During.
Practical GLM Modeling of Deductibles
Excepted from HSRP 734: Advanced Statistical Methods June 5, 2008.
VI. Evaluate Model Fit Basic questions that modelers must address are: How well does the model fit the data? Do changes to a model, such as reparameterization,
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
© 2012 Towers Watson. All rights reserved. GLM II Basic Modeling Strategy 2012 CAS Ratemaking and Product Management Seminar by Len Llaguno March 20, 2012.
America CAS Seminar on Ratemaking March 2005 Presented by: Serhat Guven An Introduction to GLM Theory Refinements.
Practical GLM Analysis of Homeowners David Cummings State Farm Insurance Companies.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Tutorial I: Missing Value Analysis
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
Quantitative Methods Residual Analysis Multiple Linear Regression C.W. Jackson/B. K. Gordor.
Yandell – Econ 216 Chap 15-1 Chapter 15 Multiple Regression Model Building.
Stats Methods at IC Lecture 3: Regression.
Estimating standard error using bootstrap
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
Exploring Group Differences
Chapter 15 Multiple Regression Model Building
GS/PPAL Section N Research Methods and Information Systems
BINARY LOGISTIC REGRESSION
Psych 231: Research Methods in Psychology
Chapter 7. Classification and Prediction
Regression Analysis AGEC 784.
Deep Feedforward Networks
Statistics made simple Dr. Jennifer Capers
Structural Equation Modeling using MPlus
Notes on Logistic Regression
Eastern Michigan University
Statistics in MSmcDESPOT
Generalized Linear Models
Log Linear Modeling of Independence
Statistical Methods For Engineers
10701 / Machine Learning Today: - Cross validation,
Pattern Recognition and Machine Learning
Psych 231: Research Methods in Psychology
OVERVIEW OF LINEAR MODELS
Principles of the Global Positioning System Lecture 11
Statistics II: An Overview of Statistics
Product moment correlation
Regression Assumptions
Inferential Statistics
Model generalization Brief summary of methods
Psych 231: Research Methods in Psychology
Psych 231: Research Methods in Psychology
Chapter 13 Additional Topics in Regression Analysis
Longitudinal Data & Mixed Effects Models
Regression Assumptions
Testing Causal Hypotheses
Presentation transcript:

Generalized Linear Models Theory vs. Practice Hannah Kaufmann Patryk Wiech Nathan Schuele March 28, 2019

Presentation Outline Why use Generalized Linear Models? Introduction to Simple Linear Regression Data Processing What is a Generalized Linear Model? Choosing a Model Validation Interpreting Results

Why do Insurers use GLMs? Efficiency Account for the relationship between variables Flexibility based on overall purpose Error statistics Accessible software Widely used in ratemaking 1. Simplifies process 2. 3. 4. Instead of analyzing only the relationship to the predictor 5. Choose distributions

Simple Linear Regression Explores the relationship between a quantitative response variable (Y) and one explanatory variable (X) Limited to a single pair of a response and an explanatory variable μ represents the true mean of the response variable y given the data from the explanatory variable x 𝜇 𝑦|𝑥 = 𝛽 0 + 𝛽 1 𝑥+ε

Linear Regression Assumptions Random Components: each component of the response vector (𝑌) is normally distributed and all share a common constant variance 𝜎 2 Systematic Components: p covariates combined to give the linear predictor 𝜂 such that: 𝜂 =𝐗∙ 𝛽 Link Function: the identity function such that: 𝐸 𝑌 ≡ 𝜇 = 𝜂

Simple Linear Regression Example

One-Way Charts One-Way charts are a good way to begin exploring your data Analyze reliability of data Loss ratios Exposure distribution Correlation issues are difficult to detect Helps with selecting a reference level and mapping data Patryk

One-Way Graph

Mapping Your Data Data can either be continuous or categorical Continuous data can be transformed to categorical data In general we want to group similar data levels within the same variable Group miles driven, Ages, etc. If we map data to too small of a group, results can be unreliable and non predictive Too many degrees of freedom may cause model to not converge If we map data to too large of a group, may miss an important factor

Selecting A Reference Level Choose a reference level that helps with the interpretability of your model In general we select the level with the most exposure to be the reference level This is done so that the significance statistics produce meaningful p values Reference level with too little data will produce less significant p values than one with more Significance statistics compare how different the level is to the base level The model needs to have a good read on what the average value is for the base level so there has to be enough data, if you use a level with only a couple values, the model will not have a good understanding of what the “average” of that level should be There needs to be confidence in both levels being reviewed

Why not Linear Regression? Ignores any interdependencies the variables have Assumes the response variable is normally distributed, has a constant variance, and that all predictors are entered additively Why Generalized Linear Models? Considers interdependencies Allows for multivariate analysis

GLM Assumptions Random Component: each component of the response variable vector ( 𝑌 ) is independent and is from one of the distributions in the exponential family Systematic Components: p covariates combined to give the linear predictor 𝜂 such that: 𝜂 =𝐗∙ 𝛽 Note: unchanged from assumption 2 on simple regression Link Function: the relationship between the random and systematic components via a link function g, that is differentiable and monotonic such that 𝐸 𝑌 ≡ 𝜇 = 𝑔 −1 ( 𝜂 ) Change

Basics of a GLM The standard form of a Generalized Linear Model = 𝑔 −1 ( 𝑗 𝑋 𝑖𝑗 ∙ 𝛽 𝑗 + 𝜉 𝑖 ) With 𝑉𝑎𝑟 𝑌 𝑖 = 𝜙∙𝑉( 𝜇 𝑖 ) ω 𝑖 𝑌 𝑖 is the response vector 𝑔(𝑥) is the link function 𝑋 𝑖𝑗 is the “design matrix” 𝛽 𝑗 is the vector of parameters 𝜉 𝑖 is the vector of offsets 𝜙 is the parameter of 𝑉(𝑋) 𝑉(𝑋) is the variance function ( 𝜎 2 ) ω 𝑖 is the prior weight

Exponential Family of Distributions Includes: Normal Poisson Binomial Gamma Inverse Gaussian Variance function depends on the distribution chosen Most distributions have a strictly increasing variance More risky policyholders are expected to have higher variance The characteristics of the raw distributions lead to choosing a specific distribution for the specific type of model; in practice gamma for severity and poisson for frequency are the most commonly used distributions Gamma – used for severity models; depending on the parameters, the results tend to have a large spike and then a long tail to the right which match the empirical distribution of claim severity Poisson – used for frequency models, to model claim counts Binomial – used for retention models, when you are trying to predict a probability, the probability that a policyholder will renew their policy When it comes to actually creating the GLM, this decision is generally made at the beginning and then assumed for the rest of the process. Since there has been so much research done, these assumptions tend to be safe to make and then focus on the individual variables and the quality of the data in order to improve the model

Choosing a Model Selecting a model is more of an art than a science As models are built and results are analyzed, the model is likely to evolve Choosing target and predictor variables Choosing distribution for the target variable Best form of predictor variables Which variables to include

Choosing a Model Compare Measures of Fit Analyzing Residuals Non Penalized Log-Likelihood Deviance Penalized AIC BIC Analyzing Residuals They follow no predictable pattern Normally distributed with constant variance (Homoscedastic) Any deviation can indicate underlying distribution is incorrect

Bias vs. Variance Trade-Off Bias: Expected Prediction – Correct Value Pay little attention to the training data, which leads to high error in both the training data and test data Variance: variability within the data Pays lots of attention to the training data which leads to overfitting in the test data

P-Values An estimate of the probability of a value of which the magnitude (or higher) arises by pure chance Example: 𝑃 𝛽 0 ≥1.5 = 𝑝 𝑣 =0.0012 Leads to the result that 𝛽 0 is significant, or that it is likely that 𝛽 0 ≠0 Example: 𝑃 𝛽 0 ≥1.5 = 𝑝 𝑣 =0.52 Leads to the result that 𝛽 0 is insignificant, or that it is likely that 𝛽 0 =0 The effect of 𝛽 0 may be present in the data set, but would need to be seen from a macro-level

P-Value Example Output directly from SAS Pr > Chi Sq (in yellow) gives the p-values The top chart

Simple Quantile Plots Validation technique to compare the actual results with the predicted results Judgment to determine “best” model Predictive accuracy, monotonicity, vertical separation Simple quantile plots are a widely used technique to visualize how “well” your model is doing by comparing the actual results to the predicted values. There are specific step by step instructions to create these graphs but the output needs to be analyzed Blue curve is the model output and the orange curve is the validation (actual) values

Analyzing Results Simplifying models Smoothing results post model Outside of model Rerunning model Communicating results to clients Business considerations Reliability of results How feasible is it to implement the rates? Smoothing results – adjust for categorical variables Business considerations Competitor variables, do you want to use it even if there is not enough data available Multi policy discount - market tends to show a much bigger discount than the model suggests. Decide between having a more spread out portfolio between mono line and mult policy. If decide not to give the bigger discount, you risk losing all the multi policies Some variables arent usuable because you can’t verify the data Ex. mileage, Customer and agent control can be ruined if incentives are offered By offering full pay, the people who wouldn’t normal chose full pay would not pick it. This changes the environment of the discount

Thank You for Your Attention Hannah Kaufmann (309) 807 2304 hkaufmann@pinnacleactuaries.com Patryk Wiech (678) 894 7263 pwiech@pinnacleactuaries.com Nathan Schuele (217) 278 9132 npschu1@ilstu.edu