Practical GLM Analysis of Homeowners David Cummings State Farm Insurance Companies.

Slides:



Advertisements
Similar presentations
Workshop in R & GLMs: #3 Diane Srivastava University of British Columbia
Advertisements

1 General Iteration Algorithms by Luyang Fu, Ph. D., State Auto Insurance Company Cheng-sheng Peter Wu, FCAS, ASA, MAAA, Deloitte Consulting LLP 2007 CAS.
Copula Regression By Rahul A. Parsa Drake University &
1 1 Chapter 5: Multiple Regression 5.1 Fitting a Multiple Regression Model 5.2 Fitting a Multiple Regression Model with Interactions 5.3 Generating and.
Non-life insurance mathematics Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring.
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Non-life insurance mathematics Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring.
Departments of Medicine and Biostatistics
2001 CAS Seminar on Ratemaking - Las Vegas, Nevada1 CAS Seminar on Ratemaking Las Vegas, Nevada March 11-13, 2001 Fitting to Loss Distributions with Emphasis.
SOME LESSONS FROM CAPITAL MARKET HISTORY Chapter 12 1.
1 Modelling mortgage insurance as a multi-state process Greg Taylor Taylor Fry Consulting Actuaries University of Melbourne University of New South Wales.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Linear and generalised linear models
Considerations in P&C Pricing Segmentation February 25, 2015 Bob Weishaar, Ph.D., FCAS, MAAA.
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
Severity Distributions for GLMs: Gamma or Lognormal? Presented by Luyang Fu, Grange Mutual Richard Moncher, Bristol West 2004 CAS Spring Meeting Colorado.
Generalized Linear Models
Commercial Property Size of Loss Distributions Glenn Meyers Insurance Services Office, Inc. Casualty Actuaries in Reinsurance June 15, 2000 Boston, Massachusetts.
Lecture 11 Implementation Issues – Part 2. Monte Carlo Simulation An alternative approach to valuing embedded options is simulation Underlying model “simulates”
Review of Lecture Two Linear Regression Normal Equation
A Primer on the Exponential Family of Distributions David Clark & Charles Thayer American Re-Insurance GLM Call Paper
Risk Modeling of Multi-year, Multi-line Reinsurance Using Copulas
A New Exposure Base for Vehicle Service Contracts – Miles Driven CAS Ratemaking Seminar – Atlanta 2007 March 8, 2007Slide 1 Discussion Paper Presentation.
Incorporating Catastrophe Models in Property Ratemaking Prop-8 Jeffrey F. McCarty, FCAS, MAAA State Farm Fire and Casualty Company 2000 Seminar on Ratemaking.
Random Sampling, Point Estimation and Maximum Likelihood.
Intensive Actuarial Training for Bulgaria January 2007 Lecture 5 – General Insurance Overview and Pricing By Michael Sze, PhD, FSA, CFA.
Generalized Minimum Bias Models
CAS Seminar on Ratemaking Introduction to Ratemaking Relativities March 17-18, 2008 Royal Sonesta Hotel Boston, Mass. Presented by: Michael J. Miller,
Basic Ratemaking Workshop: Intro to Increased Limit Factors Jared Smollik FCAS, MAAA, CPCU Increased Limits & Rating Plans Division, ISO March 19, 2012.
Practical GLM Modeling of Deductibles
Excepted from HSRP 734: Advanced Statistical Methods June 5, 2008.
Workers’ Compensation Managed Care Pricing Considerations Prepared By: Brian Z. Brown, F.C.A.S., M.A.A.A. Lori E. Stoeberl, A.C.A.S., M.A.A.A. SESSION:
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
The Common Shock Model for Correlations Between Lines of Insurance
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Generalized Linear Models All the regression models treated so far have common structure. This structure can be split up into two parts: The random part:
2007 CAS Predictive Modeling Seminar Estimating Loss Costs at the Address Level Glenn Meyers ISO Innovative Analytics.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
1 Multiple Regression A single numerical response variable, Y. Multiple numerical explanatory variables, X 1, X 2,…, X k.
1 GLM I: Introduction to Generalized Linear Models By Curtis Gary Dean Distinguished Professor of Actuarial Science Ball State University By Curtis Gary.
Institute for Mathematics and Its Applications
WP 19 Assessment of Statistical Disclosure Control Methods for the 2001 UK Census Natalie Shlomo University of Southampton Office for National Statistics.
Negative Binomial Regression NASCAR Lead Changes
Comparing Counts.  A test of whether the distribution of counts in one categorical variable matches the distribution predicted by a model is called a.
© 2012 Towers Watson. All rights reserved. GLM II Basic Modeling Strategy 2012 CAS Ratemaking and Product Management Seminar by Len Llaguno March 20, 2012.
CAS Seminar on Ratemaking Introduction to Ratemaking Relativities (INT - 3) March 11, 2004 Wyndham Franklin Plaza Hotel Philadelphia, Pennsylvania Presented.
Modeling the Loss Process for Medical Malpractice Bill Faltas GE Insurance Solutions CAS Special Interest Seminar … Predictive Modeling “GLM and the Medical.
Introduction to logistic regression and Generalized Linear Models July 14, 2011 Introduction to Statistical Measurement and Modeling Karen Bandeen-Roche,
America CAS Seminar on Ratemaking March 2005 Presented by: Serhat Guven An Introduction to GLM Theory Refinements.
Privileged & Confidential Frequency and Severity vs. Loss Cost Modeling CAS 2012 Ratemaking and Product Management Seminar March 2012 Philadelphia, PA.
On Predictive Modeling for Claim Severity Paper in Spring 2005 CAS Forum Glenn Meyers ISO Innovative Analytics Predictive Modeling Seminar September 19,
Glenn Meyers ISO Innovative Analytics 2007 CAS Annual Meeting Estimating Loss Cost at the Address Level.
Personal Lines Actuarial Research Department Generalized Linear Models CAGNY Wednesday, November 28, 2001 Keith D. Holler Ph.D., FCAS, ASA, ARM, MAAA.
Tutorial I: Missing Value Analysis
Personal Lines Actuarial Research Department Generalized Linear Models CAS - Boston Monday, November 11, 2002 Keith D. Holler Ph.D., FCAS, ASA, ARM, MAAA.
Capital Allocation for Property-Casualty Insurers: A Catastrophe Reinsurance Application CAS Reinsurance Seminar June 6-8, 1999 Robert P. Butsic Fireman’s.
Week 7: General linear models Overview Questions from last week What are general linear models? Discussion of the 3 articles.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Missing data: Why you should care about it and what to do about it
Negative Binomial Regression
2003 Fall CAS Meeting November 11, 2003 Robert J. Walling
2000 CAS RATEMAKING SEMINAR
Cost of Capital Issues April 16, 2002 John J. Kollar.
Generalized Linear Models
Managing Underwriting Risk & Capital
Quantitative Methods What lies beyond?.
Quantitative Methods What lies beyond?.
Significant models of claim number Introduction
Statistical Thinking and Applications
Generalized Linear Models
Presentation transcript:

Practical GLM Analysis of Homeowners David Cummings State Farm Insurance Companies

Overview How are GLMs different? –Practical Implications Modeling Deductibles in GLMs

How are GLMs different? Multivariate Analysis Statistical Framework Flexible Modeling Tool

Multivariate Analysis Multivariate analyses reduce bias Practical implications –Requires analysis of all rate factors –Ensures consistency in analysis –May change your processes

Consistent Analysis Consistent Exposure Base

Statistical Framework Enhances the analysis Practical Implications –Re-learn hypothesis testing and analysis of standard errors –Different application of “Credibility”

Flexible Modeling Tool Allows for many analyses Practical Implications –Freq/Severity vs. Pure Premium vs. Loss Ratio –Design an analysis process –Easily accommodates new data –Fight the urge to overanalyze

Modeling Deductibles Traditional Deductible Analyses GLM Approaches to Deductibles Tests on simulated data

Empirical Method All losses at $500 deductible $1,000,000 Losses eliminated by $1000 deductible $ 100,000 Loss Elimination Ratio 10%

Empirical Method Pros –Simple Cons –Need credible data at low deductible –No $1000 deductible data is used to price the $1000 deductible

Loss Distribution Method Fit a severity distribution to data

Loss Distribution Method Fit a severity distribution to data Calculate expected value of truncated distribution

Loss Distribution Method Pros –Provides framework to relate data at different deductibles –Direct calculation for any deductible Cons –Need to reflect other rating factors –Framework may be too rigid

Complications Deductible truncation is not clean “Pseudo-deductible” effect –Due to claims awareness/self-selection –May be difficult to detect in severity distribution

GLM Modeling Approaches 1.Fit severity distribution using other rating variables 2.Use deductible as a variable in severity/frequency models 3.Use deductible as a variable in pure premium model

GLM Approach 1 – Fit Distribution w/ variables Fit a severity model Linear predictor relates to untruncated mean Maximum likelihood estimation adjusted for truncation Reference: –Guiahi, “Fitting Loss Distributions with Emphasis on Rating Variables”, CAS Winter Forum, 2001

GLM Approach 1 – Fit Distribution w/ variables X = untruncated random variable ~ Gamma Y = loss data, net of deductible d

GLM Approach 1 – Fit Distribution w/ variables Pros –Applies GLM within framework –Directly models truncation Cons –Non-standard GLM application –Difficult to adapt to rate plan –No frequency data used in model

Practical Issues No standard statistical software –Complicates analysis –Less computationally efficient Not a member of Exponential Family of distributions

Practical Issues No clear translation into a rate plan –Deductible effect depends on mean –Mean depends on all other variables –Deductible effect varies by other variables

Practical Issues No use of frequency information –Frequency effects derived from severity fit –Loss of information

GLM Approach 2 -- Frequency/Severity Model Standard GLM approach Fit separate frequency and severity models Use deductible as independent variable

Pros –Utilizes standard GLM packages –Incorporates deductible effects on frequency and severity –Allows model forms that fit rate plan Cons –Potential inconsistency of models –Specification of deductible effects GLM Approach 2 -- Frequency/Severity Model

Test Data Simulated Data –1,000,000 policies –80,000 claims Risk Characteristics –Amount of Insurance –Deductible –Construction –Alarm System Gamma Severity Distribution Poisson Frequency Distribution

Conclusions from Test Data – Frequency/Severity Models Deductible as categorical variable –Good overall fit –Highly variable estimates for higher or less common deductibles –When amount effect is incorrect, interaction term improves model fit

Severity Relativities Using Categorical Variable

Conclusions from Test Data – Frequency/Severity Models Deductible as continuous variable –Transformations with best likelihood Ratio of deductible to coverage amount Log of deductible –Interaction terms with amount improve model fit –Carefully examine the results for inconsistencies

Frequency Relativities

Severity Relativities

Pure Premium Relativities

GLM Approach 3 – Pure Premium Model Fit pure premium model using Tweedie distribution Use deductible as independent variable

GLM Approach 3 – Pure Premium Model Pros –Incorporates frequency and severity effects simultaneously –Ensures consistency –Analogous to Empirical LER Cons –Specification of deductible effects

Conclusions from Test Data – Pure Premium Models Deductible as categorical variable –Good overall fit –Some highly variable estimates Good fit with some continuous transforms –Can avoid inconsistencies with good choice of transform

Extension of GLM – Dispersion Modeling Double GLM Iteratively fit two models –Mean model fit to data –Dispersion model fit to residuals Reference Smyth, Jørgensen, “Fitting Tweedie’s Compound Poisson Model to Insurance Claims Data: Dispersion Modeling,” ASTIN Bulletin, 32:

Double GLM in Modeling Deductibles Gamma distribution assumes that variance is proportional to µ 2 Deductible effect on severity –Mean increases –Variance increases more gradually Double GLM significantly improves model fit on Test Data –More significant than interactions

Pure Premium Relativities Tweedie Model – $500,000 Coverage Amount

Conclusion Deductible modeling is difficult Tweedie model with Double GLM seems to be the best approach Categorical vs. Continuous –Need to compare various models Interaction terms may be important