Practical GLM Modeling of Deductibles

Slides:

Advertisements

Similar presentations

Workshop in R & GLMs: #3 Diane Srivastava University of British Columbia

Advertisements

1 General Iteration Algorithms by Luyang Fu, Ph. D., State Auto Insurance Company Cheng-sheng Peter Wu, FCAS, ASA, MAAA, Deloitte Consulting LLP 2007 CAS.

Copula Regression By Rahul A. Parsa Drake University &

Non-life insurance mathematics Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring.

CAS Seminar on Ratemaking Introduction to Ratemaking Relativities March 13-14, 2006 Salt Lake City Marriott Salt Lake City, Utah Presented by: Brian M.

Departments of Medicine and Biostatistics

Generalized Linear Mixed Model English Premier League Soccer – 2003/2004 Season.

1 Modelling mortgage insurance as a multi-state process Greg Taylor Taylor Fry Consulting Actuaries University of Melbourne University of New South Wales.

Linear statistical models 2008 Model diagnostics  Residual analysis  Outliers  Dependence  Heteroscedasticity  Violations of distributional assumptions.

Presenting: Assaf Tzabari

Considerations in P&C Pricing Segmentation February 25, 2015 Bob Weishaar, Ph.D., FCAS, MAAA.

Severity Distributions for GLMs: Gamma or Lognormal? Presented by Luyang Fu, Grange Mutual Richard Moncher, Bristol West 2004 CAS Spring Meeting Colorado.

Generalized Linear Models

Commercial Property Size of Loss Distributions Glenn Meyers Insurance Services Office, Inc. Casualty Actuaries in Reinsurance June 15, 2000 Boston, Massachusetts.

STK 4540Lecture 6 Claim size. The ultimate goal for calculating the pure premium is pricing 2 Pure premium = Claim frequency x claim severity Parametric.

Lecture 11 Implementation Issues – Part 2. Monte Carlo Simulation An alternative approach to valuing embedded options is simulation Underlying model “simulates”

Objectives of Multiple Regression

March 11-12, 2004 Elliot Burn Wyndham Franklin Plaza Hotel

A Primer on the Exponential Family of Distributions David Clark & Charles Thayer American Re-Insurance GLM Call Paper

THE SCIENCE OF RISK SM 1 Interaction Detection in GLM – a Case Study Chun Li, PhD ISO Innovative Analytics March 2012.

Risk Modeling of Multi-year, Multi-line Reinsurance Using Copulas

A New Exposure Base for Vehicle Service Contracts – Miles Driven CAS Ratemaking Seminar – Atlanta 2007 March 8, 2007Slide 1 Discussion Paper Presentation.

1 Validation & Verification Chapter VALIDATION & VERIFICATION Very Difficult Very Important Conceptually distinct, but performed simultaneously.

Incorporating Catastrophe Models in Property Ratemaking Prop-8 Jeffrey F. McCarty, FCAS, MAAA State Farm Fire and Casualty Company 2000 Seminar on Ratemaking.

Bootstrapping Identify some of the forces behind the move to quantify reserve variability. Review current regulatory requirements regarding reserves and.

Random Sampling, Point Estimation and Maximum Likelihood.

Intensive Actuarial Training for Bulgaria January 2007 Lecture 5 – General Insurance Overview and Pricing By Michael Sze, PhD, FSA, CFA.

Generalized Minimum Bias Models

Basic Ratemaking Workshop: Intro to Increased Limit Factors Jared Smollik FCAS, MAAA, CPCU Increased Limits & Rating Plans Division, ISO March 19, 2012.

Excepted from HSRP 734: Advanced Statistical Methods June 5, 2008.

Incorporating heterogeneity in meta-analyses: A case study Liz Stojanovski University of Newcastle Presentation at IBS Taupo, New Zealand, 2009.

Non-life insurance mathematics Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring.

Workers’ Compensation Managed Care Pricing Considerations Prepared By: Brian Z. Brown, F.C.A.S., M.A.A.A. Lori E. Stoeberl, A.C.A.S., M.A.A.A. SESSION:

Casualty Excess Pricing Using Power Curves Ana Mata, PhD, ACAS CARe Seminar London, 15 September 2009 Mat β las Underwriting and Actuarial Consulting,

The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.

Generalized Linear Models All the regression models treated so far have common structure. This structure can be split up into two parts: The random part:

Hidden Risks in Casualty (Re)insurance Casualty Actuaries in Reinsurance (CARe) 2007 David R. Clark, Vice President Munich Reinsurance America, Inc.

2007 CAS Predictive Modeling Seminar Estimating Loss Costs at the Address Level Glenn Meyers ISO Innovative Analytics.

April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.

1 Multiple Regression A single numerical response variable, Y. Multiple numerical explanatory variables, X 1, X 2,…, X k.

1 GLM I: Introduction to Generalized Linear Models By Curtis Gary Dean Distinguished Professor of Actuarial Science Ball State University By Curtis Gary.

STK 4540Lecture 3 Uncertainty on different levels And Random intensities in the claim frequency.

Negative Binomial Regression NASCAR Lead Changes

Comparing Counts.  A test of whether the distribution of counts in one categorical variable matches the distribution predicted by a model is called a.

© 2012 Towers Watson. All rights reserved. GLM II Basic Modeling Strategy 2012 CAS Ratemaking and Product Management Seminar by Len Llaguno March 20, 2012.

CAS Seminar on Ratemaking Introduction to Ratemaking Relativities (INT - 3) March 11, 2004 Wyndham Franklin Plaza Hotel Philadelphia, Pennsylvania Presented.

Modeling the Loss Process for Medical Malpractice Bill Faltas GE Insurance Solutions CAS Special Interest Seminar … Predictive Modeling “GLM and the Medical.

Bivariate Poisson regression models for automobile insurance pricing Lluís Bermúdez i Morata Universitat de Barcelona IME 2007 Piraeus, July.

Privileged & Confidential Frequency and Severity vs. Loss Cost Modeling CAS 2012 Ratemaking and Product Management Seminar March 2012 Philadelphia, PA.

On Predictive Modeling for Claim Severity Paper in Spring 2005 CAS Forum Glenn Meyers ISO Innovative Analytics Predictive Modeling Seminar September 19,

Glenn Meyers ISO Innovative Analytics 2007 CAS Annual Meeting Estimating Loss Cost at the Address Level.

Personal Lines Actuarial Research Department Generalized Linear Models CAGNY Wednesday, November 28, 2001 Keith D. Holler Ph.D., FCAS, ASA, ARM, MAAA.

Practical GLM Analysis of Homeowners David Cummings State Farm Insurance Companies.

Statistics 2: generalized linear models. General linear model: Y ~ a + b 1 * x 1 + … + b n * x n + ε There are many cases when general linear models are.

Tutorial I: Missing Value Analysis

Dependent Variable Discrete  2 values – binomial  3 or more discrete values – multinomial  Skewed – e.g. Poisson Continuous  Non-normal.

Personal Lines Actuarial Research Department Generalized Linear Models CAS - Boston Monday, November 11, 2002 Keith D. Holler Ph.D., FCAS, ASA, ARM, MAAA.

Week 7: General linear models Overview Questions from last week What are general linear models? Discussion of the 3 articles.

Missing data: Why you should care about it and what to do about it

Negative Binomial Regression

2003 Fall CAS Meeting November 11, 2003 Robert J. Walling

Cost of Capital Issues April 16, 2002 John J. Kollar.

Generalized Linear Models

Managing Underwriting Risk & Capital

Quantitative Methods What lies beyond?.

Quantitative Methods What lies beyond?.

Significant models of claim number Introduction

Investigating Whether A Social Media Presence Impacts Claim Severity

Generalized Linear Models

Presentation transcript:

Practical GLM Modeling of Deductibles David Cummings State Farm Insurance Companies

Overview Traditional Deductible Analyses GLM Approaches to Deductibles Tests on simulated data

Empirical Method All losses at $500 deductible $1,000,000 Losses eliminated by $1000 deductible $ 100,000 Loss Elimination Ratio 10%

Empirical Method Pros Cons Simple Need credible data at low deductible No $1000 deductible data is used to price the $1000 deductible

Loss Distribution Method Fit a severity distribution to data

Loss Distribution Method Fit a severity distribution to data Calculate expected value of truncated distribution

Loss Distribution Method Pros Provides framework to relate data at different deductibles Direct calculation for any deductible Cons Need to reflect other rating factors Framework may be too rigid

Complications Deductible truncation is not clean “Pseudo-deductible” effect Due to claims awareness/self-selection May be difficult to detect in severity distribution

GLM Modeling Approaches Fit severity distribution using other rating variables Use deductible as a variable in severity/frequency models Use deductible as a variable in pure premium model

GLM Approach 1 – Fit Distribution w/ variables Fit a severity model Linear predictor relates to untruncated mean Maximum likelihood estimation adjusted for truncation Reference: Guiahi, “Fitting Loss Distributions with Emphasis on Rating Variables”, CAS Winter Forum, 2001

GLM Approach 1 – Fit Distribution w/ variables X = untruncated random variable ~ Gamma Y = loss data, net of deductible d

GLM Approach 1 – Fit Distribution w/ variables Pros Applies GLM within framework Directly models truncation Cons Non-standard GLM application Difficult to adapt to rate plan No frequency data used in model

Not a member of Exponential Family of distributions Practical Issues No standard statistical software Complicates analysis Less computationally efficient Not a member of Exponential Family of distributions

Practical Issues No clear translation into a rate plan Deductible effect depends on mean Mean depends on all other variables Deductible effect varies by other variables

Practical Issues No use of frequency information Frequency effects derived from severity fit Loss of information

GLM Approach 2 -- Frequency/Severity Model Standard GLM approach Fit separate frequency and severity models Use deductible as independent variable

GLM Approach 2 -- Frequency/Severity Model Pros Utilizes standard GLM packages Incorporates deductible effects on frequency and severity Allows model forms that fit rate plan Cons Potential inconsistency of models Specification of deductible effects

Test Data Simulated Data Risk Characteristics 1,000,000 policies 80,000 claims Risk Characteristics Amount of Insurance Deductible Construction Alarm System Gamma Severity Distribution Poisson Frequency Distribution

Conclusions from Test Data – Frequency/Severity Models Deductible as categorical variable Good overall fit Highly variable estimates for higher or less common deductibles When amount effect is incorrect, interaction term improves model fit

Severity Relativities Using Categorical Variable

Conclusions from Test Data – Frequency/Severity Models Deductible as continuous variable Transformations with best likelihood Ratio of deductible to coverage amount Log of deductible Interaction terms with amount improve model fit Carefully examine the results for inconsistencies

Frequency Relativities

Severity Relativities

Pure Premium Relativities

GLM Approach 3 – Pure Premium Model Fit pure premium model using Tweedie distribution Use deductible as independent variable

GLM Approach 3 – Pure Premium Model Pros Incorporates frequency and severity effects simultaneously Ensures consistency Analogous to Empirical LER Cons Specification of deductible effects

Conclusions from Test Data – Pure Premium Models Deductible as categorical variable Good overall fit Some highly variable estimates Good fit with some continuous transforms Can avoid inconsistencies with good choice of transform

Extension of GLM – Dispersion Modeling Double GLM Iteratively fit two models Mean model fit to data Dispersion model fit to residuals Reference Smyth, Jørgensen, “Fitting Tweedie’s Compound Poisson Model to Insurance Claims Data: Dispersion Modeling,” ASTIN Bulletin, 32:143-157

Double GLM in Modeling Deductibles Gamma distribution assumes that variance is proportional to µ2 Deductible effect on severity Mean increases Variance increases more gradually Double GLM significantly improves model fit on Test Data More significant than interactions

Pure Premium Relativities Tweedie Model – $500,000 Coverage Amount

Conclusion Deductible modeling is difficult Tweedie model with Double GLM seems to be the best approach Categorical vs. Continuous Need to compare various models Interaction terms may be important