Presentation is loading. Please wait.

Presentation is loading. Please wait.

2007 CAS Predictive Modeling Seminar Estimating Loss Costs at the Address Level Glenn Meyers ISO Innovative Analytics.

Similar presentations


Presentation on theme: "2007 CAS Predictive Modeling Seminar Estimating Loss Costs at the Address Level Glenn Meyers ISO Innovative Analytics."— Presentation transcript:

1 2007 CAS Predictive Modeling Seminar Estimating Loss Costs at the Address Level Glenn Meyers ISO Innovative Analytics

2 Territorial Ratemaking Territories should be big –Have a sufficient volume of business to make credible estimates of the losses. Territories should be small –“You live near that bad corner!” –Driving conditions vary within territory.

3 Some Environmental Features Related to Auto Accidents Proximity to Business Districts –Workplaces Busy at beginning and end of work day –Shopping Centers Always busy (especially on weekends) –Restaurants Busy at mealtimes –Schools Busy and beginning and end or school day

4 Weather –Rainfall –Temperature –Snowfall (especially in hilly areas) Traffic Density –More traffic sharing the same space increases odds of collision Others Some Environmental Features Related to Auto Accidents

5 Combining Environmental Variables at a Particular Garage Address Individually, the geographic variables have a predictable effect on accident rate and severity. Variables for a particular location could have a combination of positive and negative effects. ISO is building a model to calculate the combined effect of all variables. –Based on countrywide data – Actuarially credible

6 Environmental Model

7

8 Separate Models by Coverage –Bodily Injury Liability –No-Fault –Property Damage Liability –Collision –Comprehensive

9 Constructing the Components Frequency Model as Example

10 “Other Classifiers” reflect driver, vehicle, limits and deductibles. Model output is deployed to a base class, standard limits and deductibles.

11 Data Used in Building Model Additional Insurer Data – Development Partners –From leading insurers Third-Party Data –Traffic –Business Location –Demographic –Weather –etc Approximately 1,000 indicators

12 Environmental Module Examples Weather: –Measures of snowfall, rainfall, temperature, wind and elevation Traffic Density and Driving Patterns : –Commute patterns –Public transportation usage –Population density –Types of housing Traffic Composition –Demographic groups –Household size –Homeownership Traffic Generators –Transportation hubs –Shopping centers –Hospitals/medical centers –Entertainment districts Experience and trend: –ISO loss cost –State frequency and severity trends from ISO lost cost analysis  Comprised of over 1000 indicators

13 Modeling Techniques Employed Variable Selection – univariate analysis, transformations, known relationship to loss Sampling Sub models/data reduction – neural nets, splines, principal component analysis, variable clustering Spatial Smoothing – with parameters related to auto insurance loss patterns

14 In Depth for Weather Component Coverage Frequency Traffic Generators Traffic Composition Weather Neural Net Weather Model 1 Weather Severity Scale 1 Temperature Model Weather Summary Variables 35 Years of Weather Data Weather Severity Scale 2 Neural Net Weather Model 2 Traffic Density Experience and Trend Severity Environmental Model Loss Cost by Coverage Frequency × Severity Causes of Loss Frequency Sub Model Data Summary Variable Raw Data

15 Overall Model Diagnostics Results are preliminary Sort in order of increasing prediction –Frequency –Severity Group observations in buckets consisting of 1/100 th of the exposure Calculate bucket averages Invert the GLM link function for bucket averages and predicted value –logit for frequency –log for severity Plot predicted vs empirical

16 Overall Diagnostics - Frequency

17 Overall Diagnostics - Severity

18 Component Diagnostics Frequency Example Sort observations in order of C i Bucket as above and calculate –C ib = Average C i in bucket b –p ib = Average p i in bucket b –Partial Residuals Plot C ib vs R ib – Expect linear relationship

19 Component Diagnostics Experience and Trend

20 Component Diagnostics Traffic Composition

21 Component Diagnostics Traffic Density

22 Component Diagnostics Traffic Generators

23 Component Diagnostics Weather

24 Relativities to Current Loss Costs

25 Newark NJ Area Combined Relativity

26 Evaluating the Lift of the Environmental Model Demonstrate the ability to select the more profitable risks Demonstrate the adverse effect of competitors “skimming the cream” Calculate the “Value of Lift” statistic Once insurers see the value of lift other actions are possible –Change prices (etc)

27 Effect of Selecting Lower Relativities

28 Effect of Competitors Selecting Lower Relativities

29 Assumptions of The Formula Value of Lift (VoL) Assume a competitor comes in and takes away the business that is less than your class average. Because of adverse selection, the new loss ratio will be higher than the current loss ratio. What is the value of avoiding this fate? VoL is proportional to the difference between the new and the current loss ratio. Express the VoL as a $ per car year.

30 The VoL Formula L C = Current losses P C = Current Loss Cost L N = New losses of business remaining After adverse selection P N = New Loss Cost After adverse selection E C = Current exposure in car years

31 The VoL Formula The numerator represents $ value of the potential cost of competitors skimming the cream. Dividing by E C expresses this value as a $ value per car year.

32 Value of Lift Results

33 Customized Model  1 …  5 ≡ 1 in industry model Severity model customized similarly

34 Helpful Hints in Customization Sample records with no losses –Most records have no losses –Attach sample rate, s i, to retained records –Lore is to have equal number of loss records and no loss records in the sample. Policy exposure, t i, varies –Most are 6 month or 12 month policies Need to account for sampling and exposure in building model

35 Sampling and Exposure in Logistic Regression p i = annual probability n i = 1 if claim, 0 if not t i = policy term s i = sample rate For p i <<1

36 Sampling and Exposure in Logistic Regression Set w i = s i if n i = 1 Set w i = t i s i if n i = 0

37 Summary Model estimates loss cost as a function of business, demographic and weather conditions. Demonstrated model diagnostics Demonstrated lift Indicated how to customize the model


Download ppt "2007 CAS Predictive Modeling Seminar Estimating Loss Costs at the Address Level Glenn Meyers ISO Innovative Analytics."

Similar presentations


Ads by Google