Download presentation
Presentation is loading. Please wait.
Published byDina Reed Modified over 9 years ago
1
@ Hanover Insurance Group: Catherine Eska 1 FROM CLASS TO INDIVIDUAL RATING CAS Predictive Modeling Seminar October 4 th, 5 th 2006 Data Challenges and Considerations in Building a Modeling Dataset
2
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 2 Main Topics Data Sources Dealing with Product Changes How Detailed are your losses? Target Variable Considerations
3
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 3 Internal Data Sources: Considerations How much history do you need? What is the most Complete and Accurate Data Source? Where will the model obtain data once you implement it? Need to capture and store model results post implementation.
4
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 4 Internal Data Sources: Policy / Premium Policy Processing System – Generally only has Written Premium – Most Complete data source – Where real-time scoring would happen – Available at the time the policy is quoted / issued Statistical Record – Premium and Loss Data – Only updated at certain points in time (month-end) – Some codes converted from what is entered – Some data elements may be dropped – May include Manual Policy Data
5
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 5 Internal Data Sources: Claims / Losses Statistical Record – Premium and Loss have same coding structure (Major / Minor Line, Major Peril) – Assignment of Losses to Building / Location / Vehicle may be suspect Claim System – Codes most likely follow Policy Processing System – History may not be readily available – More accurate assignment of losses – Additional Data Elements may be available
6
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 6 Internal Data Sources: Balance and Verify Most likely will get data from multiple sources Make sure you balance the data: Premium $, Loss $, Counts (Policy and Claim) Statistical Records usually most complete and accurate, so balance any data from other sources to this.
7
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 7 Internal Data Sources: Leave No Stone Unturned Billing – May be a separate system – Some Billing attributes may be captured in the Policy Processing System Agency Data – Name and Address – Year Agent Appointed – Agent Status Fraud / SIU Litigation
8
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 8 External Data Sources Vendors (Experian, ChoicePoint, D&B …) NOAA – Weather Data Census – Demographic Data WCRI / HLDI State Rating Bureaus NCCI / ISO Considerations: Cost / Appropriateness / Regulatory
9
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 9 Product Changes: Coverage Examples Rental Reimbursement Medical Payments Limit Embedded Limits: – Jewelry, Watches and Furs – Accounts Receivable – Employee Dishonesty – Building and Ordinance Broadening Endorsements
10
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 10 Product Changes: Coverage Treatment Create an indicator to determine if the limit of coverage was selected by the insured or embedded in the base policy Create a variable to represent what version of a broadening endorsement was on the policy
11
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 11 Product Changes: Other Codes change – data dictionary – statistical manual Definition changes – Age of Building versus Year Built Indivisible Premium split to separate Coverage Premiums: Summarize at policy level (lowest common denominator)
12
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 12 How Detailed are your Losses? It is generally desirable to build the model at the lowest level of detail that is accurate. – Personal Automobile – by Vehicle – Business Owner – by Building / Location – Workers Compensation – by State / Class Heavily dependent on the quality of the individual company’s data. When the detail is missing, you can get creative in your variable definitions
13
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 13 Losses should be accurate at the Policy Level You can not tie losses accurately to location / building / vehicle Create Pseudo Variables: – Highest Building Value – Lowest Building Value – Number of Buildings – Deductible associated with Highest Building Value
14
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 14 Alternatives to Policy Level: State / Class Code Level Accuracy of loss assignment to state is typically very good Accuracy of loss assignment to class should be explored with Claims Very common for Commercial Policies to have multiple states and classes on them
15
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 15 Coverage Treatment in Modeling Pure Premium models are generally built by coverage. Loss Ratio models are restricted to the level at which premiums are calculated Can create variables that are specific to coverage within the modeling dataset: Liability Limit, General Liability Class Code, Industry (SIC or NAICS code), Dogs (Y/N), Toys (Jet Ski) etc …
16
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 16 Target Variable Considerations: PRODUCT CHANGES Adjust premiums to the new rate structure – Re-Rate historical policies using current rates – On-Level Factors If you adjust the historical premiums, you must also adjust historical losses Trend Losses in the History to reflect broader coverage Levels as well as inflation Cap Losses in the Past to reflect more restrictive coverage levels (caps on replacement cost)
17
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 17 Target Variable Considerations: TREND When do you Adjust for Trend? Loss Ratio Approach Using Premium at the current rate level Select appropriate trend for losses by coverage When don’t you Adjust for Trend? Frequency/Severity Approach Include Policy Year as a Dependent Variable Is the trend implied by model reasonable?
18
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 18 Target Variable Considerations: Loss Development Claim data should be of sufficient maturity: Can you Limit dataset to Closed Claims? Chose an Age where “Pure” IBNR Claims are no longer expected Desire that future development on known claims is Minimal Balance between responsiveness and stability
19
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 19 Target Variable Considerations: Loss Development Build a GLM to model Loss Development Consider using Duration of the Claim as a Dependent Variable Extrapolate duration of open claims using a survival model Investigate other data elements available in the Claims System … Claimant Age, Gender, Litigation Status
20
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 20 Target Variable Considerations: Loss Development Options Broad Loss Development Assumptions Not great at predicting actual ultimate loss at a policy level Try to build assumptions for homogeneous groups of claims: Coverage, Program, Industry, State … If claims emergence is not complete: Earned Premium * Expected Loss Ratio * % Unreported If claims are fully reported: Loss Development Factor * Reported Loss
21
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 21 Target Variable Considerations: Loss Development Implicit approach – No adjustment Include Open/Closed Indicator in Severity Model Include Policy Year in Model and Observe Implied Policy Year Trend Project trend coefficient for newer years in setting up final model for implementation
22
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 22 Target Variable Considerations: TREND
23
Data Challenges and Considerations @ Hanover Insurance Group: Catherine Eska 23 Contact Information Catherine E. Eska, FCAS, MAAA Vice President Underwriting Analytics – Corporate Actuarial The Hanover Insurance Group 440 Lincoln Street, S457 Worcester, MA 01653 ceska@Hanover.com (508) 855-2493
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.