America CAS Predictive Modeling Seminar September 2005 Presented by: Rich Moncher – Bristol West Tom Hettinger – EMB America Vehicle Ratemaking Vehicles.

Slides:



Advertisements
Similar presentations
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT
Advertisements

Dimension reduction (1)
Copyright © 2014 by the American Academy of Actuaries. All Rights Reserved. Michael E. Angelina, MAAA, ACAS Price Optimization Casualty Actuarial & Statistical.
Sensitivity Analysis for Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare Research and Quality (AHRQ)
Assignment Four Underwriting. Definitions Underwriting – The process of selecting policyholders by recognizing and evaluation hazards, establishing prices.
BA 555 Practical Business Analysis
Who and How And How to Mess It up
Sampling.
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
Statistics for Managers Using Microsoft® Excel 5th Edition
Methods and Philosophy of Statistical Process Control
Practical Application of Retention Modeling Chuck Boucek, FCAS e.
Severity Distributions for GLMs: Gamma or Lognormal? Presented by Luyang Fu, Grange Mutual Richard Moncher, Bristol West 2004 CAS Spring Meeting Colorado.
Slide 1 Testing Multivariate Assumptions The multivariate statistical techniques which we will cover in this class require one or more the following assumptions.
Commercial Property Size of Loss Distributions Glenn Meyers Insurance Services Office, Inc. Casualty Actuaries in Reinsurance June 15, 2000 Boston, Massachusetts.
1 CHAPTER M4 Cost Behavior © 2007 Pearson Custom Publishing.
CAS Predictive Modeling Seminar Evaluating Predictive Models Glenn Meyers ISO Innovative Analytics October 5, 2006.
Data Mining Techniques
Analysis of Real-World Data Static Stability Factor and the Risk of Rollover April 11, 2001.
THE SCIENCE OF RISK SM 1 Interaction Detection in GLM – a Case Study Chun Li, PhD ISO Innovative Analytics March 2012.
Chapter 10 Hypothesis Testing
Mining Insurance Data to Promote Traffic Safety and Better Match Rates to Risk 2002 CAS Seminar on Ratemaking Greg Hayward.
Sampling January 9, Cardinal Rule of Sampling Never sample on the dependent variable! –Example: if you are interested in studying factors that lead.
INT 506/706: Total Quality Management Introduction to Design of Experiments.
Analyzing Reliability and Validity in Outcomes Assessment (Part 1) Robert W. Lingard and Deborah K. van Alphen California State University, Northridge.
Incorporating Catastrophe Models in Property Ratemaking Prop-8 Jeffrey F. McCarty, FCAS, MAAA State Farm Fire and Casualty Company 2000 Seminar on Ratemaking.
Practical GLM Modeling of Deductibles
CAS Spring Meeting Commentary on the New Hazard Groups June 18, 2007 Jose Couret Orlando.
Experimental Design If a process is in statistical control but has poor capability it will often be necessary to reduce variability. Experimental design.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Integrating the Broad Range Applications of Predictive Modeling in a Competitive Market Environment Jun Yan Mo Mosud Cheng-sheng Peter Wu 2008 CAS Spring.
Estimating the Predictive Distribution for Loss Reserve Models Glenn Meyers Casualty Loss Reserve Seminar September 12, 2006.
2007 CAS Predictive Modeling Seminar Estimating Loss Costs at the Address Level Glenn Meyers ISO Innovative Analytics.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 16.
ECE 8443 – Pattern Recognition LECTURE 10: HETEROSCEDASTIC LINEAR DISCRIMINANT ANALYSIS AND INDEPENDENT COMPONENT ANALYSIS Objectives: Generalization of.
5-4-1 Unit 4: Sampling approaches After completing this unit you should be able to: Outline the purpose of sampling Understand key theoretical.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
CROSS-VALIDATION AND MODEL SELECTION Many Slides are from: Dr. Thomas Jensen -Expedia.com and Prof. Olga Veksler - CS Learning and Computer Vision.
Chapter 10 Verification and Validation of Simulation Models
© 2012 Towers Watson. All rights reserved. GLM II Basic Modeling Strategy 2012 CAS Ratemaking and Product Management Seminar by Len Llaguno March 20, 2012.
Pricing Excess Workers Compensation 2003 CAS Ratemaking Seminar Session REI-5 By Natalie J. Rekittke, FCAS, MAAA Midwest Employers Casualty Company.
Credit History Impact on Personal Lines Loss Experience Session CPP-49 James E. Monaghan Thurs. March 9, 2000 CAS Ratemaking Seminar.
America SWAF December 3, 2004 Presented by: Serhat Guven Dimension Reduction.
2004 CAS RATEMAKING SEMINAR INCORPORATING CATASTROPHE MODELS IN PROPERTY RATEMAKING (PL - 4) PRICING EARTHQUAKE INSURANCE DAVE BORDER, FCAS, MAAA.
America CAS Seminar on Ratemaking March 2005 Presented by: Serhat Guven An Introduction to GLM Theory Refinements.
HOUSEHOLD AVERAGING CAS Annual Meeting 2007 Alice Gannon November 2007.
Glenn Meyers ISO Innovative Analytics 2007 CAS Annual Meeting Estimating Loss Cost at the Address Level.
Chapter 14: Inference for Regression. A brief review of chapter 4... (Regression Analysis: Exploring Association BetweenVariables )  Bi-variate data.
Multivariate Analysis and Data Reduction. Multivariate Analysis Multivariate analysis tries to find patterns and relationships among multiple dependent.
1 Module One: Measurements and Uncertainties No measurement can perfectly determine the value of the quantity being measured. The uncertainty of a measurement.
Practical GLM Analysis of Homeowners David Cummings State Farm Insurance Companies.
Special Challenges With Large Data Mining Projects CAS PREDICTIVE MODELING SEMINAR Beth Fitzgerald ISO October 2006.
Tutorial I: Missing Value Analysis
The Mixed Effects Model - Introduction In many situations, one of the factors of interest will have its levels chosen because they are of specific interest.
Principal Component Analysis
Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 9 Hypothesis Testing: Single.
Building Valid, Credible & Appropriately Detailed Simulation Models
CPH Dr. Charnigo Chap. 11 Notes Figure 11.2 provides a diagram which shows, at a glance, what a neural network does. Inputs X 1, X 2,.., X P are.
Chapter 11 – With Woodruff Modications Demand Management and Forecasting Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
2000 CAS RATEMAKING SEMINAR
12 Inferential Analysis.
Similarity Modeling Chuck Boucek
Chapter 10 Verification and Validation of Simulation Models
12 Inferential Analysis.
Analytics – Statistical Approaches
Analyzing Reliability and Validity in Outcomes Assessment
Presentation transcript:

America CAS Predictive Modeling Seminar September 2005 Presented by: Rich Moncher – Bristol West Tom Hettinger – EMB America Vehicle Ratemaking Vehicles Need Class Too

2 Vehicle Ratemaking OUTLINE Background Vehicle Estimator ­ Initial Estimator ­ Diagnostics ­ Tools Vehicle Symbols Symbol Relativities Summary PURPOSE: To discuss techniques for performing vehicle symbol analysis within the context of multivariate framework, including proper tools and diagnostics Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

3

4

5

6 Potential for Adverse Selection? Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary

7 Potential for Adverse Selection? Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary

8 Potential for Adverse Selection? Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary

9 Potential for Adverse Selection? Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary

10 Vehicle Classification/Relativity Analysis Vehicle is critical as it is a major risk driver and accounts for much of the variation in rates Two elements: ­ Symbols ­ Relativities (Both in terms of Model Year and Symbol) Historically, focus on relativities and vehicle age BUT not symbols ­ Initial symbols based on limited data, competitors, bureaus, and judgment ­ Regular reviews of relativities Model Year Symbols Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary

11 Why a Symbol Review? Reduce your reliance on third parties. Produce assignments you can understand and explain internally. Remove potential bias due to inaccurate assignments. Customize to meet the experience of your book. ­ Why does one company up-charge a Ford Taurus after its initial assignment and another down-charge it? ­ Difference in underlying books? ­ Difference in methodologies? Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary

12 Why a Symbol Review? Develop better initial assignments. For example electronic stability control systems (ESCS): ­ “The safety agency credited much of the reduction in SUV rollover risk to the increasing availability of electronic stability control systems on SUVs.” – Wall Street Journal ­ “The systems are sometimes offered as standard equipment or, as an option, cost several hundred dollars.” – Wall Street Journal ­ Could two versions (one with and one without ESCS) of a new SUV that both fall into a original cost new symbol of 22 ($40K-$45K) be different risks? ­ If your experience showed ESCS enabled vehicles cost x% less to insure, how would you initially assign the symbol? Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary

13 The Analysis Use statistically credible techniques to develop the most appropriate symbol assignments and relativities. ­ Utilize GLM techniques. ­ Utilize Smoothing, Credibility Weighting, Clustering techniques. ­ Allow for User Interaction. Get the most out of the company’s own data. ­ How is the company’s vehicle experience different from other company’s or rating agencies underlying databases. ­ How can known cars’ characteristics help you understand the loss potential where data is thin. Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

14 The Analysis ????? Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

15 Issues with Classifying Vehicles High-dimensionality - Symbol analysis requires a large number of small vehicle units (VIN) as building blocks. - VINs are the building blocks of vehicle rating and have little to no experience. - Most companies only use two types of classifications for vehicles – model year and symbol. Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

16 Issues with Classifying Vehicles High correlation - Vehicles tends to be highly correlated with other rating variables (e.g., Deductible, location, age, and limit) - Multivariate framework required to handle highly correlated variables Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

17 Purpose of Predictive Modeling To predict a response variable using a series of explanatory variables (or rating factors). Dependent/Response Losses Claims Retention Independent/Predictors Age Symbols LimitModel Year TerritoryCredit Score Weights Claims Exposures Premium Statistical Model Model Results Parameters Validation Statistics Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

18 Predictive Modeling Response Variable Systematic Component Random Component = + Signal: Function of the Rating Factors/Predictors Noise: Reflects stochastic process Overall Mean “Best” Model 1 parameter for each observation Model Complexity (Number of Parameters) Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

19 Vehicle Symbol/Relativity Analysis Vehicle symbol/relativity analysis is a multi-stage process. ­ How do you isolate the signal from the data? ­ Many techniques available. Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America Vehicle Level Data - Problems - Noisy - Limited Data Initial Vehicle Estimator - Choices to be made - Raw - Standardized - Isolate the Signal - By coverage - Frequency and Severity determined separately. - Residual corrections. - Dimensionally smoothed. Final Vehicle Risk Estimator

20 Vehicle Symbol/Relativity Analysis Vehicle symbol/relativity analysis is a multi-stage process. ­ How do you group the data? ­ Many techniques available. Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America Vehicle Symbol Assignments Symbol Relativities - Estimators combined. - Overall estimators clustered to form symbols. - Relativities calculated for each symbol.

21 Determining the Vehicle Risk Estimator Variety of methods required to decipher different risk drivers by coverage. ­ For Example: Weight impacts BI/Med/Collision differently. This helps us better understand and explain differences. It can also help in creating better symbol assignments in the future. Recommend determining the estimator at the granular level (i.e., frequency/severity by coverage). Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America Initial Vehicle Estimator Final Vehicle Risk Estimator

22 Determining the Vehicle Risk Estimator Output of this stage is a risk estimator for each vehicle. Dimensional Smoothing Credibility Weighting Residual Correction TOOLS GLM Tests Hold-Out Samples Residual Analysis DIAGNOSTICS P-Values Determination of final risk estimator is an iterative process. GLM Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America Initial Vehicle Estimator Final Vehicle Risk Estimator

23 Determining the Vehicle Risk Estimator Dimensional Smoothing Credibility Weighting Residual Correction TOOLS GLM Tests Hold-Out Samples Residual Analysis DIAGNOSTICS P-Values Where do we start? Analyst has choices for initial estimator. GLM Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America Initial Vehicle Estimator Final Vehicle Risk Estimator

24

25 Observed Data Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary Given the following rating factors –Age (a) –Sex (s) –Limit (l) –Vehicle: VIN (v) Then Initial estimate for the i th VIN America

26 Standardized Observed Data Limit/Deductible Territory … Policyholder Sex Vehicle Factors Standard Policy Factors GLM Current Symbols Make Model Categories VIN Groupings Residuals Policyholder Age Data Final Vehicle Factors Include basic vehicle factors within GLM model. ­ Standard policy factors are captured correctly. Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

27

28 Data GLM Standard Policy Factors Policyholder Age Policyholder Sex … Vehicle Age Vehicle Group Vehicle Factors Body Data Performance Data Crash/Theft Data Residuals Standardized Fitted Data Final Vehicle Factors Directly model vehicle estimators within GLM. Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

29 Standardized Fitted Data with External Data Performance and body data differentiates among unique VINs. ­ Transmission, Curb Weight, Wheelbase, Power, Torque, Engine Size, speed, Braking Distance, Turning Circle, etc. As Curb Weight increases, Property Damage Severity increases. As Curb Weight increases, Collision Severity decreases. Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

30 Differentiating a VIN Origin Make Vehicle Series Body Style Engine Emission Check Figure Year Factory Code Serial Number Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

31 Differentiating a VIN Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

32 Differentiating a VIN Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

33 Standardized Fitted Data Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary Redefine the vehicle unit into meaningful concepts Age Sex Limit VIN Then Current Symbol VIN Cluster Body Data Performance Data Crash/Theft Data America

34 Determining the Vehicle Risk Estimator Variety of diagnostics can be used. Dimensional Smoothing Credibility Weighting Residual Correction TOOLS GLM Tests Hold-Out Samples Residual Analysis DIAGNOSTICS P-Values Need to determine how well the vehicle estimator is performing. GLM Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America Initial Vehicle Estimator Final Vehicle Risk Estimator

35 Hold-Out Samples Split data into “Training” and “Test”. ­ Create groupings/estimators with the “Training” data. ­ Examine “Test” data to see how well groupings perform. Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

36 P-Values p-value = probability that the modeled frequency is at least as extreme as that observed. Over-fitting Under- fitting Under the null hypothesis the p- values should be uniformly spread over [0,1]. Assume smoothed statistic is underlying frequency in each zip code. Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

37 Residual Analysis Standardize the data for all factors to see if there is any systematic residual variation. Derive the residuals for each VIN Apply Multi-dimensional smoothing methods to aid interpretation ­ Principle Components ­ Residual Scoring Looking for systematic patterns in the residuals ­ Multidimensional Residual Plots using the VIN characteristics Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

38 Determining the Vehicle Risk Estimator Tests will indicate which tools analyst should consider. Dimensional Smoothing Credibility Weighting Residual Correction TOOLS GLM Tests Hold-Out Samples Residual Analysis DIAGNOSTICS P-Values A variety of tools are needed to handle different situations. GLM Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America Initial Vehicle Estimator Final Vehicle Risk Estimator

39 Data GLM Standard Policy Factors Vehicle Factors Residuals Standardized Fitted Data Data GLM Standard Policy Factors Vehicle Factors Residuals Overall Mean “Best” Model 1 parameter for each observation Model Complexity (Number of Parameters) Underfit Overfit Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

40 Standardized Fitted Data Overall Mean “Best” Model 1 parameter for each observation Model Complexity (Number of Parameters) Underfit Enhance GLM Credibility-weight Residual Correction Overfit Revisit GLM Smoothing Residual Correction TOOLS Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

41 Dimensional Smoothing Uses knowledge of similar vehicles to enhance estimates of the underlying risk. ­ Similarity characteristics based on the parameters from the GLM Essentially applying dimension reduction techniques on the VIN characteristics to form a single continuous variable ­ Similar to scoring routines ­ Variates can then be applied to the scores to smooth the estimate Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

42 Residual Correction Factors Check residuals for underlying systematic patterns. Ideally, enhance underlying GLM to better explain data. Alternatively - Band the residuals via smoothing and clustering - Estimate a correction factor Effectively creating a new external factor to explain the vehicle residual effect Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

43 Credibility Weighting ­ May want to control the amount of credibility weighting via max/min credibility constraints. Can employ standard credibility weighting techniques. Z * Primary Estimator+ (1 – Z) * Secondary Estimator Data GLM Standard Policy Factors Vehicle Factors Residuals Data GLM Standard Policy Factors Vehicle Factors Residuals Standardized Fitted (Underfit) Standardized Observed Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

44 Determining the Vehicle Symbol Assignment Use techniques to identify similar risk estimators to be group to create a manageable number of symbol assignments. Many choices are available to do this. Let statistics help you choose. Not practical to do in a GLM/Tree/Other environment. It is impractical to have a symbol assignment for each and every vehicle. Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America Final Vehicle Risk Estimator Vehicle Symbol Assignments

45 Creating New Symbol BI Frequency BI Severity PD Frequency PD Severity Comp Frequency Comp Severity Coll Frequency Coll Severity BI Estimator PD Estimator Comp Estimator Coll Estimator Vehicle Risk Estimators clustered to form symbols. Combine component estimators to determine a risk measure for each vehicle for use in building symbols. ­ Coverage estimators can be further combined if desire 1 set of symbols for multiple coverages. Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America Vehicle Symbol Assignments

46 Clustering used to produce groupings that are predictive of the future: ­ Minimize within-group heterogeneity. ­ Maximize cross-group heterogeneity. Commonly-used clustering methods: ­ Quantiles ­ Equal Weight ­ Similarity Methods Average Linkage Centroid ­ Wards Clustering Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

47 Quantiles ­ Create groups with equal numbers of observations. Equal Weight ­ Create groups which have an equal amount of weight. Similarity Methods: ­ Rank the data set by the statistic you wish to cluster. ­ Decide on which pair of records are the ‘most similar.’ ­ Group these records. ­ Repeat until left with the desired number of groups. Wards Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary Clustering Methodologies America

48

49 Determining New Symbol Relativities GLM model fit using data grouped by new vehicle symbols. Test relativities using standard GLM tests. ­ Predictive in GLM model ­ Consistent over time in GLM model ­ Predictive when tested against other data Refine symbols/relativities as appropriate. ­ Incorporate rules-based restrictions. ­ Apply actuarial knowledge. ­ Investigate “neighbors” with very different relativities. Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America Vehicle Symbol Assignments Symbol Relativities

50 Accurate estimation of underlying risk associated with vehicle is a three stage process Vehicle Rating - Overview Step 1 Obtain a separate estimator by claim type and by frequency and severity for each VIN building block. Combine estimators, as appropriate. BI Frequency Severity Estimator PD Frequency Severity Estimator Comp Frequency Severity Estimator Coll Frequency Severity Estimator BI Estimator PD Estimator Comp Estimator Coll Estimator Vehicle Symbols Symbol Relativities Step 2 Cluster Vehicle building blocks to develop symbols separately by coverage or for several coverages combined Step 3 Determine by-coverage relativities for each symbol group Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America

51 Summary Vehicle is a major driver of risk, thus it is critical that companies review symbol assignments and relativities regularly. Issues exist that create special challenges with regards to symbol analysis. ­ High-dimensionality ­ Heavily correlated Vehicle symbol analysis requires a range of different approaches and tools (as there are different loss drivers by coverage). Diagnostics needed to ensure best model possible Background Symbol Relativities Vehicle Estimator –Initial EstimatorInitial Estimator Vehicle Symbols –DiagnosticsDiagnostics –ToolsTools Summary America