Predictive Modeling for Disability Pricing May 13, 2009 Claim Analytics Inc. Barry Senensky FSA FCIA MAAA Jonathan Polon FSA www.claimanalytics.com.

Slides:



Advertisements
Similar presentations
Barry Senensky FSA FCIA MAAA Overview of Claim Scoring November 6, 2008.
Advertisements

Disability Insurance Predictive Modeling Applications July 30, 2009 Claim Analytics Inc. Jonathan Polon FSA
Design of Experiments Lecture I
Quantitative Research and Analytics, Proprietary and Confidential1 Ryan Michaluk
Improving Disability Claims Management with Predictive Modeling May 15, 2008 Claim Analytics Inc. Barry Senensky FSA FCIA MAAA Jonathan Polon FSA
Predictive Data Modeling A CASE STUDY FOR DATA MODELING.
Evaluating Inforce Blocks Of Disability Business With Predictive Modeling SOA Spring Health Meeting May 28, 2008 Jonathan Polon FSA
Copyright © 2014 by the American Academy of Actuaries. All Rights Reserved. Michael E. Angelina, MAAA, ACAS Price Optimization Casualty Actuarial & Statistical.
© 2014 Fair Isaac Corporation. Confidential. This presentation is provided for the recipient only and cannot be reproduced or shared without Fair Isaac.
Disability Pricing and Dental Fraud Detection: Supervised and Unsupervised Learning October 11, 2007 Jonathan Polon FSA Claim Analytics Inc.
Dental Insurance Fraud Detection With Predictive Modeling SOA Spring Health Meeting May 29, 2008 Jonathan Polon FSA
Computer vision: models, learning and inference
Discovery Through Statistics Claim Analytics Group Disability Reserving Unleash the Power of the 21 st Century Canadian Institute of Actuaries June 29.
x – independent variable (input)
Integrated Assessment of Sustainability =Economic analysis +Non-market analysis.
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
Predictive Modeling and Dental Fraud Detection November 22, 2007 Jonathan Polon FSA Barry Senensky FSA FCIA MAAA Claim Analytics Inc.
Ensemble Learning (2), Tree and Forest
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
2006 CAS RATEMAKING SEMINAR CONSIDERATIONS FOR SMALL BUSINESSOWNERS POLICIES (COM-3) Beth Fitzgerald, FCAS, MAAA.
Walter Hop Web-shop Order Prediction Using Machine Learning Master’s Thesis Computational Economics.
ASSURATECH Applications of Advanced Science in the New Era of Risk Management Lee Smith, FCAS, MAAA Co-author, Presenter Lilli Segre Tossani, MA, BA Co-author.
1 Quantifying the Risk of Disability and Death using Medical Claims Data (US patent 7,249,040 and patents pending) Presented at: The National Predictive.
ACCT: 742-Advanced Auditing
2006 General Meeting Assemblée générale 2006 Chicago, Illinois 2006 General Meeting Assemblée générale 2006 Chicago, Illinois Canadian Institute of Actuaries.
EMIS 8381 – Spring Netflix and Your Next Movie Night Nonlinear Programming Ron Andrews EMIS 8381.
Introduction to machine learning and data mining 1 iCSC2014, Juan López González, University of Oviedo Introduction to machine learning Juan López González.
Solvency II Open Forum 4 th March 2008 Michael Aitchison.
Direct Message Passing for Hybrid Bayesian Networks Wei Sun, PhD Assistant Research Professor SFL, C4I Center, SEOR Dept. George Mason University, 2009.
Stochastic Subgradient Approach for Solving Linear Support Vector Machines Jan Rupnik Jozef Stefan Institute.
Integrating the Broad Range Applications of Predictive Modeling in a Competitive Market Environment Jun Yan Mo Mosud Cheng-sheng Peter Wu 2008 CAS Spring.
2007 CAS Predictive Modeling Seminar Estimating Loss Costs at the Address Level Glenn Meyers ISO Innovative Analytics.
Ensemble Methods: Bagging and Boosting
ECE 8443 – Pattern Recognition LECTURE 10: HETEROSCEDASTIC LINEAR DISCRIMINANT ANALYSIS AND INDEPENDENT COMPONENT ANALYSIS Objectives: Generalization of.
IMPROVING ACTUARIAL RESERVE ANALYSIS THROUGH CLAIM-LEVEL PREDICTIVE ANALYTICS 1 Presenter: Chris Gross.
1 Some more examples Client satisfaction Products sold Trusted advisor score Net growth TOP PERFORMERS Age diversity HIGH Credibility HIGH Absenteeism.
Big data Usman Roshan CS 675. Big data Typically refers to datasets with very large number of instances (rows) as opposed to attributes (columns). Data.
Final Exam Review CS479/679 Pattern Recognition Dr. George Bebis 1.
Classification Ensemble Methods 1
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 12: Advanced Discriminant Analysis Objectives:
Chapter 2-OPTIMIZATION
Classification and Prediction: Ensemble Methods Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
Special Challenges With Large Data Mining Projects CAS PREDICTIVE MODELING SEMINAR Beth Fitzgerald ISO October 2006.
Exploring Policyholder Behavior in the Extreme Tail Yuhong (Jason) Xue, FSA MAAA.
Evaluating scientific software. Choosing appropriate software. Options. 1. Create your own from scratch. 2. Create your own with key bits from well-developed,
Risk Solutions & Research © Copyright IBM Corporation 2005 Default Risk Modelling : Decision Tree Versus Logistic Regression Dr.Satchidananda S Sogala,Ph.D.,
SOLUCIA, INC. 1 An Actuarial Perspective on Disease Management ROI Measurement May 10, 2006.
LOAD FORECASTING. - ELECTRICAL LOAD FORECASTING IS THE ESTIMATION FOR FUTURE LOAD BY AN INDUSTRY OR UTILITY COMPANY - IT HAS MANY APPLICATIONS INCLUDING.
WEEK 2 SOFT COMPUTING & MACHINE LEARNING YOSI KRISTIAN Gradient Descent for Linear Regression.
GroupRe Plus Serving the Reinsurance Needs of Smaller Insurance Companies Robert Lumia, FSA, MAAA Assistant Vice President & Director Group Reinsurance.
Statistics in Insurance Business
Combining Models Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Transformation: Normalization
Introduction to Machine Learning and Tree Based Methods
LECTURE 11: Advanced Discriminant Analysis
Data Mining CAS 2004 Ratemaking Seminar Philadelphia, Pa.
Boosting and Additive Trees (2)
Table 1. Advantages and Disadvantages of Traditional DM/ML Methods
Machine Learning Basics
Jan Rupnik Jozef Stefan Institute
2006 General Meeting Assemblée générale 2006 Chicago, Illinois
Dr. Morgan C. Wang Department of Statistics
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Analysis for Predicting the Selling Price of Apartments Pratik Nikte
Multiple features Linear Regression with multiple variables
Multiple features Linear Regression with multiple variables
Presentation transcript:

Predictive Modeling for Disability Pricing May 13, 2009 Claim Analytics Inc. Barry Senensky FSA FCIA MAAA Jonathan Polon FSA

About Claim Analytics Introduction to Predictive Modeling Disability Pricing Discussion Agenda

About Us Founded in 2001 by two actuaries Objective: Apply predictive modeling technology to insurance questions Current Disability Products Claim scoring Claim reserving Pricing Benchmarking

Clients in Canada and U.S. o Blue Cross Life o ING Employee Benefits o Lincoln Financial o Mutual of Omaha o Principal Financial o Sun Life Canada o Sun Life US o UNUM Several new product initiatives About Us (Cont’d)

Introduction to Predictive Modeling

Computer Performance MeasureIBM 7094 c Laptop c Change Processor Speed (MIPS).253,00012,000-fold increase Main Memory 144 KB4,000,000 KB28,000-fold increase Approx. Cost ($2008) $11,000,000$2,0005,500-fold decrease

What is a Predictive Model A Predictive Model is a model which is created or chosen to try to best predict the probability of an outcome Have been around for 40+ years Harnesses power of modern computers to find hidden patterns in data Used extensively in industry Many possible uses in insurance:

About Predictive Models May be parametric… apply numerical methods to optimize parameters E.g., gradient descent, competitive learning Or non-parametric often have a decision tree form typically optimized using exhaustive search

Predictive Modeling Tools Some common techniques Generalized linear models Neural networks Genetic algorithms Random forests Stochastic gradient boosted trees Support vector machines

Why aren’t Actuaries building modern predictive models? Life Insurance Industry is conservative and slow to change Not a traditional actuarial tool The times are changing! –Especially P&C Actuaries Its only a matter of time! –It just makes too much sense! –Innumerable applications to help solve insurance problems

Disability Pricing

Traditional actuarial methods focus on one, maybe two risk factors at a time Solve for one factor, then move to the next Unable to account for correlations and interactions between rating variables Example: region and industry may be highly correlated and may interact Lots of uncertainty in rates Current Industry Approach

Quotes for Claim Analytics employee LTD benefits Uncertainty in Current Rates InsurerRate per $100 Manulife0.539 Great West Life1.110 Empire Life1.850 Sun Life1.986 No consistency between insurers Does anyone have confidence in their rates?

Ideally suited to multivariate analysis  Adjust for correlations between variables  Facilitate analysis of interaction effects  Uncover and quantify complex relationships between risk factors and claim experience  Maintain wholeness of data Improved accuracy vs traditional methods Greater confidence in rates Predictive Modeling Approach

Identified a key rating variable that was not priced for in current rates Identified and quantified two-way interaction effects between rating variables Better quantification of all effects Significant improvements compared to existing rates Recent Project Highlights

Exposure data: Census: age, gender, salary Plan features: EP, ben%, benefit period, etc Group info: SIC, region, size, etc Claim data: Policy #: link to plan features and group info Claimant info: age, gender, salary, benefit Cost estimate: PV benefits paid plus reserve Data Requirements

Apply predictive modeling to: Predict claim incidence rates Predict claim severity, conditional upon claim being made for each member of census data The Objective

Flexible, client-defined Can be the same as current Most common structure is base rates and multiplicative loadings Iterative process: Test multiple structures Test several rating variables Rate Structure

Base rates are typically a function of: age, gender, EP and max benefit period In low dimensions, with sufficient data, traditional graduation approach works well Or, predictive modeling can be used if data is sparse or heterogeneous Base Rate Approach

Generalized linear models (GLM) can be used to optimize loading factors Accurate, yet efficient  can test several combinations of rating factors Significant and insignificant rating factors are identified Interactions between rating factors can be quantified Multiplicative Loadings

Predictive modeling technique Industry standard for P&C insurance Generalization of classical linear regression Computationally efficient Performs very well for multiplicative models Generalized Linear Models

Similar to linear regression except that effects are additive on a transformed scale Transformation occurs through a link function, g(x) Log-link function results in multiplicative effects: g(x) = ln(x)  g -1 (x) = e x μ i = g -1 ( Β 1 x i1 + …+ Β p x ip ) = exp(Β 1 x i1 ) * … * exp(Β p x ip ) GLM At A Glance

Predictive modeling is a decision support tool Pure factors can be manually adjusted: Marketplace pressures Strategic considerations Cost of adjustments can be quantified The Final Rate Manual

Predictive modeling facilitates better decisions Identify rating variables that the market misprices  Where market overprices, can choose to be aggressive or to keep excess profits  Where market underprices, can choose to avoid unprofitable business or at least know the cost of writing Better business mix Greater confidence in rates supports decisions as to when and how much to discount A Marketplace Advantage

Improved accuracy and confidence Accurately account for correlations and interactions between rating variables Facilitates analysis of new rating variables General rate structure can remain the same Maintains flexibility in final rates Predictive Modeling Benefits

Discussion