1 Does Credit Score Really Help Explain Insurance Losses? Cheng-Sheng Peter Wu, FCAS, ASA, MAAA, Jim Guszcza, ACAS, MAAA, Ph. D.

Slides:



Advertisements
Similar presentations
1 General Iteration Algorithms by Luyang Fu, Ph. D., State Auto Insurance Company Cheng-sheng Peter Wu, FCAS, ASA, MAAA, Deloitte Consulting LLP 2007 CAS.
Advertisements

CAS Seminar on Ratemaking Introduction to Ratemaking Relativities March 13-14, 2006 Salt Lake City Marriott Salt Lake City, Utah Presented by: Brian M.
Reliability and Validity
PERSONAL LINES RATEMAKING – WHAT’S DOWN THE ROAD? Midwest Actuarial Forum – September 21, 2007 Jeffrey L. Kucera, FCAS, MAAA – Sr. Consultant EMB America.
Predictive Data Modeling A CASE STUDY FOR DATA MODELING.
Tim Rozar FSA, MAAA, CERA Derek Kueker FSA Lapse and Mortality of Post-Level Period Term Plans International Actuarial Association.
Structuring the argument of a theoretical paper For bachelor’s theses and master’s seminars in social sciences and humanities Richard Parncutt, Uni Graz.
Slides to accompany Weathington, Cunningham & Pittenger (2010), Chapter 4: An Overview of Empirical Methods 1.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.4 Cautions in Analyzing.
Good Research Questions. A paradigm consists of – a set of fundamental theoretical assumptions that the members of the scientific community accept as.
Statistical Methods Chichang Jou Tamkang University.
Considerations in P&C Pricing Segmentation February 25, 2015 Bob Weishaar, Ph.D., FCAS, MAAA.
1 BA 555 Practical Business Analysis Review of Statistics Confidence Interval Estimation Hypothesis Testing Linear Regression Analysis Introduction Case.
Brown, Suter, and Churchill Basic Marketing Research (8 th Edition) © 2014 CENGAGE Learning Basic Marketing Research Customer Insights and Managerial Action.
Practical Application of Retention Modeling Chuck Boucek, FCAS e.
10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Credit Scoring Beyond the Numbers MASFAA Conference 2006.
Project Management For Class Plan Projects CAS Special Interest Seminar on Predictive Modeling October 11-12, 2007 Jonathan White.
Sapient Insurance Partners. Overview & Services We have almost four decades of combined experience in the property & casualty insurance and reinsurance.
Application of SAS®! Enterprise Miner™ in Credit Risk Analytics
Demand Modeling to Price Optimization
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 6 – Multiple comparisons, non-normality, outliers Marshall.
Proprietary & Confidential 1 Product Development Workshop Part 7: Product Monitoring/Risk Management 2012 CAS Ratemaking and Product Management Seminar.
 Several years ago, a major P&C insurer established key business goal Significantly enhance approach to writing Small Commercial  Product / process.
Sociological Research Methods and Techniques
Visual Analytics University of Texas – Pan American CSCI 6361, Spring 2014 From Stasko, 2013.
© Deloitte Consulting, 2005 Predictive Modeling – Panacea or Placebo? Cheng-Sheng Peter Wu, FCAS, ASA, MAAA CAS 2005 Spring Meeting Scottsdale, AZ May.
A View Inside the “Black Box”: A Review and Analysis of Personal Lines Insurance Credit Scoring Models Filed in the State of Virginia By Cheng-sheng Peter.
Philosophy of IR Evaluation Ellen Voorhees. NIST Evaluation: How well does system meet information need? System evaluation: how good are document rankings?
Travelers Analytics: U of M Stats 8053 Insurance Modeling Problem
© Deloitte Consulting, 2004 Introduction to Data Mining James Guszcza, FCAS, MAAA CAS 2004 Ratemaking Seminar Philadelphia March 11-12, 2004.
Generalized Minimum Bias Models
CHAPTER 6, INDEXES, SCALES, AND TYPOLOGIES
CHAPTER 14 MULTIPLE REGRESSION
Chapter Fourteen Statistical Analysis Procedures Statistical procedures that simultaneously analyze multiple measurements on each individual or.
@ Hanover Insurance Group: Catherine Eska 1 FROM CLASS TO INDIVIDUAL RATING CAS Predictive Modeling Seminar October 4 th, 5 th 2006 Data Challenges and.
Correlational Research Chapter Fifteen Bring Schraw et al.
Data Mining By : Tung, Sze Ming ( Leo ) CS 157B. Definition A class of database application that analyze data in a database using tools which look for.
Course on Professionalism Statement of Principles.
Data Quality & dissemination D. Sahoo Dy. Director General Central Statistical Organization, India.
The Actuary’s Role in the Audit Process The Actuary’s Responsibility to Auditors and Examiners CLRS – September 9 th, 2003 Matt Carrier, ACAS, MAAA Senior.
© Deloitte Consulting, 2004 Alternatives to Credit Scoring in Insurance James Guszcza, FCAS, MAAA Cheng-Sheng Peter Wu, FCAS, ASA, MAAA CAS 2004 Ratemaking.
Integrating the Broad Range Applications of Predictive Modeling in a Competitive Market Environment Jun Yan Mo Mosud Cheng-sheng Peter Wu 2008 CAS Spring.
SUVs and Automobile Insurance Costs SUV Drivers Have Different Underlying Liability Loss Costs Michael C. Dubin, FCAS, MAAA, MCA 1999 CAS Seminar on Ratemaking.
2007 CAS Predictive Modeling Seminar Estimating Loss Costs at the Address Level Glenn Meyers ISO Innovative Analytics.
© Deloitte Consulting, 2005 What To Do When You Cannot Use Credit? (Personal Lines) Cheng-Sheng Peter Wu, FCAS, ASA, MAAA CAS 2005 Special Interest Seminar.
May 18, 2004CAS Spring Meeting1 Demand Based Pricing: A Company Perspective CAS Spring Meeting May 18, 2004 Floyd M. Yager, FCAS, MAAA Allstate Insurance.
Software Product Line Material based on slides and chapter by Linda M. Northrop, SEI.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
The Practice of Statistics
Predictive Modeling for Small Commercial Risks CAS PREDICTIVE MODELING SEMINAR Beth Fitzgerald ISO October 2006.
CAS Seminar on Ratemaking Introduction to Ratemaking Relativities (INT - 3) March 11, 2004 Wyndham Franklin Plaza Hotel Philadelphia, Pennsylvania Presented.
Credit History Impact on Personal Lines Loss Experience Session CPP-49 James E. Monaghan Thurs. March 9, 2000 CAS Ratemaking Seminar.
Research Design. Selecting the Appropriate Research Design A research design is basically a plan or strategy for conducting one’s research. It serves.
Dimension Reduction in Workers Compensation CAS predictive Modeling Seminar Louise Francis, FCAS, MAAA Francis Analytics and Actuarial Data Mining, Inc.
1 Reserving Ranges and Acceptable Deviations CANE Fall 2005 Meeting Kevin Weathers FCAS, MAAA The Hartford This document is designed for discussion purposes.
1999 CAS RATEMAKING SEMINAR PRODUCT DEVELOPMENT (MIS - 32) BETH FITZGERALD, FCAS, MAAA.
Experimental Research Methods in Language Learning Chapter 5 Validity in Experimental Research.
Glenn Meyers ISO Innovative Analytics 2007 CAS Annual Meeting Estimating Loss Cost at the Address Level.
Module III Multivariate Analysis Techniques- Framework, Factor Analysis, Cluster Analysis and Conjoint Analysis Research Report.
Multivariate Statistics Psy 524 Andrew Ainsworth.
Practical GLM Analysis of Homeowners David Cummings State Farm Insurance Companies.
Accounting Implications of Finite Reinsurance Contracts 2003 Casualty Loss Reserve Seminar Chicago, IL Session 4 – Recent Developments in Finite Reinsurance.
1 Deloitte Consulting LLP Predictive Modeling for Commercial Risks Cheng-Sheng Peter Wu, FCAS, ASA, MAAA CAS 2005 Special Interest Seminar Chicago September.
Commercial Insurance Product Development Justin VanOpdorp ACAS, MAAA GE Commercial Insurance g.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Casualty Actuarial Society Ratemaking Seminar Shantelle Thomas March 17, 2008 Allocating the Cost of Multi-State Reinsurance Contracts to Individual States.
Ryan Purdy, FCAS, MAAA Merlinos & Associates
Social Research.
Presentation transcript:

1 Does Credit Score Really Help Explain Insurance Losses? Cheng-Sheng Peter Wu, FCAS, ASA, MAAA, Jim Guszcza, ACAS, MAAA, Ph. D.

2 Themes The History What Does the Question Mean? Simpson’s Paradox - Need for Multivariate Analysis What Has Been Done So Far? Our Large-Scale Data Mining Experience Going Beyond Credit Conclusions

3 The History Pricing/Class Plans Few factors before World War II Explosion of class plan factors after the War Current class plans (Auto) – territory, driver, vehicle, loss and violation, others, tiers/company, etc. Actuarial techniques – Minimum Bias & GLM

4 The History Credit First important factor identified over the past 2 decades Composite multivariate score vs. raw credit information Introduced in late 80’s and early 90’s Viewed at first as a “secret weapon” Currently almost everyone is using it Industry scores vs. proprietary scores Quiet, confidential, controversial, black-box, …etc

5 What Does the Question Mean? Can Credit Score Really “Explain” Ins Losses? “X explains Y” Weaker than claiming that X causes Y Stronger than merely reporting that X is correlated with Y

6 What Does the Question Mean? Working Definition We say that “X helps explain Y” if: – X is correlated with Y – The correlation does not go away when other available, measurable information is introduced

7 What Does the Question Mean? Intuition Behind the Definition It might be okay for X to be a proxy for a “true” cause of Y – Testosterone level might be a true cause of auto losses…. But it’s not available – Age/Gender is a reasonable proxy It might not be okay for X to be a proxy for other available predictive information

8 What Does the Question Mean? Applying the Definition Suppose we see that credit score plays an important role in a multivariate regression equation that predicts loss ratio Then it is fair to say the credit helps explain insurance losses A multivariate study is needed

9 Simpson’s Paradox – Need for Multivariate Analysis Statistics can lie Illustrates how a univariate association can lead to a spurious conclusion The “true” explanatory factor is masked by the spurious correlation Famous example: 1973 Berkeley admissions data

10 Simpson’s Paradox – Need for Multivariate Analysis The Berkeley Example (stylized) 2200 people applied for admission 1100 men; 1100 women 210 men, 120 women were accepted. Clear-cut case of gender discrimination… …. Or is it?

11 Simpson’s Paradox – Need for Multivariate Analysis

12 Simpson’s Paradox – Need for Multivariate Analysis

13 Simpson’s Paradox – Need for Multivariate Analysis

14 What Has Been Done So Far We (actuaries) have been quiet Few published actuarial studies/opinions – NAIC/Tillinghast (1997) – Monaghan’s Study (2000) Recent/related studies – Virginia State Study (1999) – CAS Sub-Committee (2002) – Washington State Study (2003) – University of Texas Study (2003)

15 What Has Been Done So Far Relevant Actuarial/Statistical Principles Pure premium vs. loss ratio – Loss ratio studies go beyond existing rating plans, and are implicitly multivariate Independence vs. correlation – Most insurance variables are correlated Univariate vs. multivariate – Correlated variables call for multivariate studies for true answers (Simpson’s Paradox) Credibility vs. homogeneity – Studies need to be credible and representative

16 What Has Been Done So Far The Tillinghast Study 9 companies’ data, seems representative Loss ratio study No other predictive variables included in the study No detailed information given about the data Strong correlation with loss ratio, seems credible This is true, but it doesn’t answer our question and doesn’t quiet the critics

17 What Has Been Done So Far

18 What Has Been Done So Far Monaghan’s Study Loss ratio study Large amount of data – credible analysis Analyze individual credit variables as well as score Multivariate analysis – limited to score + 1 traditional rating variable at a time Shows strong correlations with loss ratio do not go away in the presence of other variables Another good step, but we can go further

19 Our Large-Scale Data Mining Experience Our Work Loss ratio studies Multiple studies - representative Large amounts of data – credible Hundreds of variables tested along with credit – truly multivariate – Policy, driver, vehicle, coverages, billing, agency, external data, synthetic, …etc. Sound actuarial and statistical model design Disciplined data mining process

20 Our Large-Scale Data Mining Experience What Have We Found Out? Credit score is always one of top variables selected for the multivariate models Credit score has among the strongest parameters and statistical measurements (t-score) – Credit’s predictive power does not go away in the truly multivariate context Removing credit score dampens the predictive power of the models

21 Our Large-Scale Data Mining Experience What Do We Conclude? We conclude that credit score bears an unambiguous relationship to insurance losses, and is not a mere proxy for other kinds of information available to insurance companies. This does not mean that credit score is the “cause” of insurance losses

22 Our Large-Scale Data Mining Experience Why Is Credit Score Correlated with Ins Losses? Beyond the scope of our work – Emphasis is not causation Plausible speculations include – Stress/planning & organization – Risk-seeking behavior – ?? Analogy: Age/Gender might be a proxy for testosterone

23 Going Beyond Credit Can We Do Well Without Credit? YES: non-credit predictive models are – Valuable alternative to credit scores – Flexible – Tailored to individual companies – Comparable predictive power to credit scores Also possible to build mixed credit/non-credit models

24 Going Beyond Credit Keys to Building Successful Non-Credit Models: Fully utilize all sources of information – Leverage company’s internal data sources – Enriched with other external data sources Use large amount of data Employ disciplined analytical process Utilize state-of-the-art modeling tools Apply multivariate methodology

25 Going Beyond Credit Advantages of Going Beyond Credit Next generation of competitive advantage More variables, more predictive power Leverages company’s internal data sources More flexibility Address regulatory issues and public concerns Expense savings Everyone gets a score (less of a “no hit” problem) More customized – less “plain vanilla” than credit score

26 Conclusions Credit works… even in a fully multivariate setting But non-credit models can work well too! What it means to us – beginning of a new era – Advances in computer technology – Advances in predictive modeling techniques – Large scale multivariate studies now practical – More external and internal info, anything else out there? – Other ways to go beyond credit?

27 Conclusions Future works on this topic Multivariate pure premium analysis would provide more insights Further study of public policy issues – WA, VA came to opposite conclusions Comparison of various existing scoring models