Correlation Estimation for Property and Casualty Underwriting Losses Fred Klinker Insurance Services Office, Inc.

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Structural Equation Modeling
The Multiple Regression Model.
Uncertainty in fall time surrogate Prediction variance vs. data sensitivity – Non-uniform noise – Example Uncertainty in fall time data Bootstrapping.
Copyright © 2008 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics, 9e Managerial Economics Thomas Maurice.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Point estimation, interval estimation
Topic 2: Statistical Concepts and Market Returns
BHS Methods in Behavioral Sciences I
Chapter 11 Multiple Regression.
Part II – TIME SERIES ANALYSIS C2 Simple Time Series Methods & Moving Averages © Angel A. Juan & Carles Serrat - UPC 2007/2008.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
Lecture 17 Interaction Plots Simple Linear Regression (Chapter ) Homework 4 due Friday. JMP instructions for question are actually for.
Introduction to Regression Analysis, Chapter 13,
Slides 13b: Time-Series Models; Measuring Forecast Error
So are how the computer determines the size of the intercept and the slope respectively in an OLS regression The OLS equations give a nice, clear intuitive.
Constant process Separate signal & noise Smooth the data: Backward smoother: At any give T, replace the observation yt by a combination of observations.
Inference for regression - Simple linear regression
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Correlation and Linear Regression
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
Integrating Reserve Risk Models into Economic Capital Models Stuart White, Corporate Actuary Casualty Loss Reserve Seminar, Washington D.C September.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 6 Sampling Distributions.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
CHAPTER 14 MULTIPLE REGRESSION
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Chapter Outline 3.1THE PERVASIVENESS OF RISK Risks Faced by an Automobile Manufacturer Risks Faced by Students 3.2BASIC CONCEPTS FROM PROBABILITY AND STATISTICS.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
VI. Evaluate Model Fit Basic questions that modelers must address are: How well does the model fit the data? Do changes to a model, such as reparameterization,
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
The Common Shock Model for Correlations Between Lines of Insurance
Chap 14-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 14 Additional Topics in Regression Analysis Statistics for Business.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 19 Linear Patterns.
Copyright © 2011 Pearson Education, Inc. The Simple Regression Model Chapter 21.
Managerial Economics Demand Estimation & Forecasting.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Chapter 5 Parameter estimation. What is sample inference? Distinguish between managerial & financial accounting. Understand how managers can use accounting.
Reserve Variability – Session II: Who Is Doing What? Mark R. Shapland, FCAS, ASA, MAAA Casualty Actuarial Society Spring Meeting San Juan, Puerto Rico.
©2015 : OneBeacon Insurance Group LLC | 1 SUSAN WITCRAFT Building an Economic Capital Model
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.
Ch 5-1 © 2004 Pearson Education, Inc. Pearson Prentice Hall, Pearson Education, Upper Saddle River, NJ Ostwald and McLaren / Cost Analysis and Estimating.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 10 th Edition.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Spencer M. Gluck, FCAS New York CAS Seminar on Reinsurance 2007 Hidden Risks in (Re)Insurance Systemic Risks and Accumulation: May 7, 2007.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Model Comparison. Assessing alternative models We don’t ask “Is the model right or wrong?” We ask “Do the data support a model more than a competing model?”
Chapter 15 Multiple Regression Model Building
Chapter 4: Basic Estimation Techniques
Chapter 4 Basic Estimation Techniques
Market-Risk Measurement
Chapter 14 Inference on the Least-Squares Regression Model and Multiple Regression.
Regression Analysis AGEC 784.
Basic Estimation Techniques
Chapter Outline 3.1 THE PERVASIVENESS OF RISK
Cost of Capital Issues April 16, 2002 John J. Kollar.
...Relax... 9/21/2018 ST3131, Lecture 3 ST5213 Semester II, 2000/2001
POSC 202A: Lecture Lecture: Substantive Significance, Relationship between Variables 1.
Basic Estimation Techniques
I. Statistical Tests: Why do we use them? What do they involve?
Product moment correlation
15.1 The Role of Statistics in the Research Process
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

Correlation Estimation for Property and Casualty Underwriting Losses Fred Klinker Insurance Services Office, Inc.

Mathematical vs. Physical Models for Correlation Mathematical models/ treatments: convenient and parsimonious ways of encoding what we know about correlation: simulation, Fast Fourier Transforms, copulas, etc. Physical models for the drivers of correlation that therefore capture the structure: parameter uncertainty, natural and man-made catastrophes, mass torts.

Estimation of Correlation For a number of lines of business, companies, and years, estimate expected losses or loss ratios Measure deviations of actual ultimates from these expectations Estimate correlations among these deviations as the correlations relevant to required capital

Issues Deviations about long-term means not the most relevant, because they probably include a predictable component driven by known rate and price indices, trends, knowledge of current industry competitiveness, losses emerged to date, etc. What is relevant are unpredictable deviations from expectations varying predictably over time.

Thought Experiment 1 Rose Colored Glasses Insurance Company—will probably estimate larger correlations than a company that estimates its expected losses more accurately. A cautionary conclusion—the correlations we estimate to some extent depend on how we estimate the expectations.

Thought Experiment 2: How We Might Like to Estimate Correlations Mimic P&C industry real-time forecasting: rolling one-year-ahead forecasts based on what industry would have known compared to estimated actual ultimates What we need: Multiple decade time series of loss ratios and predictors, one decade to calibrate the time series, plus more to check for time varying correlations We lack the requisite data

An Alternative Calculation One decade of data, no predictors By LOB, a generalized additive model with main effects for company and year Year effect captured by a non-parametric smoother Fitted values respond to both earlier and later years, as opposed to one-year-ahead forecasts

A Question Could the year smoother “forecast” even better than the best true one-year-ahead forecast, thereby understating deviations and covariances? Perhaps, but probably not vastly better.

A Correlation Model Based on Parameter Uncertainty From recent papers by Glenn Meyers, assuming frequency parameter uncertainty only:

where: L ijk is annual aggregate ultimate loss for LOB i, company j, and year k. δ ii´ is 1 if and only if i = i´ and 0 otherwise. Likewise for δ jj´. δ GiGi´ is 1 if and only if first and second LOBs are in the same covariance group, otherwise 0. μ i and σ i are the mean and standard deviation of the severity distribution associated with LOB i. E ijk = E[L ijk ] g i is the covariance generator associated with LOB i.

Recall the definition of covariance: Define the normalized deviation: Divide the original equation by E ijk E i’j’k to find:.

Model for Expected Losses Model loss ratios, then multiply by denominators. By LOB, a generalized additive model with main effects for company and year Year smoothing parameters chosen so that model responds to long term trends without responding much to individual year effects. Loss ratio volatility declines significantly with increasing company size; a weighted model strongly recommended.

Appearance of roughly parallel lines supports main effects model. At least for LOB 1, considerable correlated ups and downs from year to year. After visual inspection of these graphs, would not be surprised to find greater correlation for LOB 1 than for LOB 2.

Variance Model

Other Pairwise Products of Deviations

In each pairwise product, first and second deviations share common year and LOB 1, but different companies: cross-company, within-LOB correlation. Pairwise products are not independent; many share a common first or second factor. Regression line indicates modest positive correlation between first and second deviations, plus considerable noise. A visual aid only; actual inference not based on this line.

For illustrative purposes only, ignores year effects; measures deviations against decade average, separately by company. Ignoring long-term trends and patterns, probably predictable, inflates apparent correlations.

Bootstrap Estimates of Standard Errors Pairwise products of deviations not independent; can’t use the usual sqrt(n) rule. Don’t bootstrap on pairwise products directly; this destroys two-way structure of data on company and year. Bootstrap on year, take all companies. Then bootstrap on company, take all years. Combined standard error is square root of sum of squared standard errors due to year and company separately.

Representative Results

Correlation Parameter Estimates: LOB 1 Between companies: g Estimate: Standard error due to years: Standard error due to companies: Full standard error: Within company: c + g Estimate: Standard error due to years: Standard error due to companies: Full standard error:

With respect to g, standard errors due to years and companies are comparable. Estimate is more than twice the full standard error, so significant. g is the variance of a frequency multiplier acting in common across companies within LOB 1. Square root of about.05: common underlying effects have the potential to drive frequencies across companies within LOB 1up or down by 5 or 10%. Contagion is 0.02.

Correlation Parameter Estimates: LOB 2 Between companies: g Estimate: Standard error due to years: Standard error due to companies: Full standard error: Within company: c + g Estimate: Standard error due to years: Standard error due to companies: Full standard error:

g just barely significant at two standard errors. Both g and c smaller than for LOB 1, as expected from graphical evidence.

Correlation Parameter Estimates: LOB 1 vs. LOB 2 Between and within companies: g Estimate: Standard error due to years: Standard error due to companies: Full standard error:

What is here labeled g is actually geometric average of gs for LOBs 1 and 2, if in the same covariance group, or 0 otherwise. Parameter estimate not significantly different from 0: no statistical evidence that LOBs 1 and 2 are in the same covariance group.

Additional Observations Parameter estimates are pooled across companies, not separate by company size, stock/ mutual, etc. Correlation in the body of a multivariate distribution vs. “correlation in the tails”: correlation due to parameter uncertainty vs. correlation due to catastrophes.

What Else is in Appendix? Expected losses derived from expected loss ratio models. We tested several denominators: premium, PPR, exposures. Adjusted normalized deviations for degrees of freedom. More thorough treatment of weights in all models: loss ratio, variance, other pairwise products of deviations. Tested correlation model parameters for dependence on size of company: none found.

Bibliography Glenn Meyers, “Estimating Between Line Correlations Generated by Parameter Uncertainty,” CAS Forum, Summer pdf Glenn Meyers, Fred Klinker, and David Lalonde, “The Aggregation and Correlation of Insurance Exposure,” CAS Forum, Summer pdf