Modelling Multiple Lines of Business: Detecting and using correlations in reserve forecasting. Presenter: Dr David Odell Insureware, Australia.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Probability & Statistical Inference Lecture 9
The General Linear Model Or, What the Hell’s Going on During Estimation?
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
Best Estimates for Reserves Glen Barnett and Ben Zehnwirth or find us on
Reserve Variability Modeling: Correlation 2007 Casualty Loss Reserve Seminar San Diego, California September 10-11, 2007 Mark R. Shapland, FCAS, ASA, MAAA.
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
Chapter 11 Multiple Regression.
Simple Linear Regression Analysis
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Correlation and Regression Analysis
Lorelei Howard and Nick Wright MfD 2008
Lecture II-2: Probability Review
Separate multivariate observations
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Regression and Correlation Methods Judy Zhong Ph.D.
Inference for regression - Simple linear regression
1 MULTI VARIATE VARIABLE n-th OBJECT m-th VARIABLE.
Alternative Measures of Risk. The Optimal Risk Measure Desirable Properties for Risk Measure A risk measure maps the whole distribution of one dollar.
1 FORECASTING Regression Analysis Aslı Sencer Graduate Program in Business Information Systems.
Integrating Reserve Risk Models into Economic Capital Models Stuart White, Corporate Actuary Casualty Loss Reserve Seminar, Washington D.C September.
Some matrix stuff.
Statistics for Business and Economics Dr. TANG Yu Department of Mathematics Soochow University May 28, 2007.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Two Approaches to Calculating Correlated Reserve Indications Across Multiple Lines of Business Gerald Kirschner Classic Solutions Casualty Loss Reserve.
Reinsurance of Long Tail Liabilities Dr Glen Barnett and Professor Ben Zehnwirth.
MULTIPLE TRIANGLE MODELLING ( or MPTF ) APPLICATIONS MULTIPLE LINES OF BUSINESS- DIVERSIFICATION? MULTIPLE SEGMENTS –MEDICAL VERSUS INDEMNITY –SAME LINE,
Module 1: Statistical Issues in Micro simulation Paul Sousa.
The Common Shock Model for Correlations Between Lines of Insurance
Examining Relationships in Quantitative Research
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Regression Regression relationship = trend + scatter
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
Linear Regression Analysis 5E Montgomery, Peck & Vining 1 Chapter 8 Indicator Variables.
Chapter 13 Multiple Regression
Reserve Variability – Session II: Who Is Doing What? Mark R. Shapland, FCAS, ASA, MAAA Casualty Actuarial Society Spring Meeting San Juan, Puerto Rico.
Spatial Analysis & Geostatistics Methods of Interpolation Linear interpolation using an equation to compute z at any point on a triangle.
IMPROVING ACTUARIAL RESERVE ANALYSIS THROUGH CLAIM-LEVEL PREDICTIVE ANALYTICS 1 Presenter: Chris Gross.
Simple Linear Regression In the previous lectures, we only focus on one random variable. In many applications, we often work with a pair of variables.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.
1 B IVARIATE AND MULTIPLE REGRESSION Estratto dal Cap. 8 di: “Statistics for Marketing and Consumer Research”, M. Mazzocchi, ed. SAGE, LEZIONI IN.
U6-1 UNIT 6 Risk and Return and Stock Valuation Risk return tradeoff Diversifiable risk vs. market risk Risk and return: CAPM/SML Stock valuation: constant,
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Tutorial I: Missing Value Analysis
Nonparametric Statistics
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
Stats Methods at IC Lecture 3: Regression.
The simple linear regression model and parameter estimation
Statistical Data Analysis - Lecture /04/03
Correlation and Simple Linear Regression
BPK 304W Correlation.
Stochastic Hydrology Hydrological Frequency Analysis (II) LMRD-based GOF tests Prof. Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering.
CHAPTER 29: Multiple Regression*
Inference about the Slope and Intercept
6-1 Introduction To Empirical Models
Inference about the Slope and Intercept
Hypothesis testing and Estimation
Undergraduated Econometrics
MGS 3100 Business Analysis Regression Feb 18, 2016
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Modelling Multiple Lines of Business: Detecting and using correlations in reserve forecasting. Presenter: Dr David Odell Insureware, Australia

MULTIPLE TRIANGLE MODELLING ( or MPTF ) APPLICATIONS MULTIPLE LINES OF BUSINESS- DIVERSIFICATION? MULTIPLE SEGMENTS –MEDICAL VERSUS INDEMNITY –SAME LINE, DIFFERENT STATES –GROSS VERSUS NET OF REINSURANCE –LAYERS CREDIBILITY MODELLING –ONLY A FEW YEARS OF DATA AVAILABLE –HIGH PROCESS VARIABILITY MAKES IT DIFFICULT TO ESTIMATE TRENDS

BENEFITS Level of Diversification- optimal capital allocation by LOB Mergers/Acquisitions Writing new business- how is it related to what is already on the books?

Model Displays for LOB1 and LOB3

LOB1 and LOB3 Weighted Residual Plots for Calendar Year

Pictures shown above correspond to two linear models, which described by the following equations Without loss of sense and generality two models in (1) could be considered as one linear model:

For illustration of the most simple case we suppose that size of vectors y in models (1) are the same and equal to n, also we suppose that In this case Which could be rewritten as

For example, when n = 3

There is a difference between linear models in (1) and linear model (2). In (1) we model separately and do not use additional information from related trends, which we can do in model (2). To extract this additional information we need to use proper methods to estimate the vector of parameters . The general least squares (GLS) estimation equation enables us to achieve this.

However, it is necessary immediately to underline that we do not know elements of the matrix  and we have to estimate them as well. So, practically, we should build iterative process of estimations and this process will stop, when we reach estimations with satisfactory statistical properties.

There are some cases, when model (2) provides the same results as models in (1). They are: 1.Design matrices in (1) have the same structure ( they are the same or proportional to each other ). 2.Models in (1) are non-correlated, another words However in situation when two models in (1) have common regressors model (2) again will have advantages in spite of the same structure of design matrices.

Correlation and Linearity The idea of correlation arises naturally for two random variables that have a joint distribution that is bivariate normal. For each individual variable, two parameters a mean and standard deviation are sufficient to fully describe its probability distribution. For the joint distribution, a single additional parameter is required – the correlation. If X and Y have a bivariate normal distribution, the relationship between them is linear: the mean of Y, given X, is a linear function of X ie:

How is the distribution of Y affected by this new information?

Y|X = x has a normal distribution or and

The slope  is determined by the correlation , and the standard deviations and : The correlation between Y and X is zero if and only if the slope  is zero. Also note that, when Y and X have a bivariate normal distribution, the conditional variance of Y, given X, is constant ie not a function of X: where

This is why, in the usual linear regression model Y =  +  X +  the variance of the "error" term  does not depend on X. However, not all variables are linearly related. Suppose we have two random variables related by the equation where T is normally distributed with mean zero and variance 1. What is the correlation between S and T ?

Linear correlation is a measure of how close two random variables are to being linearly related. In fact, if we know that the linear correlation is +1 or -1, then there must be a deterministic linear relationship Y =  +  X between Y and X (and vice versa). If Y and X are linearly related, and f and g are functions, the relationship between f( Y ) and g( X ) is not necessarily linear, so we should not expect the linear correlation between f( Y ) and g( X ) to be the same as between Y and X.

A common misconception with correlated lognormals Actuaries frequently need to find covariances or correlations between variables such as when finding the variance of a sum of forecasts (for example in P&C reserving, when combining territories or lines of business, or computing the benefit from diversification). Correlated normal random variables are well understood. The usual multivariate distribution used for analysis of related normals is the multivariate normal, where correlated variables are linearly related. In this circumstance, the usual linear correlation ( the Pearson correlation ) makes sense.

However, when dealing with lognormal random variables (whose logs are normally distributed), if the underlying normal variables are linearly correlated, then the correlation of lognormals changes as the variance parameters change, even though the correlation of the underlying normal does not.

All three lognormals below are based on normal variables with correlation 0.78, as shown left, but with different standard deviations.

We cannot measure the correlation on the log-scale and apply that correlation directly to the dollar scale, because the correlation is not the same on that scale. Additionally, if the relationship is linear on the log scale (the normal variables are multivariate normal) the relationship is no longer linear on the original scale, so the correlation is no longer linear correlation. The relationship between the variables in general becomes a curve:

Note that the correlation of Y 1 and Y 2 does not depend on the  ’s When the standard deviations are close to zero it is just below  but decreases further as s.d.’s increase.

Weighted Residual Plots for LOB1 and LOB3 versus Calendar Years What does correlation mean?

Model Displays for LOB1 and LOB3 for Calendar Years

Model for individual iota parameters

There are two types of correlations involved in calculations of reserve distributions. Weighted Residual Correlations between datasets: 0.35 – is weighted residual correlation between datasets LOB1 and LOB3; Correlations in parameter estimates: 0.32 – is correlation between iota parameters in LOB1 and LOB3. These two types of correlations induce correlations between triangle cells and within triangle cells.

Common iota parameter in both triangles

Two effects: Same parameter for each LOB increases correlations and CV of aggregates Single parameter for each line reduces CV of aggregates

Forecasted reserves by accident year, calendar year and total are correlated Indicates dependency through residuals’ and parameters’ correlations Indicates dependency through parameter estimate correlations only

Dependency of aggregates in aggregate table Var(Aggregate) >> Var(LOB1) + Var(LOB3). In each forecast cell and in aggregates by accident year and calendar year (and total) Correlation between reserve distributions is 0.82

Payment Stream Illustration Year01…t Payment Stream *exp(  ) … 100*exp(  *t) Then

Quantiles for aggregate of both lines according to model which accounts for the correlation. Quantiles for aggregate based on summing two independent models. (Note that these values are lower.)

Simulations from lognormals correlated within LOB and between LOBs Diagnostic for validity of using parametric distributions for forecast total.

GROSS VERSUS NET Model Display 1 Is your outward reinsurance program optimal? (eg. Excess of Loss Reinsurance) 6% stable trend in calendar year

Model Display 2 Zero trend in calendar year. Therefore ceding part that is growing!

Weighted Residual Covariances Between Datasets Weighted Residual Correlations Between Datasets

Weighted Residuals versus Accident Year Gross

Net Weighted Residuals versus Accident Year

Weighted Residual Normality Plot

Accident Year Summary CV Net > CV Gross – Why? It is not good for cedant?

Example of Risk Capital Allocation as a function of correlation

Simple example with 2 LOBs LOB1 Mean = 250 SD = 147 LOB2 Mean = 450 SD = 285 Total of means = Mean Reserve = 700 Loss Probability Loss Probability

Total losses for combined LOBs LOB LOB (200) (200) 1000 (300) 1250 (550) The yellow cells are those in which the mean reserve has been exceeded. The red numbers show the amount of this excess.

Probabilities in matrix of outcomes depend on the correlation between the two LOBs LOB LOB2 100 P=1/ P=1/ P=1/ LOB LOB2 100 P=1/9 200 P=1/9 300 P=1/ P=1/9 550 P=1/9 650 P=1/ P=1/9 900 P=1/ P=1/ “Correlation” =1.0“Correlation” =0.0 Very poor diversification Good diversification

Probabilities in matrix of outcomes depend on the correlation between the two LOBs LOB LOB2 100 P=1/ P=1/ P=1/ LOB LOB2 100 P=1/9 200 P=1/9 300 P=1/ P=1/9 550 P=1/9 650 P=1/ P=1/9 900 P=1/ P=1/ “Correlation” =1.0“Correlation” =0.0 Probability (Loss > Mean Reserve) = 1/3 Amount of excess loss = 550 LOB1 loss = 450 = mean(LOB1) LOB2 loss = 800 = mean(LOB2) Contribution to loss. LOB1:LOB2 = 1:1.75

Probabilities in matrix of outcomes depend on the correlation between the two LOBs LOB LOB2 100 P=1/ P=1/ P=1/ LOB LOB2 100 P=1/9 200 P=1/9 300 P=1/ P=1/9 550 P=1/9 650 P=1/ P=1/9 900 P=1/ P=1/ “Correlation” =1.0“Correlation” =0.0 Probability (Loss > Mean Reserve) = 1/3 Mean Excess loss = 550 (E.S. at 67%) LOB1 loss = 450 = mean(LOB1) LOB2 loss = 800 = mean(LOB2) Contributions to loss: LOB1:LOB2 = 1:1.75 Probability (Loss > Mean reserve)=4/9 Mean Excess loss = (E.S. at 67%) LOB1 Cond. Exp. Shortfall = 50 LOB2 Cond. Exp. Shortfall = Contributions to mean loss: LOB1:LOB2 = 1:5.25

Concept of Conditional Expected Shortfall Loss for LOB i = l i i= 1,2,3.. Expected Shortfall at n th percentile = E( l i -K| l i > K), where Pr( l i < K)=n/100. Assume LOB i is reserved at r i. Conditional Expected Shortfall at n th percentile = E( l i - r i | l 1 +l 2 +l 3.. > K), where Pr( l 1 +l 2 +l 3.. < K)=n/100.

Risk Capital allocation should depend on the expected shortfalls (ES) of the LOBs under a given scenario. Percentile of loss ES(LOB2)/ES(LOB1) when corr = ES(LOB2)/ES(LOB1) when corr = SD(LOB2)/SD(LOB1) 1.94 The contribution of the two lines depends on their correlation as well as the percentile at which we are calculating the ES.

Example 2. Risk Capital Allocation ratio for two LOBs with lognormal distributed losses. LOB1 has  =100,  = 50 LOB2 has  2= 150,  2= 100  2  1  2.0 Correlation of the underlying normals = rho Each LOB is reserved at the mean. Results based on simulations.

MODELING LAYERS Similar Structure Highly Correlated Surprise Finding! CV of reserves limited to $1M is the same as CV of reserves limited to $2M !

Model Display for All 1M: PL(I)

Model Display for All 2M: PL(I)

Model Display for All 1Mxs1M: PL(I) Note that All 1Mxs1M has zero inflation, and All 2M has lower inflation than All 1M, and distributions of parameters going forward are correlated

Residual displays vs calendar years show high correlations between three triangles.

Weighted Residual Covariances Between Datasets Weighted Residual Correlations Between Datasets

Compare Accident Year Summary Consistent forecasts based on composite model.

If we compare forecast by accident year for Limited 1M and limited 2M it is easy to see that CV is the same.

= + data 0 0 Breaking up a triangle 1.Change of mix of business 2.Different development and/or inflation 3.Different process variability

Layer 1: Limit 100k Layer 2: Limit 200k Composite (Layer 1 & Layer 2) Layer … Layer 2: Limit 200k Layer 1: Limit 100k Layer 2: Limit 200k Layer 1: Limit 100k

More information will be available at the Warner Center 3 rd floor on Monday, September 10, pm-12.00am ICRFS-ELRF will be available for FREE!