Modelling Multiple Lines of Business: Detecting and using correlations in reserve forecasting. Presenter: Dr David Odell Insureware, Australia
MULTIPLE TRIANGLE MODELLING ( or MPTF ) APPLICATIONS MULTIPLE LINES OF BUSINESS- DIVERSIFICATION? MULTIPLE SEGMENTS –MEDICAL VERSUS INDEMNITY –SAME LINE, DIFFERENT STATES –GROSS VERSUS NET OF REINSURANCE –LAYERS CREDIBILITY MODELLING –ONLY A FEW YEARS OF DATA AVAILABLE –HIGH PROCESS VARIABILITY MAKES IT DIFFICULT TO ESTIMATE TRENDS
BENEFITS Level of Diversification- optimal capital allocation by LOB Mergers/Acquisitions Writing new business- how is it related to what is already on the books?
Model Displays for LOB1 and LOB3
LOB1 and LOB3 Weighted Residual Plots for Calendar Year
Pictures shown above correspond to two linear models, which described by the following equations Without loss of sense and generality two models in (1) could be considered as one linear model:
For illustration of the most simple case we suppose that size of vectors y in models (1) are the same and equal to n, also we suppose that In this case Which could be rewritten as
For example, when n = 3
There is a difference between linear models in (1) and linear model (2). In (1) we model separately and do not use additional information from related trends, which we can do in model (2). To extract this additional information we need to use proper methods to estimate the vector of parameters . The general least squares (GLS) estimation equation enables us to achieve this.
However, it is necessary immediately to underline that we do not know elements of the matrix and we have to estimate them as well. So, practically, we should build iterative process of estimations and this process will stop, when we reach estimations with satisfactory statistical properties.
There are some cases, when model (2) provides the same results as models in (1). They are: 1.Design matrices in (1) have the same structure ( they are the same or proportional to each other ). 2.Models in (1) are non-correlated, another words However in situation when two models in (1) have common regressors model (2) again will have advantages in spite of the same structure of design matrices.
Correlation and Linearity The idea of correlation arises naturally for two random variables that have a joint distribution that is bivariate normal. For each individual variable, two parameters a mean and standard deviation are sufficient to fully describe its probability distribution. For the joint distribution, a single additional parameter is required – the correlation. If X and Y have a bivariate normal distribution, the relationship between them is linear: the mean of Y, given X, is a linear function of X ie:
How is the distribution of Y affected by this new information?
Y|X = x has a normal distribution or and
The slope is determined by the correlation , and the standard deviations and : The correlation between Y and X is zero if and only if the slope is zero. Also note that, when Y and X have a bivariate normal distribution, the conditional variance of Y, given X, is constant ie not a function of X: where
This is why, in the usual linear regression model Y = + X + the variance of the "error" term does not depend on X. However, not all variables are linearly related. Suppose we have two random variables related by the equation where T is normally distributed with mean zero and variance 1. What is the correlation between S and T ?
Linear correlation is a measure of how close two random variables are to being linearly related. In fact, if we know that the linear correlation is +1 or -1, then there must be a deterministic linear relationship Y = + X between Y and X (and vice versa). If Y and X are linearly related, and f and g are functions, the relationship between f( Y ) and g( X ) is not necessarily linear, so we should not expect the linear correlation between f( Y ) and g( X ) to be the same as between Y and X.
A common misconception with correlated lognormals Actuaries frequently need to find covariances or correlations between variables such as when finding the variance of a sum of forecasts (for example in P&C reserving, when combining territories or lines of business, or computing the benefit from diversification). Correlated normal random variables are well understood. The usual multivariate distribution used for analysis of related normals is the multivariate normal, where correlated variables are linearly related. In this circumstance, the usual linear correlation ( the Pearson correlation ) makes sense.
However, when dealing with lognormal random variables (whose logs are normally distributed), if the underlying normal variables are linearly correlated, then the correlation of lognormals changes as the variance parameters change, even though the correlation of the underlying normal does not.
All three lognormals below are based on normal variables with correlation 0.78, as shown left, but with different standard deviations.
We cannot measure the correlation on the log-scale and apply that correlation directly to the dollar scale, because the correlation is not the same on that scale. Additionally, if the relationship is linear on the log scale (the normal variables are multivariate normal) the relationship is no longer linear on the original scale, so the correlation is no longer linear correlation. The relationship between the variables in general becomes a curve:
Note that the correlation of Y 1 and Y 2 does not depend on the ’s When the standard deviations are close to zero it is just below but decreases further as s.d.’s increase.
Weighted Residual Plots for LOB1 and LOB3 versus Calendar Years What does correlation mean?
Model Displays for LOB1 and LOB3 for Calendar Years
Model for individual iota parameters
There are two types of correlations involved in calculations of reserve distributions. Weighted Residual Correlations between datasets: 0.35 – is weighted residual correlation between datasets LOB1 and LOB3; Correlations in parameter estimates: 0.32 – is correlation between iota parameters in LOB1 and LOB3. These two types of correlations induce correlations between triangle cells and within triangle cells.
Common iota parameter in both triangles
Two effects: Same parameter for each LOB increases correlations and CV of aggregates Single parameter for each line reduces CV of aggregates
Forecasted reserves by accident year, calendar year and total are correlated Indicates dependency through residuals’ and parameters’ correlations Indicates dependency through parameter estimate correlations only
Dependency of aggregates in aggregate table Var(Aggregate) >> Var(LOB1) + Var(LOB3). In each forecast cell and in aggregates by accident year and calendar year (and total) Correlation between reserve distributions is 0.82
Payment Stream Illustration Year01…t Payment Stream *exp( ) … 100*exp( *t) Then
Quantiles for aggregate of both lines according to model which accounts for the correlation. Quantiles for aggregate based on summing two independent models. (Note that these values are lower.)
Simulations from lognormals correlated within LOB and between LOBs Diagnostic for validity of using parametric distributions for forecast total.
GROSS VERSUS NET Model Display 1 Is your outward reinsurance program optimal? (eg. Excess of Loss Reinsurance) 6% stable trend in calendar year
Model Display 2 Zero trend in calendar year. Therefore ceding part that is growing!
Weighted Residual Covariances Between Datasets Weighted Residual Correlations Between Datasets
Weighted Residuals versus Accident Year Gross
Net Weighted Residuals versus Accident Year
Weighted Residual Normality Plot
Accident Year Summary CV Net > CV Gross – Why? It is not good for cedant?
Example of Risk Capital Allocation as a function of correlation
Simple example with 2 LOBs LOB1 Mean = 250 SD = 147 LOB2 Mean = 450 SD = 285 Total of means = Mean Reserve = 700 Loss Probability Loss Probability
Total losses for combined LOBs LOB LOB (200) (200) 1000 (300) 1250 (550) The yellow cells are those in which the mean reserve has been exceeded. The red numbers show the amount of this excess.
Probabilities in matrix of outcomes depend on the correlation between the two LOBs LOB LOB2 100 P=1/ P=1/ P=1/ LOB LOB2 100 P=1/9 200 P=1/9 300 P=1/ P=1/9 550 P=1/9 650 P=1/ P=1/9 900 P=1/ P=1/ “Correlation” =1.0“Correlation” =0.0 Very poor diversification Good diversification
Probabilities in matrix of outcomes depend on the correlation between the two LOBs LOB LOB2 100 P=1/ P=1/ P=1/ LOB LOB2 100 P=1/9 200 P=1/9 300 P=1/ P=1/9 550 P=1/9 650 P=1/ P=1/9 900 P=1/ P=1/ “Correlation” =1.0“Correlation” =0.0 Probability (Loss > Mean Reserve) = 1/3 Amount of excess loss = 550 LOB1 loss = 450 = mean(LOB1) LOB2 loss = 800 = mean(LOB2) Contribution to loss. LOB1:LOB2 = 1:1.75
Probabilities in matrix of outcomes depend on the correlation between the two LOBs LOB LOB2 100 P=1/ P=1/ P=1/ LOB LOB2 100 P=1/9 200 P=1/9 300 P=1/ P=1/9 550 P=1/9 650 P=1/ P=1/9 900 P=1/ P=1/ “Correlation” =1.0“Correlation” =0.0 Probability (Loss > Mean Reserve) = 1/3 Mean Excess loss = 550 (E.S. at 67%) LOB1 loss = 450 = mean(LOB1) LOB2 loss = 800 = mean(LOB2) Contributions to loss: LOB1:LOB2 = 1:1.75 Probability (Loss > Mean reserve)=4/9 Mean Excess loss = (E.S. at 67%) LOB1 Cond. Exp. Shortfall = 50 LOB2 Cond. Exp. Shortfall = Contributions to mean loss: LOB1:LOB2 = 1:5.25
Concept of Conditional Expected Shortfall Loss for LOB i = l i i= 1,2,3.. Expected Shortfall at n th percentile = E( l i -K| l i > K), where Pr( l i < K)=n/100. Assume LOB i is reserved at r i. Conditional Expected Shortfall at n th percentile = E( l i - r i | l 1 +l 2 +l 3.. > K), where Pr( l 1 +l 2 +l 3.. < K)=n/100.
Risk Capital allocation should depend on the expected shortfalls (ES) of the LOBs under a given scenario. Percentile of loss ES(LOB2)/ES(LOB1) when corr = ES(LOB2)/ES(LOB1) when corr = SD(LOB2)/SD(LOB1) 1.94 The contribution of the two lines depends on their correlation as well as the percentile at which we are calculating the ES.
Example 2. Risk Capital Allocation ratio for two LOBs with lognormal distributed losses. LOB1 has =100, = 50 LOB2 has 2= 150, 2= 100 2 1 2.0 Correlation of the underlying normals = rho Each LOB is reserved at the mean. Results based on simulations.
MODELING LAYERS Similar Structure Highly Correlated Surprise Finding! CV of reserves limited to $1M is the same as CV of reserves limited to $2M !
Model Display for All 1M: PL(I)
Model Display for All 2M: PL(I)
Model Display for All 1Mxs1M: PL(I) Note that All 1Mxs1M has zero inflation, and All 2M has lower inflation than All 1M, and distributions of parameters going forward are correlated
Residual displays vs calendar years show high correlations between three triangles.
Weighted Residual Covariances Between Datasets Weighted Residual Correlations Between Datasets
Compare Accident Year Summary Consistent forecasts based on composite model.
If we compare forecast by accident year for Limited 1M and limited 2M it is easy to see that CV is the same.
= + data 0 0 Breaking up a triangle 1.Change of mix of business 2.Different development and/or inflation 3.Different process variability
Layer 1: Limit 100k Layer 2: Limit 200k Composite (Layer 1 & Layer 2) Layer … Layer 2: Limit 200k Layer 1: Limit 100k Layer 2: Limit 200k Layer 1: Limit 100k
More information will be available at the Warner Center 3 rd floor on Monday, September 10, pm-12.00am ICRFS-ELRF will be available for FREE!