THE REINSURANCE ACTUARY - Blog

The Pareto Distribution and Method of Moments

Mon, 15 Apr 2024 00:00:00 GMT

On why it doesn’t really make sense to fit a Pareto distribution with a method of moments.

I was sent some large loss modelling recently by another actuary for a UK motor book. In the modelling, they had taken the historic large losses, and fit a Pareto distribution using a method of moments. I thought about it for a while and realized that it didn't really like the approach for a couple of reasons which I'll go into in more detail below, but then when I thought about it some more I realised I'd actually seen the exact approach before ... in an IFoA exam paper. So even though the method has some shortcomings, it is actually a taught technique. [1]

Following the theme from last time, of London's old vs new side by side. Here's a cool photo which shows the old royal naval college in Greenwich, with Canary Wharf in the background. Photo by Fas Khan

Problem 1 – existence of moments

The first thing to note is that we are looking at the 2 parameter (or type 2) Pareto, and we are going to be following Klugman in using alpha and theta to represent our parameters. This is not universal usage though and Wikipedia for example uses alpha and sigma. The alpha parameter determines the tail weight, and a lower value of alpha gives us a heavier tailed distribution. Theta just determines the rest of the shape of the curve, but generally for a Pareto it's the alpha which is the most important value, particularly if we are projecting out into a part of the curve which is beyond our previous largest loss.

Klugman [2] gives us the domain on which the moments of a Pareto Type 2 distribution are defined, and looking at the formulas in the table below gives us the clue to the first problem :

Based on this table, we can see that the mean is only defined when alpha > 1, and the variance is only defined when alpha > 2. (which we can see when we insert k=1 or k=2 into the third formula)

Now why is this important? You might reason that for any given data, the sample mean and sample variance always exist and are always finite, so we will always be able to fit a well defined pareto to our data, no problem.

The issue is that for many situations, we expect an alpha value below 2, and we will never produce such a fitted distribution when using the method of moments. In fact, in certain situations we expect an alpha below 1, which we will definitely not produce. Therefore, even though we are using a heavy tailed distribution, we are limiting ourselves to only be able to generate alpha values which are likely to be too light.

What is a reasonable prior range for our alpha value to fall in? Fackler [3], for example talks about MTPL curves often having alpha values less than 2, and property cat severity curves having alpha values which are often below 1. And just to note, this is consistent with datasets I've modeled. So when fitting a Pareto using a method of moments to property cat data, we are almost guaranteeing ourselves a fit which is too light, and which is likely to lose us money if we were to rely on it.

Problem 2 – is the sample representative?

There’s another more subtle problem with a method of moment approach. If we think a heavy tailed distribution like a Pareto is appropriate for type of situation we are modelling, then the mean and variance of any sample is likely to be unrepresentative, and specifically lower, than the mean or variance of the distribution generating the data.

We can actually model this process ourselves, lets set up a numerical simulation where we repeatedly generate 50 losses from a Pareto distribution with a given mean and variance. The value of 50 could be varied, and the result is more extreme the fewer losses we include, but 50 is not unusually small compared to a standard large loss history. We then examine the distribution of the sample mean and sample variances to get a sense of how a typical sample will present itself for a given Pareto mean and Pareto variance.

In [1]:

import numpy as npfrom scipy.stats import lomaxfrom scipy.optimize import fsolveimport pandas as pd

In [2]:

def lomax_moments_test(mean ,var):     # Solve the system of equations for the shape and scale parameters    def equations(params):        shape, scale = params        eq1 = mean - scale / (shape - 1)        eq2 = var - shape * scale**2 / ((shape - 1)**2 * (shape - 2))        return [eq1, eq2]        shape, scale = fsolve(equations, (3, 3))        return shape, scaledef lomax_moments(data):    mean = np.mean(data)    var = np.var(data)        # Solve the system of equations for the shape and scale parameters    def equations(params):        shape, scale = params        eq1 = mean - scale / (shape - 1)        eq2 = var - shape * scale**2 / ((shape - 1)**2 * (shape - 2))        return [eq1, eq2]        shape, scale = fsolve(equations, (3, 3))        return shape, scale

In [3]:

c= 1.3scale = 10scale / (c - 1) #mean = c* scale**2 / ((c- 1)**2 * (c - 2)) #var = sample_means = []sample_variances = []for _ in range(50000):    # Generate 1000 samples from the Lomax distribution    samples = lomax.rvs(c, scale=scale, size=50)    sample_mean = np.mean(samples)    sample_means.append(sample_mean )        sample_variance = np.var(samples)    sample_variances.append(sample_variance)    print("mean of simulated means = " + str(np.mean(sample_means)))print("median of simulated means = " + str(np.median(sample_means)))print("mean of simulated variances = " + str(np.mean(sample_variances)))print("median of simulated variances = " + str(np.median(sample_variances)))

mean of simulated means = 35.32232097822492median of simulated means = 22.199454165491623mean of simulated variances = 20133683.651214894median of simulated variances = 1999.904985311406

In [ ]:

We see in the final output that the median of the means is about 40% lower than the mean of the means. So 50% of samples are going to have a mean which is 40% or more lower than the mean for the underlying loss generation process. This is a big deal, and we are going to be massively underestimating our loss cost in these cases.

And then the variance is even more extreme, for our c value of 1.3, we should actually have an infinite variance, the mean is coming in around 20 million, so definitely exhibiting traits of being extremely big compared to the other values, the median on the other hand is a paltry 2k, much much smaller, and we are potentially going to be tricking ourselves into using a much too small alpha value here.

Solution

So what should we do instead? We can avoid most of the problems by just using a maximum likelihood estimator instead. Even though this does nothing to change the fact that the sample mean and sample variance are likely to be unrepresentative, the method is much more forgiving in terms of producing alpha and theta parameters which are appropriate. The second thing to do, is attempt to compare the alpha parameter that we have generated with external datasets. In my opinion I would much rather see someone ignore an inappropriately low alpha value which has been generated by a dataset and just use a made up value than stick slavishly to the values generated by the data. The key point to remember is that for small sample sizes and heavy tailed distributions your data is unlikely to be representative of the properties of the full distributions in lots of subtle ways.

IFoA Exam

Here is the extract from the exam. I think the reason that its a popular exam question is that the algebra works quite nicely, and it requires you to show some understanding of what the moments of the distribution are. Plus in a written exam it would be way too fiddly to ask someone to attempt a maximum likelihood method instead.

[1] https://actuaries.org.uk/qualify/prepare-for-your-exams/past-exam-papers-and-examiners-reports
[2] Loss models - from data to decisions – Klugman et al.
[3] Inflation and Excess insurance, Michael Fackler (2011)

Why do brokers always seem to think an ADC should cost 20% on line?

Thu, 29 Feb 2024 00:00:00 GMT

Okay, that's a bit of an exaggeration, but there’s a quirky mathematical result related to these deals which means the target loss cost can often end up clustering in a certain range. Let’s set up a dummy deal and I’ll show you what I mean.

Source: Jim Linwood, Petticoat Lane Market, https://www.flickr.com/photos/brighton/4765025392

I found this photo online, and I think it's a cool combo - it's got the modern City of London (the Gherkhin), a 60s brutalist-style estate (which when I looked it up online has been described as "a poor man's Barbican Estate"), and a street market which dates back to Tudor times (Petticoat lane).

ADC pricing

Suppose we are looking at an at-the-money ADC where carried reserves are \$500m. To keep the maths simple, let’s suppose the reinsurer has completed their reserve review, and the best estimate reserve is also equal \$500m. If there’s a shortfall or surplus, the maths doesn’t change too much anyway. We’re also going to ignore investment returns for the time being.

We are going to model the distribution of ultimate reserves with a lognormal distribution, so we need to determine two parameters – $\mu$ and $\sigma$. We already set the mean of the distribution to \$500m, which gives us $\mu$, so we just need to determine the volatility. Let’s stick in 15% as our value for the CV for the time being, and revisit this part later (the CV pick ends up having an interesting effect, so let’s save that discussion for later)

Next up, we need to think about what a sensible ADC structure would be. This is an at-the-money, so we know the attachment, but we don’t know the limit. Ideally the cedant would like maximum capital relief, so one sensible limit is to buy up to the 1-in-200 percentile of the ultimate reserve. i.e. limit = (99.5th percentile – attach) & attach = mean

Here’s our summary table so far.

And that’s all we need to put an expected loss cost against the contract. I wrote a quick Python script to do just that, and then ran it for a grid with mean running from \$500m – 3bn, in 500m increments, and CV running in 5% increments from 5% - 25%. The final table then outputs the expected loss, expressed as a % of limit.

In [2]:

import numpy as npfrom scipy.stats import lognormfrom scipy.integrate import quadimport pandas as pdfrom math import expfrom math import logfrom math import sqrt

In [6]:

def integrand(x, excess, limit):    return min(limit, max(x - excess, 0)) * lognorm.pdf(x, s=sigma, scale=np.exp(mu))means = []cvs = []outputs = []for mean in [500,1000,1500,2000,2500,3000]:    for cv in [0.05,0.1,0.15,0.2,0.25]:                means.append(mean)        cvs.append(cv)                stddev = mean*cv        mu = log(mean/(sqrt(1+stddev **2/mean**2)))        sigma = sqrt(log(1+stddev**2/mean**2))        excess = mean        limit = lognorm.ppf(0.995, s=sigma, scale=np.exp(mu)) - mean        result, _ = quad(integrand, excess, excess+limit, args=(excess, limit))        outputs.append(result/limit)# Create DataFramedf = pd.DataFrame({    'Mean': means,    'Coefficient of Variation': cvs,    'Output': outputs})df_pivot = df.pivot(index='Mean', columns='Coefficient of Variation', values='Output')# Print the DataFrameprint(df_pivot)

Coefficient of Variation      0.05      0.10      0.15      0.20      0.25Mean                                                                      500                       0.140955  0.133088  0.125675  0.118723  0.1122291000                      0.140955  0.133088  0.125675  0.118723  0.1122291500                      0.140955  0.133088  0.125675  0.118723  0.1122292000                      0.140955  0.133088  0.125675  0.118723  0.1122292500                      0.140955  0.133088  0.125675  0.118723  0.1122293000                      0.140955  0.133088  0.125675  0.118723  0.112229

In [ ]:

Observation 1 - mean invariance

The pricing is invariant to changes in the mean – i.e. assuming we update all the other values in line with changes to the mean, for a given CV, the loss on line does not change at all. We can see that from the table by the fact that the columns all have identical values for a given CV. Perhaps not too surprising? Effectively we are saying that we would charge twice as much for a transaction which is twice the size.

Observation 2 - cv invariance?

The more interesting observation, is that actually the loss on line does not really change too much as the coefficient of variation changes, and that it actually reduces as the CV increases... this was more surprising to me. When the CV increases from 5% to 25%, which is a factor of 5, and represents the difference between a fairly stable book at 5% CV, to a fairly volatile book at 25% CV, the loss on line only changes by less then 3 percentage points.

Now why is this surprising, and why does it happen?

The reason we might expect the loss on line to increase rather than decrease as the CV goes up is that we are offering non-linear protection on a more volatile book, so surely this should cost more? But actually what happens is that as the CV increases, the limit that a client is likely buy increases due to the 99.5th percentile moving out, and this increase in limit more than offsets the increase in loss cost due to increased volatility.

Analytical approach

We figured out the above using a numerical integration package in python, but we could have also attempted to approach it analytically, by showing that the expected loss on line for the ADC, as expressed by the following integral, is a (slowly) decreasing function of the CV.

$$ \frac{1}{Q_{0.995} - \mu} \int_{\mu}^{Q_{0.995}} \frac{1}{x\sigma \sqrt{2\pi}} e^{-\frac{(\ln x - \mu)^2}{2\sigma^2}} \, dx $$

Elon Musk's pay deal

Mon, 26 Feb 2024 00:00:00 GMT

As a rule of thumb, news outlets like the Guardian [1] or BBC News [2] don't typically report on the decisions of the Delaware Court of Chancery, a fairly niche 'court of equity' which decides matters of corporate law in the state of Delaware. That is of course, unless those decisions involve Elon Musk. Recently, the Delaware court handed down a judgement which voided a /$56bn pay-out which was due to Musk for his role as Tesla’s CEO. The reasoning behind striking it down is quite legal and technical, and not really my area of expertise but Matt Levine has a good write up for those interested. [3]

What I am interested in is thinking about how we would assess the fairness of the pay-out. Now fairness is a slippery concept, but I'm going to present one angle, which I've haven't seen discussed elsewhere yet, which I think is one possible way of framing the situation.

Source: https://en.m.wikipedia.org/wiki/File:Roadster_2.5_charging.jpg

Assessing fairness

My contention is that it is instructive to look at how reasonable the remuneration package was at *the date it was awarded*. Musk's options ended up being worth /$56bn, but that's only because Tesla’s share price performed incredibly well, had Tesla's shares performed poorly, then Musk would be due much less than /$56bn, and could even potentially have been due nothing at all. I don't think its appropriate to analyse the situation solely in terms of the /$56bn given the /$56bn was not a guaranteed outcome and could very well have come out at a different number.

To put it in different terms, if someone offered an employee the option of a /$100 bonus in cash, or /$100 of lottery tickets, then when deciding whether the bonus was fair, the relevant perspective would be 'is /$100 a reasonable amount to spend on bonuses'. It would be unfair to wait to see if they win, and then judge the fairness of the bonus based on whether they had a winning or losing lottery ticket.

So in order to get a sense of whether the package was fair when awarded, I thought I’d try to do the calcs to see how much the stock options were worth to Musk in 2018.

The proxy statement

First step, we need to figure out the precise details of the package. Various news articles give the basic outline as 'Musk would be awarded 1% of the outstanding shares if the market cap got above /$100bn, and then an additional 1% of the shares for each additional of /$50bn market cap, up to a max of 12% of the shares'. i.e. If the market cap only grew to /$110bn, he would have just got 1%, but if it grew to /$650bn, then he gets the full 12%.

The proxy statement [4] explains in full detail, and the specific mechanics are that Musk was to be awarded vanilla call options with a exercise price set to be the current stock price in 2018. Based on the informal description in various news sources, I initially thought that he was going to just be awarded that number of shares, rather than being given a call option (i.e. an asset-or-nothing call option). If he has just been awarded the shares, in the most optimistic scenario he would effectively be paid 12%*/$650bn = /$78bn, whereas with the vanilla call options he would effectively be paid 12%*(/$650bn - /$57bn) = /$63bn instead. So it's important we value these as vanilla call options, not as asset-or-nothing call options.

Another adjustment then needs to account for the effect of dilution, even though Musk could be awarded up to 12% of shares, this would be closer to an effective 10% of the company once dilution is accounted for. The post-trial opinion from Judge McCormick [5] contains values for these (as far as I’m aware I don’t think you could calculate them just based on public info). I’ve pasted the relevant section below.

To sense check this, the proxy statement gives a value of /$56bn if the market cap hits /$650bn. We can recreate this using the following calc:

9.6%*(/$650bn-/$57bn) = /$57bn, which is close enough.

No upper limit

Another point to make is that there is no upper limit to what these options might be worth. If Tesla’s share price had hit /$2trillion, then the options would have been worth /$186bn! I found some articles from the time, for example this one in the Guardian [6], which seem to imply that the maximum possible pay out was /$56bn, which is incorrect, this is the payout when the final tranche of options are released, which is the highest value mentioned in the proxy statement, but the sky is the limit with these things.

Okay, so now let’s actually value the stock options, I used a Black-Scholes model in a Spreadsheet to price the 12 tranches of options (and I know Black-Scholes doesn’t really work, and caused the 2008 financial crisis, but I’m not going to be trading trillions of dollars of derivatives based on it, and it should work reasonably well as a first approximation.)

Here is a screenshot of the model, but you can also download it from the following link:
github.com/Lewis-Walsh/Valuation_of_2018_Musk_pay_deal

Based on my model, the options were worth around /$1.8bn at the time they were awarded. Given this might have been the only compensation that Musk would have received in 10 years, we should then spread the cost of the options across the 10 years, so we could say they were worth /$180m pa. There’s probably an argument for using a period shorter than 10 years, as really this should be the average duration of the options given they might be exercised early, but let’s just keep the maths simple and use 10 years.

So this means, that Tesla could have purchased call options in the open market which replicated the payout pattern of those granted to Musk for /$1.8bn plus some sort of margin, and Musk's mean expected compensation per year is around /$180m.

Benchmarking

Now let’s benchmark the /$180m against what other CEOs are being paid. The highest paid CEOs in 2022 were the following (I used 2022 as it's approx. 50% through the contract length, and we want to recognize the impact of some inflation across the 10 years)

/$180m puts Musk basically joint 3rd highest paid CEO when compared to 2022 remuneration. This definitely benchmarks as a high package, but certainly not anything unprecedented in generosity.

Conclusions

Just to preempt possible comments on the above analysis, I'm acutely aware that there are many considerations I haven't incorporated, e.g. should anyway ever get paid /$56bn? Was this package really needed to incentivise Musk given he already have a material equity stake, did Tesla appropriately inform shareholders of the process which was used to arrive at the pay deal, etc. etc. But I do think the above perspective on the deal, thinking about what we would have thought before we knew how things turned out, is an important perspective when thinking about fairness.

[1] https://www.theguardian.com/technology/2024/jan/30/elon-musk-tesla-pay-package-too-much-judge-rules
[2] https://www.bbc.co.uk/news/business-68150306
[3] https://www.bloomberg.com/opinion/articles/2024-01-31/elon-musk-is-overpaid
[4] https://www.sec.gov/Archives/edgar/data//1318605/000119312518035345/d524719ddef14a.htm
[5] https://courts.delaware.gov/Opinions/Download.aspx?id=359340
[6] https://www.theguardian.com/technology/2018/jan/23/elon-musk-aiming-for-worlds-biggest-bonus-40bn

When chain ladders goes wrong

Thu, 08 Feb 2024 10:10:21 GMT

I received a chain ladder analysis a few days ago that looked roughly like the triangle below, but there's actually a bit of a problem with how the method is dealing with this particular triangle, have a look at see if you can spot the issue.

Maybe it's obvious to you straight away, maybe not. If we look at age-to-age factors though I think the issue jumps out in a much clearer way.

When looking at these factors, we can see that something weird is happening in the 2005 year. It looks like maybe a case reserve was put up and then taken down in the next development period. Ideally we don't want to incorporate this up and down effect when projecting the more recent years to ultimate. You might think to yourself - given the chain ladder is multiplicative, it doesn't much matter having an up and then a down as the two will cancel each other out when the two factors are multiplied together, i.e. 1.16 *0.86 = 1.

A problem does creep in here though because when we use the chain ladder method, we don't just multiply the individual factors. The down factor (0.86), is given a greater weighting in the weighted average calc when determining the incremental ldf than the up factor (1.16). This is partly due to there being an additional year of data in the upper calc, but also particularly acute in this example, as the additional year that drops off is outsized compared to the other years - it's about 4 times bigger than the average of the other years. So whereas the 1.16 is given 17% weighting in the incremental ldf calc, the 0.86 is given 34% weighting, which means the two definitely do not cancel each other out.

In fact, a 'standard' chain ladder is nothing more than a weighted average of the individual age-to-age factors, weighted by incurred loss. There's some reasons to prefer this approach as opposed to using a straight average or something else, but there's certainly nothing wrong with deviating from the weighted average if the method is not performing correctly.

By including this feature of our data, the cml ldf for 2010 ends up being a factor of 0.95. By correcting for this feature, the ldf to ult for 2010 ends up only being 0.98.

Let the data speak

I think there's an increased awareness in actuarial modelling, possibly driven by the influence of machine learning, of not making arbitrary adjustments to our data in an attempt to correct perceived issues, but instead just letting the data 'speak for itself'. I think the above is a clear example of a case where this approach of just letting the data speak for itself is probably not serving us well, as our model is simply not working as intended.

Modelling Extremal Events - Cramer-Lundberg theorem under LogN

Fri, 03 Nov 2023 15:30:09 GMT

I’ve had the textbook 'Modelling Extremal Events: For Insurance and Finance’ sat on my shelf for a while, and last week I finally got around to working through a couple of chapters. One thing I found interesting, just around how my own approach has developed over the years, is that even though it’s quite a maths heavy book my instinct was to immediately build some toy models and play around with the results. I recall earlier in my career, when I had just got out of a 4-year maths course, I was much more inclined to understand new topics via working through proofs step-by-step in long hand, pen to paper.

In case it’s of interest to others, I thought I’d upload my Excel version I built of the classic ruin process. In particular I was interested in how the Cramer-Lundberg theorem fails for sub-exponential distributions (which includes the very common Lognormal distribution). Therefore the Spreadsheet contains a comparison of this theorem against the correct answer, derived from monte carlo simulation.

The Speadsheet can be found here:
https://github.com/Lewis-Walsh/RuinTheoryModel

The first tab uses an exponential distribution, and the second uses a Lognormal distribution. Screenshot below.

I also coded a similar model in Python via Jupyter Notebook, which you can read about below.

Here I have specifically used a LogN distribution, and I compare the ruin probability I derive using a monte carlo simulation, with the value from the Cramer-Lundberg theorem, to understand the extent to which it under-estimates the ruin probability.

In [1]:

import numpy as npimport timeimport matplotlib.pyplot as pltimport math

In [2]:

# Set parametersmean_poisson = 1mean_lognormal = 1std_dev_lognormal = 1.5u = 15c = 1.1num_simulations = 10000max_poisson_samples = 500# Calculate parameters mu and sigmamu = np.log(mean_lognormal**2 / np.sqrt(std_dev_lognormal**2 + mean_lognormal**2))sigma = np.sqrt(np.log(1 + std_dev_lognormal**2 / mean_lognormal**2))

In [3]:

# Initialize arrays to record resultsresults = []poisson_sims_before_stop = []# Start timing for simulationsstart_time_simulations = time.time()for _ in range(num_simulations):    t = 0    S_t = 0    poisson_sims = 0        while t < max_poisson_samples:        poisson_sample = np.random.poisson(mean_poisson)        lognormal_samples = np.random.lognormal(mu, sigma, poisson_sample)        S_t += np.sum(lognormal_samples)                f_t = u + c*t - S_t                if f_t < 0:            results.append(1)            poisson_sims_before_stop.append(poisson_sims)            break                t += 1        poisson_sims += 1        if t == max_poisson_samples:        results.append(0)        poisson_sims_before_stop.append(poisson_sims)

In [4]:

# Filter out values less than max_poisson_samples (500)filtered_poisson_sims = [sims for sims in poisson_sims_before_stop if sims < max_poisson_samples]# Calculate average number of Poisson simulations before stoppingavg_poisson_sims = np.mean(filtered_poisson_sims)#Calculate the ruin probabilityruin_prob = len(filtered_poisson_sims) / len(poisson_sims_before_stop)# Calculate total execution time in minutes and secondsend_time_total = time.time()execution_time_total = end_time_total - start_time_simulationsminutes_total = int(execution_time_total // 60)seconds_total = int(execution_time_total % 60)

In [5]:

print(f"Ruin probability - monte carlo")print(ruin_prob)print(f"Ruin probability - Cramer-Lundberg")theta = c/(mean_poisson*mean_lognormal)-1print(1/(1+theta)*math.exp((-1*theta*u)/(1+theta)))# Create a histogram to visualize the distributionplt.hist(filtered_poisson_sims, bins=30, density=True, alpha=0.6, color='g', edgecolor='black')plt.xlabel('Number of Poisson simulations before stopping')plt.ylabel('Frequency')plt.title('Distribution of Poisson Simulations Before Stopping')plt.show()

Ruin probability - monte carlo0.3703Ruin probability - Cramer-Lundberg0.23248105446645487

In [ ]:

We see the output from the Monte Carlo and Cramer-Lundberg (CL) at the bottom. The monte carlo is our 'correct' value and gives us a 37% chance of ruin, whereas the CL method only thinks there is a 23% chance of ruin. So, by using the CL method, we'd be quite materially underestimating the risk of ruin.

Nuclear Verdicts and shenanigans with graphs part 2

Thu, 26 Oct 2023 13:12:20 GMT

I was thinking more about the post I made last week, and I realised there’s another feature of the graphs that is kind of interesting. None of the graphs adequately isolates what we in insurance would term ‘severity’ inflation. That is, the increase in the average individual verdict over time.

You might think that the bottom graph of the three, tracking the ‘Median Corporate Nuclear Verdict’ does this. If verdicts are increasing on average year by year due to social inflation, then surely the median nuclear verdict should increase as well right?!?

Actually, the answer to this is no. Let's see why.

Let’s start by looking at exactly what the chart is showing - in this context, a ‘nuclear verdict’ is defined as a US corporate civil verdict, which exceeds a fixed dollar value of $10m. You should hopefully be aware (for example see the paper by Brazauskas et al. (2009)) [1] that as a measure of trend, examining the increase in average claim excess of a fixed dollar threshold does not accurately reflect the underlying trend. It can be geared so as to cause a ‘greater than underlying’ trend, and it can also be less than the underlying trend (or even give a value of 0) depending on the exact shape of the distribution. For example, if our underlying trend is 8%, then the increase in average claim excess $10m may present itself in the data as 12%, as 4%, or even as 0% depending on the exact shape of the data. And this is not a ‘lack of data’ issue, i.e. the real answer will converge over time with more data, there is an inherent bias in the methodology.

The intuition behind the phenomenon is that we might have a whole bunch of losses just under the threshold, which when inflated are pushed to just above the threshold, and thereby adding a whole load of new small claims, bringing down the average of the losses above the threshold.

To be exact, the paper by Brazauskas et al. referenced above, shows that analysing the *mean* excess value does not accurately reflect trend, but our chart uses the *median* excess value. The obvious follow up question is does the median also have the same issue as the mean? Intuitively, the same arguments relating to verdicts just under the threshold seems to apply, but let’s run a quick python script to check.

The following script runs 50k monte carlo simulations, each simulation generates values from a lognormal distribution. We then inflate the simulated values by 5% pa, and then restrict ourselves to just the 'nuclear' verdicts, i.e. those above the threshold of 10m. Finally we analyse the median of these nuclear verdicts and assess the trend in the median over time. Here's our Python code:

In [1]:

import numpy as npimport pandas as pdimport scipy.stats as scipyfrom math import expfrom math import logfrom math import sqrtfrom scipy.stats import lognormfrom scipy.stats import poissonfrom scipy.stats import linregress

In [2]:

Distmean = 10000000.0DistStdDev = Distmean*1.5AverageFreq = 100years = 10ExposureGrowth = 0.0Mu = log(Distmean/(sqrt(1+DistStdDev**2/Distmean**2)))Sigma = sqrt(log(1+DistStdDev**2/Distmean**2))LLThreshold = 1e7Inflation = 0.05s = Sigmascale= exp(Mu)

In [3]:

MedianLL = []AllLnOutput = []for sim in range(50000):    SimOutputFGU = []    SimOutputLL = []    year = 0    Frequency= []    for year in range(years):        FrequencyInc = poisson.rvs(AverageFreq*(1+ExposureGrowth)**year,size = 1)        Frequency.append(FrequencyInc)        r = lognorm.rvs(s,scale = scale, size = FrequencyInc[0])        r = np.multiply(r,(1+Inflation)**year)        r = np.sort(r)[::-1]        r_LLOnly = r[(r>= LLThreshold)]        SimOutputFGU.append(np.transpose(r))        SimOutputLL.append(np.transpose(r_LLOnly))            SimOutputFGU = pd.DataFrame(SimOutputFGU).transpose()    SimOutputLL = pd.DataFrame(SimOutputLL).transpose()        a = np.log(SimOutputLL.median())    AllLnOutput.append(a)    b = linregress(a.index,a).slope    MedianLL.append(b)AllLnOutputdf = pd.DataFrame(AllLnOutput)dfMedianLL = pd.DataFrame(MedianLL)dfMedianLL['Exp-1'] = np.exp(dfMedianLL[0]) -1print(np.mean(dfMedianLL['Exp-1']))print(np.std(dfMedianLL['Exp-1']))

0.014141991399513090.013860637023388517

In [ ]:

The value of 1.4% we have outputted at the bottom is the average trend in the median of the nuclear verdicts. We can see that based on this analysis, even though we know that the underlying data has a trend of 5%, the median of the values greater than 10m, only increases by 1.4% pa.

What could we have done instead? We could have analysed the median of the top X verdicts, where X could for example be 10, 20, 50, 100. For a write up of this method, see the following:
www.lewiswalsh.net/blog/backtesting-inflation-modelling-median-of-top-x-losses