Portfolio Optimizer

From Volatility Forecasting to Covariance Matrix Forecasting: The Return of Simple and Exponentially Weighted Moving Average Models

2024-10-12T00:00:00-05:00

In the initial post of the series on volatility forecasting, I described the simple and the exponentially weighted moving average forecasting models, that are both easy to understand and relatively performant in practice.

Beyond (univariate) volatility forecasting, these two models are also widely used in (multivariate) covariance matrix forecasting¹²³, for the very same reasons.

In this blog post, I will detail the simple and exponentially weighted moving average covariance matrix forecasting models and I will illustrate their empirical performances in the context of monthly covariance matrix forecasting for a multi-asset class ETF portfolio.

Mathematical preliminaries

Covariance matrix modelling

Let $n$ be the number of assets in a universe of assets and $r_t \in \mathbb{R}^n$ be the vector of the (logarithmic) return of these assets over a time period $t$.

In all generality, $r_t$ can be expressed as²

\[r_t = \mu_t + \epsilon_t\]

, where:

$\mu_t \in \mathbb{R}^n$, $\mu_t = \mathbb{E} \left[ r_t \right]$, is a predictable quantity representing the vector of the (conditional) assets mean return over the time period $t$
$\epsilon_t \in \mathbb{R}^n$, $\epsilon_t = r_t - \mathbb{E} \left[ r_t \right]$, is an unpredictable error term, often referred to as a vector of “shocks” or as a vector of “random disturbances”², over the time period $t$

The asset (conditional) covariance matrix $\Sigma_t \in \mathcal{M}(\mathbb{R}^{n \times n})$ is then defined by¹

\[\begin{aligned} \Sigma_t &= Cov \left[ r_t \right] \\ &= \mathbb{E} \left[ \left( r_t - \mu_t \right) \left( r_t - \mu_t \right) {}^t \right] \\ &= \mathbb{E} \left[ r_t r_t {}^t \right] - \mathbb{E} \left[ r_t \right] \mathbb{E} \left[ r_t \right] {}^t \\ &= \mathbb{E} \left[ r_t r_t {}^t \right] - \mu_t \mu_t {}^t \\ \end{aligned}\]

From this general model for asset returns, it is possible to derive different models for the asset covariance matrix depending on working assumptions.

In this blog post, the main working assumption will be that the assets mean return over any of the considered time periods $t$ is zero, that is, $\mu_t = 0_n$.

From a previous blog post, such an assumption is routinely made when working with daily stock returns but is also empirically justified when working with lower frequency data on a wide range of assets.

Indeed, as highlighted by Johansson et al.²

[…] the mean [$\mu_t$] is small enough […] for most daily, weekly, or monthly stock, bond, and futures returns, factor returns, and index returns.

Covariance proxies

Under the working assumption of the previous sub-section, the covariance matrix $\Sigma_t$ is then equal⁴ to the second moment $\mathbb{E} \left[ r_t r_t {}^t \right]$.

As a consequence, the outer product of the asset returns $ r_t r_t {}^t $ over a time period $t$ (a day, a week, a month..) is a covariance estimate $\tilde{\Sigma}_t$ - or covariance proxy⁵ - for the asset returns covariance matrix over the considered time period.

To be noted that other covariance proxies exist, like realized covariances⁵ or asset regularized returns⁶, but they will not be discussed in this blog post.

Correlation matrix modelling

The asset (conditional) correlation matrix $C_t \in \mathcal{M}(\mathbb{R}^{n \times n})$ is related to the asset covariance matrix $\Sigma_t$ through the standard formula⁷:

\[C_t = V_t^{-1} \Sigma_t V_t^{-1}\]

, where $V_t \in \mathcal{M}(\mathbb{R}^{n \times n})$ is the diagonal matrix of the asset standard deviations

\[V_t = \begin{pmatrix} \sqrt {\left(\Sigma_t\right)_{1,1}} & 0 & ... & 0 \\ 0 & \sqrt {\left(\Sigma_t\right)_{2,2}} & ... & 0 \\ ... & ... & ... & ... \\ 0 & 0 & ... & \sqrt {\left(\Sigma_t\right)_{n,n}} \end{pmatrix}\]

Simple moving average covariance matrix forecasting model

The simple moving average (SMA) covariance matrix forecasting model⁸, also known as the rolling historical covariance matrix forecasting model², uses an equally weighted moving average [of covariance proxies] calculated on a […] data window [of fixed size $k \geq 1$] that is rolled over time² in order to forecast the asset returns covariance matrix over a time period $t$.

Forecasting formulas

Under a simple moving average covariance matrix forecasting model, forecasting formulas are:

To estimate the next period’s asset returns covariance/correlation matrix:
\[\hat{\Sigma}_{T+1} = \frac{1}{k} \sum_{i=1}^{k} \tilde{\Sigma}_{T+1-i}\] \[\hat{C}_{T+1} = \hat{V}_{T+1}^{-1} \hat{\Sigma}_{T+1} \hat{V}_{T+1}^{-1}\]
To estimate the next $h$-period’s ahead asset returns covariance/correlation matrix, $h \geq 2$:
\[\hat{\Sigma}_{T+h} = \frac{1}{k} \left( \sum_{i=1}^{k-h+1} \tilde{\Sigma}_{T+1-i} + \sum_{i=1}^{h-1} \hat{\Sigma}_{T+h-i} \right)\] \[\hat{C}_{T+h} = \hat{V}_{T+h}^{-1} \hat{\Sigma}_{T+h} \hat{V}_{T+h}^{-1}\]
To estimate the averaged asset returns covariance/correlation matrix⁶ over the next $h$ periods:
\[\hat{\Sigma}_{T+1:T+h} = \frac{1}{h} \sum_{i=1}^{h} \hat{\Sigma}_{T+i}\] \[\hat{C}_{T+1:T+h} = \frac{1}{h} \sum_{i=1}^{h} \hat{C}_{T+i}\]

Specific cases

The simple moving average covariance matrix forecasting model encompasses two specific models:

The random walk model, which corresponds to $k = 1$.

Under this model, the forecast of the next period’s asset returns covariance matrix is the current period’s asset returns covariance matrix.
The historical average model, which corresponds to $k = T$.

Under this model, the forecast of the next period’s asset returns covariance matrix is the long term average of the past periods’ asset returns covariance matrix.

How to choose the window size?

Like for its univariate couterpart, selecting the “best” window size $k$ of a simple moving average covariance matrix forecasting model is a problem in itself, c.f. the associated blog post.

In addition, and this time specific to the multivariate nature of that forecasting model, the rolling window covariance estimate is not full rank¹ when $k < n$, so that specific post-processing might need to be implemented in order to ensure that covariance matrix forecasts are positive definite⁹.

Exponentially weighted moving average covariance matrix forecasting model

Menchero and Morozov¹⁰ notes that:

If return distributions were stationary, then using the maximum sample size and equally weighting every observation [- that is, using an historical average forecasting model, c.f. the previous section -] would minimize sampling error and hence produce the most accurate forecasts. Return distributions, however, are not stationary. Events that occurred 10 years ago have little to do with current [ones]. Therefore, to reflect current market conditions, we must give more weight to recent observations.

This remark leads to the introduction of the exponentially weighted moving average (EWMA) covariance matrix forecasting model⁸, which, thanks to a decay factor $\lambda \in [0, 1]$, gives $\lambda$-exponentially less emphasis to distant past covariance proxies v.s. more recent ones in order to forecast the asset returns covariance matrix over a time period $t$¹¹.

Incidentally, the exponentially weighted moving average covariance matrix forecasting model is perhaps the most widely used […] model among practitioners¹², in particular due
to its use in the RiskMetrics VaR software of J.P. Morgan¹².

Forecasting formulas

Under an exponentially weighted moving average covariance matrix forecasting model, forecasting formulas are:

To estimate the next period’s asset returns covariance/correlation matrix:
\[\hat{\Sigma}_{T+1} = \frac{1 - \lambda}{1 - \lambda^{T}} \sum_{i=1}^{T} \lambda^{T-i} \tilde{\Sigma}_{i}\] \[\hat{C}_{T+1} = \hat{V}_{T+1}^{-1} \hat{\Sigma}_{T+1} \hat{V}_{T+1}^{-1}\]
To estimate the next $h$-period’s ahead asset returns covariance/correlation matrix, $h \geq 2$:
\[\hat{\Sigma}_{T+h} = \hat{\Sigma}_{T+1}\] \[\hat{C}_{T+h} = \hat{C}_{T+1}\]
This result means that covariance and correlation matrix forecasts beyond the (immediate) next period are all equal to the covariance and correlation matrix forecast for that next period, in a kind of random walk model way, and is a known limitation of this model when multi-period ahead forecasts are required.
To estimate the averaged asset returns covariance/correlation matrix⁶ over the next $h$ periods:
\[\hat{\Sigma}_{T+1:T+h} = \frac{1}{h} \sum_{i=1}^{h} \hat{\Sigma}_{T+i} = \hat{\Sigma}_{T+1}\] \[\hat{C}_{T+1:T+h} = \frac{1}{h} \sum_{i=1}^{h} \hat{C}_{T+i} = \hat{C}_{T+1}\]

How to choose the decay factor?

Like its univariate couterpart, there are essentially two¹³ procedures to choose the decay factor $\lambda$ of an exponentially weighted moving average covariance matrix forecasting model:

Using recommended values from the literature (0.94, 0.97…).
Determining the optimal value w.r.t. the forecast horizon $h$, for example through the minimization of the root mean square error (RMSE) between the forecasted covariance matrix over the desired horizon and the observed covariance matrix over that horizon¹³.

C.f. the the associated blog post for more details.

Implementation in Portfolio Optimizer

Portfolio Optimizer implements:

The simple moving average covariance and correlation matrix forecasting model through the endpoints /assets/covariance/matrix/forecast/sma and /assets/correlation/matrix/forecast/sma
The exponentially weighted moving average covariance and correlation matrix forecasting model through the endpoints /assets/covariance/matrix/ewma and /assets/correlation/matrix/forecast/ewma

All these endpoints support the 2 covariance proxies below:

Squared (close-to-close) returns
Demeaned squared (close-to-close) returns

The second group of endpoints allows to automatically determine the optimal value of its parameter (the decay factor $\lambda$) using a proprietary variation of the procedures described in the RiskMetrics technical document¹⁴.

Example of usage - Covariance matrix forecasting at monthly level for a portfolio of various ETFs

As an example of usage, I propose to evaluate the empirical performances of the simple and exponentially weighted moving average covariance matrix forecasting models in the context of a portfolio of 10 ETFs representative¹⁵ of misc. asset classes:

U.S. stocks (SPY ETF)
European stocks (EZU ETF)
Japanese stocks (EWJ ETF)
Emerging markets stocks (EEM ETF)
U.S. REITs (VNQ ETF)
International REITs (RWX ETF)
U.S. 7-10 year Treasuries (IEF ETF)
U.S. 20+ year Treasuries (TLT ETF)
Commodities (DBC ETF)
Gold (GLD ETF)

Methodology

At the end of each month, I will compare the (averaged) covariance matrix forecasts produced by the simple and exponentially weighted moving average covariance matrix forecasting models¹⁶ at a one-month horizon to the next month’s empirical covariance matrix.

The performance criterion considered will be the mean squared error (MSE) between the forecasted and the observed covariance matrices, which is a direct¹⁷ method to evaluate the out-of-sample forecast accuracy of a covariance matrix forecasting model, c.f. for example in Johansson et al.¹

Results

Results over the period 31st January 2008 - 31st July 2023¹⁸ are the following¹⁹, with a graphical illustration on Figure 1:

Covariance matrix model	Covariance matrix MSE
SMA, window size of all the previous months (historical average model)	9.59 $10^{-6}$
SMA, window size of the previous year	9.08 $10^{-6}$
EWMA, optimal $\lambda$²⁰	6.52 $10^{-6}$
EWMA, $\lambda = 0.97$	6.37 $10^{-6}$
SMA, window size of the previous month (random walk model)	6.06 $10^{-6}$
EWMA, $\lambda = 0.94$	5.78 $10^{-6}$

Figure 1. Next month's covariance matrix forecasting model MSE, 10-ETF universe, 31st January 2008 - 31st July 2023.

A couple of general remarks from Figure 1:

Both covariance matrix forecasting models seem to do an excellent job at forecasting next month’s empirical covariance matrix; unfortunately, the very low MSEs are misleading - more on this in the next sub-section.
The exponentially weighted moving average covariance matrix forecasting model is generally more performant than the simple moving average covariance matrix forecasting model - provided a proper decay factor is used - which is consistent with the higher reactivity of that model.
There is a penalty to pay in terms of MSE for automatically determining the optimal decay factor of the exponentially weighted moving average covariance matrix forecasting model v.s. using a pre-defined value.

Results, take 2

Because a covariance matrix depends on both asset variances and correlations, let’s have a look at the MSE²¹ between the forecasted and the observed correlation matrices associated to the covariance matrices of the previous sub-section, with a graphical illustration on Figure 2:

Covariance matrix model	Correlation matrix MSE
SMA, window size of the previous month (random walk model)	8.19
SMA, window size of all the previous months (historical average model)	8.10
EWMA, $\lambda = 0.94$	7.67
SMA, window size of the previous year	6.50
EWMA, $\lambda = 0.97$	6.36
EWMA, optimal $\lambda$²⁰	5.87

Figure 2. Next month's correlation matrix forecasting model MSE, 10-ETF universe, 31st January 2008 - 31st July 2023.

It appears that the correlation matrix forecasts are actually pretty bad, or at the very least, not as good as the covariance matrix forecasts MSEs have led us to believe in the previous sub-section!

The explanation is that the covariance matrix forecasts MSEs are artificially deflated due to the difference in scale²² between asset returns variances and correlations, so that they are more indicative of the quality of the variances forecasts than of the correlations forecasts…

Question is then, how bad are the correlation matrix forecasts?

Figure 3²¹ illustrates the MSEs obtained when forecasting the next month’s correlation matrix by the observed next month’s correlation matrix randomly perturbed by a given noise level.

Figure 3. Next month's correlation matrix perturbation level v.s. resulting MSE, 10-ETF universe, 31st January 2008 - 31st July 2023.

Figure 3 is interpreted as follows:

When all the correlation coefficients of the forecasted next month’s correlation matrix are equal to those of the observed next month’s correlation matrix (perturbation level equal to 0), the resulting MSE is 0.
When all the correlation coefficients of the forecasted next month’s correlation matrix are within $\pm 1.4$ from those of the observed next month’s correlation matrix (perturbation level equal to 1.4), the resulting MSE is ~35.

Reverse-reading Figure 3, it is thus possible to conclude that:

The correlation coefficients forecasted by the worst performing covariance forecasting model (MSE ~8.19) are on average within $\pm 0.60$ of the observed correlation coefficients.
The correlation coefficients forecasted by the best performing covariance forecasting model (MSE ~5.87) are on average within $\pm 0.50$ of the observed correlation coefficients.

So, all in all, the correlation matrix forecasts produced by the simple and exponentially weighted moving average covariance matrix forecasting models are not that bad, but an average $\pm 0.50$ or $\pm 0.60$ uncertainty around the forecasted correlations would definitely gain to be improved.

As a final remark here, one should always be wary of covariance matrix forecasting results reported in terms of “covariance” MSEs…

Conclusion

This first blog post on covariance matrix forecasting introduced two baseline models, which are direct extensions of volatility forecasting models.

The observed discrepencies between those models’ covariance and correlation matrix forecasting performances lead to wonder whether it is possible to improve these models by separating the covariance matrix forecasting process into two steps - a first volatility forecasting step and a second correlation matrix forecasting step.

This will be the subject of the next blog post in this series.

Meanwhile, feel free to connect with me on LinkedIn or to follow me on Twitter.

–

See Kasper Johansson, Mehmet G. Ogut, Markus Pelger, Thomas Schmelzer and Stephen Boyd (2023), A Simple Method for Predicting Covariance Matrices of Financial Returns, Foundations and Trends in Econometrics: Vol. 12: No. 4, pp 324-407. ↩ ↩² ↩³ ↩⁴
See Valeriy Zakamulin, A Test of Covariance-Matrix Forecasting Methods, The Journal of Portfolio Management Spring 2015, 41 (3) 97-108. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶
See Guijarro-Ordonez, Jorge and van Beek, Misha and Dhaliwal, Amandeep, A Simple Responsive Covariance Matrix Forecaster for Multiple Horizons and Asset Classes. ↩
Or at least, should be well approximated by that second moment… ↩
See Patton, A.J., Sheppard, K. (2009). Evaluating Volatility and Correlation Forecasts. In: Mikosch, T., Kreiß, JP., Davis, R., Andersen, T. (eds) Handbook of Financial Time Series. Springer, Berlin, Heidelberg. ↩ ↩²
See Gianluca De Nard, Robert F. Engle, Olivier Ledoit, Michael Wolf, Large dynamic covariance matrices: Enhancements based on intraday data, Journal of Banking & Finance, Volume 138, 2022, 106426. ↩ ↩² ↩³
See Wikipedia, Covariance matrix. ↩
This covariance matrix forecasting model is a direct extension of its volatility forecasting model counterpart. ↩ ↩²
In all generality, specific post-processing might anyway need to be implemented to ensure that covariance matrix forecasts are well-conditioned, c.f. Johansson et al.¹ ↩
See Jose Menchero, Andrei Morozov, Improving Risk Forecasts Through Cross-Sectional Observations, The Journal of Portfolio Management Spring 2015, 41 (3) 84-96. ↩
The exponentially weighted moving average covariance matrix forecasting model is a special case of the diagonal vec multivariate GARCH model of Engle and Kroner²³, and corresponds to an integrated GARCH model with no constant vector¹². ↩
See Richard D.F. Harris, Fatih Yilmaz, Estimation of the conditional variance–covariance matrix of returns using the intraday range, International Journal of Forecasting, Volume 26, Issue 1, 2010, Pages 180-194. ↩ ↩² ↩³
Other procedures are described in the RiskMetrics technical document¹⁴. ↩ ↩²
See RiskMetrics. Technical Document, J.P.Morgan/Reuters, New York, 1996. Fourth Edition. ↩ ↩²
These ETFs are used in the Adaptative Asset Allocation strategy from ReSolve Asset Management, described in the paper Adaptive Asset Allocation: A Primer²⁴. ↩
As implemented in Portfolio Optimizer. ↩
Direct methods use a proxy for the true covariance matrix to evaluate the predictor, while indirect methods use the covariance predictor on tasks of interest, such as portfolio construction or portfolio tracking¹. ↩
(Adjusted) daily prices have have been retrieved using Tiingo. ↩
Using the outer product of asset returns - assuming a mean return of 0 - as covariance proxy, and using an expanding historical window of asset returns. ↩
The optimal decay factor $\lambda$ is computed at the end of every month using all the available asset returns history up to that point in time, as implemented in Portfolio Optimizer; thus, there is no look-ahead bias. ↩ ↩²
The maximum perturbation level on Figure 3 is limited to 1.4, because that level correponds to the MSE also reached by forecasting next month’s correlation matrix by a random correlation matrix; so, there is no practical interest in going beyond that level up to the theoretical limit of 2. ↩ ↩²
Especially for the variance of daily asset returns, as is the case here. ↩
See Engle RF, Kroner KF. Multivariate Simultaneous Generalized ARCH. Econometric Theory. 1995;11(1):122-150. ↩
See Butler, Adam and Philbrick, Mike and Gordillo, Rodrigo and Varadi, David, Adaptive Asset Allocation: A Primer. ↩

Capital Market Assumptions: Combining Institutions’ Forecasts for Improved Accuracy

2024-07-10T00:00:00-05:00

Capital market assumptions¹ (CMAs) are forecasts of future risk/return characteristics for broad asset classes over the next 5 to 20 years produced by leading investment managers, consultants and advisors².

These forecasts are well-reasoned, analytically rigorous assumptions about uncertain future market movements² and are used almost universally among institutional investors², for example as inputs to their stategical asset allocation³.

In this blog post, I will show that combining capital market assumptions might be preferable to using any institution-specific estimates and I will describe a couple of practical data issues encountered when doing so.

The variability of capital market assumptions

Each year since 2010, people at Horizon Actuarial ask different investment firms to provide their capital market assumptions⁴, and release an associated survey.

Figure 1 - taken from the 2023 edition of that survey⁴ - clearly shows significant differences in expected returns and standard deviations among investment advisors⁴.

Figure 1. Distribution of expected returns and standard deviations for misc. asset classes over the next 10 years, Source: Horizon Actuarial.

Of course, variability should come as no surprise, because different institutions use different forecasting methodologies and/or have different views on the future⁵…

Nevertheless, such a high variability is definitely an issue for an investor since it is impossible to identify a priori the institution that will produce the most accurate forecasts of future returns, standard deviations and correlations.

The risk of relying on a “bad” institution for an investor is therefore very real…

Combining capital market assumptions

The general forecast combination problem

The specific problem described in the previous section is an instance of a more general problem known as the forecast combination problem, introduced as follows by Timmermann⁶:

Faced with multiple forecasts of the same variable, an issue that immediately arises is how best to exploit information in the individual forecasts. In particular, should a single dominant forecast be identified or should a combination of the underlying forecasts be used to produce a pooled summary measure?

From the forecasting literature, a pragmatic solution to this problem is to average all individual forecasts, which leads to an equal-weighting forecast combination scheme.

While guaranteed to degrade the forecast accuracy v.s. using the “best” individual forecast⁷, such a forecast combination scheme has repeatedly been found to outperform sophisticated adaptive combination methods in empirical applications⁶, thus setting a benchmark that has proved surprisingly difficult to beat⁶.

As an interesting side note for readers of this blog, it turns out that the forecast combination problem is closely related to the mean-variance portfolio optimization problem⁶.

Averaging capital market assumptions

From a theoretical perspective, an equal-weighted forecast combination scheme is well-suited to the specific context of capital market assumptions.

Indeed, given the significant effort, resources and intellectual capital [put] into the production of [capital market assumptions]², it is reasonable to assume that the relative size of the variance of the forecast errors associated with [the] different forecasting methods⁶ are all similar, which in theory favors an equal-weighting forecast combination scheme⁶.

From a practical perspective, an equal-weighted forecast combination scheme is easy to compute and does not introduce any parameter estimation errors, which are known to be a serious problem for many combination techniques⁶.

As for the forecast accuracy of that scheme, it is possible to have an idea of it thanks to Figure 2 - taken from Sebastian² -, which compares:

The min-max range of 10-year capital market assumptions produced by 19 investment firms for misc. asset classes in 2013
The associated averaged 10-year capital market assumptions
The subsequent realized 10-year returns

Figure 2. 2013 10-year capital market assumptions v.s. subsequent 10-year returns, Source: Sebastian.

From Figure 2, it is visible that:

Using averaged capital market assumptions v.s. firm-specific estimates prevented⁸ severe over/under-predictions for a couple of asset classes, like a ~3% over-prediction for Emerging Markets equities, which is great.
In the majority of cases, the average capital market assumption [is] an unreliable guide to the future return², which is not so great, but which ultimately depends on the accuracy and/or bias of the individual capital market assumptions and not on the forecast combination scheme itself⁹.

That being said, a sample size of 1 observation per asset class is anyway maybe a little too small for reaching definite conclusions…

Capital market assumptions data preparation

Before being able to average capital market assumptions originating from different institutions, it is first required to collect and normalize these forecasts.

Below are examples of typical data preparation steps for doing so.

Collecting a sufficient number of forecasts

Figure 3 - taken from an ECR Research report¹⁰ - illustrates the variety of forecast horizons for which a representative sample of leading investment firms provide their capital market assumptions.

Figure 3. Representative frequency of investment firms capital market assumptions forecast horizons, Source: ECR Research.

From Figure 3, and in oder to combine as many distinct capital market assumptions as possible, it seems a good idea to focus on collecting forecasts for either a 5-year or a 10-year horizon, with a preference for a 10-year horizon.

Reconciliating asset classes across institutions

Because not all investment advisors use the same asset classes when developing their capital market assumptions⁴, reconciliating different asset classes across institutions is mandatory and requires to exercice judgment in classifying [institutions] capital market assumptions into a standard set of asset².

As a side note, there are also sometimes inconsistencies in the name of asset classes.

For example, under the asset class label Developed Equities, different institutions might mean:

An asset class represented by the MSCI World Index, which would be the standard convention
An asset class represented by the MSCI World ex-US Index
An asset class represnted by the MSCI EAFE Index

Converting expected returns

Converting real expected returns to nominal expected returns

The vast majority of institutions provide expected returns in nominal terms, but a couple of them provide expected returns in real terms.

In order to compare apples to apples, it is necessary to convert these real returns into nominal returns by adding local expected inflation¹¹, that is:

U.S. expected inflation, for U.S. asset classes (U.S. Equities, U.S. Bonds…) as well as usually for commodities
Eurozone expected inflation¹², for euro-denominated asset classes (E.M.U. Equities, EUR Bonds…)
Emerging countries inflation¹², for Emerging Equities
…

Converting expected returns currencies

Depending on the institution and on the asset class, expected returns can be provided in:

USD, or another investable currency like EUR
USD EUR-hedged, or another investable currency pair hedged like EUR JPY-hedged
Local currency
Local currency investable currency-hedged, or investable currency local currency-hedged or whatever weird combination

In order to again compare apples to apples, it is necessary to convert all the currencies appearing in the different expected returns into a unique investable currency.

Converting expected returns in an investable currency to expected returns in another investable currency

The most usual scenario is the need to convert expected returns initially provided in an investable currency like USD into another investable currency like EUR.

This convertion - ignoring the small geometric interaction¹¹ - is done by adding the expected foreign exchange return to the expected returns¹¹.

Converting investable currency-hedged expected returns to investable currency-unhedged expected returns

Expected returns for “foreign” fixed income asset classes are frequently provided as “foreign currency”-hedged¹³.

The convertion from “foreign currency”-hedged expected returns to “foreign currency”-unhedged expected returns is done by correcting for the expected nominal risk-free rate differential¹⁴, that is, by adding the difference between the foreign and the local cash returns.

Converting expected returns in local currency to expected returns in an investable currency

Many institutions provide expected returns in local currency to avoid the issue of [determining] what an asset’s expected return will be from a particular investor’s currency perspective¹³.

Ultimately, though, an investor needs to make a currency decision, and expected returns in local currency must be converted into expected returns in an investable currency like USD or EUR.

For single-currency asset classes, like U.S. Equities or Eurozone Government Bonds, this is not an issue, c.f. the previous sub-sections.

For multi-currency asset classes, like Emerging Market Equities or Global Government Bonds, the situation is more complex because a proper conversion would at the very least require assumptions on:

The expected currency weights in the asset class at the forecast horizon
The expected foreign exchange return at the forecast horizon, for all currencies

To be noted that ignoring this complexity is probably not an option, as illustrated in Figure 4 - taken from the factsheet of the MSCI Emerging Markets Index -, which shows a ~3% difference in annualized returns over the last 10 years between the “local currency” and the “USD” flavors of the MSCI Emerging Markets Index.

Figure 4. Cumulative MSCI Emerging Markets Net Returns Index performance, USD v.s. local currency (v.s. hedged to USD), May 2009 - May 2024, Source: MSCI.

Between those two extremes, finding a pragmatic solution depending on the exact asset class is the best thing to do!

Other convertions

The need for other weird currency convertions might be encountered, at which point a balance must be struck between:

Adding new forecasts, possibly unique, to the list of collected capital market assumptions
Adding complexity to the capital market assumptions collection process

Converting geometric expected returns to arithmetic expected returns

Nearly all institutions produce estimates of geometric (compound) asset class returns¹⁴, while standard mean-variance portfolio optimization¹⁵ requires expected returns to be provided in arithmetic rather than in geometric terms¹⁴.

In order to approximately convert geometric expected returns to arithmetic expected returns, it suffices to add half of the expected asset class variance¹¹.

As a down-to-earth comment on why is that, I cannot resist sharing what people at Verus wrote in the 2024 edition of their capital market assumptions:

This is the industry standard approach, but requires a complex explanation only a heavy quant could love, so we have chosen not to provide further details in this document […]

Converting expected standard deviations/correlations

Similar to expected returns, expected standard deviations and correlations should also be converted into an investor’s chosen currency.

Unfortunately, this conversion is far more complex to perform than that of expected returns, so that - unless specified otherwise by the originating institution - it is probably best to leave the original expected standard deviations and correlations untouched.

Implementations

Implementation in Portfolio Optimizer

Portfolio Optimizer allows to retrieve minimum, maximum and averaged 10-year capital market assumptions (nominal arithmetic and geometric returns, standard deviations, correlations) from leading financial institutions and for selected asset classes:

In EUR, through the endpoint /markets/capital-assumptions/eur
In USD, through the endpoint /markets/capital-assumptions/usd

A couple of remarks:

The list of financial institutions from which capital market assumptions have been sourced is provided in output of these endpoints.

The supported asset classes, together with a representative benchmark, are the following:

Asset Class	Asset Class Label in Portfolio Optimizer	Representative Benchmark
U.S. Cash	cash-us	J.P. Morgan Cash Index USD
Eurozone Cash	cash-eurozone	J.P. Morgan Cash Index Euro Currency
U.S. Government Bonds	governmentBonds-us	Bloomberg U.S. Treasury Index
Eurozone Government Bonds	governmentBonds-eurozone	Bloomberg Euro Aggregate Treasury Index
U.S. Investment Grade Corporate Bonds	corporateBonds-us	ICE BofA US Corporate Index
Eurozone Investment Grade Corporate Bonds	corporateBonds-eurozone	ICE BofA Euro Corporate Index
U.S. Equities	equities-us	MSCI USA Index
Eurozone Equities	equities-eurozone	MSCI EMU Index
Developed Markets Equities	equities-developedMarkets	MSCI Developed Markets Index
Emerging Markets Equities	equities-emergingMarkets	MSCI Emerging Markets Index
Commodities	commodities-all	Bloomberg Commodity Index
Gold	gold-all	Gold Spot Price

The expected correlation matrices provided in output of these endpoints are not guaranteed to be complete nor valid.
- In case a correlation matrix is complete but invalid, it is possible to compute its nearest valid correlation matrix thanks to the endpoint /assets/correlation/matrix/nearest
- In case a correlation matrix is incomplete, Portfolio Optimizer provides no solution for now but stay tuned!

Last, if you would like a specific currency or a specific asset class to be integrated into Portfolio Optimizer, feel free to reach out.

Implementations elsewhere

I am aware of other capital market assumptions “averaging” services as described in this blog post:

Horizon Actuarial yearly survey of capital market assumptions
Peresec interactive summary of long-term capital market assumptions
ECR Research strategic asset allocation quarterly reports

Conclusion

This concludes this first blog post on capital market assumptions.

As usual, feel free to connect with me on LinkedIn or to follow me on Twitter.

–

Or capital market expectations (CMEs). ↩
See Sebastian, Michael D., The Accuracy and Use of Capital Market Assumptions. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷
See Research Affiliates Capital Market Expectations Methodology, 05.31.2023. ↩
See Horizon Actuarial Services LLC, Survey of Capital Market Assumptions, 2023 Edition. ↩ ↩² ↩³ ↩⁴
Additionally, Horizon Actuarial⁴ notes that different institutions publish their capital market assumptions at different dates. ↩
See Allan Timmermann, Chapter 4 Forecast Combinations, Editor(s): G. Elliott, C.W.J. Granger, A. Timmermann, Handbook of Economic Forecasting, Elsevier, Volume 1, 2006, Pages 135-196. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷
Again, impossible to identify ex-ante. ↩
Of course, Figure 2 also shows that using averaged capital market assumptions prevented to use the best firm-specific estimates, but as discussed in the previous section, it is impossible to identify these ex-ante, so that’s not a real drawback. ↩
One could think of using an optimal-weighting forecast combination scheme, with for example the objective of minimizing the out-of-sample forecast error, but due to the small sample size over which the optimal weights would need to be estimated, such a scheme would probably lead to poor performance v.s. an equal-weighting scheme, as is usual in the forecasting litterature, c.f. Timmermann⁶. ↩
See ECR Research, Expected Returns Model. ↩
See Lai Hoang, Ummul Ruthbah, Long-Term Capital Market Assumptions Public Equities. ↩ ↩² ↩³ ↩⁴
With or without an appropriate weighting per country representative of the asset class weighting per country, which quickly becomes a mess to manage… ↩ ↩²
See BNP Paribas AM, Long-Term Asset Allocation - The slow return to normal, Portfolio perspectives, White paper. ↩ ↩²
See Invesco, 2024 long-term capital market assumptions. ↩ ↩² ↩³
The same remark also applies to other portfolio allocation or optimization techniques, like Monte Carlo simulations. ↩

Volatility Forecasting: HAR Model

2024-06-25T00:00:00-05:00

Among the different members of the family of volatility forecasting models by weighted moving average¹ like the simple and the exponentially weighted moving average models or the GARCH(1,1) model, the Heterogeneous AutoRegressive (HAR) model introduced by Corsi² has become the workhorse of the volatility forecasting literature³ on account of its simplicity and generally good forecasting performance³.

In this blog post, strongly inspired by the paper A Practical Guide to harnessing the HAR volatility model from Clements and Preve⁴, I will describe the HAR volatility forecasting model together with some important implementation details and I will illustrate its practical performances in the context of monthly volatility forecasting for various ETFs.

Mathematical preliminaries (reminders)

This section contains reminders from a previous blog post.

Volatility modelling and volatility proxies

Let $r_t$ be the (logarithmic) return of an asset over a time period $t$ (a day, a week, a month..), over which its (conditional) mean return is supposed to be null.

Then:

The asset (conditional) variance is defined as $ \sigma_t^2 = \mathbb{E} \left[ r_t^2 \right] $

From this definition, the squared return $r_t^2$ of an asset is a (noisy⁴) variance estimator - or variance proxy⁴ - for that asset variance over the considered time period.

Another example of an asset variance proxy is the Parkinson range of an asset.

Yet another example of an asset variance proxy, this time over a specific time period $t$ of one day, is the daily realized variance $RV_t$ of an asset, which is defined as the sum of the asset squared intraday returns sampled at a high frequency (1 minutes, 5 minutes, 15 minutes…).

The generic notation for an asset variance proxy in this blog post is $\tilde{\sigma}_t^2$.
The asset (conditional) volatility is defined as $ \sigma_t = \sqrt { \sigma_t^2 } $

The generic notation for an asset volatility proxy in this blog post is $\tilde{\sigma}_t$.

Weighted moving average volatility forecasting model

Boudoukh et al.¹ show that many seemingly different methods of volatility forecasting actually share the same underlying representation of the estimate of an asset next period’s variance $\hat{\sigma}_{T+1}^2$ as a weighted moving average of that asset past periods’ variance proxies $\tilde{\sigma}^2_t$, $t=1..T$, with

\[\hat{\sigma}_{T+1}^2 = w_0 + \sum_{i=1}^{k} w_i \tilde{\sigma}^2_{T+1-i}\]

, where:

$1 \leq k \leq T$ is the size of the moving average, possibly time-dependent
$w_i, i=0..k$ are the weights of the moving average, possibly time-dependent as well

The HAR volatility forecasting model

The original HAR model

Due to the limitations of the GARCH model in reproducing the main empirical features of financial returns (long memory, fat tails, and self-similarity)², Corsi² proposes to use an additive cascade model of different volatility components each of which is generated by the actions of different types of market participants².

As detailled by Corsi²:

The main idea is that agents with different time horizons perceive, react to, and cause different types of volatility components. Simplifying a bit, we can identify three primary volatility components: the short-term traders with daily or higher trading frequency, the medium-term investors who typically rebalance their positions weekly, and the long-term agents with a characteristic time of one or more months.

Under this volatility forecasting model called the Heterogeneous AutoRegressive model (HAR)⁵, an asset next day’s daily realized variance $RV_{T+1}$ is modeled as an AR(22) process subject to economically meaningful restrictions² on its parameters, which results in the formula⁴

\[\hat{RV}_{T+1} = \beta + \beta_d RV_{T} + \beta_w RV_{T}^w + \beta_m RV_{T}^m\]

, where:

$\hat{RV}_{T+1}$ is the forecast at time $T$ of the asset next day’s daily realized variance $RV_{T+1}$
$RV_T$ is the asset daily realized variance at time $T$
$RV_T^w = \frac{1}{5} \sum_{i=1}^5 RV_{T-i+1}$ is the asset weekly realized variance at time $T$
$RV_T^m = \frac{1}{22} \sum_{i=1}^{22} RV_{T-i+1}$ is the asset monthly realized variance at time $T$
$\beta$, $\beta_d$, $\beta_w$ and $\beta_m$ are the HAR model parameters, to be determined

In terms of practical performances, and in spite of its simplicity […], the [HAR] model is able to reproduce the same volatility persistence observed in the empirical data as well as many of the other main stylized facts of financial data², which makes it a very accurate volatility forecasting model.

Realized variance v.s. generic variance proxy

The original HAR model described in the previous subsection relies on a very specific asset variance proxy - the realized variance of an asset - over a very specific time period - a day - for its definition.

Some papers (Lyocsa et al.⁶, Lyocsa et al.⁷, Clements et al.³, …) propose to replace the (high-frequency) daily realized variance by a (low-frequency) daily range-based variance estimator⁸ like:

The square of the Parkinson volatility estimator
The square of the Garman-Klass volatility estimator
The square of the Rogers-Satchell volatility estimator

Going one step further, it is possible to replace the daily realized variance by any generic daily variance estimator.

This leads to the generic HAR volatility forecasting model, under which an asset next days’s conditional variance $\sigma_{T+1}^2$ is modeled as a linear function of [its previous day’s] daily, weekly and monthly [conditional variance] components⁴, with the following daily variance forecasting formula

\[\hat{\sigma}_{T+1}^2 = \beta + \beta_d \tilde{\sigma}^2_{T} + \beta_w \tilde{\sigma}^{2,w}_{T} + \beta_m \tilde{\sigma}^{2,m}_{T}\]

, where:

$\hat{\sigma}_{T+1}^2$ is the forecast at time $T$ of the asset next day’s conditional variance $\sigma_{T+1}^2$
$\tilde{\sigma}^2_{T}, \tilde{\sigma}^2_{T-1},…,\tilde{\sigma}^2_{T-21}$ are the asset daily variance estimators over each of the previous 22 days at times $T$, $T-1$, …, $T-21$
$\tilde{\sigma}^{2,w}_{T} = \frac{1}{5} \sum_{i=1}^5 \tilde{\sigma}^2_{T+1-i}$ is the asset weekly variance estimator at time $T$
$\tilde{\sigma}^{2,m}_{T} = \frac{1}{22} \sum_{i=1}^{22} \tilde{\sigma}^2_{T+1-i}$ is the asset monthly variance estimator at time $T$
$\beta$, $\beta_d$, $\beta_w$ and $\beta_m$ are the HAR model parameters, to be determined

Going another step further, it is also possible to replace the baseline daily time period by any desired time period (weekly, biweekly, monthly, quarterly…), but given the theoretical foundations of the HAR model, this also requires to replace the weekly and the monthly variance estimators $\tilde{\sigma}^{2,w}_{T}$ and $\tilde{\sigma}^{2,m}_{T}$ by appropriate variance estimators.

Relationship with the generic weighted moving average model

From its definition, it is easy to see that the HAR volatility forecasting model is a specific kind of weighted moving average volatility forecasting model, with:

$w_0 = \beta$
$w_1 = \beta_d + \frac{1}{5} \beta_w + \frac{1}{22} \beta_m$
$w_i = \frac{1}{5} \beta_w + \frac{1}{22} \beta_m, i = 2..5$
$w_i = \frac{1}{22} \beta_m, i = 6..22$
$w_i = 0$, $i \geq 23$, discarding all the past variance proxies beyond the $22$-th from the model

Volatility forecasting formulas

Under an HAR volatility forecasting model, the generic weighted moving average volatility forecasting formula becomes:

To estimate an asset next day’s volatility:
\[\hat{\sigma}_{T+1} = \sqrt{ \beta + \beta_d \tilde{\sigma}^2_{T} + \frac{\beta_w}{5} \sum_{i=1}^5 \tilde{\sigma}^2_{T+1-i} + \frac{\beta_m}{22} \sum_{i=1}^{22} \tilde{\sigma}^2_{T+1-i} }\]
To estimate an asset next $h$-day’s ahead volatility⁹, $h \geq 2$:
\[\hat{\sigma}_{T+h} = \sqrt{ \beta + \beta_d \hat{\sigma}_{T+h-1}^2 + \frac{\beta_w}{5} \left( \sum_{i=1}^{5-h+1} \tilde{\sigma}^2_{T+1-i} + \sum_{i=1}^{h-1} \hat{\sigma}^2_{T+h-i} \right) + \frac{\beta_m}{22} \left( \sum_{i=1}^{22-h+1} \tilde{\sigma}^2_{T+1-i} + \sum_{i=1}^{h-1} \hat{\sigma}^2_{T+h-i} \right) }\]
To estimate an asset aggregated volatility⁹ over the next $h$ days:
\[\hat{\sigma}_{T+1:T+h} = \sqrt{ \sum_{i=1}^{h} \hat{\sigma}^2_{T+i} }\]

Note:
Clements and Preve⁴ extensively discuss whether to use an indirect or a direct multi-step ahead forecast scheme for estimating an asset next $h$-day’s ahead volatility under an HAR volatility forecasting model:

In an indirect scheme¹⁰, which is the scheme described above, the asset next $h$-day’s ahead volatility is estimated indirectly, by a recursive application of the HAR model formula for the asset next days’s conditional variance.

In a direct scheme, the asset next $h$-day’s ahead volatility is estimated directly, by replacing the asset next days’s conditional variance on the left-hand side of the HAR model formula with the asset next $h$-day’s ahead conditional variance.

Clements and Preve⁴ argue in particular that “direct forecasts are easy to compute and more robust to model misspecification compared to indirect forecasts”⁴, but at the same time, they show that this robustness does not always translate into more accurate forecasts.

Harnessing the HAR volatility forecasting model

HAR model parameters estimation

Ordinary least squares estimators

Corsi² writes that it is possible to easily estimate [the HAR model] parameters by applying simple linear regression², in which case the ordinary least squares (OLS) estimator of the parameters $\beta$, $\beta_d$, $\beta_w$ and $\beta_m$ at time $T \geq 23$ is the solution of the minimization problem⁴

\[\argmin_{ \left( \beta, \beta_d, \beta_w, \beta_m \right) \in \mathbb{R}^{4}} \sum_{t=23}^T \left( \tilde{\sigma}_{t}^2 - \beta - \beta_d \tilde{\sigma}^2_{t-1} - \beta_w \tilde{\sigma}^{2,w}_{t-1} - \beta_m \tilde{\sigma}^{2,m}_{t-1} \right)^2\]

Other least squares estimators

Nevertheless, Clements and Preve⁴ warn that given the stylized facts of [volatility estimators] (such as spikes/outliers, conditional heteroskedasticity, and non-Gaussianity) and well-known properties of OLS, this […] should be far from ideal⁴.

Indeed, as detailled in Patton and Sheppard¹¹:

Because the dependent variable in all of our regressions is a volatility measure, estimation by OLS has the unfortunate feature that the resulting estimates focus primarily on fitting periods of high variance and place little weight on more tranquil periods. This is an important drawback in our applications, as the level of variance changes substantially across our sample period and the level of the variance and the volatility in the error are known to have a positive relationship.

Instead of OLS, Clements and Preve⁴ and Clements et al.³ suggest to use other least squares estimators:

Weighted least squares estimators (WLS), with for example the weighting schemes described in Clements and Preve⁴
Robust least squares estimators (RLS), with for example Tukey’s biweight loss function
Regularized least squares estimators (RRLS) (ridge regression, LASSO regression, elastic net regression…), with the cross-validation procedure for the associated hyperparameters described in Clements et al.³.

Expanding v.s. rolling window estimation procedure

Clements and Preve⁴ empirically demonstrate that the HAR model parameters $\beta$, $\beta_d$, $\beta_w$ and $\beta_m$ are time-varying, as illustrated in Figure 1.

Figure 1. Evolution of the HAR model weekly beta coefficient estimated through misc. least squares estimators (solid black - OLS, dashed colors - WLS and RLS) applied on a 1000-day rolling window, S&P 500 index daily realized variance, 6th April 2001 - 29th August 2013, Source: Clements and Preve.

To deal with this non-stationarity, or more generally to deal with parameter drift that is difficult to model explicitly⁴, the standard¹² approach in the litterature is to use a rolling window procedure¹³ for the least squares estimation of the HAR model parameters.

Insanity filter

The HAR volatility forecasting model may on rare occasions generate implausibly large or small forecasts⁴, because no restrictions on the parameters [$\beta$, $\beta_d$, $\beta_w$ and $\beta_m$] are imposed¹¹.

In particular, it has been noted¹⁴ that forecasts are occasionally negative¹¹.

In order to correct this behaviour, Clements and Preve⁴ propose to implement an insanity filter¹⁵ ensuring that any forecast greater than the maximum, or less than the minimum, of [an asset next days’s conditional variance] observed in the estimation period is replaced by the sample average over that period⁴.

Two important remarks on such a filter:

It seems that the particular choice of insanity filter is not important; what matters is to eliminate unrealistic forecasts¹⁶.
The presence of an insanity filter is all the more important when an indirect multi-step ahead forecast scheme is used for estimating an asset next $h$-day’s ahead volatility, because an unreasonable [forecast] is most likely to generate another unreasonable forecast¹⁶.

Variance proxies transformations

Clements and Preve⁴ study three different Box-Cox transformations of an asset daily realized variance that can be used in the HAR volatility forecasting model instead of that asset “raw” daily realized variance:

The logarithmic transformation (log)
The quartic root transformation (qr)
The square root transformation (sqr)

It is visible on Figure 2 that these transformations appear useful for reducing skewness, and hence the possible effect of outliers and potential heteroskedasticity in the realized variance series⁴.

Figure 2. S&P 500 index daily realized variance, raw v.s. transformed, 6th April 2001 - 29th August 2013, Source: Clements and Preve.

Of particular interest is the logarithmic transformation¹⁷, which in addition to good practical performances and closed form bias-corrected expressions for variance forecasts¹⁸, also guarantees that the generated forecasts are always positive¹⁹.

Non-overlapping daily, weekly and monthly variance proxies

Corsi et al.²⁰ propose a slightly different parametrization of the HAR model compared to Corsi², where:

$\tilde{\sigma}^2_{T}, \tilde{\sigma}^2_{T-1},…,\tilde{\sigma}^2_{T-21}$ are the asset daily variance estimators over each of the previous 22 days at times $T$, $T-1$, …, $T-21$
$\tilde{\sigma}^{2,w}_{T} = \frac{1}{4} \sum_{i=2}^5 \tilde{\sigma}^2_{T+1-i}$ is the asset weekly variance estimator at time $T$, non-overlapping with the asset daily variance estimator $\tilde{\sigma}^2_{T}$
$\tilde{\sigma}^{2,m}_{T} = \frac{1}{17} \sum_{i=6}^{22} \tilde{\sigma}^2_{T+1-i}$ is the asset monthly variance estimator at time $T$, non-overlapping with either the asset daily variance estimator $\tilde{\sigma}^2_{T}$ nor with the asset weekly variance estimator $\tilde{\sigma}^{2,w}_{T}$

Such a re-parametrization does not imply any loss of information compared to the original [HAR model], since it relies only on a different rearrangement of the terms²⁰, but allows an easiest interpretation of the HAR model parameters, c.f. Patton and Sheppard¹¹.

Incidentally, this non-overlapping formulation of the HAR volatility forecasting model has been found to generate better forecasts by practitioners²¹.

Other lag indexes for the variance proxies

The standard lag indexes for the variance proxies in the HAR volatility forecasting model are 1, 5 and 22, each corresponding to a different volatility component in Corsi’s underlying additive cascade model.

As mentioned in Corsi², more components could easily be added to the additive cascade of partial volatilities², which is done for example in Lyocsa et al.⁷.

In that case, denoting $l$ and $h$, respectively, the lowest and highest frequency in the cascade², an asset next days’s conditional variance is modeled as an AR($\frac{l}{h}$) process reparameterized in a parsimonious way by imposing economically meaningful restrictions².

Implementation in Portfolio Optimizer

Portfolio Optimizer implements the HAR volatility forecasting model - augmented with the insanity filter described in Clements and Preve⁴ - through the endpoint /assets/volatility/forecast/har.

This endpoint supports the 4 variance proxies below:

Squared close-to-close returns
Demeaned squared close-to-close returns
The Parkinson range
The jump-adjusted Parkinson range

This endpoint also supports:

Transforming the input variance proxies into log variance proxies before estimating the HAR model parameters; the associated variance forecasts are then unbiased thanks to the formulas established in Buccheri and Corsi¹⁸.
Estimating the HAR model parameters through 3 different least squares procedures:
- Ordinary least squares
- Weighted least squares, using the inverse of the variance proxies as weights
- Robust least squares, using Tukey’s biweight loss function
Using up to 5 lag indexes for the variance proxies

Example of usage - Volatility forecasting at monthly level for various ETFs

As an example of usage, I propose to enrich the results of the previous blog post, in which monthly forecasts produced by different volatility models are compared - using Mincer-Zarnowitz²² regressions - to the next month’s close-to-close observed volatility for 10 ETFs representative²³ of misc. asset classes:

U.S. stocks (SPY ETF)
European stocks (EZU ETF)
Japanese stocks (EWJ ETF)
Emerging markets stocks (EEM ETF)
U.S. REITs (VNQ ETF)
International REITs (RWX ETF)
U.S. 7-10 year Treasuries (IEF ETF)
U.S. 20+ year Treasuries (TLT ETF)
Commodities (DBC ETF)
Gold (GLD ETF)

Vanilla HAR volatility forecasting model

Averaged results for all ETFs/regression models over each ETF price history²⁴ are the following²⁵, when using the vanilla HAR volatility forecasting model:

Volatility model	Variance proxy	$\bar{\alpha}$	$\bar{\beta}$	$\bar{R^2}$
Random walk	Squared close-to-close returns	5.8%	0.66	44%
SMA, optimal $k \in \left[ 1, 5, 10, 15, 20 \right]$ days	Squared close-to-close returns	5.8%	0.68	46%
EWMA, optimal $\lambda$	Squared close-to-close returns	4.7%	0.73	45%
GARCH(1,1)	Squared close-to-close returns	-1.3%	0.98	43%
HAR	Squared close-to-close returns	-0.7%	0.95	46%
Random walk	Parkinson range	5.6%	0.94	44%
SMA, optimal $k \in \left[ 1, 5, 10, 15, 20 \right]$ days	Parkinson range	5.1%	1.00	47%
EWMA, optimal $\lambda$	Parkinson range	4.3%	1.06	48%
GARCH(1,1)	Parkinson range	2.7%	1.18	47%
HAR	Parkinson range	0.1%	1.25	44%
Random walk	Jump-adjusted Parkinson range	4.9%	0.70	45%
SMA, optimal $k \in \left[ 1, 5, 10, 15, 20 \right]$ days	Jump-adjusted Parkinson range	5.1%	0.71	47%
EWMA, optimal $\lambda$	Jump-adjusted Parkinson range	4.0%	0.76	45%
GARCH(1,1)	Jump-adjusted Parkinson range	-1.0%	1.00	45%
HAR	Jump-adjusted Parkinson range	-1.4%	0.99	47%

Alternative HAR volatility forecasting models

Averaged results for all ETFs/regression models over each ETF price history²⁴ are the following²⁵, when using different variations of the vanilla HAR volatility forecasting model:

Volatility model	Variance proxy	$\bar{\alpha}$	$\bar{\beta}$	$\bar{R^2}$
HAR	Squared close-to-close returns	-0.7%	0.95	46%
HAR (weighted least squares)	Squared close-to-close returns	3.1%	0.71	34%
HAR (robust least squares)	Squared close-to-close returns	-12.3%	2.50	26%
HAR (log)	Squared close-to-close returns	0.5%	0.62	40%
HAR (log, weighted least squares)	Squared close-to-close returns	10%	0.26	13%
HAR (log, robust least squares)	Squared close-to-close returns	0.5%	0.53	40%
HAR	Parkinson range	0.1%	1.25	44%
HAR (weighted least squares)	Parkinson range	0.1%	1.19	44%
HAR (robust least squares)	Parkinson range	-4.2%	2.20	40%
HAR (log)	Parkinson range	1.9%	1.22	50%
HAR (log, weighted least squares)	Parkinson range	-0.6%	1.47	47%
HAR (log, robust least squares)	Parkinson range	2.2%	1.22	50%
HAR	Jump-adjusted Parkinson range	-1.4%	0.99	47%
HAR (weighted least squares)	Jump-adjusted Parkinson range	-4.2%	0.92	46%
HAR (robust least squares)	Jump-adjusted Parkinson range	-6.6%	1.76	41%
HAR (log)	Jump-adjusted Parkinson range	0.9%	0.92	51%
HAR (log, weighted least squares)	Jump-adjusted Parkinson range	-0.8%	1.06	48%
HAR (log, robust least squares)	Jump-adjusted Parkinson range	1.2%	0.92	51%

Comments

From the results of the two previous subsections, it is possible to make the following comments:

When using squared returns as a variance proxy, the vanilla HAR model is the best volatility forecasting model among all the alternative HAR models as well as among all the models already studied in this series.
When using the Parkinson range or the jump-adjusted Parkinson range as variance proxies, the log HAR model exhibits the highest r-squared among all the models already studied in this series.
When using the jump-adjusted Parkinson range as a variance proxy, the log HAR model is the best volatility forecasting model in this series (relatively low bias, highest r-squared).
When using alternative least square estimators, the forecasts quality generally degrades

Conclusion

The previous section empirically demonstrated that the HAR volatility forecasting model, despite its relationship with realized variance, is still accurate when used with range-based variance estimators at a long forecasting horizon, in line with Lyocsa et al.⁶

In that context, the (log) HAR model is also the most accurate volatility forecasting model studied so far in this series on volatility forecasting by weighted moving average models!

As usual, feel free to connect with me on LinkedIn or to follow me on Twitter.

–

See Boudoukh, J., Richardson, M., & Whitelaw, R.F. (1997). Investigation of a class of volatility estimators, Journal of Derivatives, 4 Spring, 63-71. ↩ ↩²
See Fulvio Corsi, A Simple Approximate Long-Memory Model of Realized Volatility, Journal of Financial Econometrics, Volume 7, Issue 2, Spring 2009, Pages 174–196. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹² ↩¹³ ↩¹⁴
See Clements, Adam and Preve, Daniel P. A. and Tee, Clarence, Harvesting the HAR-X Volatility Model. ↩ ↩² ↩³ ↩⁴ ↩⁵
See Adam Clements, Daniel P.A. Preve, A Practical Guide to harnessing the HAR volatility model, Journal of Banking & Finance, Volume 133, 2021. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹² ↩¹³ ↩¹⁴ ↩¹⁵ ↩¹⁶ ↩¹⁷ ↩¹⁸ ↩¹⁹ ↩²⁰ ↩²¹
This additive cascade model is autoregressive in (daily) realized variance and combines realized variance over different horizons (daily, weekly, monthly), hence its name. ↩
See Stefan Lyocsa, Peter Molnar, Tomas Vyrost, Stock market volatility forecasting: Do we need high-frequency data?, International Journal of Forecasting, Volume 37, Issue 3, 2021, Pages 1092-1110. ↩ ↩²
See Stefan Lyocsa, Tomas Plihal, Tomas Vyrost, FX market volatility modelling: Can we use low-frequency data?, Finance Research Letters, Volume 40, 2021, 101776. ↩ ↩²
These three volatility estimators are described in details in a previous blog post. ↩
See Brooks, Chris and Persand, Gitanjali (2003) Volatility forecasting for risk management. Journal of Forecasting, 22(1). pp. 1-22. ↩ ↩²
Also called iterated scheme. ↩
See Andrew J. Patton, Kevin Sheppard; Good Volatility, Bad Volatility: Signed Jumps and The Persistence of Volatility. The Review of Economics and Statistics 2015; 97 (3): 683–697. ↩ ↩² ↩³ ↩⁴
Some papers like Buccheri and Corsi¹⁸ describe how to explicitely model the HAR model parameters as time-varying, but in that case, those parameters are not anymore obtained through a simple least squares regression… ↩
The length of the rolling window is typically 1000 days, corresponding approximately to 1000 days of trading; more details on the rolling window procedure can be found in Clements et al.³ and a numerical comparison in terms of out-of-sample volatility forecasts between an expanding and a rolling window procedure can be found in Clements and Preve⁴. ↩
And I confirm from practical experience. ↩
Such a filter has apparently a long history in the forecasting litterature, dating back at least to Swanson and White²⁶ in the context of neural-network models for interest rates forecasting. ↩
See Cunha, Ronan, Kock, Anders Bredahl and Pereira, Pedro L. Valls, Forecasting large covariance matrices: comparing autometrics and LASSOVAR. ↩ ↩²
Already hinted at in Corsi². ↩
See Giuseppe Buccheri, Fulvio Corsi, HARK the SHARK: Realized Volatility Modeling with Measurement Errors and Nonlinear Dependencies, Journal of Financial Econometrics, Volume 19, Issue 4, Fall 2021, Pages 614–649. ↩ ↩² ↩³
Which helps limiting the need for an insanity filter. ↩
See Fulvio Corsi, Nicola Fusari, Davide La Vecchia, Realizing smiles: Options pricing with realized volatility, Journal of Financial Economics, Volume 107, Issue 2, 2013, Pages 284-304. ↩ ↩²
See Salt Financial, The Layman’s Guide to Volatility Forecasting: Predicting the Future, One Day at a Time, Research Note. ↩
See Mincer, J. and V. Zarnowitz (1969). The evaluation of economic forecasts. In J. Mincer (Ed.), Economic Forecasts and Expectations. ↩
These ETFs are used in the Adaptative Asset Allocation strategy from ReSolve Asset Management, described in the paper Adaptive Asset Allocation: A Primer²⁷. ↩
The common ending price history of all the ETFs is 31 August 2023, but there is no common starting price history, as all ETFs started trading on different dates. ↩ ↩²
For all models, I used an expanding window for the volatility forecast computation. ↩ ↩²
See Swanson, N. R. and H. White (1995). A model-selection approach to assessing the information in the term structure using linear models and artificial neural networks. Journal of Business & Economic Statistics 13 (3), 265–275. ↩
See Butler, Adam and Philbrick, Mike and Gordillo, Rodrigo and Varadi, David, Adaptive Asset Allocation: A Primer. ↩

Combating Volatility Laundering: Unsmoothing Artificially Smoothed Returns

2024-06-01T00:00:00-05:00

Introduction

It is common knowledge that returns to hedge funds and other alternative investments [like private equity or real estate] are often highly serially correlated¹.

This results in apparently smooth returns that have artificially lower volatilities and covariations with other asset classes², which in turn bias [portfolio] allocations toward the smoothed asset classes².

In this blog post, I will detail two simple unsmoothing procedures designed to help with these problems.

As examples of usage, I will show that these procedures allow a fairer comparison between private and public U.S. real estate investments and I will illustrate the impact of smoothed v.s. unsmoothed returns on the mean-variance efficient frontier associated to a universe made of both public and private investments.

Notes:

A Google Sheet corresponding to this post is available here.

Mathematical preliminaries

Autocorrelation of a time series

Let be:

$x_1,…,x_T$ the values of a time series observed over $T$ time periods
$\bar{x} = \frac{1}{T} \sum_{i=1}^T x_i$ the arithmetic average of that time series

The autocovariance function - or serial covariance function - of order $n \in \mathbb{N}$ of the time series $x_1,…,x_T$ is defined as³:

\[\gamma_n = \frac{1}{T} \sum_{i=\max(1,-n)}^{\min(T-n,T)} \left( x_{i+n} - \bar{x} \right) \left( x_i - \bar{x} \right)\]

The autocorrelation function - or serial correlation function - of order $n \in \mathbb{N}$ of the time series $x_1,…,x_T$ is defined as³:

\[\rho_n = \frac{\gamma_n}{\gamma_0}\]

These functions represent the covariance and the correlation between the time series $x_{1+n},…,x_T$ and a delayed copy of itself $x_1,…,x_{T-n}$, hence their name.

Autocorrelation of financial asset returns and volatility laundering

In financial asset returns, it has been noted that both positive and negative autocorrelation might be present simultaneously at different horizons⁴.

Nevertheless, for both theoretical and practical reasons, any significant and persistent autocorrelation should quickly be arbitraged away⁵.

It is thus quite remarkable that returns on several alternative assets (art⁶, collectible stamps⁷, hedge funds¹, private equity⁸, private infrastructure⁹, private real estate¹⁰, wine¹¹…) have been found to be significantly (positively) autocorrelated.

Such an autocorrelation is nevertheless spurious - a purely mathematical artifact⁹ - and originates from the characteristics of these alternative assets:

Illiquidity (infrequent trading, infrequent mark-to-market valuations…)
Smoothing of the underlying “true” economic returns
Lagging reporting and/or appraisal process
Fees structure
…

Indeed, as explained¹² by Getmansky et al.¹:

In such cases, the reported returns of [these assets] will appear to be smoother than true economic returns (returns that fully reflect all available market information […]) and this, in turn, will […] yield positive serial return correlation.

Unfortunately, even if artificial, the presence of autocorrelation in these assets’ returns invalidates traditional measures of risk and performance¹³ such as volatility, Sharpe ratio, correlation, and market-beta estimates¹ and ultimately presents the risks of [these] assets as more attractive than they are¹¹, especially when compared to typical public alternatives (stocks, REITs…).

Incidentally, the smoothing service¹⁴ offered by these alternative assets has been named volatility laundering¹⁵¹⁶ by Cliff Asness, the founder of AQR Capital Management.

Unsmoothing artificially smoothed financial asset returns

In order to provide a more accurate picture of the risks associated to assets whose returns have been artificially smoothed¹⁷, one possibility is to unsmooth (or desmooth) those smoothed returns so as to recover the true underlying (unobservable) returns¹⁸.

In other words, given a time series $r_1,…,r_T$ of artificially smoothed financial asset returns, we would like to determine the “hidden” underlying unsmoothed time series $r_1^*,…,r_T^*$.

Now, of course, the nature and interpretation of such an [unsmoothed] series depends on the assumptions that are imbedded in the smoothing model and unsmoothing procedure¹⁰.

The standard approach is to rely on the assumption that the underlying true returns are uncorrelated across time, consistent with the classical hypothesis of weak-form informational efficiency in asset markets¹⁰.

Based on this assumption, several different unsmoothing procedures have been proposed in the litterature¹⁰¹⁹¹³¹²⁰, the two simplest ones being:

The procedure of Geltner¹⁰ (or of Fisher-Geltner-Webb¹⁰), able to approximately remove the first-order autocorrelation from a time series
The procedure of Okunev and White¹⁹, an extension of Geltner’s procedure able to approximately remove the autocorrelation of any order from a time series

Geltner’s procedure

The smoothing model of Geltner¹⁰ assumes that the reported (smoothed) return at time $t$ $r_t$ is a linear combination of the true economic (unsmoothed) return at time $t$ $r_t^*$ and the one-period lagged reported return $r_{t-1}$:

\[r_t = \left( 1 - \alpha \right) r_t^* + \alpha r_{t-1}, t=2..T\]

, with $\alpha \in \left[ 0,1 \right]$ an unknown parameter to be determined.

After some manipulations, this leads to the following formula for the time series $r_1^*,…,r_T^*$:

\[r_1^* = r_1\] \[r_t^* = \frac{r_t - c r_{t-1}}{1 - c}, t=2..T\]

, with $c = \rho_1$ the first-order autocorrelation of the time series $r_1,…,r_T$.

It can be shown¹⁹ that the unsmoothed time series has:

An arithmetic mean equal to that of the original time series
Its first-order autocorrelation approximately null

Okunev and White’s procedure

One difficulty with [Geltner’s procedure] is that it is only strictly correct for an AR(1) process and it only acts to remove first order autocorrelation¹⁹.

This led Okunev and White¹⁹ to propose an extension of Geltner’s smoothing model by assuming that the reported return at time $t$ $r_t$ is a linear combination of the true economic return at time $t$ $r_t^*$ and the lagged reported returns $r_{t-1}, r_{t-2}, …, r_{t-m}$:

\[r_t = \left( 1 - \alpha \right) r_t^* + \sum_{i=1}^m \beta_i r_{t-i}, t=m+1..T\]

, with:

$m \geq 1$ the number of lagged reported returns to consider in order to determine the true economic returns, usually taken equal to 1 or 2 in practical applications
$\beta_i \in \left[ 0,1 \right]$, $i=1..m$
$\alpha = \sum_{i=1}^m \beta_i$

Under that model, Okunev and White¹⁹ describes how to approximately remove the $m$-th order autocorrelation from the time series $r_1,…,r_T$:

Define $r_{0,t} = r_t$, $t=1..T$
Remove the first-order autocorrelation from the time series $r_{0,1},…,r_{0,T}$ by defining the time series $r_{1,1},…,r_{1,T}$
\[r_{1,1} = r_{0,1}\] \[r_{1,t} = \frac{r_{0,t} - c_1 r_{0,t-1}}{1 - c_1}, t=2..T\]
, with:
- $c_1 = \frac{\left( 1+a_{0,2} \right) \pm \sqrt{\left( 1+a_{0,2} \right)^2 - 4a_{0,1}^2}}{2a_{0,1}}$, choosen so that $\left| c_1 \right| \leq 1$
- $a_{0,1} = \rho_1$ the first-order autocorrelation of the time series $r_{0,1},…,r_{0,T}$
- $a_{0,2} = \rho_2$ the second-order autocorrelation of the time series $r_{0,1},…,r_{0,T}$
For $i=2..m$, remove the $i$-th order autocorrelation from the time series $r_{i-1,1},…,r_{i-1,T}$ by defining the time series $r_{i,1},…,r_{i,T}$
\[r_{i,t} = r_{i-1,t}, t=1..i\] \[r_{i,t} = \frac{r_{i-1,t} - c_i r_{i-1,t-1}}{1 - c_i}, t=i+1..T\]
, with:
- $c_i = \frac{\left( 1+a_{i-1,2i} \right) \pm \sqrt{\left( 1+a_{i-1,2i} \right)^2 - 4a_{i-1,i}^2}}{2a_{i-1,i}}$, choosen so that $\left| c_i \right| \leq 1$
- $a_{i-1,i}$ the $i$-th order autocorrelation of the time series $r_{i-1,1},…,r_{i-1,T}$
- $a_{i-1,2i}$ the $2i$-th order autocorrelation of the time series $r_{i-1,1},…,r_{i-1,T}$

It can be shown¹⁹ that the unsmoothed time series has:

An arithmetic mean equal to that of the original time series
Its first $m$-th order autocorrelations approximately null

Notes on Okunev and White’s procedure:

It is iterative by nature, because removing the $i$-th order autocorrelation from the time series $r_{i-1,1},…,r_{i-1,T}$ alters its $i-1$-th order autocorrelation; steps 1, 2 and 3 thus need to be applied iteratively, until the first $m$ autocorrelations are sufficiently close to zero¹⁹.

It allows, more generally, to set the first $m$-th order autocorrelations of the time series $r_1,…,r_T$ to any desired values, under certain mathematical conditions.

Its influence on the skewness, the kurtosis and the modified Value-At-Risk of the unsmoothed returns has been studied¹² in Gallais-Hamonno and Nguyen-Thi-Thanh²¹.

Misc. remarks

A couple of miscealenous remarks on the two unsmoothing procedures described in the previous sub-sections:

Both procedures should preferably be used with smoothed logarithmic² asset returns rather than smoothed arithmetic asset returns because unsmoothing arithmetic returns might result in returns that are less than -100%²².
Both procedure require some numerical attention when determining the unsmoothing coefficients $c$ or $c_1,…,c_m$.
Okunev and White’s procedure better nullifies the first order autocorrelation of a time series of smoothed asset returns than Geltner’s procedure.

Adjusting the performance statistics of artificially smoothed financial asset returns

Another possibility to provide a more accurate picture of the risks associated to assets whose returns have been artificially smoothed is to adjust traditional performance statistics²³
to control for the spurious serial correlation in [those] returns¹⁹.

Describing such adjustments is out of scope of this blog post, but the interested reader can for example refer to:

Lo²⁴, in which the “square root of time” rule typically used to annualize a monthly Sharpe Ratio is shown to be invalid in the presence of autocorrelation and in which a more appropriate adjustment method is determined.
Bailey and de Prado²⁵, in which the impact of first-order autocorrelation on maximum drawdown and time under water is analyzed.

Implementation in Portfolio Optimizer

Portfolio Optimizer implements both Geltner’s and Okunev and White’s procedures through the endpoint /assets/returns/unsmoothed.

Examples of usage

Comparing private and public real estate investments

In the blog post The Case Against Private Markets, Nicolas Rabener from Finominal highlights the discrepancies between private and public market valuations²⁶ in the case of U.S. real estate.

In more details, he compares the Blackstone Real Estate Income Trust BREIT - a private real estate investment - together with the Fidelity MSCI Real Estate Index ETF (FREL) - a publicly traded real estate investment.

Rabener then shows that:

The volatility of BREIT is absurdly low compared to that of the FREL ETF
The Sharpe Ratio of BREIT is thus absurdly high compared to that of the FREL ETF

Problem is, those two investments are exposed to the same underlying economic drivers²⁶, so that such differences do not really make sense…

I propose to revisit Rabener’s results, this time comparing BREIT with the Vanguard Real Estate Index Fund ETF (VNQ), both depicted in Figure 1²⁷.

Figure 1. Private real estate (BREIT) v.s. public real estate (VNQ ETF), January 2017 - March 2024.

Some statistics²⁸ for these two funds:

Instrument	Annualized Mean	Annualized Standard Deviation	Annualized Sharpe Ratio	First-order autocorrelation
BREIT	10.10%	4.58%	2.20	0.42
VNQ ETF	6.26%	18.68%	0.34	-0.13

From Figure 1 and the associated statistics (e.g., one-fourth of the volatility of its public counterpart, strong first-order autocorrelation), BREIT is a textbook case of volatility laundering that we can combat thanks to Geltner’s procedure²⁹!

Figure 2 depicts the resulting unsmoothed BREIT together with the original BREIT and the VNQ ETF.

Figure 2. Smoothed and unsmoothed private real estate (BREIT) v.s. public real estate (VNQ ETF), January 2017 - March 2024.

Some additional statistics for the unsmoothed BREIT:

Instrument	Annualized Mean	Annualized Standard Deviation	Annualized Sharpe Ratio	First-order autocorrelation
Unsmoothed BREIT	10.28%	6.85%	1.50	-0.04

While Figure 2 is hardly distinguishable from Figure 1, the associated statistics are clear and consistent with the litterature:

Unsmoothing the returns of BREIT increases its volatility by nearly 50%
Unsmoothing the returns of BREIT also decreases its annualized Sharpe Ratio by approximately 30%

This small exercice empirically validates that unsmoothing private asset returns makes them look and feel more like a public market proxy³⁰.

To be noted that the unsmoothed BREIT still seems a little bit too smooth when compared to the VNQ ETF.

Analysing hedge funds returns

Brooks and Kat¹⁸, as well as many other authors¹¹⁹²¹³¹, observes that hedge funds exhibit significant levels of first - and sometimes second or higher - order autocorrelation, which has important consequences for investors¹⁸.

In particular, and thanks to the unsmoothing procedures previously described:

Gallais-Hamonno and Nguyen-Thi-Thanh²¹ notes that

[…] the hedge fund performance measured by traditional ratios considerably decreases after unsmoothing, decrease which is on average 20%, even 25% according to the method and to the performance ratio used.

[…] 20% of [hedge funds] have a change in rankings. Moreover, there are several cases of strong “down-grading” and of strong “over-grading” […]
Okunev and White¹⁹ shows that

After removing the autocorrelations from returns, we find increases in risk of between 60 and 100 percent for many of the individual indices.

So, when analysing hedge funds returns, a pre-requisite is to always use an unsmoothing procedure!

Mitigating covariance misspecification in asset allocation

In the context of a multi-asset portfolio, Peterson and Grier² shows through a Monte Carlo mean-variance experiment that the naive covariance estimates [relying on smoothed asset returns] generate significant weight bias, undesirable allocative tilts, and higher portfolio risk².

Similar in spirit to that paper, I propose to illustrate the impact of smoothed returns on mean-variance analysis.

For this:

I consider a U.S.-centric universe of assets made of five Preqin private investment indexes and their public markets equivalents:
- Private Equity and S&P 500 index
- Venture Capital and Russell 2000 index
- Private Debt and an High Yield Bonds index³²
- Private Real Estate and MSCI U.S. Real Estate index
- Private Infrastructure and S&P Infrastructure index
  
  Figure 3 displays the (quarterly) evolution of all these indexes over the period 31 December 2007 - 31 December 2023.
  
  Figure 3. Private investment indexes v.s. public markets equivalents, 31 December 2007 - 31 December 2023.
Within that universe, I then compute two mean-variance efficient frontiers over the period 31 December 2007 - 31 December 2023, using in-sample annualized mean-variance inputs³³:
- A “raw” efficient frontier, using reported returns for all indexes
- An “unsmoothed” efficient frontier, using unsmoothed³⁴ reported returns for the private investment indexes and reported returns for their public markets equivalents
  
  Figure 4 represents these two efficient frontiers in the mean-variance plane, together with the different indexes of the considered universe.
  
  Figure 4. Efficient frontiers for raw v.s. unsmoothed private investment indexes, 31 December 2007 - 31 December 2023.
  
  From Figure 4, and in accordance with Peterson and Grier²:
  - The standard deviations of the private investment indexes increase significantly when their reported returns are unsmoothed.
  - The difference in the locations of the two frontiers is largely³⁵ a result of the underestimated standard deviations of the smoothed [private investment indexes]².
  This confirms that the naive covariance matrix underestimates risk and overestimates returns relative to the more efficient revised covariance matrix² and that unsmoothing is necessary to remove potential biases and improve efficiency by maximizing the information content of [the mean-variance input] estimates².
  
  One important remark, though, is that the unsmoothing procedure used only partially resolved the issues related to smoothed asset returns.
  
  Indeed, the transition maps³⁶ of the two efficient frontiers (Figure 5) shows that unsmoothed mean-variance efficient portfolios are still heavily concentrated in private assets…
  
  Figure 5. Transition maps of efficient frontiers for raw v.s. unsmoothed private investment indexes, 31 December 2007 - 31 December 2023.
  
  So, to paraphrase Trym Riksen, private assets break the Markovitz’s model, and do so even when their “specificities” are accounted for!
  
  More on this right now.

Conclusion

A proper conclusion is certainly that in general, [private asset returns] series must be unsmoothed before analysis² to avoid introducing any unwanted bias.

Nevertheless, the two unsmoothing procedures described in this blog post are not entirely satisfying because the resulting unsmoothed returns are yet too smooth…

Next in this series on combating volatility laundering, I will describe another unsmoothing procedure, this time able to take into account³⁷ the economic relationship between a private asset and its public market equivalent, which might improve its practical performances.

In order to smoothly stay up-to-date with I am doing, feel free to connect with me on LinkedIn or to follow me on Twitter.

–

See Mila Getmansky, Andrew W. Lo, Igor Makarov, An econometric model of serial correlation and illiquidity in hedge fund returns, Journal of Financial Economics, Volume 74, Issue 3, 2004, Pages 529-609. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶
See Steven P. Peterson and John T. Grier, Covariance Misspecification in Asset Allocation, Financial Analysts Journal, Vol. 62, No. 4 (Jul. - Aug., 2006), pp. 76-85. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰
See Venables, W. N. and Ripley, B. D. (2002) Modern Applied Statistics with S. Fourth Edition. Springer-Verlag. ↩ ↩²
See On the Autocorrelation of the Stock Market, Ian Martin, Journal of Financial Econometrics, Volume 19, Issue 1, Winter 2021, Pages 39–52. ↩
See Samuelson P. A. 1965. Proof That Properly Anticipated Prices Fluctuate Randomly. Industrial Management Review 6: 41–49. ↩
Campbell, R., 2008. Art as a financial investment. Journal of Alternative Investments 10, 64–81. ↩
See Dimson, Elroy, and Christophe Spaenjers. 2011. Ex post: The investment performance of collectible stamps. Journal of Financial Economics 100:443–458. ↩
See Ang, A., Chen, B., Goetzmann, W. N., Phalippou, L., 2018. Estimating private equity returns from limited partner cash flows. The Journal of Finance 73, 1751–1783. ↩
See Alexander D. Beath, PhD and Chris Flynn, CFA, ASSET ALLOCATION, COST OF INVESTING AND PERFORMANCE OF EUROPEAN DB PENSION FUNDS: THE IMPACT OF REAL ESTATE, CEM Benchmarking Inc.. ↩ ↩²
See Geltner, David, 1991, Smoothing in Appraisal-Based Returns, Journal of Real Estate Finance and Economics, Vol.4, p.327-345, Geltner, David, 1993, Estimating Market Values from Appraised Values without Assuming an Efficient Market, Journal of Real Estate Research, Vol.8, p.325-345 and Fisher, J. D., Geltner, D. M., & Webb, R. B. (1994). Value indices of commercial real estate: A comparison of index construction methods. The Journal of Realv Estate Finance and Economics, 9 (2), 137–164. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷
See Verdickt, Gertjan, Volatility Laundering: On the Feasibility of Wine Investment Funds (March 28, 2024). ↩ ↩²
In the context of hedge funds. ↩ ↩²
See Spencer J Couts, Andrei S Gonçalves, Andrea Rossi, Unsmoothing Returns of Illiquid Funds, The Review of Financial Studies, 2024. ↩ ↩²
See Antti Ilmanen, Investing in Interesting Times, The Journal of Portfolio Management, Multi-Asset Special Issue 2023, 49 (4) 21 - 35. ↩
See Cliff Asness, Volatility Laundering, Perspective. ↩
In the context of private equity, c.f. the podcast Private Equity and the Game of ”Volatility Laundering” from This Week in Intelligent Investing. ↩
Or volatility laundered. ↩
See Brooks, C. and H. Kat, The statistical properties of Hedge Fund index returns and their implications for investors, The Journal of Alternative Investments, Fall 2002, 5 (2) 26-44. ↩ ↩² ↩³
See Geoff Loudon, John Okunev, Derek White, Hedge Fund Risk Factors and the Value at Risk of Fixed Income Trading Strategies, The Journal of Fixed Income, Fall 2006, 16 (2) 46-61 and Okunev, John and White, Derek, Hedge Fund Risk Factors and Value at Risk of Credit Trading Strategies (October 2003). ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹
See Shilling, James D. Measurement Error in FRC/NCREIF Returns on Real Estate. Southern Economic Journal, vol. 60, no. 1, 1993, pp. 210–19. ↩
See Georges Gallais-Hamonno, Huyen Nguyen-Thi-Thanh. The necessity to correct hedge fund returns: empirical evidence and correction method. 2007. ↩ ↩² ↩³
For example, use the time series [-0.94, -0.96, -0.98, -0.95]. ↩
Called i.i.d.-based in de Prado²⁵. ↩
See Lo, A. W., 2002. The statistics of sharpe ratios. Financial Analysts Journal 58, No. 4, 36–52. ↩
See Bailey, David H. and Lopez de Prado, Marcos, Stop-Outs Under Serial Correlation and ‘The Triple Penance Rule’. ↩ ↩²
See The Case Against Private Markets. ↩ ↩²
(Adjusted) prices have have been retrieved using Tiingo. ↩
Due to the first-order autocorrelation in BREIT returns, the naive annualization of the volatility and of the Sharpe Ratio that I have done is incorrect, c.f. Lo²⁴; this is done on purpose, though. ↩
Okunev and White’s procedure gives nearly identical results. ↩
See Bingxu Chen & Christopher Carrano, Two Easy Steps to Take Your Private Asset Returns Public, Vnn by Two Sigma. ↩
See Clifford S Asness, Robert J Krail, John M Liew, Do Hedge Funds Hedge?, The Journal of Portfolio Management, Fall 2001, 28 (1) 6-19. ↩
The associated index is the Markit iBoxx USD Liquid High Yield Index; comparison of Private Debt to both Investment Grade and High Yield bonds is discussed in Boni and Manigart³⁸. ↩
In-sample asset mean returns and in-sample asset returns covariance matrix, using asset returns over the period 31 March 2008 - 31 December 2023. ↩
Reported returns for the private investment indexes were unsmoothed using the Okunev and White’s procedure of order 1; results are similar with Geltner’s procedure, but since I already used Geltner’s procedure in the context of real-estate, I decided to play with its extension. ↩
To be noted that the in-sample asset mean returns are numerically different between smoothed and unsmoothed returns, but this does not explain the differences between the two frontiers. ↩
The transition map of the efficient frontier is a stacked area plot of the composition, in terms of asset weights, of the efficient portfolios belonging to the efficient frontier. ↩
That being said, a variant of Geltner’s method allow to use a public market equivalent of a private asset to “inject” volatility in the unsmoothed returns, c.f. Geltner¹⁰. ↩
See Boni, Pascal and Manigart, Sophie, Private Debt Fund Returns, Persistence, and Market Conditions. ↩

Bootstrap Simulations with Exact Sample Mean Vector and Sample Covariance Matrix

2024-05-05T00:00:00-05:00

Bootstrapping is a statistical method which consists in sampling with replacement from an original data set to compute the distribution of a desired statistic, with plenty of possible variations depending on the exact context (non-dependent data, dependent data…).

Because bootstrap methods are not, in general, based on any particular assumption on the distribution of the data, they are well suited for the analysis of [financial] returns¹, in which case the original data set typically consists of historical return time series².

Unfortunately, bootstrap simulations have the disadvantage that only previously recorded values can be simulated³, so that they cannot explicitly reflect the investors’ expectations³.

In this blog post, inspired by Rob Carver’s piece Portfolio optimisation, uncertainty, bootstrapping, and some pretty plots⁴, I will describe a way to incorporate exact views on expected returns, standard deviations and correlations into bootstrapping so that the resulting simulations both preserve the salient characteristics of asset returns³ and also match forward-looking risk⁵ and return assumptions.

As an example of application, I will revisit the methodology described in the research note Considering the Past and the Future in Asset Simulation³ from T. Rowe Price and incorporate the full range of capital market assumptions into a systematic modelling process to simulate potential asset returns for portfolio construction³.

Mathematical preliminaries

Let be:

$n$, the number of assets in a universe of assets
$T$, the number of time periods
$R \in \mathcal{M}(\mathbb{R}^{T \times n})$, with $R_{t,i}$ the return⁶ of the asset $i=1..n$ for the time period $t=1..T$, representing a scenario (historical, simulated…) for the temporal evolution of the asset returns
$\mu \in \mathbb{R}^{n}$, the vector of the arithmetic averages of the asset returns $R$
$\Sigma \in \mathcal{M}(\mathbb{R}^{n \times n})$, the covariance matrix of the asset returns $R$, supposed to be invertible to avoid numerical subtleties

Moment-matching

Let be:

$\bar{\mu} \in \mathbb{R}^{n}$ a vector
$\bar{\Sigma} \in \mathcal{M} \left( \mathbb{R}^{n \times n} \right)$ a positive-definite matrix

Suppose that we would like the first two empirical moments of the asset returns $R$ - that is, $\mu$ and $\Sigma$ - to take the values $\bar{\mu}$ and $\bar{\Sigma}$.

How to proceed?

This problem - called moment-matching by twisting scenarios in Attilio Meucci’s Advanced Risk and Portfolio Management website - consists in transforming the original asset returns $R$ into modified asset returns $\tilde{R} \in \mathcal{M}(\mathbb{R}^{T \times n})$, so that:

$ \frac{1}{T} \sum_{t=1}^{T} \tilde{R}_{t,i} = \bar{\mu}_i $, $i=1..n$
$ \frac{1}{T} \sum_{k=1}^T \left( \tilde{R}_{k,i} - \bar{\mu}_i \right) \left( \tilde{R}_{k,j} - \bar{\mu}_j \right) = \bar{\Sigma}_{i,j} $, $i=1..n$, $j=1..n$

Moment-matching with $n=1$, shift and rescaling procedure

When $n=1$, a procedure to compute the modified asset returns $\tilde{R}$ from the original asset returns $R$ is discussed in Boyle et al.⁷.

It involves the following intuitive shift/rescaling⁸ of the original asset returns:

\[\tilde{R} = \begin{pmatrix} 1 \\ 1 \\ ... \\ 1 \end{pmatrix} \bar{\mu} + \frac{\bar{\sigma}}{\sigma} \left( R - \begin{pmatrix} 1 \\ 1 \\ ... \\ 1 \end{pmatrix} \mu \right)\]

, with:

$\sigma = \sqrt{\Sigma_{1,1}}$, the (sample) standard deviation of the original asset returns
$\bar{\sigma} = \sqrt{\bar{\Sigma}_{1,1}}$, the desired (sample) standard deviation of the modified asset returns

Moment-matching with $n \geq 2$

Multivariate shift and rescaling procedure

When $n \geq 2$, Kaut and Lium⁹ hints at a multivariate generalization of the procedure described in the previous sub-section, which becomes an affine transformation of the original asset returns $R$:

\[\tilde{R} = \begin{pmatrix} 1 \\ 1 \\ ... \\ 1 \end{pmatrix} \bar{\mu}^t + \bar{\Sigma}^{\frac{1}{2}} \Sigma^{-\frac{1}{2}} \left( R - \begin{pmatrix} 1 \\ 1 \\ ... \\ 1 \end{pmatrix} \mu^t \right)\]

, with:

$\bar{\Sigma}^{\frac{1}{2}}$, the unique positive semidefinite square root of $\bar{\Sigma}$
$\Sigma^{-\frac{1}{2}}$, the inverse of the unique positive semidefinite square root of $\Sigma$

To be noted that if only the average asset returns and/or the asset standard deviations need to be altered - that is, if the asset correlations should be left unchanged -, the generic formula above becomes:

\[\tilde{R} = \begin{pmatrix} 1 \\ 1 \\ ... \\ 1 \end{pmatrix} \bar{\mu}^t + Diag \left( \frac{\bar{\sigma}}{\sigma} \right) \left( R - \begin{pmatrix} 1 \\ 1 \\ ... \\ 1 \end{pmatrix} \mu^t \right)\]

, with

\[Diag \left( \frac{\bar{\sigma}}{\sigma} \right) = \begin{pmatrix} \frac{\bar{\sigma}_1}{\sigma_1} & 0 & ... & 0 \\ 0 & \frac{\bar{\sigma}_2}{\sigma_2} & ... & 0 \\ ... & ... & ... & ... \\ 0 & 0 & ... & \frac{\bar{\sigma}_n}{\sigma_n} \end{pmatrix}\]

, which is exactly the one-dimensional procedure described in the previous sub-section, applied independently to each of the $n$ assets.

Minimum-correction second-moment-matching procedure

One problem with the multivariate shift and rescaling procedure described in the previous sub-section is that it offers no guarantees regarding the “distance” between the original asset returns and the modified asset returns.

In other words, it is theoretically possible that the modified asset returns obtained through this procedure bear no resemblance at all with the original asset returns, which would render them useless for the purpose of maintain[ing] important information from [the original asset returns]³,

To solve this problem, a natural approach is to aim for a minimal correction $\tilde{R} - R$ to avoid introducing numerical artifacts¹⁰.

Such an approach to minimally distorting the original asset returns is described in Lin and Lermusiaux¹⁰ under the name minimum-correction second-moment-matching¹⁰, with the following main result¹¹:

\[\argmin_{\tilde{R} \in \mathcal{M}(\mathbb{R}^{T \times n}), \frac{1}{n}\tilde{R}^t R = \bar{\Sigma}} \left\Vert R - \tilde{R} \right\Vert^2_F = R A_*\]

, with $A_*$ defined by

\[A_* = { \bar{\Sigma}^{\frac{1}{2}} }^t \left( { \bar{\Sigma}^{\frac{1}{2}} } \Sigma { \bar{\Sigma}^{\frac{1}{2}} }^t \right)^{-\frac{1}{2}} { \bar{\Sigma}^{\frac{1}{2}} }\]

This leads to the mapping:

\[\tilde{R} = \begin{pmatrix} 1 \\ 1 \\ ... \\ 1 \end{pmatrix} \bar{\mu}^t + \left( R - \begin{pmatrix} 1 \\ 1 \\ ... \\ 1 \end{pmatrix} \mu^t \right) A_*\]

Comparison of the two procedures

Figure 1 and Figure 2 highlight, over the period 04rd January 2010 - 31th December 2020¹², the typical differences between the two procedures described in the previous sub-sections when applied to a universe of 10 ETFs representative¹³ of misc. asset classes.

Figure 1. Comparison of moment matching procedures in a 10 ETF universe, SPY ETF, 04rd January 2010 - 31th December 2020.

Figure 2. Comparison of moment matching procedures in 10 ETF universe, TLT ETF, 04rd January 2010 - 31th December 2020.

On Figure 1 and Figure 2:

The curve in blue corresponds to the original asset returns
The two curves in green and orange correspond to modified asset returns so that the average returns and the standard deviations of the asset returns are left unchanged but their correlations are altered, using
- The multivariate shift and rescaling procedure (green curve)
- The minimum correction procedure (orange curve)

From these figures, it is clear that the curve corresponding to the minimum correction procedure is “closer” to the original curve than the curve corresponding to the multivariate shift and rescaling procedure.

This is confirmed numerically, with the “minimally corrected” asset returns being about 10% closer to the original asset returns than the “shifted and scaled” asset returns:

$\left\Vert R - \tilde{R}_{shift-rescaling} \right\Vert_F \approx 1.30$
$\left\Vert R - \tilde{R}_{min-correction} \right\Vert_F \approx 1.17$

Misc. remarks

A couple of remarks on what precedes:

The underlying distribution of the original asset returns - whatever its nature - is usually not preserved by the different moment-matching procedures.

This is discussed in Boyle et al.⁷ for the one-dimensional case and in Kaut and Lium⁹ for the general multi-dimensional case.
The multivariate formulas for computing $\tilde{R}$ provided in the previous sub-sections are not the most generic ones.

In particular, Kaut and Lium⁹ and Lin and Lermusiaux¹⁰ emphasize that the matrix square roots appearing in those formulas need not be the unique positive semidefinite square roots.

Bootstrap simulations with exact sample mean vector and sample covariance matrix

Bootstrap simulations…

Carver⁴ gives a great summary of why bootstrap simulations are so useful in finance:

Bootstrapping is particularly potent in the field of financial data because we only have one set of data: history. We can’t run experiments to get more data. Bootstrapping allows us to create ‘alternative histories’ that have the same basic character as our actual history, but aren’t quite the same. Apart from generating completely random data […], there isn’t really much else we can do.

When applied to historical asset returns, bootstrapping creates simulated asset returns that differ from their historical counterparts but that nevertheless maintain important information gleaned from history³.

Now, for various reasons, it might be useful to be able to tweak the first two empirical moments of those simulated asset returns:

To incorporate forward-looking estimates of mean asset returns based on current market valuation indicators like the U.S. AIAE indicator for U.S. stocks or the BSRM/B model for U.S. bonds.
To incorporate forward-looking estimates of asset correlations, like an estimate of the future stock-bond correlation based on the framework described in Czasonis et al.¹⁴ or, more recently, in Molenaar et al.¹⁵
To simulate correlation scenarios incompatible with the historical asset returns, like unprecedented correlation breakdowns¹⁶.

… with exact sample mean vector and sample covariance matrix

For this, one possibility is to use the moment-matching procedures described in the previous section, and in particular the minimum-correction second-moment-matching procedure.

Indeed:

Although these procedures do not preserve the distribution of the simulated asset returns, it is not a problem in the context of bootstrapping because - contrary to a Monte Carlo simulation - there is (usually) no assumption made about that distribution.
Additionally, the “minimum alteration” property of the minimum-correction second-moment-matching procedure gives some assurance that, even if the distribution of the simulated asset returns is altered by this procedure, it should be kind of “minimally” altered.

In practice, a moment-matching procedure can be used in two different ways with bootstrap simulations:

For tweaking the historical asset returns
For tweaking the simulated asset returns

Tweaking the historical asset returns

This is the method described for example in Carver⁴ or in Wu and Walsh³.

It consists in:

Transforming the historical asset returns into modified historical asset returns whose sample mean vector (resp. sample covariance matrix) is equal to a desired target mean vector (resp. target covariance matrix).
Bootstrapping these modified historical assets returns in order to simulate asset returns.

Figure 3 illustrates this method applied to a universe of U.S. stocks (SPY ETF) and bonds (TLT ETF) over the period 04rd January 2010 - 31th December 2020¹² when the historical correlation of about $-0.5$ between stocks and bonds is altered to about $0.5$, which is more representative of the pre-2000 period, c.f. Brixton et al.¹⁷.

Figure 3. Simulated SPY/TLT ETF returns obtained from bootstrapping moment-matched historical SPY/TLT ETF returns, 04rd January 2010 - 31th December 2020.

One of the advantages of this method is that the computationally costly part - the moment-matching procedure - needs to be performed only once.

Nevertheless, with this method, the simulated asset returns will exhibit a varying sample mean vector and sample covariance matrix, which might be undesirable.

Tweaking the simulated asset returns

This method consists in:

Bootstrapping the historical assets returns in order to simulate asset returns.
Transforming the simulated asset returns into modified simulated asset returns whose sample mean vector (resp. sample covariance matrix) is equal to a desired target mean vector (resp. target covariance matrix).

Figure 4 illustrates this method in the same context as in Figure 3.

Figure 4. Moment-matched simulated SPY/TLT ETF returns obtained from bootstrapping historical SPY/TLT ETF returns, 04rd January 2010 - 31th December 2020.

With this method, the simulated asset returns will exhibit a sample mean vector and a sample covariance matrix that exactly match the desired target mean vector and target covariance matrix.

Unfortunately, one of the drawbacks of this method is that it is computationally heavy, because every time a bootstrap simulation is generated, the moment-matching procedure must be applied to the associated simulated asset returns.

Comparison with entropy pooling

Readers familiar with a procedure called entropy pooling¹⁸ - that also allows to express views on features of the market, such as expectation or correlations, regardless of the market distribution¹⁹ - might wonder about the relationship between that procedure and the moment-matched bootstrapping procedure described in this section.

Two preliminary remarks:

Bootstrapping, especially its variations designed for time-series, introduces time-dependency in the sampled asset returns.

This makes moment-matched bootstrapping closer in spirit to time-dependent entropy pooling - as described in van der Schans²⁰, which applies the entropy pooling computational approach in a time-dependent setting with sample paths, called scenarios, instead of sample points²⁰ - than to vanilla entropy pooling.
Time-dependent entropy pooling incorporates the [views] by assigning weights to the scenarios²⁰ within a scenario-probability setting, while moment-matched bootstrapping operates on one scenario at a time, without any reference to a probability distribution.

With that in mind, the relationship between entropy pooling and moment-matched bootstrapping would be the following:

Time-dependent entropy pooling works by tweaking the relative probabilities of the scenarios without affecting the scenarios themselves¹⁸
Moment-matched bootstrapping works by tweaking the scenarios without affecting the relative probabilities themselves, since in that case, the user associates full probability to one single scenario¹⁸

That being said, entropy pooling is a much more generic²¹ framework for imposing views than moment-matched bootstrapping, so that the comparison ends here.

Implementation in Portfolio Optimizer

Portfolio Optimizer implements the minimum-correction second-moment-matching procedure described in the previous sections through the endpoint /assets/returns/moment-matched.

Together with the endpoint /assets/returns/simulation/bootstrap, this allows to use any of the two methods described in the previous section.

Example of application - extending an historically informed, forward-looking simulation framework

T. Rowe Price’s simulation framework

Wu and Walsh³ describes a simulation-based portfolio construction framework that embed both long-term historical asset behaviours and investor expectations of future performance when simulating asset returns³.

In details, this historical informed³ framework is a three-step process that:

Backfills missing historical asset returns for assets whose return histories differ in length, using the relationships observed between all assets over their common returns history while accounting for the associated estimation error.

As a side note, this step uses the procedure described in a previous blog post.
Tweaks the resulting historical asset returns, taking into account user-defined return expectations³.

This step tweaks the historical asset returns thanks to the univariate shift procedure²² described in the previous sections, using T. Rowe Price’s own expected returns as input.

Figure 5, nearly identical²³ to Figure 8 from Wu and Walsh³, depicts the result of that procedure when applied to the SPY ETF over the period December 2010 - December 2021 with a target expected return of 4.9% per year²⁴.

Figure 5. 12-month rolling SPY ETF returns before and after sample mean moment-matching, December 2010 - December 2021.
Simulates asset returns from the modified historical asset returns, ensuring to retain the actual pattern of […] asset movements in each simulated scenario³.

This step relies on what seems to be a rolling block-bootstrap in order to recognise extreme tail risk occurrences in actual historical context³.

Ultimately, T. Rowe Price’s framework can be used to demonstrate how [a multi-asset portfolio] would have performed in different market conditions reflective of historical experience, while incorporating the investor’s expectations for the future³.

Incorporating all capital market assumptions into T. Rowe Price’s simulation framework

I propose to extend T. Rowe Price’s framework to be able to take into account:

User-defined return expectations
User-defined standard deviation expectations
User-defined correlation expectations

Because these three quantities usually represent the full range of capital market assumptions published by financial institutions²⁵, the resulting simulation framework will become even more powerful while retaining its original simplicity.

In line with this blog post, this is simply done by integrating the minimum-correction second-moment-matching procedure described in the previous sections into the second step of T. Rowe Price’s framework.

Conclusion

A perfect summary of the technique described in this blog post is given by Carver⁴:

[…] it basically allows us to use forward looking estimates for the first two moments (and first co-moment - correlation) of the distribution, whilst using actual data for the higher moments (skew, kurtosis and so on) and co-moments (co-skew, co-kurtosis etc). In a sense it’s sort of a blend of a parameterised monte-carlo and a non parameterised bootstrap.

For more fun with bootstrap, feel free to connect with me on LinkedIn or to follow me on Twitter.

–

See Esther Ruiz and Lorenzo Pascual. Bootstrapping Financial Time Series. Journal of Economic Surveys, 2002, vol. 16, issue 3, 271-300. ↩
Like historical asset returns or historical factor returns. ↩
See Wu, Walsh, Considering the Past and the Future in Asset Simulation, T. Rowe Price, Investment Insights, November 2022. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹² ↩¹³ ↩¹⁴ ↩¹⁵ ↩¹⁶
See Rob Carver, This Blog is Systematic, Portfolio optimisation, uncertainty, bootstrapping, and some pretty plots. Ho, ho, ho. ↩ ↩² ↩³ ↩⁴
With risk defined in terms of volatility. ↩
Arithmetic, logarithmic… ↩
See Boyle P, M Broadie and P Glasserman, 1995, Recent advances in simulation for security pricing, Proceedings of the 1995 Winter Simulation Conference, pages 212–219. ↩ ↩²
See Meucci, Attilio, Simulations with Exact Means and Covariances (June 7, 2009). ↩
See Kaut, Michal, and Lium, Arnt-Gunnar. “Scenario generation with distribution functions and correlations.” Kybernetika 50.6 (2014): 1049-1064. ↩ ↩² ↩³
See Lin, J., Lermusiaux, P.F.J. Minimum-correction second-moment-matching: theory, algorithms and applications. Numer. Math. 147, 611–650 (2021). ↩ ↩² ↩³ ↩⁴
To be noted that Theorem 2.1 from Lin and Lermusiaux¹⁰ requires a couple of numerical assumptions - that $m \geq n$ and that $R$ has full column rank. ↩
(Adjusted) prices have have been retrieved using Tiingo. ↩ ↩²
These ETFs are used in the Adaptative Asset Allocation strategy from ReSolve Asset Management, described in the paper Adaptive Asset Allocation: A Primer²⁶. ↩
See Megan Czasonis, Mark Kritzman, David Turkington, The Stock-Bond Correlation, The Journal of Portfolio Management, February 2021, 47 (3) 67-76. ↩
See Molenaar, Roderick and Senechal, Edouard and Swinkels, Laurens and Wang, Zhenping, Empirical evidence on the stock-bond correlation (February 9, 2023). ↩
On this, see also the blog posts here, here and here. ↩
See A Changing Stock–Bond Correlation: Drivers and Implications, Alfie Brixton, Jordan Brooks, Pete Hecht, Antti Ilmanen, Thomas Maloney, Nicholas McQuinn, The Journal of Portfolio Management, Multi-Asset Special Issue 2023, 49 (4) 64 - 80. ↩
See Meucci, Attilio, Fully Flexible Views: Theory and Practice (August 8, 2008). Fully Flexible Views: Theory and Practice, Risk, Vol. 21, No. 10, pp. 97-102, October 2008. ↩ ↩² ↩³
See Meucci, Attilio and Nicolosi, Marco, Dynamic Portfolio Management with Views at Multiple Horizons (April 16, 2015). Applied Mathematics and Computation, Volume 274, 1 February 2016, Pages 495-518. ↩
See van der Schans, M. Entropy Pooling with Discrete Weights in a Time-Dependent Setting. Comput Econ 53, 1633–1647 (2019). ↩ ↩² ↩³
Interestingly though, with (time-dependent) entropy pooling, when scenarios of two variables always move along with each other, we cannot impose a negative correlation²⁰, while this situation is not an issue for moment-matched bootstrapping! ↩
There is no alteration of asset standard deviations or of correlations described in Wu and Walsh³, so that the multivariate shift and rescaling procedure actually becomes a simple univariate shift procedure… ↩
Returns before shifting in Figure 5 seem slightly higher than their counterparts in Figure 8 from Wu and Walsh³; this might be due to different data sources used. As a side note, Figure 8 from Wu and Walsh³ has inverted legends for the two thick lines. ↩
Which corresponds to the 2022 T. Rowe Price five-year capital market assumptions / expected returns for U.S. Large Cap²⁵. ↩
See T. Rowe Price, Capital Market Assumptions Five-Year Perspective. ↩ ↩²
See Butler, Adam and Philbrick, Mike and Gordillo, Rodrigo and Varadi, David, Adaptive Asset Allocation: A Primer. ↩

Cluster Risk Parity: Equalizing Risk Contributions Between and Within Asset Classes

2024-04-24T00:00:00-05:00

The equal risk contribution (ERC) portfolio, introduced in Maillard et al.¹, is a portfolio aiming to equalize the risk contributions from [its] different components¹.

Empirically, the ERC portfolio has been found to be a middle-ground alternative¹ to an equally weighted portfolio and a minimum variance portfolio, balanced in risk and in weights¹, which exhibits interesting performances in terms of risk-adjusted returns¹.

One issue with the ERC portfolio, though, it that it is highly dependent upon the structure of the universe of assets considered² and tends to distribute risk unevenly in terms of asset classes whenever the number of assets per asset class differs.

In this short blog post, I will describe an extension of the ERC portfolio proposed by Varadi³ and Kapler⁴ designed to solve this problem.

The equal risk contribution portfolio

Let be:

$n$, the number of assets in a universe of assets
$w = (w_1,…,w_n) \in {\mathbb{R}^+}^{n}$, the asset weights of a long-only portfolio invested in the considered universe of assets
$\Sigma \in \mathcal{M}(\mathbb{R}^{n \times n})$, the asset covariance matrix
$\sigma_{i,j}$, the covariance between assets $i$ and $j$, $i=1..n$, $j=1..n$
$\sigma_1^2,…,\sigma_n^2 \in {\mathbb{R}^+}^{n}$, the asset variances
$\sigma(w) = \sqrt{ w^t \Sigma w }$, the portfolio standard deviation/volatility

Definitions

Risk contributions

The marginal contribution to risk⁵ - or marginal risk contribution - $MCTR_i(w)$ of the asset $i, i=1..n$, to the portfolio risk $\sigma(w)$ is defined as:

\[MCTR_i(w) = \frac{\partial \sigma(w)}{\partial w_i} = \frac{w_i \sigma_{i}^2 + \Sigma_{i=1, i\ne j}^n w_j \sigma_{i,j}}{\sigma(w)}\]

As noted in Maillard et al.¹, the adjective marginal qualifies the fact that those quantities give the change in volatility of the portfolio induced by a small increase in the weight of one component¹, all other weights staying the same.

The total contribution to risk - or total risk contribution - $TCTR_i(w)$ of the $i$-th asset, $i=1..n$, to the portfolio risk $\sigma(w)$ is defined as:

\[TCTR_i(w) = w_i MCTR_i(w)\]

The ERC portfolio

The ERC portfolio is defined as the risk-balanced portfolio such that the total risk contribution is the same for all assets in the portfolio¹.

More formally, the ERC portfolio is the long-only portfolio whose asset weights $w^* \in [0,1]^{n}$ satisfy:

$\sum_{i=1}^{n} w_i = 1$
$TCTR_i(w) = TCTR_j(w)$, for all $i$, $j$, $i=1..n$, $j=1..n$

Sensitivity of the ERC portfolio to the universe of assets

Choueifaty et al.² defines a set of basic properties that an unbiased, agnostic portfolio construction process should respect, based on common sense and reasonable economic grounds².

I propose to illustrate one of these properties, named duplication invariance, by comparing two portfolios invested in U.S. stocks (SPY ETF) and U.S. bonds (TLT ETF):

The minimum variance (MV) portfolio
The ERC portfolio

Using asset data over the period 03rd January 2023 - 29th December 2023⁶, the associated portfolios are displayed in Figure 1:

Figure 1. ERC v.s. MV portfolio weights, SPY-TLT universe.

Something interesting to note in Figure 1 is that the ERC portfolio appears closer to an equally weighted portfolio than the MV portfolio, which is a general property discussed in Maillard et al.¹.

Now, let’s suppose that the TLT ETF is somehow duplicated into a TLT’ ETF over the same period of time.

What would be the impact of this asset duplication on the above portfolios?

Answer is provided in Figure 2.

Figure 2. Impact of TLT ETF duplication on ERC v.s. MV portfolio weights, SPY-TLT universe.

It is visible on Figure 2 that:

The weights of the ERC portfolio in the original assets are (severely) impacted by the duplication of TLT ETF.

The ERC portfolio is said be non duplication invariant.
The weights of the MV portfolio in the original assets are left unchanged⁷ by the duplication of the TLT ETF.

The MV portfolio is said to be duplication invariant.

This lack of duplication invariance means that ERC portfolios are generally biased toward assets with multiple representations², so that the asset universe is an important factor when considering [ERC] portfolios⁸.

As an example, for a portfolio invested in bonds and equities, the number […] of bond versus equity will dramatically affect the risk contribution of the [ERC] portfolio from each category³.

Indeed, as detailed in Roncalli and Weisang⁸:

If the universe includes 5 equity indices and 5 bond indices, then the ERC portfolio will be well balanced between equity and bond in terms of risk. Conversely, if the universe includes 7 equity indices and 3 bond indices, the equity risk of the ERC portfolio represents then 70% of the portfolio’s total risk, a solution very unbalanced between equity and bond risks.

Figure 3 illustrates this imbalance within the SPY-TLT universe, with the TLT duplicated 5 times (resp. 3 times) and the SPY ETF duplicated 5 times (resp. 7 times).

Figure 3. Impact of SPY and TLT ETFs duplication on ERC portfolio total contributions to risk, assets v.s. asset classes, SPY-TLT universe.

Interestingly, such a behavior is not in violation of the mathematical objective of the ERC portfolio⁹, but is nevertheless in clear violation of the ERC portfolio philosophy based on diversification¹.

Notes:

Asset weights in Figure 1 and Figure 2 might seem surprising at first sight (higher weights for the SPY ETF v.s. the TLT ETF), but 2023 has been a particular year for U.S. Treasuries… For reference, the exact (co)variances of the SPY and TLT ETFs are provided in Figure 5.

For more discussion about duplication invariance, see for example Gava and Turc¹⁰, which introduces a quantitative measure of the sensitivity of a portfolio to duplication.

The cluster risk parity portfolio

Description

One possible solution to the problem of the sensitivity of the ERC portfolio discussed in the previous section is a careful selection of the universe of assets.

Another possible solution is to slightly alter the ERC portfolio construction process by taking into account both the asset and the asset class level, which is the solution proposed by Varadi³ and Kapler⁴ under the name cluster risk parity (CRP) portfolio.

In details, the CRP portfolio is computed through a three-step process, detailed below.

Notes:

Another cluster risk parity portfolio is described in the Quantpedia series Introduction to Clustering Methods In Portfolio Management, but the two portfolio construction methodologies are not the same.

Step 1 - Partitioning the universe of assets into clusters

The first step consists in using a clustering algorithm to group similar assets together within the considered universe of assets.

This results in $k$ clusters¹¹ $\mathcal{G}_1,…,\mathcal{G}_k$, $1 \leq k \leq n$, each corresponding to an asset class automatically determined by the clustering algorithm.

The purpose of this step is to avoid the need for artificial or manual grouping³ of the assets and enables the [CRP] portfolio to adapt to changes […] without having to run a lot of ad hoc analysis and having to make continual adjustments³.

Using the same hierarchical clustering algorithm as the one used in the Hierarchical Clustering-Based Risk Parity algorithm and using asset data over the period 03rd January 2023 - 29th December 2023⁶, Figure 4 illustrates this first step for a universe of 10 ETFs representative¹² of misc. asset classes:

U.S. stocks (SPY ETF)
European stocks (EZU ETF)
Japanese stocks (EWJ ETF)
Emerging markets stocks (EEM ETF)
U.S. REITs (VNQ ETF)
International REITs (RWX ETF)
U.S. 7-10 year Treasuries (IEF ETF)
U.S. 20+ year Treasuries (TLT ETF)
Commodities (DBC ETF)
Gold (GLD ETF)

Figure 4. Example of hierarchical clustering algorithm with automatic determination of the optimal number of clusters, 10-ETF universe.

Figure 4 shows that 3 clusters/asset classes have been automatically determined by the hierarchical clustering algorithm over the studied period, materialized by the thick black dendrogram cut:

An equity cluster $\mathcal{G}_1$ made of the equity ETFs SPY, EZU, EWJ, EEM and of the real estate ETFs VNQ, RWX
A bond cluster $\mathcal{G}_2$ made of the bond ETFs IEF, TLT
A commodity cluster $\mathcal{G}_3$ made of the commodity ETF DBC and of the gold ETF GLD

Step 2 - Assets weights computation within each cluster

The second step consists in computing the ERC portfolio within each cluster $\mathcal{G}_i$, $i=1..k$.

This results in $k$ vectors of asset weights $w_{i}^* \in [0,1]^{n}$, $i=1..k$.

Using as input the covariance matrix $\Sigma$ provided in Figure 5¹³, the ERC portfolio computed within each cluster $\mathcal{G}_1$, $\mathcal{G}_2$ and $\mathcal{G}_3$ defined in the previous sub-section is displayed in Figure 6.

Figure 5. Example of CRP portfolio computation, asset covariance matrix, 10-ETF universe.

Figure 6. Example of CRP portfolio computation, within-cluster ERC portfolios, 10-ETF universe.

Step 3 - Assets weights computation across all clusters and final assets weights computation

The third and last step consists in computing the ERC portfolio across all clusters $\mathcal{G}_i$, $i=1..k$.

For this:

Because the $k$ clusters $\mathcal{G}_1,…,\mathcal{G}_k$ correspond to $k$ risk factors, whose loadings matrix $A \in \mathcal{M}(\mathbb{R}^{n \times k})$ - in the original asset weights - is $ A = \begin{pmatrix} w_1^* & w_2^* & … & w_k^* \end{pmatrix} $, the cluster covariance matrix $\Sigma_c \in \mathcal{M}(\mathbb{R}^{k \times k})$ is equal¹⁴ to $ A^t \Sigma A $.
Thanks to that cluster covariance matrix, it is possible to compute the ERC portfolio across all clusters.

This results in a vector of cluster weights $w_c^* \in [0,1]^{k}$.
Then, all remaining is to convert the vector of cluster weights $w_c^*$ into the final vector of asset weights $w^* \in [0,1]^n$ through the formula $w^* = A w_c^*$.

Continuing the example from the previous sub-section:

The loading matrix $A$, deduced from Figure 6, is:
\[A = \begin{pmatrix} 0.19 & 0 & 0 \\ 0.16 & 0 & 0 \\ 0.19 & 0 & 0 \\ 0.17 & 0 & 0 \\ 0.13 & 0 & 0 \\ 0.16 & 0 & 0 \\ 0 & 0.66 & 0 \\ 0 & 0.34 & 0 \\ 0 & 0 & 0.48 \\ 0 & 0 & 0.52 \\ \end{pmatrix}\]
The cluster covariance matrix $\Sigma_c$ is:
\[\Sigma_c = A^t \Sigma A = \begin{pmatrix} 0.000063 & 0.000014 & 0.000016 \\ 0.000014 & 0.000053 & 0.000011 \\ 0.000016 & 0.000011 & 0.000047 \\ \end{pmatrix}\]
The vector of cluster weights of the across-cluster ERC portfolio is:
\[w_c^* = \begin{pmatrix} 0.30 \\ 0.34 \\ 0.36 \end{pmatrix}\]
The vector of asset weights of the CRP portfolio, $w^* = A w_c^*$, is displayed in Figure 7:

Figure 7. Example of CRP portfolio computation, final portfolio weights, 10-ETF universe.

Practical performances

The CRP portfolio has been empirically benchmarked against several other portfolios within different multi-asset classes universes⁴¹⁵ and its performances, especially in terms of risk-adjusted returns, have been found to be excellent.

On this aspect, Varadi³ concludes that:

The [CRP portfolio] is perhaps the most robust method of passive portfolio allocation, and it also produces the best risk-adjusted returns without relying as much on the low-volatility factor or bond/fixed income performance.

Implementation in Portfolio Optimizer

Portfolio Optimizer allows to easily compute the CRP portfolio through the endpoint /portfolios/optimization/equal-risk-contributions/clustering-based, with optional support for minimum and maximum asset weights constraints.

For maximum flexibility, the partitioning of the considered universe of assets into clusters is expected to be provided in input to that endpoint, which can be done using either:

Any clustering algorithm available on the user side
Any clustering algorithm provided by Portfolio Optimizer:
- The Fast Threshold Clustering Algorithm (FTCA), available through the endpoint /assets/clustering/ftca
- The hierarchical clustering algorithm used in the Hierarchical Clustering-Based Risk Parity algorithm, available through the endpoint /assets/clustering/hierarchical

Conclusion

This concludes the description of the CRP portfolio, a little known but excellent framework to consider for a robust risk parity approach¹⁵.

As usual, feel free to connect with me on LinkedIn or to follow me on Twitter.

–

See Maillard, S., Roncalli, T., Teiletche, J.: The properties of equally weighted risk contribution portfolios. J. Portf. Manag. 36, 60–70 (2010). ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰
See Choueifaty, Y., Tristan Froidure, T., Reynier, J. Properties of the Most Diversified Portfolio. Journal of Investment Strategies, Vol.2(2), Spring 2013, pp.49-70. ↩ ↩² ↩³ ↩⁴
See David Varadi, CSSA, Cluster Risk Parity and David Varadi, CSSA, Cluster Risk Parity (CRP) versus Risk Parity (RP) and Equal Risk Contribution (ERC). ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶
See Michael Kapler, Systematic Investor Blog, Cluster Portfolio Allocation and Michael Kapler, Systematic Investor Blog, Cluster Risk Parity back-test. ↩ ↩² ↩³
When the risk measure is defined as the portfolio volatility; other risk measures can be used instead, c.f. Maillard et al.¹. ↩
(Adjusted) prices have have been retrieved using Tiingo. ↩ ↩²
This property is established in all generality in Maillard et al.¹. ↩
See Roncalli, Thierry and Weisang, Guillaume, Risk Parity Portfolios with Risk Factors and Roncalli & G. Weisang, Risk parity portfolios with risk factors, Quantitative Finance. ↩ ↩²
Per construction of the ERC portfolio, ll assets present in that portfolio are guaranteed to have the same total risk contributions. ↩
See Gava J, Turc J. The Properties of Alpha Risk Parity Portfolios. Entropy. 2022; 24(11):1631. ↩
Assumed to form a partition of the universe of assets. ↩
These ETFs are used in the Adaptative Asset Allocation strategy from ReSolve Asset Management, described in the paper Adaptive Asset Allocation: A Primer¹⁶. ↩
This covariance matrix is the empirical asset covariance matrix computed from the arithmetic returns of the 10 assets over the period 04th January 2023 - 29th December 2023. ↩
This is a direct application of the formula for the covariance matrix of a linear transformation of a random vector, c.f. for example Wikipedia, Covariance matrix. ↩
See ReSolve Asset Management, Dynamic asset allocation for practitioners part 5: robust risk parity. ↩ ↩²
See Butler, Adam and Philbrick, Mike and Gordillo, Rodrigo and Varadi, David, Adaptive Asset Allocation: A Primer. ↩

Volatility Forecasting: GARCH(1,1) Model

2024-03-11T00:00:00-05:00

In the previous post of this series on volatility forecasting, I described the simple and the exponentially weighted moving average volatility forecasting models.

In particular, I showed that these two models belong to the generic family of weighted moving average volatility forecasting models¹, whose members represent the volatility of an asset as a weighted moving average of its past squared returns².

Another member of this family is the Generalized AutoRegressive Conditional Heteroscedasticity (GARCH) model, widely used in financial time series modelling and implemented in most statistics and econometric software packages³.

In this blog post, I will detail the simplest but often very useful⁴ GARCH(1,1) volatility forecasting model and I will illustrate its practical performances in the context of monthly volatility forecasting for various ETFs.

Mathematical preliminaries (reminders)

This section contains reminders from a previous blog post.

Volatility modelling and volatility proxies

Let $r_t$ be the (logarithmic) return of an asset over a time period $t$ (a day, a week, a month..), over which its (conditional) mean return is supposed to be null.

Then:

The asset (conditional) variance is defined as $ \sigma_t^2 = \mathbb{E} \left[ r_t^2 \right] $

From this definition, the squared return $r_t^2$ of an asset is a (noisy⁵) variance estimator - or variance proxy⁵ - for that asset variance over the considered time period.

Another example of an asset variance proxy is the Parkinson range of an asset.

The generic notation for an asset variance proxy in this blog post is $\tilde{\sigma}_t^2$.
The asset (conditional) volatility is defined as $ \sigma_t = \sqrt { \sigma_t^2 } $

The generic notation for an asset volatility proxy in this blog post is $\tilde{\sigma}_t$.

Weighted moving average volatility forecasting model

Boudoukh et al.¹ shows that many seemingly different methods of volatility forecasting actually share the same underlying representation of the estimate of an asset next period’s variance $\hat{\sigma}_{T+1}^2$ as a weighted moving average of that asset past periods’ variance proxies $\tilde{\sigma}^2_t$, $t=1..T$, with

\[\hat{\sigma}_{T+1}^2 = w_0 + \sum_{i=1}^{k} w_i \tilde{\sigma}^2_{T+1-i}\]

, where:

$1 \leq k \leq T$ is the size of the moving average, possibly time-dependent
$w_i, i=0..k$ are the weights of the moving average, possibly time-dependent as well

GARCH(1,1) volatility forecasting model

The GARCH(p,q) model

Definition

Bollerslev⁴’s GARCH model is a generalization of Engle’s ARCH econometric model which captures the time-varying nature of the (conditional) variance of certain time series like asset returns.

Under a GARCH(p,q) model, an asset next period’s conditional variance $\sigma_{T+1}^2$ is modeled as recursive linear function of its own $p$ lagged conditional variances $\sigma_{T}^2, \sigma_{T-1}^2…$ and of its $q$ lagged squared returns $r_{T}^2, r_{T-1}^2…$, which leads to the formula

\[\hat{\sigma}_{T+1}^2 = \omega + \sum_{i=1}^p \beta_i \hat{\sigma}_{T+1-i}^2+ \sum_{j=1}^q \alpha_j r_{T+1-i}^2\]

, where:

The parameters $\omega$, $\alpha_j$, $j=1..q$ and $\beta_i$, $i=1..p$ are non-negative and subject to various inequality constraints depending on working assumptions⁶
The initial conditional variance $\hat{\sigma}_1^2$ is usually taken equal to $r_1^2$, but c.f. Pelagatti and Lisi⁷ for a thorough discussion about this subject

Squared returns v.s. generic variance proxy

Molnar⁸ notes that in GARCH type of models, demeaned squared returns serve as a way to calculate innovations to the volatility⁸ so that replacing the squared returns by more precise volatility estimates will produce better GARCH models, regarding both in-sample fit and out-of-sample forecasting performance⁸.

Molnar⁸ then proposes to modify the GARCH(p,q) model for the estimation of an asset next period’s conditional variance $\sigma_{T+1}^2$ as follows

\[\hat{\sigma}_{T+1}^2 = \omega + \sum_{i=1}^p \beta_i \hat{\sigma}_{T+1-i}^2+ \sum_{j=1}^q \alpha_j \tilde{\sigma}_{T+1-i}^2\]

, where $\tilde{\sigma}^2_t$, $t=1..T$ are the asset past periods’ variance proxies.

To be noted that replacing squared returns by less noisy variance proxies is already discussed at length in the previous blog post in the case of the simple and the exponentially weighted moving average volatility forecasting models.

The GARCH(1,1) model

Definition

Because the GARCH(1,1) model works surprisingly well in comparison with much more complex [GARCH] models⁸, it is usually the main GARCH model used in practice.

Under this model, the generic GARCH formula for the estimate of an asset next period’s conditional variance can be re-parametrized as follows

\[\hat{\sigma}_{T+1}^2 = \gamma \tilde{\sigma}^2 + \alpha \tilde{\sigma}^2_{T} + \beta \hat{\sigma}_{T}^2\]

, where:

$\alpha$, $\beta$ and $\gamma$ are positive parameters summing to one
$\tilde{\sigma}^2$ is a strictly positive parameter, corresponding to the asset unconditional variance⁹

The GARCH(1,1) model thus estimates an asset next period’s conditional variance $\hat{\sigma}_{T+1}^2$ as a weighted average¹⁰ of three different variance estimators:

A long-term variance estimator $\tilde{\sigma}^2$
A short-term variance estimator $\tilde{\sigma}^2_{T}$
The current GARCH(1,1) variance estimator $\hat{\sigma}_{T}^2$

and the weights $\alpha$, $\beta$ and $\gamma$ determine the speed with which the model adapts to short-term variance v.s. reverts to its long-term variance.

Relationship with the generic weighted moving average model

By developing the recursive definition of the GARCH(1,1) model, it is possible to see that it is a specific kind of weighted moving average volatility forecasting model, with:

$k = T$
$w_0 = \gamma \sum_{k=0}^{T-1} \beta^k$
$w_1 = \alpha$, $w_2 = \alpha \beta$, …, $w_{T-1} = \alpha \beta^{T-2}$, $w_T = \alpha \beta^{T-1}$, that is, exponentially decreasing weights emphasizing recent past variance proxies v.s. more distant ones in the model, exactly like in the exponentially weighted moving average volatility forecasting model¹¹

Volatility forecasting formulas

Under a GARCH(1,1) volatility forecasting model, the generic weighted moving average volatility forecasting formula becomes:

To estimate an asset next period’s volatility:
\[\hat{\sigma}_{T+1} = \sqrt{ \gamma \tilde{\sigma}^2 + \alpha \tilde{\sigma}^2_{T} + \beta \hat{\sigma}_{T}^2 }\]
To estimate an asset next $h$-period’s ahead volatility¹², $h \geq 2$:
\[\hat{\sigma}_{T+h} = \sqrt{ \tilde{\sigma}^2 + \left( \alpha + \beta \right)^{h-1} \left( \hat{\sigma}_{T+1} - \tilde{\sigma}^2 \right) }\]
To estimate an asset aggregated volatility¹² over the next $h$ periods:
\[\hat{\sigma}_{T+1:T+h} = \sqrt{h} \hat{\sigma}_{T+1}\]

How to determine the parameters of a GARCH(1,1) model?

The parameters of a GARCH(1,1) model - either $\omega$, $\alpha$ and $\beta$ or $\alpha$, $\beta$, $\gamma$ and $\tilde{\sigma}^2$ - are typically determined by maximum likelihood estimation (MLE) with a Gaussian¹³ or Student’s $t$ assumption for the distribution of the innovations.

A note of caution, though.

There are plenty of software packages able to do this estimation, but the underlying optimization problem has been documented to be numerically difficult and prone to error¹⁴ due to a one dimensional manifold in the parameter space where the likelihood function is large and almost constant¹⁴, which tends to “trap” numerical algorithms.

Possible remediations have been suggested in Zumbach¹⁴ and in Kristensen and Linton¹⁵, like reformulating the optimization problem in an alternative parameter space or using a closed-form estimator for the GARCH(1,1) parameters that does not rely on any numerical optimization procedure, but unfortunately, these remediations are not sufficient due to the problematic¹⁶ finite sample behavior of the maximum likelihood estimates…

Implementation in Portfolio Optimizer

Portfolio Optimizer implements the GARCH(1,1) volatility forecasting model through the endpoint /assets/volatility/forecast/garch.

This endpoint supports the 4 variance proxies below:

Squared close-to-close returns
Demeaned squared close-to-close returns
The Parkinson range
The jump-adjusted Parkinson range

Internally, this endpoint:

Assumes that the asset unconditional variance $\tilde{\sigma}^2$ is equal to its long-term average value $\frac{1}{T} \sum_{t=1}^{T} \tilde{\sigma}^2_t$
Automatically determines the optimal value of the GARCH(1,1) parameters $\alpha$, $\beta$ and $\gamma$ using a proprietary numerical optimization procedure

Example of usage - Volatility forecasting at monthly level for various ETFs

As an example of usage, I propose to enrich the results of the previous blog post, in which monthly forecasts produced by different volatility models are compared - using Mincer-Zarnowitz¹⁷ regressions - to the next month’s close-to-close observed volatility for 10 ETFs representative¹⁸ of misc. asset classes:

U.S. stocks (SPY ETF)
European stocks (EZU ETF)
Japanese stocks (EWJ ETF)
Emerging markets stocks (EEM ETF)
U.S. REITs (VNQ ETF)
International REITs (RWX ETF)
U.S. 7-10 year Treasuries (IEF ETF)
U.S. 20+ year Treasuries (TLT ETF)
Commodities (DBC ETF)
Gold (GLD ETF)

Averaged results for all ETFs/regression models over each ETF price history¹⁹ are the following²⁰:

Volatility model	Variance proxy	$\bar{\alpha}$	$\bar{\beta}$	$\bar{R^2}$
Random walk	Squared close-to-close returns	5.8%	0.66	44%
SMA, optimal $k \in \left[ 1, 5, 10, 15, 20 \right]$ days	Squared close-to-close returns	5.8%	0.68	46%
EWMA, optimal $\lambda$	Squared close-to-close returns	4.7%	0.73	45%
GARCH(1,1)	Squared close-to-close returns	-1.3%	0.98	43%
Random walk	Parkinson range	5.6%	0.94	44%
SMA, optimal $k \in \left[ 1, 5, 10, 15, 20 \right]$ days	Parkinson range	5.1%	1.00	47%
EWMA, optimal $\lambda$	Parkinson range	4.3%	1.06	48%
GARCH(1,1)	Parkinson range	2.7%	1.18	47%
Random walk	Jump-adjusted Parkinson range	4.9%	0.70	45%
SMA, optimal $k \in \left[ 1, 5, 10, 15, 20 \right]$ days	Jump-adjusted Parkinson range	5.1%	0.71	47%
EWMA, optimal $\lambda$	Jump-adjusted Parkinson range	4.0%	0.76	45%
GARCH(1,1)	Jump-adjusted Parkinson range	-1.0%	1.00	45%

From these, it is possible to conclude the following:

The two GARCH(1,1) models using improved variance proxies produce volatility forecasts with better r-squared than the GARCH(1,1) model using squared returns (lines #8 and #12 v.s. line #4), which is in agreement with Molnar⁸
The two GARCH(1,1) models using variance proxies that integrate close prices produce nearly unbiased forecasts (lines #4 and #12), which, together with their relatively high r-squared, makes them volatility forecasting models to recommend in these cases
The GARCH(1,1) using the Parkinson range as variance proxy produces the most biased forecasts (line #8), which makes it a volatility forecasting model to avoid in this case

Conclusion

The GARCH(1,1) volatility forecasting model exhibits good practical performances for a wide range of assets, as empirically demonstrated in the previous section.

Nevertheless, because this model is still unable to describe certain aspects often found in financial data³, many different extensions have been proposed over the years like AGARCH, EGARCH, QGARCH, TGARCH³….

I will not discuss these further, though, and next in this series dedicated to volatility forecasting, I will detail a model that was initially developed for use with high frequency data.

Meanwhile, feel free to connect with me on LinkedIn or to follow me on Twitter.

–

See Boudoukh, J., Richardson, M., & Whitelaw, R.F. (1997). Investigation of a class of volatility estimators, Journal of Derivatives, 4 Spring, 63-71. ↩ ↩²
Or more generally, of a weighted moving average of one of its past variance proxies. ↩
See Brandon Williams, GARCH(1,1) models, B. Sc. Thesis, 15. Juli 2011. ↩ ↩² ↩³
See Bollerslev, T. (1986). Generalized autoregressive conditional heteroskedasticity. Journal of Econometrics, 31(3), 307–327. ↩ ↩²
See Andrew J. Patton, Volatility forecast comparison using imperfect volatility proxies, Journal of Econometrics, Volume 160, Issue 1, 2011, Pages 246-256. ↩ ↩²
See Daniel B. Nelson and Charles Q. Cao, Inequality Constraints in the Univariate GARCH Model, Journal of Business & Economic Statistics, Vol. 10, No. 2 (Apr., 1992), pp. 229-235. ↩
See Pelagatti, M., Lisi, F. (2009). Variance initialisation in GARCH estimation. In Paganoni, A.M., Sangalli, L.M., Secchi, P., Vantini, S. (eds.), S.Co. 2009 Sixth Conference Complex Data Modeling and Computationally Intensive Statistical Methods for Estimation and Prediction, Maggioli Editore, Milan. ↩
See Peter Molnar (2016): High-low range in GARCH models of stock return volatility, Applied Economics. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶
Also called the asset long-term variance. ↩
More precisely, a convex combination. ↩
Which is not surprising since in fact, exponential smoothing is a constrained version of GARCH (1,1)¹, without mean-reversion. ↩
See Brooks, Chris and Persand, Gitanjali (2003) Volatility forecasting for risk management. Journal of Forecasting, 22(1). pp. 1-22. ↩ ↩²
In which case, the Gaussian MLE is usually considered as a quasi-maximum likelihood estimate. ↩
See Zumbach, G. (2000). The Pitfalls in Fitting Garch(1,1) Processes. In: Dunis, C.L. (eds) Advances in Quantitative Asset Management. Studies in Computational Finance, vol 1. Springer, Boston, MA. ↩ ↩² ↩³
See Dennis Kristensen and Oliver Linton, A Closed-Form Estimator for the GARCH(1,1) Model, Econometric Theory, Vol. 22, No. 2 (Apr., 2006), pp. 323-337. ↩
See for example here and there. ↩
See Mincer, J. and V. Zarnowitz (1969). The evaluation of economic forecasts. In J. Mincer (Ed.), Economic Forecasts and Expectations. ↩
These ETFs are used in the Adaptative Asset Allocation strategy from ReSolve Asset Management, described in the paper Adaptive Asset Allocation: A Primer²¹. ↩
The common ending price history of all the ETFs is 31 August 2023, but there is no common starting price history, as all ETFs started trading on different dates. ↩
For all models, I used an expanding window for the volatility forecast computation. ↩
See Butler, Adam and Philbrick, Mike and Gordillo, Rodrigo and Varadi, David, Adaptive Asset Allocation: A Primer. ↩

Random Portfolio Benchmarking: Simulation-based Performance Evaluation in Finance

2024-02-06T00:00:00-06:00

As noted in Surz¹, the question “Is [a mutual fund’s]² performance good?” can only be answered relative to something¹, typically by comparing that fund to a benchmark like a financial index or to a peer group.

Unfortunately, these two methodologies are not without issues. For example, it is very difficult to create an index captur[ing] the essence of the people, process, and philosophy behind an investment product³ and peer groups have well-known biases¹ (classification bias, survivorship bias…).

In a series of papers (Surz³, Surz⁴, Surz¹…), Ronald J. Surz proposes an innovation that combines the better aspects of both [methodologies] while eliminating their undesirable properties³. This innovation consists in evaluating a fund against the fund manager’s true opportunity set⁵, defined as the set of all of the possible portfolios that the manager could have conceivably held following his unique investment process⁴.

In practice, the fund manager’s opportunity set is approximated by the simulation of thousands of random portfolios in the same universe of assets as the one of the fund manager and satisfying the same constraints (long-only, long-short…) and rules (portfolio rebalancing rules…) as those of the fund manager. Then, because these portfolios do not exhibit any particular skill⁶, they can be used as the control group to test [the] fund manager skill⁵, thus allowing to apply modern statistics to the problem of performance evaluation⁴.

In this blog post, I will describe how to generate random portfolios, detail Surz’s original methodology as well as some of its variations and illustrate the usage of random portfolios with a couple of examples like the creation of synthetic benchmarks or the evaluation and monitoring of trading strategies.

Mathematical preliminaries

Definition

Let be:

$n$ the number of assets in a universe of assets
$\mathcal{C} \subset \mathbb{R}^{n}$ a subset of $ \mathbb{R}^{n}$ representing the constraints imposed⁷ on a fund manager when investing in that universe of assets, for example:
- Short sale constraints
- Concentration constraints (assets, sectors, industries…)
- Leverage constraints
- Cardinality constraints (i.e., minimum or maximum number of assets constraints)
- Portfolio volatility constraints
- Portfolio tracking error constraints
- …

Then, a (constrained) random portfolio in that universe of assets is a vector of portfolio weights $w \in \mathbb{R}^{n}$ generated at random over the set $\mathcal{C}$.

Generation at random v.s. generation uniformly at random

In the context of performance evaluation, as in Surz’s methodology, it is theoretically preferable that random portfolios are generated uniformly at random over the set $\mathcal{C}$, so as not to introduce any biases⁸ which would otherwise defeat the purpose of using these portfolios as an unbiased control group³.

Figure 1 and Figure 2 illustrate the difference between random portfolios generated at random v.s. uniformly at random in the case of a three-asset universe subject to long-only and full investment constraints⁹:

In Figure 1, the random portfolios are visibly concentrated in the middle of the standard 2-simplex in $\mathbb{R}^3$⁹ - these random portfolios are NOT generated uniformly at random
In Figure 2, the random portfolios seem to be “well spread” over the standard 2-simplex in $\mathbb{R}^3$⁹ - these random portfolios are generated uniformly at random

Figure 1. Random portfolios not generated uniformly at random over a standard simplex, three-asset universe. Source: Shaw.

Figure 2. Random portfolios generated uniformly at random over a standard simplex, three-asset universe. Source: Shaw.

One important remark, though, is that real-life portfolios are usually binding on at least one of their constraints, so that generating random portfolios biased toward the boundary of the geometrical object associated to the constraints set $\mathcal{C}$ might not be a real problem in practice¹⁰.

Generation of random portfolios over the standard simplex

When the constraints imposed on a portfolio are 1) a full investment constraint and 2) a long-only constraint, the subset $C$ is then equal to

\[C = \{w \in \mathbb{R}^{n} \textrm{ s.t. } \sum_{i=1}^n w_i = 1, w_i \geq 0, i = 1..n\}\]

and the geometrical object associated to that subset is called¹¹ a standard simplex, already illustrated in Figure 1 and Figure 2 with $n = 3$.

Several algorithms exist to generate points uniformly at random over a standard simplex¹², among which:

An algorithm based on differences of sorted uniform random variables
An algorithm based on normalized unit exponential random variables

Generation of random portfolios over the restricted standard simplex

When the constraints imposed on a portfolio are 1) a full investment constraint, 2) a long-only constraint and 3) minimum/maximum asset weights constraints, the subset $C$ is then equal to

\[C = \{w \in \mathbb{R}^{n} \textrm{ s.t. } \sum_{i=1}^n w_i = 1, 1 \geq u_i \geq w_i \geq l_i \geq 0, i = 1..n\}\]

, where:

$l_1, …, l_n$ represent minimum asset weights constraints
$u_1, …, u_n$ represent maximum asset weights constraints

The geometrical object associated to that subset is a “restricted” standard simplex, as illustrated in Figure 3, with:

$n = 3$
$ 0.7 \geq w_1 \geq 0.1$
$ 0.8 \geq w_2 \geq 0 $
$ 0.6 \geq w_3 \geq 0.1 $

Figure 3. Example of restricted standard simplex, three-asset universe. Source: Piepel.

Generating points uniformly at random over a restricted standard simplex is much more complex than generating points uniformly at random over a standard simplex.

Hopefully, this problem has been studied at least since the 1990’s by people working in the statistical domain of the design of experiments, and an algorithm based on the conditional distribution method has been published¹³ in 2000.

Generation of random portfolios over a convex polytope

When the constraints imposed on a portfolio consist in generic linear constraints, the subset $C$ is then equal to

\[C = \{w \in \mathbb{R}^{n} \textrm{ s.t. } A_e w = b_e, A_i w \leq b_i \}\]

, where:

$A_e \in \mathbb{R}^{n_e \times n}$ and $b_e \in \mathbb{R}^{n_e}$, $n_e \geq 1$, represent linear equality constraints
$A_i \in \mathbb{R}^{n_i \times n}$ and $b_i \in \mathbb{R}^{n_i}$, $n_i \geq 1$, represent linear inequality constraints

The geometrical object associated to that subset is called a convex polytope, illustrated in Figure 4, with:

$n = 3$
$\sum_{i=1}^n w_i = 1$
$1 \geq w_i \geq 0, i = 1..n$
$w_1 > w_2$
$2w_3 > w_2 > 0.5w_3$

Figure 4. Example of standard simplex with additional linear inequality constraints, three-asset universe. Source: Tervonen et al.

As one can guess, generating points uniformly at random over a convex polytope is another level higher in terms of complexity, and while some algorithms exist to do so¹⁴, they are impractical in high dimension.

From the literature, what is possible to achieve instead is to generate points asymptotically uniformly at random, using Markov chain Monte Carlo (MCMC) algorithms like the Hit-And-Run algorithm¹⁵.

Generation of random portfolios over a generic constraint set

When the constraints imposed on a portfolio are generic, that is, quadratic (volatility or tracking error constraints) and/or non-convex (threshold constraints) and/or integer (maximum number of assets constraints, round lot constraints), the final boss has arrived.

With such constraints, the only reasonable¹⁶ approach in the literature seems to recast the problem of generating random points over the constraint set $C$ as a (penalized) optimization problem and use genetic algorithms to solve it¹⁰.

The underlying idea is to randomly¹⁷ minimize an objective function $f$ essentially made of penalties - the higher the distance of a point $w \in \mathbb{R}^{n}$ from the constraint set $C$, the higher the value of the objective function $f(w)$ -, so that at optimum, a random point satisfying all the constraints¹⁸ is found.

Of course, the points generated this way are not generated uniformly at random, but when the constraints are fully generic, that requirement should probably be dropped altogether as mentioned in Dawson and Young⁸.

Investment fund’s performance evaluation with random portfolios

Rationale

Evaluating the performance of an investment fund is actually a statistical hypothesis test in disguise¹⁹, in which:

The null hypothesis is The investment fund’s exhibit no particular “performance” over the considered time period
The test statistic is a quantitative measure of “performance” over the considered time period (e.g. annualized return, holding period return, risk-adjusted return…)
The (empirical) distribution of the test statistic under the null hypothesis is computed from a sample made of either
- One observation - the investment fund’s benchmark
- A small number of observations - the investment fund’s peer group
- Any desired number of observations - random portfolios generated from the investment fund’s universe of assets and obeying to the fund’s constraints and rules

From this perspective, using a benchmark, a peer group or random portfolios for performance evaluation is essentially a choice between sampling approaches¹.

Still, random portfolios represent a more rigorous approach to performance evaluation than the two other alternatives, for various reasons highlighted in different papers¹⁸¹⁰¹⁹ and summarized by Dawson⁸ as follows

A set of uniformly distributed, stochastically generated, portfolios that by construction incorporate no investment strategy, bias or skill form an effective control set for any portfolio measurement metric. This allows the true value of a strategy or model to be identified. They also provide a mechanism to differentiate between effects due to “market conditions” and effects due to either the management of a portfolio, or the constraints the management is obliged to work within.

Surz’s methodology

Using random portfolios to evaluate an investment fund’s performance over a considered time period is a two-step Monte Carlo simulation process²⁰:

Modeling step

Identify the fund’s main characteristics.
Computational step

Random portfolios simulation

Generate random portfolios compatible with the identified fund’s main characteristics and simulate their evolution through the considered time period.
Performance evaluation

Determine the level of statistical significance of the fund’s performance over the considered time period using the previously simulated random portfolios.

Step 1 - Identifying the fund’s main characteristics

Dawson⁸ note that an investment strategy is essentially a set of constraints [and rules] that are carefully constructed to (hopefully) remove, in aggregate, poor performing regions of the solution space⁸. Thus, it is very important to identify as precisely as possible the main characteristics of the fund, even though this […] information is not always available, as only the manager knows [it] exactly¹⁹.

From a practical perspective, these characteristics need to be provided as²¹:

A universe of $n$ assets modeling the fund’s universe of assets
A constraint set $\mathcal{C} \subset \mathbb{R}^{n}$ modeling the fund’s constraints
An ensemble of decision rules modeling²² the fund’s rebalancing rules

For examples of this step in different contexts, c.f. Kothari and Warner²³ and Surz²⁴.

Step 2a - Simulating random portfolios

Thanks to the previous step, it becomes possible to simulate thousands of random portfolios that the fund’s manager could have potentially held over the considered time period.

The only difficulty at this point is computational, due to the impact of the constraint set on the algorithmic machinery required to generate portfolio weights at random.

As a side note, and for a fair comparison [with the fund under analysis], transaction costs, as well as all other kinds of costs, must be considered¹⁰ when simulating random portfolios.

Step 2b - Evaluating the fund’s performance

The ranking of the fund’s performance against the performance of all the simulated random portfolios provides a direct measure of the statistical significance of that fund’s performance¹.

Indeed, under the null hypothesis that the investment fund’s exhibit no particular performance over the considered time period, the $p$-value of the statistical hypothesis test mentioned at the beginning of this section is defined as²⁵

\[p = \frac{n_x + 1}{N + 1}\]

, where:

$n_x$ is the the number of random portfolios whose performance over the considered time period is as extreme or more extreme than that of the fund under evaluation
$N$ is the number of simulated random portfolios

This idea is illustrated in Figure 5, which depicts:

The distribution of the holding period return of 1000 random portfolios made of stocks belonging to the S&P 500 and subject to misc. turnover and cardinality constraints (solid line)
The 95th percentile of that distribution (dashed line)

Figure 5. Distribution of the holding period return of random portfolios simulated from stocks belonging to the S&P 500, 2005 - 2010. Source: Stein.

From this figure, if we were to evaluate the performances of a fund trading these securities and operating under constraints similar to those simulated, [it] would have to obtain a return equal of at least 80% during the 6 year period for the null of no particular performance to be rejected⁵ at a 95% confidence level.

Variations on Surz’s methodology

Burns¹⁰²⁵ and Lisi¹⁹ both extend Surz’s methodology from one time period to several time periods - which allows to consider the persistence of results over time¹⁹ - by describing different ways of combining individual $p$-values²⁶. As a by-product, Burns¹⁰²⁵ empirically demonstrates that Stouffer’s method²⁷ for combining $p$-values should be preferred to Fisher’s when evaluating a fund’s performance over multiple time periods.

Stein⁵ proposes to replace Surz’s methodology by testing whether the distribution of returns of the [fund under evaluation] is stochastically greater than that of [a] chosen percentile random fund using the Mann–Whitney U test.

Caveats

Surz¹ warns that there are many ways to implement a [Monte Carlo simulation] approach [with random portfolios], some better than others, and some worse¹.

In particular, Surz¹ argues that, when selecting a number of assets at random to satisfy portfolio cardinality constraints, it is required to use value-weighted sampling, so that the probability of choosing a given [asset] is proportionate to its outstanding capitalization¹. Otherwise, some macroeconomic [in]consistency¹ would be introduced by the Monte Carlo simulation process, which would ultimately bias the end results. This point is confirmed for both U.S. and global stocks in Arnott et al.²⁶, who show that random portfolios introduce, often unintentionally, value and small cap tilts²⁶. Nevertheless, other authors¹⁹ argue on the contrary that using equally-weighted sampling is more representative of the behavior of an “unskilled” manager, which is exactly what random portfolios are supposed to model!

More generally, random portfolios might have “unfair” characteristics v.s. the fund under evaluation (higher volatility, higher beta, lower turnover…), which must either be acknowledged or controlled for depending on the exact circumstances.

Implementations

Implementation in Portfolio Optimizer

Through the endpoint /portfolios/simulation/random, Portfolio Optimizer allows to:

Generate the weights of a random portfolio subject to general linear inequality constraints

Through the endpoint /portfolios/simulation/evolution/random, Portfolio Optimizer allows to:

Simulate the evolution of a random portfolio subject to general linear inequality constraints over a specified time period, with 3 different portfolio rebalancing strategies:
- Buy and hold
- Rebalancing toward the weights of the initial random portfolio
- Rebalancing toward the weights of a new random portfolio

Implementations elsewhere

I am aware of two commercial implementations of random portfolios for performance evaluation as described in this blog post:

Portfolio Probe, from Patrick Burns²⁸ of Burns Statistics
PIPODs, from Ronald Surz²⁸ of PPCA Inc., which seems discontinued

Examples of usage

Hedge funds performance evaluation

The performances of hedge funds are notoriously difficult to evaluate due to the lack of both proper benchmarks and homogeneous peer groups²⁴.

Random portfolios offer a solution to this problem because they can reflect the unique specifications of each individual hedge fund²⁴.

More details can be found in Surz²⁴ or in Molyboga and Ahelec²⁹.

Investment strategy returns dispersion analysis

Kritzman and Page³⁰ uses random portfolios to compare the relative importance of different investment strategies (investment at global asset class level, at individual stock level, etc.) and concludes that³⁰

Contrary to perceived doctrine, dispersion around average performance arising from security selection is substantially greater than dispersion around average performance arising from all other investment choices. Moreover, asset allocation, widely considered the most important investment choice, produces the least dispersion; thus, from a normative perspective it is the least important investment choice.

Figure 6, which quantifies that dispersion around average performance, shows the extent to which a talented investor (top 25th or 5th percentile) could have improved upon average performance by engaging in various investment choices across a global universe. It also shows how far below average an unlucky investor (bottom 75th or 95th percentile) could have performed, depending on the choice of investment discretion³⁰.

Figure 6. 5th, 25th, 75th, and 95th percentile of the annualized difference from average performance of random portfolios simulated from misc. global asset classes, 1987-2001. Source: Kritzman and Page.

While this conclusion has been criticized³¹, the underlying methodology - returns dispersion analysis - provides valuable insight to investors in that it allows to understand the potential impact of [any] investment choice, irrespective of investment behavior³⁰.

For example, let’s suppose a French investor would like to passively invest in an MSCI World ETF, but is worried by both³²:

The massive ~70% weight of the United States in that index
The ridiculous ~3% weight of his home country in that index

Thus, this investor would rather like to invest in an MSCI World ETF “tilted” away from the United States and toward France, although he has no precise idea about how to implement such a tilt.

Here, returns dispersion analysis through random portfolios can help our investor understand the impact on performances of his proposed tactical deviation from the baseline investment strategy, at least historically³³.

For this, Figure 7 depicts the range of annualized returns achieved over the period 31 December 2008 - 29 December 2023 by 10000 random portfolios invested in the 23 countries of the MSCI World index when³⁴:

The weights of the United States and France are made randomly varying between 0% and 100%
The weights of all the other countries are kept constant v.s. their reference weights

Figure 7. Quartiles of the annualized returns of random portfolios from countries included in the MSCI World tilted away from the U.S. and toward France, 31 December 2008 - 29 December 2023.

It is clear from from Figure 7 that deviating from the MSCI World index by under-weighting the United States and over-weighting France has been a relatively bad strategy over the considered period, with a median annualized return of ~8.30% compared to an annualized return of ~11.30% for the MSCI World³⁵!

Whether history will repeat itself remains to be seen, but thanks to this historical returns dispersion analysis, our investor is at least informed to take an educated decision re. his proposed investment strategy.

Synthetic benchmark construction

Stein⁵ proposes to use random portfolios in order to construct a “synthetic” benchmark, representative of all possible investment strategies within a given universe of assets and subject to a given set of constraints and rules.

In more details, Stein⁵ proposes to construct such a benchmark directly from the time series of returns of a [well chosen] single random portfolio⁵, which is typically the median¹ random portfolio w.r.t. a chosen performance measure.

A similar idea is discussed in Lisi¹⁹, in which such a benchmark is this time constructed from the cross-sectional returns of the random portfolios.

Figure 8 illustrates Lisi’s methodology; on it, the green line represents a synthetic benchmark made of the 95th percentile³⁶ of the distribution of the random portfolios holding period returns at each time $t$

Figure 8. Time series of the 95th percentile of cross-sectional holding period returns of random portfolios. Source: Lisi.

The possibility to create synthetic benchmarks is particularly useful when evaluating the performances of tactical asset allocation (TAA) strategies, because their allocation might completely change from one period to another, making their comparison with a static benchmark - like a buy and hold portfolio or a 60/40 stock-bond portfolio - difficult to justify a priori.

For example, Figure 9 compares over the period 30th November 2006 - 31th January 2024³⁷:

The Global Tactical Asset Allocation (GTTA)³⁸ strategy of Mebane Faber, which invests monthly - depending on the quantitative rules described in Faber³⁸ - within a five-asset universe made of:
- U.S. Equities, represented by the SPY ETF
- International Equities, represented by the EFA ETF
- Intermediate-Term U.S. Treasury Bonds, represented by the IEF ETF
- U.S. Real Estate, represented by the VNQ ETF
- Global Commodities, represented by the DBC ETF
An (equal-weighted) buy and hold portfolio within Faber’s five-asset universe, which is the benchmark of the GTTA strategy proposed in Faber³⁸
A synthetic benchmark à la Lisi¹⁹ constructed from the median cross-sectional holding period returns of 25000 random portfolios simulated³⁹ within Faber’s five-asset universe

Figure 9. Global Tactical Asset Allocation (GTTA) strategy v.s. misc. benchmarks, 30th November 2006 - 31th January 2024.

In Figure 9, the buy and portfolio and the synthetic benchmark both appears to behave quite similarly⁴⁰, but the synthetic benchmark should theoretically be preferred for performance comparison purposes because it better reflects the dynamics of the GTTA strategy (varying asset weights, varying exposure) while guaranteeing the complete absence of skill⁴¹.

Trading strategy evaluation

Random portfolios also find applications in evaluating (quantitative) trading strategies.

One of these applications, described in Dawson⁸, consists in evaluating the significance of an alpha signal through the comparison of two different samples of random portfolios:

It is […] very difficult to show conclusively the effect that a model, strategy […] has on the real investment process. […] The ideal solution would be to generate a set of portfolios, constrained in the same way as the portfolio(s) built with the theory or model under investigation, only without the information produced by the theory. It would then be possible to compare the distributions of the portfolio characteristics with and without the added information from the new theory, giving strong statistical evidence of the effects of the new information.

As a practical illustration, let’s come back on Faber’s GTTA strategy introduced in the previous subsection and compare:

A sample of 10000 random portfolios simulated within Faber’s five-asset universe, rebalanced each month toward a random portfolio invested⁴² in all the assets selected for investment by the GTTA strategy rules
A sample of 10000 random portfolios simulated within Faber’s five-asset universe, rebalanced each month toward a random portfolio invested⁴² in all the assets NOT selected for investment by the GTTA strategy rules

The resulting ranges of annualized returns over the period 30th November 2006 - 31th January 2024 are displayed in Figure 10, which highlights a clear under-performance of the second sample of random portfolios.

Figure 10. Quartiles of the annualized returns of the Global Tactical Asset Allocation (GTTA) strategy original asset selection rules v.s. inverted asset selection rules using random portfolios for weighting the selected assets, 30th November 2006 - 31th January 2024.

This kind of under-performance is exactly what is expected under the hypothesis that the GTTA strategy asset selection rules correspond to a true alpha signal.

Indeed, as Arnott et al.²⁶ puts it:

In inverting the strategies, we tacitly examine whether these strategies outperform because they are predicated on meaningful investment theses and deep insights on capital markets, or for reasons unrelated to the investment theses. If the investment beliefs are the source of outperformance, then contradicting those beliefs should lead to underperformance.

Trading strategy monitoring

Going beyond trading strategy evaluation, random portfolios can also be used to monitor how the performances of a trading strategy differ between in-sample and out-of-sample periods.

I propose to illustrate the associated process with another tactical asset allocation strategy, the Global Equities Momentum (GEM)⁴³ strategy of Gary Antonacci, which invests monthly - depending on the quantitative rules described in Antonacci⁴³ - in one asset among a three-asset universe made of:

U.S. Equities, represented by the S&P 500 Index
International Equities, represented by the MSCI ACWI ex-USA Index
U.S. Bonds, represented by the Barclays Capital US Aggregate Bond Index

Because this strategy was published in 2013, let’s first check GEM performances during the in-sample period 1989-2012 (Google Sheet).

Figure 11 depicts the equity curves of a couple of random portfolios simulated⁴⁴ within Antonacci’s three-asset universe (in solid) v.s. the GEM equity curve (in dashed) over that period.

Figure 11. Global Equities Momentum (GEM) strategy v.s. random portfolios, in-sample period 1989-2012.

For the real thing, Figure 12 depicts the Sharpe Ratio distribution of 10000 random portfolios simulated⁴⁴ within Antonacci’s three-asset universe over that period, with the red line corresponding to the GEM Sharpe Ratio.

Figure 12. Global Equities Momentum (GEM) strategy v.s. random portfolios, distribution of Sharpe Ratios, in-sample period 1989-2012.

Pretty amazing, as the GEM Sharpe Ratio is among the best obtainable Sharpe Ratios over the in-sample period!

Let’s now check GEM performances during the out-of-sample period 2013-October 2020 (Google Sheet).

Figure 13 depicts the Sharpe Ratio distribution of 10000 random portfolios simulated⁴⁴ within Antonacci’s three-asset universe over that new period, with the red line again corresponding to the GEM Sharpe Ratio.

Figure 13. Global Equities Momentum (GEM) strategy v.s. random portfolios, distribution of Sharpe Ratios, out-of-sample period 2013-October 2020.

Pretty blah this time, with the GEM Sharpe Ratio roughly comparable to the median random portfolio Sharpe Ratio, which by definition exhibits no particular skill…

Such a difference in the GEM Sharpe Ratio relative to its simulated peer group¹ between the in-sample period and the out-of-sample period is puzzling, but analyzing a particular tactical asset allocation strategy is out of scope of this blog post, so that I need to leave the “why” question unanswered.

The same process can be applied to any trading strategy in order to detect a potential shift in that strategy’s performances, so, don’t hesitate to abuse the computing power available with today’s computers!

Conclusion

Dawson⁸ notes that [Monte Carlo analysis] is not a tool that has been readily applied to the investment process in the past, due to the perceived complexity of the problem⁸.

Through this blog post, I hope to have demonstrated that random portfolios are not necessarily complex to use, and come with many benefits for performance evaluation.

Feel free to randomly reach out on LinkedIn or on Twitter.

–

See Surz, Ronald, A Fresh Look at Investment Performance Evaluation: Unifying Best Practices to Improve Timeliness and Reliability, Journal of Portfolio Management, Vol. 32, No. 4, Summer 2006, pp 54-65. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹ ↩¹⁰ ↩¹¹ ↩¹² ↩¹³ ↩¹⁴
Or a trading strategy, or whatever. ↩
See Surz, R. J. 1994. Portfolio opportunity distributions: an innovation in performance evaluation. The Journal of Investing, 3(2): 36-41. ↩ ↩² ↩³ ↩⁴
See Surz, Ron. Accurate Benchmarking is Gone But Not Forgotten: The Imperative Need to Get Back to Basics, Journal of Performance Measurement, Vol. 11, No. 3, Spring, pp 34-43. ↩ ↩² ↩³
See Roberto Stein, Not fooled by randomness: Using random portfolios to analyse investment funds, Investment Analysts Journal, 43:79, 1-15. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷
By construction. ↩
These can be imposed by the firm that offers the funds, for example in terms of the prospectus and investment goals, or self-imposed trading behavior that the manager maintains over his career⁵; these can also be imposed by regulatory bodies or stock exchanges. ↩
See Dawson, R. and R. Young: 2003, Near-uniformly Distributed, Stochastically Generated Portfolios. In: S. Satchell and A. Scowcroft (eds.): Advances in Portfolio Construction and Implementation. Butterworth–Heinemann. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸ ↩⁹
The $3$-dimensional geometrical object associated to the subset of $\mathbb{R}^3$ modeling long-only and full investment constraints is the standard 2-simplex in $\mathbb{R}^3$. ↩ ↩² ↩³
See Burns, P. (2007). Random Portfolios for Performance Measurement. In: Kontoghiorghes, E.J., Gatu, C. (eds) Optimisation, Econometric and Financial Analysis. Advances in Computational Management Science, vol 9. Springer, Berlin, Heidelberg. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶
Such an object is also called a unit simplex or the probabilistic simplex. ↩
See Onn, S., Weissman, I. Generating uniform random vectors over a simplex with implications to the volume of a certain polytope and to multivariate extremes. Ann Oper Res 189, 331–342 (2011). ↩
See Kai-Tai Fang, Zhen-Hai Yang, On uniform design of experiments with restricted mixtures and generation of uniform distribution on some domains, Statistics & Probability Letters, Volume 46, Issue 2, 2000, Pages 113-120. ↩
See Paul A. Rubin (1984) Generating random points in a polytope, Communications in Statistics - Simulation and Computation, 13:3, 375-396. ↩
See Gert van Valkenhoef, Tommi Tervonen, Douwe Postmus, Notes on “Hit-And-Run enables efficient weight generation for simulation-based multiple criteria decision analysis”, European Journal of Operational Research, Volume 239, Issue 3, 2014, Pages 865-867. ↩
The rejection method is not a reasonable approach because the probability of a portfolio being accepted is generally extremely small when realistic constraints are in place¹⁰. ↩
Due to the nature of genetic algorithms. ↩
Within a given numerical tolerance. ↩
See Francesco Lisi (2011) Dicing with the market: randomized procedures for evaluation of mutual funds, Quantitative Finance, 11:2, 163-172. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸
Some authors like Kritzman and Page³⁰ consider the usage of random portfolios to be a bootstrap simulation process, and not a Monte Carlo one. ↩
To be noted that the universe of assets, constraints and rules might perfectly be time-dependent; for example, at a given point in time, the universe of assets might be completely different from that at an earlier or later point in time. ↩
The frontier between the universe of assets and the rebalancing rules might not always be perfectly clear; at heart, the rebalancing rules must model the trading behaviour of the fund. ↩
See Kothari, S.P. and Warner, Jerold B., Evaluating Mutual Fund Performance (August 1997). ↩
See Ronald J. Surz, Testing the Hypothesis “Hedge Fund Performance Is Good”, The Journal of Wealth Management, Spring 2005, 7 (4) 78-83. ↩ ↩² ↩³ ↩⁴
See Burns, Patrick J., Performance Measurement Via Random Portfolios (December 2, 2004). ↩ ↩² ↩³
See Robert D. Arnott, Jason Hsu, Vitali Kalesnik, Phil Tindall, The Surprising Alpha From Malkiel’s Monkey and Upside-Down Strategies, The Journal of Portfolio Management, Summer 2013, 39 (4) 91-105. ↩ ↩² ↩³ ↩⁴
See N A Heard, P Rubin-Delanchy, Choosing between methods of combining p-values, Biometrika, Volume 105, Issue 1, March 2018, Pages 239–246. ↩
I have no affiliation. ↩ ↩²
See Molyboga, M., Ahelec, C. A simulation-based methodology for evaluating hedge fund investments. J Asset Manag 17, 434–452 (2016). ↩
See Kritzman, Mark and Sébastien Page (2003), The Hierarchy of Investment Choice, Journal of Portfolio Management 29, number 4, pages 11-23.. ↩ ↩² ↩³ ↩⁴ ↩⁵
See Staub, R. (2004). The Hierarchy of Investment Choice. The Journal of Portfolio Management, 31(1), 118–123. ↩
C.f. the MSCI World index factsheet. ↩
If future asset prices are available, for example thanks to a bootstrap simulation, nothing prevents a returns dispersion analysis to integrate them. ↩
In more details, the methodology is as follows: 1) Gross USD monthly price data for all the 23 countries represented in the MSCI World index has been collected from the MSCI website, 2) The MSCI World tracking portfolio - exhibiting a nearly null tracking error - has been computed over the period 31 December 2008 - 29 December 2023, which gives reference weights for the 23 countries represented in the MSCI World (e.g., United States ~50% and France ~5%), 3) Using Portfolio Optimizer, the evolution of 10000 random portfolios has been simulated over the period 31 December 2008 - 29 December 2023, these portfolios being a) constrained so that all country weights are positive, sum to one and all country weights except these for the United States and France are kept constant v.s. their reference weights and b) monthly rebalanced toward random portfolios in order to encompass any possible tilting, 4) The annualized return of each of these 10000 random portfolios has been computed. ↩
Which, in addition, is near the top of the achievable annualized returns! ↩
To be noted that in practice, this 95th quantile would probably need to be replaced by the 50th quantile because the benchmark return [should] always ranks median¹. ↩
(Adjusted) prices have have been retrieved using Tiingo. ↩
See Faber, Meb, A Quantitative Approach to Tactical Asset Allocation (February 1, 2013). The Journal of Wealth Management, Spring 2007. ↩ ↩² ↩³
In more details, using Portfolio Optimizer, the evolution of 25000 random portfolios has been simulated over the considered period, these portfolios being a) constrained so that all weights are positive and sum to a random exposure between 0% and 100% and b) monthly rebalanced toward random portfolios in order to encompass any possible tactical allocation. ↩
Which justifies a posteriori the use of the buy and hold portfolio as a benchmark. ↩
As a side note, Figure 9 highlights that an equal-weighted buy and hold portfolio is a tough benchmark to beat, c.f. DeMiguel et al.⁴⁵! ↩
Long-only and fully invested. ↩ ↩²
Gary Antonacci, Dual Momentum Investing: An Innovative Strategy for Higher Returns With Lower Risk ↩ ↩²
In more details, using Portfolio Optimizer, the evolution of 10000 random portfolios has been simulated over the considered period, these portfolios being a) constrained so that all weights are positive and sum to a 100% and b) monthly rebalanced toward random portfolios in order to encompass any possible tactical allocation. ↩ ↩² ↩³
See DeMiguel, Victor and Garlappi, Lorenzo and Uppal, Raman, Optimal Versus Naive Diversification: How Inefficient is the 1/N Portfolio Strategy? (May 2009). The Review of Financial Studies, Vol. 22, Issue 5, pp. 1915-1953, 2009. ↩

Sparse Index Tracking: Limiting the Number of Assets in an Index Tracking Portfolio

2024-01-08T00:00:00-06:00

In the previous post, I introduced the index tracking problem¹, which consists in finding a portfolio that tracks as closely as possible² a given financial market index.

Because such a portfolio might contain any number of assets, with for example an S&P 500 tracking portfolio possibly containing ~500 stocks, it is [sometimes desirable] that the tracking portfolio consists of a small number of assets³ in order to simplify the execution, avoid small and illiquid positions, and large transaction costs³.

In other words, it is sometimes desirable to impose a constraint on the maximum number of assets contained in an index tracking portfolio, which leads to an extension of the regular index tracking problem called the sparse index tracking problem⁴.

In this new post, I will describe the mathematics of the sparse index tracking problem and I will detail a few examples of usage like:

Replicating a fund of funds to reduce fees while keeping the number of directly-invested funds manageable
Replicating the S&P 500 with only ~50 stocks as an (extreme) illustration of the optimized sampling⁵ ETF methodology

Mathematical preliminaries

The general sparse index tracking optimization problem

Let be:

$T$, a number of time periods
$r_{idx} = \left( r_{idx, 1}, …, r_{idx, T} \right) \in \mathcal{R}^{T}$, the vector of the index arithmetic returns over each of the $T$ time periods
$n$, the number of assets in the universe of the sparse index tracking portfolio
$X \in \mathcal{R}^{T \times n}$ the matrix of the $n$ assets arithmetic returns over each of the $T$ time periods
$w = \left( w_1,…,w_n \right) \in \mathcal{R}^{n} $ the vector of the weights of a portfolio in each of the $n$ assets

The vector of the sparse index tracking portfolio weights $w^*$ is then the⁶ solution to the optimization problem which consists in minimizing a tracking error measure $f \left( X w - r_{idx} \right)$ - with $f$ some loss function - subject to additional constraints like full investment constraint, no short sale constraint, etc. and a cardinality constraint³

\[w^* = \operatorname{argmin} f \left( X w - r_{idx} \right) \newline \textrm{s.t. } \begin{cases} \sum_{i=1}^{n} w_i = 1 \newline 0 \leqslant w_i \leqslant 1, i = 1..n \newline ... \newline \lVert w \rVert_0 \le n_{max} \end{cases}\]

, where:

$\lVert . \rVert_0$ is the zero “norm”⁷ of a vector, that is, the number of non-zero elements in that vector
$n_{max}$ is the maximum number of assets with non-zero weights desired in the sparse index tracking portfolio

From a computational perspective, and whatever the exact definition of the loss function $f$, it is known that when a cardinality constraint restricting the number of stocks is introduced, the problem of optimizing the composition of a portfolio tends to become NP-hard⁸, which means that exact solutions to instances of realistic sizes are computationally intractable, and thus inexact solution methods are the only practical ones⁸.

Standard quadratic optimization methods guaranteeing an optimal solution are thus unfortunately not usable to solve the sparse index tracking optimization problem⁹…

Heuristics for solving the general sparse index tracking optimization problem

While an exhaustive enumeration of all possible methods for solving the sparse index tracking optimization problem is out of scope of this post, most of them¹⁰ seem to fall into one of three rough categories.

Combinatorial optimization methods

The cardinality constraint present in the sparse index tracking problem makes it a combinatorial problem, so that standard combinatorial optimization methods (continuous relaxation of a mixed integer programming formulation⁸¹¹, genetic algorithms¹²¹³…) can be used to find an approximate solution.

As a side note, thanks to the relationship of the sparse index tracking problem with combinatorial optimization and hence with operations research, it is possible to find datasets for different index tracking problems - ranging from a universe of 31 assets to a universe of 2151 assets - in the OR-Library.

Signal processing methods

The sparse index tracking problem in finance is closely related to a similar problem in signal processing called compressed sensing, for which specific algorithms have been developed over the years, like the iterative hard thresholding algorithm¹⁴ or the compressive sampling matching pursuit (CoSaMP) algorithm¹⁵.

To be noted, though, that all signal processing methods that rely on the least absolute shrinkage and selection operator (LASSO), which consists in approximating the $l_0$-norm constraint by a $l_1$-norm constraint, are not applicable to the sparse index tracking problem. Indeed, when full investment and no-short sales constraints are imposed¹⁶, we have, for any portfolio weights vector $w$

\[\lVert w \rVert_1 = \sum_{i=1}^n \left| w_i \right| = \sum_{i=1}^n w_i = 1\]

, so that the $l_1$-norm reduces to a constant and is therefore irrelevant³.

Non-linear optimization methods

At the crossroads with signal processing methods, generic non-linear optimization methods taking into account cardinality constraints have also been developed over the years:

Methods that directly integrate cardinality constraints, like projected gradient methods¹⁷¹⁸ or coordinate descent methods¹⁸
Methods that indirectly integrate cardinality constraints by either:
- Relaxing the $l_0$-norm constraint into an “easier” constraint, like the entropic lower bound algorithm of Jacquet et al.¹⁹
- Reformulating the $l_0$-norm constraint as a $l_0$-norm penalty integrated in the tracking error measure²⁰ and replacing that $l_0$-norm penalty with an “easier” penalty, like the $q$-norm approximation algorithm of Jansen and van Dijk²¹ or the majorization-minimization algorithm of Benidis et al.³

The sparse index tracking/empirical tracking error optimization problem

Like in the previous post of this series, this post will use the empirical tracking error as the preferred tracking error measure.

In this case, the general sparse index tracking optimization problem becomes

\[w^* = \operatorname{argmin} \frac{1}{T} \lVert X w - r_{idx} \rVert_2^2 \newline \textrm{s.t. } \begin{cases} \sum_{i=1}^{n} w_i = 1 \newline 0 \leqslant w_i \leqslant 1, i = 1..n \newline ... \newline \lVert w \rVert_0 \le n_{max} \end{cases}\]

, which, as noted by Benidis et al.³, is nothing else than a constrained sparse regression problem:

the sparse index tracking problem is similar to many sparsity formulations in the signal processing area in the sense that it is a regression problem with some sparsity requirements

Implementation in Portfolio Optimizer

Portfolio Optimizer allows to compute an approximate solution to the sparse index tracking optimization problem under the empirical tracking error measure through the endpoint /portfolios/replication/index-tracking/sparse.

In addition, Portfolio Optimizer allows to impose misc. constraints like minimum and maximum asset weights or minimum and maximum group weights constraints²².

Examples of usage

Replicating the S&P 500 by (extreme) optimized sampling

One of the most immediate application of the sparse index tracking problem is the ability to replicate a financial market index without investing into all its constituents, which is desirable for the reasons explained in the introduction of this post.

In order to illustrate this, I propose to partially replicate one of the numerical experiments in Benidis et al.³.

In details, using the S&P 500 data provided in the R package sparseIndexTracking - which consists in the daily returns of the S&P 500 index and of 386 of its constituents over the period from 1st January 2010 to 31th December 2010 - I will solve²³ the following 80 sparse index tracking problems:

Index - S&P 500
Tracking assets - 386 stocks included in the S&P 500
Maximum number of assets - Varying from 20 to 100
Misc. constraints - Maximum asset weight constraint of 5%

The empirical tracking error of the resulting 80 sparse S&P 500 tracking portfolios, as well as - for reference - the empirical tracking error of the regular S&P 500 tracking portfolio²⁴, is depicted in Figure 1.

Figure 1. Sparse S&P 500 tracking portfolios over the period 01 January 2010 - 31 December 2010, empirical tracking error v.s. maximum number of stocks

From this figure, it is visible that a sparse tracking portfolio made of ~100 stocks allows to track the S&P 500 with the same level of tracking error as a regular tracking portfolio made of much more stocks²⁴.

But it is possible to do better!

A sparse tracking portfolio made of only ~50 stocks is actually sufficient, as illustrated in Figure 2.

Figure 2. Sparse 55-stock S&P 500 tracking portfolio over the period 01 January 2010 - 31 December 2010, performances comparison with the S&P 500

Of course, such a small portfolio might be less “diversified”²⁵ than its 100-stock counterpart, but the main takeaway is that a portfolio made of ~50-100 stocks adequately replicates the S&P 500, which opens the door of direct investing²⁶ to DIY investors.

Automatically determining the proper composite benchmark for a mutual fund

In the previous post of this series, the index tracking machinery was shown to allow to easily and automatically construct a benchmark for any mutual fund.

By extension, the sparse index tracking machinery also allows to do the same, with an important advantage for multi-asset mutual funds - the automated construction of the best composite benchmark with a given number of constituents.

Replicating a fund of funds to reduce fees

As a last example of usage, I propose to build upon a LinkedIn post from Finominal, in which the following question is asked:

What’s the likelihood that all […] funds in JP’s Investor Growth & Income Fund contribute value?

A bit of context first.

The JPMorgan Investor Growth & Income Fund is a fund of funds (FoF) whose main investment strategy is to invest in other J.P. Morgan Funds²⁷. As of 31 December 2023, it is invested in 25 such other J.P. Morgan funds (JPMorgan Core Bond R6, JPMorgan US Equity R6…).

One problem with FoF is that they usually have higher fees than traditional mutual funds because they include the management fees charged by the underlying funds²⁸.

The JPMorgan Investor Growth & Income Fund is no different, with a total expense ratio ranging from 1.47% to 0.72%, the latter being for its institutional class share (ticker ONGFX).

Inspired by the results from the previous post of this series, it is possible to reduce those fees by investing directly in all the underlying J.P. Morgan funds in proportion determined by solving²³ the following regular index tracking problem:

Index - The JPMorgan Investor Growth & Income Fund (ONGFX)
Tracking assets - 24 over the 25 J.P. Morgan funds held by the JPMorgan Investor Growth & Income Fund as of 31 December 2023, knowing that the 25th fund is marked as restricted for investment by Morningstar²⁹

Using price data³⁰ for the period 01 January 2023 - 31 December 2023, this gives the regular tracking portfolio whose weights are displayed in Figure 3 and whose performances are compared with the JPMorgan Investor Growth & Income Fund in Figure 4.

Figure 3. JPMorgan Investor Growth & Income Fund tracking portfolio over the period 01 January 2023 - 31 December 2023, composition

Figure 4. JPMorgan Investor Growth & Income Fund tracking portfolio over the period 01 January 2023 - 31 December 2023, performances comparison with JPMorgan Investor Growth & Income Fund

This tracking portfolio has:

A practically null tracking error v.s. the JPMorgan Investor Growth & Income Fund (Figure 4)
An expense ratio of ~0.43%³¹ v.s. of at least 0.72% for the JPMorgan Investor Growth & Income Fund

So, this tracking portfolio is an interesting alternative to the JPMorgan Investor Growth & Income Fund, but there is a catch - it is invested in a total of 22 J.P. Morgan funds (Figure 3), which might be cumbersome to manage…

An even better tracking portfolio would have a smaller number of directly-invested funds.

Does such a unicorn exist?

Figures 5 and 6 answer this question, by solving²³ the following sequence of sparse index tracking problems over the period 01 January 2023 - 31 December 2023:

Index - The JPMorgan Investor Growth & Income Fund (ONGFX)
Tracking assets - The 24 non-restricted J.P. Morgan funds held by the JPMorgan Investor Growth & Income Fund as of 31 December 2023, knowing that the 25th fund is marked as restricted for investment by Morningstar²⁹
Maximum number of assets - Varying from 1 to 24

Figure 5. Sparse JPMorgan Investor Growth & Income Fund tracking portfolio over the period 01 January 2023 - 31 December 2023, performances comparison with JPMorgan Investor Growth & Income Fund

Figure 6. Sparse JPMorgan Investor Growth & Income Fund tracking portfolio over the period 01 January 2023 - 31 December 2023, fees

From these two figures, the sweet spot for the number of directly-invested funds in terms of tracking error/fees/manageability seems to be ~6-8.

Coming back to the initial question from Finominal, this analysis confirms that there is basically zero likelihood that all […] funds in JP’s Investor Growth & Income Fund contribute value, because these can be drastically reduced³².

Conclusion

That’s it for the first blog post of 2024!

Next in this series, I will discuss another extension of the regular index tracking problem, aiming this time at tracking a financial index with “dynamic” asset weights over the considered $T$ time periods.

As usual, feel free to connect with me on LinkedIn or to follow me on Twitter.

–

Also called the benchmark replication problem. ↩
In terms of a given tracking error measure, like the empirical tracking error³, also called the mean squared tracking error. ↩
See Konstantinos Benidis; Yiyong Feng; Daniel P. Palomar, Optimization Methods for Financial Index Tracking: From Theory to Practice , now, 2018.. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸
Also called the partial benchmark replication problem. ↩
See Kees van Montfort, Elout Visser, Laurens Fijn van Draat, Index Tracking by Means of Optimized Sampling, The Journal of Portfolio Management Winter 2008, 34 (2) 143-152. ↩
Or a solution, in case multiple solutions exist. ↩
The zero “norm” is not a real norm. ↩
See Purity Mutunge, Dag Haugland, Minimizing the tracking error of cardinality constrained portfolios, Computers & Operations Research, Volume 90, 2018, Pages 33-41. ↩ ↩² ↩³
To be noted that in the specific case of a quadratic loss function, Mutunge and Haugland⁸ establish that the sparse index tracking optimization problem is actually strongly NP-hard. ↩
Excluding simple heuristics like selecting the largest assets according to a given criteria. For example, a widely used method, especially for a market capitalization weighted index, is to select the largest K assets according to their market capitalizations³; as another example, greedy or reverse greedy algorithms can be used on top of a non-sparse formulation of the index tracking problem to include or exclude assets one by one¹⁹. ↩
See N.A. Canakgoz, J.E. Beasley, Mixed-integer programming approaches for index tracking and enhanced indexation, European Journal of Operational Research, Volume 196, Issue 1, 2009, Pages 384-399. ↩
See J.E. Beasley, N. Meade, T.-J. Chang, An evolutionary heuristic for the index tracking problem, European Journal of Operational Research, Volume 148, Issue 3, 2003, Pages 621-643. ↩
See Gilli, M., Kellezi, E. (2002). The Threshold Accepting Heuristic for Index Tracking. In: Pardalos, P.M., Tsitsiringos, V.K. (eds) Financial Engineering, E-commerce and Supply Chain. Applied Optimization, vol 70. Springer, Boston, MA. ↩
See T. Blumensath and M. Davies, Iterative hard thresholding for compressed sensing, Appl. Comput. Harmon. Anal., 27 (2009), pp. 265–274.. ↩
See Deanna Needell and Joel A Tropp. Cosamp: iterative signal recovery from incomplete and inaccurate samples. Applied and Computational Harmonic Analysis, 26(3):301–321, 2009. ↩
Which is usually the case in practice… ↩
See Xu, F., Dai, Y., Zhao, Z. et al. Efficient projected gradient methods for cardinality constrained optimization. Sci. China Math. 62, 245–268 (2019).. ↩
See Amir Beck, Yonina C. Eldar, Sparsity Constrained Nonlinear Optimization: Optimality Conditions and Algorithms, 2013, SIAM Journal on Optimization, 1480-1509, 23, 3 ↩ ↩²
See Quentin Jacquet, Agnes Bialecki, Laurent El Ghaoui, Stéphane Gaubert, Riadh Zorgati. Entropic Lower Bound of Cardinality for Sparse Optimization. 2022.. ↩ ↩²
It is not trivial to show that such a reformulation is equivalent; c.f. Jansen and van Dijk²¹ for a detailed proof. ↩
See Roel Jansen, Ronald van Dijk, Optimal Benchmark Tracking with Small Portfolios, The Journal of Portfolio Management, Winter 2002, 28 (2) 33-39. ↩ ↩²
The possibility to include minimum and maximum group weights constraints is important in practice, because it allows for example to set an upper limit on fees or to (try to) impose sector neutrality in the computed portfolio, a characteristic often overlooked c.f. for example Che et al.³³ ↩
Using Portfolio Optimizer. ↩ ↩² ↩³
The non-sparse S&P 500 tracking portfolio, subject to the same maximum asset weight constraint of 5% as the sparse S&P 500 tracking portfolios, is made of 286 stocks. ↩ ↩²
For example, subject to sector concentration bias, as highlighted in Che et al.³³. ↩
Direct investing solutions are usually provided by asset management firms (Fidelity, Frec…). In the case of a stock market index, investing directly in stocks composing that stock market index, as opposed to indirectly through an ETF, has several advantages (null expense ratio, possibility to do tax-loss harvesting…). ↩
C.f. the JPMorgan Investor Growth & Income Fund prospectus. ↩
C.f. Wikipedia. ↩
C.f. Morningstar. ↩ ↩²
(Adjusted) prices have have been retrieved using Tiingo. ↩
Computation made using the expense ratios reported by Morningstar for the different J.P. Morgan funds, you will have to trust me on this one. To also be noted that the fees associated to the potential rebalancing of the tracking portfolio are not taken into account and would need to be properly minimized in real life. ↩
In view of Figure 5 and Figure 6, I would take the returns increased part of Finominal’s LinkedIn post with a grain of salt, although it is certain that returns will increase thanks to lowering the fees. ↩
See Che, Y.; Chen, S.; Liu, X. Sparse Index Tracking Portfolio with Sector Neutrality. Mathematics 2022, 10, 2645.. ↩ ↩²

Beyond Modified Value-at-Risk: Application of Gaussian Mixtures to the Computation of Value-at-Risk

2023-12-18T00:00:00-06:00

In a previous post, I described a parametric approach to computing Value-at-Risk (VaR) - called modified VaR¹² - that adjusts Gaussian VaR for asymmetry and fat tails present in financial asset returns³ thanks to the usage of a Cornish–Fisher expansion.

Modified VaR, when properly used⁴, provides accurate estimates of the VaR for a wide range of non-normal portfolio return distributions. Unfortunately, for mathematical reasons, it cannot be computed for an even wider range of such distributions⁵, which poses an issue in practice.

In this blog post, I will describe another parametric approach to computing VaR - called Gaussian mixture Value-at-Risk for the lack of a better name - which this time adjusts Gaussian VaR for higher moments thanks to the usage of a Gaussian mixture distribution and is free of any computational restriction.

After providing the formula for the Gaussian mixture VaR and explaining how to fit a Gaussian mixture distribution to an empirical return distribution, I will show how to compute the Value-at-Risk of Bitcoin under the Gaussian mixture model described in the BlackRock paper Asset Allocation with Crypto: Application of Preferences for Positive Skewness from Ang et al.⁶

Mathematical preliminaries

Univariate Gaussian mixture distribution

A random variable $X$ is said to follow a univariate Gaussian mixture distribution, written as $X \sim \mathcal{GM} \left( \left( \mu_i, \sigma_i, p_i \right)_{i=1}^k \right) $, if its cumulative distribution function (c.d.f.) $F_X$ is of the form⁷

\[F_X(x) = \sum_{i=1}^k p_i \Phi \left( \frac{x - \mu_j}{\sigma_j} \right)\]

, where:

$k \geq 1$ is the (integer) number of mixture components
$p_i \in \left[ 0,1 \right]$, $i=1..k$, are the probabilities of the mixture components, with $\sum_{i=1}^kp_i = 1$
$\mu_i \in \mathbb{R}$, $i=1..k$, are the means of the mixture components
$\sigma_i \gt 0$, $i=1..k$, are the standard deviations of the mixture components
$\Phi$ is the c.d.f. of the standard normal distribution

Reminder on Value-at-Risk

The (percentage) VaR of a portfolio of financial assets corresponds to the percentage of portfolio wealth that can be lost over a certain time horizon and with a certain probability.

Formally, the VaR $VaR_{\alpha}$ of a portfolio over a time horizon $T$ (1 day, 10 days…) and at a confidence level $\alpha$% $\in ]0,1[$ (95%, 97.5%, 99%…) can be defined as⁸

\[\text{VaR}_{\alpha} (X) = - F_X^{-1}(1 - \alpha)\]

, where:

$X$ is a random variable representing the portfolio return over the time horizon $T$
$F_X^{-1}$ is the inverse c.d.f. - also called the quantile function - of the random variable $X$

Gaussian mixture Value-at-Risk

Definition

When the return distribution of a portfolio is approximated by a Gaussian mixture distribution, that is, when $X \sim \mathcal{GM} \left( \left( \mu_i, \sigma_i, p_i \right)_{i=1}^k \right) $ with $X$ a random variable representing the portfolio return over a given time horizon $T$, the resulting parametric VaR can be called Gaussian mixture Value-at-Risk (GmVaR).

Mathematically, the Gaussian mixture VaR over the chosen time horizon $T$ and at a confidence level $\alpha$% $GmVaR_{\alpha}$ is implicitly defined by the following equation⁹, obtained by mixing together the formula for the VaR of a portfolio and the formula for the c.d.f. of a Gaussian mixture distribution

\[\sum_{i=1}^k p_i \Phi \left( - \frac{ GmVaR_{\alpha} + \mu_i }{\sigma_i} \right) = 1 - \alpha\]

Computation

Because the formula of the previous subsection is implicit in $GmVaR_{\alpha}$, the effective computation of the Gaussian mixture VaR must involve a numerical procedure such as:

A non-linear least-squares minimization procedure⁹
A bisection procedure¹⁰
Or more generally, a procedure to compute a quantile function from a closed-form c.d.f.

Rationale

It has been observed since the 1970s¹¹ that Gaussian mixture distributions adequately approximate the unconditional distributions of financial asset returns. For example, Kon¹² shows that two, three and four-component Gaussian mixture distributions can be used to model the daily returns of 3 U.S. stock market indexes and of 30 Dow Jones stocks. Similarly, Cuevas-Covarrubias et al.¹³ shows how two-component Gaussian mixture distributions exhibit a natural capacity to fit leptokurtic distributions¹³ using the daily returns of 3 Mexican stocks.

Compared to alternative distributions, Gaussian mixture distributions offer a versatile combination of precision and simplicity¹³ for modeling asset returns. Indeed, Gaussian mixture distributions are both flexible enough to handle arbitrary asset return distributions¹⁴ and simple enough to be numerically tractable¹⁴. In contrast, [alternative distributions] tend to suffer from one of two evils: either they are too restrictive in the variety of p.d.f. shapes that can be achieved, or not restrictive enough, in the sense that they have too many degrees of freedom for calibration to be feasible¹⁴.

As for applications of Gaussian mixture distributions to VaR, Hull and White¹⁵ used a two-component Gaussian mixture distribution more than 25 years ago in order to illustrate their non-normal VaR computation methodology. More recent papers include for example Saissi Hassani and Dionne⁹ who performs VaR backtests for several parametric models, among which two and three-component Gaussian mixture models.

The Gaussian mixture VaR is thus a well-known extension of the Gaussian VaR when dealing with real-life asset returns.

Fitting a univariate Gaussian mixture distribution to an empirical portfolio return distribution

Let $r_1,…,r_T \in \mathbb{R}$ be the returns of a portfolio observed over $T$ time periods.

In order to compute the Gaussian mixture VaR of this portfolio using the formula of the previous section, it is first required to approximate the empirical distribution of the returns $r_1,…,r_T$ by a Gaussian mixture distribution.

This is usually done by determining the “best” statistical estimators for the unknown parameters $k$ and $\left(p_i, \mu_i, \sigma_i \right)$, $i=1..k$ of that Gaussian mixture distribution, with “best” to be defined.

How to determine the number of mixture components?

Manually

Because it is possible to interpret the components of a Gaussian mixture return distribution as market regimes, with a latent variable that represents the active regime and a return distribution that is Gaussian, given the regime⁷, the number of mixture components $k$ can be chosen manually.

For example:

Choosing $k = 2$ means defining two market regimes in the portfolio returns - a normal regime and a rare events regime, sometimes called a distressed regime¹⁴ or a bliss regime⁶
Choosing $k = 3$ means defining three market regimes in the portfolio returns - the typical bear, neutral and bull market regimes¹⁶

Automatically

The number of mixture components $k$ can also be chosen automatically through two general families of methods:

Methods that determine the number of mixture components independently of the remaining mixture parameters

Methods from this family typically iterate over a number of potential values for $k$, determine the remaining parameters $\left(p_i, \mu_i, \sigma_i \right)$, $i=1..k$, and evaluate how well the resulting $k$-component Gaussian mixture distribution approximates the portfolio return distribution.

The chosen number of mixture components is then the value $k^*$, among these potential values for $k$, that allows to best approximate the portfolio return distribution¹⁷.

Examples from this family include the Kolmogorov-Smirnov test¹⁸ or the more common Bayesian Information Criterion (BIC)¹⁹.
Methods that determine the number of mixture components together with the remaining mixture parameters

Methods from this family determine all the parameters of the Gaussian mixture distribution at once.

Examples from this family include modified Expectation-Maximization algorithms²⁰ or ad-hoc procedures²¹.

How to determine the remaining mixture parameters?

In this subsection, the number of mixture components $k$ is supposed to be known²².

Likelihood maximization

Sensible values for the unknown mixture parameters $\left(p_i, \mu_i, \sigma_i \right)$, $i=1..k$, are maximum-likelihood estimates, defined as the choice of parameters $\left(p_i^*, \mu_i^*, \sigma_i^* \right)$, $i=1..k$, that globally²³ maximizes the log-likelihood function $L$ associated to the returns $r_1,…,r_T$ and given by

\[L \left( p_1, \mu_1, \sigma_1,..., p_k, \mu_k, \sigma_k \right) = \sum_{i=1}^T \ln \left[ \sum_{j=1}^k p_j \frac{1}{\sqrt{2 \pi} \sigma_j} \exp \left( - \frac{1}{2} \left( \frac{ x_i - \mu_j}{ \sigma_j} \right )^2 \right) \right]\]

Numerically, several procedures can be used to compute these maximum-likelihood estimates, like Newton’s method²⁴, Fisher’s method of scoring²⁴ or the Expectation-Maximization (EM) algorithm¹⁹ which is actually the method of choice for learning mixtures of Gaussians²⁵²⁶.

Unfortunately, due to the complexity²⁷ of the log-likelihood function $L$, none of these procedures is guaranteed to produce “true” maximum-likelihood estimates²⁸. Even worse, the behavior of all these procedures is known to be highly dependent from [the initial guess of the unknown parameters] and it may fail as a result of degeneracies²⁸, which led practitioners to implement misc. numerical tweaks, like running these procedures several times with different, randomly chosen starting points²⁹, in order to avoid [obtaining] a poor approximation to the true maximum²⁹.

Turbulence partitioning of returns

Although in most applications, parameters of a [Gaussian] mixture model are estimated by maximizing the likelihood²⁹, there are other procedures to determine the remaining parameters of a Gaussian mixture distribution (minimum distance estimation³⁰…) which have often been show to be more robust to departures from underlying assumptions than is maximum likelihood³⁰.

While an exhaustive review of these alternative procedures is out of scope of this post, I would still like to mention a procedure described in another post in the the context of a two-component Gaussian mixture distribution.

This procedure is based on a measure of statistical unusualness of asset returns popularized by Kritzman and Li³¹ called the Turbulence Index, and can easily be generalized to a $k$-component Gaussian mixture distribution as follows:

Compute the turbulence index values $d(r_t)$, $t=1..T$, associated to the portfolio returns³²
Partition the portfolio returns into $k$ partitions based on their turbulence index values, either through manual percentage-based thresholding³³ or through automated clustering (1D $k$-means³⁴…).
For each of these partitions $P_i$,$i=1..k$
- Define the estimate of the $i$-th Gaussian mixture component probability $p_i^{**}$ as the proportion of the portfolio returns belonging to $P_i$
- Define the estimate of the $i$-th Gaussian mixture component mean $\mu_i^{**}$ as the mean of the portfolio returns belonging to $P_i$
- Define the estimate of the $i$-th Gaussian mixture component standard deviation $\sigma_i^{**}$ as the standard deviation of the portfolio returns belonging to $P_i$

Compared to likelihood maximization, this procedure is:

Easy to implement, with no local optima and no convergence issue to worry about
Easy to interpret, with a one-to-one relationship between partitions and market regimes

In addition, as illustrated in the aforementioned post, the quality of the resulting parameters estimates is on par with that of the maximum-likelihood parameters estimates, at least in the case of a two-asset universe made of U.S. stocks and U.S. Treasuries.

Implementation in Portfolio Optimizer

Portfolio Optimizer implements the computation of a portfolio Gaussian mixture Value-at-Risk through the endpoint /portfolios/analysis/value-at-risk/gaussian/mixture.

This endpoint:

Fits a univariate Gaussian mixture distribution to an empirical portfolio (logarithmic) return distribution, using either
- A variation of the Expectation-Maximization algorithm described in the previous section
- The turbulence partitioning procedure also described in the previous section, relying on either a percentage-based thresholding of turbulence index values or a 1D $k$-means clustering of turbulence index values
Computes the associated Gaussian mixture Value-at-Risk, using a variation of the bisection procedure

To be noted that it is possible to let Portfolio Optimizer automatically determine the best number of Gaussian mixture components³⁵.

Example of usage - Computing the Value-at-Risk of Bitcoin

In their paper, Ang et al.⁶ empirically demonstrate that the very positively skewed returns [of Bitcoin] are well captured by a [two-component] mixture of Normals distribution⁶. They then use this result to suggest that given a small probability regime where Bitcoin may generate high returns, investors with preferences for positive skewness of portfolio returns should have portfolio exposure to Bitcoin³⁶.

As an example of usage of the Gaussian mixture VaR, I propose to compute the VaR of Bitcoin under such a two-component Gaussian mixture model.

Bitcoin returns analysis

I propose to start by comparing the Bitcoin price data used in Ang et al.⁶ to the Bitcoin price data used in this post:

Figures 1³⁷ and 2 compare the cumulative returns of Bitcoin over the period 31 August 2010 - 31 December 2021

Figure 1. Cumulative Bitcoin returns over the period 31 August 2010 - 31 December 2021, Ang et al.'s data. Source: Ang et al.

Figure 2. Cumulative Bitcoin returns over the period 31 August 2010 - 31 December 2021, this post's data.
Figures 3³⁷ and 4 compare the annualized mean, standard deviation and Sharpe ratio of Bitcoin monthly (log) returns over the same period

Figure 3. Summary statistics of Bitcoin monthly returns over the period 31 August 2010 - 31 December 2021, Ang et al.'s data. Source: Ang et al.

Figure 4. Summary statistics of Bitcoin monthly returns over the period 31 August 2010 - 31 December 2021, this post's data.

From these figures, it is clear that the Bitcoin data used in Ang et al.⁶ differ from the Bitcoin data used in this post (Figure 3 v.s. Figure 4), although not that much (Figure 1 v.s. Figure 2).

To be noted that this is not surprising, because there is no official price for Bitcoin due to its decentralized nature.

Fitting a Gaussian mixture model to Bitcoin returns

Similar to Ang et al.⁶, I now propose to fit a two-component Gaussian mixture model to the Bitcoin monthly (log) returns over the period 31 August 2010 - 31 December 2021.

Likelihood maximization

The EM algorithm, as implemented by the scikit-learn method sklearn.mixture.GaussianMixture with default parameters, leads to three different maximum likelihood estimates³⁸

$\left(p_1^1, \mu_1^1, \sigma_1^1 \right)$ = $\left( 0.96, 0.66, 0.84 \right)$ and $\left(p_2^1, \mu_2^1, \sigma_2^1 \right)$ = $\left( 0.04, 14.98, 0.89 \right)$
$\left(p_1^2, \mu_1^2, \sigma_1^2 \right)$ = $\left( 0.91, 0.54, 0.82 \right)$ and $\left(p_2^2, \mu_2^2, \sigma_2^2 \right)$ = $\left( 0.09, 7.45, 2.03 \right)$
$\left(p_1^3, \mu_1^3, \sigma_1^3 \right)$ = $\left( 0.91, 0.55, 0.82 \right)$ and $\left(p_2^3, \mu_2^3, \sigma_2^3 \right)$ = $\left( 0.09, 7.81, 2.03 \right)$

This situation confirms that an important drawback of EM is that its solution can highly depend on its starting position and consequently produce sub-optimal maximum likelihood estimates²⁹!

Fortunately, increasing the number of random initializations performed in the sklearn.mixture.GaussianMixture method leads to the optimal maximum likelihood estimates, but somebody unaware of the numerical properties of the EM algorithm might conclude too quickly that the mixture of Normals can be estimated easily by maximum likelihood or EM algorithms⁶.

These “true” maximum likelihood estimates are $ \left(p_1^*, \mu_1^*, \sigma_1^* \right)$ = $\left( 0.96, 0.66, 0.84 \right)$ and $\left(p_2^*, \mu_2^*, \sigma_2^* \right)$ = $\left( 0.04, 14.98, 0.89 \right)$.

Turbulence partitioning

The turbulence partitioning procedure used with an exact 1D $k$-means clustering leads to the same³⁹ estimates as the “true” maximum likelihood estimates, which empirically demonstrates the validity of this procedure to fit the parameters of a univariate Gaussian mixture distribution.

As a reminder, these estimates are $ \left(p_1^{**}, \mu_1^{**}, \sigma_1^{**} \right)$ = $\left( 0.96, 0.66, 0.84 \right)$ and $\left(p_2^{**}, \mu_2^{**}, \sigma_2^{**} \right)$ = $\left( 0.04, 14.98, 0.89 \right)$.

One of the advantages of the turbulence partitioning procedure is that it allows to easily explain how the Gaussian mixture parameters have been estimated:

$p_1^{**}$ (resp. $p_2^{**}$) is the proportion of Bitcoin returns whose turbulence index values are classified as “normal” (resp. “outliers”) by the 1D $k$-means clustering method

Graphically, thanks to Figure 5 which displays the 137 turbulence index values associated to the Bitcoin monthly (log) returns over the period 31 August 2010 - 31 December 2021:
- $p_1^{**}$ is the proportion of Bitcoin returns whose turbulence index values are below the red horizontal line, that is, $p_1^{**} = \frac{132}{137} \approx 0.96$
- $p_2^{**}$ is the proportion of Bitcoin returns whose turbulence index values are above the red horizontal line, that is, $p_2^{**} = \frac{5}{137} \approx 0.04$
Figure 5. Bitcoin monthly turbulence index values, 31 August 2010 - 31 December 2021.
$\mu_1^{**}$ (resp. $\mu_2^{**}$) is the mean of the Bitcoin returns whose turbulence index values are classified as “normal” (resp. “outliers”) by the 1D $k$-means clustering method
$\sigma_1^{**}$ (resp. $\sigma_2^{**}$) is the standard deviation of the Bitcoin returns whose turbulence index values are classified as “normal” (resp. “outliers”) by the 1D $k$-means clustering method

In terms of market regimes, the first (resp. second) Gaussian mixture component models the behavior of Bitcoin during a normal (resp. rare events) market regime.

As a side note, the turbulence partitioning procedure highlights that only 5 returns have been used to compute the parameters estimates $\left(p_2^{**}, \mu_2^{**}, \sigma_2^{**} \right)$, which raises the question of the statistical significance of both these estimates⁴⁰ and the maximum likelihood estimates $\left(p_2^*, \mu_2^*, \sigma_2^* \right)$ since they are equal…

Evaluation

Figure 6 and Figure 7 compare the theoretical distribution of the estimated two-component Gaussian mixture distribution with the empirical distribution of the monthly (log) Bitcoin returns.

Figure 6. Bitcoin monthly returns, empirical c.d.f. v.s. two-component Gaussian mixture c.d.f., 31 August 2010 - 31 December 2021.

Figure 7. Bitcoin monthly returns, empirical c.d.f. v.s. two-component Gaussian mixture c.d.f., left tail, 31 August 2010 - 31 December 2021.

From these figures, the estimated mixture model seems to adequately capture the main characteristics of the Bitcoin returns, except part of their left tail behavior, which is confirmed more quantitatively by the Kolmogorov-Smirnov goodness of fit test⁴¹.

Bitcoin returns Value-at-Risk

I finally propose to compute the Value-at-Risk of Bitcoin monthly (log) returns over the period 31 August 2010 - 31 December 2021, using two methods:

The historical Value-at-Risk (HVaR), introduced in another post
The Gaussian mixture Value-at-Risk, introduced in this blog post

Results for various confidence levels, all visible on Figure 7, are provided below:

Confidence level $\alpha$	$\text{HVaR}_{\alpha}$	$\text{GmVaR}_{\alpha}$
95%	42.02%	34.21%
97.5%	45.84%	41.97%
99%	47.22%	50.96%
99.5%	49.86%	57.08%
99.9%	49.86%	69.69%

Conclusion

Aim of this post was to describe an extension of Gaussian Value-at-Risk, second in order of complexity after modified Value-At-Risk, in case the later cannot be computed.

This being done, I wish you Season’s Greetings and a Happy New Year!

Waiting for 2024, feel free to connect with me on LinkedIn or follow me on Twitter.

–

See Zangari, P. (1996). A VaR methodology for portfolios that include options. RiskMetrics Monitor First Quarter, 4–12. ↩
Or Cornish-Fisher VaR. ↩
In this post, all returns are assumed to be logarithmic returns. ↩
See Maillard, Didier, A User’s Guide to the Cornish Fisher Expansion. ↩
All portfolio return distribution for which the skewness and the (excess) kurtosis are outside of the domain of validity of the Cornish-Fisher expansion underlying modified VaR, c.f. Maillard⁴. ↩
See Ang, Andrew, Morris, Tom, Savi, Raffaele, Asset Allocation with Crypto: Application of Preferences for Positive Skewness, The Journal of Alternative Investments, Spring 2023, 25 (4) 7-28. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷ ↩⁸
See Luxenberg, E., Boyd, S. Portfolio construction with Gaussian mixture returns and exponential utility via convex optimization. Optim Eng (2023).. ↩ ↩²
Some theoretical subtleties are detailed in a previous post. ↩
See Saissi Hassani, Samir and Dionne, Georges, The New International Regulation of Market Risk: Roles of VaR and CVaR in Model Validation (January 12, 2021). ↩ ↩² ↩³
See Benjamin Bruder, Nazar Kostyuchyk, Thierry Roncalli, Risk Parity Portfolios with Skewness Risk: An Application to Factor Investing and Alternative Risk Premia, arXiv. ↩
See Joyce A. Hall, Wade Brorsen and Scott H. Irwin, The Distribution of Futures Prices: A Test of the Stable Paretian and Mixture of Normals Hypotheses, The Journal of Financial and Quantitative Analysis, Vol. 24, No. 1 (Mar., 1989), pp.105-116. ↩
See Kon S (1984) Models of stock returns-a comparison. J Financ 39(1):147–16. ↩
See C. Cuevas-Covarrubias, J. Inigo-Martinez and R. Jimenez-Padilla, Gaussian mixtures and financial returns, Discussiones Mathematicae, Probability and Statistics 37 (2017) 101–122. ↩ ↩² ↩³
See Ian Buckley, David Saunders, Luis Seco, Portfolio optimization when asset returns have the Gaussian mixture distribution, European Journal of Operational Research, Volume 185, Issue 3, 2008, Pages 1434-1461. ↩ ↩² ↩³ ↩⁴
See J. Hull, A. White, Value at risk when daily changes in market variables are not normally distributed, Journal of Derivatives 5 (3) (1998) 9–19. ↩
See Yuantong Li, Qi Ma, Sujit K. Ghosh, A Non-Iterative Quantile Change Detection Method in Mixture Model with Heavy-Tailed Components, arXiv. ↩
The selected number of mixture components sometimes also takes into account both the quality of the approximation of the portfolio return distribution and the potential for overfitting. ↩
See Ming-Heng Zhang, Qian-Sheng Cheng, Gaussian mixture modelling to detect random walks in capital markets, Mathematical and Computer Modelling, Volume 38, Issues 5–6, 2003, Pages 503-508. ↩
See J. Worms and S. Touati, “Modelling Program’s Performance with Gaussian Mixtures for Parametric Statistics,” in IEEE Transactions on Multi-Scale Computing Systems, vol. 4, no. 3, pp. 383-395, 1 July-Sept. 2018. ↩ ↩²
See Tao Huang, Heng Peng and Kun Zhang, Model selection for Gaussian mixture models, Statistica Sinica 27 (2017), 147-169. ↩
Yuantong Li, Qi Ma, and Sujit K. Ghosh. 2020. A Non-Iterative Quantile Change Detection Method in Mixture Model with Heavy-Tailed Components. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD ‘20). Association for Computing Machinery, New York, NY, USA, 1888–1898.. ↩
Typically, because it is determined independently of the remaining parameters $\left(p_i, \mu_i, \sigma_i \right)$, $i=1..k$. ↩
To avoid theoretical issues, some authors prefer to work with non-unique local maximizers, c.f. for example Peters and Walker²⁴. ↩
See B. Charles Peters, Jr. and Homer F. Walker, An Iterative Procedure for Obtaining Maximum-Likelihood Estimates of the Parameters for a Mixture of Normal Distributions, SIAM Journal on Applied Mathematics, Vol. 35, No. 2 (Sep., 1978), pp. 362-378. ↩ ↩² ↩³
See Sanjoy Dasgupta, Leonard Schulman, A Two-round Variant of EM for Gaussian Mixtures, arXiv. ↩
This is because the estimation of the parameters of a Gaussian mixture distribution can be formulated as a missing data problem⁴², for which the EM algorithm has been specifically designed. ↩
In particular, it is non convex. ↩
See Jean-Patrick Baudry, Gilles Celeux. EM for mixtures - Initialization requires special care. 2015. hal-01113242. ↩ ↩²
See Christophe Biernacki, Gilles Celeux, Gérard Govaert, Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models, Computational Statistics & Data Analysis, Volume 41, Issues 3–4, 2003, Pages 561-575. ↩ ↩² ↩³ ↩⁴
See Wayne A. Woodward , William C. Parr , William R. Schucany & Hildegard Lindsey (1984) A Comparison of Minimum Distance and Maximum Likelihood Estimation of a Mixture Proportion, Journal of the American Statistical Association, 79:387, 590-598. ↩ ↩²
See M. Kritzman, Y. Li, Skulls, Financial Turbulence, and Risk Management,Financial Analysts Journal, Volume 66, Number 5, Pages 30-41, Year 2010. ↩
Or the underlying multivariate asset returns, if available. ↩
See George Chow, Jacquier, E., Kritzman, M., & Kenneth Lowry. (1999). Optimal Portfolios in Good Times and Bad. Financial Analysts Journal, 55(3), 65–73.. ↩
To be noted that in 1D, the $k$-means algorithm can be computed exactly, see for example Allan Gronlund, Kasper Green Larsen, Alexander Mathiasen, Jesper Sindahl Nielsen, Stefan Schneider, Mingzhou Song, Fast Exact k-Means, k-Medians and Bregman Divergence Clustering in 1D, arXiv. ↩
A proprietary methodology is used for that, except when the turbulence partitioning procedure is used together with a percentage-based thresholding of turbulence index values, in which case the number of Gaussian mixture components is equal⁴³ to the number of provided thresholds plus one. ↩
See Sepp, Artur, Optimal Allocation to Cryptocurrencies in Diversified Portfolios (December 31, 2022). Risk Magazine, October 2023, 1-6. ↩
Adapted from Ang et al.⁶. ↩ ↩²
Annualized. ↩
You will need to trust me on this one, as long as Portfolio Optimizer does not expose an endpoint to fit a Gaussian mixture distribution to an empirical return distribution. ↩
See Greenwood, Joseph A. and Sandomire, Marion M., “Sample Size Required For Estimating The Standard Deviation as a Percent of Its True Value” (1950). U.S. Navy Research. 34.. ↩
The $p$-values are so ridiculously high (> 0.95) that even taking into account the fact that the parameters of the two-component Gaussian mixture distribution have been estimated, the conclusion remains the same. ↩
See McLachlan, Geoffrey J.; Krishnan, Thriyambakam; Ng, See Ket (2004) : The EM Algorithm, Papers, No. 2004,24, Humboldt-Universität zu Berlin, Center for Applied Statistics and Economics (CASE), Berlin. ↩
Except when there are empty partitions. ↩