Regime-Aware Asset Allocation: a Statistical Jump Model Approach (2024)

Yizhan Shu111Department of Operations Research and Financial Engineering, Princeton University, Princeton, USA,  Chenyu Yu11footnotemark: 1,  John M. Mulvey11footnotemark: 1222Yizhan Shu (corresponding author), yizhans@princeton.edu,Chenyu Yu, chenyu@princeton.edu,John M. Mulvey, mulvey@princeton.edu

(February 7, 2024)

Abstract

This article investigates the impact of regime switching on asset allocation decisions, with a primary focus on comparing different regime identification models.In contrast to traditional Markov-switching models, we adopt the statistical jump model, a recently proposed robust model known for its ability to capture persistent market regimes by applying an explicit jump penalty.The feature set of our jump model comprises return and volatility features derived solely from the price series.We introduce a data-driven approach for selecting the jump penalty within a time-series cross-validation framework, which directly optimizes the performance metric of the regime-aware asset allocation strategy constructed following a comprehensive multi-step process.Through empirical analysis using daily return series from major US equity indices, we highlight the outperformance of employing jump models in comparison to both buy-and-hold strategies and Markov-switching asset allocation approaches.These results underline the enhanced robustness, interpretability, and realism inherent in asset allocation strategies guided by jump models, offering insights for portfolio managers.

Keywords: Dynamic Asset Allocation; Statistical Jump Models; Regime Switching; Temporal Clustering

1 Introduction

Throughout history, traded markets have exhibited changing patterns of performance, such as significant spikes in volatility and correlation during market crashes.These dynamic market conditions are driven by a complex interplay of behavioral, political, and economic factors.A natural approach to understanding these variations is the identification of financial regimes – distinct time periods characterized by relatively hom*ogeneous behavior that persists over extended durations.The study of regime-switching models originates from the work of Hamilton, (1989), which primarily focused on macroeconomics and the study of business cycles.It was further extended into the realm of finance by Turner etal., (1989) and Rydén etal., (1998), with a specific focus on stock market returns.Building upon these seminal works, various regime-switching models have been developed and applied to financial data (Hardy,, 2001; Ang and Bekaert,, 2002; Choi and Hammoudeh,, 2010).

Following the initial development of regime models, researchers began applying them to a range of downstream tasks, including portfolio construction.Many studies (Ang and Bekaert,, 2004; Guidolin and Timmermann,, 2007; Tu,, 2010) have highlighted the advantages of integrating regime information into portfolio management strategies.Particularly after the 2008 financial crisis, the limitations of traditional strategic asset allocation, which aimed to create static “all-weather” portfolios, became evident. This led to a growing interest in dynamic asset allocation, which adapts portfolio weights based on market regime information to leverage favorable conditions and mitigate potential drawdown (Sheikh and Sun,, 2012).Subsequently, various regime-switching asset allocation models have been introduced (van Vliet and Blitz,, 2011; Bae etal.,, 2014; Nystrup etal.,, 2015; Kim and Kwon,, 2023).

To incorporate regime information into portfolio decisions, we propose an interpretable four-step framework for the development of a regime-aware asset allocation strategy that includes 1. regime identification, 2. regime forecast, 3. regime-based portfolio model, and 4. out-of-sample testing.The first step constitutes an unsupervised learning problem, as the market regime remains a latent variable necessitating labeling for each period.The second step turns into a supervised problem focused on forecasting the probability of future regime states.Subsequently, regime forecasts are integrated into a portfolio optimization model, such as risk parity, in diverse ways.The final step involves the assessment of strategy performance over an independent test period.Numerous pieces of literature discussed in this article align with this overarching four-step process, and we find this framework valuable for its interpretability and flexibility.This multi-step approach draws from foundations in financial and behavioral economics and holds promise for achieving strong empirical performance.In contrast, an alternative methodology in the form of conditional portfolio optimization (Tuck etal.,, 2022; Chan etal.,, 2023) bases portfolio decisions on observable market conditions, with regime characteristics implicitly embedded into the strategy.While this method integrates regime dynamics indirectly, our direct, step-wise approach aims to provide enhanced strategy interpretability.

Among the studies on regime-switching asset allocation, the work of Bulla etal., (2011) holds particular relevance to our research.Their study explores a basic two-asset scenario, switching between an all-cash and all-index position based on forecasts of whether the next period belongs to the high-volatility regime.In this article, we follow their principles because their straightforward strategy framework forms a solid foundation for further study.We integrate their strategy seamlessly into our four-step process, introducing a key advancement in the first step of regime identification.Here, we replace hidden Markov models (HMMs) with statistical jump models333It is worth noting that the statistical jump models discussed herein are not related to jump-diffusion models, a common class of stochastic processes. (JMs), a recently introduced robust model known for its ability to identify persistent market regimes.This modification has the potential to address some of the limitations of HMMs in financial applications, including issues related to statistical accuracy (Nystrup etal., 2020b, ; Nystrup etal., 2020a, ) and the persistence of inferred hidden state sequences (Bulla,, 2011).

1.1 Contributions

This article employs jump models to construct a practical regime-aware asset allocation strategy. This strategy, developed following our proposed four-step framework, is rigorously evaluated in a realistic out-of-sample testing that considers transaction costs.To ensure a fair comparison with HMMs, we construct our jump model using a straightforward feature set comprising return and volatility features derived solely from the price series itself.While prior research has demonstrated the improved statistical accuracy achieved by JMs in identifying hidden state sequences, our main contribution lies in showcasing that this can have tangible financial implications to generate excess profits.

Our second contribution centers on the selection of the jump penalty, a critical hyperparameter in the application of jump models.We introduce a time-series cross-validation methodology, which directly optimizes the performance metric of downstream financial tasks linked to JMs.More precisely, we select the jump penalty that maximizes the Sharpe ratio of the JM-guided strategy over the validation period.In contrast to prior approaches that primarily rely on statistical criteria, our innovation lies in selecting a jump penalty that maximizes the practical benefits for financial applications associated with JMs.

The empirical results, based on the daily return series from three major US indices spanning over a 50-year period, suggest that during the validation period, the impact of the jump penalty on strategy performance is favorable, facilitating the selection of the optimal jump penalty at the point where performance metrics peak.In the subsequent test period, our strategy, guided by the chosen jump penalty, outperforms both HMM-based and buy-and-hold strategies across various performance metrics, including return, Sharpe ratio, and maximum drawdown.Visualization reveals that the regimes identified by JMs are more persistent and interpretable compared to those identified by HMMs, leading to higher returns.

The paper proceeds as follows:Section 2 delves into the methodology, explaining the details of JMs and the asset allocation strategy framework within the context of our proposed four-step process.Section 3 covers the data and features used in our analysis.In Section 4, we present empirical results, comparing strategy performance and visualizing identified regimes.Finally, our paper concludes with key insights and implications in Section 5.

2 Methodology

2.1 Statistical Jump Models

Hidden Markov models (HMMs) have been a foundational regime-switching model in financial data analysis, owing to their natural representation of hidden state variables as regimes.Nevertheless, recent studies have highlighted that inherent characteristics in financial time series, including state persistence & imbalance, time-varying parameters, and limited sample sizes, have led to questions about HMM’s statistical accuracy for regime identification (Nystrup etal., 2020b, ; Nystrup etal., 2020a, ), potentially affecting downstream performance444As highlighted by Hess, (2006), “a wrong regime forecast not only may lead to a non-optimal but to a detrimental allocation in the contrary direction relative to the ‘neutral’ single state”..Additionally, HMMs, as parametric models, are susceptible to model mis-specification in both the conditional distributions and the sojourn time distribution (Bulla and Bulla,, 2006), producing inferred state sequences that lack genuine persistence (Bulla,, 2011).The excess regime shifts can potentially lead to elevated turnover and transaction costs. To address these challenges, various non-parametric, data-driven alternatives like trend filtering (Mulvey and Liu,, 2016) and spectral clustering HMMs (Zheng etal.,, 2021) have emerged.This article focuses on the application of jump models as one such alternative.

In a recent work, Bemporad etal., (2018) introduced statistical jump models (JMs), an unsupervised learning model that fits multiple model parameters to a time series while incorporating temporal information.Specifically, given an observation sequence of D𝐷Ditalic_D (standardized) features 𝒀:={𝒚0,,𝒚T1}assign𝒀subscript𝒚0subscript𝒚𝑇1\bm{Y}:=\{\bm{y}_{0},\ldots,\bm{y}_{T-1}\}bold_italic_Y := { bold_italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , … , bold_italic_y start_POSTSUBSCRIPT italic_T - 1 end_POSTSUBSCRIPT }, where 𝒚tDsubscript𝒚𝑡superscript𝐷\bm{y}_{t}\in\mathbb{R}^{D}bold_italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ roman_ℝ start_POSTSUPERSCRIPT italic_D end_POSTSUPERSCRIPT for all t𝑡titalic_t, we estimate a JM with K𝐾Kitalic_K states by solving the optimization problem

argminΘ,𝑺t=0T1l(𝒚t,𝜽st)+λt=1T1𝟙{st1st},subscriptargminΘ𝑺superscriptsubscript𝑡0𝑇1𝑙subscript𝒚𝑡subscript𝜽subscript𝑠𝑡𝜆superscriptsubscript𝑡1𝑇1subscriptdouble-struck-𝟙subscript𝑠𝑡1subscript𝑠𝑡\operatorname*{arg\,min}_{\Theta,\bm{S}}\sum_{t=0}^{T-1}l(\bm{y}_{t},\bm{%\theta}_{s_{t}})+\lambda\sum_{t=1}^{T-1}\mathbb{1}_{\left\{s_{t-1}\neq s_{t}%\right\}}\,,start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT roman_Θ , bold_italic_S end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T - 1 end_POSTSUPERSCRIPT italic_l ( bold_italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) + italic_λ ∑ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T - 1 end_POSTSUPERSCRIPT blackboard_𝟙 start_POSTSUBSCRIPT { italic_s start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT ≠ italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT } end_POSTSUBSCRIPT ,(1)

where Θ:={𝜽kD:k=0,,K1}assignΘconditional-setsubscript𝜽𝑘superscript𝐷𝑘0𝐾1\Theta:=\left\{\bm{\theta}_{k}\in\mathbb{R}^{D}:k=0,\ldots,K-1\right\}roman_Θ := { bold_italic_θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ roman_ℝ start_POSTSUPERSCRIPT italic_D end_POSTSUPERSCRIPT : italic_k = 0 , … , italic_K - 1 } represents the collection of K𝐾Kitalic_K model parameters, 𝑺:={s0,,sT1}assign𝑺subscript𝑠0subscript𝑠𝑇1\bm{S}:=\{s_{0},\ldots,s_{T-1}\}bold_italic_S := { italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , … , italic_s start_POSTSUBSCRIPT italic_T - 1 end_POSTSUBSCRIPT } denotes the unobserved state sequence, l(,)𝑙l(\cdot,\cdot)italic_l ( ⋅ , ⋅ ) is a loss function between observations and model parameters, and λ+𝜆subscript\lambda\in\mathbb{R}_{+}italic_λ ∈ roman_ℝ start_POSTSUBSCRIPT + end_POSTSUBSCRIPT is a hyperparameter called the jump penalty. Each hidden state stsubscript𝑠𝑡s_{t}italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT takes on values in {0,,K1}0𝐾1\left\{0,\ldots,K-1\right\}{ 0 , … , italic_K - 1 } and is associated with a state-specific model parameter 𝜽stsubscript𝜽subscript𝑠𝑡\bm{\theta}_{s_{t}}bold_italic_θ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT. We adhere to the standard practice of employing the scaled squared 2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT-distance as the loss function, defined by l(𝒚,𝜽):=12𝒚𝜽22assign𝑙𝒚𝜽12subscriptsuperscriptnorm𝒚𝜽22l(\bm{y},\bm{\theta}):=\frac{1}{2}\left\|\bm{y}-\bm{\theta}\right\|^{2}_{2}italic_l ( bold_italic_y , bold_italic_θ ) := divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∥ bold_italic_y - bold_italic_θ ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT.After estimating a jump model, we can infer the transition probability matrix based on the optimal hidden state sequence and compute metrics such as mean and volatility within each cluster.In this article, the optimal hidden state s^tsubscript^𝑠𝑡\hat{s}_{t}over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT serves as the regime label for period t𝑡titalic_t, a crucial role in our analysis.

The objective function (1) represents a tradeoff between fitting the data with multiple model parameters and incorporating our prior beliefs about the persistence of the state sequence.At its core, the hyperparameter λ𝜆\lambdaitalic_λ acts as a regulator for the fixed-cost regularization term, which is incurred whenever there is a transition between states.With a jump penalty of zero, the model simplifies to a k𝑘kitalic_k-means clustering algorithm, disregarding temporal information.As λ𝜆\lambdaitalic_λ increases, state transitions become less frequent, until eventually all data points are classified into a single cluster.Previously, the optimal selection of the hyperparameter λ𝜆\lambdaitalic_λ has relied on statistical criteria such as information criteria (Cortese etal., 2023b, ) or in-sample accuracy (Nystrup etal., 2020b, ).This article, however, introduces a novel approach utilizing a cross-validation technique specifically tailored to the downstream financial applications, as elaborated in the Step four of the next subsection.

Minimizing objective function (1) jointly over model parameters ΘΘ\Thetaroman_Θ and the hidden state sequence 𝑺𝑺\bm{S}bold_italic_S typically involves a coordinate descent algorithm with two stages: 1. optimizing model parameters given a fixed state sequence, and 2. optimizing the state sequence given fixed model parameters.The second stage is crucial for online state inference in a real-time live setting, as discussed in Step one of the next subsection. Due to the non-convex nature of the objective function, we run the coordinate descent algorithm ten times, each starting from a different initial value generated by the k𝑘kitalic_k-means+++++ + algorithm, and retain the best result555Our HMM implementation using the Baum-Welch algorithm is also initialized ten times for improved performance..

Bemporad etal., (2018) demonstrate that, under certain assumptions, JMs nest HMMs in the probabilistic setting. Subsequently, Nystrup etal., 2020b introduced JMs to financial applications, utilizing the feature set initially proposed in Zheng etal., (2021)666A detailed explanation of the feature set we use can be found in Section 3.2..Their work showcases that JMs can achieve improved statistical accuracy compared to HMMs, when applied to financial data, and can yield a more persistent hidden state sequence.

Several extensions to JMs have emerged.Nystrup etal., (2021) introduced a feature selection procedure on top of the jump model for handling high-dimensional feature sets, which gives JMs an advantage over HMMs in such scenarios.Additionally, Aydinhan etal., (2023) expanded JMs into the continuous statistical jump model (CJM) by generalizing the discrete hidden state variable into a probability vector over all states.This extension establishes a probabilistic interpretation where the hidden state vector represents the probability of each period belonging to each regime.While these probability values can be useful for the Step three of portfolio optimization models, our studys employ a straightforward 0/1 weight in the portfolio model, making no significant difference between using JM and CJM in terms of final strategy performance. Consequently, we present results for JM alone for simplicity.

2.2 Regime-Aware Asset Allocation Strategy

In this subsection, we will detail our asset allocation strategy.Before explaining each of the four steps, we want to emphasize three key principles drawn from Bulla etal., (2011).First, we use daily data instead of monthly data to reduce the impact of incorrect regime forecasts, which now typically span only a few days instead of a whole month.Second, we employ a two-state regime model, with regimes distinguished primarily by their volatility.In terms of notation, we assume the inferred hidden state s^t{0,1}subscript^𝑠𝑡01\hat{s}_{t}\in\left\{0,1\right\}over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ { 0 , 1 }, with s^t=1subscript^𝑠𝑡1\hat{s}_{t}=1over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = 1 indicating the high-volatility regime.Throughout our discussion, we will occasionally refer to the high-volatility regime as the market crash or bear market.Additionally, we impose a conservative transaction cost of 10 basis points for each one-way trade throughout the study.

Step One: Regime Identification

The first step involves real-time regime identification, where we use JMs to infer each day’s hidden state s^tsubscript^𝑠𝑡\hat{s}_{t}over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT as the regime label, immediately after the market return data rtsubscript𝑟𝑡r_{t}italic_r start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT for that day becomes available, without using any data beyond day t+1𝑡1t+1italic_t + 1777It is worth emphasizing that, due to the unsupervised nature of regime identification, we can infer the regime for day t𝑡titalic_t only at the end of that day when market data for day t𝑡titalic_t becomes available, a task referred to as “nowcasting” by James etal., (2019). Regime forecasting, i.e. predicting the regime for day t+1𝑡1t+1italic_t + 1 at the end of day t𝑡titalic_t, is addressed as a separate task in Step two of our four-step process..To achieve this, we initially fit a full two-state JM every six months, using a 2000-day lookback window covering a typical bull/bear market cycle. In essence, we update the model parameters biannually.Then within each six-month interval, we perform online state inference to obtain the regime label s^tsubscript^𝑠𝑡\hat{s}_{t}over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT in a live setting.Specifically, for each day t𝑡titalic_t, we run the second stage of the coordinate descent algorithm, minimizing the objective function (1) over the hidden state sequence while keeping model parameters fixed at their most recently fitted values, using data from the beginning of the most recent fitting window up to day t𝑡titalic_t.After obtaining the optimal hidden state sequence, we extract the final day’s state as the online state inference s^tsubscript^𝑠𝑡\hat{s}_{t}over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT for day t𝑡titalic_t, ensuring that the regime label s^tsubscript^𝑠𝑡\hat{s}_{t}over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT relies solely on information available up to that day. Our approach differs from Bulla’s, which applied an HMM to a rolling window of the same length but fitted a full model every day without performing online inference.We believe that the regime model parameters ΘΘ\Thetaroman_Θ should remain consistent rather than changing daily.The jump penalty selection process is discussed in Step four.

Step Two: Regime Forecast

We follow the same approach as Bulla etal., (2011) by using the regime label s^tsubscript^𝑠𝑡\hat{s}_{t}over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT as the forecast ft+1subscript𝑓𝑡1f_{t+1}italic_f start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT for tomorrow. This assumption relies on the belief that daily market regimes exhibit high persistence and do not change frequently.Some studies have treated the regime forecasting task as a separate classification algorithm, using information up to day t𝑡titalic_t to predict the regime label s^t+1subscript^𝑠𝑡1\hat{s}_{t+1}over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT for day t+1𝑡1t+1italic_t + 1.These labels can either be learned from Step one, as demonstrated in Uysal and Mulvey, (2021), or obtained from external sources such as the NBER recession indicators, as shown in James etal., (2019).We leave the comparison between a simple forecast, as presented here, and a separate machine learning classification algorithm for future research to determine which approach is superior.

Step Three: Portfolio Model

We maintain a two-asset world, switching between a weight of 0.0 or 1.0 in the index based on the regime forecast ft+1subscript𝑓𝑡1f_{t+1}italic_f start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT.If the next day is predicted to be in the high-volatility regime, we allocate the entire investment to the risk-free asset; otherwise, it all goes into the index.As demonstrated by Bulla etal., (2011), this volatility-averse portfolio strategy effectively reduces volatility while modestly improving the overall strategy’s Sharpe ratio after accounting for transaction costs.While other portfolio models in a two-asset world, such as weighting as a linear function of probability forecast, exist, Nystrup etal., (2016) has shown that they fall short of the seemingly simple 0/1 strategy in terms of performance.Alternative possibilities, such as short selling the index, fail to achieve a reduction in volatility and consequently do not yield satisfactory results.We leave the exploration of integrating regime forecasts into advanced portfolio models like the Black–Litterman model (Kolm etal.,, 2021) within our four-step process to future research.

It’s worth noting that selling all index positions shortly after buying them is impractical for portfolio managers.Hence, the persistence of the inferred regime label sequence {s^t}subscript^𝑠𝑡\left\{\hat{s}_{t}\right\}{ over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT } is crucial.Bulla etal., (2011) address this by applying a median-filtering procedure on the label sequence, which, in essence, means that the investor only switches asset allocation positions when a majority of days within a waiting period indicate a regime shift.Nystrup etal., (2015), utilizing an adaptively estimated HMM, adopts a similar procedure, shifting positions only when “the confidence in the (regime shift) prediction is above 95%”.These procedures enhance performance by mitigating the lack of persistence seen in HMM’s inferred state sequence.In our approach, thanks to the inherent persistence offered by JMs, we can avoid such filtering altogether, eliminating the introduction of additional hyperparameters related to the number of waiting days or confidence levels.

Step Four: Out-of-sample Testing

We employ a single time-series data split, resulting in training, validation, and test periods888While a nested cross-validation process, which would update the optimal value of λ𝜆\lambdaitalic_λ periodically (e.g., annually), could potentially enhance performance, we prioritize robustness by seeking a unique optimal jump penalty that is effective over the entire test period.. We identify the optimal jump penalty λ𝜆\lambdaitalic_λ based on the strategy’s performance during the validation period and apply this λ𝜆\lambdaitalic_λ throughout the test period to assess its effectiveness.To achieve this, for each candidate value of λ𝜆\lambdaitalic_λ, we aggregate its online forecasts over the validation period and calculate various strategy performance metrics999A detailed list of performance metrics can be found in Section 4..The jump penalty that leads to the highest Sharpe ratio achieved during the validation period is selected.Our analysis indicates that this chosen λ𝜆\lambdaitalic_λ also excels in other performance metrics.To ensure the robustness of this selection, we employ a long enough validation period in the empirical studies.We then apply this selected penalty on the test period to evaluate and report the final performance metrics.

3 Data and Features

3.1 Data

The data used in this article includes daily total return series for three major US equity indices: S&P 500, DJIA, and Nasdaq-100, sourced from the Bloomberg Terminal.Table 1 provides details on their tickers, start dates, and the validation/test period durations, all extending until the end of October 2023.We clarify that the strategy is constructed independently for each of the three indices.For the US risk-free rate, we employ the US Treasury bills’ 3-month constant maturity yield from Global Financial Data, following the guidance of Fairbanksa etal., (2021).

Equity IndexTickerStart DateValidation PeriodTest Period
S&P 500SPX1960-11968 – 19901990 – 2023
DJIAINDU1960-11968 – 19901990 – 2023
Nasdaq-100NDX1985-71993 – 20022002 – 2023

3.2 Features

The quality of the feature set plays a pivotal role in enhancing the effectiveness of jump models.In this article, our goal is to develop a feature set for JMs that strikes a balance between informativeness and dimensionality, while solely relying on the log-return series itself.This approach allows a fair comparison with HMMs, our baseline method, which also focuses only on return series101010While incorporating exogenous variables like the VIX index (Nystrup etal.,, 2016) and economic indicators (Kim and Kwon,, 2023) could enhance regime identification, their exploration in JMs is reserved for future research..We prioritize lower dimensionality in the feature set to mitigate the sensitivity of general clustering algorithms to the challenges associated with high-dimensional data, such as collinearity.In clustering, unlike regression where a coefficient of zero can completely exclude a feature, every feature equally contributes to the squared 2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT-distance in the loss function, unless explicitly weighted otherwise.While the ideal process involves creating an extensive feature set and refining it via a feature selection mechanism such as sparse jump models (SJMs) used in cryptocurrency markets (Cortese etal., 2023a, ), we leave it for future research due to the added complexity and extra hyperparameters.

In our analysis, we focus on return and volatility features derived from index return series, as these are crucial components for understanding index performance.We utilize a set of exponentially weighted moving (EWM) measures with varying half-lives, a widely accepted method in both industry and academia.The selection of half-life for these EWM measures is crucial, balancing the consideration of effects over long and short horizons. Short-horizon EWM measures respond quickly to market fluctuations but often include substantial noise prevalent in index returns. In contrast, EWM measures with long half-lives filter out noise but may be less sensitive to recent price changes.This concept is directly relevant to jump models, where features based on short half-lives exhibit high variability and thus yield less persistent regimes even when heavily penalized, while features with longer half-lives produce smoother, more persistent regimes.Due to the substantial amount of noise in daily index returns, based on our experience, we prefer features with longer half-lives, which provide a stable basis for regime identification in jump models.

The composition of our final feature set is detailed in Table 2. As highlighted in previous research (Schwert,, 1989; Bulla,, 2011), market regimes are predominantly characterized by their volatility profiles. Consequently, our feature set comprises three volatility-related features and one return feature, with a primary focus on the volatility aspects.This feature set is applied uniformly across all three indices, without further fine-tuning the half-lives for individual indices.

No.CategoryFeature
1VolDD (hl=20)
2VolDD (hl=20) - DD (hl=60)
3VolDD (hl=60) - DD (hl=120)
4RetReturn (hl=120)

For the three volatility features, we choose exponentially weighted moving downside deviation (EWM-DD) over standard deviation (STD).DD focuses on downside risks by considering only negative returns and excluding potential upside in its calculation (Grootveld and Hallerbach,, 1999), reducing the likelihood of mis-classifying periods with high volatility while positive returns as bear markets, as seen during periods of 1997-1999 preceding the dot-com bubble burst.Another advantage we’ve observed with DD is its tendency to revert to the long-term average faster than STD, especially during market rebound phases that typically follow the end of a market crash.This characteristic allows our strategy to more promptly transition back to an all-index position, capitalizing on the market rebounds.Thus, EWM-DD better aligns with our goal of accurately identifying and profiting from different market regimes.

We explore the incorporation of downside deviation across various horizons, specifically considering half-lives of 20, 60, and 120 days.Figure 1 illustrates the three DD measures alongside the S&P 500 price presented on a logarithmic scale, over the period of 1960-2023.We observe that spikes in DD primarily correspond to periods of falling prices, highlighting the efficacy of using DD features in regime identification.However, directly utilizing these three measures reveals high correlation, leading to inefficiency in clustering algorithms.To overcome this, we utilize the differences between DD measures across different half-lives.This approach not only mitigates collinearity but also introduces a momentum aspect by contrasting long-horizon and short-horizon effects, efficiently synthesizing DD measures across various horizons.

Regime-Aware Asset Allocation: a Statistical Jump Model Approach (1)
Regime-Aware Asset Allocation: a Statistical Jump Model Approach (2)

In our analysis of return features, we initially considered exponentially weighted moving average returns (EWMAs) with half-lives of both 60 and 120 days, as depicted in Figure 2. However, as anticipated, the EWMA proves to be notably noisy even with a half-life as long as 60 days. Consequently, we include only the EWMA with a half-life of 120 days in our feature set.

4 Empirical Results

4.1 Validation Period

We start our empirical studies by analyzing the strategy’s performance during the validation period, focusing exclusively on the results for S&P 500 for brevity. The out-of-sample performance for the other two indices will be presented in the next subsection. Recall that our dataset begins in 1960, with a rolling window length of 8 years, and we desire a sufficiently long validation period to ensure a robust selection of the optimal jump penalty.Therefore, for S&P 500, we select a validation period spanning from 1968 to 1990.We consider a range of candidate jump penalty values and evaluate the strategy’s performance during the validation period across various metrics, including annualized return & volatility, Sharpe ratio, downside deviation (DD), Sortino ratio, maximum drawdown (MDD), Calmar ratio, and average daily turnover.Table 3 presents these performance metrics of the strategy across all candidate λ𝜆\lambdaitalic_λ values, with the performance of S&P 500 included in the last column as a reference.It is worth emphasizing that we impose a transaction cost of 10 basis points for each one-way trade throughout our empirical study.

λ=𝜆absent\lambda=italic_λ =10.22.50.100.220.500.1000.S&P 500
return0.09930.10550.10830.11560.10890.08780.10570.1057
vol0.13050.13160.13480.13580.13860.14220.15300.1530
Sharpe0.23930.28160.29720.34450.29620.15540.26250.2625
DD0.09480.09490.09710.09750.09930.10240.10930.1093
Sortino0.32460.38540.40750.47390.40860.21330.36410.3641
MDD-0.3293-0.3293-0.3293-0.3293-0.3293-0.3550-0.4480-0.4480
Calmar0.09350.11100.12010.14030.12330.06150.08880.0888
turnover0.00690.00430.00290.00180.00140.00110.00000.0000

In Table 3, we make several observations regarding how different λ𝜆\lambdaitalic_λ values affect the strategy’s performance.As λ𝜆\lambdaitalic_λ decreases, turnover increases, indicating more frequent regime shifts, as expected due to the smaller jump penalty.Simultaneously, both volatility and downside deviation decrease with smaller λ𝜆\lambdaitalic_λ values, resulting from the reduced market exposure during high-volatility periods.Another risk measure, maximum drawdown, also improves when regime switching is incorporated into the strategy.Additionally, concerning return and risk-adjusted return metrics including Sharpe, Sortino, and Calmar ratios, a clear trend is evident that they initially increase and then decrease with smaller λ𝜆\lambdaitalic_λ values, forming an inverted U shape.Four λ𝜆\lambdaitalic_λ values between 22.0 and 220.0 outperform the index across various metrics.Furthermore, nearly all the metrics reach their optimal values at λ=100.0𝜆100.0\lambda=100.0italic_λ = 100.0, indicating it as the best-performing jump penalty during the validation period, and we will apply this value throughout the test period in the next subsection.These observed patterns remain consistent for the other two indices, underscoring the robustness of our approach in selecting the optimal jump penalty.

The U-shape pattern arises because when λ𝜆\lambdaitalic_λ is set too large, regime switches become prohibitively costly, causing the model to overlook prolonged market crashes and exhibit delayed reactions to market conditions.Conversely, a too-small λ𝜆\lambdaitalic_λ leads to an excessive focus on separating daily returns into two clusters, akin to a k𝑘kitalic_k-means algorithm, disregarding temporal consistency.As a result, the model reacts to minor, noise-driven fluctuations in input features that do not persist into the following day.This lack of persistence, combined with high turnover, results in sub-optimal performance, mirroring the shortcomings observed in HMMs, as visualized in the next subsection.

For a visual representation of regime identification, in Figure 3 we plot the performance curve of our strategy with the optimally selected λ=100.0𝜆100.0\lambda=100.0italic_λ = 100.0, shown in orange, over the validation period from 1968 to 1990, together with the index price in blue color, both in a logarithmic scale.The shaded areas highlighted in red indicate the forecasted high-volatility regime, during which we allocate all assets into the risk-free asset.Notably, our jump model effectively identifies two major crashes around 1970 and 1974, during which our strategy outperforms the index.In the 1970 crash, our strategy avoids a significant drawdown with high volatility, while benefiting from a considerably high risk-free rate during the stagflation period.The crash in 1974 witnesses a substantial outperformance of our strategy in terms of return compared to the index.However, it’s worth noting that we were unable to detect the 1987 Black Monday due to its rapid and drastic drop, which our features typically do not respond to that quickly.Furthermore, our identified regimes exhibit high persistence, requiring only 10 asset re-allocations over a 22-year period.Although this may not align with the trading frequency of a real portfolio manager, our model provides valuable signal with reduced false alarms.

Regime-Aware Asset Allocation: a Statistical Jump Model Approach (3)

4.2 Test Period

After selecting the jump penalty that leads to the best performance during the validation period, we apply it consistently throughout the test period.Table 4 presents a comparison of out-of-sample performance metrics between the JM-based regime-switching strategy (with an optimal λ=100.0𝜆100.0\lambda=100.0italic_λ = 100.0), the HMM-based strategy, and the S&P 500 index.It is worth noting that the hidden state sequence generated by the HMM has undergone the median filtering procedure as implemented in Bulla etal., (2011); without this filtering, the performance is worse.Comparing the HMM-based strategy with the index itself, we observe that the former achieves higher risk-adjusted return metrics, such as Sharpe and Sortino ratios, primarily by significantly reducing volatility at the expense of return. Notably, it achieves the most substantial improvement in controlling drawdown.

In contrast, when comparing the HMM- and JM-based strategies, our approach realizes a boost in annualized return, increasing from 9.82% of the market to 12.55%.This substantial improvement translates into higher risk-adjusted return, with our Sharpe ratio reaching 0.78, compared to the HMM’s modest increase from 0.46 to 0.51.The outperformance seen in Sortino and Calmar ratios echoes the same story.Furthermore, our turnover is notably lower than that of the HMM, highlighting that the persistence inherent in JM not only reduces transaction costs but, more importantly, enhances returns by avoiding excessive trading.

JM (λ=100.0𝜆100.0\lambda=100.0italic_λ = 100.0)HMMS&P 500
ret0.12550.07510.0982
vol0.12920.09980.1822
Sharpe0.77840.51490.4628
DD0.09020.07210.1289
Sortino1.11370.71180.6532
MDD-0.2450-0.1905-0.5522
Calmar0.41020.26930.1525
turnover0.00160.01350.0000

In Figures 4 and 5, we provide visual representations of the regimes forecasted by JM and HMM, respectively, along with the strategy’s performance curve in orange, over the test period. We also include the index price in blue, both displayed in a logarithmic scale.The high-volatility regimes identified by JM exhibit notable persistence and align with major downturns in the U.S. stock market, particularly during the dot-com bubble, the 2008 financial crisis, and the 2020 COVID-19 crash. These periods account for the majority of the outperformance achieved by our model.It is worth noting that JM is not able to identify the 2022 market downturn caused by interest rate hikes, coinciding with the maximum drawdown period of our strategy.Additionally, while JM promptly identifies the conclusion of the 2008 financial crisis, it exhibits some delay in recognizing the end of other market downturns, such as the dot-com bubble and the 2020 COVID-19 crash, missing some of the subsequent market rebound opportunities.

Regime-Aware Asset Allocation: a Statistical Jump Model Approach (4)

Regime-Aware Asset Allocation: a Statistical Jump Model Approach (5)

In contrast, HMM exhibits a notable frequency of regime shifts, totaling 115 times compared to JM’s 14, resulting in increased transaction costs for the HMM-guided strategy.Furthermore, HMM identifies numerous short-lived regimes primarily driven by the noise in daily market returns.While reducing market exposure based on these false alarms can lower volatility, it does so at the expense of potential market returns.HMM spends more time in the risk-free asset, with 34.4% compared to JM’s 21.0%, explaining the HMM-based strategy’s lower volatility.These characteristics make interpreting and trading HMM signals challenging.When considering the broader regime shape, the main distinctions between HMM and JM lie in two periods that are the pre-dot-com bubble from 1997 to 1999 and the 2022 interest rate hike.HMM identifies both of these periods, missing growth opportunities in the former while avoiding the downturn in the latter.

JM (λ=220.0𝜆220.0\lambda=220.0italic_λ = 220.0)HMMDJIA
ret0.11020.07170.1025
vol0.12450.10520.1752
Sharpe0.69320.46390.4964
DD0.08780.07560.1237
Sortino0.98160.64440.7025
MDD-0.2073-0.3277-0.5178
Calmar0.41580.14880.1678
turnover0.00280.01570.0000
JM (λ=10.0𝜆10.0\lambda=10.0italic_λ = 10.0)HMMNasdaq-100
ret0.13570.09800.1156
vol0.16430.15070.2363
Sharpe0.77340.60480.5225
DD0.11580.10740.1656
Sortino1.09630.84690.7455
MDD-0.2777-0.3698-0.5345
Calmar0.45710.24610.2310
turnover0.00240.01510.0000

The comparative performance of our JM-based strategy against the HMM-based strategy, applied to DJIA and Nasdaq-100, is highlighted in Tables 5 and 6, respectively.This comparison aligns with the results for S&P 500: the JM-guided strategy achieves an increase in annualized return by 1-2%, resulting in a more substantial improvement in the Sharpe and Sortino ratios.Additionally, there is an enhancement in the maximum drawdown for both indices compared to both the index itself and the HMM-based strategy, leading to a higher Calmar ratio.Furthermore, the turnover is significantly reduced by approximately a factor of 5.It’s important to note that the optimally selected jump penalties vary for different indices.

Finally, Figure 6 presents the visualization of the regimes identified by JM for Nasdaq-100, which is considered distinct from the other two indices.The notable persistence observed in these regimes remains consistent with previous observations, and JM effectively identifies the 2022 market downturn caused by interest rate hikes.These achievements by JM are promising, given the simplicity of our approach, which doesn’t rely on advanced portfolio optimization techniques and utilizes only a minimal set of features.We demonstrate that the improved statistical accuracy of JMs in identifying market regimes has significant financial implications, enabling the generation of excess profits.

Regime-Aware Asset Allocation: a Statistical Jump Model Approach (6)

5 Conclusion

In this article, we present an interpretable multi-step framework for the development of a regime-aware asset allocation strategy.Adhering to this framework, we develop a strategy that employs statistical jump models for the first step of regime identification, followed by an empirical evaluation of its performance through rigorous out-of-sample testing that considers transaction costs.Our feature set for the jump model is straightforward, comprising return and volatility features derived exclusively from the return series.When it comes to selecting the jump penalty, we depart from traditional methods that rely solely on statistical criteria.Instead, we utilize a time-series cross-validation approach that directly optimizes the strategy’s performance metric over the validation period.This approach prioritizes the financial implications of the jump penalty, ensuring it benefits downstream tasks to the greatest extent.By doing so, we emphasize the practical and financial relevance of our methodology over a purely statistical perspective.

Our empirical results, spanning a 50-year period across three major US indices, suggest that the improved statistical accuracy of JMs in identifying persistent market regimes has significant financial implications for generating real profits.Specifically, throughout the validation period, the strategy’s performance exhibits an inverted U-shaped trend with decreasing jump penalty.This pattern aids in our selection of the optimal jump penalty, where we select the value corresponding to the peak performance.Subsequently, during the test period, our strategy, guided by the optimally chosen jump penalty, consistently outperforms HMM-based strategies and buy-and-hold strategies across various metrics, including annualized return, risk-adjusted return metrics like the Sharpe ratio, and maximum drawdown.Further, the visualization of regime forecasts highlights that this enhanced performance originates from the jump model’s ability to capture persistent market downturns, in contrast to HMMs, which tend to identify transient and short-lived regimes that lead to sub-optimal strategy performance.

Future directions include expanding the feature set to incorporate macroeconomic indicators and implementing feature selection in Step one.Further enhancements to Step two and three, such as exploring a multi-asset universe, developing a dedicated regime forecast algorithm, or employing advanced portfolio optimization models, hold promise for future research.

References

  • Ang and Bekaert, (2002)Ang, A. and Bekaert, G. (2002).Regime switches in interest rates.Journal of Business & Economic Statistics, 20(2):163–182.
  • Ang and Bekaert, (2004)Ang, A. and Bekaert, G. (2004).How regimes affect asset allocation.Financial Analysts Journal, 60(2):86–99.
  • Aydinhan etal., (2023)Aydinhan, A.O., Kolm, P.N., Mulvey, J.M., and Shu, Y. (2023).Identifying patterns in financial markets: Extending the statisticaljump model for regime identification.SSRN.
  • Bae etal., (2014)Bae, G.I., Kim, W.C., and Mulvey, J.M. (2014).Dynamic asset allocation for varied financial markets under regimeswitching framework.European Journal of Operational Research, 234(2):450–458.60 years following Harry Markowitz’s contribution to portfoliotheory and operations research.
  • Bemporad etal., (2018)Bemporad, A., Breschi, V., Piga, D., and Boyd, S.P. (2018).Fitting jump models.Automatica, 96:11–21.
  • Bulla, (2011)Bulla, J. (2011).Hidden Markov models with t components: Increased persistence andother aspects.Quantitative Finance, 11(3):459–475.
  • Bulla and Bulla, (2006)Bulla, J. and Bulla, I. (2006).Stylized facts of financial time series and hidden semi-Markovmodels.Computational Statistics & Data Analysis, 51(4):2192–2209.
  • Bulla etal., (2011)Bulla, J., Mergner, S., Bulla, I., Sesboüé, A., and Chesneau, C. (2011).Markov-switching asset allocation: Do profitable strategies exist?Journal of Asset Management, 12:310–321.
  • Chan etal., (2023)Chan, E., Fan, H., Sawal, S., and Viville, Q. (2023).Conditional portfolio optimization: Using machine learning to adaptcapital allocations to market regimes.SSRN.
  • Choi and Hammoudeh, (2010)Choi, K. and Hammoudeh, S. (2010).Volatility behavior of oil, industrial commodity and stock markets ina regime-switching environment.Energy Policy, 38(8):4388–4399.
  • (11)Cortese, F., Kolm, P., and Lindström, E. (2023a).What drives cryptocurrency returns? a sparse statistical jump modelapproach.Digit Finance.
  • (12)Cortese, F.P., Kolm, P.N., and Lindström, E. (2023b).Generalized information criteria for sparse statistical jump models.In Linde, P., editor, Symposium i anvendt statistik, Vol 44,Copenhagen. Copenhagen Business School.
  • Fairbanksa etal., (2021)Fairbanksa, J.C., Griffithsb, M.D., and Wintersc, D.B. (2021).On the use of the daily fama–french risk-free rate.Journal of Investment Management, 19(3):39–59.
  • Grootveld and Hallerbach, (1999)Grootveld, H. and Hallerbach, W. (1999).Variance vs downside risk: Is there really that much difference?European Journal of operational research, 114(2):304–319.
  • Guidolin and Timmermann, (2007)Guidolin, M. and Timmermann, A. (2007).Asset allocation under multivariate regime switching.Journal of Economic Dynamics and Control, 31(11):3503–3544.
  • Hamilton, (1989)Hamilton, J.D. (1989).A new approach to the economic analysis of nonstationary time seriesand the business cycle.Econometrica, 57(2):357–384.
  • Hardy, (2001)Hardy, M.R. (2001).A regime-switching model of long-term stock returns.North American Actuarial Journal, 5(2):41–53.
  • Hess, (2006)Hess, M.K. (2006).Timing and diversification: A state-dependent asset allocationapproach.European Journal of Finance, 12(3):189–204.
  • James etal., (2019)James, A., Abu-Mostafa, Y.S., and Qiao, X. (2019).Machine learning for recession prediction and dynamic assetallocation.The Journal of Financial Data Science, 1(3):41–56.
  • Kim and Kwon, (2023)Kim, M.J. and Kwon, D. (2023).Dynamic asset allocation strategy: An economic regime approach.Journal of Asset Management, 24:136–147.
  • Kolm etal., (2021)Kolm, P.N., Ritter, G., and Simonian, J. (2021).Black–litterman and beyond: The bayesian paradigm in investmentmanagement.The Journal of Portfolio Management, 47(5):91–113.
  • Mulvey and Liu, (2016)Mulvey, J.M. and Liu, H. (2016).Identifying economic regimes: Reducing downside risks for universityendowments and foundations.The Journal of Portfolio Management, 43(1):100–108.
  • Nystrup etal., (2015)Nystrup, P., Hansen, B.W., Madsen, H., and Lindström, E. (2015).Regime-based versus static asset allocation: Letting the data speak.The Journal of Portfolio Management, 42(1):103–109.
  • (24)Nystrup, P., Kolm, P.N., and Lindström, E. (2020a).Greedy online classification of persistent market states usingrealized intraday volatility features.The Journal of Financial Data Science, 2(3):25–39.
  • Nystrup etal., (2021)Nystrup, P., Kolm, P.N., and Lindström, E. (2021).Feature selection in jump models.Expert Systems with Applications, 184:115558.
  • (26)Nystrup, P., Lindström, E., and Madsen, H. (2020b).Learning hidden Markov models with persistent states by penalizingjumps.Expert Systems with Applications, 150:113307.
  • Nystrup etal., (2016)Nystrup, P., WilliamHansen, B., Madsen, H., and Lindström, E. (2016).Detecting change points in VIX and S&P 500: A new approach todynamic asset allocation.Journal of Asset Management, 17:361–374.
  • Rydén etal., (1998)Rydén, T., Teräsvirta, T., and Åsbrink, S. (1998).Stylized facts of daily return series and the hidden Markov model.Journal of Applied Econometrics, 13(3):217–244.
  • Schwert, (1989)Schwert, G.W. (1989).Why does stock market volatility change over time?The Journal of Finance, 44(5):1115–1153.
  • Sheikh and Sun, (2012)Sheikh, A.Z. and Sun, J. (2012).Regime change: Implications of macroeconomic shifts on asset classand portfolio performance.Journal of Investing, 21(3):36–54.
  • Tu, (2010)Tu, J. (2010).Is regime switching in stock returns important in portfoliodecisions?Management Science, 56(7):1198–1215.
  • Tuck etal., (2022)Tuck, J., Barratt, S., and Boyd, S. (2022).Portfolio construction using stratified models.In Capponi, A. and Lehalle, C.-A., editors, Machine Learning AndData Sciences For Financial Markets. Cambridge University Press.
  • Turner etal., (1989)Turner, C.M., Startz, R., and Nelson, C.R. (1989).A Markov model of heteroskedasticity, risk, and learning in thestock market.Journal of Financial Economics, 25(1):3–22.
  • Uysal and Mulvey, (2021)Uysal, A.S. and Mulvey, J.M. (2021).A machine learning approach in regime-switching risk parityportfolios.The Journal of Financial Data Science, 3(2):87–108.
  • van Vliet and Blitz, (2011)van Vliet, P. and Blitz, D. (2011).Dynamic strategic asset allocation: Risk and return across thebusiness cycle.Journal of Asset Management, 12:360–375.
  • Zheng etal., (2021)Zheng, K., Li, Y., and Xu, W. (2021).Regime switching model estimation: Spectral clustering hiddenMarkov model.Annals of Operations Research, 303:297–319.
Regime-Aware Asset Allocation: a Statistical Jump Model Approach (2024)
Top Articles
Latest Posts
Article information

Author: Barbera Armstrong

Last Updated:

Views: 6423

Rating: 4.9 / 5 (59 voted)

Reviews: 90% of readers found this page helpful

Author information

Name: Barbera Armstrong

Birthday: 1992-09-12

Address: Suite 993 99852 Daugherty Causeway, Ritchiehaven, VT 49630

Phone: +5026838435397

Job: National Engineer

Hobby: Listening to music, Board games, Photography, Ice skating, LARPing, Kite flying, Rugby

Introduction: My name is Barbera Armstrong, I am a lovely, delightful, cooperative, funny, enchanting, vivacious, tender person who loves writing and wants to share my knowledge and understanding with you.