Skip to main content
QUICK REVIEW

[论文解读] Algorithmic Monitoring: Measuring Market Stress with Machine Learning

Marc Schmitt|arXiv (Cornell University)|Feb 5, 2026
Financial Markets and Investment Strategies被引用 0
一句话总结

该论文构建了 MSPI,一种仅限股票市场的实时、概率性压力度量,面向美国股市一个月前瞄准,使用横截面脆弱性信号与 lasso-logit,并将其与非线性学习模型进行比较。

ABSTRACT

I construct a Market Stress Probability Index (MSPI) that estimates the probability of high stress in the U.S. equity market one month ahead using information from the cross-section of individual stocks. Using CRSP daily data, each month is summarized by a set of interpretable cross-sectional fragility signals and mapped into a forward-looking stress probability via an L1-regularized logistic regression in a real-time expanding-window design. Out of sample, MSPI tracks major stress episodes and improves discrimination and accuracy relative to a parsimonious benchmark based on lagged market return and realized volatility, delivering calibrated stress probabilities on an economically meaningful scale. Further, I illustrate how MSPI can be used as a probability-based measurement object in financial econometrics. The resulting index provides a transparent and easily updated measure of near-term equity-market stress risk.

研究动机与目标

  • Motivate the need for algorithmic, real-time market-stress monitoring in electronic markets.
  • Construct an equity-only, transparent Market Stress Probability Index (MSPI) that outputs calibrated one-month-ahead stress probabilities.
  • Compare a sparse lasso-logit MSPI to nonlinear learners (random forests, gradient boosting) under a real-time expanding-window protocol.
  • Demonstrate MSPI’s calibration, discrimination, and relation to realized market outcomes.
  • Highlight MSPI as a usable measurement object for financial econometrics.

提出的方法

  • Define cross-sectional fragility signals from CRSP daily stock data (dispersion, tails, skewness, kurtosis, trading intensity).
  • Aggregate signals monthly to form X_t as predictors for stress.
  • Define stress monthly indicator S_t based on a return cutoff and a volatility quantile q_{t-1}(α).
  • Model MSPI_t as a one-month-ahead stress probability using lasso-logit: MSPI_t = Λ(β_0 + X_t'β), with L1 regularization.
  • Employ an expanding-window real-time learning protocol with initial 120-month training and fixed λ chosen via time-series cross-validation.
  • Benchmark against a parsimonious benchmark using lagged market return and realized volatility with L2 regularization.
  • Assess nonlinear models (random forests, gradient boosting) with Platt-calibration within each training window for probability outputs.
Figure 1: MSPI out-of-sample. The solid line plots the predicted probability that the next month is a stress month. Dots mark months for which stress is realized in the subsequent month under the definition in equation ( 8 ).
Figure 1: MSPI out-of-sample. The solid line plots the predicted probability that the next month is a stress month. Dots mark months for which stress is realized in the subsequent month under the definition in equation ( 8 ).

实验结果

研究问题

  • RQ1Can an equity-only, real-time cross-sectional fragility framework produce a calibrated one-month-ahead probability of market stress?
  • RQ2How does a sparse lasso-logit MSPI compare to nonlinear learners in discrimination and probability accuracy under a real-time expanding-window design?
  • RQ3Is MSPI's probability output stable and economically interpretable for monitoring and econometric use?
  • RQ4Do MSPI levels and innovations relate to subsequent realized market outcomes like volatility?

主要发现

ModelAUCPR-AUCBrierLogLoss
MSPI0.8000.5380.1060.352
Benchmark0.7520.444--
Gradient Boosting0.7560.481--
Random Forest0.7270.447--
  • MSPI achieves out-of-sample AUC of 0.800 and PR-AUC of 0.538, outperforming the benchmark (AUC 0.752, PR-AUC 0.444).
  • Nonlinear learners (gradient boosting, random forest) do not consistently dominate MSPI in discrimination or calibrated probability accuracy.
  • MSPI records the lowest Brier score (0.106) and log loss (0.352) among evaluated models, indicating strong probability accuracy.
  • MSPI probability path is smoother and more interpretable for monitoring than jagged nonlinear models.
  • MSPI’s mean predicted probability (~0.180) is close to realized stress rate (~0.159), reflecting good calibration.
  • MSPI signals rise ahead of major stress episodes (e.g., 2008-09 crisis, 2020 COVID) and cluster over time.
Figure 2: Out-of-sample stress probability - model horse race (calibrated). MSPI (black) is the L1-logit probability of next-month stress. Random forest (blue) and gradient boosting (orange) are estimated under the same expanding-window design and Platt-calibrated in real time within each training w
Figure 2: Out-of-sample stress probability - model horse race (calibrated). MSPI (black) is the L1-logit probability of next-month stress. Random forest (blue) and gradient boosting (orange) are estimated under the same expanding-window design and Platt-calibrated in real time within each training w

更好的研究,从现在开始

从论文设计到论文写作,大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成,并经人工编辑审核。