QUICK REVIEW

[论文解读] Algorithmic Monitoring: Measuring Market Stress with Machine Learning

Marc Schmitt|arXiv (Cornell University)|Feb 5, 2026

Financial Markets and Investment Strategies被引用 0

一句话总结

该论文构建了 MSPI，一种仅限股票市场的实时、概率性压力度量，面向美国股市一个月前瞄准，使用横截面脆弱性信号与 lasso-logit，并将其与非线性学习模型进行比较。

ABSTRACT

I construct a Market Stress Probability Index (MSPI) that estimates the probability of high stress in the U.S. equity market one month ahead using information from the cross-section of individual stocks. Using CRSP daily data, each month is summarized by a set of interpretable cross-sectional fragility signals and mapped into a forward-looking stress probability via an L1-regularized logistic regression in a real-time expanding-window design. Out of sample, MSPI tracks major stress episodes and improves discrimination and accuracy relative to a parsimonious benchmark based on lagged market return and realized volatility, delivering calibrated stress probabilities on an economically meaningful scale. Further, I illustrate how MSPI can be used as a probability-based measurement object in financial econometrics. The resulting index provides a transparent and easily updated measure of near-term equity-market stress risk.

研究动机与目标

Motivate the need for algorithmic, real-time market-stress monitoring in electronic markets.
Construct an equity-only, transparent Market Stress Probability Index (MSPI) that outputs calibrated one-month-ahead stress probabilities.
Compare a sparse lasso-logit MSPI to nonlinear learners (random forests, gradient boosting) under a real-time expanding-window protocol.
Demonstrate MSPI’s calibration, discrimination, and relation to realized market outcomes.
Highlight MSPI as a usable measurement object for financial econometrics.

提出的方法

Define cross-sectional fragility signals from CRSP daily stock data (dispersion, tails, skewness, kurtosis, trading intensity).
Aggregate signals monthly to form X_t as predictors for stress.
Define stress monthly indicator S_t based on a return cutoff and a volatility quantile q_{t-1}(α).
Model MSPI_t as a one-month-ahead stress probability using lasso-logit: MSPI_t = Λ(β_0 + X_t'β), with L1 regularization.
Employ an expanding-window real-time learning protocol with initial 120-month training and fixed λ chosen via time-series cross-validation.
Benchmark against a parsimonious benchmark using lagged market return and realized volatility with L2 regularization.
Assess nonlinear models (random forests, gradient boosting) with Platt-calibration within each training window for probability outputs.

Figure 1: MSPI out-of-sample. The solid line plots the predicted probability that the next month is a stress month. Dots mark months for which stress is realized in the subsequent month under the definition in equation ( 8 ).

实验结果

研究问题

RQ1Can an equity-only, real-time cross-sectional fragility framework produce a calibrated one-month-ahead probability of market stress?
RQ2How does a sparse lasso-logit MSPI compare to nonlinear learners in discrimination and probability accuracy under a real-time expanding-window design?
RQ3Is MSPI's probability output stable and economically interpretable for monitoring and econometric use?
RQ4Do MSPI levels and innovations relate to subsequent realized market outcomes like volatility?

主要发现

Model	AUC	PR-AUC	Brier	LogLoss
MSPI	0.800	0.538	0.106	0.352
Benchmark	0.752	0.444	-	-
Gradient Boosting	0.756	0.481	-	-
Random Forest	0.727	0.447	-	-

MSPI achieves out-of-sample AUC of 0.800 and PR-AUC of 0.538, outperforming the benchmark (AUC 0.752, PR-AUC 0.444).
Nonlinear learners (gradient boosting, random forest) do not consistently dominate MSPI in discrimination or calibrated probability accuracy.
MSPI records the lowest Brier score (0.106) and log loss (0.352) among evaluated models, indicating strong probability accuracy.
MSPI probability path is smoother and more interpretable for monitoring than jagged nonlinear models.
MSPI’s mean predicted probability (~0.180) is close to realized stress rate (~0.159), reflecting good calibration.
MSPI signals rise ahead of major stress episodes (e.g., 2008-09 crisis, 2020 COVID) and cluster over time.

Figure 2: Out-of-sample stress probability - model horse race (calibrated). MSPI (black) is the L1-logit probability of next-month stress. Random forest (blue) and gradient boosting (orange) are estimated under the same expanding-window design and Platt-calibrated in real time within each training w

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。