QUICK REVIEW

[論文レビュー] LLM as a Risk Manager: LLM Semantic Filtering for Lead-Lag Trading in Prediction Markets

Sumin Kim, Minjae Kim|arXiv (Cornell University)|Feb 4, 2026

Stock Market Forecasting Methods被引用数 0

ひとこと要約

2段階のフレームワークがGranger因果推論とLLMベースの意味フィルタを組み合わせ、予測市場における先行–遅行関係をランキング。Kalshiデータに対する取引パフォーマンスを向上させ、下振れリスクを低減。

ABSTRACT

Prediction markets provide a unique setting where event-level time series are directly tied to natural-language descriptions, yet discovering robust lead-lag relationships remains challenging due to spurious statistical correlations. We propose a hybrid two-stage causal screener to address this challenge: (i) a statistical stage that uses Granger causality to identify candidate leader-follower pairs from market-implied probability time series, and (ii) an LLM-based semantic stage that re-ranks these candidates by assessing whether the proposed direction admits a plausible economic transmission mechanism based on event descriptions. Because causal ground truth is unobserved, we evaluate the ranked pairs using a fixed, signal-triggered trading protocol that maps relationship quality into realized profit and loss (PnL). On Kalshi Economics markets, our hybrid approach consistently outperforms the statistical baseline. Across rolling evaluations, the win rate increases from 51.4% to 54.5%. Crucially, the average magnitude of losing trades decreases substantially from 649 USD to 347 USD. This reduction is driven by the LLM's ability to filter out statistically fragile links that are prone to large losses, rather than relying on rare gains. These improvements remain stable across different trading configurations, indicating that the gains are not driven by specific parameter choices. Overall, the results suggest that LLMs function as semantic risk managers on top of statistical discovery, prioritizing lead-lag relationships that generalize under changing market conditions.

研究の動機と目的

イベントレベルの予測市場における堅牢な先行–遅行関係を識別する課題の動機づけ。
統計的発見と意味的検証を組み合わせた2段階フレームワークの提案。
LLMベースの再ランキングがGrangerのみのスクリーニングより取引パフォーマンスを改善するか評価。

提案手法

Stage 1 は対数オッズ変換市場価格に対するGranger因果を用いて、複数のラグ（p∈{1,2,3,4,5}）で候補のリーダー–フォロワー対を識別。
Stage 2 はイベントタイトル/説明を用いて、妥当な経済伝達機序を評価し、妥当性スコアを付与するLLM（GPT-5-nano）でこれら候補を再ランク付け。
取引プロトコル：各リーダー–フォロワー対について、リーダー価格変化が閾値を超えたときにフォロワー取引を開始、h日間保有し、アウト・オブ・サンプルPnLを測定。
最終ポートフォリオはStat.から上位M組の有向対（M=20）を選択。比較のためのハイブリッド手法。
評価はKalshi Economics市場で60日間の訓練と30日間のテストを用いたローリングウィンドウ。

Figure 1: Two-stage framework for leader–follower pair discovery in prediction markets. Stage 1 produces a candidate set of Top K directed pairs (K=100) ranked by Granger significance, and Stage 2 applies LLM-based semantic re-ranking to select the final Top M portfolio (M=20).

実験結果

リサーチクエスチョン

RQ1LLMsはGranger因果推定で識別される統計的相関と機械的に妥当な先行–後行関係を識別できるか？
RQ2意味的再ランキングはGrangerベースのスクリーニングを超えて取引パフォーマンスを改善し、下振れリスクを低減できるか？
RQ3保持期間や市場状況の異なる場合でも利益は頑健か？

主な発見

LLMベースの意味フィルタリングは、統計的ベースラインに対して総PnLの大幅な gains をもたらす（+205%）。
下振れリスクが低減し、平均損失が$649から$347へ減少（46.5%削減）。
デフォルト設定下で勝率が51.4%から54.5%へ上昇。
同一イベント対と異なるイベント対の両方で損失削減が継続（40%超の改善）。
意味的フィルタリングはリーダーの動きが大きい場合（5–10ポイントおよび10ポイント以上の動き）により大きな利益を提供。
LLM選択の対はGranger Top-M外の経済的に意味のあるリンクを含むことがあり、統計的ランキングを超える定性的な利得を示す。

Figure 2: Signal-triggered trading protocol used to evaluate ranked lead-lag relationships from Figure 1 : leader price moves trigger follower trades, with direction determined by the Granger-induced trade sign and out-of-sample PnL used to evaluate the ranked pair list.

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。