QUICK REVIEW

[論文レビュー] StormDiT: A generative AI model bridges the 2-6 hour 'gray zone' in precipitation nowcasting

Haofei Sun, Yunfan Yang|arXiv (Cornell University)|Jan 28, 2026

Tropical and Extratropical Cyclones Research被引用数 0

ひとこと要約

StormDiT は統一的な拡散-トランスフォーマーモデルであり、2–6時間の降水の現在予報を6分間隔で行い、分解ベースの手法を上回り、SEVIR のベースラインを新たに設定する。

ABSTRACT

Accurate short-term warnings for extreme precipitation are critical for global disaster mitigation but are hindered by a persistent predictability barrier at the 2-6 hour horizon -- the "nowcasting gray zone." In this window, traditional observation-based extrapolation fails due to error accumulation, while numerical weather prediction is computationally too slow to resolve storm-scale dynamics. Recent generative AI approaches attempt to bridge this gap by decomposing precipitation into separate deterministic advection and stochastic diffusion components. However, this decomposition can sever fundamental causal links between entangled atmospheric processes, such as the dynamic initiation of convection triggered by boundary advection. Here we present StormDiT, a unified generative model that treats weather evolution as a holistic spatiotemporal problem, learning the coupled physics of the gray zone without human-imposed structural priors. Trained on a massive dataset of 7,720 precipitation events from China, our model achieves a breakthrough in long-horizon stability. On a heavy-rainfall test set, it maintains skillful prediction for strong convection ($\ge$ 35 dBZ) with a Critical Success Index (CSI) near 0.2 across the full 6-hour forecast at 6-minute resolution. Crucially, the model exhibits superior probabilistic calibration, accurately quantifying operational risks. On the public SEVIR benchmark, our unified paradigm more than doubles the state-of-the-art 1-hour performance for heavy rain and establishes the first robust baseline for 3-hour forecasting. Furthermore, interpretability analysis reveals that the model attends to non-local physical precursors, such as outflow boundaries, explicitly validating its emergent understanding of convective organization.

研究の動機と目的

従来の外挿法と高速NWP が苦戦する2–6時間の現在予報のグレーゾーンでの予測を動機づける。
手作業での分解に頼らず、結合した運動学的・力学的大気過程を学習する統一的生成フレームワークを提案する。
大規模レーダ反射率データと公開ベンチマーク上で長期 horizon の安定性と高解像度性能を示す。
局所外流境界などの非局所的物理前駆体に対する注意機構の出現性を示す解釈性の証拠を提供する。

提案手法

ピクセル空間の冗長性を減らすために因果的VAEを介して圧縮された潜在的天気状態空間で運用する。
3D因果自己注意を備えたDiffusion Transformer を用いて時空発展をモデル化する。
ガウス先行分布から物理データ分布へ輸送するためにRectified Flow を採用し、一直線の最適輸送軌跡を可能にする。
連続時間ダイナミクスを強制するために時間埋め込み付きAdaptive Layer Normalization（adaLN）を取り入れる。
一般的な動画基盤モデル（Cosmos-Predict2.5）で事前学習し、大規模レーダ反射率データで後学習して堅牢なWorld Model を確立する。

実験結果

リサーチクエスチョン

RQ1StormDiT は実世界の状況で拡張された（2–6時間）予報の物理的一貫性と予報精度を維持できるか。
RQ2統一的 StormDiT フレームワークは SEVIR のような公開ベンチマークで最先端の分解ベース・決定論モデルと比べて一般化・性能をどう示すか。
RQ3統合的なエンタングルド・物理学アプローチは、従来の外挿法やNWPのスピンアップ制限と比較して高影響降水予報を改善するか。

主な発見

Model	CSI-M ↑	CSI-181 ↑	CSI-219 ↑	SSIM ↑	MSE ↓
ConvGRU	0.2903	0.0879	0.0350	0.6100	368.34
SimVP	0.3108	0.1106	0.0517	0.6508	383.56
Earthformer	0.2892	0.0844	0.0245	0.6633	360.11
PhyDNet	0.3017	0.1040	0.0278	0.6532	357.63
NowcastNet	0.2791	0.0770	0.0351	0.6839	412.94
DiffCast	0.3050	0.1300	0.0582	0.6482	559.59
AlphaPre	0.3259	0.1332	0.0545	0.6884	345.18
StormDiT (ours)	0.3142	0.1682	0.1301	0.7150	329.10

6時間全体の強対流（≥35 dBZ）に対する予報性能を維持し、6分間隔でCSIは約0.2に達する。
SEVIR では高強度イベントの1時間CSIベンチマークを超え、CSI-219 を0.054から0.130に倍増、3時間の堅牢なベースラインを設定。
Spread-Skill Ratio ≈ 0.96 の優れた確率的キャリブレーションを示し、予測分布がよくキャリブレーションされていることを示す。
分解ベースモデル（例：DiffCast）を1時間・3時間タスクで上回り、サイクロン列や台風減衰のような複雑なケースで高周波のシャープな反射率構造を保持する。
注意機構の解析は、非局所的な物理前駆体（外流境界、後方流入）が予報を導くことを示し、 memorization ではなく emergent 物理理解を支持する。
中国のレーダデータ（2,624件）で長期 horizon の性能が安定し、閾値（5–45 dBZ）および極端強度（≥45 dBZ）でCSIの崩壊が最小限に留まることを示す。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。