QUICK REVIEW

[論文レビュー] U-Net with Hadamard Transform and DCT Latent Spaces for Next-day Wildfire Spread Prediction

Yingyi Luo, Shuaiang Rong|arXiv (Cornell University)|Feb 12, 2026

Fire effects on ecosystems被引用数 0

ひとこと要約

要約: この論文は Transform Domain Fusion UNet (TD-FusionUNet) を提案する。訓練可能な Hadamard および DCT 層を用いた軽量な変換ドメイン UNet で、マルチモーダル衛星データを用いた翌日 wildfire の拡大予測を行い、パラメータ数を大幅に削減しつつ最先端の精度を達成する。

ABSTRACT

We developed a lightweight and computationally efficient tool for next-day wildfire spread prediction using multimodal satellite data as input. The deep learning model, which we call Transform Domain Fusion UNet (TD-FusionUNet), incorporates trainable Hadamard Transform and Discrete Cosine Transform layers that apply two-dimensional transforms, enabling the network to capture essential "frequency" components in orthogonalized latent spaces. Additionally, we introduce custom preprocessing techniques, including random margin cropping and a Gaussian mixture model, to enrich the representation of the sparse pre-fire masks and enhance the model's generalization capability. The TD-FusionUNet is evaluated on two datasets which are the Next-Day Wildfire Spread dataset released by Google Research in 2023, and WildfireSpreadTS dataset. Our proposed TD-FusionUNet achieves an F1 score of 0.591 with 370k parameters, outperforming the UNet baseline using ResNet18 as the encoder reported in the WildfireSpreadTS dataset while using substantially fewer parameters. These results show that the proposed latent space fusion model balances accuracy and efficiency under a lightweight setting, making it suitable for real time wildfire prediction applications in resource limited environments.

研究の動機と目的

資源配分と緊急対応を最適化するための翌日 wildfire 拡大の正確な予測を動機づける。
周波数ドメイン特徴を捉える軽量な変換ドメインニューラルネットワークを開発する。
プレマスクの sparsity と不均衡に対応するためのカスタム前処理を用いたトレーニングデータを充実させる。
低パラメータ数と計算効率の高さからエッジデバイスへのデプロイ可能性を示す。

提案手法

2D Hadamard 変換、訓練可能なチャネルごとの差動、ソフト閾値処理を適用した HT-パーセプトロンブロックを導入し、逆Hadamard再構成を行う。
Hadamard ドメイン特徴と DCT ドメイン特徴をフュージョン層で統合するデュアルブランチ構造（TD-FusionUNet）を拡張する。
並列ブランチに 2D DCT-パーセプトロンブロックを組み込み、デコード時に HT と DCT 特徴を統合する。
スパース性を緩和し一般化を改善するための二つのカスタム前処理技術—ランダムマージン切り抜きとガウシアン混合平滑化—を適用する。
Google Research Next-Day Wildfire Spread および WildfireSpreadTS データセットで評価し、UNet ベースラインおよび軽量版と比較する。

実験結果

リサーチクエスチョン

RQ1Hadamard および DCT 層を用いた変換ドメイン学習は、軽量モデルを維持しつつ翌日 wildfire 拡大予測を改善できるか？
RQ2デュアルブランチの transform-domain fusion（HT + DCT）は、単一路線の transform UNet や従来の UNet より sparsity 条件下で優れているか？
RQ3カスタム前処理は sparsely での pre-fire マスクの一般化を高めるか？
RQ4TD-FusionUNet は競争力の精度を保ちつつパラメータを大幅に減らしてエッジ展開に適しているか？

主な発見

Model	Parameters	Preprocessing/Settings	Prediction	Precision	Recall	IoU	F1
CNN Autoencoder	78k	Standard	Next day	0.346	0.373	0.153	0.359
CNN Autoencoder	78k	Customized	Both days	0.588	0.658	0.382	0.621
HT-UNet	45k	Standard	Next day	0.571	0.453	0.188	0.505
HT-UNet	45k	Customized	Both days	0.737	0.606	0.433	0.665
TD-FusionUNet (base=8)	93k	Customized	Both days	0.876	0.586	0.555	0.702
Baseline UNet (ResNet18)	14M	Next day	0.536	0.654	0.589
HT-UNet	169k	Next day	0.510	0.646	0.570
TD-FusionUNet (base=4)	159k	Next day	0.580	0.595	0.588
TD-FusionUNet (base=8)	370k	Next day	0.539	0.653	0.591

TD-FusionUNet（基礎チャネル幅 8、最適）は WildfireSpreadTS で F1 = 0.591 を達成し、370k パラメータで ResNet18 ベースの UNet ベースラインを上回る。
単一路HT-UNetは CNN 自動エンコーダー基準よりパラメータを 50% 超削減しつつ、標準前処理およびカスタム前処理の両方で IoU と F1 を改善。
HT 系統および DCT 系統の変換ブランチは相補的な周波数表現を提供し、結合時に単一路モデルより優れた予測性能を示す。
Google データセットでは HT-UNet および TD-FusionUNet がカスタム前処理で IoU と F1 を大きく改善し、スパースな火災マスクに対する変換ドメイン学習の利点を確認。
デュアルブランチ TD-FusionUNet はパラメータ増加を抑えつつスケーラブルな性能向上を示す。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。