QUICK REVIEW

[論文レビュー] Inferring Height from Earth Embeddings: First insights using Google AlphaEarth

Alireza Hamoudzadeh, Valeria Belloni|arXiv (Cornell University)|Feb 19, 2026

Remote Sensing and LiDAR Applications被引用数 0

ひとこと要約

この論文は AlphaEarth Embeddings が軽量 DL モデル（U-Net および U-Net++）を指示して 10 m AlphaEarth データから地域表面高度を推定できるかを評価し、リッジベースラインと比較し、新しい地域への一般化を評価します。

ABSTRACT

This study investigates whether the geospatial and multimodal features encoded in extit{Earth Embeddings} can effectively guide deep learning (DL) regression models for regional surface height mapping. In particular, we focused on AlphaEarth Embeddings at 10 m spatial resolution and evaluated their capability to support terrain height inference using a high-quality Digital Surface Model (DSM) as reference. U-Net and U-Net++ architectures were thus employed as lightweight convolutional decoders to assess how well the geospatial information distilled in the embeddings can be translated into accurate surface height estimates. Both architectures achieved strong training performance (both with $R^2 = 0.97$), confirming that the embeddings encode informative and decodable height-related signals. On the test set, performance decreased due to distribution shifts in height frequency between training and testing areas. Nevertheless, U-Net++ shows better generalization ($R^2 = 0.84$, median difference = -2.62 m) compared with the standard U-Net ($R^2 = 0.78$, median difference = -7.22 m), suggesting enhanced robustness to distribution mismatch. While the testing RMSE (approximately 16 m for U-Net++) and residual bias highlight remaining challenges in generalization, strong correlations indicate that the embeddings capture transferable topographic patterns. Overall, the results demonstrate the promising potential of AlphaEarth Embeddings to guide DL-based height mapping workflows, particularly when combined with spatially aware convolutional architectures, while emphasizing the need to address bias for improved regional transferability.

研究の動機と目的

AlphaEarth Embeddings が地域表面高度推定のデコード可能な高さ関連信号をエンコードしているかを評価する。
埋め込みを高品質な DSM 派生高度へマッピングする軽量 CNN デコーダ（U-Net および U-Net++）を評価する。
DL アプローチと同一の入力と評価指標を用いたリッジベースラインと比較する。
埋め込み guided 高さマッピングの一般化と転用性を大規模な研究領域に跨って調査する。

提案手法

10 m 解像度の AlphaEarth Embeddings を入力特徴量として使用する。
ResNet-18 エンコーダを用いて 512×512 パッチで二つの軽量デコーダ（U-Net と U-Net++）を訓練する。
埋め込み入力を uint8 から [0,1] に正規化し、10 m にリサンプリングされた 5×5 m DSM ターゲットと整合させる。
AdamW、MSE 損失、ReduceLROnPlateau、早期停止で最適化；80/20 の訓練/検証分割と別のテストエリア。
同一の入力と評価指標を用いて Ridge 回帰ベースラインと比較する。

実験結果

リサーチクエスチョン

RQ1AlphaEarth Embeddings が DSM 参照から地域地形高度を予測する DL 回帰を効果的に導けるか。
RQ2軽量で空間的に意識したデコーダは高度関連の空間パターンを保持し、新しい地域へ一般化できるか。
RQ3埋め込み guided 高さ推定は従来の線形ベースラインと比較してどうか。
RQ4異なる高度分布を持つ地域間で埋め込みベースの高度モデルを転用する際の強みと限界は何か。

主な発見

U-Net および U-Net++ は訓練時に高い性能を示し（R2 ≈ 0.97）、埋め込みがデコード可能な高度信号をエンコードしていることを示唆する。
テストデータでは U-Net++ が U-Net より一般化性が高く（R2 ≈ 0.84 対 0.78）、分布シフトに対する頑健性が向上。
U-Net++ はテストセットで RMSE が約 16 m と低く、参照 DSM に対する高度のサポートが Ridge ベースラインよりも近い。
DL モデルは訓練データとテストデータの両方で Ridge 回帰を上回り、埋め込み主導の高度推定の価値を示す。
高度推定はバイアスと残差分布の差異により依然課題となっており、地域間での転送性には限界がある。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。