QUICK REVIEW

[論文レビュー] So2Sat LCZ42: A Benchmark Dataset for Global Local Climate Zones Classification

Xiao Xiang Zhu, Jingliang Hu|arXiv (Cornell University)|Dec 19, 2019

Land Use and Ecosystem Services参考文献 42被引用数 53

ひとこと要約

この論文は So2Sat LCZ42 を提示する、約40万の Sentinel-1/2 の画像パッチをラベル付けしたグローバルに分布するオープンベンチマークデータセットで、17 の Local Climate Zone (LCZ) クラスと厳密なラベリングワークフローおよびベースライン分類結果を提供する。

ABSTRACT

Access to labeled reference data is one of the grand challenges in supervised machine learning endeavors. This is especially true for an automated analysis of remote sensing images on a global scale, which enables us to address global challenges such as urbanization and climate change using state-of-the-art machine learning techniques. To meet these pressing needs, especially in urban research, we provide open access to a valuable benchmark dataset named "So2Sat LCZ42," which consists of local climate zone (LCZ) labels of about half a million Sentinel-1 and Sentinel-2 image patches in 42 urban agglomerations (plus 10 additional smaller areas) across the globe. This dataset was labeled by 15 domain experts following a carefully designed labeling work flow and evaluation process over a period of six months. As rarely done in other labeled remote sensing dataset, we conducted rigorous quality assessment by domain experts. The dataset achieved an overall confidence of 85%. We believe this LCZ dataset is a first step towards an unbiased globallydistributed dataset for urban growth monitoring using machine learning methods, because LCZ provide a rather objective measure other than many other semantic land use and land cover classifications. It provides measures of the morphology, compactness, and height of urban areas, which are less dependent on human and culture. This dataset can be accessed from http://doi.org/10.14459/2018mp1483140.

研究の動機と目的

移動可能な都市 LCZ 分類モデルを実現するための、グローバルに分布した高品質な LCZ ラベリングデータセットを提供する。
信頼性の高いラベルを達成するため、厳密なラベリングワークフローと品質評価を確立する。
機械学習実験のためのコアレジストされた SAR/光学画像パッチ（Sentinel-1/2）のオープンアクセスを提供する。

提案手法

42 の大規模都市圏を厳選し、大陸横断的に LCZ ポリゴンを手作業でラベル付けする。
LCZ ラベルを Sentinel-1 SAR および Sentinel-2 マルチスペクトルパッチ（320m x 320m パッチ）へコアレジストする。
学習、ラベリング、視覚的検証、定量的検証の4段階のラベリングワークフローを実装する。
専門家の投票を用いてポリゴンおよびピクセルレベルのラベル検証を行い、ラベル信頼度を推定する（全体で 85% と報告）。
ML用途のラスタ化前に、後処理（ポリゴン縮小とクラスバランシング）によりクラスサンプルをバランスさせる。
Sentinel-2特徴量上でのベースライン分類器（RF、SVM、ResNeXt-CBAM）を提供する。

実験結果

リサーチクエスチョン

RQ1グローバルに分布する専門家ラベル付き LCZ データセットを作成して、一般化可能な LCZ 分類器を訓練できるか？
RQ2複数の専門家による検証時の LCZ 注釈のラベル品質と不確実性はどの程度か？
RQ3一般的な ML 手法を用いて LCZ42 データセットでどの程度のベースライン分類性能を達成できるか？

主な発見

分類器	OA	WA	AA	カッパ係数
RF	0.51	0.87	0.31	0.46
SVM	0.54	0.88	0.36	0.49
ResNeXt-CBAM	0.61	0.92	0.51	0.58

LCZ42 データセットは 400,673 件の Sentinel-1/2 パッチ対と LCZ ラベルを 52 の都市にわたり含む。
多数決による改良後の人間ラベリング信頼度は約 85%。
Sentinel-2特徴量上のベースライン結果は RF OA=0.51、SVM OA=0.54、ResNeXt-CBAM OA=0.61。
加重精度 WA の値は 0.87 (RF)、0.88 (SVM)、0.92 (ResNeXt-CBAM)。
平均精度 AA の値は 0.31 (RF)、0.36 (SVM)、0.51 (ResNeXt-CBAM)。
カッパ係数は 0.46 (RF)、0.49 (SVM)、0.58 (ResNeXt-CBAM)。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。