QUICK REVIEW

[論文レビュー] Self-Supervised Similarity Learning for Digital Pathology

Jacob Gildenblat, Eldad Klaiman|arXiv (Cornell University)|May 20, 2019

AI in cancer detection参考文献 17被引用数 33

ひとこと要約

本論文は、全スライド画像の空間的連続性を利用してデジタル病理のための128次元ディスクリプタを学習する自己監督型シアミーズネットワークを提案し、Camelyon16 で ImageNet ベースおよび他の自己監督法を上回る。

ABSTRACT

Using features extracted from networks pretrained on ImageNet is a common practice in applications of deep learning for digital pathology. However it presents the downside of missing domain specific image information. In digital pathology, supervised training data is expensive and difficult to collect. We propose a self-supervised method for feature extraction by similarity learning on whole slide images (WSI) that is simple to implement and allows creation of robust and compact image descriptors. We train a siamese network, exploiting image spatial continuity and assuming spatially adjacent tiles in the image are more similar to each other than distant tiles. Our network outputs feature vectors of length 128, which allows dramatically lower memory storage and faster processing than networks pretrained on ImageNet. We apply the method on digital pathology WSIs from the Camelyon16 train set and assess and compare our method by measuring image retrieval of tumor tiles and descriptor pair distance ratio for distant/near tiles in the Camelyon16 test set. We show that our method yields better retrieval task results than existing ImageNet based and generic self-supervised feature extraction methods. To the best of our knowledge, this is also the first published method for self-supervised learning tailored for digital pathology.

研究の動機と目的

データにラベルが乏しいデジタル病理領域におけるドメイン特異的特徴抽出の動機付け。
全スライド画像の内在的な空間連続性を活用して自動的にラベル付けされたペアを作成。
対比ロスを用いたシアミーズネットワークを訓練し、コンパクトな128次元ディスクリプタを生成。
Camelyon16 における baselines に対する画像検索とディスクリプタ分離性の改善を実証。

提案手法

組織タイルが2 mm 以内なら類似、10 mm を超える場合は非類似とラベル付けして画像ペアを作成。
対比ロスを用いた128-d出力のシアミーズネットワークを訓練: L_contrastive = (1-y)·||f1-f2||2 + y·max(0, m - ||f1-f2||2).
エンコーダとして両ブランチで共有される修正版のResNet-50バックボーンを使用。
評価前にLAB空間で染色正規化を用いてタイルを正規化。
ImageNet 事前学習済みのResNet-50特徴とNPID自己監督特徴と比較。
Camelyon16 で画像検索とグローバル ADDR 指標で評価。

実験結果

リサーチクエスチョン

RQ1自己監督的な類似性学習は手動ラベルなしでドメイン特有の組織病理特徴を捕捉できるか。
RQ2空間連続性を利用したディスクリプタベースの方法は、病理タスクにおいてImageNetベースの特徴や既存の自己監督法を上回るか。
RQ3学習済みディスクリプタは検索や距離ベースの評価において腫瘍組織と正常組織をどれくらいうまく分離できるか。
RQ4大規模WSI解析の実務的効率性とディマンション性はどの程度か。

主な発見

手法	グローバル ADDR
ResNet-50 pretrained on ImageNet	1.38
Non-Parametric Instance Discrimination	1.28
Ours	1.5

Global ADDR: ImageNet ResNet-50 = 1.38, NPID = 1.28, Ours = 1.50.
腫瘍タイル検索: ImageNet ResNet-50 = 26%, NPID = 21%, Ours = 34%.
我々のディスクリプタは基準よりUMAP可視化で腫瘍エンベディングがより凝集している。
我々の手法はスライド間の腫瘍タイル検索を改善し、ドメイン特異的特徴を捉えていることを示す。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。