QUICK REVIEW

[論文レビュー] Prompt-Free SAM-Based Multi-Task Framework for Breast Ultrasound Lesion Segmentation and Classification

Samuel Ebimobowei Johnny, Bernes L. Atabonfack|arXiv (Cornell University)|Jan 9, 2026

AI in cancer detection被引用数 0

ひとこと要約

本論文は、SAMビジョンエンコーダの特徴を用いて、誠実な提示なしで乳腺超音波病変を同時にセグメンテーションし、正常・良性・悪性の分類を行うマルチタスクフレームワークを提案し、マスク誘導注意機構が分類を改善します。

ABSTRACT

Accurate tumor segmentation and classification in breast ultrasound (BUS) imaging remain challenging due to low contrast, speckle noise, and diverse lesion morphology. This study presents a multi-task deep learning framework that jointly performs lesion segmentation and diagnostic classification using embeddings from the Segment Anything Model (SAM) vision encoder. Unlike prompt-based SAM variants, our approach employs a prompt-free, fully supervised adaptation where high-dimensional SAM features are decoded through either a lightweight convolutional head or a UNet-inspired decoder for pixel-wise segmentation. The classification branch is enhanced via mask-guided attention, allowing the model to focus on lesion-relevant features while suppressing background artifacts. Experiments on the PRECISE 2025 breast ultrasound dataset, split per class into 80 percent training and 20 percent testing, show that the proposed method achieves a Dice Similarity Coefficient (DSC) of 0.887 and an accuracy of 92.3 percent, ranking among the top entries on the PRECISE challenge leaderboard. These results demonstrate that SAM-based representations, when coupled with segmentation-guided learning, significantly improve both lesion delineation and diagnostic prediction in breast ultrasound imaging.

研究の動機と目的

共同セグメンテーションと分類を通じた強健な乳腺超音波解析の動機付け。
PROMPTなしのSAM埋め込みを用いたエンドツーエンド推論の活用。
病変領域に焦点を当てるセグメンテーション誘導注意による分類の向上。
PRECISE 2025データセットでの高いセグメンテーションと診断指標の実証。

提案手法

1024x1024のBUS画像から抽出されたSAMビジョンエンコーダ（ViT-B/16）特徴を共有埋め込みとして使用。
セグメンテーションのデコーダとして、軽量な畳み込みヘッドまたはU-Net風デコーダの2つのオプションを提供。
分類器をセグメンテーション予測で条件付けするマスク誘導注意を導入。
セグメンテーション（BCE+Dice）と分類（クロスエントロピー）損失を組み合わせた重み付きマルチタスク損失で訓練。
マルチタスク学習を安定化させるための3段階カリキュラム訓練を採用。

実験結果

リサーチクエスチョン

RQ1PROMPT不要のSAM埋め込みは、乳腺超音波病変の正確な共同セグメンテーションと分類をサポートできるか。
RQ2マスク誘導注意は病変領域に焦点を当てることで診断精度を向上させるか。
RQ3SAM特徴を用いた場合、軽量デコーダで高品質なセグメンテーションは十分か。
RQ4エンドツーエンドのマルチタスク訓練はセグメンテーションと分類の性能にどのような影響を与えるか。

主な発見

Metric	Baseline (Aumente-Maestro et al. 2025b)	BI-RADS (Bobowicz et al. 2024)	EDCNN (Islam et al. 2024)	Ours
Classification Accuracy	71.06	89.23	80.45	90.7
Classification F1-Score	0.693	0.893	0.7646	0.887
Classification AUC	0.761	0.964	0.9322	0.981
Segmentation DSC	0.153	0.806	0.583	0.887
Segmentation HD95 (mm)	83.11	30.63	45.48	60.838
Segmentation NSD	0.007	0.433	0.313	0.503

PRECISE 2025データセットでセグメンテーションのDice相似係数0.887、分類の正解率92.3%を達成。
分類AUCは0.981に達し、ベースラインを上回る。
報告テーブルにおける分類（精度90.7%、AUC0.981）とセグメンテーション（DSC 0.887）で最先端手法を上回る。
アブレーションにより、マスク誘導注意が分類を向上させる一方、より単純なデコーダでもSAM特徴のおかげで強力なセグメンテーションが得られることを示す。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。