QUICK REVIEW

[論文レビュー] SegRap2023: A Benchmark of Organs-at-Risk and Gross Tumor Volume Segmentation for Radiotherapy Planning of Nasopharyngeal Carcinoma

Xiangde Luo, Jia Fu|arXiv (Cornell University)|Dec 15, 2023

Head and Neck Cancer Studies被引用数 10

ひとこと要約

SegRap2023 は、NPC における 45 件の OARs と 2 つの GTV を対象とする自動分割の大規模公開ベンチマークと MICCAI 2023 チャレンジを提示し、ペアとなる ncCT/ceCT スキャンの大規模データセットを分析します。

ABSTRACT

Radiation therapy is a primary and effective NasoPharyngeal Carcinoma (NPC) treatment strategy. The precise delineation of Gross Tumor Volumes (GTVs) and Organs-At-Risk (OARs) is crucial in radiation treatment, directly impacting patient prognosis. Previously, the delineation of GTVs and OARs was performed by experienced radiation oncologists. Recently, deep learning has achieved promising results in many medical image segmentation tasks. However, for NPC OARs and GTVs segmentation, few public datasets are available for model development and evaluation. To alleviate this problem, the SegRap2023 challenge was organized in conjunction with MICCAI2023 and presented a large-scale benchmark for OAR and GTV segmentation with 400 Computed Tomography (CT) scans from 200 NPC patients, each with a pair of pre-aligned non-contrast and contrast-enhanced CT scans. The challenge's goal was to segment 45 OARs and 2 GTVs from the paired CT scans. In this paper, we detail the challenge and analyze the solutions of all participants. The average Dice similarity coefficient scores for all submissions ranged from 76.68\% to 86.70\%, and 70.42\% to 73.44\% for OARs and GTVs, respectively. We conclude that the segmentation of large-size OARs is well-addressed, and more efforts are needed for GTVs and small-size or thin-structure OARs. The benchmark will remain publicly available here: https://segrap2023.grand-challenge.org

研究の動機と目的

OAR と GTV セグメンテーションのための大規模で公開可能なデータセットを提供する。
2 つの臨床的に関連するタスク（OAR および GTV の delineation）で最先端のセグメンテーション手法を評価する。
セグメンテーション性能に対する多モダリティ入力の影響を評価する（ncCT 対 ceCT）。
公正でリーダーボード主導の比較フレームワークを提供し、NPC 自動 delineation ツールの進展を促す。
現在の手法の強みとギャップを浮き彫りにし、今後の研究と臨床導入を導く。

提案手法

ncCT および ceCT の事前整列済みスキャンと 45 の OARs および 2 つの GTV の ground truth マスクを 200 名の NPC 患者から収集する。
SegRap2023 を 2 つのタスクとして整理する。Task01 は 45 の OARs のセグメンテーション、Task02 は 2 つの GTV のセグメンテーションで、評価指標として DSC と NSD を用いる。
grand-challenge プラットフォーム上で Docker ベースの提出ワークフローとトレーニング/検証/テストの分割を提供する。
複数のチームからの提出物を収集・分析し、ランキングとパフォーマンスの要約を作成する。
Dice 相似係数（DSC）と正規化表面 Dice（NSD）を、それぞれ領域の重なりと表面距離の指標として採用する。

実験結果

リサーチクエスチョン

RQ1ncCT および ceCT データを用いた大規模な NPC の OAR および GTV delineation における現状のセグメンテーション手法の性能はどの程度か。
RQ245 の OAR と 2 つの GTV に対して DSC および NSD の観点から手法を比較するとどうなるか。
RQ3ceCT（造影 CT）と ncCT（非造影 CT）の使用がセグメンテーションの精度に与える影響はどうか。
RQ4現在の手法は大きな OAR や小さく薄い構造、GTV などでどこに苦戦しており、臨床転換にはどのギャップが残っているか。
RQ5包括的な注釈付き公開ベンチマークが、NPC 放射線治療計画の公正で再現性のある改善を促進できるか。

主な発見

提出物全体の OAR の平均 DSC は 76.68% から 86.70% の範囲だった。
提出物全体の GTV の平均 DSC は 70.42% から 73.44% の範囲だった。
大規模サイズの OAR のセグメンテーションは現手法でよく対応されている一方、GTV および小さく薄い構造の OAR は依然として課題である。
NSD の結果はチーム間で大きなばらつきを示し、臓器ごとの表面 delineation の品質の違いを反映している。
SegRap2023 データセット（ncCT と ceCT）とチャレンジ設定は、公正な複数チーム間の比較を可能にし、今後の臨床的課題につながる実践的なギャップを強調している。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。