QUICK REVIEW

[論文レビュー] Classification of Multiple Diseases on Body CT Scans Using Weakly Supervised Deep Learning- Model weights

Fakrul Islam Tushar, Vincent M. D’Anniballe|arXiv (Cornell University)|Aug 3, 2020

Radiomics and Machine Learning in Medical Imaging参考文献 33被引用数 19

ひとこと要約

本研究では、レントゲン報告書から自動抽出されたラベルを用いて、身体CTスキャンにおける複数疾患分類のための弱教師あり深層学習モデルを開発した。13,000件を超えるCT検査にルールベースのアルゴリズムを適用した結果、肺・胸膜、肝臓・胆嚢、腎臓・尿管の3つの臓器系にまたがる15の疾患ラベルにおいて、AUCが0.65から0.97の範囲に達し、手動ラベル付けに依存せずにスケーラブルで正確な複数疾患分類が実現された。

ABSTRACT

<h3>Model Documentation: Multidisease Classification Models for Body CT Scans</h3> This document provides an overview and usage guidance for three deep learning models developed to perform multidisease classification on body CT scans. The models are based on 3D convolutional neural networks implemented in Python using TensorFlow, and they were trained using weak supervision derived from radiology report text. <h4>Background and Purpose</h4> These models were developed as part of a retrospective study aiming to detect multiple common disease conditions across three major organ systems—lungs and pleura, liver and gallbladder, and kidneys and ureters—using body CT scans. Labels for training were extracted using rule-based natural language processing (NLP) from radiology reports, enabling efficient training without extensive manual annotation. The work demonstrates how weak supervision can support the development of clinically relevant, multi-organ disease classifiers on a large scale. <h4>Model Summary</h4> Each model targets a specific organ system and predicts the presence or absence of five disease categories (four pathologies + one "no apparent disease" class): <ol> <li> Lungs and Pleura:  <ul> <li> Labels: Atelectasis, Nodule, Emphysema, Effusion, No Apparent Disease </li> <li> Performance (AUCs): <ul> <li> Atelectasis: 0.77 </li> <li> Nodule: 0.65 </li> <li> Emphysema: 0.89 </li> <li> Effusion: 0.97 </li> <li> No Apparent Disease: 0.89 </li> </ul> </li> </ul> </li> <li> Liver and Gallbladder <ul> <li> Labels: Hepatobiliary Calcification, Lesion, Dilation, Fatty Liver, No Apparent Disease </li> <li> Performance (AUCs): <ul> <li> Calcification: 0.62 </li> <li> Lesion: 0.73 </li> <li> Dilation: 0.87 </li> <li> Fatty: 0.89 </li> <li> No Apparent Disease: 0.82 </li> </ul> </li> </ul> </li> <li> Kidneys and Ureters <ul> <li> Labels: Stone, Atrophy, Lesion, Cyst, No Apparent Disease </li> <li> Performance (AUCs): <ul> <li> Stone: 0.83 </li> <li> Atrophy: 0.92 </li> <li> Lesion: 0.68 </li> <li> Cyst: 0.70 </li> <li> No Apparent Disease: 0.79 </li> </ul> </li> </ul> </li> </ol> The models were trained on CT data from over 13,000 scans and evaluated on a subset of 2,158 volumes with 2,875 manually validated reference labels. Automated label extraction achieved between 91%–99% accuracy during internal validation. <h4>Implementation Details</h4> <ul> <li> Programming Language: Python </li> <li> Framework: TensorFlow </li> <li> Model Type: 3D Convolutional Neural Network (CNN) </li> <li> Preprocessing: Organ segmentation (via DenseVNet), intensity normalization, and cropping of CT volumes to organ-specific regions of interest. </li> </ul> <h4>Repository Links</h4> The source code, model weights, and usage instructions will be made publicly available through: <ul> <li> GitHub Repository: https://github.com/fitushar/multi-label-weakly-supervised-classification-of-body-ct </li> <li> GitLab Repository: https://gitlab.oit.duke.edu/railabs/LoGroup/multi-label-weakly-supervised-classification-of-body-ct </li> </ul> These repositories include: <ul> <li> Model loading and inference scripts </li> <li> Preprocessing pipeline details </li> <li> Instructions for applying the model to new CT data </li> <li> Evaluation tools and AUC reporting scripts </li> </ul> <h4>License and Citation</h4> These models are released for academic research purposes only. If you use them in your work, please cite the original study. Citation details will be provided in the repository README.

研究の動機と目的

手動ラベル付けのボトルネックを克服するため、身体CT用の複数疾患分類器を開発すること。
高価な手動ラベル付けに依存しないよう、既存のレントゲン報告書を活用した弱教師あり学習を実現すること。
肺、肝臓、腎臓などの複数の臓器系にまたがる、多様な疾患タイプを対象としたマルチラベル分類を可能にすること。
ルールベースのラベル抽出の正確性と、現実的で多様なCTデータにおけるモデル性能を検証すること。
臨床画像におけるスケーラブルで自動化された深層学習パイプラインの実現可能性を示すこと。

提案手法

13,667件のレントゲン報告書の所見セクションから、キーワードマッチングと否定論理を用いたルールベースのアルゴリズムで疾患ラベルを抽出した。
ラベルは手動で検証され、15の疾患カテゴリーにおいて91～99％の正確性を達成した。
肺・胸膜、肝臓・胆嚢、腎臓・尿管の各システムに対して、独立して3つの3D DenseVNetモデルを訓練した。
各モデルは、1つの臓器系あたり5つの疾患と「明らかな疾患なし」の合計6つのラベルを分類した。
計算コストを削減しながら診断的詳細を保持するため、2 mm × 2 mm × 2 mmの等方的ボクセルを用いた。
性能評価には、DeLong法を用いた95％信頼区間を伴う受信器操作特性（ROC）AUCが用いられた。

実験結果

リサーチクエスチョン

RQ1レントゲン報告書からのルールベースのラベル抽出は、身体CTにおける多様な腹部および胸部疾患に対して高い正確性を達成できるか？
RQ2弱教師あり3次元畳み込みニューラルネットワーク（3D CNN）は、最小限の手動ラベル付けデータで複数の臓器系に一般化できるか？
RQ3身体CTにおける局所的疾患と広範性疾患の両方において、モデル性能はどのように変動するか？
RQ4局所化や手動セグメンテーションが不要なテキストベースのラベルのみを用いて、多疾患分類を効果的に実現できるか？
RQ5画像解像度およびスキャンプロトコルの違いが、異なる疾患タイプにおけるモデル性能に与える影響は何か？

主な発見

手動検証により、ルールベースのラベル抽出が91～99％の正確性を達成した。これは弱教師あり学習の信頼性を裏付けた。
肺および胸膜モデルは、アテレアシス（AUC 0.77）、結節（0.65）、肺気腫（0.89）、胸水（0.97）、および「疾患なし」（0.89）の各疾患でAUCを達成した。
肝臓および胆嚢モデルは、石灰化（0.62）、病変（0.73）、拡張（0.87）、脂肪変性（0.89）、および「疾患なし」（0.82）の各疾患でAUCを達成した。
腎臓および尿管モデルは、石灰腫（0.83）、萎縮（0.92）、病変（0.68）、嚢胞（0.70）、および「疾患なし」（0.79）の各疾患でAUCを達成した。
広範性疾患（例：肺気腫、脂肪肝）の性能は局所的疾患（例：結節、病変）よりも高かったが、腎臓結石は特殊なプロトコルのおかげで高いAUC（0.83）を達成した。
本アプローチにより、手動ラベル付けに依存しない、スケーラブルで自動化された3つの臓器系にまたがる多疾患分類が実現された。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。