QUICK REVIEW

[論文レビュー] Advanced U-Net Architectures with CNN Backbones for Automated Lung Cancer Detection and Segmentation in Chest CT Images

Alireza Golkarieh, Kiana Kiashemshaki|ArXiv.org|Jul 14, 2025

Radiomics and Machine Learning in Medical Imaging被引用数 3

ひとこと要約

要旨：本論文は胸部 CT 画像上で U-Net セグメンテーションを VGG16、ResNet50、Xception バックボーンで評価し、CNN およびハイブリッド CNN+ML 分類器を用いた肺がん検出を分析して、U-Net セグメンテーションを用いた場合最大で 99.1% の分類精度を達成する。

ABSTRACT

This study investigates the effectiveness of U-Net architectures integrated with various convolutional neural network (CNN) backbones for automated lung cancer detection and segmentation in chest CT images, addressing the critical need for accurate diagnostic tools in clinical settings. A balanced dataset of 832 chest CT images (416 cancerous and 416 non-cancerous) was preprocessed using Contrast Limited Adaptive Histogram Equalization (CLAHE) and resized to 128x128 pixels. U-Net models were developed with three CNN backbones: ResNet50, VGG16, and Xception, to segment lung regions. After segmentation, CNN-based classifiers and hybrid models combining CNN feature extraction with traditional machine learning classifiers (Support Vector Machine, Random Forest, and Gradient Boosting) were evaluated using 5-fold cross-validation. Metrics included accuracy, precision, recall, F1-score, Dice coefficient, and ROC-AUC. U-Net with ResNet50 achieved the best performance for cancerous lungs (Dice: 0.9495, Accuracy: 0.9735), while U-Net with VGG16 performed best for non-cancerous segmentation (Dice: 0.9532, Accuracy: 0.9513). For classification, the CNN model using U-Net with Xception achieved 99.1 percent accuracy, 99.74 percent recall, and 99.42 percent F1-score. The hybrid CNN-SVM-Xception model achieved 96.7 percent accuracy and 97.88 percent F1-score. Compared to prior methods, our framework consistently outperformed existing models. In conclusion, combining U-Net with advanced CNN backbones provides a powerful method for both segmentation and classification of lung cancer in CT scans, supporting early diagnosis and clinical decision-making.

研究の動機と目的

胸部CTスキャンからの肺がん検出とセグメンテーションの精度向上を動機づける。
3つの事前学習済み CNN バックボーン（VGG16、ResNet50、Xception）を用いた U-Net セグメンテーション性能を評価する。
U-Net セグメンテーションに基づく CNN とハイブリッド CNN+ML 分類器（SVM、RF、GB）を用いた分類性能を評価する。
セグメンテーションと分類の結果を比較するため、頑健なクロスバリデーションと複数指標を適用する。
提案フレームワークを既存研究と比較して、セグメンテーションと検出精度の向上を強調する。

提案手法

CLAHE で前処理を行い、128×128 にリサイズしてピクセル値を正規化する。
VGG16、ResNet50、または Xception のバックボーンを用いた U-Net で肺をセグメント化する（エンコーダは事前学習済みCNNから取得）。
U-Net セグメンテーション上で癌・非癌検出のため独立した CNN 分類器を訓練する。
CNN で抽出した特徴を SVM、Random Forest、Gradient Boosting 分類器へ入力するハイブリッドモデルを開発する。
Dice、IoU、精度、適合率、再現率、F1、ROC-AUC を用いた 5-fold クロスバリデーションで評価し、平均と標準偏差を報告する。

実験結果

リサーチクエスチョン

RQ1異なる事前学習済みバックボーン（VGG16、ResNet50、Xception）を用いた U-Net セグメンテーションは、癌対非癌 CT 画像の肺セグメンテーションを向上させるか。
RQ2U-Net セグメンテーション上で訓練された CNN モデルの分類性能はどの程度か、そしてハイブリッド CNN+ML アプローチと比較してどうか。
RQ3胸部 CT データの肺がん検出において、どのバックボーンと分類器の組み合わせが、精度、再現率、ROC-AUC のバランスとして最も優れているか。

主な発見

ResNet50 を用いた U-Net は癌セグメンテーションの Dice が 0.9495、精度が 0.9735（±0.1015）。
VGG16 を用いた U-Net は非癌肺のセグメンテーションで最良の Dice 0.9532、精度 0.9513（±0.1006）。
Xception を用いた U-Net は一貫した結果を提供（癌 Dice 0.9383、非癌 Dice 0.9521）。
Xception バックボーンを用いた U-Net セグメンテーションで訓練した CNN モデルは 99.10% の精度、99.74% の再現率、99.42% の F1 スコア（ROC-AUC 0.9911）を達成。
ハイブリッド CNN+ML モデルも競争力のある性能を示し、CNN-SVM-Xception が高い精度と F1 を達成し、CNN-GB-VGG16 が高い ROC-AUC を達成。
事前訓練済みバックボーンは Xception がバランスの取れた結果を提供する強力な分類性能を示し（精度 0.9580、再現率 0.9950、F1 0.9739、ROC-AUC 0.9193）。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。