QUICK REVIEW

[論文レビュー] Artifact Reduction in Undersampled 3D Cone-Beam CTs using a Hybrid 2D-3D CNN Framework

Johannes Thalhammer, Tina Dorosti|arXiv (Cornell University)|Feb 9, 2026

Medical Imaging Techniques and Applications被引用数 0

ひとこと要約

この論文は、2D U-Net を用いてスライスごとの特徴を抽出し、それらを統合する3Dデコーダを用いるハイブリッドな2D-3D CNNフレームワークを提案し、欠陥のある undersampled 3D cone-beam CT のアーティファクトを低い計算オーバーヘッドを維持しつつ低減します。

ABSTRACT

Undersampled CT volumes minimize acquisition time and radiation exposure but introduce artifacts degrading image quality and diagnostic utility. Reducing these artifacts is critical for high-quality imaging. We propose a computationally efficient hybrid deep-learning framework that combines the strengths of 2D and 3D models. First, a 2D U-Net operates on individual slices of undersampled CT volumes to extract feature maps. These slice-wise feature maps are then stacked across the volume and used as input to a 3D decoder, which utilizes contextual information across slices to predict an artifact-free 3D CT volume. The proposed two-stage approach balances the computational efficiency of 2D processing with the volumetric consistency provided by 3D modeling. The results show substantial improvements in inter-slice consistency in coronal and sagittal direction with low computational overhead. This hybrid framework presents a robust and efficient solution for high-quality 3D CT image post-processing. The code of this project can be found on github: https://github.com/J-3TO/2D-3DCNN_sparseview/.

研究の動機と目的

低消費行為や被ばく量低減を可能にする undersampled CT ボリュームのアーティファクト低減を動機づける。
2D スライスごとの特徴抽出と3D体積デコーディングを組み合わせたハイブリッドアーキテクチャを提案する。
PSNRとSSIM 指標を用いてスパースビューデータに対するアーティファクト低減性能を評価する。

提案手法

512x512 axial スライスで2D U-Netを訓練し、スパースビュー入力からアーティファクトを除去する。
体積から2D特徴マップを抽出してスタックし、3D特徴表現を形成する。
3x3x3 畳み込みの3Dデコーダを用いて、スタックした特徴とスパース入力からアーティファクト低減を施した3Dボリュームを再構成する。
PSNR、SSIM、計算時間の観点から、2D のみの後処理と3Dデコーダ後処理を比較する。

Fig. 1: Overview of the 2D-3D training pipeline. A) First, a 2D U-Net is trained on 2D axial slices to remove artifacts. B) The encoder of the trained U-Net is used to extract features at different levels of an entire CT volume by looping through $N$ axial slices. Subsequently, the extracted feature

実験結果

リサーチクエスチョン

RQ1ハイブリッドな2D-3D CNNフレームワークは、2Dまたは3Dモデル単独と比べて undersampled cone-beam CT のアーティファクト低減を改善できるか。
RQ22D特徴を3D表現にスタックすることで体積的一貫性の改善が得られるか、計算コストは過大にならないか。
RQ3提案アプローチのPSNRとSSIMの定量的改善、および実行時間のトレードオフはどうなるか。

主な発見

Method	PSNR (dB)	SSIM
sparse-view	24.810 ± 0.522	0.637 ± 0.015
2D U-Net	39.289 ± 1.295	0.949 ± 0.015
3D decoder	38.086 ± 1.188	0.938 ± 0.016

2D U-Net と 3D デコーダは、スパースビュー再構成よりも PSNR と SSIM を大幅に改善する。
軸位ビューでは、どちらの手法も視覚的にアーティファクト低減が類似しているが、冠状位と矢状位ビューでは3Dデコーダがスライス間の一貫性を優れている。
テストデータのPSNR：sparse-view 24.810 ± 0.522; 2D U-Net 39.289 ± 1.295; 3D decoder 38.086 ± 1.188。
テストデータのSSIM：sparse-view 0.637 ± 0.015; 2D U-Net 0.949 ± 0.015; 3D decoder 0.938 ± 0.016。
ボリュームあたりの実行時間：2D U-Net 後処理で 2.408 ± 0.651 秒；3Dデコーダ（特徴抽出を含む）で 20.276 ± 5.768 秒（NVIDIA A100 上）。

Fig. 2: An axial, coronal, and sagittal slice of the full-view reconstruction from the test split (first row), 128-view reconstruction (second row) with artifact reduction by the 2D U-Net (third row) and by the Hybrid 2D-3D CNN framework (last row), respectively. Images are in the mediastinum window

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。