[论文解读] DDMM-Synth: A Denoising Diffusion Model for Cross-modal Medical Image Synthesis with Sparse-view Measurement Embedding
DDMM-Synth 使用 MRI 引导的扩散模型,结合范围-空域分解,在合成 CT 时从 MRI 融入稀疏视图 CT 测量,提高数据一致性和对噪声的鲁棒性。
Reducing the radiation dose in computed tomography (CT) is important to mitigate radiation-induced risks. One option is to employ a well-trained model to compensate for incomplete information and map sparse-view measurements to the CT reconstruction. However, reconstruction from sparsely sampled measurements is insufficient to uniquely characterize an object in CT, and a learned prior model may be inadequate for unencountered cases. Medical modal translation from magnetic resonance imaging (MRI) to CT is an alternative but may introduce incorrect information into the synthesized CT images in addition to the fact that there exists no explicit transformation describing their relationship. To address these issues, we propose a novel framework called the denoising diffusion model for medical image synthesis (DDMM-Synth) to close the performance gaps described above. This framework combines an MRI-guided diffusion model with a new CT measurement embedding reverse sampling scheme. Specifically, the null-space content of the one-step denoising result is refined by the MRI-guided data distribution prior, and its range-space component derived from an explicit operator matrix and the sparse-view CT measurements is directly integrated into the inference stage. DDMM-Synth can adjust the projection number of CT a posteriori for a particular clinical application and its modified version can even improve the results significantly for noisy cases. Our results show that DDMM-Synth outperforms other state-of-the-art supervised-learning-based baselines under fair experimental conditions.
研究动机与目标
- Motivate reducing CT radiation by enabling high-fidelity CT reconstruction from sparse measurements and cross-modal MRI information.
- Develop a diffusion-model framework that leverages MRI guidance and sparse-view CT priors for MRI-to-CT synthesis.
- Enable flexible adjustment to measurement settings and robustness to noisy measurements without retraining.
- Assess whether integrating MRI guidance and CT measurement embedding yields superior synthesis quality compared to state-of-the-art baselines.
提出的方法
- Conditional diffusion model trained with MRI guidance.
- Null-space range-null space decomposition to insert sparse-view CT information during reverse sampling.
- CT reconstruction consistency enforced via A†y and measurements y in the reverse process (Equations 5–7).
- Noise handling in diffusion steps with gamma_t and phi_t to account for measurement noise (Equations 7–13).
- DDIM sampling to accelerate reverse process (T = 2000 to 100 steps).
- A single framework adaptable to different linear measurement mappings without retraining.
实验结果
研究问题
- RQ1Can an MRI-guided diffusion model, combined with sparse-view CT measurements, produce higher-fidelity CT images than prior MRI-to-CT translation or purely data-driven reconstruction methods?
- RQ2Does the range-null space measurement embedding improve data consistency and structural fidelity in synthesized CT under sparse-view and noisy scenarios?
- RQ3How does DDMM-Synth perform relative to state-of-the-art supervised baselines on multi-modal pelvic MRI-CT (Gold Atlas) and BRATS2018 datasets?
- RQ4Is the method robust to different projection counts and to measurement noise without additional model training?
主要发现
| 方法 | N_p | PSNR (Gold Atlas) | SSIM (Gold Atlas) | PSNR (BRATS2018) | SSIM (BRATS2018) |
|---|---|---|---|---|---|
| pix2pix | 23 | 28.98 ± 1.87 | 0.887 ± 0.019 | 24.01 ± 1.91 | 0.797 ± 0.028 |
| pGAN | 23 | 29.19 ± 1.64 | 0.893 ± 0.012 | 24.16 ± 1.44 | 0.810 ± 0.027 |
| medSynth | 23 | 30.03 ± 1.81 | 0.891 ± 0.023 | 24.32 ± 2.06 | 0.813 ± 0.031 |
| SAGAN | 23 | 31.01 ± 2.01 | 0.912 ± 0.021 | 26.62 ± 1.97 | 0.826 ± 0.027 |
| DDMM-Synth | 10 | 29.82 ± 2.10 | 0.861 ± 0.020 | 26.47 ± 1.85 | 0.796 ± 0.025 |
| DDMM-Synth | 20 | 33.12 ± 1.99 | 0.936 ± 0.019 | 27.03 ± 1.27 | 0.863 ± 0.021 |
| DDMM-Synth | 23 | 33.79 ± 1.98 | 0.941 ± 0.019 | 27.54 ± 1.25 | 0.872 ± 0.022 |
- DDMM-Synth achieves higher PSNR/SSIM than several baselines on Gold Atlas and BRATS2018 under varying projection counts.
- With fewer projections (e.g., 20 vs 23) DDMM-Synth can outperform some methods that use more projections, indicating strong data-consistency with sparse views.
- DDMM-Synth-noise variant significantly improves performance in noisy acquisitions, as evidenced by FID reductions.
- Ablation studies show MRI guidance plus sparse-view CT embedding yields more faithful CT reconstructions than variants lacking either component.
- Diffusion-based synthesis provides higher anatomical fidelity and fewer artifacts in ROIs compared to non-diffusion baselines.
更好的研究,从现在开始
从论文设计到论文写作,大幅缩短您的研究时间。
无需绑定信用卡
本解读由 AI 生成,并经人工编辑审核。