QUICK REVIEW

[논문 리뷰] Synthetic Poisoning Attacks: The Impact of Poisoned MRI Image on U-Net Brain Tumor Segmentation

Tianhao Li, Tianyu Zeng|ArXiv.org|2025. 02. 06.

Forensic Toxicology and Drug Analysis인용 수 4

한 줄 요약

논문은 GAN으로 합성된 MRI 데이터 비율을 높일수록 학습 세트가 오염되고 U-Net 뇌종양 분할 성능이 저하되는지 연구합니다.

ABSTRACT

Deep learning-based medical image segmentation models, such as U-Net, rely on high-quality annotated datasets to achieve accurate predictions. However, the increasing use of generative models for synthetic data augmentation introduces potential risks, particularly in the absence of rigorous quality control. In this paper, we investigate the impact of synthetic MRI data on the robustness and segmentation accuracy of U-Net models for brain tumor segmentation. Specifically, we generate synthetic T1-contrast-enhanced (T1-Ce) MRI scans using a GAN-based model with a shared encoding-decoding framework and shortest-path regularization. To quantify the effect of synthetic data contamination, we train U-Net models on progressively "poisoned" datasets, where synthetic data proportions range from 16.67% to 83.33%. Experimental results on a real MRI validation set reveal a significant performance degradation as synthetic data increases, with Dice coefficients dropping from 0.8937 (33.33% synthetic) to 0.7474 (83.33% synthetic). Accuracy and sensitivity exhibit similar downward trends, demonstrating the detrimental effect of synthetic data on segmentation robustness. These findings underscore the importance of quality control in synthetic data integration and highlight the risks of unregulated synthetic augmentation in medical image analysis. Our study provides critical insights for the development of more reliable and trustworthy AI-driven medical imaging systems.

연구 동기 및 목표

의료 영상 분할에서 합성 데이터의 견고한 평가를 촉진한다.
합성 데이터 오염이 U-Net 뇌종양 분할 성능에 미치는 영향을 정량화한다.
성능이 크게 저하되는 오염 임계치를 식별한다.
의료 영상의 안전하고 신뢰할 수 있는 AI에 대한 시사점을 강조한다.

제안 방법

공유 인코딩-디코딩 및 최단 경로 정규화를 갖춘 GAN을 사용하여 paired CT-MRI 데이터를 바탕으로 합성 T1-Ce MRI 이미지를 생성한다.
학습 데이터에 실제 이미지와 합성 이미지를 지정된 비율(16.67% to 83.33%)로 혼합하여 오염된 학습 데이터를 생성한다.
기본 U-Net을 실제 데이터에 대해 학습시키고 각 오염 데이터 세트에서 오염된 U-Net을 학습시킨다.
Dice, Jaccard, 정확도, 민감도를 사용하여 실제 MRI 검증 세트에서 모델을 평가한다.

실험 결과

연구 질문

RQ1학습 데이터에서 합성 데이터 비율이 증가할수록 U-Net 뇌종양 분할 성능에 어떤 영향을 미치는가?
RQ2성능 저하가 본격적으로 나타나는 임계 비율은 무엇인가?
RQ3합성 데이터 오염의 증가에 따라 Dice, Jaccard, 정확도, 민감도가 일관되게 감소하는가?

주요 결과

Dice는 33.33% 합성에서 0.8937에서 83.33% 합성에서 0.7474로 감소한다.
Jaccard 및 민감도는 더 높은 합성 데이터 비율에서 비슷한 하향 추세를 보인다.
합성 데이터 증가에도 불구하고 정확도는 비교적 안정적으로 유지된다.
낮은 합성 데이터 비율(≤33.33%)은 비교적 안정적인 성능을 유지하고, ≥50%에서 현저한 저하가 발생한다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.