Skip to main content
QUICK REVIEW

[논문 리뷰] XLSor: A Robust and Accurate Lung Segmentor on Chest X-Rays Using Criss-Cross Attention and Customized Radiorealistic Abnormalities Generation

Youbao Tang, Yuxing Tang|arXiv (Cornell University)|2019. 04. 19.
COVID-19 diagnosis using AI참고 문헌 31인용 수 86
한 줄 요약

XLSor introduces criss-cross attention for global context in lung segmentation on chest X-rays and uses radiorealistic abnormal CXR data augmentation to improve robustness, achieving superior performance on abnormal and normal CXRs.

ABSTRACT

This paper proposes a novel framework for lung segmentation in chest X-rays. It consists of two key contributions, a criss-cross attention based segmentation network and radiorealistic chest X-ray image synthesis (i.e. a synthesized radiograph that appears anatomically realistic) for data augmentation. The criss-cross attention modules capture rich global contextual information in both horizontal and vertical directions for all the pixels thus facilitating accurate lung segmentation. To reduce the manual annotation burden and to train a robust lung segmentor that can be adapted to pathological lungs with hazy lung boundaries, an image-to-image translation module is employed to synthesize radiorealistic abnormal CXRs from the source of normal ones for data augmentation. The lung masks of synthetic abnormal CXRs are propagated from the segmentation results of their normal counterparts, and then serve as pseudo masks for robust segmentor training. In addition, we annotate 100 CXRs with lung masks on a more challenging NIH Chest X-ray dataset containing both posterioranterior and anteroposterior views for evaluation. Extensive experiments validate the robustness and effectiveness of the proposed framework. The code and data can be found from https://github.com/rsummers11/CADLab/tree/master/Lung_Segmentation_XLSor .

연구 동기 및 목표

  • Motivate accurate lung segmentation for chest X-rays to aid automated diagnosis across normal and abnormal cases.
  • Develop a segmentation framework that captures global contextual information using criss-cross attention (CCA).
  • Reduce annotation burden via radiorealistic abnormal CXR generation and pseudo-masks for robust training.
  • Evaluate on multiple datasets including NI H Chest X-rays with PA and AP views to test generalization.

제안 방법

  • Use criss-cross attention modules within a CNN backbone to capture horizontal and vertical global context for pixel-wise segmentation.
  • Replace last down-sampling layers with dilated convolutions to achieve a stride of 8 for high-resolution context.
  • Adopt a two-CCA recurrent arrangement to densely collect contextual information.
  • Generate radiorealistic abnormal CXRs from normal ones via MUNIT and propagate normal masks as pseudo masks for training augmentation.
  • Train XLSor with real data and augmented data; evaluate against U-Net baselines.
  • Annotate 100 abnormal NIH CXRs to test generalization.

실험 결과

연구 질문

  • RQ1Can criss-cross attention improve global contextual reasoning for accurate lung segmentation in CXRs, especially under abnormal/pathological conditions?
  • RQ2Does radiorealistic augmentation with pseudo masks boost robustness and generalization to unseen abnormal CXRs across PA and AP views?
  • RQ3How does XLSor compare to a standard U-Net baseline in normal and abnormal CXR segmentation settings?

주요 결과

CXR 마스크U-Net RU-Net A^4U-Net R+A^4XLSor RXLSor A^4XLSor R+A^4
REC0.976 ± 0.020.963 ± 0.030.972 ± 0.020.973 ± 0.020.967 ± 0.020.974 ± 0.02
PRE0.968 ± 0.030.979 ± 0.020.970 ± 0.030.979 ± 0.020.983 ± 0.010.976 ± 0.01
DICE0.972 ± 0.020.971 ± 0.020.975 ± 0.010.976 ± 0.010.973 ± 0.010.975 ± 0.01
AVD0.198 ± 0.560.162 ± 0.360.131 ± 0.340.149 ± 0.510.098 ± 0.070.078 ± 0.06
VS0.988 ± 0.020.989 ± 0.010.990 ± 0.010.992 ± 0.010.991 ± 0.010.993 ± 0.01
  • XLSor outperforms U-Net on both public testing and NIH datasets, with notable gains on unseen abnormal CXRs (e.g., Dice score improvements).
  • Adding radiorealistic augmented samples (A^i) consistently improves performance over using real data alone.
  • Using criss-cross attention with CCA modules enhances global context learning and yields better segmentation, especially in challenging cases.
  • Models trained with augmentation (R+A^4) achieve robust performance on NIH data, indicating good generalization to more complex abnormal CXRs.
  • Even when trained only on augmented data (A^4), XLSor achieves competitive results on public and NIH datasets, demonstrating effective pseudo-mask supervision.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.