QUICK REVIEW

[논문 리뷰] V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

Fausto Milletarì, Nassir Navab|arXiv (Cornell University)|2016. 06. 15.

Medical Image Segmentation Techniques인용 수 257

한 줄 요약

V-Net은 Dice 기반 손실을 사용한 3D 완전 합성곱 네트워크를 도입하여 엔드-투-엔드 체적 MRI 전립선 분할을 수행하며, 빠른 추론 및 높은 정확도를 달성합니다.

ABSTRACT

Convolutional Neural Networks (CNNs) have been recently employed to solve problems from both the computer vision and medical image analysis fields. Despite their popularity, most approaches are only able to process 2D images while most medical data used in clinical practice consists of 3D volumes. In this work we propose an approach to 3D image segmentation based on a volumetric, fully convolutional, neural network. Our CNN is trained end-to-end on MRI volumes depicting prostate, and learns to predict segmentation for the whole volume at once. We introduce a novel objective function, that we optimise during training, based on Dice coefficient. In this way we can deal with situations where there is a strong imbalance between the number of foreground and background voxels. To cope with the limited number of annotated volumes available for training, we augment the data applying random non-linear transformations and histogram matching. We show in our experimental evaluation that our approach achieves good performances on challenging test data while requiring only a fraction of the processing time needed by other previous methods.

연구 동기 및 목표

엔드-투-엔드 의료 영상 분할을 위한 체적(3D) CNN의 시연.
Dice 기반 손실로 의료 체적에서의 클래스 불균형 문제를 다룬다.
잔차 학습과 체적 합성곱을 통한 학습 효율성과 수렴 개선.
실시간 기하학적/강도 변환으로 주석이 제한된 데이터를 확장.
PROMISE 2012 전립선 MRI 데이터에서 빠른 추론과 경쟁력 있는 성능를 보인다.

제안 방법

엔드-투-엔드로 128×128×64 MRI 체적을 처리하는 체적 완전합성곱 네트워크를 제안한다.
3D 인코더–디코더(압축 및 확장) 구조 내에서 잔차 학습 블록을 사용한다.
풀링을 합성곱으로 대체하고 2×2×2 스트라이드-2 다운샘플링으로 해상도는 낮추되 특징 채널은 늘린다.
이른 단계의 미세한 디테일과 더 깊은 전역 컨텍스트를 결합하기 위해 스킵 유사 수평 연결을 도입한다.
전경–배경 불균형을 다루기 위해 Dice 계수 기반 손실로 학습한다.
강건성을 위해 실시간 무작위 변형 변환과 히스토그램 매칭으로 학습 데이터를 증강한다.

실험 결과

연구 질문

RQ1전체 MRI 볼륨에서 슬라이스 단위 처리 없이 3D 완전 합성곱 네트워크가 전립선 구조를 직접 분할할 수 있는가?
RQ2재가중치를 적용한 교차 엔트로피와 비교해 Dice 기반 손실이 전경–배경 불균형에서 분할 성능을 향상시키는가?
RQ3잔차 연결과 체적 다운샘플링/업샘플링이 수렴 속도와 분할 정확도에 미치는 영향은 무엇인가?
RQ4기존 방법에 비해 PROMISE 2012 전립선 MRI 데이터에서 V-Net의 성능은 어떠한가?

주요 결과

V-Net은 Dice 기반 손실을 사용하여 PROMISE 2012 테스트 데이터에서 Avg. Dice 0.869±0.033를 달성한다.
V-Net은 Dice 기반 손실로 Avg. Hausdorff distance 5.71±1.20 mm를 달성한다.
V-Net은 Dice 기반 손실로 PROMISE 2012 챌린지 태스크에서 82.39를 점수로 얻었다.
다항 로지스틱 손실에 재가중치를 적용하면 Dice가 더 낮아지고(0.739±0.088), Hausdorff 거리는 더 나빠진다(10.55±5.38 mm).
V-Net의 Dice 손실은 같은 데이터셋의 여러 방법들을 상회한다(예: Imorphics 0.879±0.044 Dice, 5.935±2.14 mm HD; ScrAutoProstate 0.874±0.036 Dice; SBIA 0.835±0.055 Dice).
강력한 데이터 증강으로 50개 볼륨에서 학습하면 새로운 볼륨의 분할이 약 1초 안에 수행된다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.