QUICK REVIEW

[논문 리뷰] Classification of Histopathological Biopsy Images Using Ensemble of Deep Learning Networks

Sara Hosseinzadeh Kassani, Peyman Hosseinzadeh Kassani|arXiv (Cornell University)|2019. 09. 26.

AI in cancer detection참고 문헌 38인용 수 60

한 줄 요약

이 논문은 전이 학습을 이용한 사전 학습 CNN(VGG19, MobileNet, DenseNet201)의 세-path 앙상블을 제시하여 유방 조직 histopathology 이미지를 이진 분류하며, 네 가지 공개 데이터셋에서 최첨단 성능을 달성한다.

ABSTRACT

Breast cancer is one of the leading causes of death across the world in women. Early diagnosis of this type of cancer is critical for treatment and patient care. Computer-aided detection (CAD) systems using convolutional neural networks (CNN) could assist in the classification of abnormalities. In this study, we proposed an ensemble deep learning-based approach for automatic binary classification of breast histology images. The proposed ensemble model adapts three pre-trained CNNs, namely VGG19, MobileNet, and DenseNet. The ensemble model is used for the feature representation and extraction steps. The extracted features are then fed into a multi-layer perceptron classifier to carry out the classification task. Various pre-processing and CNN tuning techniques such as stain-normalization, data augmentation, hyperparameter tuning, and fine-tuning are used to train the model. The proposed method is validated on four publicly available benchmark datasets, i.e., ICIAR, BreakHis, PatchCamelyon, and Bioimaging. The proposed multi-model ensemble method obtains better predictions than single classifiers and machine learning algorithms with accuracies of 98.13%, 95.00%, 94.64% and 83.10% for BreakHis, ICIAR, PatchCamelyon and Bioimaging datasets, respectively.

연구 동기 및 목표

다양한 소스에서 얻은 이질적 조직병리 이미지 데이터의 도전 과제 해결.
핸드크래프트 피처 없이 양성/음성 유방 병리학 이미지에 대한 강건한 이진 분류기 개발.
전이 학습 및 데이터 증강을 활용하여 데이터셋 간 일반화 향상.
공개 벤치마크에서 단일 CNN 및 전통 머신러닝 방법과 비교 평가.

제안 방법

VGG19, MobileNetV2, DenseNet201를 결합한 세-path 앙상블 아키텍처를 통해 특징 추출.
ImageNet 가중치 CNN으로부터의 전이 학습, 두 클래스 출력으로 미세 조정.
최종 레이어를 평평화하고 연결하여 다-view 특징 벡터를 형성하고 이를 다층 퍼셉트론 분류기에 입력.
전처리에는 맥엔코 맨드(Macenko) 염료 표준화, 이미지 정규화, 데이터 증가(flips, 회전, 확대 등) 포함.
학습 세부 사항: 이미지 크기를 224x224로 리사이즈, 배치 크기 32, 에포크 1000, Adam 옵티마이저, 드롭아웃 0.5, 256-히든 뉴런 FC 레이어.

실험 결과

연구 질문

RQ13-경로 CNN 앙상블이 단일 CNN 및 전통 ML 방법에 비해 여러 데이터셋에서 이진 유방 histopathology 분류를 향상시킬 수 있는가?
RQ2염도 표준화 및 증가를 통한 전이 학습이 이질적 데이터셋(BreakHis, ICIAR, PatchCamelyon, Bioimaging)에 대한 일반화에 어떤 영향을 미치는가?
RQ3공개 벤치마크에서 앙상블과 개별 아키텍처의 비교 정확도 및 기타 지표는 어떤가?

주요 결과

Dataset	Accuracy	Precision	Recall	F-score
BreakHis	98.13%	98.75%	98.54%	98.64%
PatchCamelyon*	94.64%	95.70%	95.27%	95.50%
ICIAR	95.00%	95.91%	94.00%	94.94%
Bioimaging	83.10%	92.60%	71.42%	80.64%

BreakHis에서 앙상블은 98.13% 정확도, 98.75% 정밀도, 98.54% 재현율, 98.64% F-점수.
PatchCamelyon*에서 앙상블은 94.64% 정확도, 95.70% 정밀도, 95.27% 재현율, 95.50% F-점수.
ICIAR에서 앙상블은 95.00% 정확도, 95.91% 정밀도, 94.00% 재현율, 94.94% F-점수.
Bioimaging에서 앙상블은 83.10% 정확도, 92.60% 정밀도, 71.42% 재현율, 80.64% F-점수.
단일 CNN들(VGG19, MobileNetV2, DenseNet201) 및 다른 최첨단 CNN들은 일반적으로 BreakHis, ICIAR, PatchCamelyon* 데이터셋에서 앙상블보다 성능이 떨어졌고; Bioimaging은 전반적으로 성능이 낮아 어려움 지속.
여러 머신러닝 모델(의사결정 트리, 랜덤 포레스트, XGBoost, AdaBoost, Bagging)과 비교할 때, 앙상블이 대부분의 데이터셋에서 우수한 성능을 보였으나 특정 데이터셋에서 일부 문헌 방법이 더 높은 정확도를 달성한 사례도 있음(예: BreakHis에서 Pratiher & Chattoraj 2019).

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.