QUICK REVIEW

[论文解读] Split Learning for collaborative deep learning in healthcare

Maarten G. Poirot, Praneeth Vepakomma|arXiv (Cornell University)|Dec 27, 2019

Machine Learning in Healthcare参考文献 23被引用 82

一句话总结

分割学习使多家医疗中心能够在协同深度学习中实现协作，在视网膜和胸部X光任务上达到与集中式或非协作方案相当或更优的性能，并且随着客户端数量增加，收益仍然存在。

ABSTRACT

Shortage of labeled data has been holding the surge of deep learning in healthcare back, as sample sizes are often small, patient information cannot be shared openly, and multi-center collaborative studies are a burden to set up. Distributed machine learning methods promise to mitigate these problems. We argue for a split learning based approach and apply this distributed learning method for the first time in the medical field to compare performance against (1) centrally hosted and (2) non collaborative configurations for a range of participants. Two medical deep learning tasks are used to compare split learning to conventional single and multi center approaches: a binary classification problem of a data set of 9000 fundus photos, and multi-label classification problem of a data set of 156,535 chest X-rays. The several distributed learning setups are compared for a range of 1-50 distributed participants. Performance of the split learning configuration remained constant for any number of clients compared to a single center study, showing a marked difference compared to the non collaborative configuration after 2 clients (p < 0.001) for both sets. Our results affirm the benefits of collaborative training of deep neural networks in health care. Our work proves the significant benefit of distributed learning in healthcare, and paves the way for future real-world implementations.

研究动机与目标

Motivate and address data scarcity and privacy in healthcare deep learning.
Propose a split learning (U-shaped) framework suitable for healthcare settings.
Evaluate split learning against centralized and non-collaborative baselines on medical imaging tasks.
Assess scalability across 1-50 distributed participants and analyze performance stability.

提出的方法

Apply a U-shaped split learning configuration with three links (front local, center central, back local).
Train networks sequentially across clients without sharing raw data or labels; transfer local state when switching clients.
Use Resnet-34 for Diabetic Retinopathy and DenseNet-121 for CheXpert with standard Adam optimization.
Compare performance to centralized training and non-collaborative training; evaluate with accuracy (DR) and AUROC (CheXpert).
Partition data into 75% training and 25% validation, equal data split across clients, with no patient overlap.
Use one epoch per client in collaborative mode; non-collaborative mode mirrors data size of the collaborative setting.

实验结果

研究问题

RQ1Does split learning yield performance comparable to centralized training in medical imaging tasks?
RQ2How does collaboration via split learning scale with increasing numbers of distributed participants (1-50)?
RQ3What is the performance gap between split learning and non-collaborative training across tasks and client counts?
RQ4What are the privacy and bandwidth implications of the U-shaped split learning configuration in healthcare?
RQ5How does split learning performance compare to other distributed approaches (as referenced) in non-medical settings?

主要发现

number of clients	Split learning mean (DR)	Non collaborative mean (DR)
1	0.888 (0.896, 0.880)	0.869 (0.877, 0.861)
2	0.850 (0.857, 0.843)	0.852 (0.865, 0.839)
3	0.868 (0.875, 0.861)	0.753 (0.766, 0.742)
4	0.884 (0.891, 0.878)	0.754 (0.770, 0.739)
5	0.869 (0.877, 0.861)	0.755 (0.772, 0.738)
8	0.887 (0.894, 0.880)	0.717 (0.733, 0.701)
10	0.858 (0.868, 0.849)	0.676 (0.695, 0.657)
15	0.838 (0.848, 0.829)	0.627 (0.649, 0.603)
20	0.860 (0.868, 0.852)	0.613 (0.632, 0.594)
25	0.850 (0.858, 0.841)	0.607 (0.627, 0.588)
30	0.814 (0.831, 0.797)	0.620 (0.648, 0.590)
35	0.798 (0.819, 0.780)	0.633 (0.656, 0.611)
40	0.852 (0.859, 0.844)	0.595 (0.619, 0.568)
45	0.883 (0.891, 0.876)	0.608 (0.634, 0.581)
50	0.859 (0.869, 0.849)	0.588 (0.611, 0.565)

Split learning consistently outperformed non-collaborative training across both datasets.
On CheXpert, mean performance in non-collaborative settings was significantly lower than collaborative settings for more than 2 clients (p<0.005).
DR results show stable performance for split learning across 1-50 clients, with higher accuracy than non-collaborative configurations as client count increases.
Table 1 reports DR results where, for 1 client, split learning = 0.888 vs non-collaborative = 0.869; for 2 clients, 0.850 vs 0.852; and for higher client counts, split learning remains competitive (values shown in table).
CheXpert results use AUROC across five tasks; split learning beats non-collaborative configurations, with significant differences emerging as the number of clients grows.
The study concludes that distributed training in healthcare via split learning yields notable performance gains and supports future real-world implementations.

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。