QUICK REVIEW

[论文解读] FedMood: Federated Learning on Mobile Health Data for Mood Detection

Xiaohang Xu, Hao Peng|arXiv (Cornell University)|Feb 6, 2021

Mental Health Research Topics参考文献 31被引用 26

一句话总结

FedMood 提出一种用于情绪/抑郁检测的联邦多视图学习框架，利用移动端击键数据与加速度计数据，采用晚期融合架构，在 IID 与非 IID 数据分布上的实验显示性能提升及分布敏感性。

ABSTRACT

Depression is one of the most common mental illness problems, and the symptoms shown by patients are not consistent, making it difficult to diagnose in the process of clinical practice and pathological research. Although researchers hope that artificial intelligence can contribute to the diagnosis and treatment of depression, the traditional centralized machine learning needs to aggregate patient data, and the data privacy of patients with mental illness needs to be strictly confidential, which hinders machine learning algorithms clinical application. To solve the problem of privacy of the medical history of patients with depression, we implement federated learning to analyze and diagnose depression. First, we propose a general multi-view federated learning framework using multi-source data, which can extend any traditional machine learning model to support federated learning across different institutions or parties. Secondly, we adopt late fusion methods to solve the problem of inconsistent time series of multi-view data. Finally, we compare the federated framework with other cooperative learning frameworks in performance and discuss the related results.

研究动机与目标

在不集中敏感健康信息的前提下，使用移动数据来激励隐私保护的情绪/抑郁检测。
将传统模型扩展到跨机构/参与方的联邦多视图学习。
开发并评估晚期融合策略以应对异步多视图时间序列数据。
评估在 IID 与非 IID 数据分布下的性能，并比较联邦、局部训练和集中基线。

提出的方法

提出一个通用的多视图联邦学习框架，可将传统模型扩展到跨机构的联邦设置。
实现晚期融合以对齐并整合来自多个视图的异质时间序列数据。
评估三种融合策略：全连接层、因子分解机层和多视图机层（DMVM/DFM/DFM）。
采用 Google 风格的 FedAvg 联邦训练，包含本地更新和跨客户端的加权聚合。
引入隐私考量，包括对安全多方计算、差分隐私和同态加密的讨论（情境性概述）。
使用通过定制虚拟键盘收集的移动设备数据（击键元数据和加速度计）用于情绪预测。

实验结果

研究问题

RQ1联邦多视图学习在移动健康数据的情绪检测上与本地和集中方法相比的表现如何？
RQ2数据分布（IID 与非 IID）以及不同客户端数据量对 FedMood 模型准确性的影响是什么？
RQ3在联邦设置下，不同的晚期融合策略（全连接、FM、多视图机）在性能上是否存在差异？
RQ4参与方数量及每方数据量如何影响联邦学习的收敛性和准确性？

主要发现

参与方数量	模型	本地训练	CDS	FedAVG	IIL	CIIL
4	DNN	0.7431	0.8221	0.8114	0.7909	0.8112
4	DFM	0.7514	0.8144	0.8255	0.7851	0.8107
4	DMVM	0.7343	0.7953	0.8095	0.7781	0.8098
8	DNN	0.7344	0.8294	0.8283	0.7899	0.8266
8	DFM	0.7337	0.8366	0.8266	0.7774	0.8288
8	DMVM	0.7267	0.8295	0.8156	0.7751	0.8184
12	DNN	0.7406	0.8352	0.8462	0.7971	0.8324
12	DFM	0.7444	0.8526	0.8304	0.7827	0.8360
12	DMVM	0.7238	0.8309	0.8287	0.7703	0.8409
16	DNN	0.7356	0.8463	0.8383	0.7884	0.8352
16	DFM	0.7379	0.8507	0.8388	0.8039	0.8358
16	DMVM	0.7248	0.8478	0.8281	0.7827	0.8519
24	DNN	0.7388	0.8642	0.8513	0.7945	0.8493
24	DFM	0.7455	0.8597	0.8429	0.8144	0.8487
24	DMVM	0.7240	0.8479	0.8374	0.7940	0.8529

在 IID 设置中，联邦方法（FedAvg 和 CIIL）在本地训练和其他基线之上实现了显著的准确性提升，IID 实验中在 DMVM 情况下最佳情形约为 85%。
在非 IID 设置中，分布极端的情况下预测准确性下降，CDS 常常优于某些联邦变体；然而，CIIL 与某些融合变体仍显示出相对于纯本地训练的鲁棒提升。
增加参与方数量且每方数据固定时，可以提高或稳定联邦模型的准确性，但提升取决于融合方法和数据分布。
增加每个客户端的本地数据通常会提升本地和联邦性能，尤其在每方数据中等量级时，FedAvg 提供额外提升。
实验表明，多视图融合（DMVM/DFM）在若干 IID 情况下可以优于简单的 DNN 融合策略，但非 IID 条件可能削弱这些优势。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。