Skip to main content
QUICK REVIEW

[论文解读] Variational Federated Multi-Task Learning

Luca Corinzia, Joachim M. Buhmann|arXiv (Cornell University)|Jun 14, 2019
Privacy-Preserving Technologies in Data参考文献 66被引用 121
一句话总结

VIRTUAL 引入用于非凸模型的联邦多任务学习,采用星形贝叶斯网络和变分推断,在性能和更新稀疏性方面优于 FedAvg/FedProx。

ABSTRACT

In federated learning, a central server coordinates the training of a single model on a massively distributed network of devices. This setting can be naturally extended to a multi-task learning framework, to handle real-world federated datasets that typically show strong statistical heterogeneity among devices. Despite federated multi-task learning being shown to be an effective paradigm for real-world datasets, it has been applied only on convex models. In this work, we introduce VIRTUAL, an algorithm for federated multi-task learning for general non-convex models. In VIRTUAL the federated network of the server and the clients is treated as a star-shaped Bayesian network, and learning is performed on the network using approximated variational inference. We show that this method is effective on real-world federated datasets, outperforming the current state-of-the-art for federated learning, and concurrently allowing sparser gradient updates.

研究动机与目标

  • Address the challenge of federated multi-task learning with strongly non-IID client data.
  • Develop a non-convex federated MTL algorithm suitable for distributed networks.
  • Leverage a Bayesian network and variational/inference techniques to share knowledge while preserving privacy.
  • Enable sparse gradient updates to reduce communication costs.

提出的方法

  • Model the server and clients as a star-shaped Bayesian network with shared and private parameters.
  • Use an EP-like variational inference scheme to approximate the posterior over server and client parameters.
  • Adopt a Gaussian mean-field posterior for server and client parameters to facilitate tractable updates.
  • Compute client updates as deltas of their posterior factors and aggregate them at the server.
  • Retain privacy by only sharing aggregated server posterior and the product of client deltas, not individual client factors.
  • Allow a KL-divergence weighted free energy term to balance reconstruction and regularization between server and clients.

实验结果

研究问题

  • RQ1Can federated multi-task learning be effectively extended to generic non-convex models in a federated setting?
  • RQ2How can variational inference be applied to a star-shaped server–client Bayesian network to enable knowledge transfer while handling non-IID data?
  • RQ3Does VIRTUAL outperform existing FL baselines (FedAvg, FedProx) on real-world federated datasets while enabling sparse updates?
  • RQ4What is the impact of KL-divergence weighting on learning performance and sparsity of updates?

主要发现

  • VIRTUAL outperforms state-of-the-art federated learning baselines on several real-world datasets across multiple architectures.
  • The framework enables sparser client updates while maintaining or improving accuracy.
  • Client models can remain private and specialized to their data while still benefiting from shared server knowledge.
  • Using a KL-divergence regularization multiplier beta allows tuning performance and sparsity.
  • The method supports non-IID data and traditional FL challenges without requiring centralized data access.

更好的研究,从现在开始

从论文设计到论文写作,大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成,并经人工编辑审核。