[论文解读] Variational Federated Multi-Task Learning
VIRTUAL 引入用于非凸模型的联邦多任务学习,采用星形贝叶斯网络和变分推断,在性能和更新稀疏性方面优于 FedAvg/FedProx。
In federated learning, a central server coordinates the training of a single model on a massively distributed network of devices. This setting can be naturally extended to a multi-task learning framework, to handle real-world federated datasets that typically show strong statistical heterogeneity among devices. Despite federated multi-task learning being shown to be an effective paradigm for real-world datasets, it has been applied only on convex models. In this work, we introduce VIRTUAL, an algorithm for federated multi-task learning for general non-convex models. In VIRTUAL the federated network of the server and the clients is treated as a star-shaped Bayesian network, and learning is performed on the network using approximated variational inference. We show that this method is effective on real-world federated datasets, outperforming the current state-of-the-art for federated learning, and concurrently allowing sparser gradient updates.
研究动机与目标
- Address the challenge of federated multi-task learning with strongly non-IID client data.
- Develop a non-convex federated MTL algorithm suitable for distributed networks.
- Leverage a Bayesian network and variational/inference techniques to share knowledge while preserving privacy.
- Enable sparse gradient updates to reduce communication costs.
提出的方法
- Model the server and clients as a star-shaped Bayesian network with shared and private parameters.
- Use an EP-like variational inference scheme to approximate the posterior over server and client parameters.
- Adopt a Gaussian mean-field posterior for server and client parameters to facilitate tractable updates.
- Compute client updates as deltas of their posterior factors and aggregate them at the server.
- Retain privacy by only sharing aggregated server posterior and the product of client deltas, not individual client factors.
- Allow a KL-divergence weighted free energy term to balance reconstruction and regularization between server and clients.
实验结果
研究问题
- RQ1Can federated multi-task learning be effectively extended to generic non-convex models in a federated setting?
- RQ2How can variational inference be applied to a star-shaped server–client Bayesian network to enable knowledge transfer while handling non-IID data?
- RQ3Does VIRTUAL outperform existing FL baselines (FedAvg, FedProx) on real-world federated datasets while enabling sparse updates?
- RQ4What is the impact of KL-divergence weighting on learning performance and sparsity of updates?
主要发现
- VIRTUAL outperforms state-of-the-art federated learning baselines on several real-world datasets across multiple architectures.
- The framework enables sparser client updates while maintaining or improving accuracy.
- Client models can remain private and specialized to their data while still benefiting from shared server knowledge.
- Using a KL-divergence regularization multiplier beta allows tuning performance and sparsity.
- The method supports non-IID data and traditional FL challenges without requiring centralized data access.
更好的研究,从现在开始
从论文设计到论文写作,大幅缩短您的研究时间。
无需绑定信用卡
本解读由 AI 生成,并经人工编辑审核。