QUICK REVIEW

[论文解读] Practical Precoding via Asynchronous Stochastic Successive Convex Approximation

Basil M. Idrees, Javed Akhtar|arXiv (Cornell University)|Oct 3, 2020

Sparse and Compressive Sensing Techniques参考文献 58被引用 11

一句话总结

该论文提出了一种异步随机连续凸逼近（ASCA）算法，用于具有非光滑正则化项的非凸随机优化问题，通过利用超越标准随机梯度的结构化代理函数实现。该算法建立了非渐近收敛性，具有 O(𝜖⁻²) 的随机凸优化（SCO）复杂度，并推导出最大可容忍延迟的精确边界，从而实现无线资源分配中的实际部署，具备低复杂度实现和优越的实验性能。

ABSTRACT

We consider stochastic optimization of a smooth non-convex loss function with a convex non-smooth regularizer. In the online setting, where a single sample of the stochastic gradient of the loss is available at every iteration, the problem can be solved using the proximal stochastic gradient descent (SGD) algorithm and its variants. However in many problems, especially those arising in communications and signal processing, information beyond the stochastic gradient may be available thanks to the structure of the loss function. Such extra-gradient information is not used by SGD, but has been shown to be useful, for instance in the context of stochastic expectation-maximization, stochastic majorization-minimization, and stochastic successive convex approximation (SCA) approaches. By constructing a stochastic strongly convex surrogates of the loss function at every iteration, the stochastic SCA algorithms can exploit the structural properties of the loss function and achieve superior empirical performance as compared to the SGD. In this work, we take a closer look at the stochastic SCA algorithm and develop its asynchronous variant which can be used for resource allocation in wireless networks. While the stochastic SCA algorithm is known to converge asymptotically, its iteration complexity has not been well-studied, and is the focus of the current work. The insights obtained from the non-asymptotic analysis allow us to develop a more practical asynchronous variant of the stochastic SCA algorithm which allows the use of surrogates calculated in earlier iterations. We characterize precise bound on the maximum delay the algorithm can tolerate, while still achieving the same convergence rate. We apply the algorithm to the problem of linear precoding in wireless sensor networks, where it can be implemented at low complexity but is shown to perform well in practice.

研究动机与目标

为非凸、非光滑随机优化中的随机连续凸逼近（SCA）建立非渐近收敛性分析。
表征在不降低收敛速率的前提下，异步更新中可容忍的最大延迟。
通过利用超越一阶梯度的结构信息，实现 SCA 在实时无线资源分配中的实际部署。
在无线传感器网络的线性预编码任务中，展示其相对于近端 SGD 的改进实验性能。

提出的方法

提出一种使用损失函数结构信息构建的强凸代理函数的随机 SCA 异步变体。
引入一种延迟更新机制，重用早期迭代中的代理函数，并对最大可容忍延迟给出形式化边界。
采用类似近端的更新规则，使用满足当前迭代点处切线条件的代理函数 ˆf(x, x[t], ξ[t])。
采用三点更新方案，结合过去梯度的凸组合与类似动量的项，以稳定收敛性。
利用 Peter-Paul 不等式和光滑性假设来有界内积并推导收敛速率。
通过分析迭代过程中子梯度和误差项的期望范数，推导出 SCO 复杂度边界。

实验结果

研究问题

RQ1在具有延迟梯度的异步更新下，随机 SCA 算法的非渐近收敛速率是多少？
RQ2该算法在保持与同步版本相同收敛速率的前提下，可容忍多大程度的延迟？
RQ3使用结构化代理函数（超越一阶梯度）是否能在非凸问题中实现比标准近端 SGD 更快的收敛速度？
RQ4在异步 SCA 框架中，步长、动量参数与延迟之间存在怎样的最优权衡？
RQ5该算法在实际无线网络资源分配任务中的性能与近端 SGD 相比如何？

主要发现

所提出的异步随机 SCA 算法在随机凸优化（SCO）复杂度下实现了 O(𝜖⁻²) 的非渐近收敛速率，与近端 SGD 的最佳已知边界一致。
该算法可容忍最大延迟 𝜏 = o(√T)，同时保持相同的收敛速率，且对延迟容忍度给出了精确的解析边界。
当代理函数光滑性 ˆL 为 O(√T) 且强凸性参数 𝜇 设为 L 时，算法实现了子梯度范数收敛速率 O(1/√T)。
尽管计算复杂度较低，该算法在无线传感器网络的线性预编码任务中仍表现出优于近端 SGD 的实验性能。
理论分析证实，使用超越一阶梯度的结构化代理函数可实现比标准基于 SFO 的方法更快的收敛速度。
所推导的延迟容忍边界使得该算法能够在计算受限、实时性要求高的无线系统中实现实际部署。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。