[论文解读] Successive Convexification: A Superlinearly Convergent Algorithm for Non-convex Optimal Control Problems
本文提出了 SCvx 算法,一种连续凸化方法,通过迭代线性化动力学和约束并求解凸子问题,在全局收敛性和超线性收敛性保证下求解非凸、有约束的最优控制问题。
This paper presents the SCvx algorithm, a successive convexification algorithm designed to solve non-convex constrained optimal control problems with global convergence and superlinear convergence-rate guarantees. The proposed algorithm can handle nonlinear dynamics and non-convex state and control constraints. It solves the original problem to optimality by successively linearizing non-convex dynamics and constraints about the solution of the previous iteration. The resulting convex subproblems are numerically tractable, and can be computed quickly and reliably using convex optimization solvers, making the SCvx algorithm well suited for real-time applications. Analysis is presented to show that the algorithm converges both globally and superlinearly, guaranteeing i) local optimality recovery: if the converged solution is feasible with respect to the original problem, then it is also a local optimum; ii) strong convergence: if the Kurdyka-Lojasiewicz (KL) inequality holds at the converged solution, then the solution is unique. The superlinear rate of convergence is obtained by exploiting the structure of optimal control problems, showcasing that faster rate of convergence can be achieved by leveraging specific problem properties when compared to generic nonlinear programming methods. Numerical simulations are performed for a non-convex quad-rotor motion planning problem, and corresponding results obtained using Sequential Quadratic Programming (SQP) and general purpose Interior Point Method (IPM) solvers are provided for comparison. The results show that the convergence rate of the SCvx algorithm is indeed superlinear, and that SCvx outperforms the other two methods by converging in less number of iterations.
研究动机与目标
- 将 SCvx 扩展以处理超越非线性动力学的非凸状态和控制约束。
- 在温和假设下证明 SCvx 的全局(弱与强)收敛性。
- 通过利用最优控制问题的结构来建立超线性收敛。
- 在一个非凸四旋翼运动规划问题上展示实际性能,并与 SQP/IPM 方法进行比较。
提出的方法
- 将离散时间非凸最优控制问题形式化为具有凸的状态/控制局部集以及非凸动力学与约束的问题。
- 围绕上一次迭代的解对非凸部分进行迭代线性化,以得到一个凸子问题。
- 用无约束的虚拟控制来增强线性化后的动力学,以避免人为不可行性。
- 在状态/控制约束中引入无约束虚拟缓冲区,并使用精确罚项来处理非凸约束违反。
- 在每次迭代中把凸子问题求解到最优,并更新信任域以控制线性化的精度。
- 给出收敛性分析,证明弱收敛、强收敛,以及超线性收敛的条件。
实验结果
研究问题
- RQ1SCvx 框架在保持收敛性保证的前提下,能否扩展到非凸状态和控制约束?
- RQ2SCvx 的迭代是否全局收敛到一个极限点?在什么条件下极限点是唯一的?
- RQ3在温和假设下,SCvx 能否相比一般非线性规划方法达到超线性收敛?
- RQ4在非凸四旋翼运动规划问题上,SCvx 相对于 SQP 和 IPM 方法的表现如何?
主要发现
- 在所提出的框架下,SCvx 实现全局收敛,包含弱收敛和强收敛。
- 通过利用最优控制问题的结构,算法实现超线性收敛速率。
- 虚拟控制和缓冲区缓解了线性化带来的人工不可行性和无上界性。
- 每次迭代求解的凸子问题可处理,适合实时实现。
- 对非凸四旋翼问题的数值仿真表明 SCvx 收敛更快(迭代次数更少),优于 SQP 和 IPM 基线。
更好的研究,从现在开始
从论文设计到论文写作,大幅缩短您的研究时间。
无需绑定信用卡
本解读由 AI 生成,并经人工编辑审核。