QUICK REVIEW

[论文解读] Stochastic MPC with Dynamic Feedback Gain Selection and Discounted Probabilistic Constraints

Shuhao Yan, Paul J. Goulart|arXiv (Cornell University)|Jul 14, 2020

Advanced Control Systems Optimization参考文献 38被引用 13

一句话总结

本文提出了一种针对具有无界加法扰动的线性离散时间系统的随机模型预测控制（MPC）框架，采用动态反馈增益选择与折扣概率约束。通过利用切比雪夫不等式重构机会约束，并引入在线约束收紧技术，该方法在无需有界扰动假设的前提下，确保了递归可行性与二次稳定性，显著降低了保守性，并相较于固定增益MPC提升了闭环性能。

ABSTRACT

This paper considers linear discrete-time systems with additive disturbances, and designs a Model Predictive Control (MPC) law incorporating a dynamic feedback gain to minimise a quadratic cost function subject to a single chance constraint. The feedback gain is selected online and we provide two selection methods based on minimising upper bounds on predicted costs. The chance constraint is defined as a discounted sum of violation probabilities on an infinite horizon. By penalising violation probabilities close to the initial time and assigning violation probabilities in the far future with vanishingly small weights, this form of constraints allows for an MPC law with guarantees of recursive feasibility without a boundedness assumption on the disturbance. A computationally convenient MPC optimisation problem is formulated using Chebyshev's inequality and we introduce an online constraint-tightening technique to ensure recursive feasibility. The closed loop system is guaranteed to satisfy the chance constraint and a quadratic stability condition. With dynamic feedback gain selection, the closed loop cost is reduced and conservativeness of Chebyshev's inequality is mitigated. Also, a larger feasible set of initial conditions can be obtained. Numerical simulations are given to show these results.

研究动机与目标

解决传统随机MPC方法依赖最坏情况扰动边界而产生的保守性问题。
在无界加法扰动系统中实现MPC的递归可行性，这通常在不进行鲁棒约束收紧的情况下无法实现。
通过引入在线动态反馈增益选择，自适应地最小化预测代价上界，从而提升闭环性能。
在无需终端约束或有界扰动假设的前提下，确保二次稳定性和约束满足。
通过自适应反馈增益调优，降低机会约束近似中切比雪夫不等式的保守性。

提出的方法

MPC框架采用无限时域内违反概率的折扣和，以优先考虑近期约束满足，并确保机会约束的适定性。
仅利用扰动的一阶和二阶矩，通过切比雪夫不等式处理机会约束，将其转化为凸二次约束。
引入在线约束收紧技术，以在无需有界扰动的前提下，保证移动时域优化的递归可行性。
通过离线求解多目标优化问题生成一组镇定增益集合，再通过在线选择最小化预测代价上界的方式实现动态反馈增益选择。
提出了两种在线增益选择策略：一种是最小化最优预测代价的上界，另一种是确保几乎必然收敛至无约束LQ最优增益。
通过李雅普诺夫分析与Borel-Cantelli引理，证明了闭环系统满足机会约束和二次稳定性条件，并推导出性能界。

实验结果

研究问题

RQ1在无界加法扰动下，采用折扣概率约束的随机MPC公式能否在无需扰动边界或系统稳定性假设的前提下，确保递归可行性？
RQ2动态反馈增益选择如何降低机会约束MPC中切比雪夫不等式的保守性？
RQ3在线增益选择对初始条件可行集大小及长期期望平均代价有何影响？
RQ4在所提出的动态选择规则下，反馈增益能否以几乎必然方式收敛至无约束LQ最优解？
RQ5所提出的约束收紧技术如何在保持闭环运行中约束满足的同时，确保递归可行性？

主要发现

所提出的MPC算法在无需有界扰动或终端约束的前提下，保证了在线优化问题的递归可行性。
闭环系统满足机会约束并实现二次稳定性，即使在无终端代价或约束项的情况下亦成立。
动态反馈增益选择降低了切比雪夫不等式的保守性，从而在长期期望平均代价方面提升了闭环性能。
通过选择合适的初始反馈增益，可扩大允许的初始条件集合，增强吸引域。
在所提出的在线选择规则下，反馈增益几乎必然收敛至无约束LQ最优解，该结论通过Borel-Cantelli引理得到证明。
数值仿真结果表明，与固定增益MPC相比，所提方法显著降低了闭环代价与约束违反率，尤其在无界扰动下表现更优。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。