QUICK REVIEW

[論文レビュー] Safe Nonlinear Control Using Robust Neural Lyapunov-Barrier Functions

Charles Dawson, Zengyi Qin|arXiv (Cornell University)|Sep 14, 2021

Advanced Control Systems Optimization参考文献 41被引用数 27

ひとこと要約

この論文は、robust control Lyapunov barrier functions (rCLBFs) による安全性と安定性の保証を伴う頑健な非線形コントローラを合成する学習ベースの枠組みを提示し、リアルタイムのQPベース制御をモデル不確実性に対して強い頑健性を持って実現可能にする。様々な安全制御タスクにおけるシミュレーションでrobust MPCと比較して優れた性能を示す。

ABSTRACT

Safety and stability are common requirements for robotic control systems; however, designing safe, stable controllers remains difficult for nonlinear and uncertain models. We develop a model-based learning approach to synthesize robust feedback controllers with safety and stability guarantees. We take inspiration from robust convex optimization and Lyapunov theory to define robust control Lyapunov barrier functions that generalize despite model uncertainty. We demonstrate our approach in simulation on problems including car trajectory tracking, nonlinear control with obstacle avoidance, satellite rendezvous with safety constraints, and flight control with a learned ground effect model. Simulation results show that our approach yields controllers that match or exceed the capabilities of robust MPC while reducing computational costs by an order of magnitude.

研究の動機と目的

モデル不確実性下での非線形系に対して、安全で安定なコントローラを合成する枠組みを開発する。
制御Lyapunov barrier function理論を頑健な（不確実性を考慮した）設定へ拡張する（rCLBF）。
ニューラルネットワークを用いたモデルベース学習によって頑健な証明とコントローラを学習する。
安全性が重要なタスクのために、計算効率の高い証明ベースのrobust MPCの代替手段を提供する。

提案手法

CLBFを、有界パラメトリック不確実性へ拡張した robust CLBF (rCLBF) を定義し、rCLBFベースのQPコントローラ（rCLBF-QP）を導出する。
ダイナミクスを θ を含む凸包 Θ における制御包含形として表現し、θ 全体にわたって頑健な減少条件を課す。
Vとニューラルネットワークによる証明コントローラ π_NN をパラメータ化し、CLBF条件と rCLBF-QP の実現可能性を課す損失でオフライン訓練する。
オンラインの rCLBF-QP によって Lie導関数に基づく制約を用いて安全な制御を計算する。
訓練時には V(x)=w(x)^T w(x) を用いて非負性を保証し、頑健な減衰項のために V の時間微分を推定するのは π_NN に依存する。
学習された V をニューラルネット検証ツールやサンプリングで検証し、ほぼLyapunov理論の考慮とともにまれな違反領域を認識する。

実験結果

リサーチクエスチョン

RQ1有界なモデル不確実性の下で、非凸・非線形系に対して統一証明（rCLBF）が安全性と安定性を保証できるだろうか？
RQ2モデルベースの学習による rCLBF は、計算を削減しつつ、リアルタイムで頑健な安全性保証を robust MPC に匹敵させることができるか？
RQ3障害物回避、非線形接地効果、非凸の安全制約を伴うタスクで、rCLBF-QP コントローラはどのように性能を発揮するか？
RQ4オンラインコントローラの実現可能性と頑健性を保証する上で、神経証明（ neural certificates ）の役割は何か？

主な発見

タスク	アルゴリズム	安全性率	\|\|x_goal\|\|	評価時間（ms）
車の軌道追従 1	rCLBF-QP	0.7523	10.4
運動学モデル	Robust MPC ( dt=0.1 s, N=6 )	1.5148	194.6
(n=5,ℓ=2,ns=2)	Robust MPC ( dt=0.25 s, N=6 )	12.4438	172.8
車の軌道追従 1	rCLBF-QP	1.0340	9.6
サイドスリップモデル	Robust MPC ( dt=0.1 s, N=5 )	0.1560	336.5
(n=7,ℓ=2,ns=2)	Robust MPC ( dt=0.25 s, N=5 )	18.1939	316.9
3D Quadrotor	rCLBF-QP	100%	0.4647	9.7
(n=9,ℓ=4,ns=2)	Robust MPC ( dt=0.10 s, N=5 )	100%	0.0980	316.2
	Robust MPC ( dt=0.25 s, N=5 )	100%	63.6303	291.0
ニューラルランダー	rCLBF-QP	100%	0.1332	13.1
(n=6,ℓ=3,ns=1)	Robust MPC ( dt=0.10 s, N=5 )	100%	0.2086	247.2
	Robust MPC ( dt=0.25 s, N=5 )	100%	0.3267	253.2
セグウェイ	rCLBF-QP	100%	0.0447	4.4
(n=4,ℓ=1,ns=4)	Robust MPC ( dt=0.10 s, N=5 )	21%	1.3977	214.8
	Robust MPC ( dt=0.25 s, N=5 )	11%	1.9725	239.1
2D Quadrotor 2	rCLBF-QP	83%	18.6	4.4
(n=6,ℓ=2,ns=4)	Robust MPC ( dt=0.10 s, N=5 )	53%	276.9
	Robust MPC ( dt=0.25 s, N=5 )	0%	265.2
Satellite Rendezvous	rCLBF-QP	100%	0.1369	8.2
(n=4,ℓ=2)	Robust MPC ( dt=0.10 s, N=5 )	39%	6.3751	187.3
	Robust MPC ( dt=0.25 s, N=5 )	15%	9.0592	197.4

rCLBF-based controllers achieve safety and stability guarantees under parameter uncertainty across diverse benchmarks.
The rCLBF-QP controller runs in real time (≈10 ms) on laptop hardware, offering substantial speedups over robust MPC (often by an order of magnitude).
In several tasks, rCLBF-QP matches or exceeds robust MPC performance, especially when nonlinear dynamics (e.g., ground effect) are prominent.
Robust MPC's performance is sensitive to control frequency and can fail to stabilize nonlinear models or nonconvex constraints; rCLBF-QP maintains safety and reasonable tracking.
Across benchmarks, safety rates reach up to 100% where reported, with significantly lower evaluation times compared to robust MPC.

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。