QUICK REVIEW

[論文レビュー] Learning Compositional Koopman Operators for Model-Based Control

Yunzhu Li, Hao He|arXiv (Cornell University)|Oct 18, 2019

Model Reduction and Neural Networks参考文献 41被引用数 32

ひとこと要約

本論文は、グラフニューラルネットワークで学習される構成的な Koopman 演算子を用いて、変数オブジェクト数を持つ多-object 動力学をモデル化・制御することを提案し、識別と MPC に基づく制御を効率化します。

ABSTRACT

Finding an embedding space for a linear approximation of a nonlinear dynamical system enables efficient system identification and control synthesis. The Koopman operator theory lays the foundation for identifying the nonlinear-to-linear coordinate transformations with data-driven methods. Recently, researchers have proposed to use deep neural networks as a more expressive class of basis functions for calculating the Koopman operators. These approaches, however, assume a fixed dimensional state space; they are therefore not applicable to scenarios with a variable number of objects. In this paper, we propose to learn compositional Koopman operators, using graph neural networks to encode the state into object-centric embeddings and using a block-wise linear transition matrix to regularize the shared structure across objects. The learned dynamics can quickly adapt to new environments of unknown physical parameters and produce control signals to achieve a specified goal. Our experiments on manipulating ropes and controlling soft robots show that the proposed method has better efficiency and generalization ability than existing baselines.

研究の動機と目的

効率的な非線形・多-object ダイナミカルシステムの識別と制御を動機づける。
オブジェクト数にスケールする構成的 Koopman フレームワークを提案する。
オブジェクト中心の Koopman 埋め込みを得るためにグラフニューラルネットワークを活用する。
同様の相互作用間でダイナミクスを共有するブロック構造の Koopman 演算子を導入する。
rope およびソフトロボットの実験で一般化と制御性能の向上を示す。

提案手法

システム状態をオブジェクト中心の Koopman 埋め込み g^t = φ(x^t) に写像するためにグラフニューラルネットワークを用いる。
ブロック構造の Koopman 行列 K とブロック構造の制御行列 L を仮定して、オブジェクトの関係を反映する。
構造化された sigma テンソルを用いたデータから最小二乗回帰で K と L を推定する：K = σ ⊗ K̂ および L = σ ⊗ L̂。
オートエンコーダ、予測、メトリック損失（L_ae, L_pred, L_metric）を用いて φ とグラフデコーダ ψ を訓練する。
線形化された Koopman ダイナミクス g^{t+1} = K g^t + L u^t を用いて、二次計画法と MPC による開ループ制御を実現する。

実験結果

リサーチクエスチョン

RQ1可変数のオブジェクトを有するシステムへ、構成的でオブジェクト中心の Koopman 埋め込みが一般化できるか。
RQ2ブロック構造の Koopman 演算子は、非構造的または手作り基底と比較してデータ効率と一般化を改善するか。
RQ3グラフベースの Koopman 埋め込みは、二次計画法と MPC を介した効率的な制御をサポートするか。

主な発見

Koopman matrix structure	Simulation MSE	Control MSE
対角	0.133 (0.174)	2.337 (2.809)
なし	0.117 (0.083)	1.522 (1.288)
ブロック	0.105 (0.075)	0.854 (1.101)

Koopman 行列のブロック構造は、 rope、ソフト、泳ぐ環境全体でシミュレーションと制御の精度を向上させる。
Interaction Networks・Propagation Networks・手作り Koopman 基底と比較して、提案手法はシミュレーション誤差と制御誤差を低く抑える。
システム識別のデータ量を増やすとシミュレーション精度が向上し、オブジェクトあたりの埋め込み次元が約 16 で強い性能を達成する。
未知のオブジェクト数（10–14 質量へ外挿）にも一般化し、顕著な精度を示す。
識別された線形ダイナミクスを用いたオープンループ制御で MPC を適用し、テストタスクでターゲット状態を効率的に達成する。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。