QUICK REVIEW

[論文レビュー] Riemannian Motion Policies

Nathan Ratliff, Jan Issac|arXiv (Cornell University)|Jan 9, 2018

Robot Manipulation and Learning参考文献 12被引用数 48

ひとこと要約

リーマン幾何学的モーションポリシー（RMPs）を導入する、モジュラーなフレームワークで、モーションポリシーを局所幾何と組み合わせて、空間間で最適かつ幾何的一貫性を持って変換する。

ABSTRACT

We introduce the Riemannian Motion Policy (RMP), a new mathematical object for modular motion generation. An RMP is a second-order dynamical system (acceleration field or motion policy) coupled with a corresponding Riemannian metric. The motion policy maps positions and velocities to accelerations, while the metric captures the directions in the space important to the policy. We show that RMPs provide a straightforward and convenient method for combining multiple motion policies and transforming such policies from one space (such as the task space) to another (such as the configuration space) in geometrically consistent ways. The operators we derive for these combinations and transformations are provably optimal, have linearity properties making them agnostic to the order of application, and are strongly analogous to the covariant transformations of natural gradients popular in the machine learning literature. The RMP framework enables the fusion of motion policies from different motion generation paradigms, such as dynamical systems, dynamic movement primitives (DMPs), optimal control, operational space control, nonlinear reactive controllers, motion optimization, and model predictive control (MPC), thus unifying these disparate techniques from the literature. RMPs are easy to implement and manipulate, facilitate controller design, simplify handling of joint limits, and clarify a number of open questions regarding the proper fusion of motion generation methods (such as incorporating local reactive policies into long-horizon optimizers). We demonstrate the effectiveness of RMPs on both simulation and real robots, including their ability to naturally and efficiently solve complicated collision avoidance problems previously handled by more complex planners.

研究の動機と目的

多様な局所ポリシーを統合できるモジュラーな運動生成のアプローチを動機づける。
局所幾何を捉えるリーマン計量と運動ポリシーを結ぶ数学的オブジェクトを定義する。
RMPを空間間で変換・結合する演算子を、証明可能な最適な方法で提供する。
異なるパラダイムのポリシーの融合をRMPが可能にし、スケーラブルな計算をサポートすることを示す。

提案手法

RMPを、空間Xにおける二次のダイナミカルシステム f と、滑らかに変化する対称半正定値メトリック A の対として定義する。
共分散特性を持つように、タスク空間と配置空間の間でRMPを変換するプルバックとプッシュフォワード操作を導出する。
RMPの度量で重み付けされた和としての加算を導入し、その結合法則と交換法則を証明する。
RMPを組み合わせて、タスク空間の加速度の差をメトリクスで重み付けして最小化する2次コストを最小化できることを示す。
基本的な局所リアクティブポリシー（ターゲット、姿勢/方位、衝突回避）を関連メトリクスを持つRMPとして提示し、それらの結合を実演する。
計算集約的な挙動との統合を、Riemannian Motion Optimization（RieMO）フレームワークとMPCを介して議論する。

実験結果

リサーチクエスチョン

RQ1複数の局所モーションポリシーを、幾何学的一貫性と最適性を持って結合するにはどうすればよいか？
RQ2タスク空間と配置空間を横断してポリシーを変換・融合する際、リーマン計量はどのような役割を果たすか？
RQ3RMPは多様な運動生成パラダイム（DS、DMP、MPC など）を統合し、スケーラブルな計算をサポートできるか？
RQ4プルバックとプッシュフォワード操作は、ポリシー変換の共分散性と結合性をどのように保証するか？
RQ5実ロボットにおいて、衝突回避と長距離ナビゲーションをRMPとして構造化することで生じる実用的な利点は何か？

主な発見

RMPは単なる加算ではなく、度量で重み付けされた平均により、複数の局所ポリシーの幾何学的一貫性のある融合を可能にする。
プルバック/プッシュフォワード演算は線形で結合的であり、自然勾配に類似した共変変換を提供する。
ターゲット、姿勢/方位、衝突回避の局所リアクティブポリシーはRMPとして表現でき、複雑な問題を解くために効果的に結合できる。
このフレームワークは、MPC様の最適化子の出力をRMPとして表現し、忠実度を失うことなくリアクティブコアへストリーミングすることで、統合をサポートする。
RMPは、プロセスやデバイス間で、速い反応制御とより計算集約的な挙動生成の分離を促進する。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。