QUICK REVIEW

[논문 리뷰] Riemannian Motion Policies

Nathan Ratliff, Jan Issac|arXiv (Cornell University)|2018. 01. 09.

Robot Manipulation and Learning참고 문헌 12인용 수 48

한 줄 요약

Riemannian Motion Policies (RMPs)를 소개하는 모듈식 프레임워크로, 모션 정책을 로컬 기하학과 짝지어 서로 다른 공간에서 최적이고 기하학적으로 일관되게 결합·변환합니다.

ABSTRACT

We introduce the Riemannian Motion Policy (RMP), a new mathematical object for modular motion generation. An RMP is a second-order dynamical system (acceleration field or motion policy) coupled with a corresponding Riemannian metric. The motion policy maps positions and velocities to accelerations, while the metric captures the directions in the space important to the policy. We show that RMPs provide a straightforward and convenient method for combining multiple motion policies and transforming such policies from one space (such as the task space) to another (such as the configuration space) in geometrically consistent ways. The operators we derive for these combinations and transformations are provably optimal, have linearity properties making them agnostic to the order of application, and are strongly analogous to the covariant transformations of natural gradients popular in the machine learning literature. The RMP framework enables the fusion of motion policies from different motion generation paradigms, such as dynamical systems, dynamic movement primitives (DMPs), optimal control, operational space control, nonlinear reactive controllers, motion optimization, and model predictive control (MPC), thus unifying these disparate techniques from the literature. RMPs are easy to implement and manipulate, facilitate controller design, simplify handling of joint limits, and clarify a number of open questions regarding the proper fusion of motion generation methods (such as incorporating local reactive policies into long-horizon optimizers). We demonstrate the effectiveness of RMPs on both simulation and real robots, including their ability to naturally and efficiently solve complicated collision avoidance problems previously handled by more complex planners.

연구 동기 및 목표

다양한 로컬 정책을 통합할 수 있는 모듈식 모션 생성 방법의 필요성과 동기를 제시한다.
로컬 기하를 포착하기 위해 모션 정책과 Riemannian metric을 결합하는 수학적 객체를 정의한다.
공간 간에 RMP를 변환하고 결합하기 위한 최적성 보장이 있는 연산자를 제시한다.
RMP가 서로 다른 패러다임의 정책 융합을 가능하게 하고 확장 가능한 계산을 지원하는 것을 시연한다.

제안 방법

RMP를 (f, A)의 쌍으로 정의하는데, 여기서 f는 공간 X에서의 2차 미분 시스템이고 A는 매끄럽게 변화하는 대칭 양의 준정행렬이다.
공간 간의 공변성 속성으로 RMP를 작업 공간과 구성 공간 사이에서 변환하기 위한 pullback 및 pushforward 연산을 도출한다.
RMP의 메트릭 가중 합으로서의 덧셈을 도입하고 그 결합법칙의 결합법칙(Associativity)과 교환법칙(Commutativity)을 입증한다.
RMP를 결합하여 작업 공간 가속도의 차이를 각자의 메트릭으로 가중한 2차 비용을 최소화하도록 할 수 있음을 보인다.
기본 로컬 반응 정책들(타깃, 방향, 충돌 회피)을 연관 메트릭을 가진 RMP로 제시하고 이들의 결합을 시연한다.
계산적으로 집약적인 행동과의 통합을 Riemannian Motion Optimization (RieMO) 프레임워크와 MPC를 통해 논의한다.

실험 결과

연구 질문

RQ1여러 개의 로컬 모션 정책을 기하학적으로 일관되고 최적의 방식으로 어떻게 결합할 수 있는가?
RQ2작업 공간과 구성 공간 간 정책을 변환하고 융합하는 데 있어 Riemannian metric은 어떤 역할을 하는가?
RQ3RMP가 다양한 모션 생성 패러다임(DS, DMPs, MPC 등)을 하나로 묶고 확장 가능한 계산을 지원할 수 있는가?
RQ4pullback 및 pushforward 연산이 정책 변환에서 공변성과 결합 법칙을 어떻게 보장하는가?
RQ5충돌 회피와 원거리 내비게이션을 실제 로봇에서 RMP로 구조화함으로써 어떤 실용적 이점이 생기는가?

주요 결과

RMP는 단순 합계가 아닌 메트릭 가중 평균으로 다중 로컬 정책의 기하학적으로 일관된 융합을 가능하게 한다.
pullback/pushforward 연산은 선형이며 결합법칙을 제공하여 자연 기울기와 유사한 공변적 변환을 제공한다.
타깃, 방향, 충돌 회피에 대한 로컬 반응 정책은 RMP로 표현될 수 있고 효과적으로 결합되어 복잡한 문제를 해결한다.
이 프레임워크는 MPC 유사 최적화기의 출력을 RMP로 표현하고 반응 핵심으로 손실 없이 스트리밍함으로써 통합을 지원한다.
RMP는 빠른 반응 제어를 보다 계산 집약적인 행동 생성으로부터 프로세스나 디바이스 간에 분리하는 데 도움을 준다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.