QUICK REVIEW

[论文解读] Learning Particle Dynamics for Manipulating Rigid Bodies, Deformable Objects, and Fluids

Yunzhu Li, Jiajun Wu|arXiv (Cornell University)|Oct 3, 2018

Robot Manipulation and Learning参考文献 35被引用 105

一句话总结

本文学习一种可微分的颗粒基动力学模型（DPI-Nets），具备动态图和分层传播，用于模拟和控制刚性、可变形和流体材料，从而实现现实世界的操控和在线自适应。

ABSTRACT

Real-life control tasks involve matters of various substances---rigid or soft bodies, liquid, gas---each with distinct physical behaviors. This poses challenges to traditional rigid-body physics engines. Particle-based simulators have been developed to model the dynamics of these complex scenes; however, relying on approximation techniques, their simulation often deviates from real-world physics, especially in the long term. In this paper, we propose to learn a particle-based simulator for complex control tasks. Combining learning with particle-based systems brings in two major benefits: first, the learned simulator, just like other particle-based systems, acts widely on objects of different materials; second, the particle-based representation poses strong inductive bias for learning: particles of the same type have the same dynamics within. This enables the model to quickly adapt to new environments of unknown dynamics within a few observations. We demonstrate robots achieving complex manipulation tasks using the learned simulator, such as manipulating fluids and deformable foam, with experiments both in simulation and in the real world. Our study helps lay the foundation for robot learning of dynamic scenes with particle-based representations.

研究动机与目标

解决对多样物体动力学（刚性、可变形、流体）进行建模的挑战，以实现鲁棒的机器人操控。
提出一个可微分的颗粒基仿真器，使其在不同材料间具备泛化能力。
引入动态交互图和分层传播，以捕捉短程与长程相互作用。
利用所学动力学实现在线自适应和基于模型的控制（MPC/射击法）。

提出的方法

将交互网络扩展到颗粒级动力学，并进行动态图构造。
引入颗粒的分层聚类，以建模某些对象的长程依赖。
使用多阶段传播和动态图来捕捉瞬时和长程相互作用。
结合状态特异的运动预测器和在适用处的静止位置，对可变形和流体材料进行建模。
通过轨迹优化（射击方法）和带在线系统辨识的MPC，应用基于模型的控制。

实验结果

研究问题

RQ1一个可微分的颗粒基动力学模型是否能在刚性、可变形和流体物体之间实现泛化？
RQ2动态、实时构建的交互图是否优于静态图或完全图在前向预测上的表现？
RQ3所学的DPI-Nets是否能在仿真和现实任务中实现有效的控制和在线自适应？
RQ4分层传播对具有长程依赖的对象的学习与控制有何影响？
RQ5在未知物理参数下，在线系统辨识对任务性能的影响是什么？

主要发现

方法	FluidFall	BoxBath	FluidShake	RiceGrip
IN	2.74 ± 0.56	N/A	N/A	N/A
HRN	0.21 ± 0.04	3.62 ± 0.40	3.58 ± 0.77	0.17 ± 0.11
DPI-Net w/o hierarchy	0.15 ± 0.03	2.64 ± 0.69	1.89 ± 0.36	0.29 ± 0.13
DPI-Net	0.15 ± 0.03	2.03 ± 0.41	1.89 ± 0.36	0.13 ± 0.07

DPI-Nets在前向仿真MSE方面的表现低于基线（FluidFall: 0.15±0.03 与 HRN 0.21±0.04；BoxBath: 2.03±0.41 与 HRN 3.62±0.40）。
在 FluidShake 和 RiceGrip 中，DPI-Net 的表现优于 HRN 和无分层的 DPI-Net，数值为：FluidShake 1.89±0.36（有/无分层均为此值），RiceGrip 0.13±0.07（DPI-Net）对比 0.29±0.13（无分层）。
动态图优于静态图（IN），并能对流体与多状态交互进行建模（BoxBath、FluidShake）。
通过基于 SGD 的在线系统辨识实现在线自适应，在现实世界中提升操控（Real World 的 RiceGrip）。
该方法通过 MPC 和基于梯度的轨迹优化，使控制任务如塑形可变形物体和将流体引导至目标配置成为可能。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。