QUICK REVIEW

[论文解读] A Compositional Object-Based Approach to Learning Physical Dynamics

Michael B. Chang, Tomer Ullman|arXiv (Cornell University)|Dec 1, 2016

Computational Physics and Python Applications被引用 169

一句话总结

本文提出 Neural Physics Engine (NPE)，一个可微分的、基于对象的物理模拟器，使用成对相互作用，能够推广到变量对象数量和场景配置，并且可以推断潜在的对象属性如质量。

ABSTRACT

We present the Neural Physics Engine (NPE), a framework for learning simulators of intuitive physics that naturally generalize across variable object count and different scene configurations. We propose a factorization of a physical scene into composable object-based representations and a neural network architecture whose compositional structure factorizes object dynamics into pairwise interactions. Like a symbolic physics engine, the NPE is endowed with generic notions of objects and their interactions; realized as a neural network, it can be trained via stochastic gradient descent to adapt to specific object properties and dynamics of different worlds. We evaluate the efficacy of our approach on simple rigid body dynamics in two-dimensional worlds. By comparing to less structured architectures, we show that the NPE's compositional representation of the structure in physical interactions improves its ability to predict movement, generalize across variable object count and different scene configurations, and infer latent properties of objects such as mass.

研究动机与目标

激发并设计一个在不同对象数量和配置的场景中也能泛化的物理学先验。
提出一个可微分的、基于对象的物理引擎，将符号结构与基于梯度的学习相结合。
证明一种组合式、成对交互架构能够提升泛化和潜在属性推断。
在二维世界中的球体与障碍物场景上评估该方法，以测试预测、泛化和质量推断。

提出的方法

将物理场景分解为具有外在/内在/全局属性的基于对象的表示。
使用邻域掩码为每个关注对象选择上下文对象，实施类似粗略阶段的上下文选择。
将关注对象的速度建模为由成对编码器编码并解码为速度变化的成对相互作用之和。
预测速度变化（Δv）并通过一阶速度积分更新位置，在最近的时间步中保留加速度为隐式变量。
使用基于 matter-js 的仿真观测轨迹，以随机梯度下降对网络进行端到端训练。
将 NPE 与基线（No-Pairwise NP 和 LSTM）进行比较，以分离成对因子分解和组合性的好处。

实验结果

研究问题

RQ1NPE 是否能够在二维球体-障碍物世界中准确预测对象动力学？
RQ2NPE 是否能泛化到训练时未见到的更多对象的场景？
RQ3NPE 能否从动力学中推断出潜在的对象属性，如质量？
RQ4上下文邻域掩码对预测准确性与泛化的影响如何？
RQ5在保持性能的同时，NPE 如何处理不同场景配置（墙体几何、障碍物）？

主要发现

NPE 在速度预测和泛化任务中始终优于 NP 和 LSTM 基线，特别是在测试时对象数为 6–8 时。
NPE 的预测准确性和泛化随训练而提升，而 NP 和 LSTM 提早达到平台期。
NPE 在从观测到的碰撞中推断关注对象的质量方面约达到 90% 的准确率。
邻域掩码显著提升性能（速度预测损失约多一个数量级），并使每个关注对象的上下文处理达到常数时间（O(1)）。
NPE 能对训练中未见过的复杂场景配置（墙体与内部障碍物）进行鲁棒泛化，当场景复杂度增加时，性能超过基线。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。