QUICK REVIEW

[论文解读] Interaction Networks for Learning about Objects, Relations and Physics

Peter Battaglia, Razvan Pascanu|arXiv (Cornell University)|Dec 1, 2016

Topic Modeling参考文献 26被引用 576

一句话总结

本论文介绍了交互网络（interaction networks），一种图结构神经模型，能够分别对对象及其关系进行推理，以预测动力学并跨越多种物理领域推断抽象属性如能量。

ABSTRACT

Reasoning about objects, relations, and physics is central to human intelligence, and a key goal of artificial intelligence. Here we introduce the interaction network, a model which can reason about how objects in complex systems interact, supporting dynamical predictions, as well as inferences about the abstract properties of the system. Our model takes graphs as input, performs object- and relation-centric reasoning in a way that is analogous to a simulation, and is implemented using deep neural networks. We evaluate its ability to reason about several challenging physical domains: n-body problems, rigid-body collision, and non-rigid dynamics. Our results show it can be trained to accurately simulate the physical trajectories of dozens of objects over thousands of time steps, estimate abstract quantities such as energy, and generalize automatically to systems with different numbers and configurations of objects and relations. Our interaction network implementation is the first general-purpose, learnable physics engine, and a powerful general framework for reasoning about object and relations in a wide variety of complex real-world domains.

研究动机与目标

在复杂物理系统中激发并形式化对对象及其关系的推理。
开发一个通用、可学习的引擎，能够将面向对象的推理与面向关系的推理分离。
展示准确的多步轨迹预测和抽象属性推断。
展示对具有不同数量和配置的对象与关系的系统的泛化能力。

提出的方法

将系统表示为带属性的有向多重图，其中对象为节点，关系为边。
使用关系模型 f_R 为每个关系计算相互作用效应 e_k。
按接收对象聚合相互作用效应，并将它们与对象状态和外部效应一起输入对象模型 f_O，以预测下一个状态。
通过基于梯度的优化，在所有关系和对象上训练共享的 f_R 和 f_O。
可选地添加一个抽象模块 phi_A 以预测全局属性（如势能）。
使用带有编排函数 m 和聚合 a 的多层感知机（MLP）实现，以确保对变量大小的图具有交换性/结合性。

实验结果

研究问题

RQ1一个面向对象和关系的神经模型能否学习在复杂物理系统中预测未来状态？
RQ2模型在多大程度上能够泛化到具有不同数量和配置的对象与关系的系统？
RQ3模型能否从学习到的动力学中推断出抽象的系统属性（如势能）？
RQ4将关系中心和对象中心的推理分开是否能提高与基线相比的准确性和泛化能力？
RQ5该方法是否可扩展到多个物理领域（n-body、刚体碰撞、非刚性动力学）？

主要发现

该模型在所有域中以远低于基线的均方误差（MSE）预测下一步速度。
交互网络能够泛化到训练时看到的对象数量少于或多于的系统，通常优于在较小系统上训练的模型。
IN 在数千步骤中的展开与真实物理现象在视觉上相似，具备一致的长程动力学。
使用 IN 的抽象属性估计器（如势能）明显优于 MLP 基线（如 n-body 和字符串域）。
对相互作用效应的正则化提高了泛化并减少了长时展开的漂移。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。