QUICK REVIEW

[论文解读] Deep Lagrangian Networks: Using Physics as Model Prior for Deep Learning

Michael Lutter, Christian Ritter|arXiv (Cornell University)|Jul 10, 2019

Model Reduction and Neural Networks参考文献 38被引用 80

一句话总结

DeLaN 将欧拉-拉格朗日力学编码成神经网络，以学习具有物理可行性的机器人动力学，实现在控制中的鲁棒外推和在线学习。

ABSTRACT

Deep learning has achieved astonishing results on many tasks with large amounts of data and generalization within the proximity of training data. For many important real-world applications, these requirements are unfeasible and additional prior knowledge on the task domain is required to overcome the resulting problems. In particular, learning physics models for model-based control requires robust extrapolation from fewer samples - often collected online in real-time - and model errors may lead to drastic damages of the system. Directly incorporating physical insight has enabled us to obtain a novel deep model learning approach that extrapolates well while requiring fewer samples. As a first example, we propose Deep Lagrangian Networks (DeLaN) as a deep network structure upon which Lagrangian Mechanics have been imposed. DeLaN can learn the equations of motion of a mechanical system (i.e., system dynamics) with a deep network efficiently while ensuring physical plausibility. The resulting DeLaN network performs very well at robot tracking control. The proposed method did not only outperform previous model learning approaches at learning speed but exhibits substantially improved and more robust extrapolation to novel trajectories and learns online in real-time

研究动机与目标

激励为具象系统学习物理模型，以实现从少量样本的鲁棒外推和安全的在线更新。
提出一种网络拓扑，通过欧拉–拉格朗日方程强制物理性，同时保持对系统结构的通用性。
实现端到端训练，使用可微分的惯性表示和非保守力表示。
展示在仿真和物理机器人上使用在线数据的实时学习与控制性能。

提出的方法

将惯性矩阵 H(q) 表示为 L(q)L(q)^T，其中 L 的对角元素为正，以确保对称性和正定性。
用一个单独的神经网络头来建模非保守力 g(q)。
使用学习得到的 L 和 g 构造 f^{-1}(q, qdot, qddot; θ, ψ) 以满足欧拉–拉格朗日方程。
端到端训练，通过最小化预测扭矩和真实扭矩之间的损失来实现，并添加一个正则化项以解决 L 的非唯一性问题。
解析地计算导数，以实现通过 L 和 g 成分的实时反向传播。
使用一个专用的拉格朗日层，单次前向传播即可获得 H、dH/dt 和控制所需的偏导数。

实验结果

研究问题

RQ1通过 H(q) 和 g(q) 的基于神经网络的拉格朗日动力学表示，是否能为机器人系统提供物理可行且准确的模型？
RQ2强制拉格朗日结构是否比标准前馈模型在对未见轨迹和速度的外推方面有改善？
RQ3在实时控制环路中使用 DeLaN 进行动力学在线学习是否可行，同时不牺牲稳定性或可靠性？
RQ4在仿真和真实机器人中，DeLaN 相对于解析物理模型和纯数据驱动网络的表现如何？

主要发现

DeLaN 从数据中学习动力学分量：惯性、科里奥利/离心力和重力，在实验中与真实分量相匹配。
对新轨迹和更高速度的外推，DeLaN 比标准前馈网络显著更好，尤其在训练数据有限时。
在在线控制中，DeLaN 实现了鲁棒跟踪，并且可以从随机初始化开始适应新轨迹。
在真实 Barrett WAM 上，DeLaN 的跟踪性能与解析模型相当，并且能够在复杂的缆绳驱动效应下学习动力学。
在仿真和真实机器人实验中，DeLaN 展现出相较基线模型更好的样本效率和外推能力。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。