QUICK REVIEW

[论文解读] Semi-parametric Topological Memory for Navigation

Nikolay Savinov, Alexey Dosovitskiy|arXiv (Cornell University)|Mar 1, 2018

Advanced Image and Video Retrieval Techniques参考文献 34被引用 55

一句话总结

引入半参数拓扑记忆（SPTM），一种带检索网络的基于图的记忆，用于在未见环境中导航，在迷宫任务中对基线的成功率提升约3倍。

ABSTRACT

We introduce a new memory architecture for navigation in previously unseen environments, inspired by landmark-based navigation in animals. The proposed semi-parametric topological memory (SPTM) consists of a (non-parametric) graph with nodes corresponding to locations in the environment and a (parametric) deep network capable of retrieving nodes from the graph based on observations. The graph stores no metric information, only connectivity of locations corresponding to the nodes. We use SPTM as a planning module in a navigation system. Given only 5 minutes of footage of a previously unseen maze, an SPTM-based navigation agent can build a topological map of the environment and use it to confidently navigate towards goals. The average success rate of the SPTM agent in goal-directed navigation across test environments is higher than the best-performing baseline by a factor of three. A video of the agent is available at https://youtu.be/vRF7f4lhswo

研究动机与目标

利用受动物标志物导航启发的记忆结构，推动在未见环境中的导航。
提出一个仅存储连通性（拓扑）且使用神经检索器在该记忆中定位的记忆系统。
在自监督方式下训练检索与运动组件，无需人工标签或奖励。
证明SPTM在对新迷宫进行约5分钟探索后能够实现高效的目标导向导航。

提出的方法

提出一种记忆结构，包含一个非参数图G，其节点是位置，边表示邻接关系或基于视觉相似性的捷径。
引入一个参数化检索网络R，用于计算观测之间的相似性以在G中定位。
使用自监督的时间近邻性来训练R：若观测在时间上接近则标记为接近，若间隔一定边距则标记为远离。
构建一个运动网络L，将当前观测和目标观测映射到动作分布，使用随机探索的自监督方式进行训练。
导航时，在G中定位当前观测和目标观测，使用Dijkstra计算最短路径，并利用R在该路径上选择一个可达的中间目标点。

实验结果

研究问题

RQ1半参数拓扑记忆在有限探索后，是否能够在未见环境中实现可靠的目标导向导航？
RQ2具有学习检索的拓扑记忆在对新迷宫的泛化上，是否优于度量地图或反应式基线？
RQ3定位精度和捷径形成的质量如何影响导航性能？
RQ4R与L的自监督训练是否足以实现有效的近距离导航与规划？

主要发现

SPTM在未见迷宫上的平均导航成功率比最佳基线高出约三倍。
仅用5分钟的走访录像，代理构建拓扑地图并有效地朝目标导航。
消融研究显示移除视觉捷径会显著降低性能（Val-1:85%，Val-2:55%，Val-3:50% vs 全部:100%，100%，100%）。
使用逐帧定位或完整记忆纹理提高鲁棒性，完整SPTM在验证集Val-1和Val-3达到100%。
最短路径捷径显著缩短平均路径长度（例如Val-3的路径长度从990降至155步）。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。