QUICK REVIEW

[论文解读] Combining Optimal Control and Learning for Visual Navigation in Novel Environments

Somil Bansal, Varun Tolani|arXiv (Cornell University)|Mar 6, 2019

Robotic Path Planning Algorithms被引用 96

一句话总结

LB-WayPtNav 将一个学习驱动的感知模块与基于模型的规划和基于LQR的跟踪相结合，在未知室内环境中导航，在仿真和真实机器人测试中优于端到端学习和纯几何映射。

ABSTRACT

Model-based control is a popular paradigm for robot navigation because it can leverage a known dynamics model to efficiently plan robust robot trajectories. However, it is challenging to use model-based methods in settings where the environment is a priori unknown and can only be observed partially through on-board sensors on the robot. In this work, we address this short-coming by coupling model-based control with learning-based perception. The learning-based perception module produces a series of waypoints that guide the robot to the goal via a collision-free path. These waypoints are used by a model-based planner to generate a smooth and dynamically feasible trajectory that is executed on the physical system using feedback control. Our experiments in simulated real-world cluttered environments and on an actual ground vehicle demonstrate that the proposed approach can reach goal locations more reliably and efficiently in novel environments as compared to purely geometric mapping-based or end-to-end learning-based alternatives. Our approach does not rely on detailed explicit 3D maps of the environment, works well with low frame rates, and generalizes well from simulation to the real world. Videos describing our approach and experiments are available on the project website.

研究动机与目标

在未知、杂乱的室内环境中无需显式3D地图即可实现自主导航。
提出一个混合框架，使用学习来预测无碰撞的航点，并用基于模型的控制来生成可行轨迹。
证明该方法可从仿真推广到真实机器人，并优于纯学习或基于映射的基线。

提出的方法

感知模块：一个卷积神经网络从机载 RGB 图像 I_t、当前速度 u_t 和机器人坐标中的目标 p_t* 预测下一个航点。
规划：拟合三阶样条从当前状态到预测航点生成平滑的 x,y 轨迹，确保动态可行性。
跟踪：一个 LQR 控制器在样条轨迹周围线性化动力学，以产生用于在时域 H 内执行的前馈和反馈项。
训练：在训练期间使用来自已知地图的最优航点对感知进行监督，从而在测试时实现零-shot 性能，无需现实世界标注。
评估：在仿真中以及在 TurtleBot 2 硬件平台上与端到端学习和几何映射方法进行对比。

实验结果

研究问题

RQ1将感知驱动的航点预测器与基于模型的规划结合，是否能够在新颖、杂乱的室内环境中实现可靠导航？
RQ2在成功率、速度和轨迹平滑度方面，混合型 LB-WayPtNav 与端到端学习及基于映射的规划相比如何？
RQ3该方法是否能够在没有真实世界微调的情况下从仿真推广到真实硬件？
RQ4使用样条基规划和 LQR 跟踪对控制平滑性和对扰动的鲁棒性有什么影响？

主要发现

LB-WayPtNav 在仿真和硬件测试中实现了比端到端学习更高的成功率和更快的目标到达速度。
该方法产生更平滑的轨迹，较低的加速度和 jerk 相比于纯端到端策略。
在仿真中，LB-WayPtNav 优于无记忆深度映射基线，并在获得完美深度时接近基于深度的映射性能。
该方法可从仿真推广到真实机器人，域间差距最小，并且在测试时不需要显式的 3D 地图。
感知训练使用来自已知地图的最优航点进行监督，从而无需人工标注。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。