QUICK REVIEW

[论文解读] Backprop KF: Learning Discriminative Deterministic State Estimators

Tuomas Haarnoja, Anurag Ajay|arXiv (Cornell University)|May 23, 2016

Human Pose and Action Recognition参考文献 23被引用 101

一句话总结

论文提出 Backprop KF (BKF)，一个判别式、确定性的状态估计器，将卡尔曼滤波展开为可微分的计算图，通过反向传播端到端训练，使其能够从高维观测如图像中输入。

ABSTRACT

Generative state estimators based on probabilistic filters and smoothers are one of the most popular classes of state estimators for robots and autonomous vehicles. However, generative models have limited capacity to handle rich sensory observations, such as camera images, since they must model the entire distribution over sensor readings. Discriminative models do not suffer from this limitation, but are typically more complex to train as latent variable models for state estimation. We present an alternative approach where the parameters of the latent state distribution are directly optimized as a deterministic computation graph, resulting in a simple and effective gradient descent algorithm for training discriminative state estimators. We show that this procedure can be used to train state estimators that use complex input, such as raw camera images, which must be processed using expressive nonlinear function approximators such as convolutional neural networks. Our model can be viewed as a type of recurrent neural network, and the connection to probabilistic filtering allows us to design a network architecture that is particularly well suited for state estimation. We evaluate our approach on synthetic tracking task with raw image inputs and on the visual odometry task in the KITTI dataset. The results show significant improvement over both standard generative approaches and regular recurrent neural networks.

研究动机与目标

激励判别式状态估计，能够在不建模完整观测分布的情况下处理高维观测。
提出一种确定性计算图形式的状态估计器，能够实现端到端训练。
通过卡尔曼滤波核心融入领域知识，同时允许表达能力强的观测模型（如 CNNs）。
在基于视觉的跟踪和 KITTI 视觉里程计上，性能优于标准生成方法和基础 RNN。

提出的方法

通过对卡尔曼滤波展开并使用判别性训练的观测模型 g_theta(o_t) 构造确定性计算图。
通过时间反向传播（BPTT）端到端训练，方法是对 κ(s_{t-1}, z_t)、q(s_t) 和 l(φ_{y_t}) 求导。
从处理原始观测 o_t 的 CNN 输出 z_t 和 R_t，使滤波器能够对观测不确定性进行加权。
在状态更新中使用扩展卡尔曼滤波（或 KF 变体），同时优化观测网络和滤波器参数。
在概率滤波的基础上构建体系结构，同时将整体模型视为一个确定性的类似 RNN 的计算图。

实验结果

研究问题

RQ1在处理诸如图像等高维观测时，是否存在与概率滤波结构相匹配的判别式、确定性状态估计器，能够优于传统的生成滤波器？
RQ2将观测模型和滤波器进行端到端训练，是否能提升基于视觉的任务（带遮挡的跟踪和 KITTI 视觉里程计）的状态估计精度，相较于分段训练和通用 RNN？
RQ3将观测相关的不确定性（R_t）纳入后，在遮挡和噪声条件下对估计性能有何影响？

主要发现

BKF 在带遮挡的合成视觉跟踪任务上优于标准生成型 KF 基估计器和原生 LSTM。
BKF 实现的 RMS 跟踪误差更低（0.0537），比前馈网络（0.2322）、分段 KF（0.1160）及 LSTM 变体（0.1407–0.1423）更优。
在 KITTI 视觉里程计中，BKF 在不同训练集规模下都优于分段 KF 和 LSTM 基线，获得更高的平移和旋转精度。
采用基于 CNN 的观测模型进行端到端训练，使滤波器能够自适应观测不确定性（R_t），以最大化最终状态估计性能。
BKF 通过利用领域知识（KF 结构）来维持数据效率优势，同时受益于表达能力强的判别式观测处理。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。