QUICK REVIEW

[论文解读] A Unified Neural Network Approach for Estimating Travel Time and Distance for a Taxi Trip

Ishan Jindal, Tony Tony|arXiv (Cornell University)|Oct 12, 2017

Traffic Prediction and Management Techniques参考文献 15被引用 62

一句话总结

本文提出 ST-NN，一种统一的深度神经网络，使用仅有的 GPS bin 坐标和时间-日时，联合估计起点-终点对之间的 travel time 和 travel distance，在 NYC 出租车数据上实现优于现有方法的性能。

ABSTRACT

In building intelligent transportation systems such as taxi or rideshare services, accurate prediction of travel time and distance is crucial for customer experience and resource management. Using the NYC taxi dataset, which contains taxi trips data collected from GPS-enabled taxis [23], this paper investigates the use of deep neural networks to jointly predict taxi trip time and distance. We propose a model, called ST-NN (Spatio-Temporal Neural Network), which first predicts the travel distance between an origin and a destination GPS coordinate, then combines this prediction with the time of day to predict the travel time. The beauty of ST-NN is that it uses only the raw trips data without requiring further feature engineering and provides a joint estimate of travel time and distance. We compare the performance of ST-NN to that of state-of-the-art travel time estimation methods, and we observe that the proposed approach generalizes better than state-of-the-art methods. We show that ST-NN approach significantly reduces the mean absolute error for both predicted travel time and distance, about 17% for travel time prediction. We also observe that the proposed approach is more robust to outliers present in the dataset by testing the performance of ST-NN on the datasets with and without outliers.

研究动机与目标

推动城市出租车/网约车系统中对行驶时间与距离的准确估计。
利用大规模的 NYC 出租车数据来开发一个基于回归、无需特征工程的模型。
证明联合的 ST-NN 架构在行驶时间预测方面优于基线方法。
展示 ST-NN 对异常值的鲁棒性，并与现有方法进行比较。

提出的方法

将起点和终点表示为 2-D GPS 网格，时间-of-day 表示为 1-D 时间网格。
开发 Dist DNN 和 Time DNN 模块（三层多层感知机）分别预测距离和时间。
将模块融合为一个统一的 ST-NN，联合最小化 travel-time 和 travel-distance 损失之和。
在 NYC 出租车数据集上通过随机梯度下降进行训练，采用 80/20 的训练/测试分割。
与线性回归基线以及独立的 TimeNN/DistNN 模块进行比较，并参考方法（BTE）。
进行预处理，将 GPS 坐标映射到离散 cell，并将时间-日分成工作日/周末网格。

实验结果

研究问题

RQ1在没有路线信息的情况下，神经网络是否可以仅从起点-终点的 GPS 坐标和时间信息联合预测行驶时间和距离？
RQ2联合学习（ST-NN）相比仅时间模型和线性基线是否能改进行驶时间预测？
RQ3在存在异常值以及不同时间段（工作日 vs 周末）下模型的表现如何？
RQ4预测的行驶距离如何影响城市出租车数据中的时间估计？

主要发现

ST-NN 在行驶时间估计方面优于简单的线性回归基线。
在 NYC 数据上，ST-NN 的联合学习使行驶时间 MAE 相比 TimeNN 降低约 13 秒。
DistNN 在行驶距离预测方面优于简单线性回归；ST-NN 实现接近最佳的距离预测，MAE 约为 0.27（单位为提交中所述）且 R2 高 (~0.943)。
ST-NN 对异常值具有鲁棒性，在包含异常行程的数据上相对于基线方法仍保持优越性能。
相比仅曼哈顿区的基准，ST-NN 将行驶时间 MAE 降低约 17%。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。