QUICK REVIEW

[论文解读] Deep Spatio-Temporal Residual Networks for Citywide Crowd Flows Prediction

Junbo Zhang, Yu Zheng|arXiv (Cornell University)|Oct 1, 2016

Traffic Prediction and Management Techniques参考文献 14被引用 205

一句话总结

ST-ResNet 通过对时序的 closeness、period、trend 进行建模，并使用残差 CNN 分支，再与外部因素融合，联合预测全市各区域的流入流出。它在北京和纽约数据上优于若干基线模型。

ABSTRACT

Forecasting the flow of crowds is of great importance to traffic management and public safety, yet a very challenging task affected by many complex factors, such as inter-region traffic, events and weather. In this paper, we propose a deep-learning-based approach, called ST-ResNet, to collectively forecast the in-flow and out-flow of crowds in each and every region through a city. We design an end-to-end structure of ST-ResNet based on unique properties of spatio-temporal data. More specifically, we employ the framework of the residual neural networks to model the temporal closeness, period, and trend properties of the crowd traffic, respectively. For each property, we design a branch of residual convolutional units, each of which models the spatial properties of the crowd traffic. ST-ResNet learns to dynamically aggregate the output of the three residual neural networks based on data, assigning different weights to different branches and regions. The aggregation is further combined with external factors, such as weather and day of the week, to predict the final traffic of crowds in each and every region. We evaluate ST-ResNet based on two types of crowd flows in Beijing and NYC, finding that its performance exceeds six well-know methods.

研究动机与目标

推动面向交通管理与公共安全的全市人群流量预测的准确性。
开发一个端到端的深度学习模型，捕捉时空依赖性和外部因素。
用专门的残差网络对时间上的邻近性、周期和趋势进行建模。
动态融合来自多个时间分支和外部数据的输出，以预测全市各区域的流量。

提出的方法

将城市划分为 I x J 的网格，并将每个时间间隔的流入/流出表示为一个二维通道的类似图像的张量。
使用三个残差-CNN分支来建模 closeness（最近的时间）、period（跨日的同一时间）和 trend（较长期的模式）。
应用基于参数矩阵的融合来对区域和分量特定的输出（closeness、period、trend）进行加权。
通过一个两层全连接网络将外部因素（天气、假日、DayOfWeek）结合起来，并与时序输出融合。
通过 tanh 激活映射到 [-1,1] 的预测值，并通过最小化历史数据的均方误差进行训练。
使用 Adam 优化训练，并在残差单元中使用 BN 以提高收敛性。

实验结果

研究问题

RQ1如何利用时空数据为每个区域准确预测全市的人群流入/流出？
RQ2将 closeness、period 和 trend 分别用不同的残差网络是否比单一模型的预测更好？
RQ3外部因素是否可以有效整合以提升预测准确性？
RQ4区域和时间感知的融合机制是否能在不同城市区域提升性能？

主要发现

模型	RMSE
HA	57.69
ARIMA	22.78
SARIMA	26.88
VAR	22.88
ST-ANN	19.57
DeepST	18.18
ST-ResNet L2-E	17.67
ST-ResNet L4-E	17.51
ST-ResNet L12-E	16.89
ST-ResNet L12-E-BN	16.69
ST-ResNet L12-single-E	17.40
ST-ResNet L12	17.00
ST-ResNet L12-E-noFusion	17.96
BikeNYC ST-ResNet (4 RU)	6.33
DeepST-C	8.39
DeepST-CP	7.64
DeepST-CPT	7.56
DeepST-CPTM	7.43

在 TaxiBJ 北京数据上，ST-ResNet 的变体总体上优于六个基线方法（HA、ARIMA、SARIMA、VAR、ST-ANN、DeepST）。
在 TaxiBJ 上，最佳 ST-ResNet 配置（L12-E-BN）实现 RMSE 16.69，相比 16.89（L12-E）和 17.00（L12 不带融合）。
外部因素提升性能；带 E（外部）的模型优于不带的模型（如 L12-E vs L12）。
基于参数矩阵的融合显著优于简单加法融合（L12-E-noFusion）。
在 BikeNYC 上，四个残差单元的 ST-ResNet 实现 RMSE 6.33，超过所有基线，包括 DeepST 的变体（如 DeepST-CPTM 7.43）。
该方法对不同城市数据集具有良好的泛化能力（Beijing TaxiBJ 与 NYC BikeNYC）。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。