Skip to main content
QUICK REVIEW

[Paper Review] Graph WaveNet for Deep Spatial-Temporal Graph Modeling

Zonghan Wu, Shirui Pan|arXiv (Cornell University)|May 31, 2019
Advanced Graph Neural Networks26 references70 citations
TL;DR

Graph WaveNet introduces a self-adaptive adjacency matrix and stacked dilated causal convolutions to model spatial-temporal graphs, achieving state-of-the-art traffic forecasting with efficient end-to-end training.

ABSTRACT

Spatial-temporal graph modeling is an important task to analyze the spatial relations and temporal trends of components in a system. Existing approaches mostly capture the spatial dependency on a fixed graph structure, assuming that the underlying relation between entities is pre-determined. However, the explicit graph structure (relation) does not necessarily reflect the true dependency and genuine relation may be missing due to the incomplete connections in the data. Furthermore, existing methods are ineffective to capture the temporal trends as the RNNs or CNNs employed in these methods cannot capture long-range temporal sequences. To overcome these limitations, we propose in this paper a novel graph neural network architecture, Graph WaveNet, for spatial-temporal graph modeling. By developing a novel adaptive dependency matrix and learn it through node embedding, our model can precisely capture the hidden spatial dependency in the data. With a stacked dilated 1D convolution component whose receptive field grows exponentially as the number of layers increases, Graph WaveNet is able to handle very long sequences. These two components are integrated seamlessly in a unified framework and the whole framework is learned in an end-to-end manner. Experimental results on two public traffic network datasets, METR-LA and PEMS-BAY, demonstrate the superior performance of our algorithm.

Motivation & Objective

  • Address the limitations of fixed graph structures in spatial-temporal modeling.
  • Enable learning hidden spatial dependencies directly from data.
  • Capture long-range temporal dependencies efficiently without RNNs.
  • Integrate graph convolution with dilated causal convolutions in an end-to-end framework.

Proposed method

  • Introduce a graph convolution layer with a self-adaptive adjacency matrix learned from node embeddings.
  • Combine diffusion-based and self-adaptive graph convolutions to model spatial dependencies.
  • Employ stacked dilated causal convolutions (Gated TCN) to capture long-range temporal patterns.
  • Use a unified end-to-end framework where spatial-temporal layers are stacked with residual and skip connections.
  • Train the model to predict multi-step ahead outputs jointly to avoid train/infer mismatch.
  • Output predictions for all T steps in one forward pass rather than recursively.

Experimental results

Research questions

  • RQ1Can a self-adaptive adjacency matrix uncover hidden spatial dependencies beyond a fixed graph?
  • RQ2Does integrating adaptive spatial connections with dilated temporal convolutions improve long-horizon forecasting?
  • RQ3How does Graph WaveNet compare to existing spatial-temporal models on traffic datasets in terms of accuracy and efficiency?

Key findings

DatasetModel15 min MAE15 min RMSE15 min MAPE30 min MAE30 min RMSE30 min MAPE60 min MAE60 min RMSE60 min MAPE
METR-LAARIMA3.998.219.60%5.1510.4512.70%6.9013.2317.40%
METR-LAFC-LSTM3.446.309.60%3.777.2310.90%4.378.6913.20%
METR-LAWaveNet2.995.898.04%3.597.2810.25%4.458.9313.62%
METR-LADCRNN2.775.387.30%3.156.458.80%3.607.6010.50%
METR-LAGGRU2.715.246.99%3.126.368.56%3.647.6510.62%
METR-LASTGCN2.885.747.62%3.477.249.57%4.599.4012.70%
METR-LAGraph WaveNet2.695.156.90%3.076.228.37%3.537.3710.01%
PEMS-BAYARIMA1.623.303.50%2.334.765.40%3.386.508.30%
PEMS-BAYFC-LSTM2.054.194.80%2.204.555.20%2.374.965.70%
PEMS-BAYWaveNet1.393.012.91%1.834.214.16%2.355.435.87%
PEMS-BAYDCRNN1.382.952.90%1.743.973.90%2.074.744.90%
PEMS-BAYSTGCN1.362.962.90%1.814.274.17%2.495.695.79%
PEMS-BAYGraph WaveNet1.302.742.73%1.633.703.67%1.954.524.63%
  • Graph WaveNet achieves state-of-the-art results on METR-LA and PEMS-BAY for 15, 30, and 60 minute horizons.
  • The forward-backward-adaptive adjacency configuration yields the best performance across both datasets.
  • The adaptive-only configuration performs well when graph structure is unavailable, approaching forward-only performance.
  • Graph WaveNet is faster at inference and substantially faster than DCRNN during training, while remaining more accurate than many baselines.
  • Learning a self-adaptive adjacency matrix reveals influential nodes and uncovers hidden spatial dependencies.
  • The model remains stable and provides improved multi-step forecasts compared to WaveNet and STGCN.

Better researchstarts right now

From paper design to paper writing, dramatically reduce your research time.

No credit card · Free plan available

This review was created by AI and reviewed by human editors.