[Paper Review] Learning Lane Graph Representations for Motion Forecasting
The paper introduces LaneGCN, a lane-graph based model with actor-map fusion that outperforms state-of-the-art on Argoverse by learning structured map representations and modeling interactions between actors and HD maps for multi-modal motion forecasting.
We propose a motion forecasting model that exploits a novel structured map representation as well as actor-map interactions. Instead of encoding vectorized maps as raster images, we construct a lane graph from raw map data to explicitly preserve the map structure. To capture the complex topology and long range dependencies of the lane graph, we propose LaneGCN which extends graph convolutions with multiple adjacency matrices and along-lane dilation. To capture the complex interactions between actors and maps, we exploit a fusion network consisting of four types of interactions, actor-to-lane, lane-to-lane, lane-to-actor and actor-to-actor. Powered by LaneGCN and actor-map interactions, our model is able to predict accurate and realistic multi-modal trajectories. Our approach significantly outperforms the state-of-the-art on the large scale Argoverse motion forecasting benchmark.
Motivation & Objective
- Motivate leveraging high-definition map topology for accurate motion forecasting
- Propose a lane-graph representation learned by LaneGCN to capture complex lane topology
- Model comprehensive interactions between traffic actors and lane graphs via a fusion network
- Demonstrate end-to-end trainability and superior performance on Argoverse against raster-based methods
Proposed method
- Construct a lane graph from vectorized HD map data to preserve map topology without rasterization
- Develop LaneConv with multi-type adjacency (predecessor, successor, left, right) and dilations to capture long-range lane dependencies
- Represent actors and lanes as nodes; extract actor features with 1D CNNs (ActorNet) and lane features with LaneGCN (MapNet)
- Fuse actor and lane features with FusionNet through four interaction types: actor-to-lane, lane-to-lane, lane-to-actor, and actor-to-actor, using spatial attention and LaneGCN for L2L
- Predict multi-modal future trajectories via a two-branch prediction header (regression for trajectories and classification for mode confidences)
- Train end-to-end with a combined classification and regression loss, including a max-margin term for modality ranking
Experimental results
Research questions
- RQ1Does lane-graph based representation capture map topology more effectively than rasterized maps for motion forecasting?
- RQ2Can LaneConv and LaneGCN effectively model long-range dependencies in lane topology?
- RQ3Do actor-map interactions (A2L, L2L, L2A, A2A) improve forecasting accuracy over actor-only or map-only baselines?
- RQ4What is the impact of ablations on map/actor fusion and lane graph operators on predictive performance?
Key findings
| Model | minADE (K=1) | minFDE (K=1) | MR (K=1) | minADE (K=6) | minFDE (K=6) | MR (K=6) |
|---|---|---|---|---|---|---|
| Argoverse Baseline | 2.96 | 6.81 | 0.81 | 2.34 | 5.44 | 0.69 |
| Argoverse Baseline (NN) | 3.45 | 7.88 | 0.87 | 1.71 | 3.29 | 0.54 |
| Holmes (7th) | 2.91 | 6.54 | 0.82 | 1.38 | 2.66 | 0.42 |
| cxx (3rd) | 1.91 | 4.31 | 0.66 | 0.99 | 1.71 | 0.19 |
| uulm-mrm (2nd) | 1.90 | 4.19 | 0.63 | 0.94 | 1.55 | 0.22 |
| Jean (1st) | 1.86 | 4.18 | 0.63 | 0.93 | 1.49 | 0.19 |
| Our Model | 1.71 | 3.78 | 0.59 | 0.87 | 1.36 | 0.16 |
- Significant improvements over state-of-the-art on Argoverse across minADE, minFDE, and MR for both K=1 and K=6
- LaneGCN with multi-type and dilated LaneConv better captures lane topology than vanilla GCNs
- Incorporating A2L, L2L, L2A, and A2A interactions materially improves performance, with map-informed flows enhancing actor interactions
- Ablation studies show that each component (LaneConv, residual blocks, dilation, and fusion blocks) contributes to performance gains
- Qualitative results illustrate improved handling of hard cases such as missing history, left/right turns, and abrupt maneuvers
Better researchstarts right now
From paper design to paper writing, dramatically reduce your research time.
No credit card · Free plan available
This review was created by AI and reviewed by human editors.