QUICK REVIEW

[论文解读] HEATACO: Heatmap-Guided Ant Colony Decoding for Large-Scale Travelling Salesman Problems

Bo-Cheng Lin, Yi Mei|arXiv (Cornell University)|Jan 26, 2026

Vehicle Routing Optimization Methods被引用 0

一句话总结

HeatACO 将热力图边 confidences 视为软先验，并使用 Max–Min Ant System 解码器结合信息素反馈，在大规模下（多达 10K 节点）产生可行解的 TSP 线路，提供强劲的质量与时间权衡。

ABSTRACT

Heatmap-based non-autoregressive solvers for large-scale Travelling Salesman Problems output dense edge-probability scores, yet final performance largely hinges on the decoder that must satisfy degree-2 constraints and form a single Hamiltonian tour. Greedy commitment can cascade into irreparable mistakes at large $N$, whereas MCTS-guided local search is accurate but compute-heavy and highly engineered. We instead treat the heatmap as a soft edge prior and cast decoding as probabilistic tour construction under feasibility constraints, where the key is to correct local mis-rankings via inexpensive global coordination. Based on this view, we introduce HeatACO, a plug-and-play Max-Min Ant System decoder whose transition policy is softly biased by the heatmap while pheromone updates provide lightweight, instance-specific feedback to resolve global conflicts; optional 2-opt/3-opt post-processing further improves tour quality. On TSP500/1K/10K, using heatmaps produced by four pretrained predictors, HeatACO+2opt achieves gaps down to 0.11%/0.23%/1.15% with seconds-to-minutes CPU decoding for fixed heatmaps, offering a better quality--time trade-off than greedy decoding and published MCTS-based decoders. Finally, we find the gains track heatmap reliability: under distribution shift, miscalibration and confidence collapse bound decoding improvements, suggesting heatmap generalisation is a primary lever for further progress.

研究动机与目标

Motivate decoding for large-scale heatmap-based TSP solvers where the heatmap provides edge confidences.
Develop a modular, scalable decoder that respects degree-2 and subtour constraints while leveraging heatmaps.
Offer a lightweight alternative to MCTS-based or greedy decoders with competitive quality under practical budgets.
Analyze how heatmap reliability and distribution shift affect decoding performance and provide practical tuning guidance.

提出的方法

Introduce HeatACO, a Max–Min Ant System (MMAS) decoder that treats the heatmap as a multiplicative prior in transition probabilities.
Bias MMAS sampling with a heatmap-based factor tilde{H}_{ij} derived from H, using a gamma exponent to control guidance strength (p_{i→j} ∝ (τ_{ij})^{α}(η_{ij})^{β}(tilde{H}_{ij})^{γ}).
Construct sparse candidate lists from the heatmap (with distance-based fallbacks for connectivity) and optionally apply 2-opt/3-opt local improvement after decoding.
Keep heatmap predictor f_{θ} fixed; pheromone updates provide lightweight, instance-specific global feedback to resolve conflicts.
Optionally perform 2-opt/3-opt to tighten tours and reinforce pheromones with improved solutions.]
research_questions: [

实验结果

研究问题

RQ1How can a heatmap-based edge prior be integrated into a scalable, feasible-tour decoder without relying on heavy search?
RQ2To what extent does heatmap-guided MMAS decode large-scale TSPs efficiently while maintaining high solution quality?
RQ3How do heatmap strength and calibration affect the decoding performance and robustness under distribution shift?
RQ4What is the impact of optional local search (2-opt/3-opt) on final tour quality and runtime?

主要发现

HeatACO converts the same heatmaps into high-quality tours under practical CPU budgets, achieving gaps of 0.11%–1.27% on TSP500/1K/10K with seconds-to-minutes of decoding time.
Compared with greedy decoding, HeatACO offers substantially better quality under similar budgets, and with 2-opt, often outperforms published MCTS-guided decoding within comparable or smaller budgets.
Heatmap guidance speeds time-to-quality, especially on large instances (TSP10K), by biasing sampling toward high-probability edges while pheromones correct global conflicts.
The method is robust when heatmaps preserve meaningful edge rankings, but miscalibration and distribution shift (OOD) can limit gains, highlighting heatmap generalisation as a key lever.
A coarse gamma parameter (γ) controls the balance between heatmap following and exploration, with entropy-based or heuristic rules guiding selection when labels are unavailable.
Ablations show that heatmap prior improves over vanilla MMAS, and 2-opt/3-opt can significantly tighten results, with 3-opt offering further gains at higher compute cost.

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。