Skip to main content
QUICK REVIEW

[论文解读] Efficient Active Search for Combinatorial Optimization Problems

André Hottung, Yeong‐Dae Kwon|arXiv (Cornell University)|Jun 9, 2021
Metaheuristic Optimization Algorithms Research参考文献 28被引用 27
一句话总结

提出三种高效的主动搜索(EAS)策略,在搜索过程中仅更新模型参数的子集,从而提高用于TSP、CVRP和JSSP的基于ML的构造方法的性能,并且通常超越最先进的ML方法,有时甚至超越LKH3。

ABSTRACT

Recently numerous machine learning based methods for combinatorial optimization problems have been proposed that learn to construct solutions in a sequential decision process via reinforcement learning. While these methods can be easily combined with search strategies like sampling and beam search, it is not straightforward to integrate them into a high-level search procedure offering strong search guidance. Bello et al. (2016) propose active search, which adjusts the weights of a (trained) model with respect to a single instance at test time using reinforcement learning. While active search is simple to implement, it is not competitive with state-of-the-art methods because adjusting all model weights for each test instance is very time and memory intensive. Instead of updating all model weights, we propose and evaluate three efficient active search strategies that only update a subset of parameters during the search. The proposed methods offer a simple way to significantly improve the search performance of a given model and outperform state-of-the-art machine learning based methods on combinatorial problems, even surpassing the well-known heuristic solver LKH3 on the capacitated vehicle routing problem. Finally, we show that (efficient) active search enables learned models to effectively solve instances that are much larger than those seen during training.

研究动机与目标

  • Motivate and address the high computational cost of Bello et al.'s active search by proposing efficient alternatives.
  • Develop three strategies to update only a subset of model parameters during test-time search.
  • Demonstrate that EAS variants improve solution quality and generalization across multiple combinatorial optimization problems.
  • Show that EAS can outperform state-of-the-art ML-based methods and even a strong heuristic (LKH3) on CVRP and JSSP.

提出的方法

  • Define three EAS variants: Embedding updates (EAS-Emb), Added-layer updates (EAS-Lay), and Tabular updates (EAS-Tab).
  • Each variant updates a small, instance-specific component while keeping the rest of the model fixed during search.
  • Use RL and imitation learning losses to guide updates: L_RL based on REINFORCE and L_IL from imitation of incumbent best solutions; combine as L_RIL = L_RL + λ L_IL.
  • For EAS-Emb: update a subset of instance embeddings with gradients; for EAS-Lay: insert an instance-specific residual layer and train its weights; for EAS-Tab: adjust a look-up table influencing action probabilities without backpropagation.
  • Evaluate on TSP (POMO-based), CVRP (POMO-based), and JSSP (L2D-based); compare against Concorde, LKH3, and several ML baselines.

实验结果

研究问题

  • RQ1Can efficient active search (updating only a subset of parameters) achieve competitive or superior solution quality to full active search?
  • RQ2Which EAS variant (Emb, Lay, Tab) offers the best trade-off between runtime and solution quality across TSP, CVRP, and JSSP?
  • RQ3How do EAS methods affect generalization to larger instances than those seen during training?
  • RQ4Do EAS methods enable ML construction methods to outperform strong solvers like LKH3 on CVRP and JSSP?

主要发现

ProblemInstance SetnMethodObjective (Avg)Gap to Best/Opt (%)Time (Wall)Notes
TSPTesting (10k inst.)100Concorde7.7650.000%82MExact solver; baseline
TSPTesting (10k inst.)100LKH37.7650.000%8HHeuristic solver; baseline
TSPTesting (10k inst.)100POMO-Greedy7.7760.146%1MGreedy baseline from POMO family
TSPTesting (10k inst.)100POMO-Sampling7.7700.074%4HSampling baseline from POMO family
TSPTesting (10k inst.)100Active Search7.7680.046%5DOriginal active search; high cost
TSPTesting (10k inst.)100EAS-Emb7.7690.063%5HEAS embedding updates
TSPTesting (10k inst.)100EAS-Lay7.7690.053%7HEAS added-layer updates
TSPTesting (10k inst.)100EAS-Tab7.7680.048%5HEAS tabular updates
CVRPTesting (10k inst.)100LKH315.650.00%6DBaseline LKH3 on CVRP
CVRPTesting (10k inst.)100POMO-Greedy15.760.76%2MGreedy baseline
CVRPTesting (10k inst.)100POMO-Sampling15.670.17%7HSampling baseline
CVRPTesting (10k inst.)100Active Search15.63-0.07%8DOriginal active search; slower
CVRPTesting (10k inst.)100EAS-Emb15.63-0.08%9HEAS embedding updates
CVRPTesting (10k inst.)100EAS-Lay15.61-0.23%12HEAS added-layer updates
CVRPTesting (10k inst.)100EAS-Tab15.62-0.14%8HEAS tabular updates
JSSPTesting (100 inst.)10x10OR-Tools807.60.0%37SBaseline OR-Tools
JSSPTesting (100 inst.)10x10L2D-Greedy988.622.3%20SBaseline L2D greedy
JSSPTesting (100 inst.)10x10L2D-Sampling871.78.0%8HSampling baseline
JSSPTesting (100 inst.)10x10Active Search854.25.8%8HOriginal active search
JSSPTesting (100 inst.)10x10EAS-Emb837.03.7%7HEAS embedding updates
JSSPTesting (100 inst.)10x10EAS-Lay859.66.5%7HEAS added-layer updates
JSSPTesting (100 inst.)10x10EAS-Tab860.26.5%8HEAS tabular updates
  • EAS variants significantly reduce runtime compared to full active search while maintaining or improving solution quality.
  • On TSP, EAS-Emb, EAS-Lay, and EAS-Tab achieve gaps to Concorde/LKH3 comparable to or better than active search, with orders-of-magnitude faster runtimes.
  • On CVRP, EAS-Lay outperforms all baselines including LKH3 on test instances, and EAS-Tab achieves strong performance with faster runtimes; EAS-Tab shows sensitivity to α, needing tuning for some instances.
  • On JSSP, EAS-Emb yields the best performance among EAS variants, substantially reducing gaps versus sampling; EAS-Lay is competitive with active search; EAS-Tab lags on larger instances.
  • Across problems, EAS approaches improve generalization to larger instances by enabling effective search guidance without retraining the full model

更好的研究,从现在开始

从论文设计到论文写作,大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成,并经人工编辑审核。