QUICK REVIEW

[论文解读] Search Problems in Trees with Symmetries: Near Optimal Traversal Strategies for Individualization-Refinement Algorithms.

Markus Anders, Pascal Schweitzer|arXiv (Cornell University)|Jan 1, 2020

Optimization and Search Problems被引用 2

一句话总结

本文提出了一种随机化、近乎最优的遍历策略，用于在对称树中求解同构性搜索问题，建模了实际图同构性工具中的回溯行为。该方法证明了在树大小平方根范围内的预期叶节点访问次数为拟线性，且具有匹配的下界，解释了为何在 Traces 等工具中，随机化广度优先搜索优于确定性深度优先方法。

ABSTRACT

We define a search problem on trees that closely captures the backtracking behavior of all current practical graph isomorphism algorithms. Given two trees with colored leaves, the goal is to find two leaves of matching color, one in each of the trees. The trees are subject to an invariance property which promises that for every pair of leaves of equal color there must be a symmetry (or an isomorphism) that maps one leaf to the other. We describe a randomized algorithm with errors for which the number of visited leaves is quasilinear in the square root of the size of the smaller of the two trees. For inputs of bounded degree, we develop a Las Vegas algorithm with a similar running time. We prove that these results are optimal up to logarithmic factors. We show a lower bound for randomized algorithms on inputs of bounded degree that is the square root of the tree sizes. For inputs of unbounded degree, we show a linear lower bound for Las Vegas algorithms. For deterministic algorithms we can prove a linear bound even for inputs of bounded degree. This shows why randomized algorithms outperform deterministic ones. Our results explain why the randomized breadth-first with intermixed experimental path search strategy of the isomorphism tool Traces (Piperno 2008) is often superior to the depth-first search strategy of other tools such as nauty (McKay 1977) or bliss (Junttila, Kaski 2007). However, our algorithm also provides a new traversal strategy, which is theoretically near optimal with better worst case behavior than traversal strategies that have previously been used.

研究动机与目标

通过在对称树上建立搜索问题，建模实际图同构性算法中的回溯行为。
设计一种随机化算法，在尊重着色树对称性的前提下，最小化叶节点访问次数。
建立在有界与无界度数树上，针对随机化与确定性算法的理论下界。
解释 Traces 中的随机化广度优先策略为何在实践中优于 nauty 和 bliss 等工具中的确定性深度优先策略。
为个体化-精炼算法提供一种具有可证明近乎最优最坏情况性能的新遍历策略。

提出的方法

形式化定义了具有对称性不变性的着色树上的搜索问题：对于每对同色叶节点，存在一个对称映射将其中一个映射到另一个。
提出一种随机化算法，以最小化预期访问次数的方式访问叶节点，实现在较小树大小平方根范围内的拟线性复杂度。
为有界度数输入设计了一种拉斯维加斯变体，具有相似的预期性能保证。
利用概率分析与对称性利用，对预期叶节点探索次数进行上界估计。
通过对抗性树构造推导下界，证明结果在对数因子范围内达到最优。
通过从树同构性搜索问题到对称性约束下遍历难度的归约，分析其内在困难。

实验结果

研究问题

RQ1在随机策略下，为在对称树中找到一对匹配颜色的叶节点，所需的最优预期叶节点访问次数是多少？
RQ2对于有界度数与无界度数的树，随机化与确定性算法的下界有何不同？
RQ3为何 Traces 中的随机化广度优先搜索在实践中优于确定性深度优先策略？
RQ4能否设计一种新的遍历策略，使其在对称树中同构性搜索的最坏情况性能达到近乎最优？
RQ5着色树中的对称性在多大程度上限制了搜索算法的效率？

主要发现

所提出的随机化算法实现了预期叶节点访问次数在较小树大小平方根范围内的拟线性复杂度。
对于有界度数树，该算法的拉斯维加斯变体达到了相同的预期性能上界。
证明了在有界度数树上，随机化算法的下界为 Ω(√n)，表明该算法近乎最优。
对于无界度数树，证明了拉斯维加斯算法的线性下界为 Ω(n)，揭示其固有局限性。
即使在有界度数树上，也证明了确定性算法的线性下界为 Ω(n)，解释了其性能劣于随机化方法的原因。
理论结果解释了 Traces 中随机化广度优先策略在同构性工具中优于确定性深度优先方法的实证成功。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。