QUICK REVIEW

[论文解读] Log Diameter Rounds Algorithms for $2$-Vertex and $2$-Edge Connectivity

Alexandr Andoni, Clifford Stein|arXiv (Cornell University)|Jan 1, 2019

Complexity and Algorithms in Graphs参考文献 42被引用 4

一句话总结

该论文提出了适用于无向图中2-边连通性和2-顶点连通性的完全可扩展的MPC算法，分别实现了O(log D log log(m/n) n)和O(log D log² log(m/n) n + log D′ log log(m/n) n)的并行时间复杂度，其中D为图的直径，D′为双直径。这些算法利用树分解、最近公共祖先（LCA）和RMQ技术，在相对于图直径的亚对数轮次内高效识别桥和双连通分量。

ABSTRACT

Many modern parallel systems, such as MapReduce, Hadoop and Spark, can be modeled well by the MPC model. The MPC model captures well coarse-grained computation on large data --- data is distributed to processors, each of which has a sublinear (in the input data) amount of memory and we alternate between rounds of computation and rounds of communication, where each machine can communicate an amount of data as large as the size of its memory. This model is stronger than the classical PRAM model, and it is an intriguing question to design algorithms whose running time is smaller than in the PRAM model. In this paper, we study two fundamental problems, $2$-edge connectivity and $2$-vertex connectivity (biconnectivity). PRAM algorithms which run in $O(\log n)$ time have been known for many years. We give algorithms using roughly log diameter rounds in the MPC model. Our main results are, for an $n$-vertex, $m$-edge graph of diameter $D$ and bi-diameter $D'$, 1) a $O(\log D\log\log_{m/n} n)$ parallel time $2$-edge connectivity algorithm, 2) a $O(\log D\log^2\log_{m/n}n+\log D'\log\log_{m/n}n)$ parallel time biconnectivity algorithm, where the bi-diameter $D'$ is the largest cycle length over all the vertex pairs in the same biconnected component. Our results are fully scalable, meaning that the memory per processor can be $O(n^δ)$ for arbitrary constant $δ>0$, and the total memory used is linear in the problem size. Our $2$-edge connectivity algorithm achieves the same parallel time as the connectivity algorithm of Andoni et al. (FOCS 2018). We also show an $Ω(\log D')$ conditional lower bound for the biconnectivity problem.

研究动机与目标

设计在MPC模型下完全可扩展的并行算法，用于解决2-边连通性和2-顶点连通性问题，其时间复杂度优于经典的PRAM方法。
实现运行时间依赖于图直径D和双直径D′而非n的算法，从而在直径较小的图上实现更快的性能。
证明2-边连通性可在O(log D log log(m/n) n)时间内求解，与MPC模型中已知最佳连通性算法的时间复杂度一致。
建立双连通性的条件下界，表明在广泛接受的“单环 vs. 双环”猜想下，Ω(log D′)时间是必需的。

提出的方法

使用随机树分解技术构建深度不超过diam(G)^O(log log(m/n) n)的生成树。
将线性工作、O(log(dep(par)))时间的LCA算法作为子程序，用于计算树中节点的最近公共祖先。
应用范围最小值查询（RMQ）技术识别桥和双连通分量，对小的子树进行本地处理，对较大的子树则通过专用RMQ数据结构处理。
利用MPC模型中每个机器可使用O(n^δ)内存（对任意δ > 0）的能力，确保总空间复杂度为O(m^{1+γ})（对任意γ > 0），从而实现完全可扩展性。
通过从“单环 vs. 双环”问题的归约，证明双连通性的条件性下界，表明在该猜想成立的前提下，Ω(log D′)时间是必需的。
在算法最后一步，结合排序、LCA查询和连通分量标记，识别双连通分量。

实验结果

研究问题

RQ1在MPC模型中，2-边连通性是否可在O(log D log log(m/n) n)时间内求解，且每个处理器的内存完全可扩展？
RQ2双连通性是否可在O(log D log² log(m/n) n + log D′ log log(m/n) n)时间内求解，其中D′为双直径？
RQ3在MPC模型中，是否存在依赖于双直径D′的双连通性条件性下界？
RQ4所提出的算法是否完全可扩展，即每个机器使用O(n^δ)内存（对任意δ > 0），且总内存线性于输入规模？

主要发现

2-边连通性算法的运行时间为O(log D log log(m/n) n)，总空间复杂度为O(m^{1+γ})，与MPC模型中图连通性问题的最佳已知时间复杂度一致。
双连通性算法的运行时间为O(log D log² log(m/n) n + log D′ log log(m/n) n)，总空间复杂度相同，且完全可扩展。
论文建立了条件性下界：除非“单环 vs. 双环”猜想不成立，否则任何MPC模型下的双连通性算法都至少需要Ω(log D′)时间。
两个算法的正确性以高概率保证（双连通性至少0.97，2-边连通性至少0.99），依赖于随机化树构建及LCA/RMQ子程序。
算法完全可扩展，支持任意常数δ > 0的每台机器内存大小O(n^δ)，且总内存保持线性于输入规模。
该方法利用树分解、LCA和RMQ技术，将问题规约为具有已知高效MPC实现的子程序。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。