QUICK REVIEW

[论文解读] Minimizing Makespan in Sublinear Time via Weighted Random Sampling

Bin Fu, Yumei Huo|arXiv (Cornell University)|Feb 3, 2026

Scheduling and Optimization Algorithms被引用 0

一句话总结

该论文提出了两种对等机器的最短完成时间的子线性时间随机化近似方案，利用带权随机采样，得到已知和未知 n 的(1+ε)-近似完成时间及草拟调度。

ABSTRACT

We consider the classical makespan minimization scheduling problem where $n$ jobs must be scheduled on $m$ identical machines. Using weighted random sampling, we developed two sublinear time approximation schemes: one for the case where $n$ is known and the other for the case where $n$ is unknown. Both algorithms not only give a $(1+3ε)$-approximation to the optimal makespan but also generate a sketch schedule. Our first algorithm, which targets the case where $n$ is known and draws samples in a single round under weighted random sampling, has a running time of $ ilde{O}( frac{m^5}{ε^4} \sqrt{n}+A(\ceiling{m\over ε}, ε ))$, where $A(\mathcal{N}, α)$ is the time complexity of any $(1+α)$-approximation scheme for the makespan minimization of $\mathcal{N}$ jobs. The second algorithm addresses the case where $n$ is unknown. It uses adaptive weighted random sampling, % extit{that is}, it draws samples in several rounds, adjusting the number of samples after each round, and runs in sublinear time $ ilde{O}\left( frac{m^5} {ε^4} \sqrt{n} + A(\ceiling{m\over ε}, ε ) ight)$. We also provide an implementation that generates a weighted random sample using $O(\log n)$ uniform random samples.

研究动机与目标

为 m 台同类机器的经典最短完成时间问题设计子线性时间的随机化算法。
利用带权随机采样处理不平衡的作业大小，降低对输入的依赖。
提供草拟调度，并在所有数据可用时进行后验构建接近最优完成时间的具体调度。

提出的方法

在每个作业的采样概率与其处理时间成正比的前提下使用带权随机采样。
构建一个子线性大小的输入草拟，按估计处理时间区间对作业进行分组，并保留关键的 OPT(I) 属性。
应用一个元算法，针对最大的作业使用一个(1+α)近似方案的黑箱，并将其余作业分组以实现(1+ε)近似。
确保草拟具有(α,β1,β2)-草拟，對每个区间的计数和草拟的 OPT 提供保证，从而实现对 OPT(I) 的子线性时间计算。
区分已知 n（单轮采样）与未知 n（自适应带权采样）的算法。
提供从草拟到草拟调度再到在所有数据可用时推导出完整调度的方法。

实验结果

研究问题

RQ1是否可以在子线性时间内利用带权采样将同类机器上的完成时间近似到(1+ε)？
RQ2如何构造一个紧凑的输入草拟，以在已知与未知 n 情况下都保留本质的完成时间结构？
RQ3如何将一般的(1+α)-近似算法与草拟组合，以为原始实例实现(1+ε)-近似？
RQ4已知与未知 n 情况下的运行时间保证和草拟大小界限分别是多少？

主要发现

对于已知 n，存在在 tilde-O(m^5/ε^4 * sqrt(n) + A(ceil(m/ε), ε)) 时间内运行的随机(1+ε)-近似算法。
对于未知 n，存在在 tilde-O(m^5/ε^4 * sqrt(n) + A(ceil(m/ε), ε)) 时间内运行的随机(1+ε)-近似算法。
该算法产生的草拟调度表示为空间 O(m^2/ε^2 * log(mn^2))（已知 n）或 O(m^2/ε^2 * log(nm/ε))（未知 n），从而在后续数据可用时实现具体调度，使完成时间至多为(1+3ε)OPT(I)。
该方法将带权采样、广义生日悖论论证以及用于最短完成时间的黑箱近似方案结合起来，以实现子线性时间解决方案。
提供了一种实用实现，通过使用 O(log n) 个均匀样本生成带权样本。
基于草拟的方法允许在获得完整作业信息后将草拟调度转换为完整调度。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。