QUICK REVIEW

[论文解读] A Tight Runtime Analysis for the cGA on Jump Functions---EDAs Can Cross Fitness Valleys at No Extra Cost

Benjamin Doerr|arXiv (Cornell University)|Mar 26, 2019

Metaheuristic Optimization Algorithms Research参考文献 20被引用 33

一句话总结

本论文提供了紧致的运行时间分析，表明当假设种群大小 µ = Ω(√n log n) 时，紧凑遗传算法（cGA）以高概率在 O(µ√n) 次迭代内优化 n 维跳跃函数，跳跃大小 k < (1/20) ln n。与大多数进化算法不同，cGA 在无需额外代价的情况下穿越适应度山谷，即使对于对数尺度的跳跃大小，其性能也接近最优，与 OneMax 优化性能相当。

ABSTRACT

We prove that the compact genetic algorithm (cGA) with hypothetical population size $\mu = \Omega(\sqrt n \log n) \cap ext{poly}(n)$ with high probability finds the optimum of any $n$-dimensional jump function with jump size $k < \frac 1 {20} \ln n$ in $O(\mu \sqrt n)$ iterations. Since it is known that the cGA with high probability needs at least $\Omega(\mu \sqrt n + n \log n)$ iterations to optimize the unimodal OneMax function, our result shows that the cGA in contrast to most classic evolutionary algorithms here is able to cross moderate-sized valleys of low fitness at no extra cost. Our runtime guarantee improves over the recent upper bound $O(\mu n^{1.5} \log n)$ valid for $\mu = \Omega(n^{3.5+\varepsilon})$ of Hasen\"ohrl and Sutton (GECCO 2018). For the best choice of the hypothetical population size, this result gives a runtime guarantee of $O(n^{5+\varepsilon})$, whereas ours gives $O(n \log n)$. We also provide a simple general method based on parallel runs that, under mild conditions, (i)~overcomes the need to specify a suitable population size, but gives a performance close to the one stemming from the best-possible population size, and (ii)~transforms EDAs with high-probability performance guarantees into EDAs with similar bounds on the expected runtime.

研究动机与目标

为估计分布算法（EDAs）在非单峰问题上的理论理解提供补足，特别是 cGA 在跳跃函数上的表现。
解决先前 cGA 运行时间界中因种群大小过大（例如 µ = Ω(n^3.5+ε)）而导致的实际效率低下问题，避免产生不切实际的高运行时间（例如 O(n^5+ε)）。
证明 cGA 能够在不引起渐近性能下降的前提下，高效穿越中等规模的适应度山谷。
提出一种通用方法，将高概率运行时间保证转化为期望运行时间保证，并消除对种群大小参数 µ 的调优需求。

提出的方法

使用滴定分析和集中不等式分析 cGA 在跳跃函数上的行为，以限制频率更新和收敛时间。
应用加法滴定定理估计将到最优解的距离（Dt）从 D′ 减少到 D′′ 所需的时间，表明 E[T] = O(µ log n)。
采用一种改进的进度估计方法，考虑采样适应度山谷与频率更新之间的随机依赖关系，纠正了先前工作中使用的有缺陷的平均场近似。
提出一种新方法，通过并行运行将高概率运行时间保证转化为期望运行时间保证，而无需事先知道最优 µ。
采用两阶段分析：首先确保频率始终远离 0 和 1，然后在假设间隙很少被采样的前提下分析收敛至最优解的过程。
使用并集界和切尔诺夫型不等式控制在适应度山谷中采样的概率，表明在给定条件下该概率呈指数级小。

实验结果

研究问题

RQ1cGA 能否高效优化具有中等规模适应度山谷的跳跃函数？与单峰的 OneMax 相比，其运行时间是否会产生惩罚？
RQ2cGA 在跳跃函数上的最紧运行时间保证是什么？其与假设种群大小 µ 的依赖关系如何？
RQ3忽略适应度山谷采样与进度之间随机依赖关系的平均场近似是否合理？还是会导致不准确的界？
RQ4是否可能消除对种群大小 µ 手动调优的需求，同时保持接近最优的性能？
RQ5是否可以将 EDA 的高概率运行时间保证转化为期望运行时间保证？

主要发现

当 µ = Ω(√n log n) 且 µ 多项式有界时，cGA 以高概率在 O(µ√n) 次迭代内优化任意 n 维跳跃函数，其中 k < (1/20) ln n。
对于最小可接受的 µ = Θ(√n log n)，运行时间保证为 O(n log n)，显著优于 Hasenöhrl 和 Sutton（2018）提出的先前 O(n^5+ε) 的界。
cGA 在无需额外渐近代价的情况下穿越了大小 k < (1/20) ln n 的适应度山谷，而大多数经典 EA 需要 Ω(nk) 的时间。
本文识别并纠正了先前工作中存在的有缺陷的平均场近似，表明忽略山谷采样与进度之间的依赖关系会导致风险被低估和无效界。
提出了一种通用方法，通过并行运行将高概率运行时间保证转化为期望运行时间保证，无需指定 µ，从而实现接近最优 µ 的性能。
分析表明，在适应度间隙中采样的概率最多为 O(1/n)，在给定条件下呈指数级小，确保 cGA 很少陷入陷阱。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。