QUICK REVIEW

[论文解读] Bayesian optimization with virtual derivative sign observations

Eero Siivola, Aki Vehtari|arXiv (Cornell University)|Apr 4, 2017

Gaussian Processes and Bayesian Inference被引用 2

一句话总结

本文提出了一种贝叶斯优化方法，通过在搜索空间边界处注入虚拟导数观测值，利用高斯过程提升收敛速度。通过强制使目标函数在边界处不太可能达到最小值，该方法减少了过度探索，并在各种采集函数下一致地减少了所需的目标函数评估次数。

ABSTRACT

Bayesian optimization (BO) is a global optimization strategy designed to find the minimum of an expensive black-box function, typically defined on a compact subset of $\mathcal{R}^d$, by using a Gaussian process (GP) as a surrogate model for the objective. Although currently available acquisition functions address this goal with different degree of success, an over-exploration effect of the contour of the search space is typically observed. However, in problems like the configuration of machine learning algorithms, the function domain is conservatively large and with a high probability the global minimum does not sit on the boundary of the domain. We propose a method to incorporate this knowledge into the search process by adding virtual derivative observations in the \gp at the boundary of the search space. We use the properties of GPs to impose conditions on the partial derivatives of the objective. The method is applicable with any acquisition function, it is easy to use and consistently reduces the number of evaluations required to optimize the objective irrespective of the acquisition used. We illustrate the benefits of our approach in an extensive experimental comparison.

研究动机与目标

为解决贝叶斯优化中的过度探索问题，特别是在全局最小值不太可能位于边界的高维、保守型搜索空间中。
将先验知识融入模型，即全局最小值不太可能出现在搜索域的边界处。
通过引导优化过程向域的内部区域集中，减少昂贵的目标函数评估次数。
开发一种与任何采集函数兼容且易于实现的方法。

提出的方法

在搜索空间边界处向高斯过程代理模型中注入虚拟导数观测值。
利用高斯过程的性质，在边界点上施加偏导数为零的均值约束。
根据对边界行为的置信度，将导数观测值建模为噪声型或确定型。
将目标函数在边界处梯度为零的先验信念表述为对GP后验的一组线性约束。
在不修改采集函数的前提下，将虚拟观测值整合进GP训练过程。
通过将导数约束直接嵌入GP模型，确保与任何标准采集函数的兼容性。

实验结果

研究问题

RQ1在边界处注入虚拟导数观测值是否能提升贝叶斯优化的收敛速度？
RQ2所提出的方法是否能有效减少搜索空间中，特别是边界附近的过度探索？
RQ3该方法在不同采集函数和目标函数下的表现如何？
RQ4该方法是否可普遍适用，而无需对采集函数进行修改？
RQ5虚拟导数观测值对找到最小值所需的目标函数评估次数有何影响？

主要发现

所提出的方法在所有测试的采集函数下，均一致减少了优化目标函数所需的函数评估次数。
虚拟导数观测值能有效减少边界附近的过度探索，尤其在高维和保守型搜索空间中表现显著。
该方法在不修改现有采集函数的前提下，提升了优化效率。
基于GP的导数约束能够通过引导搜索向域内部推进，从而实现更快的收敛。
实验结果表明，该方法在合成数据和真实世界中的机器学习超参数调优任务中均表现出显著的性能提升。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。