QUICK REVIEW

[論文レビュー] Robust Bayesian Optimization via Tempered Posteriors

Jiguang Li, Hengrui Luo|arXiv (Cornell University)|Jan 11, 2026

Advanced Bandit Algorithms Research被引用数 0

ひとこと要約

本論文はベイズ最適化における tempered (alpha- tempered) 後方更新を導入し、 surrogate の過信を抑制。 tempered GP surrogates と generalization improvement acquisitions の後悔界を導出し、オンラインでの alpha 調整スキームを提案する。

ABSTRACT

Bayesian optimization (BO) iteratively fits a Gaussian process (GP) surrogate to accumulated evaluations and selects new queries via an acquisition function such as expected improvement (EI). In practice, BO often concentrates evaluations near the current incumbent, causing the surrogate to become overconfident and to understate predictive uncertainty in the region guiding subsequent decisions. We develop a robust GP-based BO via tempered posterior updates, which downweight the likelihood by a power $α\in (0,1]$ to mitigate overconfidence under local misspecification. We establish cumulative regret bounds for tempered BO under a family of generalized improvement rules, including EI, and show that tempering yields strictly sharper worst-case regret guarantees than the standard posterior $(α=1)$, with the most favorable guarantees occurring near the classical EI choice. Motivated by our theoretic findings, we propose a prequential procedure for selecting $α$ online: it decreases $α$ when realized prediction errors exceed model-implied uncertainty and returns $α$ toward one as calibration improves. Empirical results demonstrate that tempering provides a practical yet theoretically grounded tool for stabilizing BO surrogates under localized sampling.

研究の動機と目的

BO の surrogate が局所的ミススペックification および過信に対してロバストであることを動機づける。
BO 内のベイズ線形および GP surrogate に likelihood tempering を組み込み、更新を安定化する。
tempered GP surrogate に基づく generalized improvement ルール（EI を含む）に対する累積後悔界を導出する。
BO において tempering パラメータ alpha を選択するためのチューニング寄りのオンライン手法を提案する。
実験を通じてベンチマーク関数上で tempering の実践的な性能向上を示す。

提案手法

likelihood を冪乗 alpha (0,1] に上げて tempered posterior 更新を定式化する。
tempered posterior 更新を持つ GP surrogate を構築し、alpha に対する予測平均と分散を導出する（mu_{t,alpha}, sigma_{t,alpha}）。
tempered GP posteriors における generalized EI (g-EI) acquisition の閉形式を導出する（nu_t のリスケーリングと tau_g(v) 関数を含む）。
gamma term および determinant 増大境界を含む、alpha および g に依存する tempered BO の累積後悔界を確立する。
calibration と実現予測誤差に基づいて alpha をオンラインで適応する予め前計画的で調整が少ないスケジュールを提案する。
標準の alpha=1 理論との解析的比較を提供し、非パラメトリック GP 設定で tempered posteriors が最悪ケース保証をより鋭くする可能性を示す。

実験結果

リサーチクエスチョン

RQ1BO において likelihood を temper にすることで surrogate の局所的ミススペックification に対するロバスト性が改善されるか。
RQ2 tempered posteriors は PI や EI のような generalization improvement acquisitions との関係で後悔にどのような影響を与えるか。
RQ3alpha と g に依存する tempered GP ベースの BO に対する explicit な後悔界を導出できるか、またこれが標準後方とどう比較されるか。
RQ4 heavy tuning なしでボルド BO 中に alpha を適応させる実用的なオンライン手法は存在するか。
RQ5 tempered 戦略はベンチマーク BO 問題で経験的な性能向上をもたらすか。

主な発見

alpha (0,1] による surrogate の tempering は局所的ミススペックを低減し、BO における過信的な後方を防ぐことができる。
tempered GP に対して g-EI acquisition の閉形式が導出され、 tempered BO アルゴリズムの解析を可能にする。
後悔界は、固定された g に対して tempering が標準の alpha=1 より厳密に鋭い最悪ケース保証を与え、EI 設定近傍で最も大きな効果を示す。
線形サロゲートのベースラインでは tempered 更新は正しく仕様された場合の leading-order EI 後悔を改善しないことが示され、 tempering の価値は非線形 GP 領域に主としてあることを示唆する。
alpha を適応する予測的オンラインスケジュールを提案し、予測誤差がモデルの不確実性を上回るときに alpha を低下させ、較正が改善されると 1 に戻す。
実データ上の結果は tempered が BO の surrogate を安定化させ、局所サンプリング下でのロバスト性を向上させることを示している。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。