QUICK REVIEW

[論文レビュー] Dual Representation of Minimum Divergence Under Integral Constraints

Shubhanshu Shekhar, Shubhada Agrawal|arXiv (Cornell University)|Mar 22, 2026

Advanced Bandit Algorithms Research被引用数 0

ひとこと要約

要約: 本論文は、[0,1]^K 上の分布に対する積分制約下の最小発散のデュアル表現を導くための二段階の離散化ベースの方法を開発し、KL_inf および f-ダイバージェンスを拡張して逐次推論への適用を行う。

ABSTRACT

Minimum divergence problems under integral constraints appear throughout statistics and probability, including sequential inference, bandit theory, and distributionally robust optimization. In many such settings, dual representations are the key step that convert information-theoretic lower bounds into computationally tractable (and often near-optimal) algorithms. In this paper, we present a general two-stage recipe for deriving dual representations of constrained minimum divergence (in the second argument) for distributions supported on $[0,1]^K$. The first stage derives a dual representation for finitely-supported distributions using classical finite-dimensional convex duality techniques, while the second establishes an abstract interchange argument that lifts this discretized dual to arbitrary distributions. We begin with the simplest case of mean-constrained minimum relative entropy, commonly called $\mathrm{KL}_{\inf}$, and generalize an existing argument from multi-armed bandits literature for $K=1$ to arbitrary dimensions. Our main contribution is to significantly expand the scope of this approach to a broad class of $f$-divergences (beyond relative entropy) and to general integral constraint functionals (beyond the mean constraint). Finally, we illustrate the statistical implications of our results by constructing optimal procedures for sequential testing, estimation, and change detection with observations in $[0,1]^K$.

研究の動機と目的

[0,1]^K 上の分布に対する積分制約下で第二引数の最小発散のデュアル表現を導出する。
KL_inf から広範な f-ダイバージェンスと一般的な積分制約汎関数へデュアル性を拡張する。
制約をリミティング Argument を用いて一般分布へ拡張する、構成的な離散化ベースのパイプラインとリフティング議論を提供する。
得られたデュアル表現を逐次推論問題（検定、推定、変化検出）などへ適用する。

提案手法

古典的な凸対極性を用いて有限サポート分布のデュアリティを導出する。
二段階離散化パイプラインを導入：（i）制約を正確に満たす有限サポートへの離散化、（ii）DPIと下半連続性を活用したリミティング議論によりデュアルを任意分布へリフト。
フレームワークを相対エントロピーを超える一般的な f-ダイバージェンスへ拡張する、抽象的リミティング議論を用いる。
KL_inf の設定で平均制約を正確に保存する mean-preserving 離散化チャネルを使用し、一般的な制約満足度には近似を許容する形へ一般化。
離散化デュアルが連続デュアルへ収束する条件を確立し、これらの条件の下で I(P,g,C) の抽象的デュアル表現を定式化する。
[0,1]^K 値観測を用いた逐次検定、推定、変化検出などの統計的応用について議論する。

実験結果

リサーチクエスチョン

RQ1分布が [0,1]^K にあるとき、積分制約下で I(P,g,C) のデュアル表現をどのように得られるか？
RQ2有限サポートデュアルから始めてリミティング論を用いる二段階離散化アプローチは、平均以外の一般的な制約や他の f-ダイバージェンスへ適用可能か？
RQ3離散化デュアルが連続デュアルへ収束することを保証し、有限分布から一般分布へリフトする条件は何か？
RQ4一般の積分制約を用いた KL_inf のデュアル形と、ヘリンゲル距離やカイ二乗など他の発散のデュアル形はどうなるか？
RQ5これらのデュアル表現は [0,1]^K データに対する逐次推論タスクの最適手順にどう影響するか？

主な発見

有限サポートの場合に KL_inf のデュアル表現を導出し、一般分布へリミティング論を介して拡張した。
離散化を平均制約を正確に保存する mean-preserving なディスクリート化チャネルを導入した。
二段階アプローチを KL_inf からより広い f-ダイバージェンスのクラスおよび一般的な連続制約へ拡張した。
I(P,g,C) のための抽象的デュアル表現フレームワークを、緩やかな正則性・連続性仮定の下で開発し、適用範囲を広げた。
結果は逐次推論タスク（検定、推定、変化検出）に結びつき、[0,1]^K 観測に対する最適手順を示す。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。