QUICK REVIEW

[论文解读] Dual Representation of Minimum Divergence Under Integral Constraints

Shubhanshu Shekhar, Shubhada Agrawal|arXiv (Cornell University)|Mar 22, 2026

Advanced Bandit Algorithms Research被引用 0

一句话总结

该论文提出一种基于两阶段离散化的方法来导出在积分约束下对最小散度的对偶表示，针对在 [0,1]^K 上的分布，扩展了 KL_inf 与 f-散度，并有序列推断的应用。

ABSTRACT

Minimum divergence problems under integral constraints appear throughout statistics and probability, including sequential inference, bandit theory, and distributionally robust optimization. In many such settings, dual representations are the key step that convert information-theoretic lower bounds into computationally tractable (and often near-optimal) algorithms. In this paper, we present a general two-stage recipe for deriving dual representations of constrained minimum divergence (in the second argument) for distributions supported on $[0,1]^K$. The first stage derives a dual representation for finitely-supported distributions using classical finite-dimensional convex duality techniques, while the second establishes an abstract interchange argument that lifts this discretized dual to arbitrary distributions. We begin with the simplest case of mean-constrained minimum relative entropy, commonly called $\mathrm{KL}_{\inf}$, and generalize an existing argument from multi-armed bandits literature for $K=1$ to arbitrary dimensions. Our main contribution is to significantly expand the scope of this approach to a broad class of $f$-divergences (beyond relative entropy) and to general integral constraint functionals (beyond the mean constraint). Finally, we illustrate the statistical implications of our results by constructing optimal procedures for sequential testing, estimation, and change detection with observations in $[0,1]^K$.

研究动机与目标

在 [0,1]^K 的分布下，建立对第二个变量在积分约束下的最小散度的对偶表示。
将对偶性从 KL_inf 扩展到广义的 f-散度及一般积分约束泛函的结果。
提供一个基于离散化的构造性流程及提升论证，通过极限论证推广到一般分布。
将得到的对偶表示应用于序列推断问题，如检验、估计和变化检测。

提出的方法

利用经典凸对偶性推导具有有限支持分布的对偶性。
引入两阶段离散化流程：(i) 离散化至有限支集并保证约束的完全满足，(ii) 通过利用 DPI 与下半连续性的极限论证，将对偶提升至任意分布。
通过一个抽象的极限论证，将框架扩展到超越相对熵的广义 f-散度。
在 KL_inf 的设定中使用均值保持的离散化信道以在离散化中精确保持均值约束，并将其推广到对一般约束的近似约束满足。
在满足温和的正则性与连续性假设下，建立离散化对偶收敛到连续对偶的条件，并给出 I(P,g,C) 的抽象对偶表示。
讨论对序列检验、估计和变化检测等统计应用及对 [0,1]^K 值观测的适用性。

实验结果

研究问题

RQ1如何在 [0,1]^K 上的分布的积分约束下，为 I(P,g,C) 获得对偶表示？
RQ2两阶段离散化方法（有限支集对偶再通过极限论证提升）是否可推广到超越均值的广义 f-散度和约束？
RQ3哪些可验证的条件可以保证离散化对偶收敛到连续对偶，并允许从有限分布提升至一般分布？
RQ4对于具有一般积分约束的 KL_inf 以及其他散度如 Hellinger 与 Chi-squared，得到的对偶形式为何？
RQ5这些对偶表示如何为在 [0,1]^K 数据上的序列推断任务提供最优策略？

主要发现

在有限支集情形下推导出 KL_inf 的对偶表示，并通过极限论证推广到一般分布。
引入均值保持的离散化通道，以在离散化中精确保持均值约束。
将两阶段方法从 KL_inf 推广到更广泛的 f-散度类以及对一般连续约束的适用性。
建立一个适用于 I(P,g,C) 的抽象对偶表示框架，在温和的正则性与连续性假设下实现更广的适用性。
将结果与序列推断任务联系起来，展示在 [0,1]^K 观测下的检验、估计与变化检测的最优程序。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。