QUICK REVIEW

[论文解读] Bregman projection for calibration estimation

Jae Kwang Kim, Yonghyun Kwon|arXiv (Cornell University)|Mar 21, 2026

Advanced Statistical Methods and Models被引用 0

一句话总结

该论文提出一种基于 Bregman 散度的统一标定框架，用于调整调查权重，得到一个对偶、低维优化和设计高效的估计量，并扩展至未知倾向与高维数据。

ABSTRACT

Calibration weighting is a fundamental technique in survey sampling and data integration for incorporating auxiliary information and improving efficiency of estimators. Classical calibration methods are typically formulated through distance functions applied to weight ratios relative to design weights. In this paper we develop a unified framework for calibration estimation based on Bregman divergence defined directly on the weight vector. We show that calibration estimators obtained from Bregman divergence admit a dual representation that depends only on the dimension of the auxiliary variables and can be interpreted as a Bregman projection onto the calibration constraint set. This geometric structure leads to a general asymptotic representation showing that calibration estimators are equivalent to debiased regression estimators whose regression coefficient depends on the choice of the Bregman generator. The result provides a unifying perspective on classical calibration methods such as quadratic calibration and exponential tilting, and reveals how the choice of divergence influences efficiency. Under Poisson sampling we further characterize the generator that minimizes the asymptotic variance of the calibration estimator and obtain an optimal contrast entropy divergence. The framework also extends naturally to settings where inclusion probabilities are unknown and must be estimated, yielding cross-fitted estimators that remain root-n consistent under mild conditions. Finally, we develop a regularized calibration estimator suitable for high-dimensional auxiliary variables. Simulation studies and a real data application illustrate the practical advantages of the proposed approach.

研究动机与目标

引入在权重空间直接运行的基于 Bregman 散度的标定框架。
展示将原问题–对偶结构将 n 维权重优化降到 p 维乘子优化。
推导渐近表示，将标定与去偏回归估计量联系起来，并具备生成器相关的效率。
在泊松采样下表征设计最优对比熵，并通过跨拟合方法将方法扩展到未知包含概率。

提出的方法

定义用于权重标定的 Bregman 散度 D_G，并将约束最小化问题与标定约束相结合。
推导 KKT 条件，得到 omega_i^*(lambda) = g^{-1}{g(omega_i^{(0)}) + x_i^T lambda}。
获得通过凸共轭 F 的对偶目标 ell(lambda)，并展示原问题–对偶对称性。
证明在对偶空间求解等价于一个 p 维无约束最优化，由 lambda-hat 求得。
建立渐近展开式，表明 BCE 等价于带有生成器相关系数的去偏预测估计量。
将框架扩展至未知倾向，利用跨拟合并提出一个基线权重构造 (omega_i^{(0)}) 的跨拟合版本。

实验结果

研究问题

RQ1如何通过 Bregman 散度将标定权重重新表述为直接的权重空间投影？
RQ2对偶（乘子）表示如何依赖生成器 G，及其对效率的影响？
RQ3在泊松抽样下，哪种生成器产生设计最优的最小渐近方差？对应的最优对比熵是什么？
RQ4该 Bregman 标定框架能否有效处理未知包含概率与高维辅助变量？
RQ5在设计基与缺失数据情形下，BCE 的渐近性质与方差估计策略是什么？

主要发现

通过 Bregman 散度获得的标定权重存在仅依赖辅助变量维度 p 的对偶表示。
BCE 渐近等价于带有生成器相关系数的去偏回归估计量，从而实现效率调优。
在泊松采样下，某一对比熵生成器实现设计最优的最小渐近方差。
在未知倾向时，跨拟合在温和条件下得到一致的双重鲁棒估计量。
一种带有 l_q 容忍度和数据自适应变量选择的正则化 Bregman 标定可推广到高维 X。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。