QUICK REVIEW

[论文解读] Kernel Tests of Equivalence

Xing Liu, Axel Gandy|arXiv (Cornell University)|Mar 11, 2026

Markov Chains and Monte Carlo Methods被引用 0

一句话总结

论文提出基于核的等价性检验，使用核 Stein 不一致性（KSD）与最大均值差异（MMD）来比较分布，提供正态近似与自助法实现。

ABSTRACT

We propose novel kernel-based tests for assessing the equivalence between distributions. Traditional goodness-of-fit testing is inappropriate for concluding the absence of distributional differences, because failure to reject the null hypothesis may simply be a result of lack of test power, also known as the Type-II error. This motivates \emph{equivalence testing}, which aims to assess the \emph{absence} of a statistically meaningful effect under controlled error rates. However, existing equivalence tests are either limited to parametric distributions or focus only on specific moments rather than the full distribution. We address these limitations using two kernel-based statistical discrepancies: the \emph{kernel Stein discrepancy} and the \emph{Maximum Mean Discrepancy}. The null hypothesis of our proposed tests assumes the candidate distribution differs from the nominal distribution by at least a pre-defined margin, which is measured by these discrepancies. We propose two approaches for computing the critical values of the tests, one using an asymptotic normality approximation, and another based on bootstrapping. Numerical experiments are conducted to assess the performance of these tests.

研究动机与目标

将等价性检验作为弥补常规拟合优度检验在断言缺乏有意义分布差异方面局限性的手段。
开发非参数、基于核的检验，在预定义的边界下使用 KSD 与 MMD 来评估等价性。
为单样本与双样本情形提供两条实现路径（正态近似与自举）。
允许不等样本量并在不作参数分布假设的前提下，确保校准性与一致性。

提出的方法

提出 E-KSD-Normal 与 E-KSD-Boot 基于核 Stein 不一致性的单样本等价性检验。
提出 E-MMD-Normal 与 E-MMD-Boot 基于最大均值差异的双样本等价性检验。
推出在 Q ≠ P 下 KSD 的渐近正态性结果，并给出带 jackknife 方差估计的基于正态的检验统计量。
开发基于自举的检验（E-KSD-Boot），采用三角不等式方法以在边界较小的情况下获得更好的 I 型错误控制。
引入一种数据驱动方法来为自举检验选取等价边界 θ。
讨论通过 V-统计量和核均值嵌入进行计算，含有闭式 MMD 表达式与自举诊断。

Figure 1: Comparison of different hypothesis testing paradigms. The shaded area represents the space of distributions of interest. The null sets are shown in orange, and the alternative sets in grey. Left. standard testing with a point null hypothesis. Middle. Robust testing based on a statistical d

实验结果

研究问题

RQ1核基不一致性（KSD 与 MMD）是否能在预定义边界内可靠地检测到两个分布的等价性？
RQ2在不同边界与样本量下，正态近似与自举方法在 I 型错误控制与检验效能上有何比较？
RQ3在单样本与双样本等价性检验中，是否可以在不作参数假设的前提下容纳不等样本量？
RQ4哪些边界选择策略能为这些检验提供实用、数据驱动的等价性阈值？

主要发现

提出了两类基于核的等价性检验：单样本基于 KSD，双样本基于 MMD。
在给定假设下，E-KSD-Normal 已实现校准且一致，但对于较小边界可能会有较高的 I 型错误。
E-KSD-Boot 在有限样本下改善了 I 型错误控制，但可能以对检验功效的代价为代价。
E-MMD-Normal 与 E-MMD-Boot 可扩展至双样本情形并允许不等样本量。
基于自举的方法在正态近似对于小等价边界不可靠时提供鲁棒的控制。
提出一种数据驱动的方法用于自举检验中等价边界 θ 的选取。

Figure 2: Gaussian mean-shift experiments with varying sample sizes. The black dotted vertical line is the equivalence margin $\theta$ .

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。