QUICK REVIEW

[论文解读] Multivariate tests of association based on univariate tests

Ruth Heller, Yair Heller|arXiv (Cornell University)|Mar 10, 2016

Advanced Statistical Methods and Models被引用 8

一句话总结

本文提出一种通用的多元独立性检验框架，通过将问题简化为从任意中心点出发的距离的单变量检验。在最小假设下，一致的单变量检验可确保检测依赖关系的统计功效逐渐增强；若单变量检验为分布自由，则由此产生的多元检验亦为分布自由，且通过多个中心点及聚合可获得潜在的功效提升。

ABSTRACT

For testing two vector random variables for independence, we propose testing whether the distance of one vector from an arbitrary center point is independent from the distance of the other vector from another arbitrary center point by a univariate test. We prove that under minimal assumptions, it is enough to have a consistent univariate independence test on the distances, to guarantee that the power to detect dependence between the random vectors increases to one with sample size. If the univariate test is distribution-free, the multivariate test will also be distribution-free. If we consider multiple center points and aggregate the center-specific univariate tests, the power may be further improved, and the resulting multivariate test may be distribution-free for specific aggregation methods (if the univariate test is distribution-free). We show that certain multivariate tests recently proposed in the literature can be viewed as instances of this general approach. Moreover, we show in experiments that novel tests constructed using our approach can have better power and computational time than competing approaches.

研究动机与目标

提出一种通用且灵活的多元独立性检验方法，利用向量与中心点之间距离的单变量检验。
建立理论保证：一致的单变量检验在样本量增大时，可使检测随机向量之间依赖关系的统计功效持续增加。
探讨单变量检验的分布自由特性是否在多元扩展中得以保留。
通过多个中心点的聚合及特定聚合方法，提升检验功效。
证明现有多元检验可作为本框架的特例。

提出的方法

将多元独立性检验问题简化为检验每个向量与任意中心点之间距离的独立性。
对每个中心点所导出的距离对，应用一致的单变量独立性检验。
通过特定聚合方法，将多个中心点的结果进行聚合，以增强检验功效。
若底层单变量检验为分布自由，则确保该性质在所得多元检验中得以保留。
利用渐近理论证明：在最小假设下，随着样本量增加，统计功效趋近于1。
通过结合单变量检验、距离度量与多个中心点，构建新型多元检验。

实验结果

研究问题

RQ1在向量与中心点之间距离的单变量独立性检验中，若检验具有一致性，是否能确保检测多元依赖关系的统计功效随样本量增加而持续提升？
RQ2单变量检验的分布自由特性是否能传递至由此产生的多元检验？
RQ3通过在多个中心点上聚合单变量检验，能否提升多元检验的统计功效？
RQ4现有多元独立性检验是否可视为该通用框架的特例？
RQ5所提出的方法能否在计算效率和统计功效方面优于现有竞争方法？

主要发现

在最小假设下，若单变量检验具有一致性，则所提出的多元检验在样本量增大时，检测依赖关系的统计功效趋近于1。
若单变量检验为分布自由，则由此产生的多元检验亦为分布自由，该性质在所提出的框架下得以保持。
通过在多个中心点上聚合单变量检验，可提升统计功效，尤其在采用合适聚合方法时更为显著。
该框架可统一包含某些近期提出的多元检验作为特例，为其理论基础提供统一解释。
实验结果表明，基于该方法构建的新检验在统计功效与计算效率方面均优于现有方法。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。