QUICK REVIEW

[論文レビュー] Control of Generalized Error Rates in Multiple Testing

Joseph P. Romano, Michael Wolf|Zurich Open Repository and Archive (University of Zurich)|May 1, 2005

Statistical Methods in Clinical Trials参考文献 30被引用数 106

ひとこと要約

本稿では、多重仮説検定における一般化誤り率、特にk-FWER（少なくともk個の偽棄却の確率）およびFDP（偽発見率）を制御するリサンプリングに基づく手順を提案する。ブートストラップおよびサブサンプリング手法を用いて、検定統計量の間の依存性を考慮したステップダウン手順を開発し、部分集合ピボタル性を必要とせず、漸近的にこれらの誤り率を制御することで、遺伝学的応用など高次元設定における検出力の向上を実現する。

ABSTRACT

Consider the problem of testing $s$ hypotheses simultaneously. The usual approach restricts attention to procedures that control the probability of even one false rejection, the familywise error rate (FWER). If $s$ is large, one might be willing to tolerate more than one false rejection, thereby increasing the ability of the procedure to correctly reject false null hypotheses. One possibility is to replace control of the FWER by control of the probability of $k$ or more false rejections, which is called the $k$-FWER. We derive both single-step and step-down procedures that control the $k$-FWER in finite samples or asymptotically, depending on the situation. We also consider the false discovery proportion (FDP) defined as the number of false rejections divided by the total number of rejections (and defined to be 0 if there are no rejections). The false discovery rate proposed by Benjamini and Hochberg [J. Roy. Statist. Soc. Ser. B 57 (1995) 289--300] controls $E(FDP)$. Here, the goal is to construct methods which satisfy, for a given $γ$ and $α$, $P\{FDP>γ\}\le α$, at least asymptotically. In contrast to the proposals of Lehmann and Romano [Ann. Statist. 33 (2005) 1138--1154], we construct methods that implicitly take into account the dependence structure of the individual test statistics in order to further increase the ability to detect false null hypotheses. This feature is also shared by related work of van der Laan, Dudoit and Pollard [Stat. Appl. Genet. Mol. Biol. 3 (2004) article 15], but our methodology is quite different. Like the work of Pollard and van der Laan [Proc. 2003 International Multi-Conference in Computer Science and Engineering, METMBS'03 Conference (2003) 3--9] and Dudoit, van der Laan and Pollard [Stat. Appl. Genet. Mol. Biol. 3 (2004) article 13], we employ resampling methods to achieve our goals. Some simulations compare finite sample performance to currently available methods.

研究の動機と目的

弱い依存性の仮定の下で、計算的に実行可能な一般化誤り率を制御する手順を開発すること。
従来の家族単位誤り率（FWER）制御を超えて、許容可能なレベルの偽棄却を許容することで統計的検出力を向上させること。
リサンプリングを用いて検定統計量の依存構造を組み込むことにより、既存手法の限界を克服すること。
k-FWERおよび偽発見率が閾値γを超える確率を漸近的に制御すること。
FDRおよびFWER制御の実用的代替手法を提供し、高次元推論における第一種および第二種の誤りのトレードオフをバランスさせる。

提案手法

帰無仮説の下での検定統計量の連合分布を推定するためにブートストラップおよびサブサンプリングを用い、正確な臨界値の計算を可能にする。
k-FWER制御の根拠を形成するために、真の帰無仮説からのk番目に大きな検定統計量を特定するk-最大統計量を用いる。
順序付けられた検定統計量とリサンプリングによる臨界値に基づいて、段階的に仮説を棄却するステップダウン手順を適用する。
ブートストラップによる偽発見率の推定に基づき、動的に棄却閾値を調整するFDP制御のための新規アルゴリズムを導入する。
漸近的理論およびU統計量の議論を用いて、リサンプリングに基づく臨界値の妥当性を正当化する。
一部の他の手法が要請する部分集合ピボタル性の仮定を回避することで、依存する検定統計量への適用可能性を高める。

実験結果

リサーチクエスチョン

RQ1リサンプリングに基づく手順は、弱い依存性の仮定の下で有限標本または漸近的にk-FWERを制御できるか？
RQ2ユーザーが指定したγ ∈ [0,1) に対して、P(FDP > γ) ≤ α が漸近的に成立するように偽発見率（FDP）を制御できるか？
RQ3検定統計量の依存性を活用することで、誤り率制御を損なわずに多重検定手順の検出力を向上させられるか？
RQ4提案されたステップダウン手順は、従来のワンステップ法やFDR制御手法に比べ、検出力および誤り率制御の観点で優れているか？
RQ5提案手法は、部分集合ピボタル性条件を満たさない依存構造に対してもロバストであるか？

主な発見

提案されたステップダウン手順は、弱い正則性条件の下で、部分集合ピボタル性の仮定を必要とせず、k-FWERを漸近的に制御する。
任意のγ ∈ [0,1) に対して、P(FDP > γ) ≤ α を漸近的に制御でき、FDR制御に対するより柔軟な代替手法を提供する。
シミュレーションでは、検定統計量が依存する場合に特に従来手法に比べて検出力が向上している。
k-FWER手順はkに関して単調である：kを増加させるとより多くの仮説が棄却され、これはFDP制御アルゴリズムの妥当性を支持する。
理論的正当化はサブサンプリングおよびU統計量理論に依拠しており、一般の依存構造のもとで漸近的妥当性を保証する。
付録の反例により、[33]で提示された関連手法が、すべての帰無仮説が真である場合でさえ、FDPを漸近的に制御できないことが示され、本手法の新規性および必然性が強調される。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。