QUICK REVIEW

[論文レビュー] Robust Estimation of High-Dimensional Mean Regression

Jianqing Fan, Quefeng Li|arXiv (Cornell University)|Oct 8, 2014

Statistical Methods and Inference参考文献 25被引用数 18

ひとこと要約

本稿では、重たい尾を持つ誤差のもとで高次元平均回帰をロバストに実行できるように、発散するチューニングパラメータを備えた罰則付きハッパー損失推定器、RA-Lassoを提案する。ハッパー損失をバイアス低減に適応させ、第二モーメントしか存在しない場合でも最適な収束速度を保証することで、RA-Lassoは、軽尾設定と同一の最適な $L_2$ 誤差率を達成する。さらに、平均推定において指数的集中性を維持する。

ABSTRACT

Data subject to heavy-tailed errors are commonly encountered in various scientific fields, especially in the modern era with explosion of massive data. To address this problem, procedures based on quantile regression and Least Absolute Deviation (LAD) regression have been devel- oped in recent years. These methods essentially estimate the conditional median (or quantile) function. They can be very different from the conditional mean functions when distributions are asymmetric and heteroscedastic. How can we efficiently estimate the mean regression functions in ultra-high dimensional setting with existence of only the second moment? To solve this problem, we propose a penalized Huber loss with diverging parameter to reduce biases created by the traditional Huber loss. Such a penalized robust approximate quadratic (RA-quadratic) loss will be called RA-Lasso. In the ultra-high dimensional setting, where the dimensionality can grow exponentially with the sample size, our results reveal that the RA-lasso estimator produces a consistent estimator at the same rate as the optimal rate under the light-tail situation. We further study the computational convergence of RA-Lasso and show that the composite gradient descent algorithm indeed produces a solution that admits the same optimal rate after sufficient iterations. As a byproduct, we also establish the concentration inequality for estimat- ing population mean when there exists only the second moment. We compare RA-Lasso with other regularized robust estimators based on quantile regression and LAD regression. Extensive simulation studies demonstrate the satisfactory finite-sample performance of RA-Lasso.

研究の動機と目的

誤差分布が軽尾でない、第二モーメントしか存在しない場合の高次元平均回帰推定の課題に対処する。
従来のハッパー損失が平均回帰に与えるバイアスを軽減するため、チューニングパラメータを発散可能にする。
中央値・分位数関数ではなく条件付き平均関数を推定するロバストな正則化手法を開発する。これは、誤差分布が非対称または異分散性である場合に特に重要である。
次元数が標本サイズに指数関数的に増加する超高次元設定において、推定器の理論的最適性を確立する。
第二モーメントの存在のみを仮定したもとで、平均推定のための集中不等式を確立し、カトニの研究をスパース線形モデルに拡張する。

提案手法

発散するパラメータを用いた $L_1$-罰則付きハッパー損失に基づく、罰則付きロバストな近似2次損失（RA-quadratic）を提案し、これをRA-Lassoと呼ぶ。
$n \to \infty$ のとき $α \to \infty$ となる発散するハッパーのチューニングパラメータ $α$ を用いることで、平均推定におけるバイアス低減を保証する。
RA-Lassoの最適化問題を解くために、合成勾配降下法を適用し、十分な反復回数後には最適な収束速度に収束することを証明する。
ハッパー損失の2階テイラー展開を用いて推定誤差をバインドし、集中性の性質を導出する。
RA-Lasso損失に対して制限された強い凸性（RSC）条件を確立し、スパarsityのもとでの高次元一貫性を可能にする。
カトニのロバストM推定量フレームワークを高次元スパース線形モデルに拡張し、第二モーメントの存在のみを仮定したもとで指数的集中性を達成する。

実験結果

リサーチクエスチョン

RQ1誤差が第二モーメントしか持たない場合でも、高次元平均回帰において最適な $L_2$ 誤差率を達成できるか？
RQ2従来のハッパー損失が平均回帰に与えるバイアスを、重たい尾を持つ誤差に対してロバストなまま低減できるか？
RQ3誤差分布が重たい尾を持つ場合でも、RA-Lasso推定器は、軽尾設定と同一の最適な収束速度を達成できるか？
RQ4第二モーメントの存在のみを仮定したもとで、平均推定器のための集中不等式を確立できるか？これにより、高次元におけるロバストな推論が可能になるか？
RQ5有限標本における性能において、LAD や分位数回帰に基づく推定器と比較して、RA-Lasso推定器はどのように差をつけるか？

主な発見

RA-Lasso推定器は、超高次元設定のもとで、最適な $L_2$ 誤差率 $O(\sqrt{R_q (\log p)/n})$ を達成する。これは、軽尾設定における最適レートと一致する。
誤差が原点の周りで対称な場合、RA-Lasso推定器は $L_1$-罰則付きLAD推定器と同じレートを達成するため、この特殊な場合に効率損失が生じない。
RSC条件のもとで、推定誤差 $\|\widehat{\boldsymbol{\beta}} - \boldsymbol{\beta}^*\|_2$ は $O(\sqrt{R_q (\log p)/n})$ のレートで収束する。ここで $R_q$ はスパarsityを制御する。
第三モーメントが存在する場合、ハッパー損失のバイアスは $O(\alpha^2)$ に低減され、第二モーメントしか存在しない場合でも $O(\alpha)$ に低減される。これにより、一貫性のある平均推定が可能になる。
第二モーメントの存在のみを仮定したもとでも、RA-Lasso推定器は母平均に対して指数的集中性を示す。これは、カトニの結果を高次元回帰に拡張したものである。
合成勾配降下法は、十分な反復回数後、最適な収束速度の解に収束する。これにより、計算上の実行可能性が保証される。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。