QUICK REVIEW

[論文レビュー] Efficient Test-Time Model Adaptation without Forgetting

Shuaicheng Niu, Jiaxiang Wu|arXiv (Cornell University)|Apr 6, 2022

Domain Adaptation and Few-Shot Learning被引用数 66

ひとこと要約

tldr: Proposes EATA, an efficient fully test-time adaptation method with active sample selection and Fisher regularization to improve OOD performance while mitigating forgetting on ID data. It updates only BN affine parameters and uses entropy-based and diversity-based criteria to select samples for adaptation.

ABSTRACT

Test-time adaptation (TTA) seeks to tackle potential distribution shifts between training and testing data by adapting a given model w.r.t. any testing sample. This task is particularly important for deep models when the test environment changes frequently. Although some recent attempts have been made to handle this task, we still face two practical challenges: 1) existing methods have to perform backward computation for each test sample, resulting in unbearable prediction cost to many applications; 2) while existing TTA solutions can significantly improve the test performance on out-of-distribution data, they often suffer from severe performance degradation on in-distribution data after TTA (known as catastrophic forgetting). In this paper, we point out that not all the test samples contribute equally to model adaptation, and high-entropy ones may lead to noisy gradients that could disrupt the model. Motivated by this, we propose an active sample selection criterion to identify reliable and non-redundant samples, on which the model is updated to minimize the entropy loss for test-time adaptation. Furthermore, to alleviate the forgetting issue, we introduce a Fisher regularizer to constrain important model parameters from drastic changes, where the Fisher importance is estimated from test samples with generated pseudo labels. Extensive experiments on CIFAR-10-C, ImageNet-C, and ImageNet-R verify the effectiveness of our proposed method.

研究の動機と目的

Motivation: 訓練データへアクセスせず、重いバックプロパゲーションを必要としないように、訓練時とテスト時の分布シフトに対処する。
Goal: 訓練データ分布 (in-distribution) の性能を維持しつつ、効率的な test-time adaptation を達成する。
Aim: 各テストサンプルあたりの後方計算を削減しつつ、out-of-distribution の一般化を改善する。
Aim: Fisher information に基づく正則化項を用いて適応中の壊滅的忘却を防止する。

提案手法

S(x) というアクティブサンプルスコアを用いたサンプル適応性エントロピー最小化を導入し、後方更新の対象となる信頼性が高く冗長性の低いテストサンプルを選択する。
S(x) をエントロピーに基づく重み S_ent(x) と多様性に基づく重み S_div(x) の積として計算し、高エントロピーなサンプルや冗長なサンプルを除外する。
適応中は batch normalization 層のアフィンパラメータのみを更新する。
重要パラメータの大きな変化を抑制するため、Fisher information に基づく重要度 ω(θ_i) を用いた anti-forgetting 正則化項 R( tilde Theta, tilde Theta^o ) を提案する。
Fisher 重要度 ω(θ_i) を、元のモデル予測から構成される ID テストサンプルの小さなセットから推定する。
適応の全体目的関数でエントロピー最小化と正則化を組み合わせる。
各テストバッチごとにパラメータをリセットしない lifelong/adaptation 設定を提供する。

実験結果

リサーチクエスチョン

RQ1テスト時適応を、パラメータのサブセットと選択的サンプルだけを更新することでどのように効率化できるか？
RQ2エントロピーに基づくサンプル選択と多様性フィルタリングは、計算量を削減しつつOODに対するロバスト性を改善できるか？
RQ3Fisher に基づく正則化は、テスト時適応中のID性能の忘却を防ぐか？
RQ4提案手法は、CIFAR-10-C、ImageNet-C、ImageNet-R で従来の TTA 手法と比較してどの程度の性能を示すか？

主な発見

EATA は ImageNet-C および ImageNet-R の OOD 精度を、バックワードパスを減らしつつ、いくつかのベースラインより高く達成する。
ET A (efficient) は unreliable および redundant なサンプルを除外することで、TTT および MEMO と比較して平均バックワードパスを低減する。
EATA (lifelong) は継続的な適応中に ID クリーン精度を維持しつつ、OOD 性能を維持または向上させる。
アブレーションにより、高エントロピーおよび冗長サンプルを除去することで効率とOOD性能が向上することを示す。
Fisher 正則化は、OODの利得を犠牲にすることなく忘却を効果的に緩和する。
実験は、EATA が ETA および比較可能な手法より、効率性と頑健性の両面でベンチマークを上回ることを示している。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。