QUICK REVIEW

[論文レビュー] Dopamine: Brain Modes, Not Brains

Shervin Ghasemlou|arXiv (Cornell University)|Feb 12, 2026

Neurological disorders and treatments被引用数 0

ひとこと要約

本論文は、基本重みを凍結し、個々のニューロンの閾値とゲインを学習して活性化をゲートする活性化空間PEFTである TauGate を紹介し、少数のパラメータ予算で解釈可能なモード切替を実現。MNIST 回転モードの研究で、訓練可能パラメータが少なくても競争力のある精度と解釈可能なゲーティングを示す。

ABSTRACT

Parameter-efficient fine-tuning (PEFT) methods such as \lora{} adapt large pretrained models by adding small weight-space updates. While effective, weight deltas are hard to interpret mechanistically, and they do not directly expose \emph{which} internal computations are reused versus bypassed for a new task. We explore an alternative view inspired by neuromodulation: adaptation as a change in \emph{mode} -- selecting and rescaling existing computations -- rather than rewriting the underlying weights. We propose \methodname{}, a simple activation-space PEFT technique that freezes base weights and learns per-neuron \emph{thresholds} and \emph{gains}. During training, a smooth gate decides whether a neuron's activation participates; at inference the gate can be hardened to yield explicit conditional computation and neuron-level attributions. As a proof of concept, we study ``mode specialization'' on MNIST (0$^\circ$) versus rotated MNIST (45$^\circ$). We pretrain a small MLP on a 50/50 mixture (foundation), freeze its weights, and then specialize to the rotated mode using \methodname{}. Across seeds, \methodname{} improves rotated accuracy over the frozen baseline while using only a few hundred trainable parameters per layer, and exhibits partial activation sparsity (a minority of units strongly active). Compared to \lora{}, \methodname{} trades some accuracy for substantially fewer trainable parameters and a more interpretable ``which-neurons-fire'' mechanism. We discuss limitations, including reduced expressivity when the frozen base lacks features needed for the target mode.

研究の動機と目的

weight rewiring なしにニューロモデュレーションになぞらえて適応的計算を動機づける。
TauGate を開発し、基本重みを凍結したままニューロン毎の閾値とゲインを学習する。
少数パラメータ予算で MNIST 0° と 45° の回転モードでモード特化を実証する。
Bias のみを調整するチューニング、LoRA、完全微調整と制御された設定で比較する。
活性化空間PEFT の解釈性と限界に関する知見を提供する。

提案手法

TauGate を、スムーズなゲート g = sigmoid(s(z - tau)) を用いたニューロン毎の閾値とゲインとして定義する。
基本重みを凍結し、(tau, gamma) を各層ごとに訓練する。
推論時にゲートを硬直化して明示的な条件計算を実現することも可能。
ゲート活性化正則化子を用いてスパース性を促進する。
パラメータ数を定量化し、基準法（例：BitFit、LoRA、Full FT）と比較する。
少規模の MLP と DirectML を用いた再現可能な MNIST 回転実験を提供する。

Figure 1: Test accuracy on MNIST (0 ∘ ) and rotated MNIST (45 ∘ ) after specializing to the rotated mode. TauGate improves over the frozen foundation baseline with a small parameter budget and exhibits partial activation sparsity (“High-act frac” in Table 1 ).

実験結果

リサーチクエスチョン

RQ1 TauGate による活性化空間のゲーティングは、凍結したバックボーンでモード特化を達成できるか。
RQ2 TauGate は精度とパラメータ効率の点で重み空間PEFT法とどう比較されるか。
RQ3 硬ゲート化サブネットワークを通じてニューロンレベルの解釈可能な寄与を提供するか。
RQ4 データ少なめのおもちゃ設定におけるゲーティングベースPEFT のトレードオフと限界は。

主な発見

Method	Trainable params	Acc (0 ∘ )	Acc (45 ∘ )	High-act frac
Frozen (no adapt)	0	0.832 ± 0.017	0.833 ± 0.016	–
BitFit (bias-only)	266	0.851 ± 0.005	0.852 ± 0.009	–
TauGate (threshold tuning)	512	0.850 ± 0.006	0.850 ± 0.010	0.28
LoRA (r=8)	10448	0.855 ± 0.006	0.864 ± 0.005	–
Full FT	118282	0.783 ± 0.007	0.887 ± 0.005	–

TauGate は訓練可能パラメータ 512 を追加し、回転モードに特化した後、0° および 45° の MNIST でいずれも 0.850 の精度を達成。
凍結バックボーンと比べて、回転モードの精度を向上させつつ元のモードの性能を保つ。
BitFit は訓練可能パラメータ 266 で 0.851（0°）と 0.852（45°）を達成。
LoRA（r=8）は訓練可能パラメータ 10,448 を用い、0.855（0°）と 0.864（45°）を達成。
Full FT は訓練可能パラメータ 118,282 を用い、0.783（0°）と 0.887（45°）。
TauGate は高活性ユニットの 0.28 の高活性割合という、モードごとのゲーティングを示す部分的な活性化のスパース性を示す。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。