QUICK REVIEW

[論文レビュー] Sampling-guided exploration of active feature selection policies

Gabriel Bernardino, Anders Jönsson|arXiv (Cornell University)|Mar 16, 2026

Machine Learning and Data Classification被引用数 0

ひとこと要約

論文は、インスタンス固有のアクティブ特徴選択の強化学習フレームワークを、ヒューリスティックベースの方策探索と事後適合正則化を用いて大規模データセットに拡張し、精度を向上させつつ方針の複雑さを低減し、コンパクトで解釈可能な取得方針を生み出す。

ABSTRACT

Determining the most appropriate features for machine learning predictive models is challenging regarding performance and feature acquisition costs. In particular, global feature choice is limited given that some features will only benefit a subset of instances. In previous work, we proposed a reinforcement learning approach to sequentially recommend which modality to acquire next to reach the best information/cost ratio, based on the instance-specific information already acquired. We formulated the problem as a Markov Decision Process where the state's dimensionality changes during the episode, avoiding data imputation, contrary to existing works. However, this only allowed processing a small number of features, as all possible combinations of features were considered. Here, we address these limitations with two contributions: 1) we expand our framework to larger datasets with a heuristic-based strategy that focuses on the most promising feature combinations, and 2) we introduce a post-fit regularisation strategy that reduces the number of different feature combinations, leading to compact sequences of decisions. We tested our method on four binary classification datasets (one involving high-dimensional variables), the largest of which had 56 features and 4500 samples. We obtained better performance than state-of-the-art methods, both in terms of accuracy and policy complexity.

研究の動機と目的

取得コストの下で個々のインスタンスに対して情報量の多い特徴を選択する課題を動機づけ、解決する。
欠測データを補間せず、スーパーステート（分離した特徴サブセット）と DAG 構造を用いたスケーラブルな RL ベースのフレームワークを提案する。
大規模な特徴集合に対して有望な特徴組み合わせに方策学習を焦点化するサンプリング誘導ヒューリスティックを導入する。
異なる取得シーケンスの数を減らし、解釈性を高める正則化戦略を提示する。

提案手法

スーパーステート上のマルコフ決定過程としてアクティブ特徴選択をモデル化し、欠測データを補間せずに個々のインスタンスの特徴取得決定を可能にする。
Bellman ベースの Q 値計算を拡張し、各スーパーステートごとに独立して Q(s,a) を推定する別個の回帰器を訓練する。
有望なスーパーステートのサブセットに対する制限付き方策を構築するグラフ探索ヒューリスティックを用い、推定値の指数的成長を抑制する。
過度なスーパーステート訪問を抑制する仮想報酬項を追加するスパース方策正則化（REPS に触発）を組み込み、解釈性を高め、過学習を減少させる。

実験結果

リサーチクエスチョン

RQ1中規模/大規模な特徴集合に対して、すべての特徴組み合わせを列挙せずにアクティブ特徴選択をスケーリングできるか。
RQ2サンプリング誘導探索戦略は、個々のインスタンス決定のために最も価値のある特徴組み合わせを特定できるか。
RQ3正則化は、アクティブ特徴選択における精度を損なうことなく方策の複雑さを減らせるか。
RQ4提案手法は、精度とコスト効率の点で、さまざまなデータセットにおける最先端のアクティブ特徴選択手法と比較してどうか。

主な発見

提案されたサンプリング誘導探索は、EDD I および Wang のクラスタリングと比較して、いくつかのデータセット（高次元を含む）で競争力のある、または優れた精度を示した。
最大56特徴、最大4500サンプルのデータセットを処理可能で、方策の複雑さ（異なる取得シーケンスの数）が改善され、解釈性が向上した。
Heart データセットでの実コスト評価では、コストが変動する場合に安価で有益な特徴を選択する傾向を示し、同一コスト設定では高価な特徴を選択する傾向が強まることを示した。
ヒューリスティックベースの方策拡張により、大規模な特徴集合（例：Spam）で探索を加速し、精度とコストのトレードオフをランダム探索よりも良好に達成した。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。