QUICK REVIEW

[論文レビュー] ROOFS: RObust biOmarker Feature Selection

Bakhmach, Anastasiia, Paul Dufossé|SPIRE - Sciences Po Institutional REpository|Jan 8, 2026

Cancer Immunotherapy and Biomarkers被引用数 0

ひとこと要約

ROOFS は生物医療データの多くの特徴選択法を検討し、安定性、楽観性修正済み予測性能、 semi-synthetic の真偽陽性評価を用いてロバストなバイオマーカー署名を特定する。NSCLC 免疫療法抵抗データで実証。

ABSTRACT

Feature selection (FS) is essential for biomarker discovery and clinical predictive modeling. Over the past decades, methodological literature on FS has become rich and mature, offering a wide spectrum of algorithmic approaches. However, much of this methodological progress has not fully translated into applied biomedical research. Moreover, challenges inherent in biomedical data, such as high-dimensional feature space, low sample size, multicollinearity, and missing values, make FS non-trivial. To help bridge this gap between methodological development and practical application, we propose ROOFS (RObust biOmarker Feature Selection), a Python package available at https://gitlab.inria.fr/compo/roofs, designed to help researchers in the choice of FS method adapted to their problem. ROOFS benchmarks multiple FS methods on the user's data and generates reports summarizing a comprehensive set of evaluation metrics, including downstream predictive performance estimated using optimism correction, stability, robustness of individual features, and true positive and false positive rates assessed on semi-synthetic data with a simulated outcome. We demonstrate the utility of ROOFS on data from the PIONeeR clinical trial, aimed at identifying predictors of resistance to anti-PD-(L)1 immunotherapy in lung cancer. Of the 34 FS methods gathered in ROOFS, we evaluated 23 in combination with 11 classifiers (253 models) and identified a filter based on the union of Benjamini-Hochberg false discovery rate-adjusted p-values from t-test and logistic regression as the optimal approach, outperforming other methods including widely used LASSO. We conclude that comprehensive benchmarking with ROOFS has the potential to improve the reproducibility of FS discoveries and increase the translational value of clinical models.

研究の動機と目的

高次元・多モーダルな生物医療データにおけるバイオマーカー発見のための頑健な特徴選択を動機づける。
データ前処理・特徴選択・下流モデリング全体で多様な FS 法をベンチマークする自動化された Python パッケージ ROOFS を提供する。
安定性、楽観性補正付き予測性能、半合成アウトカムからの真偽陽性率など包括的な評価指標を提供する。
ROOFS の有用性を肺がん NSCLC コホート（PIONeeR）で実証し、抗PD-(L)1療法への抵抗性予測と方法選択の指針を示す。

提案手法

フィルタ、組込み、ラッパー、アンサンブルを網羅する 34 の FS 法を統合し、 benchmarking のために代表 23 法を選択。
選択後データに対して scikit-learn 互換モデルを用いた下流分類子を適用し予測性能を評価。
同一の前処理・モデリングパイプラインでブートストラップ再標本化（B = 100）を行い、性能と安定性を推定。
Harrell、 .632、.632+ 法による楽観性補正（OC）を性能推定に適用し、最終報告には .632+ を選択。
Nogueira ベースの頻度指標で FS の安定性を測定し、方法間での特徴レベルの頑健性を報告。
半合成アウトカムを用いて真偽発見を評価し、TPR、FPR、FDR、FOR を算出。
欠損値処理には中央値・最頻値の代入と z-score 正規化を含むデータ前処理手順を実装。

Figure 1: Overview of the roofs pipeline for comprehensive FS benchmarking.

実験結果

リサーチクエスチョン

RQ1異なるアルゴリズム系統の特徴選択法は、異質な生物医療データにおいて安定性と予測性能の最良のトレードオフを提供するのか。
RQ2多重共線性と欠損データは、さまざまな FS アプローチで選択されたバイオマーカー署名の頑健性にどのように影響するのか。
RQ3ROOFS の最適化とレポーティングは、臨床予測設定において偽陽性を抑えつつ真の発見を最大化する FS 法の選択に研究者を導けるのか。
RQ4現実データでのディスカバリ力を評価する上で、半合成の真の予測子の評価が追加価値をもたらすのか。

主な発見

フィルタ法は一般に強い安定性と競争力のAUCを示し、実データベンチマークでは t 検定とロジスティック回帰の BH 調整p値の結合である p.adjust が最高の性能を発揮した。
VIF ベースの事前フィルタリングを用いて214特徴に絞ると、方法間の FS 安定性がやや向上し、署名の信頼性が高まった。
LASSO は安定性は中程度だがブートストラップサンプル間で多重共線性のため変動が大きく、多くの特徴が一貫して選択されなかった。
ラッパー法とアンサンブル法は計算時間が長くなることが多く、LASSO や特定のフィルタよりも予測性能で一貫して優位にはならないことが多かった。
半合成ベンチマークはトレードオフを示し、高い TPR はしばしば高い FPR を伴い、安定性重視の方法は TPR が低いが FDR 制御が良い傾向があった；p.adjust フィルターは設定で高い TPR を達成しつつ許容範囲の FPR を達成。
PIONeeR データでは、グラデーションブースティングを用いた p.adjust ベースの署名が最も高い楽観性補正AUC（0.72）と安定性の高さ（S = 0.39）を示した。

Figure 2: Instability of LASSO. A: Bootstrap selection frequencies for features selected by LASSO in at least 1 bootstrap sample. B: Variability in selected subset sizes and out-of-bag AUC.

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。