QUICK REVIEW

[論文レビュー] Early Prediction of Liver Cirrhosis Up to Three Years in Advance: A Machine Learning Study Benchmarking Against the FIB-4 Score

Zhuqi Miao, Sujan Ravi|arXiv (Cornell University)|Jan 1, 2026

Artificial Intelligence in Healthcare被引用数 0

ひとこと要約

機械学習モデルは通常のEHRデータを用いて、診断の最大2年（含意として最大3年）前に肝硬変を予測し、複数の指標でFIB-4およびAPRIスコアを上回る。

ABSTRACT

Objective: Develop and evaluate machine learning (ML) models for predicting incident liver cirrhosis one, two, and three years prior to diagnosis using routinely collected electronic health record (EHR) data, and to benchmark their performance against the FIB-4 score. Methods: We conducted a retrospective cohort study using de-identified EHR data from a large academic health system. Patients with fatty liver disease were identified and categorized into cirrhosis and non-cirrhosis cohorts based on ICD-9/10 codes. Prediction scenarios were constructed using observation and prediction windows to emulate real-world clinical use. Demographics, diagnoses, laboratory results, vital signs, and comorbidity indices were aggregated from the observation window. XGBoost models were trained for 1-, 2-, and 3-year prediction horizons and evaluated on held-out test sets. Model performance was compared with FIB-4 using area under the receiver operating characteristic curve (AUC). Results: Final cohorts included 3,043 patients for the 1-year prediction, 1,981 for the 2-year prediction, and 1,470 for the 3-year prediction. Across all prediction windows, ML models consistently outperformed FIB-4. The XGBoost models achieved AUCs of 0.81, 0.73, and 0.69 for 1-, 2-, and 3-year predictions, respectively, compared with 0.71, 0.63, and 0.57 for FIB-4. Performance gains persisted with longer prediction horizons, indicating improved early risk discrimination. Conclusions: Machine learning models leveraging routine EHR data substantially outperform the traditional FIB-4 score for early prediction of liver cirrhosis. These models enable earlier and more accurate risk stratification and can be integrated into clinical workflows as automated decision-support tools to support proactive cirrhosis prevention and management.

研究の動機と目的

Routineに収集されるEHRデータを用いた肝硬変（LC）の早期予測を動機付ける。
LC診断の1年および2年予測 horizonに対するMLモデルの開発と評価。
MLモデルをFIB-4およびAPRIスコアとベンチマークする。
早期リスク層別化の識別性能と臨床的有用性の改善を示す。

提案手法

大規模な学術病院システムからの識別不能化済みEHRデータを用いた後向きコホート研究。
LC予測 horizon1年および2年のXGBoostモデルを開発。
モデル固有の特徴量選択とベイズハイパーパラメータチューニングを適用。
正確性、適合率、再現率、F1、PR AUC、AUCを用いた保持テストセットで評価。
指標を横断してMLモデルをFIB-4およびAPRIスコアと比較。

実験結果

リサーチクエスチョン

RQ1 routineなEHRデータを用いて、診断前1年および2年までにLCの新規発生をMLモデルは予測できるか？
RQ2 horizonを超えて早期LC予測でMLモデルはFIB-4およびAPRIスコアを上回るか？
RQ31年予測と2年予測の性能指標（AUC、PR AUC等）はどのように異なるか？

主な発見

XGBoostのAUCは1年で0.872、2年で0.839。
FIB-4のAUCは1年で0.756、2年で0.723。
APRIのAUCは1年で0.798、2年で0.761。
1年のPR AUCはXGBoostで0.657、2年で0.562。
1年のPR AUCはFIB-4で0.456、2年で0.373。
1年のPR AUCはAPRIで0.504、2年で0.421。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。