QUICK REVIEW

[논문 리뷰] Early Prediction of Liver Cirrhosis Up to Three Years in Advance: A Machine Learning Study Benchmarking Against the FIB-4 Score

Zhuqi Miao, Sujan Ravi|arXiv (Cornell University)|2026. 01. 01.

Artificial Intelligence in Healthcare인용 수 0

한 줄 요약

루틴 EHR 데이터를 사용하는 머신 러닝 모델은 two (and by implication up to three) years before diagnosis에 간경변 발생을 예측하여, 여러 지표에서 FIB-4 및 APRI 점수보다 우수하다.

ABSTRACT

Objective: Develop and evaluate machine learning (ML) models for predicting incident liver cirrhosis one, two, and three years prior to diagnosis using routinely collected electronic health record (EHR) data, and to benchmark their performance against the FIB-4 score. Methods: We conducted a retrospective cohort study using de-identified EHR data from a large academic health system. Patients with fatty liver disease were identified and categorized into cirrhosis and non-cirrhosis cohorts based on ICD-9/10 codes. Prediction scenarios were constructed using observation and prediction windows to emulate real-world clinical use. Demographics, diagnoses, laboratory results, vital signs, and comorbidity indices were aggregated from the observation window. XGBoost models were trained for 1-, 2-, and 3-year prediction horizons and evaluated on held-out test sets. Model performance was compared with FIB-4 using area under the receiver operating characteristic curve (AUC). Results: Final cohorts included 3,043 patients for the 1-year prediction, 1,981 for the 2-year prediction, and 1,470 for the 3-year prediction. Across all prediction windows, ML models consistently outperformed FIB-4. The XGBoost models achieved AUCs of 0.81, 0.73, and 0.69 for 1-, 2-, and 3-year predictions, respectively, compared with 0.71, 0.63, and 0.57 for FIB-4. Performance gains persisted with longer prediction horizons, indicating improved early risk discrimination. Conclusions: Machine learning models leveraging routine EHR data substantially outperform the traditional FIB-4 score for early prediction of liver cirrhosis. These models enable earlier and more accurate risk stratification and can be integrated into clinical workflows as automated decision-support tools to support proactive cirrhosis prevention and management.

연구 동기 및 목표

일상적으로 수집되는 EHR 데이터를 사용한 간경변(LC)의 조기 예측을 고무한다.
LC 진단 이전의 1년 및 2년 예측 구간에 대한 ML 모델을 개발하고 평가한다.
ML 모델을 FIB-4 및 APRI 점수와 비교하여 벤치마크한다.
조기 위험 구분을 위한 향상된 판별력과 임상적 활용성을 입증한다.

제안 방법

대형 학술 의료 시스템의 식별 불가 EHR 데이터를 사용한 회상적 코호트 연구.
1년 및 2년 예측 구간에 대해 XGBoost 모델을 개발했다.
모델별 특성 선택 및 베이지안 하이퍼파라미터 튜닝을 적용했다.
정확도, 정밀도, 재현율, F1, PR AUC 및 AUC를 사용하여 보류된 테스트 세트에서 평가했다.
지표 전반에서 ML 모델을 FIB-4 및 APRI 점수와 비교했다.

실험 결과

연구 질문

RQ1일상적으로 수집된 EHR 데이터를 사용하여 ML 모델이 진단까지 최대 1년 및 2년 전에 간경변 발생(incident LC)을 예측할 수 있는가?
RQ2조기 LC 예측에서 다양한 예측 구간에 대해 ML 모델이 FIB-4 및 APRI 점수보다 성능이 우수한가?
RQ31년 및 2년 예측 구간에서 성능 지표(AUC, PR AUC 등)는 어떻게 차이가 나타나는가?

주요 결과

XGBoost의 AUC는 0.872 (1-year) 및 0.839 (2-year) 달성했다.
FIB-4의 AUC는 0.756 (1-year) 및 0.723 (2-year).
APRI의 AUC는 0.798 (1-year) 및 0.761 (2-year).
PR AUC는 XGBoost가 0.657 (1-year) 및 0.562 (2-year).
PR AUC는 FIB-4가 0.456 (1-year) 및 0.373 (2-year).
PR AUC는 APRI가 0.504 (1-year) 및 0.421 (2-year).

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.