Skip to main content
QUICK REVIEW

[Paper Review] Machine Learning Model Interpretability for Precision Medicine

Gajendra J. Katuwal, Robert F. Chen|arXiv (Cornell University)|Oct 28, 2016
Explainable Artificial Intelligence (XAI)2 references61 citations
TL;DR

This paper proposes using Model-Agnostic Explanations (SHAP) to enhance interpretability of complex machine learning models in precision medicine. Using the MIMIC-II dataset, the authors achieve 80% balanced accuracy in predicting ICU mortality while providing individual-level feature importance, enabling clinicians to understand model decisions transparently.

ABSTRACT

Interpretability of machine learning models is critical for data-driven precision medicine efforts. However, highly predictive models are generally complex and are difficult to interpret. Here using Model-Agnostic Explanations algorithm, we show that complex models such as random forest can be made interpretable. Using MIMIC-II dataset, we successfully predicted ICU mortality with 80% balanced accuracy and were also were able to interpret the relative effect of the features on prediction at individual level.

Motivation & Objective

  • To improve interpretability of complex machine learning models used in precision medicine.
  • To enable clinicians to understand the contribution of individual features to model predictions at the individual patient level.
  • To demonstrate that high-performance models can be made interpretable without sacrificing predictive accuracy.
  • To validate the approach on a real-world critical care dataset (MIMIC-II).
  • To bridge the gap between high predictive performance and clinical trust in AI-driven decision support.

Proposed method

  • The authors apply the Model-Agnostic Explanations (SHAP) framework to interpret predictions from a trained random forest model.
  • SHAP values are computed to quantify the contribution of each clinical feature to the prediction for each individual patient.
  • The method is model-agnostic, allowing interpretation of any black-box model, including ensemble methods like random forests.
  • The MIMIC-II critical care dataset is used, containing clinical features such as vital signs, lab values, and comorbidities.
  • The model is trained to predict ICU mortality with a focus on both accuracy and interpretability.
  • Feature importance is visualized at both global and individual levels to support clinical insight.

Experimental results

Research questions

  • RQ1Can complex, high-performing machine learning models used in precision medicine be made interpretable without compromising performance?
  • RQ2What is the relative contribution of individual clinical features to ICU mortality predictions at the individual patient level?
  • RQ3How can model interpretability be systematically applied to clinical decision support systems?
  • RQ4Can SHAP-based explanations enhance trust and clinical utility of predictive models in intensive care settings?
  • RQ5What level of predictive accuracy can be achieved while maintaining sufficient interpretability for clinical deployment?

Key findings

  • The random forest model achieved 80% balanced accuracy in predicting ICU mortality on the MIMIC-II dataset.
  • SHAP-based explanations successfully identified the most influential clinical features for individual patient predictions.
  • The method enabled interpretation of model decisions at the individual level, revealing patient-specific feature impacts.
  • The interpretability framework was applied to a black-box model without requiring architectural changes.
  • The results demonstrate that high-performing models can be made transparent and clinically actionable through post-hoc explanation techniques.
  • The approach supports clinical decision-making by providing traceable, feature-level insights into model predictions.

Better researchstarts right now

From paper design to paper writing, dramatically reduce your research time.

No credit card · Free plan available

This review was created by AI and reviewed by human editors.